CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

Size: px
Start display at page:

Download "CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M."

Transcription

1 CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear Model with Additive Heterogeneity 3. Linear Model with Correlated Random Slopes 4. A Modeling Approach for Nonlinear Models 5. Estimating Average Partial Effects 6. Application to Probit/Fractional Response 1

2 1. Introduction In linear model with additive heterogeneity, unbalanced panels cause no serious issues provided certain assumptions about selection hold. FE is more robust than RE in the sense that with RE selection must be assumed uncorrelated with heterogeneity as well as with idiosyncratic shocks. FE allows arbitrary correlation between selection and c i. 2

3 For nonlinear models, RE easily adapts to unbalanced panels, but the selection assumptions are strong. Conditional MLE allows unbalanced panels when it applies (logit, Poisson). Treating the c i as parameters to estimate so called fixed effects does not require a balanced panel (but still has an incidental parameters problem with small T). 3

4 Drawback to correlated RE approach compared with CMLE or FE: Not clear how to allow for unbalanced panels when we want heterogeneity to be correlated with covariates and selection. Want to be able to handle nonlinear CRE models. Without specifically modeling the selection rule say, using a Heckman-type approach all methods assume selection is independent of shocks. We can test this assumption. 4

5 2. Linear Model with Additive Heterogeneity We still draw a random sample from the cross section, with units indexed by i. But now we may not observe a complete set of time series observations. Model this situation using a sequence of selection indicators, s i1,...,s it, where s it 1 if time period t for unit i can be used in estimation. Usually this means that we observe all elements of x it, y it. 5

6 We only use an i, t pair when a full set of data is observed, as happens when software is used for RE and FE on unbalanced panels. So we focus on complete-case estimators without imputation. We still write the population model for a random draw i as y it x it c i u it, t 1,...,T, where x it can generally include a fully set of time dummies, or other aggregate time variables. 6

7 Along with a rank condition, a sufficient (but not necessary) condition for consistency of FE on the unbalanced panel is E u it x i, c i, s i 0, t 1,...,T, where x i x i1, x i2,...,x it and s i s i1, s i2,...,s it. Allows selection in any time period to be correlated with x i, c i, but selection in all time periods must be unrelated to the idiosyncratic shocks. 7

8 The time-demeaned data now uses different time periods for different i. Let T 1 ÿ it y it T i r 1 T 1 ẍ it x it T i r 1 s ir y ir y it ȳ i s ir x ir x it x i T where T i r 1 s ir is random. 8

9 The FE estimator is then N N 1 i 1 T t 1 N N 1 i 1 1 s it ẍ it ẍ it T t 1 1 s it ẍ it ẍ it Consistency (fixed T, N ) follows if N N 1 i 1 T t 1 N N 1 i 1 s it ẍ it ÿ it T t 1 s it ẍ it u it rank T t 1 E s it ẍ it ẍ it K E s it ẍ it u it 0, t 1,...,T. 9

10 In the balanced case we know that if we estimate the equation y it x it x i v it by pooled OLS or RE we get the FE estimate for. Conveniently, the same result holds for unbalanced case, with a caveat: the time averages are defined only for the complete-case observations. 10

11 In other words, estimate the equation y it x it x i v it by pooled OLS using the s it 1 observations. The coefficient vector is identical to the fixed effects. 11

12 Unlike in the balanced case, any aggregate time variables, including time dummies, should be part of x it, and their time averages must be included in x i to get the FE estimates on the other variables. Wooldridge (2010, working paper) shows a more general result. Recall that the RE estimator can be obtained from a pooled OLS regression. On an unbalanced panel, it is where y it i ȳ i on x it i x i, 1 i x i, 1 i z i if s it 1, varies because T i varies. i 1 u 2 / u 2 T i c 2 1/2 12

13 Algebraic Fact: Let be the vector (K 1 of coefficients on x it i x i in the POLS regression above. Then FE, the fixed effects estimate on the unbalanced panel. Note that z i can contain any time-constant variables, including functions of T i, or interactions of the form T i x i, or allow a different set of coefficients on x i for each different T i. The coefficients on x i may changed widely across T i, yet the estimate on x it is still the FE estimate. 13

14 The algebraic equivalence still justifies the robust, regression-based Hausman test. Write a model with time-constant variables z i as where z i includes a constant. Use the Mundlak equation and estimate this by RE. y it x it z i c i u it, t 1,...,T, y it x it x i z i a i u it 14

15 The regression based Hausman test is just a (robust) Wald test of H 0 : 0 after RE estimation of the augmented equation. Remember that FE (and RE) assume that selection in any time period is not correlated with u it. Might worry that a shock today affects being in the sample in the future. Wooldridge (2010, MIT Press): Using FE on the unbalanced panel, estimate the equation y it x it s i,t 1 c i u it, t 1,...,T 1 and test H 0 : 0 using a robust test. 15

16 If we use RE we can (and should) test if c i is correlated with functions of T i. For example, define dummies d ir 1 T i r, r 1,...,T 1 and add these to the usual RE estimation. Significance of d i d i1,...,d i,t 1 casts serious doubt on the usual RE analysis. FE does not care. 16

17 We can jointly test the assumption that E c i x i1,...,x it, s i1,...,s it E c i by estimating the equation y it x it x i d i v it by RE using the unbalanced panel and jointly testing H 0 : 0, 0. 17

18 3. Linear Model with Correlated Random Slopes Now consider a model with all slopes heterogeneous: E y it x i, a i, b i a i x it b i, so, in the population, x it : t 1,...,T is strictly exogenous conditional on a i, b i. Write a i c i, b i d i and y it x it c i x it d i u it where E u it x i, a i, b i E u it x i, c i, d i 0 for all t. 18

19 Selection may be related to x i, a i, b i but not the idiosyncratic shocks: E y it x i, a i, b i, s i E y it x i, a i, b i or E u it x i, a i, b i, s i 0, t 1,...,T. With enough time periods, we could obtain â i, b i ( fixed effects ) and then average these. But need T i K 1. 19

20 Unfortunately, if selection is correlated with b i, there are no intuitive robustness results for the usual FE estimator. How might we use a CRE approach? 20

21 Multiply through by selection: s it y it s it s it x it s it c i s it x it d i s it u it Because we only use observations with s it 1, we handle the heterogeneity by conditioning on the entire history of selection and the values of the covariates if selected. That is, s it, s it x it : t 1,...,T. If s it 0 the observation is not used; if s it 1, the observation is used, and we observe x it. 21

22 If we condition on the larger information set x i1, s i1, x i2, s i2,..., x it, s it and heterogeneity depends only on the covariates, we would be left with an equation that is not estimable unless the covariates are always observed. 22

23 Write h i h it : t 1,...,T s it, s it x it : t 1,...,T. Then, extending Mundlak (1978) and Chamberlain (1982), we work with E s it y it h i s it s it x it s it E c i h i s it x it E d i h i and then make assumptions concerning E c i h i and E d i h i. Alternatively, we could eliminate c i using the within transformation, and focus just on E d i h i. 23

24 A simple approach is to model the expectations as exchangeable functions of h it : t 1,...,T extending the balanced case considered by Altonji and Matzkin (2005). From the model with constant slopes, natural to start with w i T i, x i. Fairly natural extension of Mundlak (1978). 24

25 A flexible specification with g ir 1 T i r : where T E c i T i, x i r 1 T r g ir r r 1 T T E d i T i, x i g ir r r r 1 r 1 g ir x i r r r P T i r E 1 T i r r E x i T i r g ir x i r I K r, 25

26 As a practical matter, the formulation above is identical to running separate regressions for each T i : y it on 1, x it, x i, x i r x it,fors it 1 where r N r 1 N i 1 1 T i r x i and N r is the number of observations with T i r. The coefficient on x it, r, is the APE given T i r. We can average these across r to obtain the overall APE. 26

27 Notice that we lose the T i 1 observations just as in a standard fixed effects estimation. We could use grouping based on T i so that the APEs include the entire population. 27

28 In any formulation, including the basic FE estimation, have a simple test for dynamic selection bias that is, for the null E y it x i, a i, b i, s i E y it x i, a i, b i. Assumes that our model for E c i, d i s it, s it x it : t 1,...,T is correctly specified. Then, no other functions of s it, s it x it : t 1,...,T should appear in E s i y it h i. 28

29 Assuming we have some T i with T i 3, add as extra regressors at time t the variables s i,t 1, s i,t 1 x i,t 1. We can compute a fully robust (to serial correlation and heteroskedasticity) Wald test of joint significance. Produces a joint test of ignorable selection and strictly exogenous covariates. 29

30 4. A Modeling Approach for Nonlinear Models Interested in the conditional distribution D y it x it, c i, or maybe just an expectation. Assume strictly exogenous covariates conditional on c i and ignorable selection: D y it x i, c i, s i D y it x it, c i, t 1,...,T. 30

31 Focus on methods that only exploit assumptions about the marginal distributions, D y it x it, c i, not joint distributions. Average partial effects are generally identified without restricting conditional dependence. Let g t y t x it, c i ; be a parametric density conditional on x it, c i. As with the linear model, specify models for D c i s it, s it x it : t 1,...,T. Let w i be a vector of known functions of s it, s it x it : t 1,...,T that act as sufficient statistics, so that D c i s it, s it x it : t 1,...,T D c i w i 31

32 Because D y it x it, c i, s it 1 D y it x it, c i and we have a density for the latter, we can obtain the density given x it, w i, s it 1 as f t y t x it, w i ;, R M g t y t x it, c; h c w i ; dc where h c w i ; is a parametric density for D c i w i and M is the dimension of c i. The partial log-likelihood function for the entire sample is N i 1 T t 1 s it log f t y it x it, w i ;,. Need a robust sandwich estimator for serial correlation (and maybe other misspecifications). 32

33 Same basic arguments carry through if we focus on estimating conditional means. We start with E y it x it, c i m t x it, c i and obtain E y it x it, w i by integrating m t x it, c i with respect to the density of c i given w i (which, again, is valid for the mean when s it 1). Or, we just assert a model for E y it x it, w i. Can then use a host of quasi-log likelihoods on the selected sample, including the Bernoulli for fractional responses, the gamma for nonnegative (continuous) responses, and the Poisson for nonnegative (count) responses. 33

34 5. Estimating Average Partial Effects In most nonlinear models, the parameters appearing in f t y t x t, c; provide only part of the story for the effect of x t on y t. Using pooled methods we often cannot fully identify or the distribution of c i. Even in the unbalanced case we can use the Blundell and Powell (2003) approach provided a vector w i suitably proxies for correlation between c i and s it, s it x it : t 1,...,T. 34

35 Let q t x t, w; denote the mean associated with f t y t x t, w;. Then ASF x t E wi q t x t, w i ;, that is, we can obtain the ASF by averaging out the observed vector of sufficient statistics, w i, from E y it x t, w i, s it 1 rather than averaging out c i from E y it x t, c i. Consistent estimation is then easy: N ASF x t N 1 i 1 q t x t, w i ; Panel bootstrap is still justified with unbalanced panel. 35

36 Tricky to estimate an overall average partial effect. With a balanced panel, it is natural to average APE tj x it, w i across the distribution of x it, w i, and then possibly across t, too. But if selection s it depends on x it, averaging across the selected sample does not consistently estimate E xit,w i APE tj x it, w i. Presumably we still have an idea of useful values to plug in for x t. 36

37 6. Application to Probit/Fractional Response The probit model, either for true binary responses or fractional responses, is natural for applying the previous approach. The model is P y it 1 x i, c i P y it 1 x it, c i x it c i, t 1,...,T where x it can include time dummies or other aggregate time variables. Once we specify P y it 1 x i, c i and assume that selection is conditionally ignorable for all t, that is, P y it 1 x i, c i, s i P y it 1 x i, c i, all that is left is to specify a model for D c i w i for suitably chosen functions w i of s it, s it x it : t 1,...,T. 37

38 A specification linear in x i but with intercept and slopes different for each T i is T E c i w i r 1 T r 1 T i r r 1 1 T i r x i r At a minimum, should let the variance of c i change with T i : Var c i w i exp T 1 r 1 1 T i r r 38

39 If we also maintain that D c i w i is normal, then we obtain the following response probability for s it 1 (with normalization that does not affect estimation of APEs): P y it 1 x it, w i, s it 1 x T it r 1 T r g ir r 1 g ir x i r 1/2 g ir r T exp r 2 so that the denominator is unity when all r are zero. (Recall g ir 1 T i r.) No difficulty in adding g ir x i for r 1,...,T to the variance function. 39

40 Computational bonus: Estimable by so-called heteroskedastic probit software, where the explanatory variables at time t are 1, x it, g i1,...,g it, g i1 x i,...,g it x i and the explanatory variables in the variance are simply the dummy variables g i2,...,g it, or also add g i1 x i,...,g it x i. 40

41 The APEs are easy to obtain from the estimated ASF: N ASF x t N 1 i 1 x T t r 1 T rg ir r 1 T exp r 2 g ir r g ir x i r 1/2 where the coefficients with ^ are from the pooled heteroskedastic probit estimation. The functions of T i, x i are averaged out, leaving the result a function of x t. If, say, x tj is continuous, its APE is estimated as APE tj x t j N N 1 i 1 T x t r 1 T rg ir r 1 T exp r 2 g ir r g ir x i r 1/2. 41

42 The above procedure applies, without change, if y it is a fractional response; that is, 0 y it 1. The original model is E y it x it, c i x it c i and APEs are on the mean response. Could allow to change with each T i (losing T i 1). Then, just estimate an APE for each T i 2 and average the results. Just a separate probit with explanatory variables 1, x it, x i. Or, allow heteroskedasticity as a function of x i, with ASF T ASF x t N 1 r 2 N i 1 g ir x t r r x i r exp x i r/2. 42

43 Same ideas apply to ordered probit and even multinomial logit, as well as Tobit and count data models. With count data, Poisson FE estimator has very nice robustness properties, including for sample selection E y it x i, c i, s i E y it x it, c i exp x it c i is sufficient but only for the scalar, additive heterogeneity case. 43

44 Scope for CRE approach in more complicated models, such as E y it x i, c i exp a i x it b i. CMLE approach makes no sense with all coefficients heterogeneous (and it is not clear a CMLE exists, anyway). FE approach can be implemented with lots of large T i T i K 1, but how well? CRE approach is available but with restrictions on D c i s it, s it x it : t 1,...,T. 44

45 EXAMPLE: Effects of Spending on School Test Pass Rates. use meap94_98. tab year 1992 school yr Freq. Percent Cum , , , , , Total 7, egen tobs sum(1), by(schid). tab tobs number of time periods Freq. Percent Cum , , , Total 7, gen tobs3 tobs 3. gen tobs4 tobs 4 45

46 . egen lavgrexpb mean(lavgrexp), by(schid). egen lunchb mean(lunch), by(schid). egen lenrolb mean(lenrol), by(schid). egen y95b mean(y95), by(schid). egen y96b mean(y96), by(schid). egen y97b mean(y97), by(schid). egen y98b mean(y98), by(schid). gen tobs4 tobs 4. gen tobs3 tobs 3. gen tobs3_lavgrexp tobs3*lavgrexp. gen tobs4_lavgrexp tobs4*lavgrexp 46

47 . xtreg math4 lavgrexp lunch lenrol y95 y96 y97 y98, fe cluster(schid) Fixed-effects (within) regression Number of obs 7150 Group variable: schid Number of groups 1683 R-sq: within Obs per group: min 3 between avg 4.2 overall max 5 F(7,1682) corr(u_i, Xb) Prob F (Std. Err. adjusted for 1683 clusters in schid) Robust math4 Coef. Std. Err. t P t [95% Conf. Interval] lavgrexp lunch lenrol y y y y _cons sigma_u sigma_e rho (fraction of variance due to u_i)

48 . xtreg math4 lavgrexp tobs3_lavgrexp tobs4_lavgrexp lunch lenrol y95 y96 y97 y98, fe cluster(schid) Fixed-effects (within) regression Number of obs 7150 Group variable: schid Number of groups 1683 R-sq: within Obs per group: min 3 between avg 4.2 overall max 5 F(9,1682) corr(u_i, Xb) Prob F (Std. Err. adjusted for 1683 clusters in schid) Robust math4 Coef. Std. Err. t P t [95% Conf. Interval] lavgrexp tobs3_lavgrexp tobs4_lavgrexp lunch lenrol y y y y _cons sigma_u sigma_e rho (fraction of variance due to u_i)

49 . test tobs3_lavgrexp tobs4_lavgrexp ( 1) tobs3_lavgrexp 0 ( 2) tobs4_lavgrexp 0 F( 2, 1682) 2.67 Prob F

50 . * Now fractional response.. replace math4 math4/100 (7150 real changes made). replace lunch lunch/100 (7146 real changes made).. capture program drop frac_het.. program frac_het 1. version args llf xb zg 3. quietly replace llf $ML_y1*log(normal( xb *exp(- zg ))) /// (1 - $ML_y1)*log(1 - normal( xb *exp(- zg ))) 4. end.. ml model lf frac_het (math4 lavgrexp lunch lenrol y95 y96 y97 y98 lavgrexpb /// lunchb lenrolb y95b y96b y97b y98b tobs3 tobs4) (tobs3 tobs4, nocons), /// vce(cluster distid). ml max 50

51 Number of obs 7150 Wald chi2(16) Log pseudolikelihood Prob chi (Std. Err. adjusted for 467 clusters in distid) Robust math4 Coef. Std. Err. z P z [95% Conf. Interval] eq1 lavgrexp lunch lenrol y y y y lavgrexpb lunchb lenrolb y95b y96b y97b y98b tobs tobs _cons eq2 tobs tobs

52 . glm math4 lavgrexp lunch lenrol y95 y96 y97 y98 lavgrexpb lunchb lenrolb y95b y96b y97b y98b tobs3 tobs4, fam(bin) link(probit) cluster(schid) note: math4 has noninteger values Generalized linear models No. of obs 7150 Optimization : ML Residual df 7133 (Std. Err. adjusted for 1683 clusters in schid) Robust math4 Coef. Std. Err. z P z [95% Conf. Interval] lavgrexp lunch lenrol y y y y lavgrexpb lunchb lenrolb y95b y96b y97b y98b tobs tobs _cons

53 . margins, dydx(lavgrexp) Average marginal effects Number of obs 7150 Model VCE : Robust Expression : Predicted mean math4, predict() dy/dx w.r.t. : lavgrexp Delta-method dy/dx Std. Err. z P z [95% Conf. Interval] lavgrexp

Jeffrey M. Wooldridge Michigan State University

Jeffrey M. Wooldridge Michigan State University Fractional Response Models with Endogenous Explanatory Variables and Heterogeneity Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Fractional Probit with Heteroskedasticity 3. Fractional

More information

Thoughts on Heterogeneity in Econometric Models

Thoughts on Heterogeneity in Econometric Models Thoughts on Heterogeneity in Econometric Models Presidential Address Midwest Economics Association March 19, 2011 Jeffrey M. Wooldridge Michigan State University 1 1. Introduction Much of current econometric

More information

CORRELATED RANDOM EFFECTS MODELS WITH UNBALANCED PANELS

CORRELATED RANDOM EFFECTS MODELS WITH UNBALANCED PANELS CORRELATED RANDOM EFFECTS MODELS WITH UNBALANCED PANELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038 wooldri1@msu.edu July 2009 I presented an earlier

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity

fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity Richard Bluhm May 26, 2013 Introduction Stata can easily estimate a binary response probit models

More information

Control Function and Related Methods: Nonlinear Models

Control Function and Related Methods: Nonlinear Models Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Empirical Application of Panel Data Regression

Empirical Application of Panel Data Regression Empirical Application of Panel Data Regression 1. We use Fatality data, and we are interested in whether rising beer tax rate can help lower traffic death. So the dependent variable is traffic death, while

More information

New Developments in Econometrics Lecture 16: Quantile Estimation

New Developments in Econometrics Lecture 16: Quantile Estimation New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile

More information

Quantitative Methods Final Exam (2017/1)

Quantitative Methods Final Exam (2017/1) Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with

More information

What s New in Econometrics? Lecture 14 Quantile Methods

What s New in Econometrics? Lecture 14 Quantile Methods What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor

More information

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Econometrics - Exam May 11, 2011 1 Exam Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Problem 1: (15 points) A researcher has data for the year 2000 from

More information

Lecture 3 Linear random intercept models

Lecture 3 Linear random intercept models Lecture 3 Linear random intercept models Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The response is measures at n different times, or under

More information

Problem Set 10: Panel Data

Problem Set 10: Panel Data Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h.

Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h. Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h. This is an open book examination where all printed and written resources, in addition to a calculator, are allowed. If you are

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Binary Dependent Variables

Binary Dependent Variables Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome

More information

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p ) Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p. 376-390) BIO656 2009 Goal: To see if a major health-care reform which took place in 1997 in Germany was

More information

Fixed and Random Effects Models: Vartanian, SW 683

Fixed and Random Effects Models: Vartanian, SW 683 : Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:

More information

Econometrics Homework 4 Solutions

Econometrics Homework 4 Solutions Econometrics Homework 4 Solutions Computer Question (Optional, no need to hand in) (a) c i may capture some state-specific factor that contributes to higher or low rate of accident or fatality. For example,

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Problem Set 5 ANSWERS

Problem Set 5 ANSWERS Economics 20 Problem Set 5 ANSWERS Prof. Patricia M. Anderson 1, 2 and 3 Suppose that Vermont has passed a law requiring employers to provide 6 months of paid maternity leave. You are concerned that women

More information

Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models

Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models Applied Econometrics Lecture 3: Introduction to Linear Panel Data Models Måns Söderbom 4 September 2009 Department of Economics, Universy of Gothenburg. Email: mans.soderbom@economics.gu.se. Web: www.economics.gu.se/soderbom,

More information

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 7: Cluster Sampling Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of roups and

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

Fortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7

Fortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7 Fortin Econ 495 - Econometric Review 1 Contents 1 Panel Data Methods 2 1.1 Fixed Effects......................... 2 1.1.1 Dummy Variables Regression............ 7 1.1.2 First Differencing Methods.............

More information

Generalized linear models

Generalized linear models Generalized linear models Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Generalized linear models Boston College, Spring 2016 1 / 1 Introduction

More information

Exercices for Applied Econometrics A

Exercices for Applied Econometrics A QEM F. Gardes-C. Starzec-M.A. Diaye Exercices for Applied Econometrics A I. Exercice: The panel of households expenditures in Poland, for years 1997 to 2000, gives the following statistics for the whole

More information

1 The basics of panel data

1 The basics of panel data Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Related materials: Steven Buck Notes to accompany fixed effects material 4-16-14 ˆ Wooldridge 5e, Ch. 1.3: The Structure of Economic Data ˆ Wooldridge

More information

ERSA Training Workshop Lecture 5: Estimation of Binary Choice Models with Panel Data

ERSA Training Workshop Lecture 5: Estimation of Binary Choice Models with Panel Data ERSA Training Workshop Lecture 5: Estimation of Binary Choice Models with Panel Data Måns Söderbom Friday 16 January 2009 1 Introduction The methods discussed thus far in the course are well suited for

More information

ECON 594: Lecture #6

ECON 594: Lecture #6 ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was

More information

Specification Tests in Unbalanced Panels with Endogeneity.

Specification Tests in Unbalanced Panels with Endogeneity. Specification Tests in Unbalanced Panels with Endogeneity. Riju Joshi Jeffrey M. Wooldridge June 22, 2017 Abstract This paper develops specification tests for unbalanced panels with endogenous explanatory

More information

point estimates, standard errors, testing, and inference for nonlinear combinations

point estimates, standard errors, testing, and inference for nonlinear combinations Title xtreg postestimation Postestimation tools for xtreg Description The following postestimation commands are of special interest after xtreg: command description xttest0 Breusch and Pagan LM test for

More information

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University. Panel GLMs Department of Political Science and Government Aarhus University May 12, 2015 1 Review of Panel Data 2 Model Types 3 Review and Looking Forward 1 Review of Panel Data 2 Model Types 3 Review

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

Missing dependent variables in panel data models

Missing dependent variables in panel data models Missing dependent variables in panel data models Jason Abrevaya Abstract This paper considers estimation of a fixed-effects model in which the dependent variable may be missing. For cross-sectional units

More information

Lecture#12. Instrumental variables regression Causal parameters III

Lecture#12. Instrumental variables regression Causal parameters III Lecture#12 Instrumental variables regression Causal parameters III 1 Demand experiment, market data analysis & simultaneous causality 2 Simultaneous causality Your task is to estimate the demand function

More information

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook) Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook) 1 2 Panel Data Panel data is obtained by observing the same person, firm, county, etc over several periods. Unlike the pooled cross sections,

More information

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points EEP 118 / IAS 118 Elisabeth Sadoulet and Kelly Jones University of California at Berkeley Fall 2008 Introductory Applied Econometrics Final examination Scores add up to 125 points Your name: SID: 1 1.

More information

Monday 7 th Febraury 2005

Monday 7 th Febraury 2005 Monday 7 th Febraury 2 Analysis of Pigs data Data: Body weights of 48 pigs at 9 successive follow-up visits. This is an equally spaced data. It is always a good habit to reshape the data, so we can easily

More information

Outline. Linear OLS Models vs: Linear Marginal Models Linear Conditional Models. Random Intercepts Random Intercepts & Slopes

Outline. Linear OLS Models vs: Linear Marginal Models Linear Conditional Models. Random Intercepts Random Intercepts & Slopes Lecture 2.1 Basic Linear LDA 1 Outline Linear OLS Models vs: Linear Marginal Models Linear Conditional Models Random Intercepts Random Intercepts & Slopes Cond l & Marginal Connections Empirical Bayes

More information

****Lab 4, Feb 4: EDA and OLS and WLS

****Lab 4, Feb 4: EDA and OLS and WLS ****Lab 4, Feb 4: EDA and OLS and WLS ------- log: C:\Documents and Settings\Default\Desktop\LDA\Data\cows_Lab4.log log type: text opened on: 4 Feb 2004, 09:26:19. use use "Z:\LDA\DataLDA\cowsP.dta", clear.

More information

multilevel modeling: concepts, applications and interpretations

multilevel modeling: concepts, applications and interpretations multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models

More information

Panel data methods for policy analysis

Panel data methods for policy analysis IAPRI Quantitative Analysis Capacity Building Series Panel data methods for policy analysis Part I: Linear panel data models Outline 1. Independently pooled cross sectional data vs. panel/longitudinal

More information

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II Jeff Wooldridge IRP Lectures, UW Madison, August 2008 5. Estimating Production Functions Using Proxy Variables 6. Pseudo Panels

More information

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008 Panel Data Seminar Discrete Response Models Romain Aeberhardt Laurent Davezies Crest-Insee 11 April 2008 Aeberhardt and Davezies (Crest-Insee) Panel Data Seminar 11 April 2008 1 / 29 Contents Overview

More information

ECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes

ECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes ECON 4551 Econometrics II Memorial University of Newfoundland Panel Data Models Adapted from Vera Tabakova s notes 15.1 Grunfeld s Investment Data 15.2 Sets of Regression Equations 15.3 Seemingly Unrelated

More information

Lecture 3.1 Basic Logistic LDA

Lecture 3.1 Basic Logistic LDA y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data

More information

Longitudinal Data Analysis. RatSWD Nachwuchsworkshop Vorlesung von Josef Brüderl 25. August, 2009

Longitudinal Data Analysis. RatSWD Nachwuchsworkshop Vorlesung von Josef Brüderl 25. August, 2009 Longitudinal Data Analysis RatSWD Nachwuchsworkshop Vorlesung von Josef Brüderl 25. August, 2009 Longitudinal Data Analysis Traditional definition Statistical methods for analyzing data with a time dimension

More information

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

More information

Dynamic Panels. Chapter Introduction Autoregressive Model

Dynamic Panels. Chapter Introduction Autoregressive Model Chapter 11 Dynamic Panels This chapter covers the econometrics methods to estimate dynamic panel data models, and presents examples in Stata to illustrate the use of these procedures. The topics in this

More information

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N 1) [4 points] Let Econ 836 Final Exam Y Xβ+ ε, X w+ u, w N w~ N(, σi ), u N u~ N(, σi ), ε N ε~ Nu ( γσ, I ), where X is a just one column. Let denote the OLS estimator, and define residuals e as e Y X.

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Difference-in-Differences Estimation

Difference-in-Differences Estimation Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like. Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and

More information

xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models

xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models Sebastian Kripfganz University of Exeter Business School, Department of Economics, Exeter, UK UK Stata Users Group

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as

Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers 1. Consider a binary random variable y i that describes a Bernoulli trial in which the probability of observing y i = 1 in any draw is given

More information

Generalized method of moments estimation in Stata 11

Generalized method of moments estimation in Stata 11 Generalized method of moments estimation in Stata 11 David M. Drukker StataCorp Stata Conference Washington, DC 2009 1 / 27 Outline 1 A quick introduction to GMM 2 gmm examples Ordinary least squares Two-stage

More information

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries

More information

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory

More information

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections Answer Key Fixed Effect and First Difference Models 1. See discussion in class.. David Neumark and William Wascher published a study in 199 of the effect of minimum wages on teenage employment using a

More information

ECON 497 Final Exam Page 1 of 12

ECON 497 Final Exam Page 1 of 12 ECON 497 Final Exam Page of 2 ECON 497: Economic Research and Forecasting Name: Spring 2008 Bellas Final Exam Return this exam to me by 4:00 on Wednesday, April 23. It may be e-mailed to me. It may be

More information

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 6 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 21 Recommended Reading For the today Advanced Panel Data Methods. Chapter 14 (pp.

More information

1. Overview of the Basic Model

1. Overview of the Basic Model IRP Lectures Madison, WI, August 2008 Lectures 3 & 4, Monday, August 4, 11:15-12:30 and 1:30-2:30 Linear Panel Data Models These notes cover some recent topics in linear panel data models. They begin with

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

8. Nonstandard standard error issues 8.1. The bias of robust standard errors

8. Nonstandard standard error issues 8.1. The bias of robust standard errors 8.1. The bias of robust standard errors Bias Robust standard errors are now easily obtained using e.g. Stata option robust Robust standard errors are preferable to normal standard errors when residuals

More information

A simple alternative to the linear probability model for binary choice models with endogenous regressors

A simple alternative to the linear probability model for binary choice models with endogenous regressors A simple alternative to the linear probability model for binary choice models with endogenous regressors Christopher F Baum, Yingying Dong, Arthur Lewbel, Tao Yang Boston College/DIW Berlin, U.Cal Irvine,

More information

Estimating and Interpreting Effects for Nonlinear and Nonparametric Models

Estimating and Interpreting Effects for Nonlinear and Nonparametric Models Estimating and Interpreting Effects for Nonlinear and Nonparametric Models Enrique Pinzón September 18, 2018 September 18, 2018 1 / 112 Objective Build a unified framework to ask questions about model

More information

Handout 11: Measurement Error

Handout 11: Measurement Error Handout 11: Measurement Error In which you learn to recognise the consequences for OLS estimation whenever some of the variables you use are not measured as accurately as you might expect. A (potential)

More information

Practice 2SLS with Artificial Data Part 1

Practice 2SLS with Artificial Data Part 1 Practice 2SLS with Artificial Data Part 1 Yona Rubinstein July 2016 Yona Rubinstein (LSE) Practice 2SLS with Artificial Data Part 1 07/16 1 / 16 Practice with Artificial Data In this note we use artificial

More information

Christopher Dougherty London School of Economics and Political Science

Christopher Dougherty London School of Economics and Political Science Introduction to Econometrics FIFTH EDITION Christopher Dougherty London School of Economics and Political Science OXFORD UNIVERSITY PRESS Contents INTRODU CTION 1 Why study econometrics? 1 Aim of this

More information

Econometric Analysis of Panel Data. Final Examination: Spring 2013

Econometric Analysis of Panel Data. Final Examination: Spring 2013 Econometric Analysis of Panel Data Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 Home page:www.stern.nyu.edu/~wgreene Email: wgreene@stern.nyu.edu URL for course web page: people.stern.nyu.edu/wgreene/econometrics/paneldataeconometrics.htm

More information

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018 Econometrics I KS Module 1: Bivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: March 12, 2018 Alexander Ahammer (JKU) Module 1: Bivariate

More information

Estimating Panel Data Models in the Presence of Endogeneity and Selection

Estimating Panel Data Models in the Presence of Endogeneity and Selection ================ Estimating Panel Data Models in the Presence of Endogeneity and Selection Anastasia Semykina Department of Economics Florida State University Tallahassee, FL 32306-2180 asemykina@fsu.edu

More information

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C =

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C = Economics 130 Lecture 6 Midterm Review Next Steps for the Class Multiple Regression Review & Issues Model Specification Issues Launching the Projects!!!!! Midterm results: AVG = 26.5 (88%) A = 27+ B =

More information

Description Quick start Menu Syntax Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see

Description Quick start Menu Syntax Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see Title stata.com hausman Hausman specification test Description Quick start Menu Syntax Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see Description hausman

More information

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

ECON Introductory Econometrics. Lecture 11: Binary dependent variables ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability

More information

1

1 i 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Foreign Liabilities/ Dependen Variabel Rasio Foreign Liabilities Total Debt (1) (2) Konstanta 0.216*** 0.219*** Ekspor/Total Sales -0.064-0.055

More information

Dealing With and Understanding Endogeneity

Dealing With and Understanding Endogeneity Dealing With and Understanding Endogeneity Enrique Pinzón StataCorp LP October 20, 2016 Barcelona (StataCorp LP) October 20, 2016 Barcelona 1 / 59 Importance of Endogeneity Endogeneity occurs when a variable,

More information

Basic Regressions and Panel Data in Stata

Basic Regressions and Panel Data in Stata Developing Trade Consultants Policy Research Capacity Building Basic Regressions and Panel Data in Stata Ben Shepherd Principal, Developing Trade Consultants 1 Basic regressions } Stata s regress command

More information

Regression #8: Loose Ends

Regression #8: Loose Ends Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch

More information

Course Econometrics I

Course Econometrics I Course Econometrics I 3. Multiple Regression Analysis: Binary Variables Martin Halla Johannes Kepler University of Linz Department of Economics Last update: April 29, 2014 Martin Halla CS Econometrics

More information

Ch 7: Dummy (binary, indicator) variables

Ch 7: Dummy (binary, indicator) variables Ch 7: Dummy (binary, indicator) variables :Examples Dummy variable are used to indicate the presence or absence of a characteristic. For example, define female i 1 if obs i is female 0 otherwise or male

More information

Greene, Econometric Analysis (7th ed, 2012)

Greene, Econometric Analysis (7th ed, 2012) EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 2 3: Classical Linear Regression The classical linear regression model is the single most useful tool in econometrics.

More information

-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model

-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model -redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model Mark B. Stewart University of Warwick January 2006 1 The model The latent equation for the random effects

More information

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley Panel Data Models James L. Powell Department of Economics University of California, Berkeley Overview Like Zellner s seemingly unrelated regression models, the dependent and explanatory variables for panel

More information

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect

More information

ECON Introductory Econometrics. Lecture 16: Instrumental variables

ECON Introductory Econometrics. Lecture 16: Instrumental variables ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental

More information

David M. Drukker Italian Stata Users Group meeting 15 November Executive Director of Econometrics Stata

David M. Drukker Italian Stata Users Group meeting 15 November Executive Director of Econometrics Stata Estimating the ATE of an endogenously assigned treatment from a sample with endogenous selection by regression adjustment using an extended regression models David M. Drukker Executive Director of Econometrics

More information

GMM Estimation in Stata

GMM Estimation in Stata GMM Estimation in Stata Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets 1 Outline Motivation 1 Motivation 2 3 4 2 Motivation 3 Stata and

More information

Lecture 8: Instrumental Variables Estimation

Lecture 8: Instrumental Variables Estimation Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano

More information

EC327: Advanced Econometrics, Spring 2007

EC327: Advanced Econometrics, Spring 2007 EC327: Advanced Econometrics, Spring 2007 Wooldridge, Introductory Econometrics (3rd ed, 2006) Chapter 14: Advanced panel data methods Fixed effects estimators We discussed the first difference (FD) model

More information

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner Econometrics II Nonstandard Standard Error Issues: A Guide for the Practitioner Måns Söderbom 10 May 2011 Department of Economics, University of Gothenburg. Email: mans.soderbom@economics.gu.se. Web: www.economics.gu.se/soderbom,

More information

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x)

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x) ECO 6375 Prof Millimet Problem Set #2: Answer Key Stata problem 2 Q 3 Q (a) The sample average of the individual-specific marginal effects is 0039 for educw and -0054 for white Thus, on average, an extra

More information