Instrumental Variable Regression

Size: px
Start display at page:

Download "Instrumental Variable Regression"

Transcription

1 Topic 6 Instrumental Variable Regression ARE/ECN 240 A Graduate Econometrics Professor: Òscar Jordà

2 Outline of this topic Randomized Experiments, natural experiments and causation Instrumental variables: causation versus correlation. Endogeneity bias Derivation of IV with GMM Properties of IV Tests

3 Experiments and quasi-experiments i The ideal environment is one in which we can control an experiment and randomly assign a treatment across individual units. Think of a pharmaceutical trial where you could control every single variable in your patients lives and you could randomly assign a treatment and a placebo. Experiments are expensive and in economics, usually rare so we often have to rely on quasiexperiments or natural experiments: a situation where something similar to a treatment assignment took place.

4 Puerto Rico and Hurricane Betsy Puerto Rico and the effects of class size on earnings: in 1956 there was a hurricane (Betsy) that traversed the island of Puerto Rico. Not many casualties but much infrastructure damage. As a result, children along the hurricane s path had to attend neighboring schools, effectively doubling class sizes in some districts but not others. The hurricane effectively assigned at random what school districts had class sizes doubled.

5 Program evaluation In statistics/econometrics this refers to studies designed to assess the effects of public policy: e.g. class size reduction, anti-smoking campaigns, etc. Three types of experiments: 1. Clinical i l drug trial: does a proposed drug lower cholesterol? Y = cholesterol level X = treatment t t or control group (or dose of drug) 2. Job training program (Job Training Partnership Act) Y = has a job, or not (or Y = wage income) X = went through experimental program, or not 3. Class size effect (Tennessee class size experiment) Y = test score (Stanford Achievement Test) X = class size treatment group (regular, regular + aide, small)

6 Causality An ideal randomized controlled experiment assigns subjects to treatment and control groups. More generally, the treatment level X is randomly assigned y i = x i + ² i Random assignment ensures that x i and ² i are independent and hence E(² i jx i )=0which is what we need to ensure ^ ^ is unbiased. Suppose x i is a binary variable (i.e. 1 for treatment, 0 for control) then

7 Treatment Effect the average treatment effect is simply: AT E = E(y i jx i =1) E(y i jx i =0); AT ^ E =¹y x=1 ¹y x=0 This is sometimes called the differences estimator. In practice, several reasons things fail: omitted variable bias: we omit important explanatory variables correlated with x. Errors-in-variables bias. Simultaneous causality bias.

8 Omitted Variables Bias Suppose the data are generated by the model: y = X + W± + ² But you estimate the model: Then: y = X + u ^ 1 1 ^ =(X 0 X) 1 (X 0 y)=(x 0 X) 1 (X 0 (X + W± + ²) = = +(X 0 X) 1 (X 0 W)± +(X 0 X) 1 (X 0 ²) So the bias depends d on the correlation between X and W and. Notice: E(ujX) = E(X 0 W±jX) = 0

9 Errors-in-variables Suppose the data are generated by: y i 0 = 1 + 2x 0 i + ² 0 i However, we observe our variables with error, e.g.: x i = x 0 i + v xi ;andy i = yi 0 + v yi For simplicity (although perhaps not realistically) assume the errors are iid i.i.d. D(0; ¾ 2 j ); j = x; y Substituting the observed y, x into the original regression, we have

10 Errors-in-Variables y = 0 i 1 + 2(x i v xi ) + ² i + v yt = 1 + 2x i + f² 0 i + v yi 2v xi g Measurement error clearly increases the error variance, since: But this is a minor problem compared to: u i V (u i )=¾² 2 + ¾y ¾ x 2 E(u i jx i )=E(² 0 i + v yi 2v xi jx 0 i + v xi )= 2v xi 6=0

11 Simultaneous Causality Classic example: a regression of quantity of wheat (Q) on the price of wheat (P). Suppose you want to estimate the elasticity of demand for wheat: ln(q i ) = ln(p i ) + u i In principle ^ 1 would be an estimate of this elasticity (why?). In practice, it is not To see this, suppose we have data on quantities and prices for different years

12 Simultaneous Causality Observations of quantities and prices over time:

13 Simultaneous Causality The interaction of supply and demand generates the following scatter of points

14 Simultaneous Causality Now suppose that I use the variable rainfall as a way to isolate shifts in the supply curve out of this cloud of points(assuming that rainfall does nothing to change the demand for wheat):

15 What is the common thread? In all of these situations, the regression parameters are estimated t with bias. The source of the bias is that the residuals of the actual regression that we estimate are correlated with the regressors E(Xi² 0 i ) 6= 0: Omitted variable bias: the regressors are correlated with the omitted variables. In the regression of test scores on class size omitting parental income will bias the effect of class size (our policy variable of interest) since richer districts i t tend to have smaller classes and richer parents have more resources (unmeasured) to help their children do well in school

16 The common thread (cont.) Errors-in-variables: we saw that even in the benign case that t the measurement error is random and well-behaved we had bias. Of course, this only gets worse if the bias is not random (e.g. the size of the measurement error is related to the value the regressor takes). Simultaneous causality: perhaps the more fundamental source of bias (often called endogeneity bias). Many variables are jointly dt determined dso some work needs to be done to filter the portion of the causality direction that we want to measure

17 The Solution: Instrumental t Variables Instrumental Variables: Z i is an l 1vectorsuchthat E(Zi² 0 i )=0andE(ZiX 0 i ) 6= 0withl k Remarks: Not all elements of X i need be endogenous. Those that are exogenous could be used as elements in Z i. In fact, if all the X i were exogenous, we would recover the usual moment condition we used for the method of moments derivation of OLS. If l = k we say the model is just identified, d otherwise we say the model is overidentified.

18 Two Stage Least Squares So the instruments are correlated with that part of the regressors that is uncorrelated with the error term. We could get at that part by regressing the regressors on the instruments: X = Z + U n ll k n k n k ^X = Z ^ =Z(Z 0 Z) 1 Z 0 X = P z X with P z idempotent Next, plug-in X into the linear regression

19 TSLS (continued) Y = X + ² =(Z +U) + ² = Z +(U + ²) =Z + V Notice that: E(Z 0 V )=E(Z 0 U )+E(Z 0 ²)=0 ^ =(Z 0 Z) 1 Z 0 Y and from before ^ =(Z 0 Z) 1 Z 0 X Let s think of the just identified case, i.e., = k 1 k kk 1! = 1 l = k ^ IV = (Z 0 Z) 1 Z 0 X 1 (Z 0 Z) 1 Z 0 Y =(Z 0 X) 1 (Z 0 Y )

20 TSLS Now using method of moments: E(Z 0 ²)=0! E(Z 0 (Y X )) = 0! E(Z 0 Y )=E(Z 0 X) And the analogy principle: ^ IV = μ Z 0 X n 1 μ Z 0 Y n =(Z 0 X) 1 (Z 0 Y ) However, what happens when we have overidentification, i.e. l >k ^ l k is not directly invertible

21 TSLS (cont.) However, notice ^ = ^ ^! (Z 0 Z) 1 Z 0 Y =(Z 0 Z) 1 (Z 0 X) l k Pre-multiplying both sides by X Z X 0 Z(Z 0 Z) 1 Z 0 Y = X 0 Z(Z 0 Z) 1 (Z 0 X) ^ k 1 ^ k k k 1 hence ^ k 1 1 = X 0 Z(Z 0 Z) 1 Z 0 X X 0 Z(Z 0 Z) 1 (Z 0 Y ) ^ IV = (X 0 P z X) ) 1 (X 0 P z Y )

22 TSLS The General Case Let s go back. First stage: X = Z +U ^X = Z ^ =Z(Z 0 Z) 1 Z 0 X = P z X Second Stage: do OLS on the auxiliary regression Y = ^X + V ^ IV =(^X 0 ^X) 1 ^X0 Y =(X 0 P z X) 1 (X 0 P z Y ) Since P z P z = P z

23 Remarks When doing TSLS note that the standard errors of the second stage regression (the one where you substitute X with ^X ) are incorrect. The reason is that the usual OLS formulas in the second stage do not take into account the estimation error in ^X^X. As we will see in more detail, the same issues with heteroscedasticity can arise in instrumental variable regression as well.

24 Example: Demand for Cigarretes We want to estimate: ln(q i d )= 0 + 1ln(p i )+² i Data: a panel of observations across states and time Annual cigarette consumption and average prices paid (including tax) 48 continental US states, Proposed instrumental variable: Z i = general sales tax per pack in the state = SalesTax i Is this a valid instrument? (1) Relevant? (2) Exogenous?

25 STATA example: First Stage Instrument = Z = rtaxso = general sales tax (real $/pack) X Z. reg lravgprs rtaxso if year==1995, r; Regression with robust standard errors Number of obs = 48 F( 1, 46) = Prob > F = R-squared = Root MSE = Robust lravgprs Coef. Std. Err. t P> t [95% Conf. Interval] rtaxso _cons X-hat. predict lravphat; Now we have the predicted values from the 1 st stage

26 Second Stage Y X-hat. reg lpackpc lravphat if year==1995, r; Regression with robust standard errors Number of obs = 48 F( 1, 46) = Prob > F = R-squared = Root MSE = Robust lpackpc Coef. Std. Err. t P> t [95% Conf. Interval] lravphat _cons These coefficients are the TSLS estimates The standard errors are wrong because they ignore the fact that ARE/ECN the 240A first stage was estimated

27 Using STATA s built-in ivreg command Y X Z. ivreg lpackpc (lravgprs = rtaxso) if year==1995, r; IV (2SLS) regression with robust standard errors Number of obs = 48 F( 1, 46) = Prob > F = R-squared = Root MSE = Robust lpackpc Coef. Std. Err. t P> t [95% Conf. Interval] lravgprs _cons Instrumented: lravgprs This is the endogenous regressor Instruments: rtaxso This is the instrumental varible OK, the change in the SEs was small this time...but not always! ln(^qd i d )= 9:72 1:08 ln(^p i), n = 48 (1:53) (0:32)

28 Elaborating on the Example cigarettes cigarettes ln( Q ) = β 0 + β 1 ln( P i ) + β 2 ln(income i i) ) + u i ( i i Z 1i = general sales tax i Z 2i = cigarette-specific ifi tax i cigarettes Endogenous variable: ln( P i )( one X ) ) Included exogenous variable: ln(income i ) ( one W ) Instruments (excluded endogenous variables): general sales tax, cigarette-specific ifi tax ( two Zs ) Is the demand elasticity β 1 overidentified, exactly identified, or underidentified?

29 Cigarrete Demand: One Instrument t Y W X Z. ivreg lpackpc lperinc (lravgprs = rtaxso) if year==1995, r; IV (2SLS) regression with robust standard errors Number of obs = 48 F( 2, 45) = 8.19 Prob > F = R-squared = Root MSE = Robust lpackpc Coef. Std. Err. t P> t [95% Conf. Interval] lravgprs lperinc _cons Instrumented: lravgprs Instruments: lperinc rtaxso STATA lists ALL the exogenous regressors as instruments slightly different terminology than we have been using Running IV as a single command yields correct SEs Use, r for heteroskedasticity-robust SEs

30 Cigarrete Demand: Two Instruments t Y W X Z 1 Z 2. ivreg lpackpc lperinc (lravgprs = rtaxso rtax) if year==1995, r; IV (2SLS) regression with robust standard errors Number of obs = 48 F( 2, 45) = Prob > F = R-squared = Root MSE = Robust lpackpc Coef. Std. Err. t P> t [95% Conf. Interval] lravgprs lperinc _cons Instrumented: lravgprs Instruments: lperinc rtaxso rtax STATA lists ALL the exogenous regressors as instruments slightly different terminology than we have been using

31 IV as Generalized Method of Moments Rather than examining the properties of the justidentified or the overidentified TSLS-IV estimator, it is easier to examine IV using GMM LetZ Z i ; X i be 1 l and 1 k vectors respectively Suppose E(Zi² 0 i )=0 Hence define: g i(y 0 i (y i ; X i ; Z i ; ) = Z i i X i ) From the population moment condition, the sample analog is 1 n nx i=1 g i (y i ;X i ;Z i ; ) =g n ( ) = Z0 (Y X ) n =0

32 GMM The generic GMM objective function is: ^Q n ( ) = g n ( ) 0 ^Wgn ( ) For some weighting l l matrix Notice that ^G n( = Z 0 X ^W p! W So if I apply directly (big if since I have not checked assumptions) the results from extremum estimators p ^ n( 0) d 1 1! N(0; (G 0 WG) 1 G 0 W ÐWG(G 0 WG) 1 ) Were. If homoscedastic Ð = E(ZiZ i Z i ² i ) Ð = ¾ (Z Z)

33 GMM in detail μ Z 0 (Y X ) 0 max n First order conditions: Recall: μ Z 0 X 0 2 ^W n W^W μ Z 0 (Y X ) n μ Z 0 (Y X ) =0 n X 0 Z ^WZ 0 Y = X 0 Z ^WZ 0 X ^ GMM = (X 0 Z ^W Z 0 X) ) 1 X 0 Z ^W Z 0 Y 1 ^ IV = X 0 Z(Z 0 Z) 1 Z 0 X 1 X 0 Z(Z 0 Z) 1 (Z 0 Y )

34 GMM (cont.) Intriguing, so if I choose ^W =(Z 0 Z) 1 then ^ GMM = ^ IV In fact, since ^G = G = Z 0 X then under homoscedasticity Ð=¾ 2 (Z 0 Z) and hence p n( ^ 0) d! N(0; (G 0 WG) 1 G 0 W ÐWG(G 0 WG) 1 ) p d n( ^ 0) )! N(0;¾ 2 (X 0 Z(Z 0 Z) 11 Z 0 X) 11 X 0 Z(Z 0 Z) 11 (Z 0 Z)(Z 0 Z) 11 Z 0 X(X 0 Z(Z 0 Z) 11 Z 0 X) p n( ^ 0) d! N(0;¾ 2 (X 0 Z(Z 0 Z) 1 Z 0 X) 1 ) p n( ^ 0) d! N(0;¾ 2 (X 0 P z X) 1 )

35 GMM (cont.) Ask yourself, why (and when) would choosing be optimal? ^W =(Z 0 Z) 1 In general, we know that the optimal choice of weighting matrix is Ã! 11 1 nx ^W = g i (y i ;X i ;Z i ; n ^ )g i (y i ;X i ;Z i ; ^ ) 0 i=1 ^W p! W = E(g i ( )g i ( ) 0 )=E(Z 0 iz i ² 2 i )

36 A Recap: A Lot to Digest Take GMM as the primitive way of thinking about the instrumental variable problem and TSLS as a special case Under just-identification (and linearity in the regression), things are simple: the choice of weighting matrix is irrelevant because we have the same number of moments and parameters (unique solution). Hence ^ IV = ^ GMM =(Z 0 X) 1 (Z 0 Y )

37 Recap Continued In the over-identified case we have more moment conditions than parameters. Hence the weighting matrix matters. Under certain conditions, ^ TSLS = ^ GMM =(X 0 P z X) 1 (X 0 P z Y ) And from the usual GMM results p n( ^ 0) d! N(0;¾ 2 (X 0 P z X) 1 ) But, there are very specific assumptions for this to work. Two critical assumptions about the instruments

38 Assumptions on the Instruments 1. Instrument Relevance: it is easiest to think about why this matters in the context of TSLS. If the Z do not explain the X, then in the first stage regression ^X X = Z +U we get lousy for the second stage. 2. Instrument Exogeneity: it better be the case that E(Z 0 ²)=0, otherwise there is no advantage with respect to using the X themselves.

39 Distribution of GMM Estimator Assume that ^W! p W and let: E(ZiX 0 i )=Q. Further assume that Then: Hence: n( )! N(0; V ) Ð=E(Z 0 iz i ² 2 i )=E(g 0 ig i )whereg i = Z 0 i² i μ μ 1 1 n X0 Z ^W n Z0 X μ μ 1 n X0 Z ^W p 1 Z 0 ² n p ( ^ ) d N(0 V ) p! Q 0 WQ d! Q 0 WN(0; Ð) V =(Q 0 WQ) 1 (Q 0 W ÐWQ)(Q 0 WQ) 1

40 Efficient GMM If then: W =Ð 1 p n( ^ ) d! N(0; (Q 0 Ð 1 Q) 1 ) In small samples: Ã 1 n! 1 X W ^W = (^g (g ¹g) 0 (^g ¹g) ^g 0^² n i (g i with g i = Z i ² i i=1 And in the linear model under some assumptions one could use W ^W = (Z 0 Z) 11 as a first stage estimator for ^ and hence construct ^² i and thus ^g i

41 Assessing IV Instrument t Relevance Let s think back to TSLS. The just-identified case: ^ IV =(Z 0 X) 1 (Z 0 Y ) In the special case: and E(z 2 i ² 2 i )=¾ 2 ² ¾ 2 z y i = 0 + 1x i + ² i with E(z i ² i ) =0 ^ IV = S ZY S ZX p! ¾ zy ¾ zx with V ( ^ IV )= ¾2 ² ¾ 2 z (¾ zx ) 2 So, if ¾ zx! 0 then ^ IV! 1 but V ( IV ^ IV )! 1 as well

42 What Happens Asymptotically When E(z ix 0 i )=0 then by the CLT 1 nx p z 0 d n ix i! N1» N(0;E(zi 2 x 2 i )) i=1 1 nx p z 0 d n i ² i! N2» N(0;E(zi 2 ² 2 i )) n i=1 Therefore: ^ = 1 n P n p n i=1 z0 i ² i 1 P n p n i=1 z0 i x i d! N 2» Cauchy N 1

43 Weak Identification The Cauchy distribution has a mean, variance, and in fact higher moments, that are not defined (although its mode and its median are). Let s think of a less extreme case, where there is correlation between the instrument and the endogenous variable, but it is weak. One way to do this asymptotically (in a one variable-one instrument case: y i = x i + ² i x i = z i + u i = n 1=2 c

44 Weak Identification (cont.) Clearly, this device assures that asymptotically the instrument and the endogenous variable are uncorrelated. Specifically, n n n 1 nx p z n ix 0 i = 1 nx p z n i nx p z i u i n i=1 = 1 n i=1 nx z i 2 c + p 1 n i=1 d! Qc + N 1 i=1 nx z i u i i=1 ^ d d! N 2 Qc + N 1

45 Remarks ^ is inconsistent for The asymptotic distribution of ^ ^ is non-normal Standard t-test have non-standard distributions

46 Example: Sampling Distribution of TSLS with weak instruments t Light line: strong instruments Dark line: weak instruments

47 Some things to check in TSLS One way to check for the strength of the instruments is with the F-stat of the first stage n regressions X = Z + U n k n l l k n k At a minimum, you should reject the null that (excluding any included exogenous variables) they should be jointly zero. Staiger and Stock (1997) and Stock and Wright (1998) are two standard references.

48 Overidentification Test (a.k.a J-Test) We have checked one of the conditions for IV to work: instrument relevance To check the other, exogeneity, we rely on having more moment conditions than parameters. The reason is that then we have a situation where, if the model is correctly specified, then all moment conditions should be satisfied. If it is not, then some will be violated.

49 Overidentification Test (cont.) Recall the GMM objective function: ^Q n ( ) = g n ( ) 0 ^Wgn ( ) where 1 n n i=1 nx gi (y i ;X i ;Z i ; ) =g n ( ) = Z0 (Y X ) n =0 when then only as and the model is correctly specified because the moment conditions only hold in populations, not in sample.

50 Overidentification Test (cont.) The specification test on the model is therefore based on the distance between the sample moment conditions and zero. It is important to note that to apply the test one must use the optimal weighting matrix. This is because we rely on the asymptotic result that 1 nx 1 nx d pn g i ( ) = p Z n i=1 n i² 0 i! N(0; Ð) i=1 Ð=E(g 0 = E(ZiZ 0 ² 2 i ( ) g i ( )) ( i i i )

51 Overidentification Test (cont.) Hence: as long as n ^Q n ( ^ ) = ng n ( ^ ) 0 ^Wgn ( ^ ) d! Â 2 l k ^W = 1 X n gi ^ ) 0 ^ ) = 1 X n 0 W g i^² 2 n i ( ) g i ( ) Z i=1 n i Z i ² i i=1 Remarks: Careful, this test tends to have low power.

52 The Distance Statistic Hypothesis tests could be constructed with the asymptotic covariance GMM estimate and the Wald principle, as we have often seen. However, if the hypotheses are nonlinear, it is often better to use directly the GMM criterion function. Suppose we want to test H = k r 0 : h( ) 0 for h : R! R

53 The Distance Statistic Let the estimates under the alternative be ^ =argmin Let the estimates under the null be ~ =argmin ~Q n ( ) Let h( )=0 ^Q n ( ) D = n( Q ~ n ( ~ ) ^Q n ( ^ )) Then D 0andD! d  2 r. Further, if h is linear then D equals the Wald statistic.

54 Fixed-effects model of cigarette demandd ln( Q cigarettes it ) = α i + β 1 ln( P cigarettes it ) + β 2 ln(income it ) + u it i = 1,,48, t = 1985, 1986,,1995 α i reflects unobserved omitted factors that t vary across states t but not over time, e.g. attitude towards smoking cigarettes Still, corr(ln( ( P it ),u it) ) is plausibly nonzero because of supply/demand interactions Estimation strategy: Use panel ldata regression methods to eliminate i α i Use TSLS to handle simultaneous causality bias Use T = 2 with changes ( changes method) look at long-term response, not short-term dynamics (short- v. long-run elasticities)

55 The changes method (when T=2) T One way to model long-term effects is to consider 10-year changes, between 1985 and 1995 Rewrite the regression in changes form: cigarettes cigarettes ln( Q i1995 ) ln( Q i1985 ) cigarettes cigarettes = β 1 [ln( P i1995 ) ln( P i1985 )] +β 2 [ln(income i1995 ) ln(income i1985 )] + (u i1995 u i1985 ) Create 10-year change variables, for example: 10-year change in log price = ln(p i1995 ) ln(p i1985 ) Then estimate the demand elasticity by TSLS using 10-year changes in the instrumental variables

56 STATA: Cigarette demand First create 10-year change variables 10-year change in log price = ln(p it ) ln(p it 10 ) = ln(p it /P it 10 ). gen dlpackpc = log(packpc/packpc[_n-10]); _n-10 is the 10-yr lagged value. gen dlavgprs = log(avgprs/avgprs[_n-10]);. gen dlperinc = log(perinc/perinc[_n-10]);. gen drtaxs = rtaxs-rtaxs[_n-10];. gen drtax = rtax-rtax[_n-10];. gen drtaxso = rtaxso-rtaxso[_n-10];

57 Use TSLS to estimate the demand elasticity by using the 10-year changes specification Y W X Z. ivreg dlpackpc p dlperinc (dlavgprs = drtaxso), r; IV (2SLS) regression with robust standard errors Number of obs = 48 F( 2, 45) = Prob > F = R-squared = Root MSE = Robust dlpackpc Coef. Std. Err. t P> t [95% Conf. Interval] dlavgprs dlperinc _cons Instrumented: dlavgprs Instruments: dlperinc drtaxso NOTE: - All the variables Y, X, W, and Z s are in 10-year changes - Estimated elasticity =.94 (SE =.21) surprisingly elastic! - Income elasticity small, not statistically different from zero - Must check whether the instrument is relevant

58 Check instrument relevance: compute first-stage t F. reg dlavgprs drtaxso dlperinc, r; Regression with robust standard errors Number of obs = 48 F( 2, 45) = Prob > F = R-squared = Root MSE = Robust dlavgprs Coef. Std. Err. t P> t [95% Conf. Interval] drtaxso dlperinc _cons test drtaxso; We didn t need to run test here because with m=1 instrument, the ( 1) drtaxso = 0 F-statistic is the square of the t-statistic, that is, F( 1, 45) = *5.80 = Prob > F = First stage F = 33.7 > 10 so instrument is not weak Can we check instrument exogeneity? No: l = k

59 Check instrument relevance: compute first-stage t F X Z1 Z2 W. reg dlavgprs drtaxso drtax dlperinc, r; Regression with robust standard errors Number of obs = 48 F( 3, 44) = Prob > F = R-squared = Root MSE = Robust dlavgprs Coef. Std. Err. t P> t [95% Conf. Interval] drtaxso drtax dlperinc _ cons test drtaxso drtax; ( 1) drtaxso = 0 ( 2) drtax = 0 F( ARE/ECN 2, 240A44) = > 10 so instruments aren t weak Prob > F =

60 What about two instruments (i (cig-only tax, sales tax)?. ivreg dlpackpc dlperinc (dlavgprs = drtaxso drtax), r; IV (2SLS) regression with robust standard errors Number of obs = 48 F( 2, 45) = Prob > F = R-squared = Root MSE = Robust dlpackpc Coef. Std. Err. t P> t [95% Conf. Interval] dlavgprs dlperinc _cons Instrumented: dlavgprs Instruments: dlperinc drtaxso drtax drtaxso = general sales tax only drtax = cigarette-specific tax only Estimated elasticity is -1.2, even more elastic than using general sales tax only With l > k, we can test the overidentifying restrictions

61 Test the overidentifying i restrictions ti. predict e, resid; Computes predicted values for most recently estimated regression (the previous TSLS regression). reg e drtaxso drtax dlperinc; Regress e on Z s and W s Source SS df MS Number of obs = F( 3, 44) = 1.64 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = e Coef. Std. Err. t P> t [95% Conf. Interval] drtaxso drtax dlperinc _cons test drtaxso drtax; ( 1) drtaxso = 0 Compute J-statistic, which is m*f, ( 2) drtax = 0 where F tests whether coefficients on the instruments are zero F( ARE/ECN 2, 240A 44) = 2.47 so J = = 4.93 Prob > F = ** WARNING this uses the wrong d.f. **

62 The correct degrees of freedom for the J-statistic J ti ti is l k: l k J = lf, where F = the F-statistic testing the coefficients on Z 1i,,ZZ li in a regression of the TSLS residuals against Z 1i,,Z li, W 1i,,W li. Under the null hypothesis that all the instruments are exogeneous, J has a chi-squared distribution with l k degrees of freedom Here, J = 4.93, distributed chi-squared with d.f. = 1; the 5% critical value is 3.84, so reject at 5% sig. level. In STATA:. dis "J-stat = " r(df)*r(f) " p-value = " chiprob(r(df)-1,r(df)*r(f)); J-stat = p-value = J = = 4.93 p-value l from chi-squared(1) distribution ib ti Now ARE/ECN what??? 240A

63 Tbl Tabular summary of fthese results:

64 How should we interpret the J-test rejection? J-test rejects the null hypothesis that both the instruments are exogenous This means that either rtaxso is endogenous, or rtax is endogenous, or both The J-test doesn t tell us which!! You must exercise judgment Why might rtax (cig-only tax) be endogenous? Political forces: history of smoking or lots of smokers political pressure for low cigarette taxes If so, cig-only tax is endogenous This reasoning doesn t apply to general sales tax use ARE/ECN just 240A one instrument, the general sales tax

65 The Demand for Cigarettes: Summary of Empirical i Results Use the estimated elasticity based on TSLS with the general sales tax as the only instrument: Elasticity = -.94, SE =.21 This elasticity is surprisingly large (not inelastic) a 1% increase in prices reduces cigarette sales by nearly 1%. This is much more elastic than conventional wisdom in the health economics literature. This is a long-run (ten-year change) elasticity. What would you expect a short-run (one-year change) elasticity to be more or less elastic?

Multivariate Regression: Part I

Multivariate Regression: Part I Topic 1 Multivariate Regression: Part I ARE/ECN 240 A Graduate Econometrics Professor: Òscar Jordà Outline of this topic Statement of the objective: we want to explain the behavior of one variable as a

More information

ECON Introductory Econometrics. Lecture 17: Experiments

ECON Introductory Econometrics. Lecture 17: Experiments ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.

More information

ECON Introductory Econometrics. Lecture 16: Instrumental variables

ECON Introductory Econometrics. Lecture 16: Instrumental variables ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental

More information

Econometrics. 8) Instrumental variables

Econometrics. 8) Instrumental variables 30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates

More information

An explanation of Two Stage Least Squares

An explanation of Two Stage Least Squares Introduction Introduction to Econometrics An explanation of Two Stage Least Squares When we get an endogenous variable we know that OLS estimator will be inconsistent. In addition OLS regressors will also

More information

Applied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables

Applied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables Applied Statistics and Econometrics Giuseppe Ragusa Lecture 15: Instrumental Variables Outline Introduction Endogeneity and Exogeneity Valid Instruments TSLS Testing Validity 2 Instrumental Variables Regression

More information

Instrumental Variables, Simultaneous and Systems of Equations

Instrumental Variables, Simultaneous and Systems of Equations Chapter 6 Instrumental Variables, Simultaneous and Systems of Equations 61 Instrumental variables In the linear regression model y i = x iβ + ε i (61) we have been assuming that bf x i and ε i are uncorrelated

More information

Experiments and Quasi-Experiments

Experiments and Quasi-Experiments Experiments and Quasi-Experiments (SW Chapter 13) Outline 1. Potential Outcomes, Causal Effects, and Idealized Experiments 2. Threats to Validity of Experiments 3. Application: The Tennessee STAR Experiment

More information

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics STAT-S-301 Experiments and Quasi-Experiments (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Why study experiments? Ideal randomized controlled experiments

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

Handout 12. Endogeneity & Simultaneous Equation Models

Handout 12. Endogeneity & Simultaneous Equation Models Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression

More information

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.

More information

Dynamic Panel Data Models

Dynamic Panel Data Models Models Amjad Naveed, Nora Prean, Alexander Rabas 15th June 2011 Motivation Many economic issues are dynamic by nature. These dynamic relationships are characterized by the presence of a lagged dependent

More information

Econometrics Midterm Examination Answers

Econometrics Midterm Examination Answers Econometrics Midterm Examination Answers March 4, 204. Question (35 points) Answer the following short questions. (i) De ne what is an unbiased estimator. Show that X is an unbiased estimator for E(X i

More information

8. Instrumental variables regression

8. Instrumental variables regression 8. Instrumental variables regression Recall: In Section 5 we analyzed five sources of estimation bias arising because the regressor is correlated with the error term Violation of the first OLS assumption

More information

Econometrics Problem Set 11

Econometrics Problem Set 11 Econometrics Problem Set WISE, Xiamen University Spring 207 Conceptual Questions. (SW 2.) This question refers to the panel data regressions summarized in the following table: Dependent variable: ln(q

More information

Hypothesis Tests and Confidence Intervals in Multiple Regression

Hypothesis Tests and Confidence Intervals in Multiple Regression Hypothesis Tests and Confidence Intervals in Multiple Regression (SW Chapter 7) Outline 1. Hypothesis tests and confidence intervals for one coefficient. Joint hypothesis tests on multiple coefficients

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

Nonlinear Regression Functions

Nonlinear Regression Functions Nonlinear Regression Functions (SW Chapter 8) Outline 1. Nonlinear regression functions general comments 2. Nonlinear functions of one variable 3. Nonlinear functions of two variables: interactions 4.

More information

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like. Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Statistical Inference with Regression Analysis

Statistical Inference with Regression Analysis Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing

More information

Lecture 8: Instrumental Variables Estimation

Lecture 8: Instrumental Variables Estimation Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano

More information

Econometrics Homework 1

Econometrics Homework 1 Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z

More information

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests ECON4150 - Introductory Econometrics Lecture 5: OLS with One Regressor: Hypothesis Tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 5 Lecture outline 2 Testing Hypotheses about one

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model

More information

At this point, if you ve done everything correctly, you should have data that looks something like:

At this point, if you ve done everything correctly, you should have data that looks something like: This homework is due on July 19 th. Economics 375: Introduction to Econometrics Homework #4 1. One tool to aid in understanding econometrics is the Monte Carlo experiment. A Monte Carlo experiment allows

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics STAT-S-301 Panel Data (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Regression with Panel Data A panel dataset contains observations on multiple entities

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Chapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression

Chapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression Chapter 7 Hypothesis Tests and Confidence Intervals in Multiple Regression Outline 1. Hypothesis tests and confidence intervals for a single coefficie. Joint hypothesis tests on multiple coefficients 3.

More information

Graduate Econometrics Lecture 4: Heteroskedasticity

Graduate Econometrics Lecture 4: Heteroskedasticity Graduate Econometrics Lecture 4: Heteroskedasticity Department of Economics University of Gothenburg November 30, 2014 1/43 and Autocorrelation Consequences for OLS Estimator Begin from the linear model

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

Introduction to Econometrics. Regression with Panel Data

Introduction to Econometrics. Regression with Panel Data Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Regression with Panel Data Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1 Regression with Panel

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries

More information

Introduction to Econometrics. Multiple Regression (2016/2017)

Introduction to Econometrics. Multiple Regression (2016/2017) Introduction to Econometrics STAT-S-301 Multiple Regression (016/017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 OLS estimate of the TS/STR relation: OLS estimate of the Test Score/STR relation:

More information

Dynamic Panels. Chapter Introduction Autoregressive Model

Dynamic Panels. Chapter Introduction Autoregressive Model Chapter 11 Dynamic Panels This chapter covers the econometrics methods to estimate dynamic panel data models, and presents examples in Stata to illustrate the use of these procedures. The topics in this

More information

Essential of Simple regression

Essential of Simple regression Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Handout 11: Measurement Error

Handout 11: Measurement Error Handout 11: Measurement Error In which you learn to recognise the consequences for OLS estimation whenever some of the variables you use are not measured as accurately as you might expect. A (potential)

More information

Introduction to Econometrics. Review of Probability & Statistics

Introduction to Econometrics. Review of Probability & Statistics 1 Introduction to Econometrics Review of Probability & Statistics Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com Introduction 2 What is Econometrics? Econometrics consists of the application of mathematical

More information

Linear Regression with Multiple Regressors

Linear Regression with Multiple Regressors Linear Regression with Multiple Regressors (SW Chapter 6) Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution

More information

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests ECON4150 - Introductory Econometrics Lecture 7: OLS with Multiple Regressors Hypotheses tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 7 Lecture outline 2 Hypothesis test for single

More information

Nonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015

Nonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 Nonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 This lecture borrows heavily from Duncan s Introduction to Structural

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

Empirical Application of Simple Regression (Chapter 2)

Empirical Application of Simple Regression (Chapter 2) Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 6 Multiple regression model Siv-Elisabeth Skjelbred University of Oslo February 5th Last updated: February 3, 2016 1 / 49 Outline Multiple linear regression model and

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral IAE, Barcelona GSE and University of Gothenburg Gothenburg, May 2015 Roadmap Deviations from the standard

More information

Extensions to the Basic Framework II

Extensions to the Basic Framework II Topic 7 Extensions to the Basic Framework II ARE/ECN 240 A Graduate Econometrics Professor: Òscar Jordà Outline of this topic Nonlinear regression Limited Dependent Variable regression Applications of

More information

Problem Set 1 ANSWERS

Problem Set 1 ANSWERS Economics 20 Prof. Patricia M. Anderson Problem Set 1 ANSWERS Part I. Multiple Choice Problems 1. If X and Z are two random variables, then E[X-Z] is d. E[X] E[Z] This is just a simple application of one

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval]

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval] Problem Set #3-Key Sonoma State University Economics 317- Introduction to Econometrics Dr. Cuellar 1. Use the data set Wage1.dta to answer the following questions. a. For the regression model Wage i =

More information

ECON Introductory Econometrics. Lecture 13: Internal and external validity

ECON Introductory Econometrics. Lecture 13: Internal and external validity ECON4150 - Introductory Econometrics Lecture 13: Internal and external validity Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 9 Lecture outline 2 Definitions of internal and external

More information

ECON Introductory Econometrics. Lecture 4: Linear Regression with One Regressor

ECON Introductory Econometrics. Lecture 4: Linear Regression with One Regressor ECON4150 - Introductory Econometrics Lecture 4: Linear Regression with One Regressor Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 4 Lecture outline 2 The OLS estimators The effect of

More information

Regression #8: Loose Ends

Regression #8: Loose Ends Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2015-16 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral IAE, Barcelona GSE and University of Gothenburg Gothenburg, May 2015 Roadmap of the course Introduction.

More information

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections Answer Key Fixed Effect and First Difference Models 1. See discussion in class.. David Neumark and William Wascher published a study in 199 of the effect of minimum wages on teenage employment using a

More information

5.2. a. Unobserved factors that tend to make an individual healthier also tend

5.2. a. Unobserved factors that tend to make an individual healthier also tend SOLUTIONS TO CHAPTER 5 PROBLEMS ^ ^ ^ ^ 5.1. Define x _ (z,y ) and x _ v, and let B _ (B,r ) be OLS estimator 1 1 1 1 ^ ^ ^ ^ from (5.5), where B = (D,a ). Using the hint, B can also be obtained by 1 1

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 7 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 68 Outline of Lecture 7 1 Empirical example: Italian labor force

More information

Nonrecursive models (Extended Version) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015

Nonrecursive models (Extended Version) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 Nonrecursive models (Extended Version) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 NOTE: This lecture borrows heavily from Duncan s Introduction

More information

Introduction to Econometrics. Multiple Regression

Introduction to Econometrics. Multiple Regression Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Multiple Regression Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1 OLS estimate of the TS/STR

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3

More information

CHAPTER 6: SPECIFICATION VARIABLES

CHAPTER 6: SPECIFICATION VARIABLES Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero

More information

Instrumental Variables and the Problem of Endogeneity

Instrumental Variables and the Problem of Endogeneity Instrumental Variables and the Problem of Endogeneity September 15, 2015 1 / 38 Exogeneity: Important Assumption of OLS In a standard OLS framework, y = xβ + ɛ (1) and for unbiasedness we need E[x ɛ] =

More information

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han Econometrics Honor s Exam Review Session Spring 2012 Eunice Han Topics 1. OLS The Assumptions Omitted Variable Bias Conditional Mean Independence Hypothesis Testing and Confidence Intervals Homoskedasticity

More information

Lecture 14. More on using dummy variables (deal with seasonality)

Lecture 14. More on using dummy variables (deal with seasonality) Lecture 14. More on using dummy variables (deal with seasonality) More things to worry about: measurement error in variables (can lead to bias in OLS (endogeneity) ) Have seen that dummy variables are

More information

Control Function and Related Methods: Nonlinear Models

Control Function and Related Methods: Nonlinear Models Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear

More information

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor

More information

Question 1 [17 points]: (ch 11)

Question 1 [17 points]: (ch 11) Question 1 [17 points]: (ch 11) A study analyzed the probability that Major League Baseball (MLB) players "survive" for another season, or, in other words, play one more season. They studied a model of

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 5 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 44 Outline of Lecture 5 Now that we know the sampling distribution

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Friday, June 5, 009 Examination time: 3 hours

More information

Ec1123 Section 7 Instrumental Variables

Ec1123 Section 7 Instrumental Variables Ec1123 Section 7 Instrumental Variables Andrea Passalacqua Harvard University andreapassalacqua@g.harvard.edu November 16th, 2017 Andrea Passalacqua (Harvard) Ec1123 Section 7 Instrumental Variables November

More information

Econ 510 B. Brown Spring 2014 Final Exam Answers

Econ 510 B. Brown Spring 2014 Final Exam Answers Econ 510 B. Brown Spring 2014 Final Exam Answers Answer five of the following questions. You must answer question 7. The question are weighted equally. You have 2.5 hours. You may use a calculator. Brevity

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

ECONOMETRICS HONOR S EXAM REVIEW SESSION

ECONOMETRICS HONOR S EXAM REVIEW SESSION ECONOMETRICS HONOR S EXAM REVIEW SESSION Eunice Han ehan@fas.harvard.edu March 26 th, 2013 Harvard University Information 2 Exam: April 3 rd 3-6pm @ Emerson 105 Bring a calculator and extra pens. Notes

More information

Econometrics. 9) Heteroscedasticity and autocorrelation

Econometrics. 9) Heteroscedasticity and autocorrelation 30C00200 Econometrics 9) Heteroscedasticity and autocorrelation Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Heteroscedasticity Possible causes Testing for

More information

ECON 836 Midterm 2016

ECON 836 Midterm 2016 ECON 836 Midterm 2016 Each of the eight questions is worth 4 points. You have 2 hours. No calculators, I- devices, computers, and phones. No open boos, and everything must be on the floor. Good luc! 1)

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Department of Economics University of Wisconsin-Madison September 27, 2016 Treatment Effects Throughout the course we will focus on the Treatment Effect Model For now take that to

More information

Quantitative Methods Final Exam (2017/1)

Quantitative Methods Final Exam (2017/1) Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with

More information

Lab 6 - Simple Regression

Lab 6 - Simple Regression Lab 6 - Simple Regression Spring 2017 Contents 1 Thinking About Regression 2 2 Regression Output 3 3 Fitted Values 5 4 Residuals 6 5 Functional Forms 8 Updated from Stata tutorials provided by Prof. Cichello

More information

Chapter 6: Linear Regression With Multiple Regressors

Chapter 6: Linear Regression With Multiple Regressors Chapter 6: Linear Regression With Multiple Regressors 1-1 Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

Greene, Econometric Analysis (7th ed, 2012)

Greene, Econometric Analysis (7th ed, 2012) EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 2 3: Classical Linear Regression The classical linear regression model is the single most useful tool in econometrics.

More information

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical

More information

Extensions to the Basic Framework I

Extensions to the Basic Framework I Topic 5 Extensions to the Basic Framework I ARE/ECN 240 A Graduate Econometrics Professor: Òscar Jordà Outline of this topic Heteroskedasticity: reminder of OLS results and White (1980) corrected standard

More information

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as

More information

Practice exam questions

Practice exam questions Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.

More information

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N 1) [4 points] Let Econ 836 Final Exam Y Xβ+ ε, X w+ u, w N w~ N(, σi ), u N u~ N(, σi ), ε N ε~ Nu ( γσ, I ), where X is a just one column. Let denote the OLS estimator, and define residuals e as e Y X.

More information

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11 Econ 495 - Econometric Review 1 Contents 1 Linear Regression Analysis 4 1.1 The Mincer Wage Equation................. 4 1.2 Data............................. 6 1.3 Econometric Model.....................

More information

Lecture 5. In the last lecture, we covered. This lecture introduces you to

Lecture 5. In the last lecture, we covered. This lecture introduces you to Lecture 5 In the last lecture, we covered. homework 2. The linear regression model (4.) 3. Estimating the coefficients (4.2) This lecture introduces you to. Measures of Fit (4.3) 2. The Least Square Assumptions

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Endogeneity b) Instrumental

More information

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10) Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the

More information