Single-Equation GMM (II)

Size: px
Start display at page:

Download "Single-Equation GMM (II)"

Transcription

1 Single-Equation GMM (II) Guochang Zhao RIEM, SWUFE Week 12, Fall 2016 December 8, / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

2 Overidentification test If the equation is exactly identified, it is possible to choose δ so that all the elements of sample moments g n ( g n ) are zero and the distance (objective function) is zero. If the equation is over-identified, then the distance cannot be exactly set to zero, but we we would expect the minimized distance to be close to zero. It turns out that, if the weighting matrix Ŵ is chosen optimally so that plim Ŵ = S 1, then the minimized distance is asymptotically chi-squared. 2 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

3 oposition Overidentificaiton 3.6 (Hansen's test: test of Hansen s overidentifying J test restrictions (Hansen, 1982 ppose there is available a consistent estimator, $, of S (= E(gig:)). Und Suppose there is available a consistent estimator, Ŝ, of sumptions , S(= E(g i g i )). Under Assumptions 1-5: This is specification test. The test is not consistent against failures of the orthogonality test. J test in small samples exceeds the normal size (i.e., the test rejects too often). 3 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

4 GMM Testing subsets of orthogonality conditions I : } I : } [ [ Divide the K instruments into two groups: } K1 } rows, K1 rows, xi = xi = K - K1 rows. K - K1 rows. e model we where wish to xtest it is are suspect. we wish to test is The part of the model we wish to test is n is testable if there are at least as many nonsuspect instruments as ficients able if so there that This are K1 restriction p at L. least The basic is many testable idea nonsuspect is to compare if thereinstruments two arej atstatistics leastas as many rate o that GMM K1 estimators p nonsuspect L. The of basic the instruments same idea coefficient is to compare as there vector are two 6, one coefficients J using statistics only so that ts included in xil, and the other using also the suspect instruments xi2 M estimators K 1 of the L. same coefficient vector 6, one using only xil. If the inclusion of the suspect instruments significantly increases ed in xil, and If the restriction other using is also true, the then suspect theinstruments inclusion ofxi2 the suspect, that is a good reason for doubting the predeterrninedness of xi2. ance e inclusion with the instruments of partition the suspect of cannot xi, instruments the sample significantly orthogonality significantly increase conditions increases the J statistic. n good be written reason for doubting the predeterrninedness of xi2. th 4 / the 35 partition Guochang of Zhao xi, the RIEM, sample SWUFE orthogonality Single-Equation conditions GMM

5 gn (8) and S can be written as where where where where where In In particular, In particular, gln(6") gln(6") can can be written be written as In particular, gln(6") can be written as as In particular, gln(6") can be written as where where where where where For For a consistent a consistent estimate estimate 3 of 3 of S, S, the the efficient efficient GMM GMM estimator estimator using using all all the the K K For 5 instruments For / 35 a consistent a Guochang and estimate its Zhao associated 3 RIEM, of J S, SWUFE statistic the efficient have Single-Equation already GMM been GMM estimator derived using using this all all the and the K the K

6 previous section. Reproducing them, Chapter 3 For a consistent estimate Ŝ of S, the efficient GMM estimator using previous all the section. K instruments Reproducing and them, its associated J statistic: The efficient GMM estimator of the same coefficient vector 6 using only the first K1 instruments and its associated J The For efficient a consistent GMM estimator estimate of the ˆ statistic are obtained by replacing xi by xil in S 11 same of S 11 coefficient, the efficient vector GMM 6 using estimator these expressions. So only the first K1 using instruments all the and K 1 its instruments associated J and statistic its associated are obtained J by statistic: replacing xi by xil in these expressions. So A where (Testing Sll is a consistent a subset of estimate orthogonality of Sll. conditions): Suppose assumptions 1-3 The hold. A test is Let based x i1 on be the a subvector following proposition of x i, and specifying strengthen the Assumption asymptotic 4 distriwhere Sll is a consistent estimate of Sll. bution by requiring of J - J1 that (the the proof rank is left condition as an optional for identification exercise). is satisfied for x i1 The (So test E(x is based i1 z on the following proposition specifying the asymptotic distrii1 ) is of full column rank). Then, for any consistent bution of J - J1 (the proof is left as an optional exercise). Proposition estimator 3.7 Ŝ of (testing S and a Ŝ11 subset of S of 11, orthogonality conditions13): Suppose Assumptions hold. C Let Jxil Jbe 1 a subvector d χ 2 (K of K 1 xi, ) and strengthen Assump- Proposition 3.7 (testing a subset of orthogonality conditions13): Suppose tion 3.4 by requiring that the rank condition for identification is satisfied for xil (SO Assumptions where K = #x i and hold. K Let = #x xil i1 be. a subvector of xi, and strengthen Assump- E(xil zi) is of full column rank). Then, for any consistent estimators g of S and gl I tion 3.4 by requiring that the rank condition for identification is satisfied for xil (SO 6 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

7 Testing a subset of orthogonality conditions Remarks The choice of Ŝ and Ŝ11 does not matter asymptotically as long as they are consistent. In finite sample, the test statistic C can be negative. The negative C can be avoided if the same Ŝ is used throughout: (1) Do the efficient two-step GMM with full instruments x i to obtain Ŝ from the first step, ˆδ and J from the second step. (2) Extract the submatrix Ŝ11 from Ŝ obtained from (1), calculate ˆδ using this Ŝ11 and J 1. Then take the difference in J. 7 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

8 LR, Wald and LM tests Consider maximum likelihood estimation of a parameter θ and a test of hypothesis H 0 : c(θ) = 0: Likelihood ratio test If the restriction c(θ) = 0 is valid, then imposing it should not lead to a large reduction in the log-likelihood function. Wald test If the restriction is valid, then c(θ MLE ) should be close to zero since the MLE is consistent. Lagrange multiplier test If the restriction is valid, then the restricted estimator should be near the point that maximizes the log-likelihood. Test ideas The LR test is based on the distance between restricted and unrestricted maximum log likelihood; the Wlad test is based on the distance between c(θ MLE ) and 0; while LM test is based on the distance between slope of the likelihood function at the restricted estimator and 0. 8 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

9 ln L( ) d ln L( ) d c( ) d ln L( ) d ln L Likelihood ratio ln L R ln L( ) Lagrange multiplier c( ) Wald 0 ^ R ^ MLE 9 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

10 LR, Wald and LM tests (cont.) LR test: LR statistic = 2 ln λ d χ 2 #r where λ = ˆL R /ˆL U, and #r is the number of restrictions. Wald test: [ ] 1 W = c(ˆθ) Avar(c(ˆθ)) c(ˆθ) d χ 2 where #r is the number of restrictions. LM test: ( ) ( ) ln L(ˆθ R ) LM = ˆθ [I(ˆθ R )] 1 ln L(ˆθ R ) R ˆθ d χ 2 #r R where I(ˆθ R ) is the Hessian matrix, #r is the number of restrictions. 10 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM #r

11 Hypothesis testing by the Likelihood-Ratio principle The three principle to construct test statistics can be easily generalized to GMM. The restricted efficient GMM estimator is defined as restricted efficient GMM : δ(ŝ) 1 argmin J( δ, Ŝ 1 ) s.t. H 0 δ The LR principle suggests that LR J( δ(ŝ 1 ), Ŝ 1 ) J(ˆδ(Ŝ 1 ), Ŝ 1 ) d χ 2 #r where #r is the number of restrictions. 11 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

12 Implications of conditional homoskedasticity er conditional homoskedasticity, the matrix of fourth moments S (= = E(E? Assumption xixi)) can be 7 written : Conditional as a product homoskedasticity of second moments: E(ε 2 x i ) = σ 2., = E(xix:). UnderAs conditional in Chapter 2, homoskedasticity: this decomposition of S has several impli- S E[g i g i] = E[ε 2 x i x i] = σ 2 Σ xx, S is nonsingular by Assumption 3.5, this decomposition implies that a2 > Ex, is where nonsingular. Σ xx = E(x i x i ). stimator Since exploiting S is this nonsigular, structure of Σ xx S is is nonsigular. The estimator exploiting this structure of S is where ˆσ G2 is some consistent 2 is some consistent estimator to be specified. estimator to be specified below. By ergodic station- S, +, X,. Thus, provided that G2 is consistent, we do not need the -moment 12 / 35 assumption Guochang Zhao (Assumption RIEM, SWUFE 3.6) for g Single-Equation to be consistent. GMM

13 Implications of conditional homoskedasticity (cont.) Efficient GMM Becomes 2SLS In the efficient GMM estimation, the weighting matrix is g-'. If we set to (3.8.2), Efficient the GMM GMM estimator becomes 2SLS which For does the not efficient depend on GMM G2. In the estimator, general case, there is whole no need point of tothe dofirst thestep in the efficient first step two-step under GNIM conditional was to obtain homoskedasticity, a consistent estimator because of S. the Under conditional second homoskedasticity, step estimator there collapse is no need to to the do the GMM first step estimator because with the second- S xx, used for Ŝ, δ(s 1 xx). This estimator is called the TWo-Stage Least Squares. 13 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

14 timator." he 3): ML counterpart of 2SLS, called the "limited-information maximum likelihood Implications of conditional homoskedasticity (cont.) stimator." The expression for ~ var(b~~~~) can be obtained by substituting (3.8.1) into.5.13): The Efficient expression GMM for ~ becomes var(b~~~~) 2SLS can be (cont.) obtained by substituting (3.8.1) into ): For 32, consider the sample variance of the 2SLS residuals: The asymptotical variance of the 2SLS is ural estimator of this is natural estimator of this is A natural estimate for this is natural estimator of this is 2, consider (Some the authors sample divide For ˆσ variance the sum of of the squares 2SLS by residuals: n - L, not by n, to calculate Z2.) By Proposition 3.2, 2, consider (3.8.6) the -+, sample a2 if E(zizi) variance exists of the and 2SLS is finite. residuals: Thus defined in r 32, (3.8.2) consider with the this sample 32 is consistent variance of for the S. 2SLS residuals: or 32, consider the sample variance of the 2SLS residuals: Substituting (3.8.2) into (3.5.15) and (3.5.16), the t-ratio and the Wald statistic become t ratio and Wald statistic e authors divide the sum of squares by n - L, not by n, to calculate Z2.) By ome sition authors 3.2, (3.8.6) divide -+, the sum a2 if of E(zizi) squares exists by n - and L, is not finite. by n, Thus to calculate defined Z2.) in By Some authors divide the sum of squares by n - oposition ) with this 3.2, 32 is (3.8.6) consistent L, not by n, to calculate Z2.) By -+, a2 for if S. E(zizi) exists and is finite. Thus defined in roposition 3.2, (3.8.6) -+, a2 if E(zizi) exists and is finite. Thus defined in.8.2) bstituting with this (3.8.2) 32 is into consistent (3.5.15) for and S. (3.5.16), the t-ratio and the Wald statistic 3.8.2) e with this 32 is consistent for S. Substituting (3.8.2) into (3.5.15) and (3.5.16), the t-ratio and the Wald statistic Substituting (3.8.2) into (3.5.15) and (3.5.16), the t-ratio and the Wald statistic come 14 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

15 Implications of conditional homoskedasticity (cont.) Under Assumptions , the 2SLS estimator (3.8.3) is consistent. Assumption 3.5 is added, the estimator is asymptotically normal with the asy totic 15 / 35 variance Guochang given Zhaoby (3.5.1) RIEM, SWUFE with W Single-Equation = (a2~,,)-'. GMMIf Assumption 3.7 (co Becomes Sargan's Statistic 8 J Becomes Sargan s Statistic Chapte hen is set When Ŵ to is (ii2sxx)-', set to the distance defined in (3.4.6) becomes (ˆσ2 S xx ) 1, the distance becomes position 3.6 then implies that the distance evaluated at the efficient GMM es tor under conditional homoskedasticity, ~ ~SLS, is asymptotically chi-squar is distance is called Sargan's statistic (Sargan, 1958): This distance is called Sargan s Statistic: (sxy - ~x~i2sls)~sx;: (sxy - ~xzi2sls) Sargan's statistic = n.. ( ~his estimator was first proposed by Theil (1953). summarize our results so far as oposition 3.9 (asymptotic properties of 2SLS): &2

16 (a) Under Assumptions , the 2SLS estimator (3.8.3) is consistent. If Assumption 3.5 is added, the estimator is asymptotically normal with the asymptotic variance given by (3.5.1) with W = (a2~,,)-'. If Assumption 3.7 (conditional homoskedasticity) is added to Assumptions , then the estimator Asymptotic properties of 2SLS: is the efficient GMM estimator. Furthermore, Underif Assumption E(zi z:) exists 1-4, and the is finite, 2SLS estimator is consistent. If l5 then Assumption 5 is added, the estimator is asymptotically normal. (b) the If asymptotic Assumption variance 7 is is added consistently to Assumptions estimated by 1-5, (3.8.5), then estimator is (c) te in the (3.8.7) efficient +d N GMM (0, I), estimator. W in (3.8.8) +d X2(#r), and Implications of conditional homoskedasticity (cont.) The asymptotical variance is consistently estimated by... (4 the Sargan statistic in (3.8.10) +d (K- L). t l d N(0, 1), W d χ 2 (#r) Proposition The Sargan 3.8 states Statistic that the LR d χ 2 statistic, (K L). which is the difference in J with and without the imposition of the null hypothesis, is asymptotically chi-squared. Since J LR can be test written in 2SLS as (3.8.9), we have If the hypothesis is linear, the LR is numerically equal to the Wald statistic W. where 8 is the restricted 2SLS estimator which minimizes (3.8.9) under the null hypothesis.16 In Proposition 3.8, the use of the same guaranteed the statistic 16 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

17 Small properties of 2SLS There is fairly large literature on the finite-sample distribution of the 2SLS estimator. The analytical results, however, are not very useful for empirical researchers, because they are derived under the restrictive assumptions of fixed instruments and normal errors. If the instrumental variable is weak, a mass of the distribution of the sampling error ˆδ 2SLS δ remain apart from zero until the sample size gets really large. How weak is weak? The F-statistic for the instrument variable should be larger than 10 (rule of thumb). 17 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

18 one instrument.) Alternative Derivations of 2SLS Alternative Derivations of PSLS If we define data matrices as Define data matrix as then it is easy to see that the 2SLS estimator and associated statistics can be Chapter Then, the 2SLS estimator can be written as written 3 as where P where r X(XIX)-'X' P = X(X is the X) projection 1 X is matrix, the projection matrix..,.. A -2 EE a = - where P E y - Z62SLS. n 18 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

19 Alternative Derivations of 2SLS (cont.) 19 / 35 Using Guochang this formula, Zhao RIEM, we can SWUFE provide Single-Equation two other derivations GMM of the 2SLS estimato where TheP estimate r X(XIX)-'X' for asymptotical is the projection variance, matrix, t-ration, Wald statistic and overidentification test...,.. -2 EE A a = - where P E y - Z62SLS. n P'PP Sargan's statistic = -. 62

20 P'PP 2SLS as an IV estimator Sargan's statistic = g this formula, we can provide two other derivations of the 2SLS estimator. Now Generate we generate L instruments, those L instruments ẑ i, with from the xi as Kfollows. raw instruments, The t-th instrument is S as an the IV fitted x i Estimator : regress value from z il regressing on x i, and zit obtain (the t-th the regressor) fitted on values xi. The n-vector of fitted value ẑ l is = x(x'x)-' (X X 1 Xfze, )X zwhere l for each is the l. n-vector of the t-th regressor (i.e., the t-th i (L x 1) be the vector of L instruments (which will be generated from xi as column of Z). Therefore, the n x L data matrix of instruments is ribed below) Therefore, the L the regressors, n L data and let matrix 2 be of the the n x instruments L data matrix is of those A truments. So the i-th row of 2 is 2;. The IV estimator of 6 with li serving as Z = (x(x'x)-i X1zl...., x(x'x)-'x'z~) = x(x'x)-'x'z = PZ, (3.8.13) ments is, by (3.4.4), where Then P is theprojection IV estimator matrix. issubstituting this into the IV format (3.8.12) yields the 2SLS estimator (see (3.8.3')). we generate Replace those L Ẑ instruments with PZ from the above xi as follows. equation, The we t-th get the instrument 2SLS is tted value from estimator. regressing zit (the t-th regressor) on xi. The n-vector of fitted is x(x'x)-' Xfze, where is the n-vector of the t-th regressor (i.e., the t-th n of Z). Therefore, the n x L data matrix of instruments is 20 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

21 2SLS as two regressions First stage: regress the L regressors on x i and obtain fitted values ẑ i. Second stage: regress y i on these fitted values. For those regressors that are predetermined, there is no need to carry out the first stage. This derivation of the 2SLS is useful and it justifies the naming of the estimator, but there is a pitfall - the OLS standard errors and estimated asymptotic variance from the second stage cannot be used for statistical inference. 21 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

22 When skedastic, regressors there is a close are connection predetermined between the distance function J for efficient GMM and the sum of squared residuals (SSR). From (3.8.9') on page 230, - y'py - 2y1zii + S1zlzS - (since PZ = Z when zi c xi) &2 The efficient GMM estimator is OLS; The restrictive GMM is restrictive OLS; The Wald statistic can be calculated as the difference in SSR with and without the restriction, normalized to ˆσ / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

23 GMM estimator is the 2SLS estimator under conditional homoskedasticity, Testing a subset of orthogonality conditions GMM estimators are given by = (z'pz)-'z'p~ with P = x(x'x)~'x', 6= (Z'P~Z)~'Z'P~~ with PI =xi(x',xi)-'x',. And the C statistic becomes the difference in two Sargan statistics: where -,, i'i i-y-z6, i=y-z6, a G-. n CAs isseen guaranteed for the case to be without nonnegative conditional in finite homoskedasticity, samples if thec same is guarantee matrix nonnegative Ŝ is used in finite throughout. samples if the same matrix is used throughout, whic conditional homoskedasticity amounts to using the same estimate of the er ance, C2, to deflate both i'pi and ifpl i, as in (3.8.17). By Proposition / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

24 Hausman test ermore, u have as been you exposed will be asked to the to Hausman show (Analytical test in the Exercise ML context, 9), under you can rechesis e MM this guaranteeing as the GMM C version to be asymptotically of the Hausman chi-squared, principle.) By Proposition , (i) is consistently estimated by ermore, as you will be asked to show (Analytical Exercise 9), n GMM ˆδ is asymptotically more efficient than δ under because ˆδ 233 exploits thesis guaranteeing C to be asymptotically chi-squared, more orthogonality conditions. Therefore een rthermore, exposed as to you the will Hausman be asked test in to the show ML (Analytical context, you Exercise can rec- 9), under Avar( δ) Avar(ˆδ). pothesis GMM guaranteeing version of the C Hausman to be asymptotically principle.) By chi-squared, Proposition 3.9, larly, a consistent estimator of Avar(6) is istently estimated een exposed And to the we by Hausman also have test in the ML context, you can recthe GMM version of the Hausman principle.) By Proposition 3.9, sistently estimated by e been exposed In addition, to the Hausman we alsotest have, as in the calculation of the C statistic, the in same the ML estimate, context, 6', you is used can through- recnsistent estimator of Avar(6) is n as order the GMM to guarantee version the of test the statistic Hausman below principle.) to be nonnegative. By Proposition The resulting 3.9, onsistently ator of ~var(8 estimated - i) is by nsistent estimator of Avar(6) is calculation of the C statistic, the same estimate, 6', is used throughguarantee the test statistic below to be nonnegative. The resulting man calculation consistent ar(8 and - i) Taylor of estimator The is (1980) above the C statistic, of have Avar(6) matrix shown in the same is finite that (1) sample this matrix is positive in finite semidefinite samples is estimate, 6', is used throughive semidefinite but not (nonnegative necessarily definite) nonsingular. but not necessarily nonsingular, but o guarantee the test statistic below to be nonnegative. The resulting var(8 r any generalized - i) is inverse17 of this matrix, the Hausman statistic 24 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

25 Hausman test (cont.) Hausman and Taylor (1980) have shown that (1) this matrix in finite samples is positive semidefinite (nonnegative definite) but not necessarily nonsingular, but (2) for any generalized inverse17 of this matrix, the Hausman statistic Hausman test: is invariant Thisto isthe invariant choice tof the the choice generalized of generalized inverse inverse and is and asymptotically is chisquared with asymptotically min(k - K1, chi-squared L - s) degrees with min(k of freedom, k 1, Lwhere s) degrees of freedom, where s is the number of regressors which are retained s = #zi asn instruments xil = number in of x i1 regressors. which are retained as instruments in xi,. The relationship between H and C: If K K 1 L s so that both H and C have the same degrees What of is freedom, the relationship then H between = C (numerically C and H under equal). conditional Otherwise, homoskedasticity? the two It can be statistics shown (see arenewey, numerically 1985) different that: and have different degrees of freedom. "A generalized inverse, A-, of a matrix A is any matrix satisfying AA-A = A. If A is square and nonsin- 25 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

26 Testing conditional homoskedasticity The nr 2 statistic for the 2SLS residuals does not have the desired asymptotic distribution. A test statistic that is asymptotic chi-squared is available but is extremely cumbersome. Testing for serial correlation A modified Box-Pierce Q statistic that is asymptotically chi-squared under the null of no serial correlation. 26 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

27 Returns from schooling Mincer equation: ln W = β 0 + β 1 edu + β 2 exp + β 3 exp ɛ It is obvious that the β 1 is positive, but it can be due to the the causal effect of education on wage the correlation between education and unobserved ability How to isolate the causal effect from the ability? 27 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

28 Table 3.1: Characteristics of Young Men from the National Longitudinal Survey Variable Means and Standard Deviations (in parentheses) Sample size 1,362 Age in 1969 Schooling in years in 1969 (S) Logarithm of hourly wages (in cents) in 1969 (LW) 5.68 (0.40) Score on the Knowledge of the World of Work test (KWW) 35.1 (7.9) IQ score (ZQ) Experience in years in 1969 SOURCE: Griliches (1976, Table 1). 28 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

29 Omitted Variable Bias variables bias Sometimes Omitted the Variable data set Bias you work with has no measures of A (this is true for the og Wage Current Equation Sometimes Population the data Survey, set you for work example). with has What no measures is the consequence of A (this is of true ignoring for the wage ability equation Current estimated in the literature is the semi-log form: The by omitting model Population in A Griliche from Survey, the for wage (1976): example). equation? What We is know the consequence from Section of ignoring 2.9 that the regression ability by of omitting LW on A a constant, from the wage S, and equation? h provides We a consistent know from estimator Section 2.9 of that the corresponding the regression least of squares LW on projection, a constant, S, which and h can provides be written a consistent as estimator of the corresponding least squares projection, which can be written as is the log -* wage When rate there for the isindividual, no a measure S is schooling on ability: in years, A is a E (LW-* I 1, S, h) = E*(a + BS ability, h is E the (LW vector I 1, S, of h) observable = E*(a + + ya characteristics BS + + 6'h ya + + E I 1, S, h) (by (3.9.1)) 6'h of + E the I 1, individual S, h) (by (3.9.1)) perience and location dummies), =a +BS+ =a +BS+ 6 is 6'h+ the 6'h+ associated YE*(A 1 1, YE*(A vector S, h) 1 1, S, h) of +E*(E +E*(E coeffi- I 1, S, h). (3.9.2) I 1, S, h). (3.9.2) E is the unobservable error term with zero mean (in this section, the Since ubscript i the regressors will be the dropped regressors in (3.9.1) for in notational (3.9.1) are are all predetermined, all simplicity). predetermined, we The we have semi-log have E(E) Since E(ε) = 0, E(Sε) = 0, and E(h ε) = 0, we have E(E) = 0, E(S 0, E(S E) E) = 0, and n for schooling 0, E(h-E) and S E(h-E) is often = Ê justified 0. So E*(E by ( appealing 1, S, h) = to 0. the Writing well-established Z*(A ( 1, S, h) 81 13s = 0. So E*(E ( 1, S, h) = 0. Writing Z*(A ( 1, S, h) (ε 1, Sh) = 0. Writing 0hh, (3.9.2) becomes Ê (A 1, Sh) = θ 1 + θ s S + θ h = h, s S + 0hh, (3.9.2) becomes t from large the cross-section above equation data (such becomes as the Current Population Survey) tionship between log wages and schooling is linear.19 ooling coefficient p measures the percentage increase in the wage rate al would Therefore, receive Therefore, Omitted the if she OLS the variable had OLS coefficient one bias: more estimate year of can education. can be be asymptotically It therefore biased for all the included included regressors. regressors. This This phenomenon is called the plim ˆβ is called the omitted variable bias. In ctional form issue, particular, see Card (1995). OLS = β + γθ S particular, 29 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

30 measurement error, IQ is related to A as -in-variables Errors-in-variables rse the IQ score may not be an error-free measure of ability. If q is t rement error, IQ is related to A as with IQE(q) can= be0. athe measure interpretation of ability, of q but as measurement it has some error makes it reaso to assume measurement that q is error: uncorrelated with A, S, h, and the wage equation error te Substituting (3.9.3) into (3.9.1), we obtain IQ = φ + A + η (q) = 0. The interpretation of q as measurement error makes it reasonab LW=(cr-y$)+BS+ yiq+sfh+(&- yq). ( me that q where is uncorrelated η is the measurement with A, S, h, error and and the E[η] wage = equation 0. error term tuting (3.9.3) We now Substitute into illustrate (3.9.1), for the IQwe this into obtain example the general result that, if at least one the equation (3.9.1), we have regressors is measured with error, the OLS estimates of all the regression c cients LW=(cr-y$)+BS+ can be asymptotically biased. yiq+sfh+(&- yq). (3.9 To examine the consequence of using the error-ridden measure IQ for A w illustrate sider Then the for corresponding the this OLS example projections least the squares general becomes projection: result that, if at least one of t sors is measured with error, the OLS estimates of all the regression coef can be asymptotically biased. examine the consequence of using the error-ridden measure IQ for A, co he corresponding least squares projection: 30 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

31 ingle-equation GMM 241 = E(q E) (since E(E) = 0, Cov(A, E) = 0) Errors-in-variables (cont.) Now consider E*(E ( E(IQ 1, = S, 0. IQ, E) = h) E[(@ in (3.9.5). + A + q) Since El E(E) (by (3.9.3)) = 0 and (S, h) are ncorrelated with In the E, (1, above S, h) equation, are orthogonal = E(q Ê E) (ε 1, to (since E. S, IQ h) is E(E) = also 0, = orthogonal because 0, Cov(A, (1, to E) S, E = h) because 0) are ~hus, E*(E orthogonal I 1, S, IQ, h) to = ε, o and in = (3.9.5). 0. So the E(IQ biases, E) if = any, E[(@ equal + A -y + q) E*(q El I (by 1, S, (3.9.3)) IQ, h) in (3.9.5). Let (Os, OIQ, &) be the projection ~hus, E*(E coefficients = I 1, E(q S, E) IQ, of h) (since (S, = I o Q, E(E) in (3.9.5). h) = in 0, g*(q Cov(A, 1 1, S, E) I = Q, 0) h). By the formula (2.9.7) from So Chapter the biases, 2, if any, equal -y E*(q I 1, S, IQ, h) in (3.9.5). Let (Os, OIQ, &) = 0. be the projection coefficients of (S, I Q, h) in g*(q 1 1, S, I Q, h). By the formula Let (θ S, θ IQ, θ h ) be the projection coefficients of (S, IQ, h) on hus, E*(E I (2.9.7) 1, Ê S, IQ, from h) = o Var(S) Chapter (1, S, IQ, h) in Cov(S, : (3.9.5). 2, IQ) Cov(S, h') So the biases, if any, equal -y E*(q I 1, S, IQ, h) in (3.9.5). Let (Os, OIQ, &) Var(S) Cov(S, IQ) Cov(S, h') the projection coefficients Cov(h, S) of Cov(h, (S, I Q, IQ) h) in g*(q Var(h) 1 1, S, I Q, h). By the formula.9.7) from Chapter 2, In this expression, Cov(S, Cov(h, q) and S) Cov(h, q) IQ) are zero Var(h) by assumption. Cov(IQ, q), however, is not zero because In Inthis Var(S) thexpression, abovecov(s, expression, Cov(S, IQ) q) Cov(S, Cov(S, and Cov(h, η) h') and q) are Cov(h, zero by η) assumption. are zero butcov(iq, q), however, Cov(IQ, is q) not = zero E(IQ because q) (since E(q) = 0) Cov(h, S) Cov(h, IQ) Var(h) = E[(@ + A + v) q1 (by (3.9.3)) Cov(IQ, q) = E(IQ q) (since E(q) = 0) this expression, Cov(S, = and E(q) Cov(h, + E(A q) v) are zero by assumption. Cov(IQ, q), = E[(@ + + E(v2) A v) q1 (by (3.9.3)) wever, is not zero because = Var(q) (since E(q) = 0 and Cov(A, q) = 0). E(q) + E(A v) + E(v2) 31 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

32 correlated with the measured value. Using the fact that correlated with the measured value. Using the fact that Cov(S, q) = 0, Cov(IQ, q) = Var(q), and Cov(h, q) = 0, Using Cov(S, the q) fact = that 0, Cov(h, Cov(IQ, η) q) = 0, Var(q), Cov(S, η) and = 0Cov(h, and q) = 0, the projection Cov(IQ, coefficients η) = V ar(η), can be therewritten projection as coefficients can be written the projection as coefficients can be rewritten as Errors-in-variables (cont.) where where where Var(S) Var(S) Cov(S, Cov(S, IQ) IQ) Cov(S, Cov(S, h') h') a = second column of Cov(IQ, h') 242 a = second column of Cov(IQ, S) Var(IQ) Cov(IQ, h') Chapter 3 Cov(h, S) Cov(h, IQ) Var(h) Therefore, in the regression of LW on a constant, S, IQ, and h, Therefore, in the regression of LW on a constant, S, h and IQ, plim &LS = p - y. Var(r]).(lst element of a), (3.9.7a) plim POLS = y - y. Var(r]).(2nd element of a). (3.9.7b) Since the second element of a is positive (it is a diagonal element of the inverse of 32 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

33 Table 3.2: Parameter Estimates Estimation equation: LW = a + /?S + ylq + other controls Line no. ~ ~ Coefficient ~ of i ~ ~ Which regressor ~ i Excluded ~ ~ SER R~ predetermined technique S is endogenous'? lq variables 1 OLS none - (13.2) 2 OLS none - (10.7) (2.8) 2SLS (7.0) (2.4) --- MED, KWW, age, age squared, background variables SOURCE: Line 1: equation (Bl) in Griliches (1976. Table 2). Line 2: equation (B3) in Griliches's Table 2. Line 3: line 3 in Griliches's Table 5. Figures in parentheses are t-values rather than standard errors. 33 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

34 2SLS to correct for the bias Instrumental variables: (1, S, h) and age, age squared, KWW, mothers education and some other background variables of the individual. Are KWW and MED valid instruments? Subsequent developments Twins data Fixed effect model The change in Compulsory Schooling Laws (Angrist and Kruger, 1991; Card and Kruger, 1996) 34 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

35 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

36 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

37 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

38 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

39 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

40 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

41 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

42 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

43 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

44 Implementation of IV/GMM in Stata 35 / 35 Guochang Zhao RIEM, SWUFE Single-Equation GMM

Single-Equation GMM (II)

Single-Equation GMM (II) Single-Equation GMM (II) Guochang Zhao RIEM, SWUFE Week 12, Fall 2016 December 1, 2016 1 / 26 Guochang Zhao RIEM, SWUFE Single-Equation GMM Overidentification test If the equation is exactly identified,

More information

Single Equation Linear GMM

Single Equation Linear GMM Single Equation Linear GMM Eric Zivot Winter 2013 Single Equation Linear GMM Consider the linear regression model Engodeneity = z 0 δ 0 + =1 z = 1 vector of explanatory variables δ 0 = 1 vector of unknown

More information

Multiple Equation GMM with Common Coefficients: Panel Data

Multiple Equation GMM with Common Coefficients: Panel Data Multiple Equation GMM with Common Coefficients: Panel Data Eric Zivot Winter 2013 Multi-equation GMM with common coefficients Example (panel wage equation) 69 = + 69 + + 69 + 1 80 = + 80 + + 80 + 2 Note:

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

We begin by thinking about population relationships.

We begin by thinking about population relationships. Conditional Expectation Function (CEF) We begin by thinking about population relationships. CEF Decomposition Theorem: Given some outcome Y i and some covariates X i there is always a decomposition where

More information

Econ 583 Final Exam Fall 2008

Econ 583 Final Exam Fall 2008 Econ 583 Final Exam Fall 2008 Eric Zivot December 11, 2008 Exam is due at 9:00 am in my office on Friday, December 12. 1 Maximum Likelihood Estimation and Asymptotic Theory Let X 1,...,X n be iid random

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Asymptotics Asymptotics Multiple Linear Regression: Assumptions Assumption MLR. (Linearity in parameters) Assumption MLR. (Random Sampling from the population) We have a random

More information

Econometrics II - EXAM Answer each question in separate sheets in three hours

Econometrics II - EXAM Answer each question in separate sheets in three hours Econometrics II - EXAM Answer each question in separate sheets in three hours. Let u and u be jointly Gaussian and independent of z in all the equations. a Investigate the identification of the following

More information

Instrumental Variables, Simultaneous and Systems of Equations

Instrumental Variables, Simultaneous and Systems of Equations Chapter 6 Instrumental Variables, Simultaneous and Systems of Equations 61 Instrumental variables In the linear regression model y i = x iβ + ε i (61) we have been assuming that bf x i and ε i are uncorrelated

More information

The outline for Unit 3

The outline for Unit 3 The outline for Unit 3 Unit 1. Introduction: The regression model. Unit 2. Estimation principles. Unit 3: Hypothesis testing principles. 3.1 Wald test. 3.2 Lagrange Multiplier. 3.3 Likelihood Ratio Test.

More information

The regression model with one fixed regressor cont d

The regression model with one fixed regressor cont d The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8

More information

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over

More information

Instrumental Variables

Instrumental Variables Università di Pavia 2010 Instrumental Variables Eduardo Rossi Exogeneity Exogeneity Assumption: the explanatory variables which form the columns of X are exogenous. It implies that any randomness in the

More information

Econometrics II - EXAM Outline Solutions All questions have 25pts Answer each question in separate sheets

Econometrics II - EXAM Outline Solutions All questions have 25pts Answer each question in separate sheets Econometrics II - EXAM Outline Solutions All questions hae 5pts Answer each question in separate sheets. Consider the two linear simultaneous equations G with two exogeneous ariables K, y γ + y γ + x δ

More information

Dynamic Panels. Chapter Introduction Autoregressive Model

Dynamic Panels. Chapter Introduction Autoregressive Model Chapter 11 Dynamic Panels This chapter covers the econometrics methods to estimate dynamic panel data models, and presents examples in Stata to illustrate the use of these procedures. The topics in this

More information

Instrumental Variables

Instrumental Variables Università di Pavia 2010 Instrumental Variables Eduardo Rossi Exogeneity Exogeneity Assumption: the explanatory variables which form the columns of X are exogenous. It implies that any randomness in the

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

ECON Introductory Econometrics. Lecture 16: Instrumental variables

ECON Introductory Econometrics. Lecture 16: Instrumental variables ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Economics 582 Random Effects Estimation

Economics 582 Random Effects Estimation Economics 582 Random Effects Estimation Eric Zivot May 29, 2013 Random Effects Model Hence, the model can be re-written as = x 0 β + + [x ] = 0 (no endogeneity) [ x ] = = + x 0 β + + [x ] = 0 [ x ] = 0

More information

Instrumental Variables and GMM: Estimation and Testing. Steven Stillman, New Zealand Department of Labour

Instrumental Variables and GMM: Estimation and Testing. Steven Stillman, New Zealand Department of Labour Instrumental Variables and GMM: Estimation and Testing Christopher F Baum, Boston College Mark E. Schaffer, Heriot Watt University Steven Stillman, New Zealand Department of Labour March 2003 Stata Journal,

More information

Greene, Econometric Analysis (6th ed, 2008)

Greene, Econometric Analysis (6th ed, 2008) EC771: Econometrics, Spring 2010 Greene, Econometric Analysis (6th ed, 2008) Chapter 17: Maximum Likelihood Estimation The preferred estimator in a wide variety of econometric settings is that derived

More information

Instrumental Variables and the Problem of Endogeneity

Instrumental Variables and the Problem of Endogeneity Instrumental Variables and the Problem of Endogeneity September 15, 2015 1 / 38 Exogeneity: Important Assumption of OLS In a standard OLS framework, y = xβ + ɛ (1) and for unbiasedness we need E[x ɛ] =

More information

Slide Set 14 Inference Basded on the GMM Estimator. Econometrics Master in Economics and Finance (MEF) Università degli Studi di Napoli Federico II

Slide Set 14 Inference Basded on the GMM Estimator. Econometrics Master in Economics and Finance (MEF) Università degli Studi di Napoli Federico II Slide Set 14 Inference Basded on the GMM Estimator Pietro Coretto pcoretto@unisa.it Econometrics Master in Economics and Finance (MEF) Università degli Studi di Napoli Federico II Version: Saturday 9 th

More information

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible

More information

ECON 4160, Autumn term Lecture 1

ECON 4160, Autumn term Lecture 1 ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least

More information

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E.

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E. Forecasting Lecture 3 Structural Breaks Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, 2013 1 / 91 Bruce E. Hansen Organization Detection

More information

Generalized Method of Moments: I. Chapter 9, R. Davidson and J.G. MacKinnon, Econometric Theory and Methods, 2004, Oxford.

Generalized Method of Moments: I. Chapter 9, R. Davidson and J.G. MacKinnon, Econometric Theory and Methods, 2004, Oxford. Generalized Method of Moments: I References Chapter 9, R. Davidson and J.G. MacKinnon, Econometric heory and Methods, 2004, Oxford. Chapter 5, B. E. Hansen, Econometrics, 2006. http://www.ssc.wisc.edu/~bhansen/notes/notes.htm

More information

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.

More information

Introductory Econometrics

Introductory Econometrics Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna November 23, 2013 Outline Introduction

More information

Specification testing in panel data models estimated by fixed effects with instrumental variables

Specification testing in panel data models estimated by fixed effects with instrumental variables Specification testing in panel data models estimated by fixed effects wh instrumental variables Carrie Falls Department of Economics Michigan State Universy Abstract I show that a handful of the regressions

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Introduction to Estimation Methods for Time Series models Lecture 2

Introduction to Estimation Methods for Time Series models Lecture 2 Introduction to Estimation Methods for Time Series models Lecture 2 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 2 SNS Pisa 1 / 21 Estimators:

More information

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate

More information

The returns to schooling, ability bias, and regression

The returns to schooling, ability bias, and regression The returns to schooling, ability bias, and regression Jörn-Steffen Pischke LSE October 4, 2016 Pischke (LSE) Griliches 1977 October 4, 2016 1 / 44 Counterfactual outcomes Scholing for individual i is

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Endogeneity b) Instrumental

More information

Generalized Method of Moment

Generalized Method of Moment Generalized Method of Moment CHUNG-MING KUAN Department of Finance & CRETA National Taiwan University June 16, 2010 C.-M. Kuan (Finance & CRETA, NTU Generalized Method of Moment June 16, 2010 1 / 32 Lecture

More information

Ma 3/103: Lecture 24 Linear Regression I: Estimation

Ma 3/103: Lecture 24 Linear Regression I: Estimation Ma 3/103: Lecture 24 Linear Regression I: Estimation March 3, 2017 KC Border Linear Regression I March 3, 2017 1 / 32 Regression analysis Regression analysis Estimate and test E(Y X) = f (X). f is the

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the

More information

Economics 620, Lecture 7: Still More, But Last, on the K-Varable Linear Model

Economics 620, Lecture 7: Still More, But Last, on the K-Varable Linear Model Economics 620, Lecture 7: Still More, But Last, on the K-Varable Linear Model Nicholas M. Kiefer Cornell University Professor N. M. Kiefer (Cornell University) Lecture 7: the K-Varable Linear Model IV

More information

Asymptotics for Nonlinear GMM

Asymptotics for Nonlinear GMM Asymptotics for Nonlinear GMM Eric Zivot February 13, 2013 Asymptotic Properties of Nonlinear GMM Under standard regularity conditions (to be discussed later), it can be shown that where ˆθ(Ŵ) θ 0 ³ˆθ(Ŵ)

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7

MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 1 Random Vectors Let a 0 and y be n 1 vectors, and let A be an n n matrix. Here, a 0 and A are non-random, whereas y is

More information

Birkbeck Working Papers in Economics & Finance

Birkbeck Working Papers in Economics & Finance ISSN 1745-8587 Birkbeck Working Papers in Economics & Finance Department of Economics, Mathematics and Statistics BWPEF 1809 A Note on Specification Testing in Some Structural Regression Models Walter

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December

More information

Specification Test for Instrumental Variables Regression with Many Instruments

Specification Test for Instrumental Variables Regression with Many Instruments Specification Test for Instrumental Variables Regression with Many Instruments Yoonseok Lee and Ryo Okui April 009 Preliminary; comments are welcome Abstract This paper considers specification testing

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Increasing the Power of Specification Tests. November 18, 2018

Increasing the Power of Specification Tests. November 18, 2018 Increasing the Power of Specification Tests T W J A. H U A MIT November 18, 2018 A. This paper shows how to increase the power of Hausman s (1978) specification test as well as the difference test in a

More information

Econometrics. 8) Instrumental variables

Econometrics. 8) Instrumental variables 30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates

More information

Chapter 8 Heteroskedasticity

Chapter 8 Heteroskedasticity Chapter 8 Walter R. Paczkowski Rutgers University Page 1 Chapter Contents 8.1 The Nature of 8. Detecting 8.3 -Consistent Standard Errors 8.4 Generalized Least Squares: Known Form of Variance 8.5 Generalized

More information

Exam D0M61A Advanced econometrics

Exam D0M61A Advanced econometrics Exam D0M61A Advanced econometrics 19 January 2009, 9 12am Question 1 (5 pts.) Consider the wage function w i = β 0 + β 1 S i + β 2 E i + β 0 3h i + ε i, where w i is the log-wage of individual i, S i is

More information

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations. Exercises for the course of Econometrics Introduction 1. () A researcher is using data for a sample of 30 observations to investigate the relationship between some dependent variable y i and independent

More information

Testing Linear Restrictions: cont.

Testing Linear Restrictions: cont. Testing Linear Restrictions: cont. The F-statistic is closely connected with the R of the regression. In fact, if we are testing q linear restriction, can write the F-stastic as F = (R u R r)=q ( R u)=(n

More information

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

1 The Multiple Regression Model: Freeing Up the Classical Assumptions 1 The Multiple Regression Model: Freeing Up the Classical Assumptions Some or all of classical assumptions were crucial for many of the derivations of the previous chapters. Derivation of the OLS estimator

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

Advanced Econometrics I

Advanced Econometrics I Lecture Notes Autumn 2010 Dr. Getinet Haile, University of Mannheim 1. Introduction Introduction & CLRM, Autumn Term 2010 1 What is econometrics? Econometrics = economic statistics economic theory mathematics

More information

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory

More information

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE 1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +

More information

Economics 583: Econometric Theory I A Primer on Asymptotics

Economics 583: Econometric Theory I A Primer on Asymptotics Economics 583: Econometric Theory I A Primer on Asymptotics Eric Zivot January 14, 2013 The two main concepts in asymptotic theory that we will use are Consistency Asymptotic Normality Intuition consistency:

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

Multiple Regression Analysis: Heteroskedasticity

Multiple Regression Analysis: Heteroskedasticity Multiple Regression Analysis: Heteroskedasticity y = β 0 + β 1 x 1 + β x +... β k x k + u Read chapter 8. EE45 -Chaiyuth Punyasavatsut 1 topics 8.1 Heteroskedasticity and OLS 8. Robust estimation 8.3 Testing

More information

Dealing With Endogeneity

Dealing With Endogeneity Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 3 Jakub Mućk Econometrics of Panel Data Meeting # 3 1 / 21 Outline 1 Fixed or Random Hausman Test 2 Between Estimator 3 Coefficient of determination (R 2

More information

1 Outline. 1. Motivation. 2. SUR model. 3. Simultaneous equations. 4. Estimation

1 Outline. 1. Motivation. 2. SUR model. 3. Simultaneous equations. 4. Estimation 1 Outline. 1. Motivation 2. SUR model 3. Simultaneous equations 4. Estimation 2 Motivation. In this chapter, we will study simultaneous systems of econometric equations. Systems of simultaneous equations

More information

Advanced Econometrics

Advanced Econometrics Advanced Econometrics Dr. Andrea Beccarini Center for Quantitative Economics Winter 2013/2014 Andrea Beccarini (CQE) Econometrics Winter 2013/2014 1 / 156 General information Aims and prerequisites Objective:

More information

Motivation for multiple regression

Motivation for multiple regression Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope

More information

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor

More information

y it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS.

y it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS. 0.1. Panel Data. Suppose we have a panel of data for groups (e.g. people, countries or regions) i =1, 2,..., N over time periods t =1, 2,..., T on a dependent variable y it and a kx1 vector of independent

More information

Maximum Likelihood (ML) Estimation

Maximum Likelihood (ML) Estimation Econometrics 2 Fall 2004 Maximum Likelihood (ML) Estimation Heino Bohn Nielsen 1of32 Outline of the Lecture (1) Introduction. (2) ML estimation defined. (3) ExampleI:Binomialtrials. (4) Example II: Linear

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral, IAE, Barcelona GSE and University of Gothenburg U. of Gothenburg, May 2015 Roadmap Testing for deviations

More information

Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE

Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE Eric Zivot Winter 013 1 Wald, LR and LM statistics based on generalized method of moments estimation Let 1 be an iid sample

More information

Chapter 1. GMM: Basic Concepts

Chapter 1. GMM: Basic Concepts Chapter 1. GMM: Basic Concepts Contents 1 Motivating Examples 1 1.1 Instrumental variable estimator....................... 1 1.2 Estimating parameters in monetary policy rules.............. 2 1.3 Estimating

More information

Spatial Regression. 9. Specification Tests (1) Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Spatial Regression. 9. Specification Tests (1) Luc Anselin.   Copyright 2017 by Luc Anselin, All Rights Reserved Spatial Regression 9. Specification Tests (1) Luc Anselin http://spatial.uchicago.edu 1 basic concepts types of tests Moran s I classic ML-based tests LM tests 2 Basic Concepts 3 The Logic of Specification

More information

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model

More information

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63 1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:

More information

Nonlinear GMM. Eric Zivot. Winter, 2013

Nonlinear GMM. Eric Zivot. Winter, 2013 Nonlinear GMM Eric Zivot Winter, 2013 Nonlinear GMM estimation occurs when the GMM moment conditions g(w θ) arenonlinearfunctionsofthe model parameters θ The moment conditions g(w θ) may be nonlinear functions

More information

(c) i) In ation (INFL) is regressed on the unemployment rate (UNR):

(c) i) In ation (INFL) is regressed on the unemployment rate (UNR): BRUNEL UNIVERSITY Master of Science Degree examination Test Exam Paper 005-006 EC500: Modelling Financial Decisions and Markets EC5030: Introduction to Quantitative methods Model Answers. COMPULSORY (a)

More information

Statistics and econometrics

Statistics and econometrics 1 / 36 Slides for the course Statistics and econometrics Part 10: Asymptotic hypothesis testing European University Institute Andrea Ichino September 8, 2014 2 / 36 Outline Why do we need large sample

More information

Practical Econometrics. for. Finance and Economics. (Econometrics 2)

Practical Econometrics. for. Finance and Economics. (Econometrics 2) Practical Econometrics for Finance and Economics (Econometrics 2) Seppo Pynnönen and Bernd Pape Department of Mathematics and Statistics, University of Vaasa 1. Introduction 1.1 Econometrics Econometrics

More information

Economics 241B Estimation with Instruments

Economics 241B Estimation with Instruments Economics 241B Estimation with Instruments Measurement Error Measurement error is de ned as the error resulting from the measurement of a variable. At some level, every variable is measured with error.

More information

ECON 4160, Spring term 2015 Lecture 7

ECON 4160, Spring term 2015 Lecture 7 ECON 4160, Spring term 2015 Lecture 7 Identification and estimation of SEMs (Part 1) Ragnar Nymoen Department of Economics 8 Oct 2015 1 / 55 HN Ch 15 References to Davidson and MacKinnon, Ch 8.1-8.5 Ch

More information

Heteroskedasticity. Part VII. Heteroskedasticity

Heteroskedasticity. Part VII. Heteroskedasticity Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least

More information

Chapter 2. Dynamic panel data models

Chapter 2. Dynamic panel data models Chapter 2. Dynamic panel data models School of Economics and Management - University of Geneva Christophe Hurlin, Université of Orléans University of Orléans April 2018 C. Hurlin (University of Orléans)

More information

ECON 5350 Class Notes Functional Form and Structural Change

ECON 5350 Class Notes Functional Form and Structural Change ECON 5350 Class Notes Functional Form and Structural Change 1 Introduction Although OLS is considered a linear estimator, it does not mean that the relationship between Y and X needs to be linear. In this

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

A Course on Advanced Econometrics

A Course on Advanced Econometrics A Course on Advanced Econometrics Yongmiao Hong The Ernest S. Liu Professor of Economics & International Studies Cornell University Course Introduction: Modern economies are full of uncertainties and risk.

More information

Short T Panels - Review

Short T Panels - Review Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of

More information

Fixed Effects Models for Panel Data. December 1, 2014

Fixed Effects Models for Panel Data. December 1, 2014 Fixed Effects Models for Panel Data December 1, 2014 Notation Use the same setup as before, with the linear model Y it = X it β + c i + ɛ it (1) where X it is a 1 K + 1 vector of independent variables.

More information

Graduate Econometrics Lecture 4: Heteroskedasticity

Graduate Econometrics Lecture 4: Heteroskedasticity Graduate Econometrics Lecture 4: Heteroskedasticity Department of Economics University of Gothenburg November 30, 2014 1/43 and Autocorrelation Consequences for OLS Estimator Begin from the linear model

More information

Greene, Econometric Analysis (5th ed, 2003)

Greene, Econometric Analysis (5th ed, 2003) EC771: Econometrics, Spring 2004 Greene, Econometric Analysis (5th ed, 2003) Chapters 4 5: Properties of LS and IV estimators We now consider the least squares estimator from the statistical viewpoint,

More information

Econometrics Homework 4 Solutions

Econometrics Homework 4 Solutions Econometrics Homework 4 Solutions Question 1 (a) General sources of problem: measurement error in regressors, omitted variables that are correlated to the regressors, and simultaneous equation (reverse

More information

A GMM approach for dealing with missing data on regressors and instruments

A GMM approach for dealing with missing data on regressors and instruments A GMM approach for dealing with missing data on regressors and instruments Jason Abrevaya Department of Economics University of Texas Stephen G. Donald Department of Economics University of Texas This

More information

(θ θ ), θ θ = 2 L(θ ) θ θ θ θ θ (θ )= H θθ (θ ) 1 d θ (θ )

(θ θ ), θ θ = 2 L(θ ) θ θ θ θ θ (θ )= H θθ (θ ) 1 d θ (θ ) Setting RHS to be zero, 0= (θ )+ 2 L(θ ) (θ θ ), θ θ = 2 L(θ ) 1 (θ )= H θθ (θ ) 1 d θ (θ ) O =0 θ 1 θ 3 θ 2 θ Figure 1: The Newton-Raphson Algorithm where H is the Hessian matrix, d θ is the derivative

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = 0 + 1 x 1 + x +... k x k + u 6. Heteroskedasticity What is Heteroskedasticity?! Recall the assumption of homoskedasticity implied that conditional on the explanatory variables,

More information

Multivariate Regression

Multivariate Regression Multivariate Regression The so-called supervised learning problem is the following: we want to approximate the random variable Y with an appropriate function of the random variables X 1,..., X p with the

More information

Instrumental Variables Estimation in Stata

Instrumental Variables Estimation in Stata Christopher F Baum 1 Faculty Micro Resource Center Boston College March 2007 1 Thanks to Austin Nichols for the use of his material on weak instruments and Mark Schaffer for helpful comments. The standard

More information

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35 Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate

More information