Maria Elena Bontempi Roberto Golinelli this version: 5 September 2007

Size: px
Start display at page:

Download "Maria Elena Bontempi Roberto Golinelli this version: 5 September 2007"

Transcription

1 INSTRUMENTAL VARIABLES (IV) ESTIMATION A Maria Elena Bontempi e.bontempi@economia.unife.it Roberto Golinelli roberto.golinelli@unibo.it this version: 5 September The instrumental variables approach This note refers to the estimation methods to be used when the specification assumption E( ε X ) = 0 (i.e. weak exogeneity of the model regressors) fails. If the error term of the classical regression model is correlated with the explanatory variables we have biased and inconsistent OLS/GLS parameter estimates. In such circumstances, explanatory variables are correlated with the error term. The explanatory variables are correlated with the error term in three circumstances: 1. Measurement errors in the explanatory variables. If the explanatory x i is measured with error, the x i * = x i + v i, observations are available (where v i is a random variable measurement error ), instead of the genuine measure x i. In the model y i = β x i * + ε i * (with ε i * = ε i βv i ), x i * is stochastic (because v i is stochastic) and correlated with the model s error term ε i * (they both embody v i ). Therefore, OLS/GLS estimator of β is biased and inconsistent. This problem does not emerge if the measurement error characterises only the dependent variable of the model. 2. Endogeneity, i.e. economic simultaneity. The explanatory x i is generated inside the same economic system also generating y i. In this circumstance, in the equation y i = b x i + ε i, x i is correlated by definition with ε i error term. Assume that price p i and quantity q i are simultaneously determined in the i th market by the interplay of supply and demand. In the supply structural equation q i = b p i + ε i, the explanatory p i is stochastic and correlated with the ε i error term, i.e. endogenous (see Section 2.). Therefore, the OLS estimator of b is biased and inconsistent. Another classical example is that of the consumption function (limited-information model) nested in the IS-LM (full-information) model. 3. Autocorrelated errors in the dynamic model. In this case, the OLS estimator is inconsistent because the explanatory lagged dependent variable is correlated with the (autocorrelated) error term. However, if the model is static, errors autocorrelation simply causes OLS inefficiency (such as the heteroskedasticity case discussed above). In the static model with both autocorrelated and heteroskedastic errors, the GLS estimator is BLUE (i.e. the estimator with the best statistical properties) because it accounts for this through a non-scalar variancecovariance matrix of errors. If we face one of the previous three circumstances, in which the OLS estimator is biased and inconsistent, we can at least recover consistency (but neither unbiasedness nor efficiency) using the Instrumental Variables (IV) estimator, sometimes labelled as Two Stages Least Squares (2SLS). Very preliminary. Comments welcome. 1

2 A relevant IV-2SLS (henceforth only IV) feature is that it needs further information, in addition to that specified by the classical linear regression model. In particular, the IV estimator requires the additional w i variable, i.e. a valid instrument. To be valid, the instrument must be: (i) correlated with x i (instrument s relevance condition); (ii) unrelated with the model s error (instrument s exogeneity condition). Remember that the necessary condition for the parameters to be identified is that the number of instruments (exogenous) be equal to (or greater than) the number of explanatory endogenous variables (to be instrumented). If valid instruments are available, in matrix form the IV estimator is defined as: βˆ IV = ( W X) W y, i.e. the ratio of the sample covariance between the dependent variable y i and the instrument w i to the sample covariance between the explanatory x i and the instrument w i. The instrument s relevance condition sub (i) prevents the denominator of the ratio from being zero. If the explanatory x i * is measured with error (i.e. affected by the measurement error v i, see point 1 above), the instrument w i should be correlated with x i and unrelated with both the model s error (ε i ), and the measurement error (v i ). w i is chosen on the basis of both the economic nature and the statistical measurement of the variables of interest. For an equation with one endogenous explanatory variable x i (see point 2 above), the w i instrument is an exogenous variable that interacts with the solution of the whole model (in which the equation of interest is nested), without that such solution feeds-back to on w i. The specification of the full model often suggests the instrument to be used (see as an example the price-quantity model described in Section 2). The point 3 above can only arise over time, i.e. data must be time-series or panels. With timeseries, the improvement of the model dynamics is a better way than IV to cope with lagged explanatory variables related with the error. Contrary to what happens with time-series, dynamic panel models imply inconsistent OLS estimator even if errors are white noise (not autocorrelated). The latter point is tackled in the dynamic panel models lecture. 2. The limited information approach: the supply equation estimate In this first example we tackle the classical issue of the estimation of the parameters in the (full) supply-demand interaction model.. use kmenta, clear. descr Contains data obs: 20 vars: 4 5 Nov :22 size: 340 (99.9% of memory free) - storage display value variable name type format label variable label - codice byte %8.0g q float %9.0g Q p float %9.0g P r float %9.0g R - 2

3 20 geographical zones (or markets) q per capita consumption of food p relative price of food with respect to the overall consumption price index r disposable income at constant prices Full model in structural form (SF): q s = a 0 + a 1 p + u s q d = a 2 + a 3 p + a 4 r + u d q s q d In the model above only the supply equation is identified. However, the supply equation parameters OLS estimates are biased and inconsistent because the OLS assumption of price exogeneity is not valid. OLS estimate. reg q p Source SS df MS Number of obs = F( 1, 18) = 0.17 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = p _cons predict ress, res. sktest ress Skewness/Kurtosis tests for Normality joint Variable Pr(Skewness) Pr(Kurtosis) adj chi2(2) Prob>chi ress hettest Cook-Weisberg test for heteroskedasticity using fitted values of q Ho: Constant variance chi2(1) = 0.05 Prob > chi2 = ovtest Ramsey RESET test using powers of the fitted values of q Ho: model has no omitted variables F(3, 15) = 0.97 Prob > F =

4 The usual diagnostic tests do not suggest any deviation from the errors specification hypotheses of the classical regression model. However, they are not able to assess for the possible correlation between model s explanatory variables and errors. 2SLS estimation 1) FIRST STAGE: OLS estimation of the unrestricted reduced form (URF). Explanatory endogenous variables of the structural equation are estimated as a function of all the exogenous variables (both included and excluded in the structural equation). The identifying condition requires that the number of excluded exogenous variables (instruments) is the number of explanatory endogenous (if > we have an over-identified model; if = we have an exactly identified model). This stage is based on the asymptotic efficiency condition COV(X, W) 0, that in the present case becomes: COV(p, r) 0.. reg p r Source SS df MS Number of obs = F( 1, 18) = 8.51 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = p Coef. Std. Err. t P> t [95% Conf. Interval] r _cons ) Store both fitted values and residuals.. predict pfit (option xb assumed; fitted values). predict resp, res Remember that, by definition, OLS decompose the endogenous explanatory p in two parts (p = pfit + resp): the first is correlated with the exogenous instrument, the second is correlated with the supply-demand shocks (hence correlated with the disturbance term of the supply equation). With OLS, the two parts (components of p) are orthogonal by definition:. corr pfit resp (obs=20) pfit resp pfit resp ) SECOND STAGE: OLS estimate of the structural equation in which endogenous explanatory variables are replaced by the fitted values at the first stage (pfit in place of p). Given that pfit is the part of the endogenous explanatory variable not correlated with the error term of the structural equation of interest, this stage is based on the necessary condition for IV estimator being consistent, COV(ε, W) = 0, that in our case becomes COV(ε, r) = 0. 4

5 . reg q pfit Source SS df MS Number of obs = F( 1, 18) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = pfit _cons IV estimate. ivreg q (p=r) Instrumental variables (2SLS) regression Source SS df MS Number of obs = F( 1, 18) = 4.14 Model Prob > F = Residual R-squared = Adj R-squared =. Total Root MSE = p _cons Instrumented: p Instruments: r Note that, contrary to OLS, in the IV case R 2 may fall outside the 0-1 range because the orthogonality conditions of the residuals refer to the instruments and not to model s explanatory variables. For this reason, Stata does not report it. Note that standard errors of the second stage of 2SLS estimates are different from standard errors of IV estimates: the true explanatory variable is p and not pfit; the right standard errors are those of IV. See the following summary and comparison. Given the equation Y = Xβ + ε, we have what follows. OLS: min( Y Xβ ) ( Y Xβ ) ˆ βols = ( X X ) X Y ˆ ˆ 1 ) 1 with ˆ ˆ ε ˆ ε ( Y XβOLS ) ( Y XβOLS Var( β OLS ) = ( X X ) = ( X X ). N K N K IV: Y X PW Y X ˆ min( β ) ( β ) β IV = ( X PW X ) X PW Y where P W = W ( W W ) W is the matrix projecting Y upon the sub-space spanned by the columns of X that lies in the space spanned 5

6 by the columns of W. Note that, if W has the same dimension (N K) of X, it follows βˆ IV = ( X PW X ) X PW Y = ( X W ( W W ) W X ) X W ( W W ) W Y =. = ( W X ) W W ( X W ) X W ( W W ) W Y = ( W X ) W Y ˆ ˆ ) 1 The estimated variance of the estimator is ˆ ( Y Xβ IV ) ( Y Xβ IV Var( β IV ) = ( X PW X ), where N asymptotically it makes no difference whether we divide for N or N-K. 2SLS: min( Y PW Xβ ) ( Y PW Xβ ) ˆ β 2 SLS = ( X PW PW X ) X PW PW Y = ( X PW X ) X PW Y because P W is idempotent (P W = P W P W ). The estimated variance of the estimator is ˆ ˆ ˆ ( Y PW Xβ 2SLS ) ( Y PW Xβ 2SLS ) Var( β 2 SLS ) = ( X PW X ), that coincides with the IV estimated N K variance only if X=P W X Testing for exogeneity of the model s regressors The command hausman performs Hausman's (1978) specification test, also known as the Durbin- Wu-Hausman test 2, based on the following comparison: OLS IV H 0 : E(ε X)=0 Consistent and Efficient Consistent but not efficient H 1 : E(ε X) 0 Not consistent Consistent The idea is testing whether the difference ˆ δ = ˆ β ˆ IV βols is statistically significant: the rejection of the null H : plim ˆ 0 δ = 0 implies that the event E(ε X)=0 can not be accepted and that we must estimate our model by using IV, instead of OLS. Note that the Hausman test can be generalised: for example, we will see that it is applied in choosing between fixed effects and random effects in panel data model. The steps in Stata 8/9 are the following (note that the procedure to perform Hausman test has been totally revised if compared to the one of Stata 7). (1) Obtain an estimator that is consistent whether or not H 0 is true. (2) Store the estimation results under a name-consistent using the command estimates store. (3) Obtain an estimator that is efficient (and consistent) under H 0, but inconsistent otherwise. (4) If you want, store the estimation results under a name-efficient using again the command estimates store. (5) Use the command hausman name-consistent name-efficient to perform the test. If you do not want to run step (4), you may use a period (.) to refer to the last estimation results, even if these were not already stored. 1 2SLS is a special case of generated regressors, that provide consistent estimates of the parameters, but not consistent estimates of the covariance matrix of the parameter estimates. See Pagan A. R. (1984) Econometric issues in the analysis of regressions with generated regressors, International Economic Review, 25, and Pagan A. R. (1986) Two stage and related estimators and their applications, Review of Economic Studies, 53, Following the works of Durbin J. (1954) Errors in Variables, Review of International Statistical Institute, 23-32, Wu De-Min (1973) Alternative Tests of Independence between Stochastic Regressors and Disturbances, Econometrica, , and Hausman J.A. (1978) Specification Tests in Econometrics, Econometrica, 46,

7 The order of computing the two estimators may be reversed, but you have to be careful though to specify to hausman the models in the order always consistent first and efficient under H 0 second (see below). In fact, the test-statistic is ˆ δ Var [ ( ˆ) δ ] ˆ δ, where Var (δˆ ) is positive definite (>0 in the bivariate case), i.e. Var( ˆ β ) ( ˆ IV > Var βols ). Under the null, the test-statistic is distributed as a chi-squared with m degrees of freedom, where m is the number of regressors specified as endogenous in the original instrumental variable regression. The important result of Hausman is that of demonstrating that, under the null, Var( ˆ) δ = Var( ˆ β ) ( ˆ IV Var βols ) and not Var( ˆ) δ = Var( ˆ β ) ( ˆ ) 2 ( ˆ, ˆ IV + Var βols Cov β IV βols ). The proof strategy is to show that βˆ OLS can not be efficient if the result the variance of differences equals the difference of variances does not hold (see Maddala G. S., 1992, Introduction to Econometrics, Maxwell Macmillan, second edition, pp ). The order in which you specify the regressors in each model does not matter, but it is your responsibility to assure that the estimators and models are comparable, and satisfy the theoretical conditions (1) and (3) above. The command hausman may be used in any context, but you have to pay attention to the options (for example, constant specifies that the estimated intercept(s) are to be included in the model comparison, while by default they are excluded). The option sigmamore can only be specified when both estimators save e(sigma) or e(rmse) 3. It allows you to specify that the (co)variance matrices used in the test be based on a common estimate of disturbance variance (σ 2 ), namely the variance from the efficient estimator (obtained under H 0 ). This option provides a proper estimate of the contrast variance for so-called tests of exogeneity and over-identification in instrumental variables regression. (see Baltagi, 1998, Econometrics, p. 291). It is recommended when comparing fixed-effects and random-effects linear regressions because they are much less likely to produce a nonpositive-definite differenced covariance matrix (although the tests are asymptotically equivalent whether or not one of the options is specified). Save the results of the previous IV estimate, and then compare them with the OLS results:. est store IV. reg q p Source SS df MS Number of obs = F( 1, 18) = 0.17 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = p _cons hausman IV., sigmamore 3 See also the option sigmaless. 7

8 ---- Coefficients ---- (b) (B) (b-b) sqrt(diag(v_b-v_b)) IV. Difference S.E. p b = consistent under Ho and Ha; obtained from ivreg B = inconsistent under Ha, efficient under Ho; obtained from regress Test: Ho: difference in coefficients not systematic chi2(1) = (b-b)'[(v_b-v_b)^(-1)](b-b) = Prob>chi2 = Note that, by default, the estimated intercept(s) are excluded in the model comparison. This is appropriate for models in which the constant does not have a common interpretation across the two models; for example, when comparing fixed effects versus random effects in panel data. Note that V_b-V_B = Var( βˆ IV ) may be negative, in such cases the Hausman test cannot be performed. However, be careful to list, after the hausman command, the names of the stored estimates in the right order: first the IV estimator (consistent under both the null and the alternative), and then the OLS (efficient under the null but inconsistent under the alternative) )-Var( βˆ OLS C. F. Baum, M. E. Schaffer and S. Stillman have written a very useful procedure, ivendog, that reports two tests. The "Durbin-Wu-Hausman" (DWH) test is equivalent to the standard "Hausman test" obtained by using hausman, sigmamore. The other test-statistic is the "Wu-Hausman" T2 statistic of Wu (1973); Hausman demonstrates that it could be calculated straightforwardly through the use of auxiliary regressions. The test statistic, under the null, is distributed like a F(m, N-K), where m is the number of regressors specified as endogenous in the original instrumental variables regression. Note that, differently from hausman, ivendog is not valid with robust covariance estimators.. qui ivreg q (p=r). ivendog Tests of endogeneity of: p H0: Regressor is exogenous Wu-Hausman F test: F(1,17) P-value = Durbin-Wu-Hausman chi-sq test: Chi-sq(1) P-value = You can also make by-hand Hausman test. The regression test is based on the following steps: 1) estimate the reduced form (reg p r); 2) save the residuals (predict resp, res); 3) estimate the augmented regression (reg q p resp) that is the structural equation plus the residual from the reduced form; 4) test H 0 : parameter associated to resp=0 by using a t statistic (or an F test in case we are testing for endogeneity of multiple explanatory variables; of course, for each of them, we must obtain the reduced form residuals as in 1) and 2) steps). 8

9 . reg q p resp Source SS df MS Number of obs = F( 2, 17) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = p resp _cons di (_b[resp]/_se[resp])^ testparm resp ( 1) resp = 0 F( 1, 17) = Prob > F = If we reject H 0, we conclude that p is endogenous: resp is the part of the p endogenous explanatory variable filtered from the effect of all the exogenous variables (remember again the OLS decomposition of the first-stage regression). Thus, if resp parameter is statistically significant, resp is significantly correlated with q (i.e. with the error term of the structural equation). Despite the OLS regression, the estimate of p parameter above is equal to the structural estimate previously obtained with IV because resp in this OLS regression clears up the endogeneity of p. Of course, in this test regression the standard errors of the parameters are not appropriate. Note also that the squared t associated to resp (test H 0 : parameter associated to resp=0 by using a F statistic) corresponds to the Wu-Hausman F test reported by the ivendog procedure Testing for residuals heteroskedasticity M. E. Schaffer has also written a procedure, ivhettest, to perform various tests of heteroskedasticity for both OLS and IV estimation. After OLS estimates, ivhettest performs Breusch-Pagan/Godfrey/Cook-Weisberg test (the same obtained by hettest) and a general version of White/Koenker test (the same obtained by hettest, iid). The same results of the whitetst procedure is obtained by appropriately selecting the indicator variables. ivhettest command has many options 4 : 1) the default uses the regressors after OLS estimates, and the full set of instruments (exogenous variables, excluding the constant) after IV estimates; 2) ivlev corresponds to the default; 3) ivsq uses the full set of instruments (excluding the constant) and their squares; 4 Remember that the trade-off in the choice of indicator variables in all these tests is that a smaller set of indicator variables will preserve degrees of freedom, at the cost of being unable to detect heteroskedasticity in certain directions. 9

10 4) ivcp uses the full set of instruments (excluding the constant), their squares, and cross-products (save case as whitetst); 5) fitsq uses the fitted value of the dependent variable and its square; 6) varlist specifies a user-defined set of indicator variables. 7) fitlev uses the "fitted value" of the dependent variable; 5 For example, after an OLS estimate, we can obtain:. reg q p Source SS df MS Number of obs = F( 1, 18) = 0.17 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = p _cons hettest Breusch-Pagan / Cook-Weisberg test for heteroskedasticity Ho: Constant variance Variables: fitted values of q chi2(1) = 0.05 Prob > chi2 = hettest, iid Breusch-Pagan / Cook-Weisberg test for heteroskedasticity Ho: Constant variance Variables: fitted values of q chi2(1) = 0.06 Prob > chi2 = whitetst White's general test statistic : Chi-sq( 2) P-value =.963. ivhettest, all OLS heteroskedasticity test(s) using levels of IVs only Ho: Disturbance is homoskedastic White/Koenker nr2 test statistic : Chi-sq(1) P-value = Breusch-Pagan/Godfrey/Cook-Weisberg : Chi-sq(1) P-value = ivhettest, all ivcp OLS heteroskedasticity test(s) using levels and cross products of all IVs Ho: Disturbance is homoskedastic 5 In the IV regression case, fitted values are not calculated directly from the regressors and the estimated coefficients. Rather, the endogenous regressors are first replaced with their first stage fitted values, i.e., with fitted values from regression of the endogenous regressors on the full set of instruments. The exogenous regressors, the fitted values of the endogenous regressors, and the estimated coefficients from the main regression are then used to calculate the fitted values of the dependent variable. This because the indicator variables must be functions of exogenous variables (instruments) only. 10

11 White/Koenker nr2 test statistic : Chi-sq(2) P-value = Breusch-Pagan/Godfrey/Cook-Weisberg : Chi-sq(2) P-value = where the option all is used to report all applicable statistics. After IV estimates, ivhettest, all also adds two versions of Pagan and Hall's (1983) 6 teststatistic: the default, the general test-statistic, and the test-statistic assuming that the error term in the IV regression is normally distributed. These tests will be valid tests for heteroskedasticity in an IV regression only if heteroskedasticity is present in that equation and nowhere else in the system. The White/Koenker nr-squared test-statistic and the standard Breusch-Pagan/Godfrey/Cook- Weisberg test-statistic (assuming that the error term is normally distributed) will be distributed as chi-squared under the null only if the system covariance matrix is homoskedastic.. qui ivreg q (p=r). ivhettest, all IV heteroskedasticity test(s) using levels of IVs only Ho: Disturbance is homoskedastic Pagan-Hall general test statistic : Chi-sq(1) P-value = Pagan-Hall test w/assumed normality : Chi-sq(1) P-value = White/Koenker nr2 test statistic : Chi-sq(1) P-value = Breusch-Pagan/Godfrey/Cook-Weisberg : Chi-sq(1) P-value = Testing for the over-identifying restrictions When the number of excluded exogenous variables is higher than the number of explanatory endogenous variables we can test the overidentifying restrictions. The steps are the following. 1) Estimate the structural model by IV and obtain the residuals (ressf). 2) Estimate the auxiliary regression in which the residuals above are regressed on all the exogenous variables, and compute the test-statistic NR 2. 3) Under the null that all the IV instruments are unrelated with the error term of the structural equation, the test-statistic N R 2 χ 2 q, where q is the number of the overidentifying restrictions, i.e. the difference between the number of excluded exogenous variables (instruments) and the number of endogenous explanatory variables. If N R 2 exceeds the critical value of the chi-squared distribution, we reject H 0. In such case, at least some of the IV instruments are not valid (i.e., unrelated with the error term and correctly excluded from the structural model), given that the high value of the R 2 indicates that they should have been included as explanatory variables in the structural model. C. F. Baum, V. Wiggins, S. Stillman and M. E. Schaffer have written a procedure, overid, that computes the versions of Sargan's (1958) and Basmann's (1960) tests for overidentifying restrictions. 7 6 Pagan, A. R. and D. Hall. (1983), Diagnostic tests as residual analysis, Econometric Reviews, 2(2), Sargan, J.D. (1958) The Estimation of Economic Relationships Using Instrumental Variables, Econometrica, 26, Basmann, R.L. (1960) On Finite Sample Distributions of Generalized Classical Linear Identifiability Test Statistics, Journal of the American Statistical Association, 55, 292, The test-statistic is ( Y X ˆ β IV ) PW ( Y X ˆ β IV ) ( Y X ˆ β ˆ IV ) PW ( Y Xβ IV ) in the Sargan s case and, where V is the ( Y X ˆ β IV ) ( Y X ˆ ˆ ˆ β IV ) ( Y Xβ IV ) ( I PW )( Y Xβ IV ) N ( N V ) number of IVs, in the Basmann s case. The two denominators can be interpreted as two different estimates of the error variance of the estimated equation, both of which are consistent. Sargan s statistics can be calculated via steps 1)-3) 11

12 This procedure will not produce a result if either the robust or cluster options are employed in the preceding IV. The test for overidentifying restrictions that is robust to heteroskedasticity in the errors is Hansen's J statistic 8 that can be obtained via ivreg2 or xtabond2, two procedures that will be deeply shown in the lectures on panel data. Here, we just illustrate main points with an example.. g r2=r^2. ivreg q (p=r r2) Instrumental variables (2SLS) regression Source SS df MS Number of obs = F( 1, 18) = 3.67 Model Prob > F = Residual R-squared = Adj R-squared =. Total Root MSE = p _cons Instrumented: p Instruments: r r2. overid Tests of overidentifying restrictions: Sargan N*R-sq test Chi-sq(1) P-value = Basmann test Chi-sq(1) P-value = ivreg2 q (p=r r2), small 9 Instrumental variables (2SLS) regression Number of obs = 20 F( 1, 18) = 3.67 Prob > F = Total (centered) SS = Centered R2 = Total (uncentered) SS = Uncentered R2 = Residual SS = Root MSE = p _cons previously illustrated (see R. Davidson and J. G. MacKinnon (1993), Estimation and Inference in Econometrics, Oxford University Press, ). 8 Hansen L. P. (1982) Large Sample Properties of Generalised Method of Moments Estimators, Econometrica, 50, Davidson-MacKinnon do not call it the J statistic, because of the name of their non-nested models test (R. Davidson and J. G. MacKinnon (1981), Several tests for model specification in the presence of alternative hypotheses, Econometrica, 49, ). 9 The option small performs small sample corrections: Root MSE is computed as sqrt(rss/(n-k)) [while sqrt(rss/n) is the default]; if robust is chosen, the finite sample adjustment to the robust variance-covariance matrix is N/(N-K) [while 1 is the default]; if cluster is chosen, the finite sample adjustment is (N-1)/(N-K) N c /(N c -1), where N c is the number of clusters [while 1 is the default]. Note that the Sargan statistic uses error variance = RSS/N, i.e., there is no small sample correction. 12

13 Anderson canon. corr. LR statistic (identification/iv relevance test): Chi-sq(2) P-val = Sargan statistic (overidentification test of all instruments): Chi-sq(1) P-val = Instrumented: p Excluded instruments: r r2. ivreg q (p=r r2), robust Instrumental variables (2SLS) regression Number of obs = 20 F( 1, 18) = 6.89 Prob > F = R-squared =. Root MSE = Robust p _cons Instrumented: p Instruments: r r2. overid Test not valid with robust covariance matrix: use ivreg2 r(198);. ivreg2 q (p=r r2), small robust IV (2SLS) regression with robust standard errors Number of obs = 20 F( 1, 18) = 6.89 Prob > F = Total (centered) SS = Centered R2 = Total (uncentered) SS = Uncentered R2 = Residual SS = Root MSE = Robust p _cons Anderson canon. corr. LR statistic (identification/iv relevance test): Chi-sq(2) P-val = Hansen J statistic (overidentification test of all instruments): Chi-sq(1) P-val = Instrumented: p Excluded instruments: r r2 3. Hidden endogeneity With this example, we show how the estimation results (and the related quantitative findings) may be misleading when we refer to only one estimation approach, without comparing the robustness of what discovered in the light of only one regression. 13

14 . use orange, clear. descr Contains data obs: 45 vars: 7 5 Nov :25 size: 1,260 (96.3% of memory free) - storage display value variable name type format label variable label - anno int %8.0g rev float %9.0g REV inc int %8.0g INC qty float %9.0g QTY curadv float %9.0g CURADV aveadv float %9.0g AVEADV cpi float %9.0g CPI - Nerlove-Waugh (1961) discovered a number of problems of a previous paper results that were supportive of the link between oranges sales and their advertising expenses in the USA. The databank includes: rev orange sales at constant prices, in dollars; inc consumers income, in dollars; qty orange sales in quantity; curadv current advertising expenses at constant prices; aveadv average advertising expenditure over the previous 10 years at constant prices; cpi consumer price index; Transformation in logarithms:. g lrev=ln(rev). g linc=ln(inc). g lqty=ln(qty). g lcuradv=ln(curadv). g laveadv=ln(aveadv). g lcpi=ln(cpi) NOTE: rev = qty pri, where pri is the oranges price. Therefore, we can obtain orange price as:. g lpri=lrev-lqty OLS estimate. reg lrev lqty linc lcuradv laveadv Source SS df MS Number of obs = F( 4, 40) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = lrev Coef. Std. Err. t P> t [95% Conf. Interval] lqty linc lcuradv laveadv _cons

15 . est store OLS The demand equation we just estimated, might hide an endogeneity problem of lqty regressor, while this problem could not affect any other explanatory variable of the model. In fact, the dependent variable is by definition: lrev = lqty + lpri. If we solve and normalise the previous relationship for the quantity of orange sales lqty we have: lqty + lpri lqty = 0.92 linc lqty = -lpri linc +... lqty = (-1/1.38) lpri + (0.92/1.38) linc +... The signs of the parameter estimates of the demand equation are those predicted by the economic theory: the price effect is negative, the consumers income effect is positive. In this context, advertising has a significantly positive effect on the orange real demand. However, if we instrument potentially endogenous orange prices (hidden in lqty) with the USA overall consumer price index, lcpi, estimation results change dramatically, and the Hausman test clearly rejects the null of lqty weakly exogenous. IV estimate. ivreg lrev (lqty=lcpi) linc lcuradv laveadv Instrumental variables (2SLS) regression Source SS df MS Number of obs = F( 4, 40) = 7.69 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = lrev Coef. Std. Err. t P> t [95% Conf. Interval] lqty linc lcuradv laveadv _cons Instrumented: lqty Instruments: linc lcuradv laveadv lcpi. est store IV. ivendog Tests of endogeneity of: lqty H0: Regressor is exogenous Wu-Hausman F test: F(1,39) P-value = Durbin-Wu-Hausman chi-sq test: Chi-sq(1) P-value = hausman IV OLS, sigmamore Note: the rank of the differenced variance matrix (1) does not equal the number of coefficients being tested (4); be sure this is what you expect, or there may be problems computing the test. Examine the output of your estimators for anything unexpected and possibly consider scaling your variables so that the coefficients are on a similar scale Coefficients ---- (b) (B) (b-b) sqrt(diag(v_b-v_b)) IV OLS Difference S.E. 15

16 lqty linc lcuradv laveadv b = consistent under Ho and Ha; obtained from ivreg B = inconsistent under Ha, efficient under Ho; obtained from regress Test: Ho: difference in coefficients not systematic chi2(1) = (b-b)'[(v_b-v_b)^(-1)](b-b) = 9.09 Prob>chi2 = (V_b-V_B is not positive definite) Note how, without the sigmamore option, the Hausman s test is quite misleading:. hausman IV OLS ---- Coefficients ---- (b) (B) (b-b) sqrt(diag(v_b-v_b)) IV OLS Difference S.E. lqty linc lcuradv laveadv b = consistent under Ho and Ha; obtained from ivreg B = inconsistent under Ha, efficient under Ho; obtained from regress Test: Ho: difference in coefficients not systematic chi2(4) = (b-b)'[(v_b-v_b)^(-1)](b-b) = 2.48 Prob>chi2 = In the latter case, the following option prevents degrees of freedom problems:. hausman IV OLS, df(1) Coefficients ---- (b) (B) (b-b) sqrt(diag(v_b-v_b)) IV OLS Difference S.E. lqty linc lcuradv laveadv b = consistent under Ho and Ha; obtained from ivreg B = inconsistent under Ha, efficient under Ho; obtained from regress Test: Ho: difference in coefficients not systematic chi2(1) = (b-b)'[(v_b-v_b)^(-1)](b-b) = 2.48 Prob>chi2 = The df(#) option specifies the degrees of freedom to be used in the Hausman s test. The default is the matrix rank of the variance of the difference between the coefficients of the two estimators. 16

17 With IV for endogenous orange prices, the demand relationship has profoundly change in meaning: the advertising-supporting OLS results where based on big statistical problems. However, note that the IV results heavily rely on the specific instrument we used and on the exogeneity hypothesis of all the other explanatory variables. Summarising the findings of this lecture, we note the following relevant point. The choice between IV-2SLS and OLS-GLS must be based on the reliability of the exogeneity assumption for all the model s explanatory variables, and not on the estimation outcome; in doing so, the Hausman test may be of some help. The first example is constructive: it shows that the inappropriate use of OLS leads to the rejection of the existence of a demand equation, while IV supports such relationship. Viceversa, the second case is quite destructive since it shows that OLS would give quite reasonable estimation results that, however, are completely reversed by the use of IV. In other terms, the reasonableness (failure) of the estimation results cannot per se give support (cast doubts) to the estimation method used. Statistical testing (Hausman) and economic reasoning are the right ways to make the estimator choice. 17

Instrumental Variables and GMM: Estimation and Testing. Steven Stillman, New Zealand Department of Labour

Instrumental Variables and GMM: Estimation and Testing. Steven Stillman, New Zealand Department of Labour Instrumental Variables and GMM: Estimation and Testing Christopher F Baum, Boston College Mark E. Schaffer, Heriot Watt University Steven Stillman, New Zealand Department of Labour March 2003 Stata Journal,

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

IV and IV-GMM. Christopher F Baum. EC 823: Applied Econometrics. Boston College, Spring 2014

IV and IV-GMM. Christopher F Baum. EC 823: Applied Econometrics. Boston College, Spring 2014 IV and IV-GMM Christopher F Baum EC 823: Applied Econometrics Boston College, Spring 2014 Christopher F Baum (BC / DIW) IV and IV-GMM Boston College, Spring 2014 1 / 1 Instrumental variables estimators

More information

Econometrics. 8) Instrumental variables

Econometrics. 8) Instrumental variables 30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates

More information

Instrumental Variables, Simultaneous and Systems of Equations

Instrumental Variables, Simultaneous and Systems of Equations Chapter 6 Instrumental Variables, Simultaneous and Systems of Equations 61 Instrumental variables In the linear regression model y i = x iβ + ε i (61) we have been assuming that bf x i and ε i are uncorrelated

More information

Handout 12. Endogeneity & Simultaneous Equation Models

Handout 12. Endogeneity & Simultaneous Equation Models Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to

More information

Instrumental Variables Estimation in Stata

Instrumental Variables Estimation in Stata Christopher F Baum 1 Faculty Micro Resource Center Boston College March 2007 1 Thanks to Austin Nichols for the use of his material on weak instruments and Mark Schaffer for helpful comments. The standard

More information

Microeconometrics (PhD) Problem set 2: Dynamic Panel Data Solutions

Microeconometrics (PhD) Problem set 2: Dynamic Panel Data Solutions Microeconometrics (PhD) Problem set 2: Dynamic Panel Data Solutions QUESTION 1 Data for this exercise can be prepared by running the do-file called preparedo posted on my webpage This do-file collects

More information

Instrumental variables and GMM: Estimation and testing

Instrumental variables and GMM: Estimation and testing Boston College Economics Working Paper 545, 02 November 2002 Instrumental variables and GMM: Estimation and testing Christopher F. Baum Boston College Mark E. Schaffer Heriot Watt University Steven Stillman

More information

Lab 11 - Heteroskedasticity

Lab 11 - Heteroskedasticity Lab 11 - Heteroskedasticity Spring 2017 Contents 1 Introduction 2 2 Heteroskedasticity 2 3 Addressing heteroskedasticity in Stata 3 4 Testing for heteroskedasticity 4 5 A simple example 5 1 1 Introduction

More information

Graduate Econometrics Lecture 4: Heteroskedasticity

Graduate Econometrics Lecture 4: Heteroskedasticity Graduate Econometrics Lecture 4: Heteroskedasticity Department of Economics University of Gothenburg November 30, 2014 1/43 and Autocorrelation Consequences for OLS Estimator Begin from the linear model

More information

Lecture 8: Instrumental Variables Estimation

Lecture 8: Instrumental Variables Estimation Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano

More information

Эконометрика, , 4 модуль Семинар Для Группы Э_Б2015_Э_3 Семинарист О.А.Демидова

Эконометрика, , 4 модуль Семинар Для Группы Э_Б2015_Э_3 Семинарист О.А.Демидова Эконометрика, 2017-2018, 4 модуль Семинар 3 160418 Для Группы Э_Б2015_Э_3 Семинарист ОАДемидова * Stata program * copyright C 2010 by A Colin Cameron and Pravin K Trivedi * used for "Microeconometrics

More information

Econometrics. 9) Heteroscedasticity and autocorrelation

Econometrics. 9) Heteroscedasticity and autocorrelation 30C00200 Econometrics 9) Heteroscedasticity and autocorrelation Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Heteroscedasticity Possible causes Testing for

More information

Fixed and Random Effects Models: Vartanian, SW 683

Fixed and Random Effects Models: Vartanian, SW 683 : Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

Problem Set 10: Panel Data

Problem Set 10: Panel Data Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005

More information

Topic 7: Heteroskedasticity

Topic 7: Heteroskedasticity Topic 7: Heteroskedasticity Advanced Econometrics (I Dong Chen School of Economics, Peking University Introduction If the disturbance variance is not constant across observations, the regression is heteroskedastic

More information

Instrumental variables estimation using heteroskedasticity-based instruments

Instrumental variables estimation using heteroskedasticity-based instruments Instrumental variables estimation using heteroskedasticity-based instruments Christopher F Baum, Arthur Lewbel, Mark E Schaffer, Oleksandr Talavera Boston College/DIW Berlin, Boston College, Heriot Watt

More information

ECON2228 Notes 7. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 41

ECON2228 Notes 7. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 41 ECON2228 Notes 7 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 6 2014 2015 1 / 41 Chapter 8: Heteroskedasticity In laying out the standard regression model, we made

More information

Instrumental variables estimation using heteroskedasticity-based instruments

Instrumental variables estimation using heteroskedasticity-based instruments Instrumental variables estimation using heteroskedasticity-based instruments Christopher F Baum, Arthur Lewbel, Mark E Schaffer, Oleksandr Talavera Boston College/DIW Berlin, Boston College, Heriot Watt

More information

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Econometrics - Exam May 11, 2011 1 Exam Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Problem 1: (15 points) A researcher has data for the year 2000 from

More information

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

Description Quick start Menu Syntax Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see

Description Quick start Menu Syntax Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see Title stata.com hausman Hausman specification test Description Quick start Menu Syntax Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see Description hausman

More information

Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part)

Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part) Séminaire d Analyse Economique III (LECON2486) Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part) Frédéric Docquier & Sara Salomone IRES UClouvain

More information

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like. Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Economics 308: Econometrics Professor Moody

Economics 308: Econometrics Professor Moody Economics 308: Econometrics Professor Moody References on reserve: Text Moody, Basic Econometrics with Stata (BES) Pindyck and Rubinfeld, Econometric Models and Economic Forecasts (PR) Wooldridge, Jeffrey

More information

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect

More information

Handout 11: Measurement Error

Handout 11: Measurement Error Handout 11: Measurement Error In which you learn to recognise the consequences for OLS estimation whenever some of the variables you use are not measured as accurately as you might expect. A (potential)

More information

Lecture 8: Heteroskedasticity. Causes Consequences Detection Fixes

Lecture 8: Heteroskedasticity. Causes Consequences Detection Fixes Lecture 8: Heteroskedasticity Causes Consequences Detection Fixes Assumption MLR5: Homoskedasticity 2 var( u x, x,..., x ) 1 2 In the multivariate case, this means that the variance of the error term does

More information

Case of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1

Case of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1 Mediation Analysis: OLS vs. SUR vs. ISUR vs. 3SLS vs. SEM Note by Hubert Gatignon July 7, 2013, updated November 15, 2013, April 11, 2014, May 21, 2016 and August 10, 2016 In Chap. 11 of Statistical Analysis

More information

Autocorrelation. Think of autocorrelation as signifying a systematic relationship between the residuals measured at different points in time

Autocorrelation. Think of autocorrelation as signifying a systematic relationship between the residuals measured at different points in time Autocorrelation Given the model Y t = b 0 + b 1 X t + u t Think of autocorrelation as signifying a systematic relationship between the residuals measured at different points in time This could be caused

More information

Heteroskedasticity. (In practice this means the spread of observations around any given value of X will not now be constant)

Heteroskedasticity. (In practice this means the spread of observations around any given value of X will not now be constant) Heteroskedasticity Occurs when the Gauss Markov assumption that the residual variance is constant across all observations in the data set so that E(u 2 i /X i ) σ 2 i (In practice this means the spread

More information

Lecture#12. Instrumental variables regression Causal parameters III

Lecture#12. Instrumental variables regression Causal parameters III Lecture#12 Instrumental variables regression Causal parameters III 1 Demand experiment, market data analysis & simultaneous causality 2 Simultaneous causality Your task is to estimate the demand function

More information

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Heteroskedasticity. Occurs when the Gauss Markov assumption that the residual variance is constant across all observations in the data set

Heteroskedasticity. Occurs when the Gauss Markov assumption that the residual variance is constant across all observations in the data set Heteroskedasticity Occurs when the Gauss Markov assumption that the residual variance is constant across all observations in the data set Heteroskedasticity Occurs when the Gauss Markov assumption that

More information

For more information on the Stata Journal, including information for authors, see the web page.

For more information on the Stata Journal, including information for authors, see the web page. The Stata Journal Editor H. Joseph Newton Department of Statistics Texas A & M University College Station, Texas 77843 979-845-3142 979-845-3144 FAX jnewton@stata-journal.com Associate Editors Christopher

More information

. *DEFINITIONS OF ARTIFICIAL DATA SET. mat m=(12,20,0) /*matrix of means of RHS vars: edu, exp, error*/

. *DEFINITIONS OF ARTIFICIAL DATA SET. mat m=(12,20,0) /*matrix of means of RHS vars: edu, exp, error*/ . DEFINITIONS OF ARTIFICIAL DATA SET. mat m=(,,) /matrix of means of RHS vars: edu, exp, error/. mat c=(5,-.6, \ -.6,9, \,,.) /covariance matrix of RHS vars /. mat l m /displays matrix of means / c c c3

More information

Quantitative Methods Final Exam (2017/1)

Quantitative Methods Final Exam (2017/1) Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with

More information

point estimates, standard errors, testing, and inference for nonlinear combinations

point estimates, standard errors, testing, and inference for nonlinear combinations Title xtreg postestimation Postestimation tools for xtreg Description The following postestimation commands are of special interest after xtreg: command description xttest0 Breusch and Pagan LM test for

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10) Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the

More information

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication G. S. Maddala Kajal Lahiri WILEY A John Wiley and Sons, Ltd., Publication TEMT Foreword Preface to the Fourth Edition xvii xix Part I Introduction and the Linear Regression Model 1 CHAPTER 1 What is Econometrics?

More information

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N 1) [4 points] Let Econ 836 Final Exam Y Xβ+ ε, X w+ u, w N w~ N(, σi ), u N u~ N(, σi ), ε N ε~ Nu ( γσ, I ), where X is a just one column. Let denote the OLS estimator, and define residuals e as e Y X.

More information

Essential of Simple regression

Essential of Simple regression Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship

More information

Econometrics - 30C00200

Econometrics - 30C00200 Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business

More information

Introduction to Econometrics. Heteroskedasticity

Introduction to Econometrics. Heteroskedasticity Introduction to Econometrics Introduction Heteroskedasticity When the variance of the errors changes across segments of the population, where the segments are determined by different values for the explanatory

More information

Dynamic Panels. Chapter Introduction Autoregressive Model

Dynamic Panels. Chapter Introduction Autoregressive Model Chapter 11 Dynamic Panels This chapter covers the econometrics methods to estimate dynamic panel data models, and presents examples in Stata to illustrate the use of these procedures. The topics in this

More information

Final Exam. 1. Definitions: Briefly Define each of the following terms as they relate to the material covered in class.

Final Exam. 1. Definitions: Briefly Define each of the following terms as they relate to the material covered in class. Name Answer Key Economics 170 Spring 2003 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment

More information

10) Time series econometrics

10) Time series econometrics 30C00200 Econometrics 10) Time series econometrics Timo Kuosmanen Professor, Ph.D. 1 Topics today Static vs. dynamic time series model Suprious regression Stationary and nonstationary time series Unit

More information

ECON Introductory Econometrics. Lecture 16: Instrumental variables

ECON Introductory Econometrics. Lecture 16: Instrumental variables ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental

More information

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE 1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +

More information

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.

More information

Lecture 4: Heteroskedasticity

Lecture 4: Heteroskedasticity Lecture 4: Heteroskedasticity Econometric Methods Warsaw School of Economics (4) Heteroskedasticity 1 / 24 Outline 1 What is heteroskedasticity? 2 Testing for heteroskedasticity White Goldfeld-Quandt Breusch-Pagan

More information

Dynamic Panel Data Models

Dynamic Panel Data Models Models Amjad Naveed, Nora Prean, Alexander Rabas 15th June 2011 Motivation Many economic issues are dynamic by nature. These dynamic relationships are characterized by the presence of a lagged dependent

More information

Heteroskedasticity Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 30, 2015

Heteroskedasticity Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 30, 2015 Heteroskedasticity Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 30, 2015 These notes draw heavily from Berry and Feldman, and, to a lesser extent, Allison,

More information

Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013

Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 In Chap. 11 of Statistical Analysis of Management Data (Gatignon, 2014), tests of mediation are

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Instrumental Variables and the Problem of Endogeneity

Instrumental Variables and the Problem of Endogeneity Instrumental Variables and the Problem of Endogeneity September 15, 2015 1 / 38 Exogeneity: Important Assumption of OLS In a standard OLS framework, y = xβ + ɛ (1) and for unbiasedness we need E[x ɛ] =

More information

Christopher Dougherty London School of Economics and Political Science

Christopher Dougherty London School of Economics and Political Science Introduction to Econometrics FIFTH EDITION Christopher Dougherty London School of Economics and Political Science OXFORD UNIVERSITY PRESS Contents INTRODU CTION 1 Why study econometrics? 1 Aim of this

More information

Greene, Econometric Analysis (7th ed, 2012)

Greene, Econometric Analysis (7th ed, 2012) EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 4, 8: Properties of LS and IV estimators We now consider the least squares estimator from the statistical viewpoint,

More information

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model

More information

Topic 10: Panel Data Analysis

Topic 10: Panel Data Analysis Topic 10: Panel Data Analysis Advanced Econometrics (I) Dong Chen School of Economics, Peking University 1 Introduction Panel data combine the features of cross section data time series. Usually a panel

More information

Empirical Application of Simple Regression (Chapter 2)

Empirical Application of Simple Regression (Chapter 2) Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget

More information

ECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors

ECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors ECON4150 - Introductory Econometrics Lecture 6: OLS with Multiple Regressors Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 6 Lecture outline 2 Violation of first Least Squares assumption

More information

Greene, Econometric Analysis (7th ed, 2012)

Greene, Econometric Analysis (7th ed, 2012) EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 2 3: Classical Linear Regression The classical linear regression model is the single most useful tool in econometrics.

More information

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48 ECON2228 Notes 10 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 10 2014 2015 1 / 48 Serial correlation and heteroskedasticity in time series regressions Chapter 12:

More information

ECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes

ECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes ECON 4551 Econometrics II Memorial University of Newfoundland Panel Data Models Adapted from Vera Tabakova s notes 15.1 Grunfeld s Investment Data 15.2 Sets of Regression Equations 15.3 Seemingly Unrelated

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Birkbeck Working Papers in Economics & Finance

Birkbeck Working Papers in Economics & Finance ISSN 1745-8587 Birkbeck Working Papers in Economics & Finance Department of Economics, Mathematics and Statistics BWPEF 1809 A Note on Specification Testing in Some Structural Regression Models Walter

More information

Multiple Regression Analysis: Heteroskedasticity

Multiple Regression Analysis: Heteroskedasticity Multiple Regression Analysis: Heteroskedasticity y = β 0 + β 1 x 1 + β x +... β k x k + u Read chapter 8. EE45 -Chaiyuth Punyasavatsut 1 topics 8.1 Heteroskedasticity and OLS 8. Robust estimation 8.3 Testing

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

2.1. Consider the following production function, known in the literature as the transcendental production function (TPF).

2.1. Consider the following production function, known in the literature as the transcendental production function (TPF). CHAPTER Functional Forms of Regression Models.1. Consider the following production function, known in the literature as the transcendental production function (TPF). Q i B 1 L B i K i B 3 e B L B K 4 i

More information

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 54

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 54 ECON2228 Notes 10 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 10 2014 2015 1 / 54 erial correlation and heteroskedasticity in time series regressions Chapter 12:

More information

Week 11 Heteroskedasticity and Autocorrelation

Week 11 Heteroskedasticity and Autocorrelation Week 11 Heteroskedasticity and Autocorrelation İnsan TUNALI Econ 511 Econometrics I Koç University 27 November 2018 Lecture outline 1. OLS and assumptions on V(ε) 2. Violations of V(ε) σ 2 I: 1. Heteroskedasticity

More information

Greene, Econometric Analysis (5th ed, 2003)

Greene, Econometric Analysis (5th ed, 2003) EC771: Econometrics, Spring 2004 Greene, Econometric Analysis (5th ed, 2003) Chapters 4 5: Properties of LS and IV estimators We now consider the least squares estimator from the statistical viewpoint,

More information

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests ECON4150 - Introductory Econometrics Lecture 5: OLS with One Regressor: Hypothesis Tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 5 Lecture outline 2 Testing Hypotheses about one

More information

9) Time series econometrics

9) Time series econometrics 30C00200 Econometrics 9) Time series econometrics Timo Kuosmanen Professor Management Science http://nomepre.net/index.php/timokuosmanen 1 Macroeconomic data: GDP Inflation rate Examples of time series

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Generalized Method of Moments: I. Chapter 9, R. Davidson and J.G. MacKinnon, Econometric Theory and Methods, 2004, Oxford.

Generalized Method of Moments: I. Chapter 9, R. Davidson and J.G. MacKinnon, Econometric Theory and Methods, 2004, Oxford. Generalized Method of Moments: I References Chapter 9, R. Davidson and J.G. MacKinnon, Econometric heory and Methods, 2004, Oxford. Chapter 5, B. E. Hansen, Econometrics, 2006. http://www.ssc.wisc.edu/~bhansen/notes/notes.htm

More information

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in

More information

Exercices for Applied Econometrics A

Exercices for Applied Econometrics A QEM F. Gardes-C. Starzec-M.A. Diaye Exercices for Applied Econometrics A I. Exercice: The panel of households expenditures in Poland, for years 1997 to 2000, gives the following statistics for the whole

More information

LECTURE 11. Introduction to Econometrics. Autocorrelation

LECTURE 11. Introduction to Econometrics. Autocorrelation LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct

More information

Instrumental Variables

Instrumental Variables Università di Pavia 2010 Instrumental Variables Eduardo Rossi Exogeneity Exogeneity Assumption: the explanatory variables which form the columns of X are exogenous. It implies that any randomness in the

More information

Statistical Modelling in Stata 5: Linear Models

Statistical Modelling in Stata 5: Linear Models Statistical Modelling in Stata 5: Linear Models Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 07/11/2017 Structure This Week What is a linear model? How good is my model? Does

More information

Econometrics II. Lecture 4: Instrumental Variables Part I

Econometrics II. Lecture 4: Instrumental Variables Part I Econometrics II Lecture 4: Instrumental Variables Part I Måns Söderbom 12 April 2011 mans.soderbom@economics.gu.se. www.economics.gu.se/soderbom. www.soderbom.net 1. Introduction Recall from lecture 3

More information

Day 2A Instrumental Variables, Two-stage Least Squares and Generalized Method of Moments

Day 2A Instrumental Variables, Two-stage Least Squares and Generalized Method of Moments Day 2A nstrumental Variables, Two-stage Least Squares and Generalized Method of Moments c A. Colin Cameron Univ. of Calif.- Davis Frontiers in Econometrics Bavarian Graduate Program in Economics. Based

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

Greene, Econometric Analysis (7th ed, 2012) Chapters 9, 20: Generalized Least Squares, Heteroskedasticity, Serial Correlation

Greene, Econometric Analysis (7th ed, 2012) Chapters 9, 20: Generalized Least Squares, Heteroskedasticity, Serial Correlation EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 9, 20: Generalized Least Squares, Heteroskedasticity, Serial Correlation The generalized linear regression model The

More information

ADVANCED ECONOMETRICS I. Course Description. Contents - Theory 18/10/2017. Theory (1/3)

ADVANCED ECONOMETRICS I. Course Description. Contents - Theory 18/10/2017. Theory (1/3) ADVANCED ECONOMETRICS I Theory (1/3) Instructor: Joaquim J. S. Ramalho E.mail: jjsro@iscte-iul.pt Personal Website: http://home.iscte-iul.pt/~jjsro Office: D5.10 Course Website: http://home.iscte-iul.pt/~jjsro/advancedeconometricsi.htm

More information

Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models

Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models Applied Econometrics Lecture 3: Introduction to Linear Panel Data Models Måns Söderbom 4 September 2009 Department of Economics, Universy of Gothenburg. Email: mans.soderbom@economics.gu.se. Web: www.economics.gu.se/soderbom,

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics STAT-S-301 Introduction to Time Series Regression and Forecasting (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Introduction to Time Series Regression

More information

Introduction to Eco n o m et rics

Introduction to Eco n o m et rics 2008 AGI-Information Management Consultants May be used for personal purporses only or by libraries associated to dandelon.com network. Introduction to Eco n o m et rics Third Edition G.S. Maddala Formerly

More information

the error term could vary over the observations, in ways that are related

the error term could vary over the observations, in ways that are related Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance Var(u i x i ) = σ 2 is common to all observations i = 1,..., n In many applications, we may

More information

Iris Wang.

Iris Wang. Chapter 10: Multicollinearity Iris Wang iris.wang@kau.se Econometric problems Multicollinearity What does it mean? A high degree of correlation amongst the explanatory variables What are its consequences?

More information