Econometric Methods. Prediction / Violation of A-Assumptions. Burcu Erdogan. Universität Trier WS 2011/2012
|
|
- Adam Manning
- 6 years ago
- Views:
Transcription
1 Econometric Methods Prediction / Violation of A-Assumptions Burcu Erdogan Universität Trier WS 2011/2012 (Universität Trier) Econometric Methods / 42
2 Moving on to... 1 Prediction 2 Violation of Assumption A1 3 Model Selection Adjusted coefficient of determination Other Criteria: AIC, SC and PC F-Test t-test Non-nested F-Test J-Test (Universität Trier) Econometric Methods / 42
3 Prediction Point prediction The aim of point prediction is forecasting y-values based on known x-values. In the case of a multiple regression with two regressors, for x 10 and x 20, we have ŷ 0 = α + β 1 x 10 + β 2 x 20 The estimated model for barley production is ŷ t = 0, , x 1t + 0, x 2t. For a parcel with 29 kg/ha phosphate (x 10 = ln 29 = 3, 36730) and 120 kg/ha nitrogen (x 20 = ln 120 = 4, 78749), the model predicts ŷ 0 = 4, 21994, which corresponds to a barley output of about 68 dt/ha. (Universität Trier) Econometric Methods / 42
4 Prediction Point prediction and prediction error In the case of multiple regression, for the vector x 0 = [1 x x K0 ] we predict ŷ 0 = x 0 β. The prediction error is thus ) ŷ 0 y 0 = x 0 β x 0β u 0 = x 0 ( β β u 0. Expected value of the prediction error is E (ŷ 0 y 0 ) = x 0E ( β β) E (u 0 ) = 0. (Universität Trier) Econometric Methods / 42
5 Prediction Variance of the prediction error The variance of the prediction error is obtained from var(ŷ 0 y 0 ) = E [[(ŷ 0 y 0 ) E (ŷ 0 y 0 )] 2] = E [(ŷ 0 y 0 )(ŷ 0 y 0 )] [[ ) ] [ ) ]] = E x 0 ( β β u 0 x 0 ( β β u 0 [ [ )] 2 [ ) ] ] = E x 0 ( β β 2 x 0 ( β β u 0 + u0 2 [ ) ] x0 ] = E x 0( β β)( β β 2x 0E [( β β )u 0 +E (u0), 2 At the last step, pay attention that x 0 ( β β) = ( β β) x 0. Since the elements of β depend solely on the observations t = 1, 2,..., T, and u 0 is the disturbance of the additional observation, β and u 0 are independent; and ) therefore also ( β β) and u 0 : cov (( β β), u 0 = 0. (Universität Trier) Econometric Methods / 42
6 Prediction Variance of the prediction error In addition, since E (u 0 ) = 0 and E ( β β) = 0, Moreover, Hence [[ ] E [( β β)u 0 ] = E ( β β) E ( β β) ) = cov (( β β), u 0 = 0. E (u 2 0) = E var(ŷ 0 y 0 ) = x 0E ] [u 0 E (u 0 )] [ [u 0 E (u 0 )] 2] = var(u 0 ) = σ 2. [ ) ] ( β β ( β β) x 0 + σ 2 = x 0C( β)x 0 + σ 2 = x 0σ 2 ( X X ) 1 x0 + σ 2 [ = σ x 0 ( X X ) ] 1 x0. (Universität Trier) Econometric Methods / 42
7 Prediction Prediction Interval Standardize the prediction error as follows: t = (ŷ 0 y 0 ) E (ŷ 0 y 0 ) ŝe(ŷ 0 y 0 ) = ŷ0 y 0 ŝe(ŷ 0 y 0 ) where ŝe(ŷ 0 y 0 ) = var(ŷ 0 y 0 ). We know that t t (T 3) { } Pr t a/2 ŷ0 y 0 ŝe(ŷ 0 y 0 ) t a/2 = 1 a. Solving for y 0, we get the prediction interval [ŷ 0 t a/2.ŝe(ŷ 0 y 0 ); ŷ 0 + t a/2.ŝe(ŷ 0 y 0 )] (Universität Trier) Econometric Methods / 42
8 Prediction Prediction Interval For barley production, the estimated variance of prediction error is var(ŷ 0 y 0 ) = 0, 00502, hence ŝe(ŷ 0 y 0 ) = 0, , If a = 0, 05, the critical t-value with 27 degrees of freedom is 2,0518. Then, the prediction interval is [4, , , 07088; 4, , , 07088] = [4, 0745; 4, 3654] (Universität Trier) Econometric Methods / 42
9 Moving on to... 1 Prediction 2 Violation of Assumption A1 3 Model Selection Adjusted coefficient of determination Other Criteria: AIC, SC and PC F-Test t-test Non-nested F-Test J-Test (Universität Trier) Econometric Methods / 42
10 Violation of Assumption A1 Consider the following multiple regression model: y t = α + β 1 x 1t + β 2 x 2t β K x Kt + u t. y = X β + u Assumption A1: The econometric model does not lack any relevant exogenous variables and the exogenous variables used are not irrelevant. A true variable selection is achieved when the econometric model 1 does not exclude any relevant exogenous variables and 2 does not contain any irrelevant exogenous variables. (Universität Trier) Econometric Methods / 42
11 Violation of Assumption A1 Example: Factors that affect wages (y t ) in a company will be analyzed. Duration of job training (x 1t ), age (x 2t ) and duration of employment in the company (x 3t ) are considered to be possible exogenous variables. t y t x 1t x 2t x 3t t y t x 1t x 2t x 3t (Universität Trier) Econometric Methods / 42
12 Violation of Assumption A1 Three competing models: Model I (Underspecified model) II (True model) in vector notation y t = α + β 1 x 1t + u t y t = α + β 1 x 1t + β 2 x 2t + u t III (Overspecified model) y t = α + β 1 x 1t + β 2 x 2t + β 3 x 3t + u t where y t represents the wage, x 1t years of education, x 2t age and x 3t duration of employment in the company. Model I (Underspecified model) II (True model) III (Overspecified model) in matrix notation y = X 1 β 1 + ũ y = X 1 β 1 + X 2 β 2 + u y = X 1 β 1 + X 2 β 2 + X 3 β 3 + ν (Universität Trier) Econometric Methods / 42
13 Violation of Assumption A1 True model Model y = Xβ + u in a partitioned notation: where β = (β 1 β 2 ) and y = X 1 β 1 + X 2 β 2 + u 1 x 11 x K1 1 1 x 12 x K1 2 X 1 = x 1T x K1 T x K x K x K1, X x K x K x K2 2 = x K1 +1 T x K1 +2 T x KT. (Universität Trier) Econometric Methods / 42
14 Violation of Assumption A1 Estimation results Table: Overview of estimation results. Model Variable Coeff. ŝe (.) t-stat. p-value I Constant 1354,7 94,2 14,377 < 0,001 Training 89,3 19,8 4,505 < 0,001 II Constant 1027,8 164,5 6,249 < 0,001 Training 62,6 21,2 2,953 0,009 Age 10,6 4,6 2,317 0,033 III Constant 1000,5 225,7 4,432 < 0,001 Training 62,4 21,8 2,859 0,011 Age 12,4 10,7 1,159 0,263 Duration of Emp. -2,6 14,3-0,183 0,857 (Universität Trier) Econometric Methods / 42
15 Violation of Assumption A1 Figure: Interdependencies in the true model. (Universität Trier) Econometric Methods / 42
16 Violation of Assumption A1 Omitted Variable Impact on the expected value of the disturbances The disturbances of the misspecified model (ũ) are ũ = X 2 β 2 + u and this results in a violation of the Assumption B1: E (ũ) = E (X 2 β 2 + u t ) = X 2 β 2 + E (u t ) = X 2 β = 0. Consequences for the point estimator The OLS estimator of the incomplete model is denoted by β 1, which is as usual: β1 = ( X 1X 1 ) 1 X 1 y = ( X 1X 1 ) 1 X 1 (X 1 β 1 + X 2 β 2 + u) = ( X 1X 1 ) 1 X 1 X 1 β 1 + ( X 1X 1 ) 1 X 1 X 2 β 2 + ( X 1X 1 ) 1 X 1 u = β 1 + ( X 1X 1 ) 1 X 1 X 2 β 2 + ( X 1X 1 ) 1 X 1 u. (Universität Trier) Econometric Methods / 42
17 Violation of Assumption A1 Omitted Variable Take expected value of both sides (E (β 1 ) = β 1 and E (u) = o): E ( β1 ) = β 1 + ( X 1X 1 ) 1 X 1 X 2 β 2 = β 1. Consequences for the interval estimator [ β 1 t a/2 ŝe( β 1) ; β 1 + t a/2 ŝe( β 1)] var( β 1 ) must be estimated unbiased, because se( β 1 var( β ) = 1 ). ) var ( β 1 = c 22 where c 22 is the corresponding element of the variance-covariance matrix of the estimators C ( β). We need an unbiased estimate of σ 2. The estimated variance based on the incomplete model is σ 2 ũ ũ = T K 1 1. (Universität Trier) Econometric Methods / 42
18 Violation of Assumption A1 Omitted Variable - Bias of the variance σ 2 = E (ũ ũ) = E [(X 2 β 2 + u) (X 2 β 2 + u)] = E [X 2 β 2 + u] 2 = E [(X 2 β 2 ) (X 2 β 2 ) + (X 2 β 2 ) u + u X 2 β 2 + u u] (X 2 β 2 ) u is a scalar, so (X 2 β 2 ) u = u X 2 β 2. E (ũ ũ) = E [(X 2 β 2 ) (X 2 β 2 ) + 2 (X 2 β 2 ) u + u u] (Universität Trier) Econometric Methods / 42
19 Violation of Assumption A1 Omitted Variable - Bias of the variance = E [(X 2 β 2 ) (X 2 β 2 )] + 2 E [(X 2 β 2 ) u] + E [u u] = E [(X 2 β 2 ) 2 ] + σ 2 > σ 2 as E [(X 2 β 2 ) 2 ] > 0 The underlying true variance of the incomplete model is larger than that of the true model. Consequently, the estimated variance is also biased upwards, and var( β 1 ) is also biased. (Universität Trier) Econometric Methods / 42
20 Violation of Assumption A1 Excursion - Variance in the true model û = y ŷ ŷ = X β = X ( X X ) 1 X y û = y X ( X X ) 1 X y [ = I T X ( X X ) ] 1 X y = My, (Universität Trier) Econometric Methods / 42
21 Violation of Assumption A1 Excursion - Variance in the true model û = M (Xβ + u) = MXβ + Mu [ = I T X X ( X X ) ] 1 X X β + Mu = [I T X XI K+1 ] β + Mu = Mu. Consequently, the sum of squared residuals û û(= Sûû ), is û û = u M Mu = u Mu Matrix M is symmetric and idempotent, hence: M M = M (a scalar) û û = tr ( u Mu ) = tr ( uu M ). (Universität Trier) Econometric Methods / 42
22 Violation of Assumption A1 E [ û û ] = tr [[ E [uu ] ] M ] = tr [[V(u)] M] = tr [ σ 2 I T M ] = σ 2 tr [M] = σ 2 tr [I T X ( X X ) ] 1 X [ = σ 2 tr [I T ] tr [X ( X X ) ]] 1 X [ = σ 2 tr [I T ] tr [X X ( X X ) ]] 1 = σ 2 [tr [I T ] tr [I K +1 ]] = σ 2 [T K 1]. E [ σ 2 ] = E [[ û û ] /T K 1 ] = σ2 [T K 1] = σ 2 T K 1 (Universität Trier) Econometric Methods / 42
23 Violation of Assumption A1 Omitted Variable - Bias of the estimated sigma σ 2 = ũ ũ/(t K 1 1). ũ = y ỹ = = y X 1 β1 ( ) y X 1 X 1 1 X 1 X 1 y = M 1 y, M 1 = I T X 1 ( X 1 X 1 ) 1 X 1 Matrix M 1 is usually referred to as residual generating matrix. Matrix M 1 is also symmetric and idempotent, we have: M 1M 1 = M 1. (Universität Trier) Econometric Methods / 42
24 Violation of Assumption A1 ũ = M 1 y = M 1 (X 1 β 1 + X 2 β 2 + u). ũ = [I T X 1 X 1 ( X 1 X 1 ) 1 X 1 X 1 ] β 1 + M 1 X 2 β 2 + M 1 u = M 1 X 2 β 2 + M 1 u. Consequently, the following holds for the sum of squared residuals ũ ũ: ũ ũ = [ β 2 X 2M 1 + u M 1] [M1 X 2 β 2 + M 1 u] = β 2 X 2M 1M 1 X 2 β 2 + β 2 X 2M 1M 1 u + u M 1M 1 X 2 β 2 + u M 1M 1 u = β 2 X 2M 1 X 2 β 2 + β 2 X 2M 1 u + u M 1 X 2 β 2 + u M 1 u, where u M 1 u and β 2 X 2 M 1u are scalars. ũ ũ = β 2 X 2M 1 X 2 β β 2 X 2M 1 u + u M 1 u, (Universität Trier) Econometric Methods / 42
25 Violation of Assumption A1 As u M 1 u is a scalar: u M 1 u = tr[u M 1 u] and tr[u M 1 u] =tr[uu M 1 ] E [ ũ ũ] = β 2 X 2M 1 X 2 β β 2 X 2M 1 E (u) + E [tr[uu M 1 ]]. E [tr[uu M 1 ]] = σ 2 [T K 1 1] E [ ũ ũ] = β 2 X 2M 1 X 2 β 2 + σ 2 [T K 1 1]. E ( σ 2 ) = E [ ũ ũ] /(T K 1 1) = σ 2 + β 2 X 2M 1 X 2 β 2 / (T K 1 1) Since the second term is a quadratic form, it is positive definite, and accordingly it is a scalar greater than zero (compare von Auer, p. 303). Therefore, the estimated residual variance is also biased. (Universität Trier) Econometric Methods / 42
26 Violation of Assumption A1 Omitted Variable Conclusion: Omitting relevant variables leads to: biased point estimators biased interval estimators invalid hypothesis tests (Universität Trier) Econometric Methods / 42
27 Violation of Assumption A1 Using irrelevant variables First, suppose that we estimate Model III: y = X 1 β 1 + X 2 β 2 + X 3 β 3 + ν This can be represented in a simpler form as where: y = X β + ν X= [ ] X 1 X 2 X 3, β = Since Model III is wrong, β 3 = o. β 1 β 2 β 3 = β 1 β 2 o (Universität Trier) Econometric Methods / 42
28 Violation of Assumption A1 Using irrelevant variables Consequences for the point estimator β = ( X X ) 1 X y y =X β + ν = X 1 β 1 + X 2 β 2 + X 3 o + ν β = β + ( X X ) 1 X u. E ( β) = E (β + ( X X ) 1 X u) = E (β) + E ( ( X X ) 1 X u) = β. Conclusion: Since the elements of the vector β 3 are zero, beta estimator is not systematically biased. So, there is not a problem with the expected value of β. (Universität Trier) Econometric Methods / 42
29 Violation of Assumption A1 Using irrelevant variables Consequences for the interval estimator [ β 1 t a/2 ŝe( β 1 ) ; β 1 + t a/2 ŝe( β 1 )] It is crucial that the variance is not biased. u = ν + X 3 β 3 ν = u X 3 β 3 = u o = u It is thus clear that the expected value is E (ν) = 0. Moreover, the estimated variance is unbiased. The only problem is that the point estimator is not efficient, which means that there is another efficient estimator, namely of the true model. (In other words, the matrix X is now another one, and therefore the estimated variance-covariance matrix of parameter estimates is greater, see von Auer p. 304.) (Universität Trier) Econometric Methods / 42
30 Violation of Assumption A1 Using irrelevant variables Conclusion: Using irrelevant variables leads to: unbiased, but inefficient point estimates unbiased, but inefficient interval estimates valid, but not powerful hypothesis testing (Universität Trier) Econometric Methods / 42
31 Moving on to... 1 Prediction 2 Violation of Assumption A1 3 Model Selection Adjusted coefficient of determination Other Criteria: AIC, SC and PC F-Test t-test Non-nested F-Test J-Test (Universität Trier) Econometric Methods / 42
32 Adjusted coefficient of determination Coefficient of determination R 2 = (S yy Sûû )/S yy = 1 Sûû /S yy is a meaningful criterion if the following conditions are met: 1 the endogenous variables of the models are identical, i.e. their numerical values match, 2 the number of exogenous variables is identical in the models, 3 the models have a level parameter α. (Universität Trier) Econometric Methods / 42
33 Adjusted coefficient of determination Econometric software provides us the following coefficients of determination, α- and β 1 -variances for the three wage models: Table: Coefficients of determination and estimated coefficient variances of the competing three models. Model R 2 in % var( α) var( β 1 ) I 52, , 824 II 64, , 043 III 64, , 745 (Universität Trier) Econometric Methods / 42
34 Adjusted coefficient of determination The adjusted coefficient of determination: R 2 = 1 S / ûû (T K 1) / = 1 S ûû (T 1) S yy (T 1) S yy (T K 1) = 1 ( 1 R 2) T 1 T K 1 Comparison of the three wage models: Model I : R 2 = 1 (1 0, 5299) 20 1 = 0, Model II : R 2 = 1 (1 0, 6427) 20 1 = 0, Model III : R 2 = 1 (1 0, 6434) 20 1 = 0, (Universität Trier) Econometric Methods / 42
35 Other Criteria: AIC, SC and PC Akaike information criterion: AIC = ln ( Sûû T ) 2(K + 1) + T Schwarz criterion: SC = ln ( Sûû T ) (K + 1) ln T + T Prediction criterion: PC = S ûû[1 + (K + 1)/T ] T K 1 (Universität Trier) Econometric Methods / 42
36 Other Criteria: AIC, SC and PC We obtain the following Sûû -values for the three competing wage models I, II and III: , und Thus, the following results arise: Table: AIC, SC and PC of the three models to compare. Model AIC SC PC I 11,251 11, ,715 II 11,077 11, ,701 III 11,174 11, ,877 (Universität Trier) Econometric Methods / 42
37 F-Test One could perform an F -Test to compare Model I and Model III; H 0 : β 2 = β 3 = 0. F -statistic is calculated as For our example, we have F = (S 0 ûû S ûû)/l Sûû /(T K 1). F = ( ) / / (20 4) = 2, 548. The critical F -value at a significance level of a = 0, 05 for 2 and 16 degrees of freedom is: F 0,05 = 3, 634. Hence, the two variables are not jointly significant. (Universität Trier) Econometric Methods / 42
38 t-test To compare Model II and III, we could test H 0 : β 3 = 0: t = β 3 q ŝe( β 3 ) = β 3 0 ŝe( β 3 ). Normally, the respective t-statistics should be determined for all exogenous variables of the competing models. The critical value t a/2 at a significance level of a = 0, 05 is 2, 1098 for Model II (17 degrees of freedom) and 2, 1199 for Model III (16 degrees of freedom). The estimation results in the table (slide 14) agree on the true model overall. (Universität Trier) Econometric Methods / 42
39 Non-nested F-Test Let Model IV be an alternative candidate to the Model II: y t = α + β 2 x 2t + β 3 x 3t + u t. Table: Estimation results for the model. Variable Coeff. ŝe(ˆ ) t-stat p-value Constant 921, 4 267, 2 3, 449 0, 003 Age 20, 7 12, 2 1, 691 0, 109 Duration of emp. 4, 1 17, 0 0, 241 0, 812 At a significance level of 5%, neither age nor duration of employment are significant. (Universität Trier) Econometric Methods / 42
40 Non-nested F-Test Non-nested F -Test proceeds in two steps: 1 From the two models, a mega-model is formed: y t = α + β 1 x 1t + β 2 x 2t + β 3 x 3t + u t [corresponds to Model III]. 2 For this model, two F-test are performed. First, joint significance of betas, which appear in the first model but not in the alternative, are tested. Second, joint significance of betas, which appear in the alternative model but not in the first model, are tested. In our example, the null hypotheses H 0 : β 1 = 0 and H 0 : β 3 = 0 are tested successively with an F -Test (t-tests would also be acceptable here). For H 0 : β 1 = 0, F = 8, 174 and for H 0 : β 3 = 0, F = 0, 033 The critical F -value is F 0,05 = 4, 494. Model II is, therefore, preferable to the alternative model. (Universität Trier) Econometric Methods / 42
41 J-Test Models II and IV should be compared again. J-test proceeds in the following steps: 1 The estimated values of the endogenous variable ŷ t are calculated for Model II. 2 This variable is used as an additional variable in Model IV: y t = α + β 2 x 2t + β 3 x 3t + β 4 ŷ t + u t. (Model IV*) 3 A t-test for the null hypothesis H 0 : β 4 = 0 is run. 4 The same test is also carried out in reverse: Model IV is estimated and the corresponding values of ŷt are included as additional variables in Model II: y t = α + β 1 x 1t + β 2 x 2t + β 4ŷ t + u t. (Model II*) A t-test is conducted for the null hypothesis H 0 : β 4 = 0. (Universität Trier) Econometric Methods / 42
42 J-Test Econometric software delivers the estimated values ŷ t for Model II. Based on these values, an OLS estimation of Model IV can be run. This delivers ˆβ 4 = 1, 0 and (for H 0 : β 4 = 0) t = 2, 859. The critical t-value is t a/2 = 2, 1199 (16 degrees of freedom, at a significance level of a = 0, 05). The null hypothesis is rejected. Econometric software delivers the estimated values ŷ t for Model IV. OLS estimation of Model II delivers ˆβ 4 = 0, 637 and (for H 0 : β 4 = 0) t = 0, 183. The critical t-value is again t a/2 = 2, 1199 (16 degrees of freedom, at a significance level of a = 0, 05). The null hypothesis H 0 : β 4 = 0 cannot be rejected. J-test would prefer Model II to Model IV. (Universität Trier) Econometric Methods / 42
5. Erroneous Selection of Exogenous Variables (Violation of Assumption #A1)
5. Erroneous Selection of Exogenous Variables (Violation of Assumption #A1) Assumption #A1: Our regression model does not lack of any further relevant exogenous variables beyond x 1i, x 2i,..., x Ki and
More information3. Linear Regression With a Single Regressor
3. Linear Regression With a Single Regressor Econometrics: (I) Application of statistical methods in empirical research Testing economic theory with real-world data (data analysis) 56 Econometrics: (II)
More informationCHAPTER 6: SPECIFICATION VARIABLES
Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero
More informationTopic 4: Model Specifications
Topic 4: Model Specifications Advanced Econometrics (I) Dong Chen School of Economics, Peking University 1 Functional Forms 1.1 Redefining Variables Change the unit of measurement of the variables will
More information1/34 3/ Omission of a relevant variable(s) Y i = α 1 + α 2 X 1i + α 3 X 2i + u 2i
1/34 Outline Basic Econometrics in Transportation Model Specification How does one go about finding the correct model? What are the consequences of specification errors? How does one detect specification
More informationThe Multiple Regression Model Estimation
Lesson 5 The Multiple Regression Model Estimation Pilar González and Susan Orbe Dpt Applied Econometrics III (Econometrics and Statistics) Pilar González and Susan Orbe OCW 2014 Lesson 5 Regression model:
More informationFöreläsning /31
1/31 Föreläsning 10 090420 Chapter 13 Econometric Modeling: Model Speci cation and Diagnostic testing 2/31 Types of speci cation errors Consider the following models: Y i = β 1 + β 2 X i + β 3 X 2 i +
More informationMultiple Regression Analysis. Part III. Multiple Regression Analysis
Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant
More informationAnswers to Problem Set #4
Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2
More informationThe Linear Regression Model
The Linear Regression Model Carlo Favero Favero () The Linear Regression Model 1 / 67 OLS To illustrate how estimation can be performed to derive conditional expectations, consider the following general
More information2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
More informationMotivation for multiple regression
Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope
More informationThe Statistical Property of Ordinary Least Squares
The Statistical Property of Ordinary Least Squares The linear equation, on which we apply the OLS is y t = X t β + u t Then, as we have derived, the OLS estimator is ˆβ = [ X T X] 1 X T y Then, substituting
More information1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE
1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +
More informationApplied Econometrics (QEM)
Applied Econometrics (QEM) based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #3 1 / 42 Outline 1 2 3 t-test P-value Linear
More informationThe Simple Regression Model. Part II. The Simple Regression Model
Part II The Simple Regression Model As of Sep 22, 2015 Definition 1 The Simple Regression Model Definition Estimation of the model, OLS OLS Statistics Algebraic properties Goodness-of-Fit, the R-square
More informationThe general linear regression with k explanatory variables is just an extension of the simple regression as follows
3. Multiple Regression Analysis The general linear regression with k explanatory variables is just an extension of the simple regression as follows (1) y i = β 0 + β 1 x i1 + + β k x ik + u i. Because
More informationIntermediate Econometrics
Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the
More informationHeteroskedasticity. Part VII. Heteroskedasticity
Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationHomoskedasticity. Var (u X) = σ 2. (23)
Homoskedasticity How big is the difference between the OLS estimator and the true parameter? To answer this question, we make an additional assumption called homoskedasticity: Var (u X) = σ 2. (23) This
More informationLinear Regression. Junhui Qian. October 27, 2014
Linear Regression Junhui Qian October 27, 2014 Outline The Model Estimation Ordinary Least Square Method of Moments Maximum Likelihood Estimation Properties of OLS Estimator Unbiasedness Consistency Efficiency
More informationWISE International Masters
WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are
More information10. Time series regression and forecasting
10. Time series regression and forecasting Key feature of this section: Analysis of data on a single entity observed at multiple points in time (time series data) Typical research questions: What is the
More informationINTRODUCTORY ECONOMETRICS
INTRODUCTORY ECONOMETRICS Lesson 2b Dr Javier Fernández etpfemaj@ehu.es Dpt. of Econometrics & Statistics UPV EHU c J Fernández (EA3-UPV/EHU), February 21, 2009 Introductory Econometrics - p. 1/192 GLRM:
More informationIntroductory Econometrics
Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna November 23, 2013 Outline Introduction
More informationEmpirical Economic Research, Part II
Based on the text book by Ramanathan: Introductory Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna December 7, 2011 Outline Introduction
More informationEconometrics Multiple Regression Analysis: Heteroskedasticity
Econometrics Multiple Regression Analysis: João Valle e Azevedo Faculdade de Economia Universidade Nova de Lisboa Spring Semester João Valle e Azevedo (FEUNL) Econometrics Lisbon, April 2011 1 / 19 Properties
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model
Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Most of this course will be concerned with use of a regression model: a structure in which one or more explanatory
More informationSimple Linear Regression: The Model
Simple Linear Regression: The Model task: quantifying the effect of change X in X on Y, with some constant β 1 : Y = β 1 X, linear relationship between X and Y, however, relationship subject to a random
More informationReview of Econometrics
Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,
More information9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures
FE661 - Statistical Methods for Financial Engineering 9. Model Selection Jitkomut Songsiri statistical models overview of model selection information criteria goodness-of-fit measures 9-1 Statistical models
More informationSection 2 NABE ASTEF 65
Section 2 NABE ASTEF 65 Econometric (Structural) Models 66 67 The Multiple Regression Model 68 69 Assumptions 70 Components of Model Endogenous variables -- Dependent variables, values of which are determined
More informationEconometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in
More informationF3: Classical normal linear rgression model distribution, interval estimation and hypothesis testing
F3: Classical normal linear rgression model distribution, interval estimation and hypothesis testing Feng Li Department of Statistics, Stockholm University What we have learned last time... 1 Estimating
More informationInterpreting Regression Results
Interpreting Regression Results Carlo Favero Favero () Interpreting Regression Results 1 / 42 Interpreting Regression Results Interpreting regression results is not a simple exercise. We propose to split
More information08 Endogenous Right-Hand-Side Variables. Andrius Buteikis,
08 Endogenous Right-Hand-Side Variables Andrius Buteikis, andrius.buteikis@mif.vu.lt http://web.vu.lt/mif/a.buteikis/ Introduction Consider a simple regression model: Y t = α + βx t + u t Under the classical
More informationAdvanced Econometrics I
Lecture Notes Autumn 2010 Dr. Getinet Haile, University of Mannheim 1. Introduction Introduction & CLRM, Autumn Term 2010 1 What is econometrics? Econometrics = economic statistics economic theory mathematics
More informationWISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A
WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This
More informationECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47
ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with
More informationLesson 17: Vector AutoRegressive Models
Dipartimento di Ingegneria e Scienze dell Informazione e Matematica Università dell Aquila, umberto.triacca@ec.univaq.it Vector AutoRegressive models The extension of ARMA models into a multivariate framework
More informationEconometrics Honor s Exam Review Session. Spring 2012 Eunice Han
Econometrics Honor s Exam Review Session Spring 2012 Eunice Han Topics 1. OLS The Assumptions Omitted Variable Bias Conditional Mean Independence Hypothesis Testing and Confidence Intervals Homoskedasticity
More informationLECTURE 2 LINEAR REGRESSION MODEL AND OLS
SEPTEMBER 29, 2014 LECTURE 2 LINEAR REGRESSION MODEL AND OLS Definitions A common question in econometrics is to study the effect of one group of variables X i, usually called the regressors, on another
More informationThe regression model with one fixed regressor cont d
The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8
More informationLecture 5: Omitted Variables, Dummy Variables and Multicollinearity
Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the
More informationModel Specification and Data Problems. Part VIII
Part VIII Model Specification and Data Problems As of Oct 24, 2017 1 Model Specification and Data Problems RESET test Non-nested alternatives Outliers A functional form misspecification generally means
More informationSimultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations
Simultaneous Equation Models. Introduction: basic definitions 2. Consequences of ignoring simultaneity 3. The identification problem 4. Estimation of simultaneous equation models 5. Example: IS LM model
More informationEconometrics I. Andrea Beccarini. Summer 2011
Econometrics I Andrea Beccarini Summer 2011 Outline Very brief review of statistical basics Simple linear regression model (specification, point estimation, interval estimation, hypothesis tests, forecasting,
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 2 Jakub Mućk Econometrics of Panel Data Meeting # 2 1 / 26 Outline 1 Fixed effects model The Least Squares Dummy Variable Estimator The Fixed Effect (Within
More informationMultiple Regression Analysis
Chapter 4 Multiple Regression Analysis The simple linear regression covered in Chapter 2 can be generalized to include more than one variable. Multiple regression analysis is an extension of the simple
More informationECON Introductory Econometrics. Lecture 16: Instrumental variables
ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental
More informationECO220Y Simple Regression: Testing the Slope
ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x
More informationstatistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:
Wooldridge, Introductory Econometrics, d ed. Chapter 3: Multiple regression analysis: Estimation In multiple regression analysis, we extend the simple (two-variable) regression model to consider the possibility
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares
Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit
More informationAnswer all questions from part I. Answer two question from part II.a, and one question from part II.b.
B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries
More informationApplied Econometrics (MSc.) Lecture 3 Instrumental Variables
Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.
More informationEconometrics Summary Algebraic and Statistical Preliminaries
Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L
More informationEconometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage
More informationSpatial Econometrics
Spatial Econometrics Lecture 5: Single-source model of spatial regression. Combining GIS and regional analysis (5) Spatial Econometrics 1 / 47 Outline 1 Linear model vs SAR/SLM (Spatial Lag) Linear model
More informationF9 F10: Autocorrelation
F9 F10: Autocorrelation Feng Li Department of Statistics, Stockholm University Introduction In the classic regression model we assume cov(u i, u j x i, x k ) = E(u i, u j ) = 0 What if we break the assumption?
More informationIntroduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017
Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Endogeneity b) Instrumental
More informationEconometrics - 30C00200
Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business
More informationWeek 5 Quantitative Analysis of Financial Markets Modeling and Forecasting Trend
Week 5 Quantitative Analysis of Financial Markets Modeling and Forecasting Trend Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 :
More informationPractice Questions for the Final Exam. Theoretical Part
Brooklyn College Econometrics 7020X Spring 2016 Instructor: G. Koimisis Name: Date: Practice Questions for the Final Exam Theoretical Part 1. Define dummy variable and give two examples. 2. Analyze the
More informationBasic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler
Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as
More informationMultiple Linear Regression
Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there
More informationMultiple Linear Regression CIVL 7012/8012
Multiple Linear Regression CIVL 7012/8012 2 Multiple Regression Analysis (MLR) Allows us to explicitly control for many factors those simultaneously affect the dependent variable This is important for
More informationAn overview of applied econometrics
An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao
More informationPanel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43
Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression
More informationEconometrics. 8) Instrumental variables
30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates
More informationOSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity
OSU Economics 444: Elementary Econometrics Ch.0 Heteroskedasticity (Pure) heteroskedasticity is caused by the error term of a correctly speciþed equation: Var(² i )=σ 2 i, i =, 2,,n, i.e., the variance
More informationEconometrics. 7) Endogeneity
30C00200 Econometrics 7) Endogeneity Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Common types of endogeneity Simultaneity Omitted variables Measurement errors
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationECONOMETRICS HONOR S EXAM REVIEW SESSION
ECONOMETRICS HONOR S EXAM REVIEW SESSION Eunice Han ehan@fas.harvard.edu March 26 th, 2013 Harvard University Information 2 Exam: April 3 rd 3-6pm @ Emerson 105 Bring a calculator and extra pens. Notes
More informationChapter 11 Specification Error Analysis
Chapter Specification Error Analsis The specification of a linear regression model consists of a formulation of the regression relationships and of statements or assumptions concerning the explanator variables
More informationMultiple Regression Analysis: Estimation ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD
Multiple Regression Analysis: Estimation ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD Outline Motivation. Mechanics and Interpretation. Expected Values and Variances of the Estimators. Motivation for multiple
More informationECON3150/4150 Spring 2015
ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple
More informationMath 423/533: The Main Theoretical Topics
Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)
More informationECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes
ECON 4551 Econometrics II Memorial University of Newfoundland Panel Data Models Adapted from Vera Tabakova s notes 15.1 Grunfeld s Investment Data 15.2 Sets of Regression Equations 15.3 Seemingly Unrelated
More informationUnless provided with information to the contrary, assume for each question below that the Classical Linear Model assumptions hold.
Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Spring 2015 Instructor: Martin Farnham Unless provided with information to the contrary,
More informationProblem Set #6: OLS. Economics 835: Econometrics. Fall 2012
Problem Set #6: OLS Economics 835: Econometrics Fall 202 A preliminary result Suppose we have a random sample of size n on the scalar random variables (x, y) with finite means, variances, and covariance.
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Per Pettersson-Lidbom Number of creds: 7,5 creds Date of exam: Thursday, January 15, 009 Examination
More informationReview of probability and statistics 1 / 31
Review of probability and statistics 1 / 31 2 / 31 Why? This chapter follows Stock and Watson (all graphs are from Stock and Watson). You may as well refer to the appendix in Wooldridge or any other introduction
More informationTest of hypotheses with panel data
Stochastic modeling in economics and finance November 4, 2015 Contents 1 Test for poolability of the data 2 Test for individual and time effects 3 Hausman s specification test 4 Case study Contents Test
More informationIntroduction to Estimation Methods for Time Series models. Lecture 1
Introduction to Estimation Methods for Time Series models Lecture 1 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 1 SNS Pisa 1 / 19 Estimation
More informationBrief Suggested Solutions
DEPARTMENT OF ECONOMICS UNIVERSITY OF VICTORIA ECONOMICS 366: ECONOMETRICS II SPRING TERM 5: ASSIGNMENT TWO Brief Suggested Solutions Question One: Consider the classical T-observation, K-regressor linear
More informationMatematické Metody v Ekonometrii 7.
Matematické Metody v Ekonometrii 7. Multicollinearity Blanka Šedivá KMA zimní semestr 2016/2017 Blanka Šedivá (KMA) Matematické Metody v Ekonometrii 7. zimní semestr 2016/2017 1 / 15 One of the assumptions
More informationUnit roots in vector time series. Scalar autoregression True model: y t 1 y t1 2 y t2 p y tp t Estimated model: y t c y t1 1 y t1 2 y t2
Unit roots in vector time series A. Vector autoregressions with unit roots Scalar autoregression True model: y t y t y t p y tp t Estimated model: y t c y t y t y t p y tp t Results: T j j is asymptotically
More informationRegression Analysis. y t = β 1 x t1 + β 2 x t2 + β k x tk + ϵ t, t = 1,..., T,
Regression Analysis The multiple linear regression model with k explanatory variables assumes that the tth observation of the dependent or endogenous variable y t is described by the linear relationship
More informationSimple Linear Regression
Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)
More information6. Assessing studies based on multiple regression
6. Assessing studies based on multiple regression Questions of this section: What makes a study using multiple regression (un)reliable? When does multiple regression provide a useful estimate of the causal
More informationMultiple Regression Analysis: Heteroskedasticity
Multiple Regression Analysis: Heteroskedasticity y = β 0 + β 1 x 1 + β x +... β k x k + u Read chapter 8. EE45 -Chaiyuth Punyasavatsut 1 topics 8.1 Heteroskedasticity and OLS 8. Robust estimation 8.3 Testing
More information7. Integrated Processes
7. Integrated Processes Up to now: Analysis of stationary processes (stationary ARMA(p, q) processes) Problem: Many economic time series exhibit non-stationary patterns over time 226 Example: We consider
More informationMultivariate Regression Analysis
Matrices and vectors The model from the sample is: Y = Xβ +u with n individuals, l response variable, k regressors Y is a n 1 vector or a n l matrix with the notation Y T = (y 1,y 2,...,y n ) 1 x 11 x
More informationModel Mis-specification
Model Mis-specification Carlo Favero Favero () Model Mis-specification 1 / 28 Model Mis-specification Each specification can be interpreted of the result of a reduction process, what happens if the reduction
More informationSimple Linear Regression
Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.
More informationApplied Quantitative Methods II
Applied Quantitative Methods II Lecture 4: OLS and Statistics revision Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 4 VŠE, SS 2016/17 1 / 68 Outline 1 Econometric analysis Properties of an estimator
More information