Multiple Regression Analysis: Further Issues
|
|
- Curtis Anthony
- 5 years ago
- Views:
Transcription
1 Multiple Regression Analysis: Further Issues Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) MLR: Further Issues 1 / 36
2 Effects of Data Scaling on OLS Statistics Effects of Data Scaling on OLS Statistics Ping Yu (HKU) MLR: Further Issues 2 / 36
3 Effects of Data Scaling on OLS Statistics Section 2.4: SLR The fitted regression line in the example of CEO salary and return on equity is \salary = roe, n = 209,R 2 =.0132 How will the intercept and slope estimates change when the units of measurement of the dependent and independent variables changes? Suppose the salary is measured in dollars rather than thousands of dollars, the intercept should be 963,191 and the slope should be 18,501. (why?) Solution: convert a new problem to an old problem whose solution is known. Ping Yu (HKU) MLR: Further Issues 3 / 36
4 Effects of Data Scaling on OLS Statistics Relationship between Objective Functions Recall that where arg means arguments. If y i = w 1 y i and x i bβ 0, β b n 1 = arg min β 0,β 1 i=1 = w 2 x i, then (y i β 0 β 1 x i ) 2, n min (y β 0,β i β 0 β 1 x i ) 2 1 i=1 n = min (w 1 y i β β 0,β 0 β 1 w 2x i ) 2 1 i=1 = min w 2 n β β 0,β y i β w 2 1 x w 1 i=1 1 w i. 1 Ping Yu (HKU) MLR: Further Issues 4 / 36
5 Effects of Data Scaling on OLS Statistics Relationship between b β s and bσ 2 s So (why?) Also, bσ 2 = = = bβ 0 = bβ 0 and β b w 1 = β b w w 1 =) b β 0 = w 1 b β 0 and b β 1 = w 1 w 2 b β 1 n 1 n 2 bu i 2 = 1 n n 2 yi i=1 i=1 w1 2 n n 2 y i i=1 β b 0 2 β b 1 x i w1 2 n n 2 bu i 2 i=1 = w 2 1 bσ 2. bβ 0 bβ 2 1 x i Or, the standard error of the regression (SER) bσ = w 1 bσ. Ping Yu (HKU) MLR: Further Issues 5 / 36
6 Effects of Data Scaling on OLS Statistics Relationship between R 2 s and Standard Errors It turns out that R 2 = R 2 : R 2 = 1 = 1 SSR SST n i=1 yi n i=1 bβ 0 y i y 2 bβ 2 1 x i Standard Errors: = 1 = 1 w 2 1 n i=1 bu2 i w 2 1 n i=1 (y i y) 2 SSR SST = R2 bβ 0 = w b 1β 0 and β b 1 = w 1 β b w 1 2 =) se bβ 0 = w 1 se bβ 0 and se bβ 1 = w 1 se bβ w 1 2 Ping Yu (HKU) MLR: Further Issues 6 / 36
7 Effects of Data Scaling on OLS Statistics Relationship between t Statistics and CIs (Section 6.1: MLR) It is natural to predict that t statistic is the same as before: t bβ 1 bβ 1 = b β 1 se = w 1 w 2 h bβ 1 = w 1 b w 2 β 1 β = b 1 = t w bβ. 1 w 2 se bβ 1 se bβ 1 1 The CI for β 1 is the original CI multiplied by w 1 w 2 : h bβ se bβ 1, β b se bβ i se bβ 1, β b se bβ 1 i. The results for β 0 are similar; the only difference is to replace w 1 w 2 by w 1. Ping Yu (HKU) MLR: Further Issues 7 / 36
8 Effects of Data Scaling on OLS Statistics Unit Change in Logarithmic Form Changing the unit of measurement of y and x, when they appear in logarithmic form, does not affect any of the slope estimates, but may affect the intercept estimate. why? so n min [log(y β 0,β i ) β 0 β 1 log(x i )] 2 1 i=1 n = min [log(y i ) + log(w 1 ) β β 0,β 0 β 1 log(x i ) β 1 log(w 2)] 2 1 i=1 n = min [log(y i ) (β β 0,β 0 + β 1 log(w 2) log(w 1 )) β 1 log(x i )] 2, 1 i=1 bβ 0 = b β 0 + b β 1 log(w 2) log(w 1 ) and b β 1 = b β 1 =) b β 0 = b β 0 b β 1 log(w 2 ) + log(w 1 ) and b β 1 = b β 1. I.e., the elasticity is invariant to the units of measurement of either y or x, and the intercept is related to both the original intercept and slope. Ping Yu (HKU) MLR: Further Issues 8 / 36
9 Effects of Data Scaling on OLS Statistics Example: Japanese Learning Suppose we want to study the SLR, y i = β 0 + β 1 x i + u i, where y i = exam mark in Japanese language course x i = hours of study per day during the semester (average hours) The fitted regression line is by i = x i (5.6) (3.5) If you do not study at all, the predicted mark is 20. One additional hour of study per day increases exam mark by 14 marks. At the mean value x = 3 hours of study per day is expected to result in a mark of y = 62. Ping Yu (HKU) MLR: Further Issues 9 / 36
10 Effects of Data Scaling on OLS Statistics continue Now if we report hours of study per week xi rather than per day, the variable has been scaled: xi = w 2 x i = 7x i and yi = w 1 y i = y i. If we run the regression based on xi and y i, we get by i = xi (5.6) (0.5) Each additional hour of study per week increases the exam mark by 2 marks (intuition here). 2 t statistic remains the same: 0.5 = The CI for β 1, [ , ] = [1.02,2.98], is 1/7 of the CI for β 1, which is [ , ] = [7.14,20.98]. Note that the means x and y will still be on the newly estimated regression line: y = x 62 = Ping Yu (HKU) MLR: Further Issues 10 / 36
11 More on Functional Form More on Functional Form Ping Yu (HKU) MLR: Further Issues 11 / 36
12 More on Functional Form More on Using Logarithmic Functional Forms Logarithmic transformations have the convenient percentage/elasticity interpretation. Slope coefficients of logged variables are invariant to rescalings. Taking logs often eliminates/mitigates problems with outliers. (why? figure here) Taking logs often helps to secure normality (e.g., log(wage) vs. wage) and homoskedasticity (see Chapter 8 for an example). Variables measured in units such as years (e.g., education, experience, tenure, age, etc) should not be logged. Variables measured in percentage points (e.g., unemployment rate, participation rate of a pension plan, etc.) should also not be logged. Logs must not be used if variables take on zero or negative values (e.g., hours of work during a month). It is hard to reverse the log-operation when constructing predictions. (we will discuss more on this point later in this chapter) Ping Yu (HKU) MLR: Further Issues 12 / 36
13 More on Functional Form logx Figure: logx < x and lim x! x = 0 Ping Yu (HKU) MLR: Further Issues 13 / 36
14 More on Functional Form Models with Quadratics Example: Suppose the fitted regression line for the wage equation is \wage = exper.0061exper 2 (.35) (.041) (.0009) n = 526,R 2 =.093 (quite small) The predicted wage is a concave function of exper. [figure here] The marginal effect of exper on wage is wage exper = b β b β 2 exper = exper. The first year of experience increases the wage by some $.30, the second year by.298 2(.0061)(1) = $.29 < $.30 etc. Ping Yu (HKU) MLR: Further Issues 14 / 36
15 More on Functional Form Figure: Wage Maximum with Respect to Work Experience Mincer (1974, Schooling, Experience and Earnings) assumed log(wage) = β 0 + β 1 educ + β 2 exper + β 3 exper 2 + u. Ping Yu (HKU) MLR: Further Issues 15 / 36
16 More on Functional Form History of the Wage Equation Jacob Mincer ( ), Columbia, father of modern labor economics Ping Yu (HKU) MLR: Further Issues 16 / 36
17 More on Functional Form Example: Effects of Pollution on Housing Prices The fitted regression line is \ log(price) = log(nox).087log(dist) (.57) (.115) (.043).545rooms +.062rooms 2.048stratio (.165) (.013) (.006) n = 506,R 2 =.603 where nox = nitrogen oxide in air dist = distance from employment centers, in miles stratio = student/teacher ratio The predicted log(price) is a convex function of rooms. [figure here] The coefficient of rooms is negative. Does this mean that, at a low number of rooms, more rooms are associated with lower prices? The marginal effect of rooms on log(price) is log(price) rooms = price/price rooms = rooms. Ping Yu (HKU) MLR: Further Issues 17 / 36
18 More on Functional Form Figure: Calculation of the Turnaround Point Ping Yu (HKU) MLR: Further Issues 18 / 36
19 More on Functional Form Other Possibilities Using Quadratics Along with Logarithms: which implies log(price) = β 0 + β 1 log(nox) + β 2 log(nox) 2 +β 3 crime + β 4 rooms + β 5 rooms 2 + β 6 stratio + u, log(price) log(nox) = % price % nox = β 1 + 2β 2 log(nox). Higher Order Polynomials: It is often assumed that the total cost takes the following form, cost = β 0 + β 1 quantity + β 2 quantity 2 + β 3 quantity 3 + u, which implies a U-shaped marginal cost (MC). [figure here] Ping Yu (HKU) MLR: Further Issues 19 / 36
20 More on Functional Form TVC TC TFC 0 0 Figure: Quadratic MC Implies Cubic TC: q is the inflection point Ping Yu (HKU) MLR: Further Issues 20 / 36
21 More on Functional Form Models with Interaction Terms In the model price = β 0 + β 1 sqrft + β 2 bdrms + β 3 sqrft bdrms + β 4 bthrms + u, sqrft bdrms is the interaction term. The marginal effect of bdrms on price is price bdrms = β 2 + β 3 sqrft. The effect of the number of bedrooms depends on the level of square footage. Interaction effects complicate interpretation of parameters: β 2 is the effect of number of bedrooms, but for a square footage of zero. How to avoid this interpretation difficulty? Ping Yu (HKU) MLR: Further Issues 21 / 36
22 More on Functional Form Reparametrization of Interaction Effects The model can be reparametrized as y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 1 x 2 + u y = α 0 + δ 1 x 1 + δ 2 x 2 + β 3 (x 1 µ 1 ) (x 2 µ 2 ) + u, where µ 1 = E [x 1 ] and µ 2 = E [x 2 ] are population means of x 1 and x 2, and can be replaced by their sample means. What is the relationship between bα 0, δ b 1, δ b 2 and bβ 0, β b 1, β b 2? (Exercise) Now, y x 2 = δ 2 + β 3 (x 1 µ 1 ), i.e., δ 2 is the effect of x 2 if all variables take on their mean values. Advantages of reparametrization: - It is easy to interpret all parameters. - Standard errors for partial effects at the mean values are available. - If necessary, interaction may be centered at other interesting values. Ping Yu (HKU) MLR: Further Issues 22 / 36
23 More on Goodness-of-Fit and Selection of Regressors More on Goodness-of-Fit and Selection of Regressors Ping Yu (HKU) MLR: Further Issues 23 / 36
24 More on Goodness-of-Fit and Selection of Regressors Adjusted R-Squared General remarks on R-squared: - A high R-squared does not imply that there is a causal interpretation. - A low R-squared does not preclude precise estimation of partial effects. Recall that R 2 SSR/n = 1 SST /n = 1 eσ 2 u eσ 2, y so R 2 is estimating the population R-squared ρ 2 = 1 σ 2 u σ 2, y the proportion of the variation in y in the population explained by the independent variables. Adjusted R-Squared: R 2 = 1 SSR/ (n k 1) SST / (n 1) = 1 bσ 2 u bσ 2, y is sometimes also called R-bar squared, where bσ 2 u and bσ 2 y are unbiased estimators of σ 2 u and σ 2 y due to the correction of dfs. Ping Yu (HKU) MLR: Further Issues 24 / 36
25 History of R 2 More on Goodness-of-Fit and Selection of Regressors Henri Theil ( ), Chicago and Florida Ping Yu (HKU) MLR: Further Issues 25 / 36
26 More on Goodness-of-Fit and Selection of Regressors continue R 2 takes into account degrees of freedom of the numerator and denominator, so is generally a better measure of goodness-of-fit. R 2 imposes a penalty for adding new regressors: k " =) R 2 # R 2 increases if and only if the t statistic of a newly added regressor is greater than one in absolute value. [proof not required] - Compared with y = β 0 + β 1 x 1 + u, the regression y = β 0 + β 1 x 1 + β 2 x 2 + u has a larger R 2 if and only if tbβ 2 > 1. Relationship between R 2 and R 2 : by we have 1 R 2 = SSR SST = n k 1 n 1 R 2 = 1 SSR/ (n k 1) SST / (n 1) unless k = 0 or R 2 = 1. Note that R 2 even gets negative if R 2 < n k 1. 1 R 2 n 1 n k 1 < R2 = n k 1 1 R 2, n 1 Ping Yu (HKU) MLR: Further Issues 26 / 36
27 More on Goodness-of-Fit and Selection of Regressors Using Adjusted R-squared to Choose between Nonnested Models Models are nonnested if neither model is a special case of the other. For example, to incorporate diminishing return of sales to R&D, we consider two models: where rdintens = R&D intensity. rdintens = β 0 + β 1 log(sales) + u, rdintens = β 0 + β 1 sales + β 2 sales 2 + u, R 2 =.061 and R 2 =.030 in model 1 and R 2 =.148 and R 2 =.090 in model 2. A comparison between the R-squared of both models would be unfair to the first model because the first model contains fewer parameters. In the given example, even after adjusting for the difference in degrees of freedom, the quadratic model is preferred. Ping Yu (HKU) MLR: Further Issues 27 / 36
28 More on Goodness-of-Fit and Selection of Regressors Comparing Models with Different Dependent Variables R-squared or adjusted R-squared must not be used to compare models which differ in their definition of the dependent variable. Example (CEO Compensation and Firm Performance): and \salary = sales roe (223.90)(.0089) (11.08) n = 209,R 2 =.029,R 2 =.020,SST = 391,732,982 \ lsalary = lsales roe (0.29)(.033) (.0040) n = 209,R 2 =.282,R 2 =.275,SST = There is much less variation in log(salary) that needs to be explained than in salary, so it is not fair to compare R 2 and R 2 of the two models. (we will discuss how to compare the fitting of these two models later in this chapter) Ping Yu (HKU) MLR: Further Issues 28 / 36
29 More on Goodness-of-Fit and Selection of Regressors Controlling for Too Many Factors in Regression Analysis In some cases, certain variables should not be held fixed: - In a regression of traffic fatalities on state beer taxes (and other factors) one should not directly control for beer consumption. - why? Beer taxes influence traffic fatalities only through beer consumption; if beer consumption is controlled, then the coefficient of beer taxes measures the indirect effect of beer taxes, which is hardly interesting. - In a regression of family health expenditures on pesticide usage among farmers one should not control for doctor visits. - why? Health expenditures include doctor visits, an we would like to pick up all effects of pesticide use on health expenditure. Different regressions may serve different purposes: - In a regression of house prices on house characteristics, one would include price assessments and also housing attributes if the purpose of the regression is to study the validity of assessments; one should not include price assessments if the purpose of the regression is to estimate a hedonic price model, which measures the marginal values of various housing attributes. Ping Yu (HKU) MLR: Further Issues 29 / 36
30 More on Goodness-of-Fit and Selection of Regressors Adding Regressors to Reduce the Error Variance Recall that Var bβ j = σ 2 SST j 1 R 2 j. - Adding regressors may exacerbate multicollinearity problems (R 2 j "). - On the other hand, adding regressors reduces the error variance (σ 2 #). Variables that are uncorrelated with other regressors should be added because they reduce error variance (σ 2 #) without increasing multicollinearity (Rj 2 remains the same). However, such uncorrelated variables may be hard to find. Example (Individual Beer Consumption and Beer Prices): Including individual characteristics in a regression of beer consumption on beer prices leads to more precise estimates of the price elasticity if individual characteristics are uncorrelated with beer prices. log(cons) = β 0 + β 1 log(price) + indchar {z } + u. uncorrelated with log(price) Ping Yu (HKU) MLR: Further Issues 30 / 36
31 Prediction and Residual Analysis (*) Prediction and Residual Analysis Ping Yu (HKU) MLR: Further Issues 31 / 36
32 Prediction and Residual Analysis Predicting y When log(y) is the Dependent Variable We study only this prediction problem as promised. Note that implies log(y) = β 0 + β 1 x β k x k + u y = exp(β 0 + β 1 x β k x k )exp(u) = m(x)exp(u). Under the additional assumption that u is independent of (x 1,,x k ), we have E [yjx] = exp(β 0 + β 1 x β k x k )E [exp(u)jx] = exp(β 0 + β 1 x β k x k )E [exp(u)] m(x)α 0, where the second equality is due to the independence between u and x, so the predicted y is by = bm(x)bα 0 where bm(x) = exp bβ 0 + β b 1 x β b k x k and bα 0 = 1 n n exp(bu i ). i=1 Ping Yu (HKU) MLR: Further Issues 32 / 36
33 Prediction and Residual Analysis E [exp(u)] 1 Recall that E [u] = 0, so E [exp(u)] exp(e [u]) = exp(0) = 1. In the following figure, suppose u takes only two values u 1 and u 2 with probability 1 2 and 1 2, respectively. Since E [u] = 1 2 (u 1 + u 2 ) = 0, u 1 = u 2. Now, E [exp(u)] = 1 2 (exp(u 1) + exp(u 2 )) 1 exp 2 (u 1 + u 2 ) = exp(e [u]) = 1, where the equality is achieved only if u 1 = u 2 = 0, i.e., u = 0. As a result, ey = bm(x) = exp \ log(y) under-estimates E [yjx]! Ping Yu (HKU) MLR: Further Issues 33 / 36
34 Prediction and Residual Analysis Figure: Illustration of Jensen s Inequality Ping Yu (HKU) MLR: Further Issues 34 / 36
35 Prediction and Residual Analysis Comparing R-Squared of a Logged and an Unlogged Specification Reconsider the CEO salary problem: and \salary = sales mktval ceoten (65.23) (.0100) (.0095) (5.61) n = 177,R 2 =.201 \ lsalary = lsales mktval ceoten (0.257)(.039) (.050) (.0053) n = 177, e R 2 =.318 R 2 and e R 2 are the R-squareds for the predictions of the unlogged salary variable (although the second regression is originally for logged salaries). Both R-squareds can now be directly compared Ping Yu (HKU) MLR: Further Issues 35 / 36
36 Prediction and Residual Analysis About e R 2 Recall that where by is the predicted value of y. R 2 = [ Corr (y, by) 2, When lsalary is the dependent variable, the predicted value of y is bm(x)bα 0 = bα 0 ey. Since bα 0 > 0, [Corr (y, by) = [ Corr (y, bα 0 ey) = [ Corr (y, ey) invariant to bα 0, where recall that for any a > 0, Corr (X,aY ) = Corr (X,Y ) = Cov (X,Y ) p Var (X )Var (Y ). As a result, er 2 = [ Corr (y, ey) 2. Ping Yu (HKU) MLR: Further Issues 36 / 36
7. Prediction. Outline: Read Section 6.4. Mean Prediction
Outline: Read Section 6.4 II. Individual Prediction IV. Choose between y Model and log(y) Model 7. Prediction Read Wooldridge (2013), Chapter 6.4 2 Mean Prediction Predictions are useful But they are subject
More informationMultiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators
1 2 Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE Hüseyin Taştan 1 1 Yıldız Technical University Department of Economics These presentation notes are based on Introductory
More informationMultiple Linear Regression CIVL 7012/8012
Multiple Linear Regression CIVL 7012/8012 2 Multiple Regression Analysis (MLR) Allows us to explicitly control for many factors those simultaneously affect the dependent variable This is important for
More informationHeteroskedasticity (Section )
Heteroskedasticity (Section 8.1-8.4) Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Heteroskedasticity 1 / 44 Consequences of Heteroskedasticity for OLS Consequences
More informationOutline. 2. Logarithmic Functional Form and Units of Measurement. Functional Form. I. Functional Form: log II. Units of Measurement
Outline 2. Logarithmic Functional Form and Units of Measurement I. Functional Form: log II. Units of Measurement Read Wooldridge (2013), Chapter 2.4, 6.1 and 6.2 2 Functional Form I. Functional Form: log
More informationECNS 561 Topics in Multiple Regression Analysis
ECNS 561 Topics in Multiple Regression Analysis Scaling Data For the simple regression case, we already discussed the effects of changing the units of measurement Nothing different here Coefficients, SEs,
More informationMore on Specification and Data Issues
More on Specification and Data Issues Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Specification and Data Issues 1 / 35 Functional Form Misspecification Functional
More informationInference in Regression Analysis
ECNS 561 Inference Inference in Regression Analysis Up to this point 1.) OLS is unbiased 2.) OLS is BLUE (best linear unbiased estimator i.e., the variance is smallest among linear unbiased estimators)
More informationStatistical Inference. Part IV. Statistical Inference
Part IV Statistical Inference As of Oct 5, 2017 Sampling Distributions of the OLS Estimator 1 Statistical Inference Sampling Distributions of the OLS Estimator Testing Against One-Sided Alternatives Two-Sided
More informationStatistics II. Management Degree Management Statistics IIDegree. Statistics II. 2 nd Sem. 2013/2014. Management Degree. Simple Linear Regression
Model 1 2 Ordinary Least Squares 3 4 Non-linearities 5 of the coefficients and their to the model We saw that econometrics studies E (Y x). More generally, we shall study regression analysis. : The regression
More informationCHAPTER 4. > 0, where β
CHAPTER 4 SOLUTIONS TO PROBLEMS 4. (i) and (iii) generally cause the t statistics not to have a t distribution under H. Homoskedasticity is one of the CLM assumptions. An important omitted variable violates
More informationx i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.
Exercises for the course of Econometrics Introduction 1. () A researcher is using data for a sample of 30 observations to investigate the relationship between some dependent variable y i and independent
More informationLeast Squares Estimation-Finite-Sample Properties
Least Squares Estimation-Finite-Sample Properties Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Finite-Sample 1 / 29 Terminology and Assumptions 1 Terminology and Assumptions
More informationMultiple Regression Analysis. Part III. Multiple Regression Analysis
Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant
More informationThe general linear regression with k explanatory variables is just an extension of the simple regression as follows
3. Multiple Regression Analysis The general linear regression with k explanatory variables is just an extension of the simple regression as follows (1) y i = β 0 + β 1 x i1 + + β k x ik + u i. Because
More informationECO375 Tutorial 4 Wooldridge: Chapter 6 and 7
ECO375 Tutorial 4 Wooldridge: Chapter 6 and 7 Matt Tudball University of Toronto St. George October 6, 2017 Matt Tudball (University of Toronto) ECO375H1 October 6, 2017 1 / 36 ECO375 Tutorial 4 Welcome
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 6: Multiple regression analysis: Further issues
Wooldridge, Introductory Econometrics, 4th ed. Chapter 6: Multiple regression analysis: Further issues What effects will the scale of the X and y variables have upon multiple regression? The coefficients
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model
Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Most of this course will be concerned with use of a regression model: a structure in which one or more explanatory
More informationECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47
ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with
More informationEnvironmental Econometrics
Environmental Econometrics Syngjoo Choi Fall 2008 Environmental Econometrics (GR03) Fall 2008 1 / 37 Syllabus I This is an introductory econometrics course which assumes no prior knowledge on econometrics;
More informationProblem 4.1. Problem 4.3
BOSTON COLLEGE Department of Economics EC 228 01 Econometric Methods Fall 2008, Prof. Baum, Ms. Phillips (tutor), Mr. Dmitriev (grader) Problem Set 3 Due at classtime, Thursday 14 Oct 2008 Problem 4.1
More informationEconometrics I Lecture 3: The Simple Linear Regression Model
Econometrics I Lecture 3: The Simple Linear Regression Model Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 32 Outline Introduction Estimating
More informationModel Specification and Data Problems. Part VIII
Part VIII Model Specification and Data Problems As of Oct 24, 2017 1 Model Specification and Data Problems RESET test Non-nested alternatives Outliers A functional form misspecification generally means
More informationEconomics 113. Simple Regression Assumptions. Simple Regression Derivation. Changing Units of Measurement. Nonlinear effects
Economics 113 Simple Regression Models Simple Regression Assumptions Simple Regression Derivation Changing Units of Measurement Nonlinear effects OLS and unbiased estimates Variance of the OLS estimates
More informationIntroductory Econometrics Exercises for tutorials (Fall 2014)
Introductory Econometrics Exercises for tutorials (Fall 2014) Dept. of Econometrics, Uni. of Economics, Prague, zouharj@vse.cz September 23, 2014 Tutorial 1: Review of basic statistical concepts Exercise
More information1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11
Econ 495 - Econometric Review 1 Contents 1 Linear Regression Analysis 4 1.1 The Mincer Wage Equation................. 4 1.2 Data............................. 6 1.3 Econometric Model.....................
More information5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is
Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do
More informationECON The Simple Regression Model
ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In
More informationEconometrics Homework 1
Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z
More informationInference in Regression Model
Inference in Regression Model Christopher Taber Department of Economics University of Wisconsin-Madison March 25, 2009 Outline 1 Final Step of Classical Linear Regression Model 2 Confidence Intervals 3
More information4.1 Least Squares Prediction 4.2 Measuring Goodness-of-Fit. 4.3 Modeling Issues. 4.4 Log-Linear Models
4.1 Least Squares Prediction 4. Measuring Goodness-of-Fit 4.3 Modeling Issues 4.4 Log-Linear Models y = β + β x + e 0 1 0 0 ( ) E y where e 0 is a random error. We assume that and E( e 0 ) = 0 var ( e
More informationFöreläsning /31
1/31 Föreläsning 10 090420 Chapter 13 Econometric Modeling: Model Speci cation and Diagnostic testing 2/31 Types of speci cation errors Consider the following models: Y i = β 1 + β 2 X i + β 3 X 2 i +
More informationIn Chapter 2, we learned how to use simple regression analysis to explain a dependent
3 Multiple Regression Analysis: Estimation In Chapter 2, we learned how to use simple regression analysis to explain a dependent variable, y, as a function of a single independent variable, x. The primary
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationIntermediate Econometrics
Intermediate Econometrics Markus Haas LMU München Summer term 2011 15. Mai 2011 The Simple Linear Regression Model Considering variables x and y in a specific population (e.g., years of education and wage
More informationExercise sheet 3 The Multiple Regression Model
Exercise sheet 3 The Multiple Regression Model Note: In those problems that include estimations and have a reference to a data set the students should check the outputs obtained with Gretl. 1. Let the
More informationThe Simple Regression Model. Part II. The Simple Regression Model
Part II The Simple Regression Model As of Sep 22, 2015 Definition 1 The Simple Regression Model Definition Estimation of the model, OLS OLS Statistics Algebraic properties Goodness-of-Fit, the R-square
More informationSimple Linear Regression
Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)
More informationECNS 561 Multiple Regression Analysis
ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking
More informationBasic Regression Analysis with Time Series Data
Basic Regression Analysis with Time Series Data Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Basic Time Series 1 / 50 The Nature of Time Series Data The Nature of Time
More informationSolutions to Problem Set 5 (Due November 22) Maximum number of points for Problem set 5 is: 220. Problem 7.3
Solutions to Problem Set 5 (Due November 22) EC 228 02, Fall 2010 Prof. Baum, Ms Hristakeva Maximum number of points for Problem set 5 is: 220 Problem 7.3 (i) (5 points) The t statistic on hsize 2 is over
More informationLab 6 - Simple Regression
Lab 6 - Simple Regression Spring 2017 Contents 1 Thinking About Regression 2 2 Regression Output 3 3 Fitted Values 5 4 Residuals 6 5 Functional Forms 8 Updated from Stata tutorials provided by Prof. Cichello
More informationIntermediate Econometrics
Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the
More informationEC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)
1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For
More informationIn Chapter 2, we learned how to use simple regression analysis to explain a dependent
C h a p t e r Three Multiple Regression Analysis: Estimation In Chapter 2, we learned how to use simple regression analysis to explain a dependent variable, y, as a function of a single independent variable,
More informationECONOMET RICS P RELIM EXAM August 24, 2010 Department of Economics, Michigan State University
ECONOMET RICS P RELIM EXAM August 24, 2010 Department of Economics, Michigan State University Instructions: Answer all four (4) questions. Be sure to show your work or provide su cient justi cation for
More informationHomoskedasticity. Var (u X) = σ 2. (23)
Homoskedasticity How big is the difference between the OLS estimator and the true parameter? To answer this question, we make an additional assumption called homoskedasticity: Var (u X) = σ 2. (23) This
More informationThe Simple Regression Model. Simple Regression Model 1
The Simple Regression Model Simple Regression Model 1 Simple regression model: Objectives Given the model: - where y is earnings and x years of education - Or y is sales and x is spending in advertising
More informationBasic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler
Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as
More information:Effects of Data Scaling We ve already looked at the effects of data scaling on the OLS statistics, 2, and R 2. What about test statistics?
MRA: Further Issues :Effects of Data Scaling We ve already looked at the effects of data scaling on the OLS statistics, 2, and R 2. What about test statistics? 1. Scaling the explanatory variables Suppose
More informationMotivation for multiple regression
Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope
More informationCHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157.
CHAPTER 7 SOLUTIONS TO PROBLEMS 7. (i) The coefficient on male is 87.75, so a man is estimated to sleep almost one and one-half hours more per week than a comparable woman. Further, t male = 87.75/34.33
More informationMultiple Regression Analysis
Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators
More informationWednesday, October 10 Handout: One-Tailed Tests, Two-Tailed Tests, and Logarithms
Amherst College Department of Economics Economics 360 Fall 2012 Wednesday, October 10 Handout: One-Tailed Tests, Two-Tailed Tests, and Logarithms Preview A One-Tailed Hypothesis Test: The Downward Sloping
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More information0. Introductory econometrics
0. Introductory econometrics 0.1 Structure of economic data Cross-sectional data: Data wic are collected from units of te underlying population at a given time period (wic may vary occasionally) (te arrangement
More informationLecture 3: Multiple Regression
Lecture 3: Multiple Regression R.G. Pierse 1 The General Linear Model Suppose that we have k explanatory variables Y i = β 1 + β X i + β 3 X 3i + + β k X ki + u i, i = 1,, n (1.1) or Y i = β j X ji + u
More informationMultiple Regression: Inference
Multiple Regression: Inference The t-test: is ˆ j big and precise enough? We test the null hypothesis: H 0 : β j =0; i.e. test that x j has no effect on y once the other explanatory variables are controlled
More informationCh 13 & 14 - Regression Analysis
Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more
More informationEconometrics Multiple Regression Analysis: Heteroskedasticity
Econometrics Multiple Regression Analysis: João Valle e Azevedo Faculdade de Economia Universidade Nova de Lisboa Spring Semester João Valle e Azevedo (FEUNL) Econometrics Lisbon, April 2011 1 / 19 Properties
More informationRewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35
Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate
More informationApplied Regression Analysis. Section 4: Diagnostics and Transformations
Applied Regression Analysis Section 4: Diagnostics and Transformations 1 Regression Model Assumptions Y i = β 0 + β 1 X i + ɛ Recall the key assumptions of our linear regression model: (i) The mean of
More informationIntroduction to Econometrics Chapter 4
Introduction to Econometrics Chapter 4 Ezequiel Uriel Jiménez University of Valencia Valencia, September 2013 4 ypothesis testing in the multiple regression 4.1 ypothesis testing: an overview 4.2 Testing
More informationChapter 9: The Regression Model with Qualitative Information: Binary Variables (Dummies)
Chapter 9: The Regression Model with Qualitative Information: Binary Variables (Dummies) Statistics and Introduction to Econometrics M. Angeles Carnero Departamento de Fundamentos del Análisis Económico
More informationECON3150/4150 Spring 2016
ECON3150/4150 Spring 2016 Lecture 6 Multiple regression model Siv-Elisabeth Skjelbred University of Oslo February 5th Last updated: February 3, 2016 1 / 49 Outline Multiple linear regression model and
More informationApplied Econometrics (QEM)
Applied Econometrics (QEM) based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #3 1 / 42 Outline 1 2 3 t-test P-value Linear
More informationRegression Models - Introduction
Regression Models - Introduction In regression models there are two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent
More informationEconometrics Problem Set 4
Econometrics Problem Set 4 WISE, Xiamen University Spring 2016-17 Conceptual Questions 1. This question refers to the estimated regressions in shown in Table 1 computed using data for 1988 from the CPS.
More informationRegression Analysis with Cross-Sectional Data
89782_02_c02_p023-072.qxd 5/25/05 11:46 AM Page 23 PART 1 Regression Analysis with Cross-Sectional Data P art 1 of the text covers regression analysis with cross-sectional data. It builds upon a solid
More informationAdditional Topics on Linear Regression
Additional Topics on Linear Regression Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Additional Topics 1 / 49 1 Tests for Functional Form Misspecification 2 Nonlinear
More information2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0
Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct
More informationMultiple Regression Analysis: Heteroskedasticity
Multiple Regression Analysis: Heteroskedasticity y = β 0 + β 1 x 1 + β x +... β k x k + u Read chapter 8. EE45 -Chaiyuth Punyasavatsut 1 topics 8.1 Heteroskedasticity and OLS 8. Robust estimation 8.3 Testing
More informationChapter 10 Nonlinear Models
Chapter 10 Nonlinear Models Nonlinear models can be classified into two categories. In the first category are models that are nonlinear in the variables, but still linear in terms of the unknown parameters.
More informationECON3150/4150 Spring 2016
ECON3150/4150 Spring 2016 Lecture 4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo Last updated: January 26, 2016 1 / 49 Overview These lecture slides covers: The linear regression
More informationThe Simple Linear Regression Model
The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate
More informationWrite your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).
STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods in Economics 2 Course code: EC2402 Examiner: Peter Skogman Thoursie Number of credits: 7,5 credits (hp) Date of exam: Saturday,
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Friday, June 5, 009 Examination time: 3 hours
More informationRegression Models. Chapter 4. Introduction. Introduction. Introduction
Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager
More informationstatistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:
Wooldridge, Introductory Econometrics, d ed. Chapter 3: Multiple regression analysis: Estimation In multiple regression analysis, we extend the simple (two-variable) regression model to consider the possibility
More informationAnnouncements. Unit 7: Multiple linear regression Lecture 3: Confidence and prediction intervals + Transformations. Uncertainty of predictions
Housekeeping Announcements Unit 7: Multiple linear regression Lecture 3: Confidence and prediction intervals + Statistics 101 Mine Çetinkaya-Rundel November 25, 2014 Poster presentation location: Section
More informationHomework Set 2, ECO 311, Fall 2014
Homework Set 2, ECO 311, Fall 2014 Due Date: At the beginning of class on October 21, 2014 Instruction: There are twelve questions. Each question is worth 2 points. You need to submit the answers of only
More informationCHAPTER 5 LINEAR REGRESSION AND CORRELATION
CHAPTER 5 LINEAR REGRESSION AND CORRELATION Expected Outcomes Able to use simple and multiple linear regression analysis, and correlation. Able to conduct hypothesis testing for simple and multiple linear
More informationProblem C7.10. points = exper.072 exper guard forward (1.18) (.33) (.024) (1.00) (1.00)
BOSTON COLLEGE Department of Economics EC 228 02 Econometric Methods Fall 2009, Prof. Baum, Ms. Phillips (TA), Ms. Pumphrey (grader) Problem Set 5 Due Tuesday 10 November 2009 Total Points Possible: 160
More information1. The shoe size of five randomly selected men in the class is 7, 7.5, 6, 6.5 the shoe size of 4 randomly selected women is 6, 5.
Economics 3 Introduction to Econometrics Winter 2004 Professor Dobkin Name Final Exam (Sample) You must answer all the questions. The exam is closed book and closed notes you may use calculators. You must
More informationRegression with Qualitative Information. Part VI. Regression with Qualitative Information
Part VI Regression with Qualitative Information As of Oct 17, 2017 1 Regression with Qualitative Information Single Dummy Independent Variable Multiple Categories Ordinal Information Interaction Involving
More informationIntroduction to Econometrics. Heteroskedasticity
Introduction to Econometrics Introduction Heteroskedasticity When the variance of the errors changes across segments of the population, where the segments are determined by different values for the explanatory
More informationCorrelation Analysis
Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the
More informationRegression #8: Loose Ends
Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch
More informationEmpirical Application of Simple Regression (Chapter 2)
Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget
More informationEcon107 Applied Econometrics
Econ107 Applied Econometrics Topics 2-4: discussed under the classical Assumptions 1-6 (or 1-7 when normality is needed for finite-sample inference) Question: what if some of the classical assumptions
More informationChapter 4. Regression Models. Learning Objectives
Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple
More informationLECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit
LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define
More informationRegression Analysis II
Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index
More informationWISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A
WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2015-16 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This
More informationMathematics for Economics MA course
Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between
More information5.1 Model Specification and Data 5.2 Estimating the Parameters of the Multiple Regression Model 5.3 Sampling Properties of the Least Squares
5.1 Model Specification and Data 5. Estimating the Parameters of the Multiple Regression Model 5.3 Sampling Properties of the Least Squares Estimator 5.4 Interval Estimation 5.5 Hypothesis Testing for
More informationNotes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market
Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market Heckman and Sedlacek, JPE 1985, 93(6), 1077-1125 James Heckman University of Chicago
More informationChapter 14 Simple Linear Regression (A)
Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables
More informationLecture 8. Using the CLR Model. Relation between patent applications and R&D spending. Variables
Lecture 8. Using the CLR Model Relation between patent applications and R&D spending Variables PATENTS = No. of patents (in 000) filed RDEP = Expenditure on research&development (in billions of 99 $) The
More information