6. Assessing studies based on multiple regression
|
|
- Eustacia Nelson
- 6 years ago
- Views:
Transcription
1 6. Assessing studies based on multiple regression Questions of this section: What makes a study using multiple regression (un)reliable? When does multiple regression provide a useful estimate of the causal effect under consideration? Conceptual framework: Internal and external validity 139
2 Definition 6.1: (Internal and external validity) A statistical analysis is said to have internal validity if the statistical inferences about causal effects are valid for the population being studied. The analysis is said to have external validity if its inferences and conclusions can be generalized from the population and setting studied to other populations and settings. Terminology: The population studied is the population of entities (people, companies,...) from which the sample was drawn The population of interest is the population of entities to which the causal inferences from the study are to be applied By setting we mean the institutional, legal, social, and economic environment 140
3 Threats to internal validity: The estimator of a causal effect should be unbiased and consistent Hypothesis tests should have the desired significance level Confidence intervals should have the desired confidence levels Requirements for internal validity are that the OLS estimator is unbiased and consistent and that standard errors are computed in a way that makes confidence intervals have the desired confidence levels These requirements might not be met for various reasons Threats to internal validity that lead to failures of our OLS assumptions from Slide
4 Threats to external validity: Differences between the population being studied and the population of interest Example: Medical studies that use animal populations like mice (the population being studied), but aim at transfering the results to human populations (the population of interest) Even if the population studied and the population of interest are identical, the study results may not be generalized due to differences in the settings Example: The effect of an antidrinking advertising campaign on the drinking behaviour of a group of first-term students might differ at two universities if the legal penalties for drinking differ at both universities 142
5 Threats to external validity: [continued] Important questions with respect to external validity are: How to assess the external validity of a study? How to design an externally valid study? Both issues require specific knowledge of the populations and settings being studied and those of interest A rigorous treatment of both issues is beyond the scope of this lecture (cf. Shadish et al., 2002, for details) We focus on aspects of internal validity 143
6 6.1. Threats to internal validity of multiple regression analysis Objectives: Survey of five reasons why the OLS estimator of a multiple regression coefficient may be biased even in large samples (Sections ) All five sources of bias arise because the regressor is correlated with the error term in the population regression Violation of the first OLS assumption from Slide 43 What can be done to reduce this bias? 144
7 Omitted variable bias Omitted variable bias: If an omitted variable is a determinant of Y i and if it is correlated with at least one of the regressors, then the OLS estimator of at least one of the coefficients will have omitted variable bias (see Definition 3.5 on Slide 52) This bias persists even in large samples OLS estimator(s) is (are) inconsistent Mathematically, under omitted variable bias at least one of the regressors is correlated with the error term u i implying that E(u i X 1i,..., X ki ) 0 145
8 Mitigation of omitted variable bias: Inclusion of control variables in the regression equation Definition 6.2: (Control variable) A control variable is not the object of interest in a regression analysis, but is rather a regressor included to hold constant factors that, if neglected, could lead the estimated causal effect of interest to suffer from omitted variable bias. Remarks: Up to now: OLS assumptions on Slide 43 treat all regressors symmetrically Now: explicit distinction between regressors of interest and control variables 146
9 Mathematical motivation: Consider a regression with two variables X 1i (the regressor) and X 2i (the control variable) Y i = β 0 + β 1 X 1i + β 2 X 2i + u i Replace the first OLS assumption E(u i X 1i, X 2i ) = 0 by the so-called conditional-mean-independence assumption E(u i X 1i, X 2i ) = E(u i X 2i ) (6.1) β 1 has a causal interpretation, but β 2 does not (see class for details) 147
10 Intuition of (6.1): The inclusion of the control variable X 2i makes the regressor X 1i uncorrelated with u i so that the OLS estimator ˆβ 1 can estimate the causal effect on Y i of a change in X 1i By contrast, the control variable X 2i remains correlated with u i so that its coefficient β 2 is subject to omitted variable bias and does not have a causal interpretation The control variable X 2i is included because it controls for omitted factors that affect Y i and are correlated with X 1i it might (but need not) have a causal effect itself When a control variable is used, it is controlling for both, (1) its own direct causal effect (if any), and (2) for the effect of correlated omitted factors 148
11 Terminology: Complete phrasing: The coefficient β 1 on the regressor X 1i is the causal effect on Y i of a change in X 1i using the control variable X 2i both (1) to hold constant the direct effect of X 2i, and (2) to control for factors correlated with X 1i Conventional, less awkward phrasing: The coefficient β 1 on X 1i is the effect on Y i controlling for X 2i 149
12 Example: Consider the student-performance dataset Consider the regression results of TEST SCORE on STR and PCTEL (see left panel on Slide 151) Potentially omitted factor could be outside learning opportunities Factor outside learning opportunities is difficult to measure, but correlated with the students economic background Include a measure of economic background to control for omitted income-related determinants of TEST SCORE like outside learning opportunities Such a control variable is MEAL PCT measuring the percentage of students receiving a free or subsidized lunch 150
13 TEST SCORE regression results with and without the control variable MEAT PCT Dependent Variable: TEST_SCORE Method: Least Squares Date: 19/05/12 Time: 18:00 Sample: Included observations: 420 White heteroskedasticity-consistent standard errors & covariance Variable Coefficient Std. Error t-statistic Prob. C STR PCTEL R-squared Adjusted R-squared S.E. of regression Sum squared resid Log likelihood F-statistic Prob(F-statistic) tatis Mean dependent var S.D. dependent var Akaike info criterion Schwarz criterion Hannan-Quinn criter Durbin-Watson stat Dependent Variable: TEST_SCORE Method: Least Squares Date: 19/05/12 Time: 17:52 Sample: Included observations: 420 White heteroskedasticity-consistent standard errors & covariance Variable Coefficient Std. Error t-statistic Prob. C STR PCTEL MEAL_PCT R-squared Adjusted R-squared S.E. of regression Sum squared resid Log likelihood F-statistic Prob(F-statistic) tatis Mean dependent var S.D. dependent var Akaike info criterion Schwarz criterion Hannan-Quinn criter Durbin-Watson stat
14 Example: [continued] Including the control variable MEAL PCT does not substantially change the effect of STR on TEST SCORE (ˆβ 1 changes from to ) changes the size (but not the sign) of the effect of PCTEL on TEST SCORE (ˆβ 2 changes from to ) The estimated coefficient ˆβ 3 = is not reasonable, since if it were we could boost TEST SCORE by eliminating the reduced-price lunch programme Do not treat β 3 as causal 152
15 Solutions to omitted variable bias: Distinction between two situations, namely (1) one in which data on the omitted variable or on adequate control variables are available, and (2) one in which data are not available Situation #1: If data on the omitted variable is available, include it in the regression equation If you have data on adequate control variables (with the hope of achieving conditional mean independence), include these variables in the regression equation 153
16 Trade-off: Adding a variable to a regression has both costs and benefits On the one hand, omitting the variable could result in omitted variable bias On the other hand, including a variable that is not a relevant regressor (that is, when its population regression coefficient is zero) reduces the precision of the estimators of the other regression coefficients (in the form of higher variances of the OLS estimators) 154
17 Situation #2: If no data are available there are three potential ways of mitigating the omitted variable bias The use of panel data (see Stock & Watson, 2011, Section 8) The use of instrumental variables (see Section 7) The conduct of randomized controlled experiments (see Stock & Watson, 2011, Section 13) 155
18 Guidelines for deciding whether to include an additional variable: Be specific about the coefficients of interest Use a priori reasoning to identify the most important potential sources of omitted variable bias Consider a baseline specification and some questionable variables Test whether additional questionable variables have nonzero coefficients Provide full disclosure representative tabulations of your results so that others can see the effect of including questionable variables on the coefficients of interest Observe, if your results change after including a questionable control variable 156
19 Misspecification of the functional form of the regression function Definition 6.3: (Functional form misspecification) Functional form misspecification arises when the functional form of the estimated function differs from the (true) functional form of the population regression function. Two aspects of misspecification: If the (true) population regression function is nonlinear, but we estimate a linear regression equation, then the estimator of the coefficients suffer from omitted variable bias If the (true) population regression function is linear, but we estimate a nonlinear regression equation, then we estimate non-existent coefficients 157
20 Solutions to functional form misspecification: Detection of misspecification by using statistical specification tests, for example the Regression Specification Error Test (RESET) (see Econometrics I) Plot the data and the estimated regression function Correct the misspecification by trying alternative functional forms of the regression function 158
21 Measurement error and errors-in-variable bias Definition 6.4: (Errors-in-variable bias) Errors-in-variables bias in the OLS estimator arises when an independent variable is measured imprecisely. The bias depends on the nature of the measurement error and persists even if the sample size is large. Sources of measurement errors: Wrong answer of a respondent to a survey question (e.g. about her/his income) Typographical errors in data collected from computerized administrative records 159
22 Consequence: Consider a regression with a single regressor X i X i is imprecisely measured by X i The true population regression equation is Y i = β 0 + β 1 X i + u i = β 0 + β 1 X i + [ β 1 ( X i X i ) + ui ] = β 0 + β 1 X i + v i where v i = β 1 (X i X i ) + ui The population regression equation in terms of X i has an error term containing the measurement error X i X i If X i X i is correlated with X i, then the regressor X i will be correlated with v i 160
23 Consequence: [continued] Violation of OLS assumption 1 on Slide 18 ˆβ 1 will be biased and inconsistent Classical measurement error model: Suppose the measured value X i equals the unmeasured value X i plus a purely random component w i with expected value 0 and variance σ 2 w Suppose further that Corr(w i, X i ) = 0 and Corr(w i, u i ) = 0 It then follows that (see class for details) plim ˆβ 1 = σ2 X σ 2 X + σ2 w β 1 161
24 Solutions to errors-in-variables bias: Try to obtain an accurate measure of X (if possible) Use instrumental variables (see Section 7) 162
25 Missing data and sample selection We consider three cases: 1. Data are missing completely at random 2. Data are missing based on a regressor 3. Data are missing because of a selection process that is related to Y beyond depending on X (sample selection bias) 163
26 Case #1: When the data are missing completely at random (for reasons unrelated to the values of X or Y ) the effect is to reduce the sample size but not introduce bias Case #2: When the data are missing based on the value of a regressor, the effect also is to reduce the sample size but not introduce bias 164
27 Case #3: If the data are missing because of a selection process that is related to the value of the dependent variable Y beyond depending on the regressors X 1,..., X k then this selection process can introduce correlation between the error term and the regressors Bias in the OLS estimators that persists in large samples Definition 6.5: (Sample selection bias) Sample selection bias arises when a selection process influences the availability of data and that process is related to the dependent variable, beyond depending on the regressors. 165
28 Example: We consider the question as to whether stock mutual funds outperform the market To this end, many studies compare future returns on mutual funds that had high returns over the past year to future returns on other funds and on the market as a whole Some databases include historical data on funds currently available for purchase This approach implies that the most poorly performing funds are omitted from the dataset because they went out of business or were merged into other funds Only the better funds survive to be in the data set (survivorship bias) 166
29 Solutions to sample selection bias: Beyond the scope of this lecture 167
30 Simultaneous causality Definition 6.6: (Simultaneous causality bias) Simultaneous causality bias, also called simultaneous equation bias, arises in a regression of Y on X when, in addition to the causal link of interest from X to Y, there is a causal link from Y to X. Remark: The reverse causality makes X correlated with the error term Bias in the OLS estimators that persists in large samples 168
31 Example: Consider the following two regression equations that hold simultaneously: Y i = β 0 + β 1 X i + u i (6.2) X i = γ 0 + γ 1 Y i + v i (6.3) From Eq. (6.2) it follows that if u i < 0 then Y i decreases If γ 1 > 0, then a low value of Y i leads to a low value of X i in Eq. (6.3) If γ 1 > 0, then Corr(X i, u i ) > 0 in Eq. (6.2) (see class for details) 169
32 Solutions to simultaneous causality bias: Use instrumental variables (see Section 7) 170
33 Sources of inconsistency of OLS standard errors OLS standard errors: Inconsistent standard errors pose a threat to internal validity Even if the OLS estimators are consistent and the sample size is large, inconsistent standard errors will produce invalid hypothesis tests and confidence intervals Main reasons for inconsistent standard errors: Heteroskedasticity of the error terms u i Autocorrelation among the error terms u i 171
34 Remedies: Use heteroskedasticity-consistent standard errors (see Section ) Use heteroskedasticity-autocorrelation consistent (HAC) standard errors (see Section ) 172
35 6.2. Summary Five threats to internal validity: 1. Omitted variables 2. Functional form misspecification 3. Errors-in-variables 4. Sample selection 5. Simultaneous causality 173
36 Remarks: Each of these, if present, result in failure of the first OLS assumption from Slide 18: E(u i X 1i,..., X ki ) = 0 Bias in the OLS estimators that persists in large samples Incorrect calculation of standard errors poses a further threat to internal validity Applying this list of threats to a multiple regression study provides a systematic way to assess the internal validity of that study 174
2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
More information4. Nonlinear regression functions
4. Nonlinear regression functions Up to now: Population regression function was assumed to be linear The slope(s) of the population regression function is (are) constant The effect on Y of a unit-change
More information10. Time series regression and forecasting
10. Time series regression and forecasting Key feature of this section: Analysis of data on a single entity observed at multiple points in time (time series data) Typical research questions: What is the
More information5. Erroneous Selection of Exogenous Variables (Violation of Assumption #A1)
5. Erroneous Selection of Exogenous Variables (Violation of Assumption #A1) Assumption #A1: Our regression model does not lack of any further relevant exogenous variables beyond x 1i, x 2i,..., x Ki and
More informationIntroduction to Econometrics. Assessing Studies Based on Multiple Regression
Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Assessing Studies Based on Multiple Regression Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1
More informationAssessing Studies Based on Multiple Regression
Assessing Studies Based on Multiple Regression Outline 1. Internal and External Validity 2. Threats to Internal Validity a. Omitted variable bias b. Functional form misspecification c. Errors-in-variables
More informationCHAPTER 6: SPECIFICATION VARIABLES
Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero
More information7. Integrated Processes
7. Integrated Processes Up to now: Analysis of stationary processes (stationary ARMA(p, q) processes) Problem: Many economic time series exhibit non-stationary patterns over time 226 Example: We consider
More informationReview of Econometrics
Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,
More informationECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors
ECON4150 - Introductory Econometrics Lecture 6: OLS with Multiple Regressors Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 6 Lecture outline 2 Violation of first Least Squares assumption
More informationChapter 9: Assessing Studies Based on Multiple Regression. Copyright 2011 Pearson Addison-Wesley. All rights reserved.
Chapter 9: Assessing Studies Based on Multiple Regression 1-1 9-1 Outline 1. Internal and External Validity 2. Threats to Internal Validity a) Omitted variable bias b) Functional form misspecification
More informationECON Introductory Econometrics. Lecture 17: Experiments
ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.
More information8. Instrumental variables regression
8. Instrumental variables regression Recall: In Section 5 we analyzed five sources of estimation bias arising because the regressor is correlated with the error term Violation of the first OLS assumption
More informationPractice Questions for the Final Exam. Theoretical Part
Brooklyn College Econometrics 7020X Spring 2016 Instructor: G. Koimisis Name: Date: Practice Questions for the Final Exam Theoretical Part 1. Define dummy variable and give two examples. 2. Analyze the
More information13. Time Series Analysis: Asymptotics Weakly Dependent and Random Walk Process. Strict Exogeneity
Outline: Further Issues in Using OLS with Time Series Data 13. Time Series Analysis: Asymptotics Weakly Dependent and Random Walk Process I. Stationary and Weakly Dependent Time Series III. Highly Persistent
More informationLECTURE 11. Introduction to Econometrics. Autocorrelation
LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct
More informationHeteroscedasticity 1
Heteroscedasticity 1 Pierre Nguimkeu BUEC 333 Summer 2011 1 Based on P. Lavergne, Lectures notes Outline Pure Versus Impure Heteroscedasticity Consequences and Detection Remedies Pure Heteroscedasticity
More informationMultiple Regression Analysis. Part III. Multiple Regression Analysis
Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant
More informationThe general linear regression with k explanatory variables is just an extension of the simple regression as follows
3. Multiple Regression Analysis The general linear regression with k explanatory variables is just an extension of the simple regression as follows (1) y i = β 0 + β 1 x i1 + + β k x ik + u i. Because
More information7. Integrated Processes
7. Integrated Processes Up to now: Analysis of stationary processes (stationary ARMA(p, q) processes) Problem: Many economic time series exhibit non-stationary patterns over time 226 Example: We consider
More information3. Linear Regression With a Single Regressor
3. Linear Regression With a Single Regressor Econometrics: (I) Application of statistical methods in empirical research Testing economic theory with real-world data (data analysis) 56 Econometrics: (II)
More informationECONOMETRICS HONOR S EXAM REVIEW SESSION
ECONOMETRICS HONOR S EXAM REVIEW SESSION Eunice Han ehan@fas.harvard.edu March 26 th, 2013 Harvard University Information 2 Exam: April 3 rd 3-6pm @ Emerson 105 Bring a calculator and extra pens. Notes
More informationAnswers to Problem Set #4
Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2
More informationThe Simple Regression Model. Part II. The Simple Regression Model
Part II The Simple Regression Model As of Sep 22, 2015 Definition 1 The Simple Regression Model Definition Estimation of the model, OLS OLS Statistics Algebraic properties Goodness-of-Fit, the R-square
More informationInternal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.
Section 7 Model Assessment This section is based on Stock and Watson s Chapter 9. Internal vs. external validity Internal validity refers to whether the analysis is valid for the population and sample
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple
More information11. Simultaneous-Equation Models
11. Simultaneous-Equation Models Up to now: Estimation and inference in single-equation models Now: Modeling and estimation of a system of equations 328 Example: [I] Analysis of the impact of advertisement
More informationECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests
ECON4150 - Introductory Econometrics Lecture 7: OLS with Multiple Regressors Hypotheses tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 7 Lecture outline 2 Hypothesis test for single
More informationModel Specification and Data Problems. Part VIII
Part VIII Model Specification and Data Problems As of Oct 24, 2017 1 Model Specification and Data Problems RESET test Non-nested alternatives Outliers A functional form misspecification generally means
More informationHypothesis Tests and Confidence Intervals in Multiple Regression
Hypothesis Tests and Confidence Intervals in Multiple Regression (SW Chapter 7) Outline 1. Hypothesis tests and confidence intervals for one coefficient. Joint hypothesis tests on multiple coefficients
More informationEconometrics Honor s Exam Review Session. Spring 2012 Eunice Han
Econometrics Honor s Exam Review Session Spring 2012 Eunice Han Topics 1. OLS The Assumptions Omitted Variable Bias Conditional Mean Independence Hypothesis Testing and Confidence Intervals Homoskedasticity
More informationECON Introductory Econometrics. Lecture 13: Internal and external validity
ECON4150 - Introductory Econometrics Lecture 13: Internal and external validity Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 9 Lecture outline 2 Definitions of internal and external
More informationAbout the seasonal effects on the potential liquid consumption
About the seasonal effects on the potential liquid consumption Lucie Ravelojaona Guillaume Perrez Clément Cousin ENAC 14/01/2013 Consumption raw data Figure : Evolution during one year of different family
More information2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0
Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct
More informationWISE International Masters
WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are
More informationTjalling C. Koopmans Research Institute
Tjalling C. Koopmans Research Institute Tjalling C. Koopmans Research Institute Utrecht School of Economics Utrecht University Janskerkhof 12 3512 BL Utrecht The Netherlands telephone +31 30 253 9800 fax
More informationChapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression
Chapter 7 Hypothesis Tests and Confidence Intervals in Multiple Regression Outline 1. Hypothesis tests and confidence intervals for a single coefficie. Joint hypothesis tests on multiple coefficients 3.
More informationBrief Sketch of Solutions: Tutorial 3. 3) unit root tests
Brief Sketch of Solutions: Tutorial 3 3) unit root tests.5.4.4.3.3.2.2.1.1.. -.1 -.1 -.2 -.2 -.3 -.3 -.4 -.4 21 22 23 24 25 26 -.5 21 22 23 24 25 26.8.2.4. -.4 - -.8 - - -.12 21 22 23 24 25 26 -.2 21 22
More informationApplied Econometrics. Applied Econometrics Second edition. Dimitrios Asteriou and Stephen G. Hall
Applied Econometrics Second edition Dimitrios Asteriou and Stephen G. Hall MULTICOLLINEARITY 1. Perfect Multicollinearity 2. Consequences of Perfect Multicollinearity 3. Imperfect Multicollinearity 4.
More informationEcon 427, Spring Problem Set 3 suggested answers (with minor corrections) Ch 6. Problems and Complements:
Econ 427, Spring 2010 Problem Set 3 suggested answers (with minor corrections) Ch 6. Problems and Complements: 1. (page 132) In each case, the idea is to write these out in general form (without the lag
More informationHeteroskedasticity. Part VII. Heteroskedasticity
Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least
More informationEconomics 471: Econometrics Department of Economics, Finance and Legal Studies University of Alabama
Economics 471: Econometrics Department of Economics, Finance and Legal Studies University of Alabama Course Packet The purpose of this packet is to show you one particular dataset and how it is used in
More informationChapter 6: Linear Regression With Multiple Regressors
Chapter 6: Linear Regression With Multiple Regressors 1-1 Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution
More informationEastern Mediterranean University Department of Economics ECON 503: ECONOMETRICS I. M. Balcilar. Midterm Exam Fall 2007, 11 December 2007.
Eastern Mediterranean University Department of Economics ECON 503: ECONOMETRICS I M. Balcilar Midterm Exam Fall 2007, 11 December 2007 Duration: 120 minutes Questions Q1. In order to estimate the demand
More informationEcon 1123: Section 5. Review. Internal Validity. Panel Data. Clustered SE. STATA help for Problem Set 5. Econ 1123: Section 5.
Outline 1 Elena Llaudet 2 3 4 October 6, 2010 5 based on Common Mistakes on P. Set 4 lnftmpop = -.72-2.84 higdppc -.25 lackpf +.65 higdppc * lackpf 2 lnftmpop = β 0 + β 1 higdppc + β 2 lackpf + β 3 lackpf
More informationMultiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C =
Economics 130 Lecture 6 Midterm Review Next Steps for the Class Multiple Regression Review & Issues Model Specification Issues Launching the Projects!!!!! Midterm results: AVG = 26.5 (88%) A = 27+ B =
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 7 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 68 Outline of Lecture 7 1 Empirical example: Italian labor force
More informationFinal Exam Financial Data Analysis at the University of Freiburg (Winter Semester 2008/2009) Friday, November 14, 2008,
Professor Dr. Roman Liesenfeld Final Exam Financial Data Analysis at the University of Freiburg (Winter Semester 2008/2009) Friday, November 14, 2008, 10.00 11.30am 1 Part 1 (38 Points) Consider the following
More informationPractical Econometrics. for. Finance and Economics. (Econometrics 2)
Practical Econometrics for Finance and Economics (Econometrics 2) Seppo Pynnönen and Bernd Pape Department of Mathematics and Statistics, University of Vaasa 1. Introduction 1.1 Econometrics Econometrics
More informationLinear Regression with one Regressor
1 Linear Regression with one Regressor Covering Chapters 4.1 and 4.2. We ve seen the California test score data before. Now we will try to estimate the marginal effect of STR on SCORE. To motivate these
More informationLinear Regression with Multiple Regressors
Linear Regression with Multiple Regressors (SW Chapter 6) Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution
More informationRegression with a Single Regressor: Hypothesis Tests and Confidence Intervals
Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression
More informationIntroduction to Econometrics. Multiple Regression (2016/2017)
Introduction to Econometrics STAT-S-301 Multiple Regression (016/017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 OLS estimate of the TS/STR relation: OLS estimate of the Test Score/STR relation:
More informationLinear Regression with Multiple Regressors
Linear Regression with Multiple Regressors (SW Chapter 6) Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution
More informationExercises (in progress) Applied Econometrics Part 1
Exercises (in progress) Applied Econometrics 2016-2017 Part 1 1. De ne the concept of unbiased estimator. 2. Explain what it is a classic linear regression model and which are its distinctive features.
More informationThe GARCH Analysis of YU EBAO Annual Yields Weiwei Guo1,a
2nd Workshop on Advanced Research and Technology in Industry Applications (WARTIA 2016) The GARCH Analysis of YU EBAO Annual Yields Weiwei Guo1,a 1 Longdong University,Qingyang,Gansu province,745000 a
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3
More informationIntroduction to Econometrics
Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle
More informationIntroduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017
Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent
More informationECON 366: ECONOMETRICS II. SPRING TERM 2005: LAB EXERCISE #10 Nonspherical Errors Continued. Brief Suggested Solutions
DEPARTMENT OF ECONOMICS UNIVERSITY OF VICTORIA ECON 366: ECONOMETRICS II SPRING TERM 2005: LAB EXERCISE #10 Nonspherical Errors Continued Brief Suggested Solutions 1. In Lab 8 we considered the following
More informationThe Multiple Regression Model Estimation
Lesson 5 The Multiple Regression Model Estimation Pilar González and Susan Orbe Dpt Applied Econometrics III (Econometrics and Statistics) Pilar González and Susan Orbe OCW 2014 Lesson 5 Regression model:
More informationEconometrics I Lecture 3: The Simple Linear Regression Model
Econometrics I Lecture 3: The Simple Linear Regression Model Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 32 Outline Introduction Estimating
More informationIntroduction to Econometrics. Multiple Regression
Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Multiple Regression Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1 OLS estimate of the TS/STR
More informationLECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity
LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists
More informationTopic 4: Model Specifications
Topic 4: Model Specifications Advanced Econometrics (I) Dong Chen School of Economics, Peking University 1 Functional Forms 1.1 Redefining Variables Change the unit of measurement of the variables will
More informationStatistical Inference. Part IV. Statistical Inference
Part IV Statistical Inference As of Oct 5, 2017 Sampling Distributions of the OLS Estimator 1 Statistical Inference Sampling Distributions of the OLS Estimator Testing Against One-Sided Alternatives Two-Sided
More informationARDL Cointegration Tests for Beginner
ARDL Cointegration Tests for Beginner Tuck Cheong TANG Department of Economics, Faculty of Economics & Administration University of Malaya Email: tangtuckcheong@um.edu.my DURATION: 3 HOURS On completing
More informationECON Introductory Econometrics. Lecture 16: Instrumental variables
ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental
More informationEconometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in
More informationEconometrics Problem Set 11
Econometrics Problem Set WISE, Xiamen University Spring 207 Conceptual Questions. (SW 2.) This question refers to the panel data regressions summarized in the following table: Dependent variable: ln(q
More informationLecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)
Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model
More informationIntroduction to Econometrics
Introduction to Econometrics STAT-S-301 Panel Data (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Regression with Panel Data A panel dataset contains observations on multiple entities
More informationApplied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid
Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level
More informationHypothesis Tests and Confidence Intervals. in Multiple Regression
ECON4135, LN6 Hypothesis Tests and Confidence Intervals Outline 1. Why multipple regression? in Multiple Regression (SW Chapter 7) 2. Simpson s paradox (omitted variables bias) 3. Hypothesis tests and
More informationFrequency Forecasting using Time Series ARIMA model
Frequency Forecasting using Time Series ARIMA model Manish Kumar Tikariha DGM(O) NSPCL Bhilai Abstract In view of stringent regulatory stance and recent tariff guidelines, Deviation Settlement mechanism
More informationCORRELATION, ASSOCIATION, CAUSATION, AND GRANGER CAUSATION IN ACCOUNTING RESEARCH
CORRELATION, ASSOCIATION, CAUSATION, AND GRANGER CAUSATION IN ACCOUNTING RESEARCH Alireza Dorestani, Northeastern Illinois University Sara Aliabadi, Northeastern Illinois University ABSTRACT In this paper
More informationRomanian Economic and Business Review Vol. 3, No. 3 THE EVOLUTION OF SNP PETROM STOCK LIST - STUDY THROUGH AUTOREGRESSIVE MODELS
THE EVOLUTION OF SNP PETROM STOCK LIST - STUDY THROUGH AUTOREGRESSIVE MODELS Marian Zaharia, Ioana Zaheu, and Elena Roxana Stan Abstract Stock exchange market is one of the most dynamic and unpredictable
More informationBrief Suggested Solutions
DEPARTMENT OF ECONOMICS UNIVERSITY OF VICTORIA ECONOMICS 366: ECONOMETRICS II SPRING TERM 5: ASSIGNMENT TWO Brief Suggested Solutions Question One: Consider the classical T-observation, K-regressor linear
More informationProblem set 1: answers. April 6, 2018
Problem set 1: answers April 6, 2018 1 1 Introduction to answers This document provides the answers to problem set 1. If any further clarification is required I may produce some videos where I go through
More informationOutline. 11. Time Series Analysis. Basic Regression. Differences between Time Series and Cross Section
Outline I. The Nature of Time Series Data 11. Time Series Analysis II. Examples of Time Series Models IV. Functional Form, Dummy Variables, and Index Basic Regression Numbers Read Wooldridge (2013), Chapter
More informationOLS Assumptions Violation and Its Treatment: An Empirical Test of Gross Domestic Product Relationship with Exchange Rate, Inflation and Interest Rate
J. Appl. Environ. Biol. Sci., 6(5S)43-54, 2016 2016, TextRoad Publication ISSN: 2090-4274 Journal of Applied Environmental and Biological Sciences www.textroad.com OLS Assumptions Violation and Its Treatment:
More informationLecture 8. Using the CLR Model. Relation between patent applications and R&D spending. Variables
Lecture 8. Using the CLR Model Relation between patent applications and R&D spending Variables PATENTS = No. of patents (in 000) filed RDEP = Expenditure on research&development (in billions of 99 $) The
More information2 Prediction and Analysis of Variance
2 Prediction and Analysis of Variance Reading: Chapters and 2 of Kennedy A Guide to Econometrics Achen, Christopher H. Interpreting and Using Regression (London: Sage, 982). Chapter 4 of Andy Field, Discovering
More informationEconometrics 1. Lecture 8: Linear Regression (2) 黄嘉平
Econometrics 1 Lecture 8: Linear Regression (2) 黄嘉平 中国经济特区研究中 心讲师 办公室 : 文科楼 1726 E-mail: huangjp@szu.edu.cn Tel: (0755) 2695 0548 Office hour: Mon./Tue. 13:00-14:00 The linear regression model The linear
More informationLecture 8. Using the CLR Model
Lecture 8. Using the CLR Model Example of regression analysis. Relation between patent applications and R&D spending Variables PATENTS = No. of patents (in 1000) filed RDEXP = Expenditure on research&development
More informationEcon Spring 2016 Section 9
Econ 140 - Spring 2016 Section 9 GSI: Fenella Carpena March 31, 2016 1 Assessing Studies Based on Multiple Regression 1.1 Internal Validity Threat to Examples/Cases Internal Validity OVB Example: wages
More informationWISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A
WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2015-16 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This
More informationFinQuiz Notes
Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable
More informationEmpirical Economic Research, Part II
Based on the text book by Ramanathan: Introductory Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna December 7, 2011 Outline Introduction
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 2: Simple Regression Egypt Scholars Economic Society Happy Eid Eid present! enter classroom at http://b.socrative.com/login/student/ room name c28efb78 Outline
More information1/34 3/ Omission of a relevant variable(s) Y i = α 1 + α 2 X 1i + α 3 X 2i + u 2i
1/34 Outline Basic Econometrics in Transportation Model Specification How does one go about finding the correct model? What are the consequences of specification errors? How does one detect specification
More informationFöreläsning /31
1/31 Föreläsning 10 090420 Chapter 13 Econometric Modeling: Model Speci cation and Diagnostic testing 2/31 Types of speci cation errors Consider the following models: Y i = β 1 + β 2 X i + β 3 X 2 i +
More informationNovember 9th-12th 2010, Port of Spain, Trinidad
By: Damie Sinanan and Dr. Roger Hosein 42nd Annual Monetary Studies Conference Financial Stability, Crisis Preparedness and Risk Management in the Caribbean November 9th-12th 2010, Port of Spain, Trinidad
More informationIntroduction to Econometrics. Regression with Panel Data
Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Regression with Panel Data Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1 Regression with Panel
More informationApplied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables
Applied Statistics and Econometrics Giuseppe Ragusa Lecture 15: Instrumental Variables Outline Introduction Endogeneity and Exogeneity Valid Instruments TSLS Testing Validity 2 Instrumental Variables Regression
More informationContest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.
Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round
More information1 Motivation for Instrumental Variable (IV) Regression
ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data
More informationIntroductory Econometrics
Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna November 23, 2013 Outline Introduction
More informationIris Wang.
Chapter 10: Multicollinearity Iris Wang iris.wang@kau.se Econometric problems Multicollinearity What does it mean? A high degree of correlation amongst the explanatory variables What are its consequences?
More information