An explanation of Two Stage Least Squares
|
|
- Eunice Gardner
- 5 years ago
- Views:
Transcription
1 Introduction Introduction to Econometrics An explanation of Two Stage Least Squares When we get an endogenous variable we know that OLS estimator will be inconsistent. In addition OLS regressors will also be biased. Two Stage Least Squares is a general solution to the problem of inconsistent estimators. When we have one endogenous variable and one excluded exogenous variable the model is exactly identified which allows us to use it as a direct instrument. If there is more than one instrument available we use the best linear combination of all the exogenous variables including the instruments to construct a variable which we can use in place of the endogenous variable. The following demonstrates the Two Stage Least Squares method. The following two section both explain the methodology of 2SLS but I have used two separate ways, and slightly different notation, to present 2SLS as it differs across the spectrum of textbooks and online resources. The Two Stage Least Squares (General Notation) Whenever we have more than one instrument we say that the model is over identified. Let z i1,, z im where E[z im u i ] = 0. This means that each instrument is exogenous. There are many combinations of the instruments to use but the 2SLS is the most efficient IV estimator. The Vector of all exogenous variables is z i = [x i1,.., x 1, z i1, z im ] The linear combination of z i most correlated with x is given by the linear projection of x on z i. This is known as the reduced form model. x i1 = π 0 + π 1 x i1 + + π k 1 x ik 1 + π K z i1 + + π K+M 1 z im + v i (i) Where v i is uncorrelated with each of the right hand side variables and has zero mean. By definition any linear combination of z i is uncorrelated with ε i we have, E(v i ) = 0, Cov(v i, x i1 ) = 0,, Cov(v i, z im ) = 0 therefore v i drops out of the equation. x = π 0 + π 1 x i1 + + π k 1 x ik 1 + π K z i1 + + π K+M 1 z im x is often interpreted as the part of x that is uncorrelated with ε i.as long as we make the assumption that there are no exact linear dependencies amount the exogenous variables, we can consistently estimate the parameters in equation (i). We have to estimate x as it is not observed and therefore not feasible. Stage 1: Obtain fitted values of x from the reduced form regression x = π 0 + π 1 x i1 + + π k 1 x ik 1 + π K z i1 + + π K+M 1 z im 1
2 Stage 2: Run the OLS regression y i = β 0 + β 1 x i1 + + β (k 1) x i(k 1) + β k x + ε i Where x i = [x i1,.., x 1, x ] we can define the 2SLS estimator as β IV(2SLS) = x i x 1 i x i y i = (X X) 1 X y The 2SLS estimator turns out to be an OLS estimator. Note that X = Z(Z Z) 1 Z X = P Z X where tis projection matrix is idempotent and symmetric which means that P Z P Z = P Z. Therefore we have X X = X P Z X=(P Z X) P Z X. From this it is clear to see that X X = X X. Now we can show that the 2SLS estimator uses linear combination of instruments, x can be written as, β IV(2SLS) = x i x 1 i x i y i = (X X ) 1 X y This is the standard formula where we regress y on X. It is important to note that β IV(2SLS) = β IV are identical when there is only one instrument for the endogenous variable x. Two Stage Least Squares (Alternative Notation) The Structural Model y i = β 0 + β 1 x i1 + + β (k 1) x i(k 1) + β k x + ε i (1) This is our original linear model where the endogenous variable is denoted as x. Reduced Form Model x = β 0 + β 1 x i1 + + β (k 1) x i(k 1) + δ 1 z i1 + + δ m z im + v i (2) In this model the z i s represent the exogenous instrumental variables. 1 The best IV for x is the linear combination of the exogenous variables which we call x. By definition any linear combination of z i is uncorrelated with ε i we have, E(v i ) = 0, Cov(v i, x i1 ) = 0,, Cov(v i, z im ) = 0 therefore v i drops out of the equation x = β 0 + β 1 x i1 + + β (k 1) x i(k 1) + δ 1 z i1 + + δ m z im Aside: Although correct, it is not enough to say that z i1 is correlated with x. It is more precise to say that z i1 is partially correlated with x since there are other variables in equation (2). 1 For a basic understanding of the role of instrumental variables you should consult the PDF Instrumental Variables 2
3 First Stage: This stage is essentially just running the reduced form regression as shown in (3). What we are actually doing is regressing the endogenous problem variable on all the exogenous variables. This includes all the x s from the original structural equation which are exogenous plus all the instruments z's which are obviously exogenous because they are from outside the model. 1st Stage regression (Reduced Form Regression) x = β 0 + β 1x i1 + + β (k 1) x i(k 1) + δ 1z i1 + + δ mz im (3) Second Stage: Now we have an estimate for the endogenous variable x which is the best linear combination of exogenous variables x. This is an exogenous variable that we can use as a replacement for x which is the endogenous variable from the structural model (1). This is why Model (4) below uses x i in the equation. The 2SLS fist purges x of its correlation with ε i before doing the OLS regression we can show the as x = x + v i the composite error will be ε i + β k v i y i = β 0 + β 1 x i1 + + β (k 1) x i(k 1) + β k x + ε i + β k v i Where ε i + β k v i has a zero mean and is uncorrelated with all the right hand side variables. 2SLS Regression y i = β 0 + β 1 x i1 + + β (k 1) x i(k 1) + β k x + ε i (4) Therefore we have β IV(2SLS) = ( x i x i ) 1 x i y i In this final equation we only have exogenous variables. Therefore two stage least squares has prevented inconsistency coming from the fact that we have an endogenous explanatory variable. It is important to notice that although two stage least squares will be consistent it will never be unbiased. Example 1 Suppose we are interested in estimating wages and are using the following model (5). The explanatory variables are what we think effects a person wages. However we know from the OLS assumptions that these variables cannot be correlated with the error term. The error term contains everything that is unobserved and contains variables le ability. (5) is the model we wish to estimate where error represents the population error whist in (6) u i is the residual error from the sample we observe. log (wage) i = β 0 + β 1 exper i + β 2 exper i 2 + β 3 age i + β 4 region i + β 5 educ i + error i (5) log (wage) i = β 0 + β 1 exper i + β 2 exper i 2 + β 3 age i + β 4 region i + β 5 educ i + u i (6) By inspecting the structure of the error terms we can notice that ability is an unobserved variable that is related to the education variable. For instance those with more ability are 3
4 expected to have on average more years of education than those with less ability. We will assume that ability is not correlated with the experience variable although there are examples where this could be the case. u i = ability i + error (educ i u i ) 0 More specifically we could write the last equation in the following two equations to explicitly demonstrate the origin of this endogeneity. (educ i ability i ) 0, and (educ i error i ) 0 If education is correlated to the residual u i through the ability variable it is true that we will have a biased and inconsistent estimator for not only β 5 but for all the variables. Therefore we need to use an instrument or a set of instruments to solve for the endogeneity problem. At this stage we need to find at least one instrument for the one endogenous variable. When we have more than one excluded exogenous variable we use two stage least squares to create the best linear combination of instruments to rule out the endogeneity of education. We could have the variables mother s education and father s education. Consequently if the two instruments satisfy the requirements of being uncorrelated with the error but correlated with education then we can use them. It is worth noting that there are methods for testing for endogeneity and testing over identifying restrictions however we shall not look at them here. Stage 1 educ i = β 0 + β 1exper i + β 2exper 2 i + β 3age i + β 4region i + δ 1motheduc i + δ 2fatheduc i + e i Now we have an estimate of education that is constructed from the exogenous variables in the structural equation plus the outside exogenous variables. The β s are acting as instruments for themselves. Stage 2 log (wage) i = β 0 + β 1 exper i + β 2 exper 2 i + β 3 age i + β 4 region i + β 5 educ i + u i Example 2 The following regression attempts to model income based on cigarettes consumed, education and the age of a person. Log(income) = β 0 + β 1 cigs + β 2 educ + β 3 age + β 4 age 2 + ε 1 (i) How do you interpret the coefficient β 1? 4
5 . regress lincome cigs educ age agesq F( 4, 802) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = lincome Coef. Std. Err. t P> t [95% Conf. Interval] cigs educ age agesq _cons more cigarette smoked per day results in a % increase in income. cigs is the endogenous variable because it is based on income amongst other factors. We would probably expect consumption of more cigarettes to decrease income because of the health reasons but that is not the case here. We expect this result is caused because of the endogeneity To reflect the fact that cigarette consumption might be jointly determined with income, a demand for cigarettes equation is cigs = γ 0 + γ 1 log(income) + γ 2 edcu + γ 3 age + γ 4 age 2 + γ 5 log(cigpric) + γ 6 resaurn + ε 2 where cigpric is the price of a pack of cigarettes (in cents), and restaurn is a binary variable equal to one if the person lives in a state with restaurant smoking restrictions. (ii) Assuming these are exogenous to the individual, what signs would you expect for 5 and 6? We would expect γ 5 to have a negative sign as the higher the price of cigarettes the less you will consume. However as smoking is addictive the demand would most lely be inelastic so we don t expect a large change. We would expect γ 6 to be negatively related to cigarette consumption. If the restaurants do not allow smoking we would expect less consumption. Although I doubt the magnitude would be too large. The regression below confirms expectations.. regress cigs lincome educ age agesq lcigpric restaurn F( 6, 800) = 7.42 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = cigs Coef. Std. Err. t P> t [95% Conf. Interval] lincome educ age agesq lcigpric restaurn _cons
6 (iii) Under what assumption is the income equation identified? For identification there needs to be an equal number of equations and endogenous variables. There needs to be at least one excluded exogenous variable for every endogenous variable. In the cigarette consumption equation we have the log(cigpric) and resaurn which we assume are both exogenous. We can use these to identify the income equation. We have 2 possible IV s for the endogenous variable cig so we are over identified and will use the OLS combination of the two as long as they are strong enough instruments with respect to correlation. This means we should always look at the reduced form first stage regression. SEM can also suffer from weak instruments thus vigilance is warranted. (iv) Estimate the income equation by OLS and discuss the estimate of β 1.. regress lincome cigs educ age agesq F( 4, 802) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = lincome Coef. Std. Err. t P> t [95% Conf. Interval] cigs educ age agesq _cons β 1 is the coefficient on cigarettes and shows that given a 1 more cigarette smoked per day results in a % increase in income. We would expect to have a negative sign as the number of cigarettes increase we would expect to see a decline in health and income due to less hours worked. This is not the case here because of the endogeneity problem where cigs is based on other variables that are not in the income equation and cause bias and inconsistent estimators. (v) Estimate the reduced form for cigs. Are log(cigpric) and restaurn significant?. regress cigs lincome educ age agesq lcigpric restaurn F( 6, 800) = 7.42 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = cigs Coef. Std. Err. t P> t [95% Conf. Interval] lincome educ age agesq lcigpric restaurn _cons
7 log(cigpric) is not significant at the with a p-value of which suggests it is not a strong instrument for cigs. This intuitively makes sense because cigarettes are highly addictive we don t expect to see a large effect on consumption. resaurn is significant and has a large t-stat of so we can assume that this is an good instrument to use for cigs in the income equation. I would not expect the banning of cigarettes in public restaurants to have such a large effect on cigarette consumption however the results suggest otherwise. (vi) Now, estimate the income equation by 2SLS. Discuss how the estimate of β 1 compares with the OLS estimate.. ivregress 2sls lincome educ age agesq (cigs = lcigpric restaurn) Instrumental variables (2SLS) regression Number of obs = 807 Wald chi2(4) = Prob > chi2 = R-squared =. Root MSE = lincome Coef. Std. Err. z P> z [95% Conf. Interval] cigs educ age agesq _cons Instrumented: cigs Instruments: educ age agesq lcigpric restaurn β 1(2SLS) = which is now negative and has a higher P-Value than the original β 1 = This is the expected relationship between cigarette consumption and income. It shows that the OLS estimate is biased upwards. This is the original equation we want to estimate with the education replaced by its estimate which uses the best linear combination of exogenous variables. This regression now has no endogeneity problems because educ i is now an exogenous term and unrelated to the error. 7
Lecture 8: Instrumental Variables Estimation
Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano
More informationWarwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation
Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory
More informationInstrumental Variables, Simultaneous and Systems of Equations
Chapter 6 Instrumental Variables, Simultaneous and Systems of Equations 61 Instrumental variables In the linear regression model y i = x iβ + ε i (61) we have been assuming that bf x i and ε i are uncorrelated
More informationExercise Sheet 4 Instrumental Variables and Two Stage Least Squares Estimation
Exercise Sheet 4 Instrumental Variables and Two Stage Least Squares Estimation ECONOMETRICS I. UC3M 1. [W 15.1] Consider a simple model to estimate the e ect of personal computer (P C) ownership on the
More informationSimultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser
Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor
More informationProblem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval]
Problem Set #3-Key Sonoma State University Economics 317- Introduction to Econometrics Dr. Cuellar 1. Use the data set Wage1.dta to answer the following questions. a. For the regression model Wage i =
More informationEconometrics. 8) Instrumental variables
30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates
More informationCourse Econometrics I
Course Econometrics I 4. Heteroskedasticity Martin Halla Johannes Kepler University of Linz Department of Economics Last update: May 6, 2014 Martin Halla CS Econometrics I 4 1/31 Our agenda for today Consequences
More information4 Instrumental Variables Single endogenous variable One continuous instrument. 2
Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................
More information5.2. a. Unobserved factors that tend to make an individual healthier also tend
SOLUTIONS TO CHAPTER 5 PROBLEMS ^ ^ ^ ^ 5.1. Define x _ (z,y ) and x _ v, and let B _ (B,r ) be OLS estimator 1 1 1 1 ^ ^ ^ ^ from (5.5), where B = (D,a ). Using the hint, B can also be obtained by 1 1
More information4 Instrumental Variables Single endogenous variable One continuous instrument. 2
Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................
More informationMeasurement Error. Often a data set will contain imperfect measures of the data we would ideally like.
Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and
More informationEcmt 675: Econometrics I
Ecmt 675: Econometrics I Assignment 7 Problem 1 a. reg hours lwage educ age kidslt6 kidsge6 nwifeinc, r Linear regression Number of obs = 428 F( 6, 421) = 3.93 Prob > F = 0.0008 R-squared = 0.0670 Root
More informationHandout 11: Measurement Error
Handout 11: Measurement Error In which you learn to recognise the consequences for OLS estimation whenever some of the variables you use are not measured as accurately as you might expect. A (potential)
More informationECON Introductory Econometrics. Lecture 17: Experiments
ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.
More informationProblem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics
Problem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics C1.1 Use the data set Wage1.dta to answer the following questions. Estimate regression equation wage =
More informationECON Introductory Econometrics. Lecture 16: Instrumental variables
ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental
More informationPractice exam questions
Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.
More informationLecture#12. Instrumental variables regression Causal parameters III
Lecture#12 Instrumental variables regression Causal parameters III 1 Demand experiment, market data analysis & simultaneous causality 2 Simultaneous causality Your task is to estimate the demand function
More informationEconometrics Midterm Examination Answers
Econometrics Midterm Examination Answers March 4, 204. Question (35 points) Answer the following short questions. (i) De ne what is an unbiased estimator. Show that X is an unbiased estimator for E(X i
More informationHandout 12. Endogeneity & Simultaneous Equation Models
Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to
More informationECON3150/4150 Spring 2016
ECON3150/4150 Spring 2016 Lecture 6 Multiple regression model Siv-Elisabeth Skjelbred University of Oslo February 5th Last updated: February 3, 2016 1 / 49 Outline Multiple linear regression model and
More informationProblem Set 10: Panel Data
Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005
More informationLab 07 Introduction to Econometrics
Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand
More informationDynamic Panel Data Models
Models Amjad Naveed, Nora Prean, Alexander Rabas 15th June 2011 Motivation Many economic issues are dynamic by nature. These dynamic relationships are characterized by the presence of a lagged dependent
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares
Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit
More informationProblem Set # 1. Master in Business and Quantitative Methods
Problem Set # 1 Master in Business and Quantitative Methods Contents 0.1 Problems on endogeneity of the regressors........... 2 0.2 Lab exercises on endogeneity of the regressors......... 4 1 0.1 Problems
More informationECON3150/4150 Spring 2015
ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2
More informationAnswer all questions from part I. Answer two question from part II.a, and one question from part II.b.
B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries
More informationECO220Y Simple Regression: Testing the Slope
ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x
More informationInstrumental Variable Regression
Topic 6 Instrumental Variable Regression ARE/ECN 240 A Graduate Econometrics Professor: Òscar Jordà Outline of this topic Randomized Experiments, natural experiments and causation Instrumental variables:
More informationQuantitative Methods Final Exam (2017/1)
Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationFixed and Random Effects Models: Vartanian, SW 683
: Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:
More informationDealing With and Understanding Endogeneity
Dealing With and Understanding Endogeneity Enrique Pinzón StataCorp LP October 20, 2016 Barcelona (StataCorp LP) October 20, 2016 Barcelona 1 / 59 Importance of Endogeneity Endogeneity occurs when a variable,
More informationMediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013
Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 In Chap. 11 of Statistical Analysis of Management Data (Gatignon, 2014), tests of mediation are
More informationLecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)
Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model
More informationEconomics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 2 Fall 2016 Instructor: Martin Farnham
Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 2 Fall 2016 Instructor: Martin Farnham Last name (family name): First name (given name): Student ID
More informationPractice 2SLS with Artificial Data Part 1
Practice 2SLS with Artificial Data Part 1 Yona Rubinstein July 2016 Yona Rubinstein (LSE) Practice 2SLS with Artificial Data Part 1 07/16 1 / 16 Practice with Artificial Data In this note we use artificial
More informationExercise sheet 6 Models with endogenous explanatory variables
Exercise sheet 6 Models with endogenous explanatory variables Note: Some of the exercises include estimations and references to the data files. Use these to compare them to the results you obtained with
More informationMultiple Regression Analysis: Estimation. Simple linear regression model: an intercept and one explanatory variable (regressor)
1 Multiple Regression Analysis: Estimation Simple linear regression model: an intercept and one explanatory variable (regressor) Y i = β 0 + β 1 X i + u i, i = 1,2,, n Multiple linear regression model:
More informationBasic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler
Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as
More informationExercices for Applied Econometrics A
QEM F. Gardes-C. Starzec-M.A. Diaye Exercices for Applied Econometrics A I. Exercice: The panel of households expenditures in Poland, for years 1997 to 2000, gives the following statistics for the whole
More informationSpatial Regression Models: Identification strategy using STATA TATIANE MENEZES PIMES/UFPE
Spatial Regression Models: Identification strategy using STATA TATIANE MENEZES PIMES/UFPE Intruduction Spatial regression models are usually intended to estimate parameters related to the interaction of
More informationUniversity of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points
EEP 118 / IAS 118 Elisabeth Sadoulet and Kelly Jones University of California at Berkeley Fall 2008 Introductory Applied Econometrics Final examination Scores add up to 125 points Your name: SID: 1 1.
More informationApplied Econometrics (MSc.) Lecture 3 Instrumental Variables
Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.
More informationSpecification Error: Omitted and Extraneous Variables
Specification Error: Omitted and Extraneous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 5, 05 Omitted variable bias. Suppose that the correct
More informationApplied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables
Applied Statistics and Econometrics Giuseppe Ragusa Lecture 15: Instrumental Variables Outline Introduction Endogeneity and Exogeneity Valid Instruments TSLS Testing Validity 2 Instrumental Variables Regression
More informationGraduate Econometrics Lecture 4: Heteroskedasticity
Graduate Econometrics Lecture 4: Heteroskedasticity Department of Economics University of Gothenburg November 30, 2014 1/43 and Autocorrelation Consequences for OLS Estimator Begin from the linear model
More informationBrief Suggested Solutions
DEPARTMENT OF ECONOMICS UNIVERSITY OF VICTORIA ECONOMICS 366: ECONOMETRICS II SPRING TERM 5: ASSIGNMENT TWO Brief Suggested Solutions Question One: Consider the classical T-observation, K-regressor linear
More informationECO375 Tutorial 8 Instrumental Variables
ECO375 Tutorial 8 Instrumental Variables Matt Tudball University of Toronto Mississauga November 16, 2017 Matt Tudball (University of Toronto) ECO375H5 November 16, 2017 1 / 22 Review: Endogeneity Instrumental
More informationDealing With Endogeneity
Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics
More informationEconometrics Homework 1
Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z
More information2. (3.5) (iii) Simply drop one of the independent variables, say leisure: GP A = β 0 + β 1 study + β 2 sleep + β 3 work + u.
BOSTON COLLEGE Department of Economics EC 228 Econometrics, Prof. Baum, Ms. Yu, Fall 2003 Problem Set 3 Solutions Problem sets should be your own work. You may work together with classmates, but if you
More informationProblem set - Selection and Diff-in-Diff
Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into
More information14.32 Final : Spring 2001
14.32 Final : Spring 2001 Please read the entire exam before you begin. You have 3 hours. No books or notes should be used. Calculators are allowed. There are 105 points. Good luck! A. True/False/Sometimes
More informationECON3150/4150 Spring 2016
ECON3150/4150 Spring 2016 Lecture 4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo Last updated: January 26, 2016 1 / 49 Overview These lecture slides covers: The linear regression
More informationECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics August 2013
ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics August 2013 Instructions: Answer all six (6) questions. Point totals for each question are given in parentheses. The parts within
More informationCase of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1
Mediation Analysis: OLS vs. SUR vs. ISUR vs. 3SLS vs. SEM Note by Hubert Gatignon July 7, 2013, updated November 15, 2013, April 11, 2014, May 21, 2016 and August 10, 2016 In Chap. 11 of Statistical Analysis
More informationStatistical Inference with Regression Analysis
Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing
More informationLecture 4: Multivariate Regression, Part 2
Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above
More informationЭконометрика, , 4 модуль Семинар Для Группы Э_Б2015_Э_3 Семинарист О.А.Демидова
Эконометрика, 2017-2018, 4 модуль Семинар 3 160418 Для Группы Э_Б2015_Э_3 Семинарист ОАДемидова * Stata program * copyright C 2010 by A Colin Cameron and Pravin K Trivedi * used for "Microeconometrics
More informationProblem Set 1 ANSWERS
Economics 20 Prof. Patricia M. Anderson Problem Set 1 ANSWERS Part I. Multiple Choice Problems 1. If X and Z are two random variables, then E[X-Z] is d. E[X] E[Z] This is just a simple application of one
More informationEconomics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Fall 2016 Instructor: Martin Farnham
Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Fall 2016 Instructor: Martin Farnham Last name (family name): First name (given name):
More informationSIMULTANEOUS EQUATION MODEL
SIMULTANEOUS EQUATION MODEL ONE Equation Model (revisited) Characteristics: One dependent variable (Y): as a regressand One ore more independent variables (X): as regressors One way causality relationship:
More informationESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.
More informationNonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015
Nonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 This lecture borrows heavily from Duncan s Introduction to Structural
More informationEconometrics II Censoring & Truncation. May 5, 2011
Econometrics II Censoring & Truncation Måns Söderbom May 5, 2011 1 Censored and Truncated Models Recall that a corner solution is an actual economic outcome, e.g. zero expenditure on health by a household
More informationMultiple Regression Analysis
Multiple Regression Analysis y = 0 + 1 x 1 + x +... k x k + u 6. Heteroskedasticity What is Heteroskedasticity?! Recall the assumption of homoskedasticity implied that conditional on the explanatory variables,
More informationECON Introductory Econometrics. Lecture 13: Internal and external validity
ECON4150 - Introductory Econometrics Lecture 13: Internal and external validity Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 9 Lecture outline 2 Definitions of internal and external
More informationChapter 14. Simultaneous Equations Models Introduction
Chapter 14 Simultaneous Equations Models 14.1 Introduction Simultaneous equations models differ from those we have considered in previous chapters because in each model there are two or more dependent
More informationApplied Health Economics (for B.Sc.)
Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative
More informationAbility Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006
Ability Bias, Errors in Variables and Sibling Methods James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006 1 1 Ability Bias Consider the model: log = 0 + 1 + where =income, = schooling,
More informationMultiple Linear Regression CIVL 7012/8012
Multiple Linear Regression CIVL 7012/8012 2 Multiple Regression Analysis (MLR) Allows us to explicitly control for many factors those simultaneously affect the dependent variable This is important for
More information1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE
1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +
More informationECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests
ECON4150 - Introductory Econometrics Lecture 5: OLS with One Regressor: Hypothesis Tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 5 Lecture outline 2 Testing Hypotheses about one
More informationECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors
ECON4150 - Introductory Econometrics Lecture 6: OLS with Multiple Regressors Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 6 Lecture outline 2 Violation of first Least Squares assumption
More informationIntroduction to Econometrics. Heteroskedasticity
Introduction to Econometrics Introduction Heteroskedasticity When the variance of the errors changes across segments of the population, where the segments are determined by different values for the explanatory
More informationTopics in Applied Econometrics and Development - Spring 2014
Topic 2: Topics in Applied Econometrics and Development - Spring 2014 Single-Equation Linear Model The population model is linear in its parameters: y = β 0 + β 1 x 1 + β 2 x 2 +... + β K x K + u - y,
More informationEconometrics II. Lecture 4: Instrumental Variables Part I
Econometrics II Lecture 4: Instrumental Variables Part I Måns Söderbom 12 April 2011 mans.soderbom@economics.gu.se. www.economics.gu.se/soderbom. www.soderbom.net 1. Introduction Recall from lecture 3
More informationEssential of Simple regression
Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship
More informationExam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h.
Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h. This is an open book examination where all printed and written resources, in addition to a calculator, are allowed. If you are
More informationRegression #8: Loose Ends
Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch
More informationEconometrics. 9) Heteroscedasticity and autocorrelation
30C00200 Econometrics 9) Heteroscedasticity and autocorrelation Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Heteroscedasticity Possible causes Testing for
More informationControl Function and Related Methods: Nonlinear Models
Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear
More informationTHE MULTIVARIATE LINEAR REGRESSION MODEL
THE MULTIVARIATE LINEAR REGRESSION MODEL Why multiple regression analysis? Model with more than 1 independent variable: y 0 1x1 2x2 u It allows : -Controlling for other factors, and get a ceteris paribus
More informationQuestion 1 [17 points]: (ch 11)
Question 1 [17 points]: (ch 11) A study analyzed the probability that Major League Baseball (MLB) players "survive" for another season, or, in other words, play one more season. They studied a model of
More informationGMM Estimation in Stata
GMM Estimation in Stata Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets 1 Outline Motivation 1 Motivation 2 3 4 2 Motivation 3 Stata and
More informationGreene, Econometric Analysis (7th ed, 2012)
EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 2 3: Classical Linear Regression The classical linear regression model is the single most useful tool in econometrics.
More informationRockefeller College University at Albany
Rockefeller College University at Albany PAD 705 Handout: Simultaneous quations and Two-Stage Least Squares So far, we have studied examples where the causal relationship is quite clear: the value of the
More informationUNIVERSITY OF WARWICK. Summer Examinations 2015/16. Econometrics 1
UNIVERSITY OF WARWICK Summer Examinations 2015/16 Econometrics 1 Time Allowed: 3 Hours, plus 15 minutes reading time during which notes may be made (on the question paper) BUT NO ANSWERS MAY BE BEGUN.
More informationEcon 836 Final Exam. 2 w N 2 u N 2. 2 v N
1) [4 points] Let Econ 836 Final Exam Y Xβ+ ε, X w+ u, w N w~ N(, σi ), u N u~ N(, σi ), ε N ε~ Nu ( γσ, I ), where X is a just one column. Let denote the OLS estimator, and define residuals e as e Y X.
More informationAutocorrelation. Think of autocorrelation as signifying a systematic relationship between the residuals measured at different points in time
Autocorrelation Given the model Y t = b 0 + b 1 X t + u t Think of autocorrelation as signifying a systematic relationship between the residuals measured at different points in time This could be caused
More informationSimultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations
Simultaneous Equation Models. Introduction: basic definitions 2. Consequences of ignoring simultaneity 3. The identification problem 4. Estimation of simultaneous equation models 5. Example: IS LM model
More informationLecture 4: Multivariate Regression, Part 2
Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above
More informationECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48
ECON2228 Notes 10 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 10 2014 2015 1 / 48 Serial correlation and heteroskedasticity in time series regressions Chapter 12:
More informationECON 594: Lecture #6
ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was
More information1 Motivation for Instrumental Variable (IV) Regression
ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data
More informationsociology 362 regression
sociology 36 regression Regression is a means of studying how the conditional distribution of a response variable (say, Y) varies for different values of one or more independent explanatory variables (say,
More informationEmpirical Application of Simple Regression (Chapter 2)
Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget
More information