5.2. a. Unobserved factors that tend to make an individual healthier also tend

Size: px
Start display at page:

Download "5.2. a. Unobserved factors that tend to make an individual healthier also tend"

Transcription

1 SOLUTIONS TO CHAPTER 5 PROBLEMS ^ ^ ^ ^ 5.1. Define x _ (z,y ) and x _ v, and let B _ (B,r ) be OLS estimator ^ ^ ^ ^ from (5.5), where B = (D,a ). Using the hint, B can also be obtained by partitioned regression: ^ (i) Regress x onto v and save the residuals, say 1 x 1. (ii) Regress y onto x. 1 1 ^ ^ But when we regress z1 onto v, the residuals are just z1 since v is N ^ orthogonal in sample to z. (More precisely, S z i1v i = 0.) Further, because i=1 ^ ^ ^ ^ we can write y = y + v, where y and v are orthogonal in sample, the ^ residuals from regressing y onto v are simply the first stage fitted values, ^ ^ y. In other words, x 1 = (z 1,y ). But the SLS estimator of B1 is obtained ^ exactly from the OLS regression y on z, y a. Unobserved factors that tend to make an individual healthier also tend to make that person exercise more. For example, if health is a cardiovascular measure, people with a history of heart problems are probably less likely to exercise. Unobserved factors such as prior health or family history are contained in u, and so we are worried about correlation between exercise and 1 u. Self self-selection into exercising predicts that the benefits of 1 exercising will be, on average, overestimated. Ideally, the amount of exercise could be randomized across a sample of people, but this can be difficult. b. If people do not systematically choose the location of their homes and jobs relative to health clubs based on unobserved health characteristics, then it is reasonable to believe that disthome and 7

2 distwork are uncorrelated with u. But the location of health clubs is not 1 necessarily exogenous. Clubs may tend to be built near neighborhoods where residents have higher income and wealth, on average, and these factors can certainly affect overall health. It may make sense to choose residents from neighborhoods with very similar characteristics but where one neighborhood is located near a health club. c. The reduced form for exercise is exercise = p 0 + p1age + pweight + p3height + p male + p work + p disthome + p distwork + u, For identification we need at least one of p and p to be different from 6 7 zero. This assumption can fail if the amount that people exercise is not systematically related to distances to the nearest health club. d. An F test of H : p = 0, p = 0 is the simplest way to test the identification assumption in part (c). As usual, it would be a good idea to compute a heteroskedasticty-robust version a. There may be unobserved health factors correlated with smoking behavior that affect infant birth weight. For example, women who smoke during pregnancy may, on average, drink more coffee or alcohol, or eat less nutritious meals. b. Basic economics says that packs should be negatively correlated with cigarette price, although the correlation might be small (especially because price is aggregated at the state level). At first glance it seems that cigarette price should be exogenous in equation (5.54), but we must be a little careful. One component of cigarette price is the state tax on cigarettes. States that have lower taxes on cigarettes may also have lower 8

3 quality of health care, on average. Quality of health care is in u, and so maybe cigarette price fails the exogeneity requirement for an IV. c. OLS is followed by SLS (IV, in this case):. reg lbwght male parity lfaminc packs Source SS df MS Number of obs = F( 4, 1383) = 1.55 Model Prob > F = Residual R-squared = Adj R-squared = 0.03 Total Root MSE = lbwght Coef. Std. Err. t P> t [95% Conf. Interval] male parity lfaminc packs _cons reg lbwght male parity lfaminc packs (male parity lfaminc cigprice) (SLS) Source SS df MS Number of obs = F( 4, 1383) =.39 Model Prob > F = Residual R-squared = Adj R-squared =. Total Root MSE =.3017 lbwght Coef. Std. Err. t P> t [95% Conf. Interval] packs male parity lfaminc _cons (Note that Stata automatically shifts endogenous explanatory variables to the beginning of the list when it reports coefficients, standard errors, and so on.) The difference between OLS and IV in the estimated effect of packs on 9

4 bwght is huge. With the OLS estimate, one more pack of cigarettes is estimated to reduce bwght by about 8.4%, and is statistically significant. The IV estimate has the opposite sign, is huge in magnitude, and is not statistically significant. The sign and size of the smoking effect are not realistic. d. We can see the problem with IV by estimating the reduced form for packs:. reg packs male parity lfaminc cigprice Source SS df MS Number of obs = F( 4, 1383) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE =.9448 packs Coef. Std. Err. t P> t [95% Conf. Interval] male parity lfaminc cigprice _cons The reduced form estimates show that cigprice does not significantly affect packs; in fact, the coefficient on cigprice is not the sign we expect. Thus, cigprice fails as an IV for packs because cigprice is not partially correlated with packs (with a sensible sign for the correlation). This is separate from the problem that cigprice may not truly be exogenous in the birth weight equation a. Here are the OLS results:. reg lwage educ exper expersq black south smsa reg661-reg668 smsa66 30

5 Source SS df MS Number of obs = F( 15, 994) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE =.378 lwage Coef. Std. Err. t P> t [95% Conf. Interval] educ exper expersq black south smsa reg reg reg reg reg reg reg reg smsa _cons The estimated return to education is about 7.5%, with a very large t statistic. These reproduce the estimates from Table, Column () in Card (1995). b. The reduced form for educ is. reg educ exper expersq black south smsa reg661-reg668 smsa66 nearc4 Source SS df MS Number of obs = F( 15, 994) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = educ Coef. Std. Err. t P> t [95% Conf. Interval] exper expersq

6 black south smsa reg reg reg reg reg reg reg reg smsa nearc _cons The important coefficient is on nearc4. Statistically, educ and nearc4 are partially correlated, in a sensible way: holding other factors in the reduced form fixed, someone living near a four-year college at age 16 has, on average, almost one-third a year more education than a person not near a four-year college at age 16. This is not trivial effect, so nearc4 passes the requirement that it is partially correlated with educ. c. Here are the IV estimates:. reg lwage educ exper expersq black south smsa reg661-reg668 smsa66 (nearc4 exper expersq black south smsa reg661-reg668 smsa66) (SLS) Source SS df MS Number of obs = F( 15, 994) = Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = lwage Coef. Std. Err. t P> t [95% Conf. Interval] educ exper expersq black south smsa reg reg

7 reg reg reg reg reg reg smsa _cons The estimated return to education has increased to about 13.%, but notice how wide the 95% confidence interval is:.4% to 3.9%. By contrast, the OLS confidence interval is about 6.8% to 8.%, which is much tighter. Of course, OLS could be inconsistent, in which case a tighter CI is of little value. But the estimated return to education is higher with IV, something that seems a bit counterintuitive. One possible explanation is that educ suffers from classical errors-in-variables. Therefore, while OLS would tend to overestimate the return to schooling because of omitted "ability," the measurement error in educ leads to an attenuation bias. Measurement error may contribute to the larger IV estimate, but it is not especially convincing. It seems unlikely that educ satisfies the CEV assumptions. For example, if we think the measurement error is due to truncation -- people are asked about highest grade completed, not actually years of schooling -- then educ is always less than or equal to educ. The measurement error could not be independent of educ, either. If we think it is unobserved quality of schooling, then it seems likely that quality of schooling -- part of the measurement error -- is positively correlated with actual amount of schooling. This, too, violates the CEV assumptions. Another possibility for the much higher IV estimate comes out of the recent treatment effect literature, which is covered in Section Of course, we must also remember that the point estimates -- more precisly, the IV estimate -- is subject to substantial 33

8 sampling variation. At this point, we do not even know of OLS and IV are statistically different from each other. See Problem 6.1. d. When nearc is added to the reduced form of educ it has a coefficient (standard error) of.13 (.077), compared with.31 (.089) for nearc4. Therefore, nearc4 has a much stronger ceteris paribus relationship with educ; nearc is only marginally statistically significant once nearc4 has been included. The SLS estimate of the return to education becomes about 15.7%, with 95% CI given by 5.4% to 6%. The CI is still very wide Under the null hypothesis that q and z are uncorrelated, z and z are 1 exogenous in (5.55) because each is uncorrelated with u. Unfortunately, y 1 is correlated with u, and so the regression of y on z, y, z does not produce a consistent estimator of 0 on z even when E(z q) = 0. We could find ^ that J1 from this regression is statistically different from zero even when q and z are uncorrelated -- in which case we would incorrectly conclude that z is not a valid IV candidate. Or, we might fail to reject H 0: J 1 = 0 when z and q are correlated -- in which case we incorrectly conclude that the elements in z are valid as instruments. The point of this exercise is that one cannot simply add instrumental variable candidates in the structural equation and then test for significance of these variables using OLS. This is the sense in which identification cannot be tested. With a single endogenous variable, we must take a stand that at least one element of z is uncorrelated with q a. By definition, the reduced form is the linear projection 34

9 L(q 1,x,q ) = p + xp + p q, and we want to show that P 1 = 0 when q is uncorrelated with x. Now, because q is a linear function of q and a, and a is uncorrelated with x, q is uncorrelated with x if and only if q is uncorrelated with x. Assuming then that q and x are uncorrelated, q is also uncorrelated with x. A basic fact 1 about linear projections is that, because q and q are each uncorrelated with 1 the vector x, P = 0. This follows from Property LP.7: P can be obtained by 1 1 first projecting x on 1, q and obtaining the population residuals, say r. Then, project q onto r. But since x and q are orthogonal, r = x - M. 1 x Projecting q on (x - M ) just gives the zero vector because E[(x - M ) q ] = 1 x x 1 0. Therefore, P = 0. 1 b. If q and x are correlated then P $ 0, and x appears in the reduced 1 form for q. It is not realistic to assume that q and x are uncorrelated. 1 Under the multiple indicator assumptions, assuming x and q are uncorrelated is the same as assuming q and x are uncorrelated. If we believe q and x are uncorrelated then there is no need to collect indicators on q to consistently estimate B: we could simply put q into the error term and estimate B from an OLS regression of y on 1, x a. If we plug q = (1/d )q - (1/d )a into equation (5.45) we get y = b + b x b x + h q + v - h a, (5.56) K K where h 1 _ (1/d 1). Now, since the zh are redundant in (5.45), they are uncorrelated with the structural error, v (by definition of redundancy). Further, we have assumed that the z are uncorrelated with a. Since each x h 1 j is also uncorrelated with v - h a, we can estimate (5.56) by SLS using 1 1 instruments (1,x,...,x,z,z,...,z ) to get consistent of the b and h. 1 K 1 M j 1 35

10 Given all of the zero correlation assumptions, what we need for identification is that at least one of the z appears in the reduced form for h q. More formally, in the linear projection 1 q = p + p x p x + p z p z + r, K K K+1 1 K+M M 1 at least one of p,..., p must be different from zero. K+1 K+M b. We need family background variables to be redundant in the log(wage) equation once ability (and other factors, such as educ and exper), have been controlled for. The idea here is that family background may influence ability but should have no partial effect on log(wage) once ability has been accounted for. For the rank condition to hold, we need family background variables to be correlated with the indicator, q, say IQ, once the x have been netted 1 j out. This is likely to be true if we think that family background and ability are (partially) correlated. c. Applying the procedure to the data set in NLS80.RAW gives the following results:. reg lwage exper tenure educ married south urban black iq (exper tenure educ married south urban black meduc feduc sibs) Instrumental variables (SLS) regression Source SS df MS Number of obs = F( 8, 713) = 5.81 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = lwage Coef. Std. Err. t P> t [95% Conf. Interval] iq tenure educ married south urban

11 black exper _cons reg lwage exper tenure educ married south urban black kww (exper tenure educ married south urban black meduc feduc sibs) Instrumental variables (SLS) regression Source SS df MS Number of obs = F( 8, 713) = 5.70 Model Prob > F = Residual R-squared = Adj R-squared = Total Root MSE = lwage Coef. Std. Err. t P> t [95% Conf. Interval] kww tenure educ married south urban black exper _cons Even though there are 935 men in the sample, only 7 are used for the estimation, because data are missing on meduc and feduc. What we could do is define binary indicators for whether the corresponding variable is missing, set the missing values to zero, and then use the binary indicators as instruments along with meduc, feduc, and sibs. This would allow us to use all 935 observations. The return to education is estimated to be small and insignificant whether IQ or KWW used is used as the indicator. This could be because family background variables do not satisfy the appropriate redundancy condition, or they might be correlated with a. (In both first-stage regressions, the F 1 37

12 statistic for joint significance of meduc, feduc, and sibs have p-values below.00, so it seems the family background variables are sufficiently partially correlated with the ability indicators.) 5.8. a. Plug in the indicator q for q and the measurement x for x, being 1 K K sure to keep track of the errors: y = g + b x b x + g q + v - b e + g a, K K 1 1 K K 1 1 _ g 0 + b1x bkx K + g1q 1 + u where g 1 = 1/d 1. Now, if the variables z 1,..., zm are redundant in the structural equation (so they are uncorrelated with v), and uncorrelated with the measurement error e and the indicator error a, we can use these as IVs K 1 for x and q in SLS. We need M > because we have two explanatory K 1 variables, x and q, that are possibly correlated with the composite error u. K 1 b. The Stata results are. reg lwage exper tenure married south urban black educ iq (exper tenure married south urban black kww meduc feduc sibs) (SLS) Source SS df MS Number of obs = F( 8, 713) = Model Prob > F = Residual R-squared = Adj R-squared =. Total Root MSE =.4 lwage Coef. Std. Err. t P> t [95% Conf. Interval] educ iq exper tenure married south urban black _cons

13 The estimated return to education is very large, but imprecisely estimated. The 95% confidence interval is very wide, and easily includes zero. Interestingly, the the coefficient on iq is actually negative, and not statistically different from zero. The large IV estimate of the return to education and the insignificant ability indicator lend some support to the idea that omitted ability is less of a problem than schooling measurement error in the standard log-wage model estimated by OLS. But the evidence is not very convincing given the very wide confidence interval for the educ coefficient Define q = b - b, so that b = b + q. Plugging this expression into the equation and rearranging gives log(wage) = b 0 + b1exper + bexper + b 3(twoyr + fouryr) + q4fouryr + u = b + b exper + b exper + b totcoll + q fouryr + u, where totcoll = twoyr + fouryr. Now, just estimate the latter equation by SLS using exper, exper, distyr and dist4yr as the full set of instruments. ^ We can use the t statistic on q to test H : q = 0 against H : q > ^ a. For b, the lower right hand element in the general formula (5.4) 1 with x = (1,x) and z = (1,z) is s [Cov(z,x) /Var(z)] ^ Alternatively, you can derive this formula directly by writing rn(b1 - b 1) = # -1/ N ( -1 N )$ N S (zi - z)u i/ N S (zi - z)(xi - x). Now, r zx = Cov(z,x) /(szs x), i=1 i=1 so simple algebra shows that the asymptotic variance is s /(rzxs x). The asymptotic variance for the OLS estimator is s /s. Thus, the difference is x 39

14 the presence of rzx in the denominator of the IV asymptotic variance. b. Naturally, as the error variance s increases so does the asymptotic variance of the IV estimator. More variation in x in the population is better: as sx increases the asymptotic variance decreases. These effects are identical to those for OLS. A larger correlation between z and x reduces the asymptotic variance of the IV estimator. As rzx L 0 the asymptotic variance increases without bound. This is why an instrument that is only weakly correlated with x can lead to very imprecise IV estimators Following the hint, let y be the linear projection of y on z, let a be the projection error, and assume that L is known. (The results on generated regressors in Section show that the argument carries over to 0 the case when L is estimated.) Plugging in y = y + a gives 0 y = z D + a y + a a + u Effectively, we regress y on z, y. The key consistency condition is that 1 1 each explanatory is orthogonal to the composite error, a a + u. By assumption, E(z u ) = 0. Further, E(y a ) = 0 by construction. The problem 1 is that E(z a ) $ 0 necessarily because z was not included in the linear 1 1 projection for y. Therefore, OLS will be inconsistent for all parameters in general. Contrast this with SLS when y is the projection on z and z : y 1 = y + r = zp + r, where E(z r ) = 0. The second step regression (assuming that P is known) is essentially y = z D + a y + a r + u Now, r is uncorrelated with z, and so E(z r ) = 0 and E(y r ) = 0. The 1 lesson is that one must be very careful if manually carrying out SLS by explicitly doing the first- and second-stage regressions. 40

15 5.1. This problem is essentially proven by the hint. Given the description of ^, the only way the K columns of ^ can be linearly dependent is if the last column can be written as a linear combination of the first K - 1 columns. This is true if and only if each qj is zero. Thus, if at least one qj is different from zero, rank ^ = K a. In a simple regression model with a single IV, the IV estimate of the ^ & N & N slope can be written as b 1 = S (zi - z)(yi - y) / S (zi - z)(xi - x) = 7i=1 8 7i=1 8 & N & N S z i(yi - y) / S z i(xi - x). Now the numerator can be written as 7i=1 8 7i=1 8 N N & N S z i(yi - y) = S ziyi - S zi y = N1y1 - N1y = N 1(y1 - y). i=1 i=1 7i=1 8 N where N 1 = S zi is the number of observations in the sample with z i = 1 and i= y1 is the average of the yi over the observations with z i = 1. Next, write y as a weighted average: y = (N 0/N)y 0 + (N 1/N)y 1, where the notation should be clear. Straightforward algebra shows that y1 - y = [(N - N 1)/N]y1 - (N 0/N)y = (N 0/N)(y1 - y 0). So the numerator of the IV estimate is (N0N 1/N)(y1 - y 0) The same argument shows that the denominator is (N0N 1/N)(x1 - x 0). Taking the ratio proves the result b. If x is also binary -- representing some "treatment" -- x1 is the fraction of observations receiving treatment when z i = 1 and x0 is the fraction receiving treatment when z i = 0. So, suppose x i = 1 if person i participates in a job training program, and let z = 1 if person i is eligible i - for participation in the program. Then x1 is the fraction of people participating in the program out of those made eligibile, and x0 is the fraction of people participating who are not eligible. (When eligibility is necessary for participation, x = 0.) Generally, x - x is the difference in

16 participation rates when z = 1 and z = 0. So the difference in the mean response between the z = 1 and z = 0 groups gets divided by the difference in participation rates across the two groups a. Taking the linear projection of (5.1) under the assumption that (x,...,x,z,...,z ) are uncorrelated with u gives 1 K-1 1 M L(y z) = b + b x b x + b L(x z) + L(u z) K-1 K-1 K K since L(u z) = 0. = b + b x b x + b x K-1 K-1 K K b. By the law of iterated projections, L(y 1,x,...,x,x ) = b + b x 1 K-1 K b x + b x. Consistency of OLS for the b from the regression y K-1 K-1 K K j on 1, x,..., x, x follows immediately from our treatment of OLS from 1 K-1 K Chapter 4: OLS consistently estimates the parameters in a linear projection provided we can assume no perfect collinearity in (1,x,...,x,x ). 1 K-1 K c. I should have said explicitly to assume E(z z) is nonsingular -- that is, SLS.a holds. Then, x is not a perfect linear combination of K (x,...,x ) if and only if at least one element of z,..., z has a nonzero 1 K-1 1 M coefficient in L(x 1,x,...,x,z,...,z ). In the model with a single K 1 K-1 1 M endogenous explanatory variable, we know this condition is equivalent to Assumption SLS.b, the standard rank condition. ( ) ^ In L(x z) = z^, we can write ^ =, where I is the K x K ^1 IK 9 0 K identity matrix, 0 is the L x K zero matrix, ^ is L x K, and ^ is K x K. As in Problem 5.1, the rank condition holds if and only if rank(^) = K. 1 a. If for some x, the vector z does not appear in L(x z), then ^ has j 1 j 11 4

17 a column which is entirely zeros. But then that column of ^ can be written as a linear combination of the last K elements of ^, which means rank(^) < K. Therefore, a necessary condition for the rank condition is that no columns of ^11 be exactly zero, which means that at least one zh must appear in the reduced form of each x, j = 1,...,K. j 1 b. Suppose K = and L =, where z appears in the reduced form form both x and x, but z appears in neither reduced form. Then the x matrix 1 ^11 has zeros in its second row, which means that the second row of ^ is all zeros. It cannot have rank K, in that case. Intuitively, while we began with two instruments, only one of them turned out to be partially correlated with x and x. 1 c. Without loss of generality, we assume that z appears in the reduced j form for x ; we can simply reorder the elements of z to ensure this is the j 1 case. Then ^11 is a K1 x K1 diagonal matrix with nonzero diagonal elements. ( ) ^11 0 Looking at ^ =, we see that if ^11 is diagonal with all nonzero ^1 I 9 K0 diagonals then ^ is lower triangular with all nonzero diagonal elements. Therefore, rank ^ = K. 43

Lecture 8: Instrumental Variables Estimation

Lecture 8: Instrumental Variables Estimation Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano

More information

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

Problem Set # 1. Master in Business and Quantitative Methods

Problem Set # 1. Master in Business and Quantitative Methods Problem Set # 1 Master in Business and Quantitative Methods Contents 0.1 Problems on endogeneity of the regressors........... 2 0.2 Lab exercises on endogeneity of the regressors......... 4 1 0.1 Problems

More information

An explanation of Two Stage Least Squares

An explanation of Two Stage Least Squares Introduction Introduction to Econometrics An explanation of Two Stage Least Squares When we get an endogenous variable we know that OLS estimator will be inconsistent. In addition OLS regressors will also

More information

THE MULTIVARIATE LINEAR REGRESSION MODEL

THE MULTIVARIATE LINEAR REGRESSION MODEL THE MULTIVARIATE LINEAR REGRESSION MODEL Why multiple regression analysis? Model with more than 1 independent variable: y 0 1x1 2x2 u It allows : -Controlling for other factors, and get a ceteris paribus

More information

Quantitative Methods Final Exam (2017/1)

Quantitative Methods Final Exam (2017/1) Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval]

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval] Problem Set #3-Key Sonoma State University Economics 317- Introduction to Econometrics Dr. Cuellar 1. Use the data set Wage1.dta to answer the following questions. a. For the regression model Wage i =

More information

Handout 11: Measurement Error

Handout 11: Measurement Error Handout 11: Measurement Error In which you learn to recognise the consequences for OLS estimation whenever some of the variables you use are not measured as accurately as you might expect. A (potential)

More information

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like. Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and

More information

Problem Set 10: Panel Data

Problem Set 10: Panel Data Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005

More information

Problem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics

Problem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics Problem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics C1.1 Use the data set Wage1.dta to answer the following questions. Estimate regression equation wage =

More information

Multiple Regression: Inference

Multiple Regression: Inference Multiple Regression: Inference The t-test: is ˆ j big and precise enough? We test the null hypothesis: H 0 : β j =0; i.e. test that x j has no effect on y once the other explanatory variables are controlled

More information

Regression #8: Loose Ends

Regression #8: Loose Ends Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch

More information

Econometrics. 8) Instrumental variables

Econometrics. 8) Instrumental variables 30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates

More information

Ecmt 675: Econometrics I

Ecmt 675: Econometrics I Ecmt 675: Econometrics I Assignment 7 Problem 1 a. reg hours lwage educ age kidslt6 kidsge6 nwifeinc, r Linear regression Number of obs = 428 F( 6, 421) = 3.93 Prob > F = 0.0008 R-squared = 0.0670 Root

More information

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor

More information

Practice exam questions

Practice exam questions Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.

More information

Lab 10 - Binary Variables

Lab 10 - Binary Variables Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2

More information

2. (3.5) (iii) Simply drop one of the independent variables, say leisure: GP A = β 0 + β 1 study + β 2 sleep + β 3 work + u.

2. (3.5) (iii) Simply drop one of the independent variables, say leisure: GP A = β 0 + β 1 study + β 2 sleep + β 3 work + u. BOSTON COLLEGE Department of Economics EC 228 Econometrics, Prof. Baum, Ms. Yu, Fall 2003 Problem Set 3 Solutions Problem sets should be your own work. You may work together with classmates, but if you

More information

ECON Introductory Econometrics. Lecture 16: Instrumental variables

ECON Introductory Econometrics. Lecture 16: Instrumental variables ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental

More information

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

Handout 12. Endogeneity & Simultaneous Equation Models

Handout 12. Endogeneity & Simultaneous Equation Models Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics August 2013

ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics August 2013 ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics August 2013 Instructions: Answer all six (6) questions. Point totals for each question are given in parentheses. The parts within

More information

Problem Set 1 ANSWERS

Problem Set 1 ANSWERS Economics 20 Prof. Patricia M. Anderson Problem Set 1 ANSWERS Part I. Multiple Choice Problems 1. If X and Z are two random variables, then E[X-Z] is d. E[X] E[Z] This is just a simple application of one

More information

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections Answer Key Fixed Effect and First Difference Models 1. See discussion in class.. David Neumark and William Wascher published a study in 199 of the effect of minimum wages on teenage employment using a

More information

Exercise Sheet 4 Instrumental Variables and Two Stage Least Squares Estimation

Exercise Sheet 4 Instrumental Variables and Two Stage Least Squares Estimation Exercise Sheet 4 Instrumental Variables and Two Stage Least Squares Estimation ECONOMETRICS I. UC3M 1. [W 15.1] Consider a simple model to estimate the e ect of personal computer (P C) ownership on the

More information

Lecture 14. More on using dummy variables (deal with seasonality)

Lecture 14. More on using dummy variables (deal with seasonality) Lecture 14. More on using dummy variables (deal with seasonality) More things to worry about: measurement error in variables (can lead to bias in OLS (endogeneity) ) Have seen that dummy variables are

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Statistical Inference with Regression Analysis

Statistical Inference with Regression Analysis Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing

More information

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10) Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the

More information

Essential of Simple regression

Essential of Simple regression Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship

More information

Econometrics Midterm Examination Answers

Econometrics Midterm Examination Answers Econometrics Midterm Examination Answers March 4, 204. Question (35 points) Answer the following short questions. (i) De ne what is an unbiased estimator. Show that X is an unbiased estimator for E(X i

More information

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests ECON4150 - Introductory Econometrics Lecture 7: OLS with Multiple Regressors Hypotheses tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 7 Lecture outline 2 Hypothesis test for single

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 6 Multiple regression model Siv-Elisabeth Skjelbred University of Oslo February 5th Last updated: February 3, 2016 1 / 49 Outline Multiple linear regression model and

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

Topics in Applied Econometrics and Development - Spring 2014

Topics in Applied Econometrics and Development - Spring 2014 Topic 2: Topics in Applied Econometrics and Development - Spring 2014 Single-Equation Linear Model The population model is linear in its parameters: y = β 0 + β 1 x 1 + β 2 x 2 +... + β K x K + u - y,

More information

Econometrics Homework 1

Econometrics Homework 1 Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z

More information

Problem 13.5 (10 points)

Problem 13.5 (10 points) BOSTON COLLEGE Department of Economics EC 327 Financial Econometrics Spring 2013, Prof. Baum, Mr. Park Problem Set 2 Due Monday 25 February 2013 Total Points Possible: 210 points Problem 13.5 (10 points)

More information

1 Independent Practice: Hypothesis tests for one parameter:

1 Independent Practice: Hypothesis tests for one parameter: 1 Independent Practice: Hypothesis tests for one parameter: Data from the Indian DHS survey from 2006 includes a measure of autonomy of the women surveyed (a scale from 0-10, 10 being the most autonomous)

More information

Greene, Econometric Analysis (7th ed, 2012)

Greene, Econometric Analysis (7th ed, 2012) EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 2 3: Classical Linear Regression The classical linear regression model is the single most useful tool in econometrics.

More information

Graduate Econometrics Lecture 4: Heteroskedasticity

Graduate Econometrics Lecture 4: Heteroskedasticity Graduate Econometrics Lecture 4: Heteroskedasticity Department of Economics University of Gothenburg November 30, 2014 1/43 and Autocorrelation Consequences for OLS Estimator Begin from the linear model

More information

Instrumental Variables

Instrumental Variables Università di Pavia 2010 Instrumental Variables Eduardo Rossi Exogeneity Exogeneity Assumption: the explanatory variables which form the columns of X are exogenous. It implies that any randomness in the

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

ECON Introductory Econometrics. Lecture 17: Experiments

ECON Introductory Econometrics. Lecture 17: Experiments ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Inference in Regression Model

Inference in Regression Model Inference in Regression Model Christopher Taber Department of Economics University of Wisconsin-Madison March 25, 2009 Outline 1 Final Step of Classical Linear Regression Model 2 Confidence Intervals 3

More information

Problem set - Selection and Diff-in-Diff

Problem set - Selection and Diff-in-Diff Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 7 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 68 Outline of Lecture 7 1 Empirical example: Italian labor force

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Applied Health Economics (for B.Sc.)

Applied Health Economics (for B.Sc.) Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative

More information

1. The shoe size of five randomly selected men in the class is 7, 7.5, 6, 6.5 the shoe size of 4 randomly selected women is 6, 5.

1. The shoe size of five randomly selected men in the class is 7, 7.5, 6, 6.5 the shoe size of 4 randomly selected women is 6, 5. Economics 3 Introduction to Econometrics Winter 2004 Professor Dobkin Name Final Exam (Sample) You must answer all the questions. The exam is closed book and closed notes you may use calculators. You must

More information

Lab 6 - Simple Regression

Lab 6 - Simple Regression Lab 6 - Simple Regression Spring 2017 Contents 1 Thinking About Regression 2 2 Regression Output 3 3 Fitted Values 5 4 Residuals 6 5 Functional Forms 8 Updated from Stata tutorials provided by Prof. Cichello

More information

Empirical Application of Simple Regression (Chapter 2)

Empirical Application of Simple Regression (Chapter 2) Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Course Econometrics I

Course Econometrics I Course Econometrics I 4. Heteroskedasticity Martin Halla Johannes Kepler University of Linz Department of Economics Last update: May 6, 2014 Martin Halla CS Econometrics I 4 1/31 Our agenda for today Consequences

More information

Multivariate Regression: Part I

Multivariate Regression: Part I Topic 1 Multivariate Regression: Part I ARE/ECN 240 A Graduate Econometrics Professor: Òscar Jordà Outline of this topic Statement of the objective: we want to explain the behavior of one variable as a

More information

Solutions to Problem Set 5 (Due November 22) Maximum number of points for Problem set 5 is: 220. Problem 7.3

Solutions to Problem Set 5 (Due November 22) Maximum number of points for Problem set 5 is: 220. Problem 7.3 Solutions to Problem Set 5 (Due November 22) EC 228 02, Fall 2010 Prof. Baum, Ms Hristakeva Maximum number of points for Problem set 5 is: 220 Problem 7.3 (i) (5 points) The t statistic on hsize 2 is over

More information

4.8 Instrumental Variables

4.8 Instrumental Variables 4.8. INSTRUMENTAL VARIABLES 35 4.8 Instrumental Variables A major complication that is emphasized in microeconometrics is the possibility of inconsistent parameter estimation due to endogenous regressors.

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral IAE, Barcelona GSE and University of Gothenburg Gothenburg, May 2015 Roadmap Deviations from the standard

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo Last updated: January 26, 2016 1 / 49 Overview These lecture slides covers: The linear regression

More information

Section I. Define or explain the following terms (3 points each) 1. centered vs. uncentered 2 R - 2. Frisch theorem -

Section I. Define or explain the following terms (3 points each) 1. centered vs. uncentered 2 R - 2. Frisch theorem - First Exam: Economics 388, Econometrics Spring 006 in R. Butler s class YOUR NAME: Section I (30 points) Questions 1-10 (3 points each) Section II (40 points) Questions 11-15 (10 points each) Section III

More information

Nonlinear Regression Functions

Nonlinear Regression Functions Nonlinear Regression Functions (SW Chapter 8) Outline 1. Nonlinear regression functions general comments 2. Nonlinear functions of one variable 3. Nonlinear functions of two variables: interactions 4.

More information

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE 1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +

More information

Inference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58

Inference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Inference ME104: Linear Regression Analysis Kenneth Benoit August 15, 2012 August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Stata output resvisited. reg votes1st spend_total incumb minister

More information

Hypothesis Tests and Confidence Intervals. in Multiple Regression

Hypothesis Tests and Confidence Intervals. in Multiple Regression ECON4135, LN6 Hypothesis Tests and Confidence Intervals Outline 1. Why multipple regression? in Multiple Regression (SW Chapter 7) 2. Simpson s paradox (omitted variables bias) 3. Hypothesis tests and

More information

Question 1 [17 points]: (ch 11)

Question 1 [17 points]: (ch 11) Question 1 [17 points]: (ch 11) A study analyzed the probability that Major League Baseball (MLB) players "survive" for another season, or, in other words, play one more season. They studied a model of

More information

Basic Linear Model. Chapters 4 and 4: Part II. Basic Linear Model

Basic Linear Model. Chapters 4 and 4: Part II. Basic Linear Model Basic Linear Model Chapters 4 and 4: Part II Statistical Properties of Least Square Estimates Y i = α+βx i + ε I Want to chooses estimates for α and β that best fit the data Objective minimize the sum

More information

Model Specification and Data Problems. Part VIII

Model Specification and Data Problems. Part VIII Part VIII Model Specification and Data Problems As of Oct 24, 2017 1 Model Specification and Data Problems RESET test Non-nested alternatives Outliers A functional form misspecification generally means

More information

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points EEP 118 / IAS 118 Elisabeth Sadoulet and Kelly Jones University of California at Berkeley Fall 2008 Introductory Applied Econometrics Final examination Scores add up to 125 points Your name: SID: 1 1.

More information

ECO375 Tutorial 8 Instrumental Variables

ECO375 Tutorial 8 Instrumental Variables ECO375 Tutorial 8 Instrumental Variables Matt Tudball University of Toronto Mississauga November 16, 2017 Matt Tudball (University of Toronto) ECO375H5 November 16, 2017 1 / 22 Review: Endogeneity Instrumental

More information

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11 Econ 495 - Econometric Review 1 Contents 1 Linear Regression Analysis 4 1.1 The Mincer Wage Equation................. 4 1.2 Data............................. 6 1.3 Econometric Model.....................

More information

Sociology Exam 2 Answer Key March 30, 2012

Sociology Exam 2 Answer Key March 30, 2012 Sociology 63993 Exam 2 Answer Key March 30, 2012 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. A researcher has constructed scales

More information

ECON2228 Notes 7. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 41

ECON2228 Notes 7. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 41 ECON2228 Notes 7 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 6 2014 2015 1 / 41 Chapter 8: Heteroskedasticity In laying out the standard regression model, we made

More information

Interaction effects between continuous variables (Optional)

Interaction effects between continuous variables (Optional) Interaction effects between continuous variables (Optional) Richard Williams, University of Notre Dame, https://www.nd.edu/~rwilliam/ Last revised February 0, 05 This is a very brief overview of this somewhat

More information

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect

More information

Chapter 2: simple regression model

Chapter 2: simple regression model Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.

More information

Control Function and Related Methods: Nonlinear Models

Control Function and Related Methods: Nonlinear Models Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear

More information

Multiple Regression Analysis. Part III. Multiple Regression Analysis

Multiple Regression Analysis. Part III. Multiple Regression Analysis Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant

More information

ECON 836 Midterm 2016

ECON 836 Midterm 2016 ECON 836 Midterm 2016 Each of the eight questions is worth 4 points. You have 2 hours. No calculators, I- devices, computers, and phones. No open boos, and everything must be on the floor. Good luc! 1)

More information

At this point, if you ve done everything correctly, you should have data that looks something like:

At this point, if you ve done everything correctly, you should have data that looks something like: This homework is due on July 19 th. Economics 375: Introduction to Econometrics Homework #4 1. One tool to aid in understanding econometrics is the Monte Carlo experiment. A Monte Carlo experiment allows

More information

Fixed and Random Effects Models: Vartanian, SW 683

Fixed and Random Effects Models: Vartanian, SW 683 : Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:

More information

Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013

Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 In Chap. 11 of Statistical Analysis of Management Data (Gatignon, 2014), tests of mediation are

More information

Specification Error: Omitted and Extraneous Variables

Specification Error: Omitted and Extraneous Variables Specification Error: Omitted and Extraneous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 5, 05 Omitted variable bias. Suppose that the correct

More information

Chapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression

Chapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression Chapter 7 Hypothesis Tests and Confidence Intervals in Multiple Regression Outline 1. Hypothesis tests and confidence intervals for a single coefficie. Joint hypothesis tests on multiple coefficients 3.

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the

More information

CHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157.

CHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157. CHAPTER 7 SOLUTIONS TO PROBLEMS 7. (i) The coefficient on male is 87.75, so a man is estimated to sleep almost one and one-half hours more per week than a comparable woman. Further, t male = 87.75/34.33

More information

CHAPTER 4. > 0, where β

CHAPTER 4. > 0, where β CHAPTER 4 SOLUTIONS TO PROBLEMS 4. (i) and (iii) generally cause the t statistics not to have a t distribution under H. Homoskedasticity is one of the CLM assumptions. An important omitted variable violates

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

We begin by thinking about population relationships.

We begin by thinking about population relationships. Conditional Expectation Function (CEF) We begin by thinking about population relationships. CEF Decomposition Theorem: Given some outcome Y i and some covariates X i there is always a decomposition where

More information

FTE Employment before FTE Employment after

FTE Employment before FTE Employment after 1. (25 points) In 1992, there was an increase in the (state) minimum wage in one U.S. state (New Jersey) but not in a neighboring location (eastern Pennsylvania). The study provides you with the following

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information