The returns to schooling, ability bias, and regression

Size: px
Start display at page:

Download "The returns to schooling, ability bias, and regression"

Transcription

1 The returns to schooling, ability bias, and regression Jörn-Steffen Pischke LSE October 4, 2016 Pischke (LSE) Griliches 1977 October 4, / 44

2 Counterfactual outcomes Scholing for individual i is described by a binary random variable, D i = {0, 1} say D i = 1 denotes completing high school, and D i = 0 denotes dropping out. The outcome of interest, log earnings is denoted by y i. Potential outcomes: What would have happened to someone who completes if they had dropped out and vice versa. Hence, for everybody there are two potential outcomes = { Y1i if D i = 1 Y 0i if D i = 0. Pischke (LSE) Griliches 1977 October 4, / 44

3 Potential and observed outcomes The effect of completing school for individual i is Y 1i Y 0i. The observed outcome, Y i, can be written in terms of potential outcomes as { Y1i if D Y i = i = 1 Y 0i if D i = 0 = Y 0i + (Y 1i Y 0i )D i. We only observe either Y 1i or Y 0i for a single individual. Pischke (LSE) Griliches 1977 October 4, / 44

4 The selection problem We can write the observed difference in earnings for the treated and untreated as the sum of two terms: E [Y i D i = 1] E [Y i D i = 0] }{{} Observed diff in earnings cond. on schooling = E [Y 1i D i = 1] E [Y 0i D i = 0] = E [Y 1i D i = 1] E [Y 0i D i = 1] }{{} average treatment effect on the treated +E [Y 0i D i = 1] E [Y 0i D i = 0] }{{} ability or selection bias Pischke (LSE) Griliches 1977 October 4, / 44

5 Selection on observables The key to regression is that conditional on observable variables, e.g. ability, counterfactual outcomes are mean independent of schooling: Then: E [Y 0i A i, D i ] = E [Y 0i A i ] (1) E [Y i A i, D i = 1] E [Y i A i, D i = 0] }{{} Observed difference in average log earnings = E [Y 1i A i, D i = 1] E [Y 0i A i, D i = 0] = E [Y 1i A i, D i = 1] E [Y 0i A i, D i = 1] }{{} if (1) holds get to switch D i = E [Y 1i Y 0i A i, D i = 1] }{{} average treatment effect on the treated in groups defined by A i Pischke (LSE) Griliches 1977 October 4, / 44

6 Selection on observables leads to matching We have just estimated group specific treatment effects defined by the variable A i : E [Y i A i = a, D i = 1] E [Y i A i = a, D i = 0] = E [Y 1i Y 0i A i = a, D i = 1] ρ a Do this for all values the variable A i takes on. To get from these group conditional averages to the overall average use the rules for conditional probability E [Y 1i Y 0i D i = 1] = E [Y 1i Y 0i A i = a, D i = 1] Pr (A i = a D i = 1) a = ρ a Pr (A i = a D i = 1) a Pischke (LSE) Griliches 1977 October 4, / 44

7 From matching to regression How to get from this to a regression is easiest to see in a constant effects setting. So let ρ a = ρ and write Using Y 0i = α + η i Y 1i = Y 0i + ρ Y i = Y 0i + (Y 1i Y 0i )D i = E (Y 0i ) + (Y 1i Y 0i ) D }{{}}{{} i + [Y 0i E (Y 0i )] }{{} α ρ η i = α + ρd i + η i Pischke (LSE) Griliches 1977 October 4, / 44

8 Still on the road to regression Using the CIA (1) Y i = α + ρd i + η i (2) E [Y 0i A i, D i ] = E [Y 0i A i ] E [η i A i, D i ] = E [η i A i ] we see that η i is a function of A i, which is related to D i. So (2) is not a regression yet. Approximate E [η i A i ] by a linear function Substitute in (2): E [η i A i ] = γa i η i = γa i + e i E [e i A i ] = 0 Y i = α + ρd i + γa i + e i which is a regression (i.e. E [e i A i, D i ] = 0). Pischke (LSE) Griliches 1977 October 4, / 44

9 Regression anatomy A key tool in understanding multivariate regression the regression anatomy formula (or Frisch-Waugh-Lowell theorem). Consider a generic bivariate regression: Then Y i = α + β 1 X 1i + β 2 X 2i + e i β 1 = Cov(Y i, X 1i ) Var( X 1i ) where X 1i is the residual from a regression of X 1i on X 2i : X 1i = π 0 + π 1 X 2i + X 1i. Pischke (LSE) Griliches 1977 October 4, / 44

10 The regression anatomy formula derived Substitute the expression for the long regression into Cov(Y i, X 1i ) Var( X 1i ) = Cov(α + β 1 X 1i + β 2 X 2i + e i, X 1i ) Var( X 1i ) =Var ( X 1i ) =0 =0 {}}{{}}{{}}{ = β 1Cov(X 1i, X 1i ) + β 2 Cov(X 2i, X 1i ) + Cov(e i, X 1i ) Var( X 1i ) = β 1 Var( X 1i ) Var( X 1i ) = β 1. In words: the coeffi cient from a bivariate regression of Y i on X 1i is numerically the same as the long regression coeffi cient on X 1i from a regression of Y i on X 1i and X 2i. Pischke (LSE) Griliches 1977 October 4, / 44

11 Alternative versions of regression anatomy Regression anatomy can also be written as β 1 = Cov(Y i, X 1i ) Var( X 1i ) = Cov(Ỹ i, X 1i ) Var( X 1i ) i.e. also partialling out X 2i from the dependent variable. It is not enough to partial out the covariate from the dependent variable alone Cov(Ỹ i, X 1i ) Var(X 1i ) = Cov(Ỹ i, X 1i ) Var( X 1i ) Var( X 1i ) Var(X 1i ) = β 1 (you can partial out X 2i from Y i but you must partial it out from X 1i ). Regression anatomy works for multiple covariates just as well. Pischke (LSE) Griliches 1977 October 4, / 44

12 Ability bias and the returns to schooling We would like to run the long regression Y i = α + ρs i + γa i + e i where Y i is log earnings, S i is schooling and A i is ability. If we don t have a measure of ability we can only run the short regression What do we get? Y i = α s + ρ s S i + e s i. Pischke (LSE) Griliches 1977 October 4, / 44

13 The omitted variables bias formula The relationship between the long and short regression coeffi cients is given by the omitted variables bias (OVB) formula ρ s = Cov(Y i, S i ) Var(S i ) = ρ + γδ AS where δ AS = Cov(A i,s i ) Var(S i ) is the regeression coeffi cient from a regression of A i (the omitted variable) on S i (the included variable). Exercise: Derive the OVB formula (works just like for regression anatonmy). The OVB formula is a mechanical relationship between two regressions: it holds regardless of the causal interpretation of any of the coeffcients. Pischke (LSE) Griliches 1977 October 4, / 44

14 Griliches (1977) regressions The conventional wisdom is Cov(A i, S i ) > 0, so returns to schooling estimates will be biased up. Short regression estimates using the NLS Long regression estimates Y i = const (0.003) S i + experience Y i = const (0.003) S i (0.0005) IQ i + experience The results are consistent with the conventional wisdom. Pischke (LSE) Griliches 1977 October 4, / 44

15 Classical measurement error Measurement error leads to bias. mismeasured. Many economic variables are Look at a generic example and start with a simple bivariate regression We don t observe X i where Y i = α + βx i + e i. but X i Xi = X i + m i Cov(Xi, m i ) = 0 Cov(e i, m i ) = 0. This is called classical measurement error. Pischke (LSE) Griliches 1977 October 4, / 44

16 Attenuation from classical measurement error The bivariate regression coeffi cient we estimate is β = Cov(Y i, X i ) Var(X i ) = Cov(α + βx i + e i, Xi + m i ) Var(Xi + m i ) = Var(Xi ) β Var(Xi ) + Var(m i ) = βλ. We see that β is biased towards zero by an attenuation factor λ = Var(X i ) Var(X i ) + Var(m i ) which is the variance in the signal divided by the variance in the signal plus noise. Pischke (LSE) Griliches 1977 October 4, / 44

17 How measurement error works Y X True data Mismeasured data Pischke (LSE) Griliches 1977 October 4, / 44

18 Measurement error in the returns to schooling Think of Y i as log earnings, and X i as schooling. experience for the moment. Ignore age or Ashenfelter and Krueger (1994) find λ = 0.9 for schooling. This means if the true return to schooling is 0.1, we would expect an estimate of Pischke (LSE) Griliches 1977 October 4, / 44

19 Measurement error with two regressors The bivariate regression is not particularly interesting, since we typically want to use regression to control for other factors. Consider Y i = α + β 1 X 1i + β 2 X 2i + e i. and only X 1i is subject to classical measurement error, i.e. Cov(X 1i, m i ) = Cov(X 2i, m i ) = 0. Starting from X 1i = X 1i + m i and using the classical measurement error assumptions we have X 1i = X 1i + m i. Pischke (LSE) Griliches 1977 October 4, / 44

20 Measurement error with two regressors The estimator of β 1 is now (using regression anatomy): ( ) β 1 = Cov Y i, X 1i ( ) Var X 1i ( ) = Cov α + β 1 X1i + β 2 X 2i + e i, X 1i + m i ( ) Var X 1i ( ) Var X 1i = β 1 ( ) = β 1 λ Var X 1i + Var (m i ) It is easy to see that λ λ whenever X 1i and X 2i are correlated. Fact Adding correlated regressors makes attenuation bias from measurement error worse. Pischke (LSE) Griliches 1977 October 4, / 44

21 Comparing the short and long regression The short regression (on just X 1i ) coeffi cient is β 1,short = λβ 1 + β 2 δ X2 X 1 = λ ( β 1 + β 2 δ X2 X 1 ) where the estimate of β 1 is biased both because of attenuation due to measurement error, and because of omitted variables bias (the part β 2 δ X2 X 1 where δ X2 X 1 is the coeffi cient from a regression of X 2i on X 1i ). The coeffi cient from the long regression is β 1,long = β 1 λ and since but λ < λ β 1,short β 1,long. Pischke (LSE) Griliches 1977 October 4, / 44

22 Comparing the short and long regression Notice that it is more diffi cult to compare the bias from the short regression and the long regression now. λ < λ implies that the attenuation bias goes up when another regressor is entered which is correlated with X 1i. There is less attenuation in the short regression but there is also OVB now. Not clear what the net effect is. Pischke (LSE) Griliches 1977 October 4, / 44

23 Measurement error in the control What about the coeffi cient β 2? Even when there is no measurement error in X 2i, the estimate of β 2 will be biased: β 2 = β 1 δ X2 X 1 Note that the bias will be larger the larger the measurement error the correlation between X 1i and X 2i The intuition is that ( ) 1 λ + β 2. β 1 is attenuated, and hence does not reflect the full effect of X 1i β 2 will capture part of the effect of X 1i through the correlation with X 2i Pischke (LSE) Griliches 1977 October 4, / 44

24 Measurement error in the returns to schooling controlling for ability We want to run the regression y i = α + ρs i + γa i + e i where Si is schooling and A i is ability. Suppose we only have a mismeasured version of schooling, S i (so S i takes on the role of X 1i before, and A i takes on the role of X 2i ). Then the short regression will give and the long regression ρ short = λρ + γδ AS ρ long = λρ If ability bias is upwards (δ AS > 0) it is not possible to say a priori which estimate will be closer to ρ. Pischke (LSE) Griliches 1977 October 4, / 44

25 Putting some numbers on the Griliches example Pick some numbers for the regression and set y i = 0.1S i λ = A i + e i σ S = 3, σ A = 15, σ AS = Then and δ AS = σ AS σ 2 S = = 2.5 ρ short = λρ + γδ AS = = Pischke (LSE) Griliches 1977 October 4, / 44

26 What about the long regression? We first need which is λ = Var ( S i ) Var ( S i ) + Var (m i ) = λ R2 AS 1 R 2 AS R 2 AS = λ = Then the long regression coeffi cient is ( ) 2 ( ) σas = = 0.25 σ S σ A = ρ long = λρ = = so the short regression coeffi cient is too large and the long regression coeffi cient is too small. Pischke (LSE) Griliches 1977 October 4, / 44

27 Measurement error in ability Now suppose years of schooling is measured perfectly but instead of Ai we only have mismeasured ability A i (so S i takes on the role of X 2i before, and A i takes on the role of X 1i ). Then ) ρ = γδ AS (1 λ + ρ. If ability bias is upwards (δ AS > 0) then the returns to schooling will be biased up but by less than in the short regression. Controlling for A i is better than controlling for nothing, but not as good as controlling for true ability A i. Pischke (LSE) Griliches 1977 October 4, / 44

28 Instrumental variables solve the measurement error problem Suppose you have an instrument Z i, correlated with the signal Xi uncorrelated with the error m i. In the bivariate regression you get and β IV = Cov(Y i, Z i ) Cov(X i, Z i ) = Cov(α + βx i + e i, Z i ) Cov(Xi = βcov(x i, Z i ) + m i, Z i ) Cov(Xi, Z i ) = β In the multivariate regression you get for similar reasons: β 1,IV = β 1 β 2,IV = β 2 (notice that only the mismeasured X 1i is instrumented) Pischke (LSE) Griliches 1977 October 4, / 44

29 More Griliches (1977) regressions Recall the Griliches long regression estimates y i = const (0.003) S i (0.0005) IQ i + experience Instrumenting IQ with results from the Knowledge of the World of Work test he gets y i = const (0.004) S i (0.0009) IQ i + experience This is still consistent with two stories: Pischke (LSE) Griliches 1977 October 4, / 44

30 More Griliches (1977) regressions Recall the Griliches long regression estimates y i = const (0.003) S i (0.0005) IQ i + experience Instrumenting IQ with results from the Knowledge of the World of Work test he gets y i = const (0.004) S i (0.0009) IQ i + experience This is still consistent with two stories: there is upward ability bias in the bivariate return to schooling and measurement error in IQ but not schooling (in this case the estimate of is correct) Pischke (LSE) Griliches 1977 October 4, / 44

31 More Griliches (1977) regressions Recall the Griliches long regression estimates y i = const (0.003) S i (0.0005) IQ i + experience Instrumenting IQ with results from the Knowledge of the World of Work test he gets y i = const (0.004) S i (0.0009) IQ i + experience This is still consistent with two stories: there is upward ability bias in the bivariate return to schooling and measurement error in IQ but not schooling (in this case the estimate of is correct) there is measurement error in both schooling and ability (in this case the true coeffi cient could be anything). Pischke (LSE) Griliches 1977 October 4, / 44

32 What to control for? In the quest for identifying causal effects, which variables belong on the right hand side of a regression equation? Yes: Variables determining the treatment and correlated with the outcome (e.g. ability). in general these variables will be fixed characteristics or pre-determined by the time of treatment (e.g. schooling) but remember the warnings on measurement error just discussed: the kitchen sink is no panacea Yes: Variables uncorrelated with the treatment but correlated with the outcome these variables may help reducing standard errors No: Variables which are outcomes of the treatment itself. bad controls. These are Pischke (LSE) Griliches 1977 October 4, / 44

33 Bad control Some researchers regressing earnings on schooling (and experience) include controls for occupation. Does this make sense? Clearly we can think of schooling affecting the access to higher level occupations, e.g. you need a Ph.D. to become a college professor. This gives rise to a two equation system Y i = α + ρs i + γo i + e i O i = λ 0 + λ 1 S i + u i You could think about these as a simultaneous equations system. Occupation O i is an endogenous variable. As a result, you could not necessarily estimate the first equation by OLS. Occupation is a bad control. Pischke (LSE) Griliches 1977 October 4, / 44

34 Bad control example 1 Pischke (LSE) Griliches 1977 October 4, / 44

35 Bad control example 2 Pischke (LSE) Griliches 1977 October 4, / 44

36 Proxy control Sometimes we control for a variable in the best of intentions. our regression of schooling on earnings Suppose Y i = α + ρs i + γa i + e i has a causal interpretation conditional on ability. Instead of ability we only have a test score taken at age 18, call it A li for late ability. The problem is that schooling will already have influenced the late ability (some students will have dropped out by 18). Suppose A li = π 0 + π 1 S i + π 2 A i i.e. that age 18 test scores are influenced by both schooling and true ability. Pischke (LSE) Griliches 1977 October 4, / 44

37 What do we get with proxy control? Substituting for A i in our regression above we get ( y i = α γ π ) ( 0 + ρ γ π ) 1 S i + γ A li + e i. π 2 π 2 π 2 If ρ > 0 γ > 0 π 1 > 0, π 2 > 0 then ( ρ γ π ) 1 < ρ π 2 and we will estimate a return to schooling that is too small. Pischke (LSE) Griliches 1977 October 4, / 44

38 Are our results robust? Suppose you have an identification strategy, say Y i = α + ρs i + γ 1 A i + e i. You have another variable, X i, which could potentially be an additional confounder. What do you do with it? Is ρ l = ρ? Y i = α l + ρ l S i + γ 1l A i + γ 2 X i + e l i Is δ = 0? X i = δ 0 + δs i + δ 1 A i + u i Pischke (LSE) Griliches 1977 October 4, / 44

39 A tale of two tests Y i = α + ρs i + γ 1 A i + e i Y i = α l + ρ l S i + γ 1l A i + γ 2 X i + e l i X i = δ 0 + δs i + δ 1 A i + u i The test ρ l = ρ is a coeffi cient comparison test. generalized Hausman test. The test δ = 0 is a balancing test. How are they related? Through the OVB formula: Formally a ρ ρ l = γ 2 δ. Under the maintained assumption γ 2 = 0, the two tests test the same null hypothesis δ = 0. Pischke (LSE) Griliches 1977 October 4, / 44

40 Measurement error in controls again Remember ( ) ρ l = γ 2 δ 1 λ + ρ γ 2 = γ 2 λ. Classical measurement error in X i biases ρ l towards ρ, and γ 2 towards 0. ρ l closer to ρ: power of the coeffi cient comparison test directly affected. Variation from measurement error ends up in ei l, raising standard errors and reducing power There is no bias in the estimate of δ Variation from measurement error ends up in u i, raising standard errors and reducing power Pischke (LSE) Griliches 1977 October 4, / 44

41 Theoretical Power Functions Homoskedastic, plain standard errors Rejection probability d Balancing test, θ=0 (baseline) Balancing test, θ=.7 Balancing test, θ=.85 CC test, θ=0 (baseline) CC test, θ=.7 CC test, θ=.85 Pischke (LSE) Griliches 1977 October 4, / 44

42 Simulated Rejection Rates Heteroskedasticity, robust standard errors Rejection probability d Balancing test, baseline Balancing test, θ=0, robust Balancing test, θ=.85, robust CC test, baseline CC test, θ=0, robust CC test, θ=.85, robust Pischke (LSE) Griliches 1977 October 4, / 44

43 Simulated Rejection Rates Mean reverting measurement error, robust standard errors Rejection probability d Balancing test, baseline CC test, baseline Balancing test, σ 2 µ=.75 CC test, σ 2 µ=.75 Balancing test, σ 2 µ=2.25 CC test, σ 2 µ=2.25 Pischke (LSE) Griliches 1977 October 4, / 44

44 Baseline Regressions for Returns to Schooling Mother's years of Library card Log hourly earnings education at age 14 (1) (2) (3) (6) (7) Years of education Mother's years of education Library card at age (0.0040) (0.0042) (0.0040) (0.0300) (0.0040) (0.0029) (0.0183) p-values Coefficient comparison test Balancing test Pischke (LSE) Griliches 1977 October 4, / 44

45 Returns to Schooling controlling for KWW score Mother's years of Library card Log hourly earnings education at age 14 (1) (2) (3) (6) (7) Years of education KWW score Mother's years of education Library card at age (0.0059) (0.0060) (0.0059) (0.0422) (0.0059) (0.0015) (0.0016) (0.0016) (0.0107) (0.0016) (0.0037) (0.0215) Body height in inches p-values Coefficient comparison test Balancing test Pischke (LSE) Griliches 1977 October 4, / 44

46 Returns to Schooling instrumenting the KWW score Mother's years of Library card Log hourly earnings education at age 14 (1) (2) (3) (6) (7) Years of education KWW score instrumented by IQ Mother's years of education Library card at age (0.0139) (0.0139) (0.0138) (0.0952) (0.0134) (0.0063) (0.0063) (0.0063) (0.0422) (0.0060) (0.0039) (0.0245) Body height in inches p-values Coefficient comparison test Balancing test Pischke (LSE) Griliches 1977 October 4, / 44

Poorly Measured Confounders are More Useful on the Left Than on the Right

Poorly Measured Confounders are More Useful on the Left Than on the Right Poorly Measured Confounders are More Useful on the Left Than on the Right Jörn-Steffen Pischke LSE Hannes Schwandt University of Zürich April 2016 Abstract Researchers frequently test identifying assumptions

More information

Lecture Notes on Measurement Error

Lecture Notes on Measurement Error Steve Pischke Spring 2000 Lecture Notes on Measurement Error These notes summarize a variety of simple results on measurement error which I nd useful. They also provide some references where more complete

More information

Economics 241B Estimation with Instruments

Economics 241B Estimation with Instruments Economics 241B Estimation with Instruments Measurement Error Measurement error is de ned as the error resulting from the measurement of a variable. At some level, every variable is measured with error.

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

ECON Introductory Econometrics. Lecture 16: Instrumental variables

ECON Introductory Econometrics. Lecture 16: Instrumental variables ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental

More information

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012 Problem Set #6: OLS Economics 835: Econometrics Fall 202 A preliminary result Suppose we have a random sample of size n on the scalar random variables (x, y) with finite means, variances, and covariance.

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

EMERGING MARKETS - Lecture 2: Methodology refresher

EMERGING MARKETS - Lecture 2: Methodology refresher EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different

More information

Essential of Simple regression

Essential of Simple regression Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

REGRESSION RECAP. Josh Angrist. MIT (Fall 2014)

REGRESSION RECAP. Josh Angrist. MIT (Fall 2014) REGRESSION RECAP Josh Angrist MIT 14.387 (Fall 2014) Regression: What You Need to Know We spend our lives running regressions (I should say: "regressions run me"). And yet this basic empirical tool is

More information

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE 1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +

More information

1 Correlation between an independent variable and the error

1 Correlation between an independent variable and the error Chapter 7 outline, Econometrics Instrumental variables and model estimation 1 Correlation between an independent variable and the error Recall that one of the assumptions that we make when proving the

More information

EC402 - Problem Set 3

EC402 - Problem Set 3 EC402 - Problem Set 3 Konrad Burchardi 11th of February 2009 Introduction Today we will - briefly talk about the Conditional Expectation Function and - lengthily talk about Fixed Effects: How do we calculate

More information

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C =

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C = Economics 130 Lecture 6 Midterm Review Next Steps for the Class Multiple Regression Review & Issues Model Specification Issues Launching the Projects!!!!! Midterm results: AVG = 26.5 (88%) A = 27+ B =

More information

Omitted Variable Bias, Coefficient Stability and Other Issues. Chase Potter 6/22/16

Omitted Variable Bias, Coefficient Stability and Other Issues. Chase Potter 6/22/16 Omitted Variable Bias, Coefficient Stability and Other Issues Chase Potter 6/22/16 Roadmap 1. Omitted Variable Bias 2. Coefficient Stability Oster 2015 Paper 3. Poorly Measured Confounders Pischke and

More information

Ability Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006

Ability Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006 Ability Bias, Errors in Variables and Sibling Methods James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006 1 1 Ability Bias Consider the model: log = 0 + 1 + where =income, = schooling,

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

ECO375 Tutorial 8 Instrumental Variables

ECO375 Tutorial 8 Instrumental Variables ECO375 Tutorial 8 Instrumental Variables Matt Tudball University of Toronto Mississauga November 16, 2017 Matt Tudball (University of Toronto) ECO375H5 November 16, 2017 1 / 22 Review: Endogeneity Instrumental

More information

EC 533 Labour Economics Problem Set 1 Answers. = w r. S = f S. f r = 0. log y = log w + log f(s, A)

EC 533 Labour Economics Problem Set 1 Answers. = w r. S = f S. f r = 0. log y = log w + log f(s, A) Steve Pischke EC 533 Labour Economics Problem Set 1 Answers 1 Optimal Choice of Schooling and Returns to Schooling (a) The individual maximizes or alternatively V = S ye rt dt = wf(s, A) e rt dt S = w

More information

Econ 2120: Section 2

Econ 2120: Section 2 Econ 2120: Section 2 Part I - Linear Predictor Loose Ends Ashesh Rambachan Fall 2018 Outline Big Picture Matrix Version of the Linear Predictor and Least Squares Fit Linear Predictor Least Squares Omitted

More information

Controlling for Time Invariant Heterogeneity

Controlling for Time Invariant Heterogeneity Controlling for Time Invariant Heterogeneity Yona Rubinstein July 2016 Yona Rubinstein (LSE) Controlling for Time Invariant Heterogeneity 07/16 1 / 19 Observables and Unobservables Confounding Factors

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Yona Rubinstein July 2016 Yona Rubinstein (LSE) Instrumental Variables 07/16 1 / 31 The Limitation of Panel Data So far we learned how to account for selection on time invariant

More information

Final Exam. Economics 835: Econometrics. Fall 2010

Final Exam. Economics 835: Econometrics. Fall 2010 Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each

More information

The Regression Tool. Yona Rubinstein. July Yona Rubinstein (LSE) The Regression Tool 07/16 1 / 35

The Regression Tool. Yona Rubinstein. July Yona Rubinstein (LSE) The Regression Tool 07/16 1 / 35 The Regression Tool Yona Rubinstein July 2016 Yona Rubinstein (LSE) The Regression Tool 07/16 1 / 35 Regressions Regression analysis is one of the most commonly used statistical techniques in social and

More information

Econometrics Homework 4 Solutions

Econometrics Homework 4 Solutions Econometrics Homework 4 Solutions Question 1 (a) General sources of problem: measurement error in regressors, omitted variables that are correlated to the regressors, and simultaneous equation (reverse

More information

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35 Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate

More information

NBER WORKING PAPER SERIES POORLY MEASURED CONFOUNDERS ARE MORE USEFUL ON THE LEFT THAN ON THE RIGHT. Zhuan Pei Jörn-Steffen Pischke Hannes Schwandt

NBER WORKING PAPER SERIES POORLY MEASURED CONFOUNDERS ARE MORE USEFUL ON THE LEFT THAN ON THE RIGHT. Zhuan Pei Jörn-Steffen Pischke Hannes Schwandt NBER WORKING PAPER SERIES POORLY MEASURED CONFOUNDERS ARE MORE USEFUL ON THE LEFT THAN ON THE RIGHT Zhuan Pei Jörn-Steffen Pischke Hannes Schwandt Working Paper 33 http://www.nber.org/papers/w33 NATIONAL

More information

We begin by thinking about population relationships.

We begin by thinking about population relationships. Conditional Expectation Function (CEF) We begin by thinking about population relationships. CEF Decomposition Theorem: Given some outcome Y i and some covariates X i there is always a decomposition where

More information

8. Instrumental variables regression

8. Instrumental variables regression 8. Instrumental variables regression Recall: In Section 5 we analyzed five sources of estimation bias arising because the regressor is correlated with the error term Violation of the first OLS assumption

More information

Problem Set - Instrumental Variables

Problem Set - Instrumental Variables Problem Set - Instrumental Variables 1. Consider a simple model to estimate the effect of personal computer (PC) ownership on college grade point average for graduating seniors at a large public university:

More information

Recitation Notes 6. Konrad Menzel. October 22, 2006

Recitation Notes 6. Konrad Menzel. October 22, 2006 Recitation Notes 6 Konrad Menzel October, 006 Random Coefficient Models. Motivation In the empirical literature on education and earnings, the main object of interest is the human capital earnings function

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Department of Economics University of Wisconsin-Madison September 27, 2016 Treatment Effects Throughout the course we will focus on the Treatment Effect Model For now take that to

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

CHAPTER 6: SPECIFICATION VARIABLES

CHAPTER 6: SPECIFICATION VARIABLES Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero

More information

PSC 504: Instrumental Variables

PSC 504: Instrumental Variables PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured

More information

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43 Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression

More information

Lecture 4: Linear panel models

Lecture 4: Linear panel models Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)

More information

Ec1123 Section 7 Instrumental Variables

Ec1123 Section 7 Instrumental Variables Ec1123 Section 7 Instrumental Variables Andrea Passalacqua Harvard University andreapassalacqua@g.harvard.edu November 16th, 2017 Andrea Passalacqua (Harvard) Ec1123 Section 7 Instrumental Variables November

More information

Motivation for multiple regression

Motivation for multiple regression Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope

More information

The regression model with one stochastic regressor (part II)

The regression model with one stochastic regressor (part II) The regression model with one stochastic regressor (part II) 3150/4150 Lecture 7 Ragnar Nymoen 6 Feb 2012 We will finish Lecture topic 4: The regression model with stochastic regressor We will first look

More information

Gov 2000: 9. Regression with Two Independent Variables

Gov 2000: 9. Regression with Two Independent Variables Gov 2000: 9. Regression with Two Independent Variables Matthew Blackwell Fall 2016 1 / 62 1. Why Add Variables to a Regression? 2. Adding a Binary Covariate 3. Adding a Continuous Covariate 4. OLS Mechanics

More information

ECNS 561 Multiple Regression Analysis

ECNS 561 Multiple Regression Analysis ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking

More information

Exam D0M61A Advanced econometrics

Exam D0M61A Advanced econometrics Exam D0M61A Advanced econometrics 19 January 2009, 9 12am Question 1 (5 pts.) Consider the wage function w i = β 0 + β 1 S i + β 2 E i + β 0 3h i + ε i, where w i is the log-wage of individual i, S i is

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Most of this course will be concerned with use of a regression model: a structure in which one or more explanatory

More information

Introductory Econometrics

Introductory Econometrics Introductory Econometrics Violation of basic assumptions Heteroskedasticity Barbara Pertold-Gebicka CERGE-EI 16 November 010 OLS assumptions 1. Disturbances are random variables drawn from a normal distribution.

More information

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory

More information

EC 423 Labour Economics Problem Set 3 Answers. 1. Optimal Choice of Schooling and Returns to Schooling. ye rt dt = wf(s, A) e rt dt. = w r.

EC 423 Labour Economics Problem Set 3 Answers. 1. Optimal Choice of Schooling and Returns to Schooling. ye rt dt = wf(s, A) e rt dt. = w r. Steve Pischke EC 43 Labour Economics Problem Set 3 Answers 1 Optimal Choice of Schooling and Returns to Schooling (a) The individual maximizes or alternatively V = The first order condition is Notice that

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 18 Instrumental Variables

More information

Teaching Causal Inference in Undergraduate Econometrics

Teaching Causal Inference in Undergraduate Econometrics Teaching Causal Inference in Undergraduate Econometrics October 24, 2012 Abstract This paper argues that the current way in which the undergraduate introductory econometrics course is taught is neither

More information

Econ 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias

Econ 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias Contact Information Elena Llaudet Sections are voluntary. My office hours are Thursdays 5pm-7pm in Littauer Mezzanine 34-36 (Note room change) You can email me administrative questions to ellaudet@gmail.com.

More information

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the

More information

Lecture #8 & #9 Multiple regression

Lecture #8 & #9 Multiple regression Lecture #8 & #9 Multiple regression Starting point: Y = f(x 1, X 2,, X k, u) Outcome variable of interest (movie ticket price) a function of several variables. Observables and unobservables. One or more

More information

Testing Linear Restrictions: cont.

Testing Linear Restrictions: cont. Testing Linear Restrictions: cont. The F-statistic is closely connected with the R of the regression. In fact, if we are testing q linear restriction, can write the F-stastic as F = (R u R r)=q ( R u)=(n

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

Using Instrumental Variables to Find Causal Effects in Public Health

Using Instrumental Variables to Find Causal Effects in Public Health 1 Using Instrumental Variables to Find Causal Effects in Public Health Antonio Trujillo, PhD John Hopkins Bloomberg School of Public Health Department of International Health Health Systems Program October

More information

Honors General Exam Part 3: Econometrics Solutions. Harvard University

Honors General Exam Part 3: Econometrics Solutions. Harvard University Honors General Exam Part 3: Econometrics Solutions Harvard University April 6, 2016 Lead Pipes and Homicides In the second half of the nineteenth century, many U.S. cities built water supply systems. Some

More information

Econometrics Problem Set 4

Econometrics Problem Set 4 Econometrics Problem Set 4 WISE, Xiamen University Spring 2016-17 Conceptual Questions 1. This question refers to the estimated regressions in shown in Table 1 computed using data for 1988 from the CPS.

More information

Empirical approaches in public economics

Empirical approaches in public economics Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental

More information

More on Roy Model of Self-Selection

More on Roy Model of Self-Selection V. J. Hotz Rev. May 26, 2007 More on Roy Model of Self-Selection Results drawn on Heckman and Sedlacek JPE, 1985 and Heckman and Honoré, Econometrica, 1986. Two-sector model in which: Agents are income

More information

EC4051 Project and Introductory Econometrics

EC4051 Project and Introductory Econometrics EC4051 Project and Introductory Econometrics Dudley Cooke Trinity College Dublin Dudley Cooke (Trinity College Dublin) Intro to Econometrics 1 / 23 Project Guidelines Each student is required to undertake

More information

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors: Wooldridge, Introductory Econometrics, d ed. Chapter 3: Multiple regression analysis: Estimation In multiple regression analysis, we extend the simple (two-variable) regression model to consider the possibility

More information

Freeing up the Classical Assumptions. () Introductory Econometrics: Topic 5 1 / 94

Freeing up the Classical Assumptions. () Introductory Econometrics: Topic 5 1 / 94 Freeing up the Classical Assumptions () Introductory Econometrics: Topic 5 1 / 94 The Multiple Regression Model: Freeing Up the Classical Assumptions Some or all of classical assumptions needed for derivations

More information

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of

More information

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including

More information

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover). STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods in Economics 2 Course code: EC2402 Examiner: Peter Skogman Thoursie Number of credits: 7,5 credits (hp) Date of exam: Saturday,

More information

Econometrics. 7) Endogeneity

Econometrics. 7) Endogeneity 30C00200 Econometrics 7) Endogeneity Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Common types of endogeneity Simultaneity Omitted variables Measurement errors

More information

Causal Inference with and without Experiments I

Causal Inference with and without Experiments I http://www.econ.ucsd.edu/~elib/250a/ Causal Inference with and without Experiments I 1. The Big Picture: Data, Statistics, Economic Theory and Applied Econometrics 2. Causal Inference: The SAT Prep example

More information

14.32 Final : Spring 2001

14.32 Final : Spring 2001 14.32 Final : Spring 2001 Please read the entire exam before you begin. You have 3 hours. No books or notes should be used. Calculators are allowed. There are 105 points. Good luck! A. True/False/Sometimes

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Per Pettersson-Lidbom Number of creds: 7,5 creds Date of exam: Thursday, January 15, 009 Examination

More information

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations Simultaneous Equation Models. Introduction: basic definitions 2. Consequences of ignoring simultaneity 3. The identification problem 4. Estimation of simultaneous equation models 5. Example: IS LM model

More information

Linear Regression with Multiple Regressors

Linear Regression with Multiple Regressors Linear Regression with Multiple Regressors (SW Chapter 6) Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution

More information

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 Logistic regression: Why we often can do what we think we can do Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 1 Introduction Introduction - In 2010 Carina Mood published an overview article

More information

ECON The Simple Regression Model

ECON The Simple Regression Model ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In

More information

Potential Outcomes Model (POM)

Potential Outcomes Model (POM) Potential Outcomes Model (POM) Relationship Between Counterfactual States Causality Empirical Strategies in Labor Economics, Angrist Krueger (1999): The most challenging empirical questions in economics

More information

Introduction to Econometrics. Multiple Regression

Introduction to Econometrics. Multiple Regression Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Multiple Regression Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1 OLS estimate of the TS/STR

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint

More information

BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014

BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014 BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014 Homework 4 (version 3) - posted October 3 Assigned October 2; Due 11:59PM October 9 Problem 1 (Easy) a. For the genetic regression model: Y

More information

Answers to Problem Set #4

Answers to Problem Set #4 Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2

More information

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Friday, June 5, 009 Examination time: 3 hours

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

Research Design: Causal inference and counterfactuals

Research Design: Causal inference and counterfactuals Research Design: Causal inference and counterfactuals University College Dublin 8 March 2013 1 2 3 4 Outline 1 2 3 4 Inference In regression analysis we look at the relationship between (a set of) independent

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Introduction to Econometrics Third Edition James H. Stock Mark W. Watson The statistical analysis of economic (and related) data

Introduction to Econometrics Third Edition James H. Stock Mark W. Watson The statistical analysis of economic (and related) data Introduction to Econometrics Third Edition James H. Stock Mark W. Watson The statistical analysis of economic (and related) data 1/2/3-1 1/2/3-2 Brief Overview of the Course Economics suggests important

More information

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Econ 423 Lecture Notes: Additional Topics in Time Series 1 Econ 423 Lecture Notes: Additional Topics in Time Series 1 John C. Chao April 25, 2017 1 These notes are based in large part on Chapter 16 of Stock and Watson (2011). They are for instructional purposes

More information

FNCE 926 Empirical Methods in CF

FNCE 926 Empirical Methods in CF FNCE 926 Empirical Methods in CF Lecture 3 Causality Professor Todd Gormley Announcement n You should have uploaded Exercise #1 and your DO files to Canvas already n Exercise #2 due two weeks from today

More information

Introduction to Econometrics. Multiple Regression (2016/2017)

Introduction to Econometrics. Multiple Regression (2016/2017) Introduction to Econometrics STAT-S-301 Multiple Regression (016/017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 OLS estimate of the TS/STR relation: OLS estimate of the Test Score/STR relation:

More information

Stat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS

Stat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS Stat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS 1a) The model is cw i = β 0 + β 1 el i + ɛ i, where cw i is the weight of the ith chick, el i the length of the egg from which it hatched, and ɛ i

More information

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model

More information

Single-Equation GMM: Endogeneity Bias

Single-Equation GMM: Endogeneity Bias Single-Equation GMM: Lecture for Economics 241B Douglas G. Steigerwald UC Santa Barbara January 2012 Initial Question Initial Question How valuable is investment in college education? economics - measure

More information

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor 1. The regression equation 2. Estimating the equation 3. Assumptions required for

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

Applied Quantitative Methods II

Applied Quantitative Methods II Applied Quantitative Methods II Lecture 4: OLS and Statistics revision Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 4 VŠE, SS 2016/17 1 / 68 Outline 1 Econometric analysis Properties of an estimator

More information

IV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation

IV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation SS 2010 WS 2014/15 Alexander Spermann Evaluation With Non-Experimental Approaches Selection on Unobservables Natural Experiment (exogenous variation in a variable) DiD Example: Card/Krueger (1994) Minimum

More information