The returns to schooling, ability bias, and regression
|
|
- Hugo Allison
- 6 years ago
- Views:
Transcription
1 The returns to schooling, ability bias, and regression Jörn-Steffen Pischke LSE October 4, 2016 Pischke (LSE) Griliches 1977 October 4, / 44
2 Counterfactual outcomes Scholing for individual i is described by a binary random variable, D i = {0, 1} say D i = 1 denotes completing high school, and D i = 0 denotes dropping out. The outcome of interest, log earnings is denoted by y i. Potential outcomes: What would have happened to someone who completes if they had dropped out and vice versa. Hence, for everybody there are two potential outcomes = { Y1i if D i = 1 Y 0i if D i = 0. Pischke (LSE) Griliches 1977 October 4, / 44
3 Potential and observed outcomes The effect of completing school for individual i is Y 1i Y 0i. The observed outcome, Y i, can be written in terms of potential outcomes as { Y1i if D Y i = i = 1 Y 0i if D i = 0 = Y 0i + (Y 1i Y 0i )D i. We only observe either Y 1i or Y 0i for a single individual. Pischke (LSE) Griliches 1977 October 4, / 44
4 The selection problem We can write the observed difference in earnings for the treated and untreated as the sum of two terms: E [Y i D i = 1] E [Y i D i = 0] }{{} Observed diff in earnings cond. on schooling = E [Y 1i D i = 1] E [Y 0i D i = 0] = E [Y 1i D i = 1] E [Y 0i D i = 1] }{{} average treatment effect on the treated +E [Y 0i D i = 1] E [Y 0i D i = 0] }{{} ability or selection bias Pischke (LSE) Griliches 1977 October 4, / 44
5 Selection on observables The key to regression is that conditional on observable variables, e.g. ability, counterfactual outcomes are mean independent of schooling: Then: E [Y 0i A i, D i ] = E [Y 0i A i ] (1) E [Y i A i, D i = 1] E [Y i A i, D i = 0] }{{} Observed difference in average log earnings = E [Y 1i A i, D i = 1] E [Y 0i A i, D i = 0] = E [Y 1i A i, D i = 1] E [Y 0i A i, D i = 1] }{{} if (1) holds get to switch D i = E [Y 1i Y 0i A i, D i = 1] }{{} average treatment effect on the treated in groups defined by A i Pischke (LSE) Griliches 1977 October 4, / 44
6 Selection on observables leads to matching We have just estimated group specific treatment effects defined by the variable A i : E [Y i A i = a, D i = 1] E [Y i A i = a, D i = 0] = E [Y 1i Y 0i A i = a, D i = 1] ρ a Do this for all values the variable A i takes on. To get from these group conditional averages to the overall average use the rules for conditional probability E [Y 1i Y 0i D i = 1] = E [Y 1i Y 0i A i = a, D i = 1] Pr (A i = a D i = 1) a = ρ a Pr (A i = a D i = 1) a Pischke (LSE) Griliches 1977 October 4, / 44
7 From matching to regression How to get from this to a regression is easiest to see in a constant effects setting. So let ρ a = ρ and write Using Y 0i = α + η i Y 1i = Y 0i + ρ Y i = Y 0i + (Y 1i Y 0i )D i = E (Y 0i ) + (Y 1i Y 0i ) D }{{}}{{} i + [Y 0i E (Y 0i )] }{{} α ρ η i = α + ρd i + η i Pischke (LSE) Griliches 1977 October 4, / 44
8 Still on the road to regression Using the CIA (1) Y i = α + ρd i + η i (2) E [Y 0i A i, D i ] = E [Y 0i A i ] E [η i A i, D i ] = E [η i A i ] we see that η i is a function of A i, which is related to D i. So (2) is not a regression yet. Approximate E [η i A i ] by a linear function Substitute in (2): E [η i A i ] = γa i η i = γa i + e i E [e i A i ] = 0 Y i = α + ρd i + γa i + e i which is a regression (i.e. E [e i A i, D i ] = 0). Pischke (LSE) Griliches 1977 October 4, / 44
9 Regression anatomy A key tool in understanding multivariate regression the regression anatomy formula (or Frisch-Waugh-Lowell theorem). Consider a generic bivariate regression: Then Y i = α + β 1 X 1i + β 2 X 2i + e i β 1 = Cov(Y i, X 1i ) Var( X 1i ) where X 1i is the residual from a regression of X 1i on X 2i : X 1i = π 0 + π 1 X 2i + X 1i. Pischke (LSE) Griliches 1977 October 4, / 44
10 The regression anatomy formula derived Substitute the expression for the long regression into Cov(Y i, X 1i ) Var( X 1i ) = Cov(α + β 1 X 1i + β 2 X 2i + e i, X 1i ) Var( X 1i ) =Var ( X 1i ) =0 =0 {}}{{}}{{}}{ = β 1Cov(X 1i, X 1i ) + β 2 Cov(X 2i, X 1i ) + Cov(e i, X 1i ) Var( X 1i ) = β 1 Var( X 1i ) Var( X 1i ) = β 1. In words: the coeffi cient from a bivariate regression of Y i on X 1i is numerically the same as the long regression coeffi cient on X 1i from a regression of Y i on X 1i and X 2i. Pischke (LSE) Griliches 1977 October 4, / 44
11 Alternative versions of regression anatomy Regression anatomy can also be written as β 1 = Cov(Y i, X 1i ) Var( X 1i ) = Cov(Ỹ i, X 1i ) Var( X 1i ) i.e. also partialling out X 2i from the dependent variable. It is not enough to partial out the covariate from the dependent variable alone Cov(Ỹ i, X 1i ) Var(X 1i ) = Cov(Ỹ i, X 1i ) Var( X 1i ) Var( X 1i ) Var(X 1i ) = β 1 (you can partial out X 2i from Y i but you must partial it out from X 1i ). Regression anatomy works for multiple covariates just as well. Pischke (LSE) Griliches 1977 October 4, / 44
12 Ability bias and the returns to schooling We would like to run the long regression Y i = α + ρs i + γa i + e i where Y i is log earnings, S i is schooling and A i is ability. If we don t have a measure of ability we can only run the short regression What do we get? Y i = α s + ρ s S i + e s i. Pischke (LSE) Griliches 1977 October 4, / 44
13 The omitted variables bias formula The relationship between the long and short regression coeffi cients is given by the omitted variables bias (OVB) formula ρ s = Cov(Y i, S i ) Var(S i ) = ρ + γδ AS where δ AS = Cov(A i,s i ) Var(S i ) is the regeression coeffi cient from a regression of A i (the omitted variable) on S i (the included variable). Exercise: Derive the OVB formula (works just like for regression anatonmy). The OVB formula is a mechanical relationship between two regressions: it holds regardless of the causal interpretation of any of the coeffcients. Pischke (LSE) Griliches 1977 October 4, / 44
14 Griliches (1977) regressions The conventional wisdom is Cov(A i, S i ) > 0, so returns to schooling estimates will be biased up. Short regression estimates using the NLS Long regression estimates Y i = const (0.003) S i + experience Y i = const (0.003) S i (0.0005) IQ i + experience The results are consistent with the conventional wisdom. Pischke (LSE) Griliches 1977 October 4, / 44
15 Classical measurement error Measurement error leads to bias. mismeasured. Many economic variables are Look at a generic example and start with a simple bivariate regression We don t observe X i where Y i = α + βx i + e i. but X i Xi = X i + m i Cov(Xi, m i ) = 0 Cov(e i, m i ) = 0. This is called classical measurement error. Pischke (LSE) Griliches 1977 October 4, / 44
16 Attenuation from classical measurement error The bivariate regression coeffi cient we estimate is β = Cov(Y i, X i ) Var(X i ) = Cov(α + βx i + e i, Xi + m i ) Var(Xi + m i ) = Var(Xi ) β Var(Xi ) + Var(m i ) = βλ. We see that β is biased towards zero by an attenuation factor λ = Var(X i ) Var(X i ) + Var(m i ) which is the variance in the signal divided by the variance in the signal plus noise. Pischke (LSE) Griliches 1977 October 4, / 44
17 How measurement error works Y X True data Mismeasured data Pischke (LSE) Griliches 1977 October 4, / 44
18 Measurement error in the returns to schooling Think of Y i as log earnings, and X i as schooling. experience for the moment. Ignore age or Ashenfelter and Krueger (1994) find λ = 0.9 for schooling. This means if the true return to schooling is 0.1, we would expect an estimate of Pischke (LSE) Griliches 1977 October 4, / 44
19 Measurement error with two regressors The bivariate regression is not particularly interesting, since we typically want to use regression to control for other factors. Consider Y i = α + β 1 X 1i + β 2 X 2i + e i. and only X 1i is subject to classical measurement error, i.e. Cov(X 1i, m i ) = Cov(X 2i, m i ) = 0. Starting from X 1i = X 1i + m i and using the classical measurement error assumptions we have X 1i = X 1i + m i. Pischke (LSE) Griliches 1977 October 4, / 44
20 Measurement error with two regressors The estimator of β 1 is now (using regression anatomy): ( ) β 1 = Cov Y i, X 1i ( ) Var X 1i ( ) = Cov α + β 1 X1i + β 2 X 2i + e i, X 1i + m i ( ) Var X 1i ( ) Var X 1i = β 1 ( ) = β 1 λ Var X 1i + Var (m i ) It is easy to see that λ λ whenever X 1i and X 2i are correlated. Fact Adding correlated regressors makes attenuation bias from measurement error worse. Pischke (LSE) Griliches 1977 October 4, / 44
21 Comparing the short and long regression The short regression (on just X 1i ) coeffi cient is β 1,short = λβ 1 + β 2 δ X2 X 1 = λ ( β 1 + β 2 δ X2 X 1 ) where the estimate of β 1 is biased both because of attenuation due to measurement error, and because of omitted variables bias (the part β 2 δ X2 X 1 where δ X2 X 1 is the coeffi cient from a regression of X 2i on X 1i ). The coeffi cient from the long regression is β 1,long = β 1 λ and since but λ < λ β 1,short β 1,long. Pischke (LSE) Griliches 1977 October 4, / 44
22 Comparing the short and long regression Notice that it is more diffi cult to compare the bias from the short regression and the long regression now. λ < λ implies that the attenuation bias goes up when another regressor is entered which is correlated with X 1i. There is less attenuation in the short regression but there is also OVB now. Not clear what the net effect is. Pischke (LSE) Griliches 1977 October 4, / 44
23 Measurement error in the control What about the coeffi cient β 2? Even when there is no measurement error in X 2i, the estimate of β 2 will be biased: β 2 = β 1 δ X2 X 1 Note that the bias will be larger the larger the measurement error the correlation between X 1i and X 2i The intuition is that ( ) 1 λ + β 2. β 1 is attenuated, and hence does not reflect the full effect of X 1i β 2 will capture part of the effect of X 1i through the correlation with X 2i Pischke (LSE) Griliches 1977 October 4, / 44
24 Measurement error in the returns to schooling controlling for ability We want to run the regression y i = α + ρs i + γa i + e i where Si is schooling and A i is ability. Suppose we only have a mismeasured version of schooling, S i (so S i takes on the role of X 1i before, and A i takes on the role of X 2i ). Then the short regression will give and the long regression ρ short = λρ + γδ AS ρ long = λρ If ability bias is upwards (δ AS > 0) it is not possible to say a priori which estimate will be closer to ρ. Pischke (LSE) Griliches 1977 October 4, / 44
25 Putting some numbers on the Griliches example Pick some numbers for the regression and set y i = 0.1S i λ = A i + e i σ S = 3, σ A = 15, σ AS = Then and δ AS = σ AS σ 2 S = = 2.5 ρ short = λρ + γδ AS = = Pischke (LSE) Griliches 1977 October 4, / 44
26 What about the long regression? We first need which is λ = Var ( S i ) Var ( S i ) + Var (m i ) = λ R2 AS 1 R 2 AS R 2 AS = λ = Then the long regression coeffi cient is ( ) 2 ( ) σas = = 0.25 σ S σ A = ρ long = λρ = = so the short regression coeffi cient is too large and the long regression coeffi cient is too small. Pischke (LSE) Griliches 1977 October 4, / 44
27 Measurement error in ability Now suppose years of schooling is measured perfectly but instead of Ai we only have mismeasured ability A i (so S i takes on the role of X 2i before, and A i takes on the role of X 1i ). Then ) ρ = γδ AS (1 λ + ρ. If ability bias is upwards (δ AS > 0) then the returns to schooling will be biased up but by less than in the short regression. Controlling for A i is better than controlling for nothing, but not as good as controlling for true ability A i. Pischke (LSE) Griliches 1977 October 4, / 44
28 Instrumental variables solve the measurement error problem Suppose you have an instrument Z i, correlated with the signal Xi uncorrelated with the error m i. In the bivariate regression you get and β IV = Cov(Y i, Z i ) Cov(X i, Z i ) = Cov(α + βx i + e i, Z i ) Cov(Xi = βcov(x i, Z i ) + m i, Z i ) Cov(Xi, Z i ) = β In the multivariate regression you get for similar reasons: β 1,IV = β 1 β 2,IV = β 2 (notice that only the mismeasured X 1i is instrumented) Pischke (LSE) Griliches 1977 October 4, / 44
29 More Griliches (1977) regressions Recall the Griliches long regression estimates y i = const (0.003) S i (0.0005) IQ i + experience Instrumenting IQ with results from the Knowledge of the World of Work test he gets y i = const (0.004) S i (0.0009) IQ i + experience This is still consistent with two stories: Pischke (LSE) Griliches 1977 October 4, / 44
30 More Griliches (1977) regressions Recall the Griliches long regression estimates y i = const (0.003) S i (0.0005) IQ i + experience Instrumenting IQ with results from the Knowledge of the World of Work test he gets y i = const (0.004) S i (0.0009) IQ i + experience This is still consistent with two stories: there is upward ability bias in the bivariate return to schooling and measurement error in IQ but not schooling (in this case the estimate of is correct) Pischke (LSE) Griliches 1977 October 4, / 44
31 More Griliches (1977) regressions Recall the Griliches long regression estimates y i = const (0.003) S i (0.0005) IQ i + experience Instrumenting IQ with results from the Knowledge of the World of Work test he gets y i = const (0.004) S i (0.0009) IQ i + experience This is still consistent with two stories: there is upward ability bias in the bivariate return to schooling and measurement error in IQ but not schooling (in this case the estimate of is correct) there is measurement error in both schooling and ability (in this case the true coeffi cient could be anything). Pischke (LSE) Griliches 1977 October 4, / 44
32 What to control for? In the quest for identifying causal effects, which variables belong on the right hand side of a regression equation? Yes: Variables determining the treatment and correlated with the outcome (e.g. ability). in general these variables will be fixed characteristics or pre-determined by the time of treatment (e.g. schooling) but remember the warnings on measurement error just discussed: the kitchen sink is no panacea Yes: Variables uncorrelated with the treatment but correlated with the outcome these variables may help reducing standard errors No: Variables which are outcomes of the treatment itself. bad controls. These are Pischke (LSE) Griliches 1977 October 4, / 44
33 Bad control Some researchers regressing earnings on schooling (and experience) include controls for occupation. Does this make sense? Clearly we can think of schooling affecting the access to higher level occupations, e.g. you need a Ph.D. to become a college professor. This gives rise to a two equation system Y i = α + ρs i + γo i + e i O i = λ 0 + λ 1 S i + u i You could think about these as a simultaneous equations system. Occupation O i is an endogenous variable. As a result, you could not necessarily estimate the first equation by OLS. Occupation is a bad control. Pischke (LSE) Griliches 1977 October 4, / 44
34 Bad control example 1 Pischke (LSE) Griliches 1977 October 4, / 44
35 Bad control example 2 Pischke (LSE) Griliches 1977 October 4, / 44
36 Proxy control Sometimes we control for a variable in the best of intentions. our regression of schooling on earnings Suppose Y i = α + ρs i + γa i + e i has a causal interpretation conditional on ability. Instead of ability we only have a test score taken at age 18, call it A li for late ability. The problem is that schooling will already have influenced the late ability (some students will have dropped out by 18). Suppose A li = π 0 + π 1 S i + π 2 A i i.e. that age 18 test scores are influenced by both schooling and true ability. Pischke (LSE) Griliches 1977 October 4, / 44
37 What do we get with proxy control? Substituting for A i in our regression above we get ( y i = α γ π ) ( 0 + ρ γ π ) 1 S i + γ A li + e i. π 2 π 2 π 2 If ρ > 0 γ > 0 π 1 > 0, π 2 > 0 then ( ρ γ π ) 1 < ρ π 2 and we will estimate a return to schooling that is too small. Pischke (LSE) Griliches 1977 October 4, / 44
38 Are our results robust? Suppose you have an identification strategy, say Y i = α + ρs i + γ 1 A i + e i. You have another variable, X i, which could potentially be an additional confounder. What do you do with it? Is ρ l = ρ? Y i = α l + ρ l S i + γ 1l A i + γ 2 X i + e l i Is δ = 0? X i = δ 0 + δs i + δ 1 A i + u i Pischke (LSE) Griliches 1977 October 4, / 44
39 A tale of two tests Y i = α + ρs i + γ 1 A i + e i Y i = α l + ρ l S i + γ 1l A i + γ 2 X i + e l i X i = δ 0 + δs i + δ 1 A i + u i The test ρ l = ρ is a coeffi cient comparison test. generalized Hausman test. The test δ = 0 is a balancing test. How are they related? Through the OVB formula: Formally a ρ ρ l = γ 2 δ. Under the maintained assumption γ 2 = 0, the two tests test the same null hypothesis δ = 0. Pischke (LSE) Griliches 1977 October 4, / 44
40 Measurement error in controls again Remember ( ) ρ l = γ 2 δ 1 λ + ρ γ 2 = γ 2 λ. Classical measurement error in X i biases ρ l towards ρ, and γ 2 towards 0. ρ l closer to ρ: power of the coeffi cient comparison test directly affected. Variation from measurement error ends up in ei l, raising standard errors and reducing power There is no bias in the estimate of δ Variation from measurement error ends up in u i, raising standard errors and reducing power Pischke (LSE) Griliches 1977 October 4, / 44
41 Theoretical Power Functions Homoskedastic, plain standard errors Rejection probability d Balancing test, θ=0 (baseline) Balancing test, θ=.7 Balancing test, θ=.85 CC test, θ=0 (baseline) CC test, θ=.7 CC test, θ=.85 Pischke (LSE) Griliches 1977 October 4, / 44
42 Simulated Rejection Rates Heteroskedasticity, robust standard errors Rejection probability d Balancing test, baseline Balancing test, θ=0, robust Balancing test, θ=.85, robust CC test, baseline CC test, θ=0, robust CC test, θ=.85, robust Pischke (LSE) Griliches 1977 October 4, / 44
43 Simulated Rejection Rates Mean reverting measurement error, robust standard errors Rejection probability d Balancing test, baseline CC test, baseline Balancing test, σ 2 µ=.75 CC test, σ 2 µ=.75 Balancing test, σ 2 µ=2.25 CC test, σ 2 µ=2.25 Pischke (LSE) Griliches 1977 October 4, / 44
44 Baseline Regressions for Returns to Schooling Mother's years of Library card Log hourly earnings education at age 14 (1) (2) (3) (6) (7) Years of education Mother's years of education Library card at age (0.0040) (0.0042) (0.0040) (0.0300) (0.0040) (0.0029) (0.0183) p-values Coefficient comparison test Balancing test Pischke (LSE) Griliches 1977 October 4, / 44
45 Returns to Schooling controlling for KWW score Mother's years of Library card Log hourly earnings education at age 14 (1) (2) (3) (6) (7) Years of education KWW score Mother's years of education Library card at age (0.0059) (0.0060) (0.0059) (0.0422) (0.0059) (0.0015) (0.0016) (0.0016) (0.0107) (0.0016) (0.0037) (0.0215) Body height in inches p-values Coefficient comparison test Balancing test Pischke (LSE) Griliches 1977 October 4, / 44
46 Returns to Schooling instrumenting the KWW score Mother's years of Library card Log hourly earnings education at age 14 (1) (2) (3) (6) (7) Years of education KWW score instrumented by IQ Mother's years of education Library card at age (0.0139) (0.0139) (0.0138) (0.0952) (0.0134) (0.0063) (0.0063) (0.0063) (0.0422) (0.0060) (0.0039) (0.0245) Body height in inches p-values Coefficient comparison test Balancing test Pischke (LSE) Griliches 1977 October 4, / 44
Poorly Measured Confounders are More Useful on the Left Than on the Right
Poorly Measured Confounders are More Useful on the Left Than on the Right Jörn-Steffen Pischke LSE Hannes Schwandt University of Zürich April 2016 Abstract Researchers frequently test identifying assumptions
More informationLecture Notes on Measurement Error
Steve Pischke Spring 2000 Lecture Notes on Measurement Error These notes summarize a variety of simple results on measurement error which I nd useful. They also provide some references where more complete
More informationEconomics 241B Estimation with Instruments
Economics 241B Estimation with Instruments Measurement Error Measurement error is de ned as the error resulting from the measurement of a variable. At some level, every variable is measured with error.
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple
More informationECON Introductory Econometrics. Lecture 16: Instrumental variables
ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental
More informationProblem Set #6: OLS. Economics 835: Econometrics. Fall 2012
Problem Set #6: OLS Economics 835: Econometrics Fall 202 A preliminary result Suppose we have a random sample of size n on the scalar random variables (x, y) with finite means, variances, and covariance.
More information1 Motivation for Instrumental Variable (IV) Regression
ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data
More informationEMERGING MARKETS - Lecture 2: Methodology refresher
EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different
More informationEssential of Simple regression
Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship
More informationThe Simple Linear Regression Model
The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares
Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit
More informationEconometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage
More informationREGRESSION RECAP. Josh Angrist. MIT (Fall 2014)
REGRESSION RECAP Josh Angrist MIT 14.387 (Fall 2014) Regression: What You Need to Know We spend our lives running regressions (I should say: "regressions run me"). And yet this basic empirical tool is
More information1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE
1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +
More information1 Correlation between an independent variable and the error
Chapter 7 outline, Econometrics Instrumental variables and model estimation 1 Correlation between an independent variable and the error Recall that one of the assumptions that we make when proving the
More informationEC402 - Problem Set 3
EC402 - Problem Set 3 Konrad Burchardi 11th of February 2009 Introduction Today we will - briefly talk about the Conditional Expectation Function and - lengthily talk about Fixed Effects: How do we calculate
More informationMultiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C =
Economics 130 Lecture 6 Midterm Review Next Steps for the Class Multiple Regression Review & Issues Model Specification Issues Launching the Projects!!!!! Midterm results: AVG = 26.5 (88%) A = 27+ B =
More informationOmitted Variable Bias, Coefficient Stability and Other Issues. Chase Potter 6/22/16
Omitted Variable Bias, Coefficient Stability and Other Issues Chase Potter 6/22/16 Roadmap 1. Omitted Variable Bias 2. Coefficient Stability Oster 2015 Paper 3. Poorly Measured Confounders Pischke and
More informationAbility Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006
Ability Bias, Errors in Variables and Sibling Methods James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006 1 1 Ability Bias Consider the model: log = 0 + 1 + where =income, = schooling,
More informationECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47
ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with
More informationWISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A
WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This
More informationECO375 Tutorial 8 Instrumental Variables
ECO375 Tutorial 8 Instrumental Variables Matt Tudball University of Toronto Mississauga November 16, 2017 Matt Tudball (University of Toronto) ECO375H5 November 16, 2017 1 / 22 Review: Endogeneity Instrumental
More informationEC 533 Labour Economics Problem Set 1 Answers. = w r. S = f S. f r = 0. log y = log w + log f(s, A)
Steve Pischke EC 533 Labour Economics Problem Set 1 Answers 1 Optimal Choice of Schooling and Returns to Schooling (a) The individual maximizes or alternatively V = S ye rt dt = wf(s, A) e rt dt S = w
More informationEcon 2120: Section 2
Econ 2120: Section 2 Part I - Linear Predictor Loose Ends Ashesh Rambachan Fall 2018 Outline Big Picture Matrix Version of the Linear Predictor and Least Squares Fit Linear Predictor Least Squares Omitted
More informationControlling for Time Invariant Heterogeneity
Controlling for Time Invariant Heterogeneity Yona Rubinstein July 2016 Yona Rubinstein (LSE) Controlling for Time Invariant Heterogeneity 07/16 1 / 19 Observables and Unobservables Confounding Factors
More informationInstrumental Variables
Instrumental Variables Yona Rubinstein July 2016 Yona Rubinstein (LSE) Instrumental Variables 07/16 1 / 31 The Limitation of Panel Data So far we learned how to account for selection on time invariant
More informationFinal Exam. Economics 835: Econometrics. Fall 2010
Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each
More informationThe Regression Tool. Yona Rubinstein. July Yona Rubinstein (LSE) The Regression Tool 07/16 1 / 35
The Regression Tool Yona Rubinstein July 2016 Yona Rubinstein (LSE) The Regression Tool 07/16 1 / 35 Regressions Regression analysis is one of the most commonly used statistical techniques in social and
More informationEconometrics Homework 4 Solutions
Econometrics Homework 4 Solutions Question 1 (a) General sources of problem: measurement error in regressors, omitted variables that are correlated to the regressors, and simultaneous equation (reverse
More informationRewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35
Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate
More informationNBER WORKING PAPER SERIES POORLY MEASURED CONFOUNDERS ARE MORE USEFUL ON THE LEFT THAN ON THE RIGHT. Zhuan Pei Jörn-Steffen Pischke Hannes Schwandt
NBER WORKING PAPER SERIES POORLY MEASURED CONFOUNDERS ARE MORE USEFUL ON THE LEFT THAN ON THE RIGHT Zhuan Pei Jörn-Steffen Pischke Hannes Schwandt Working Paper 33 http://www.nber.org/papers/w33 NATIONAL
More informationWe begin by thinking about population relationships.
Conditional Expectation Function (CEF) We begin by thinking about population relationships. CEF Decomposition Theorem: Given some outcome Y i and some covariates X i there is always a decomposition where
More information8. Instrumental variables regression
8. Instrumental variables regression Recall: In Section 5 we analyzed five sources of estimation bias arising because the regressor is correlated with the error term Violation of the first OLS assumption
More informationProblem Set - Instrumental Variables
Problem Set - Instrumental Variables 1. Consider a simple model to estimate the effect of personal computer (PC) ownership on college grade point average for graduating seniors at a large public university:
More informationRecitation Notes 6. Konrad Menzel. October 22, 2006
Recitation Notes 6 Konrad Menzel October, 006 Random Coefficient Models. Motivation In the empirical literature on education and earnings, the main object of interest is the human capital earnings function
More informationInstrumental Variables
Instrumental Variables Department of Economics University of Wisconsin-Madison September 27, 2016 Treatment Effects Throughout the course we will focus on the Treatment Effect Model For now take that to
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationCHAPTER 6: SPECIFICATION VARIABLES
Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero
More informationPSC 504: Instrumental Variables
PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured
More informationPanel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43
Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression
More informationLecture 4: Linear panel models
Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)
More informationEc1123 Section 7 Instrumental Variables
Ec1123 Section 7 Instrumental Variables Andrea Passalacqua Harvard University andreapassalacqua@g.harvard.edu November 16th, 2017 Andrea Passalacqua (Harvard) Ec1123 Section 7 Instrumental Variables November
More informationMotivation for multiple regression
Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope
More informationThe regression model with one stochastic regressor (part II)
The regression model with one stochastic regressor (part II) 3150/4150 Lecture 7 Ragnar Nymoen 6 Feb 2012 We will finish Lecture topic 4: The regression model with stochastic regressor We will first look
More informationGov 2000: 9. Regression with Two Independent Variables
Gov 2000: 9. Regression with Two Independent Variables Matthew Blackwell Fall 2016 1 / 62 1. Why Add Variables to a Regression? 2. Adding a Binary Covariate 3. Adding a Continuous Covariate 4. OLS Mechanics
More informationECNS 561 Multiple Regression Analysis
ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking
More informationExam D0M61A Advanced econometrics
Exam D0M61A Advanced econometrics 19 January 2009, 9 12am Question 1 (5 pts.) Consider the wage function w i = β 0 + β 1 S i + β 2 E i + β 0 3h i + ε i, where w i is the log-wage of individual i, S i is
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model
Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Most of this course will be concerned with use of a regression model: a structure in which one or more explanatory
More informationIntroductory Econometrics
Introductory Econometrics Violation of basic assumptions Heteroskedasticity Barbara Pertold-Gebicka CERGE-EI 16 November 010 OLS assumptions 1. Disturbances are random variables drawn from a normal distribution.
More informationWarwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation
Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory
More informationEC 423 Labour Economics Problem Set 3 Answers. 1. Optimal Choice of Schooling and Returns to Schooling. ye rt dt = wf(s, A) e rt dt. = w r.
Steve Pischke EC 43 Labour Economics Problem Set 3 Answers 1 Optimal Choice of Schooling and Returns to Schooling (a) The individual maximizes or alternatively V = The first order condition is Notice that
More informationInstrumental Variables
Instrumental Variables Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 18 Instrumental Variables
More informationTeaching Causal Inference in Undergraduate Econometrics
Teaching Causal Inference in Undergraduate Econometrics October 24, 2012 Abstract This paper argues that the current way in which the undergraduate introductory econometrics course is taught is neither
More informationEcon 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias
Contact Information Elena Llaudet Sections are voluntary. My office hours are Thursdays 5pm-7pm in Littauer Mezzanine 34-36 (Note room change) You can email me administrative questions to ellaudet@gmail.com.
More informationLecture 5: Omitted Variables, Dummy Variables and Multicollinearity
Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the
More informationLecture #8 & #9 Multiple regression
Lecture #8 & #9 Multiple regression Starting point: Y = f(x 1, X 2,, X k, u) Outcome variable of interest (movie ticket price) a function of several variables. Observables and unobservables. One or more
More informationTesting Linear Restrictions: cont.
Testing Linear Restrictions: cont. The F-statistic is closely connected with the R of the regression. In fact, if we are testing q linear restriction, can write the F-stastic as F = (R u R r)=q ( R u)=(n
More informationWISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A
WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This
More informationUsing Instrumental Variables to Find Causal Effects in Public Health
1 Using Instrumental Variables to Find Causal Effects in Public Health Antonio Trujillo, PhD John Hopkins Bloomberg School of Public Health Department of International Health Health Systems Program October
More informationHonors General Exam Part 3: Econometrics Solutions. Harvard University
Honors General Exam Part 3: Econometrics Solutions Harvard University April 6, 2016 Lead Pipes and Homicides In the second half of the nineteenth century, many U.S. cities built water supply systems. Some
More informationEconometrics Problem Set 4
Econometrics Problem Set 4 WISE, Xiamen University Spring 2016-17 Conceptual Questions 1. This question refers to the estimated regressions in shown in Table 1 computed using data for 1988 from the CPS.
More informationEmpirical approaches in public economics
Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental
More informationMore on Roy Model of Self-Selection
V. J. Hotz Rev. May 26, 2007 More on Roy Model of Self-Selection Results drawn on Heckman and Sedlacek JPE, 1985 and Heckman and Honoré, Econometrica, 1986. Two-sector model in which: Agents are income
More informationEC4051 Project and Introductory Econometrics
EC4051 Project and Introductory Econometrics Dudley Cooke Trinity College Dublin Dudley Cooke (Trinity College Dublin) Intro to Econometrics 1 / 23 Project Guidelines Each student is required to undertake
More informationstatistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:
Wooldridge, Introductory Econometrics, d ed. Chapter 3: Multiple regression analysis: Estimation In multiple regression analysis, we extend the simple (two-variable) regression model to consider the possibility
More informationFreeing up the Classical Assumptions. () Introductory Econometrics: Topic 5 1 / 94
Freeing up the Classical Assumptions () Introductory Econometrics: Topic 5 1 / 94 The Multiple Regression Model: Freeing Up the Classical Assumptions Some or all of classical assumptions needed for derivations
More informationRegression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y
Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of
More informationApplied Econometrics (MSc.) Lecture 3 Instrumental Variables
Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.
More informationWooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems
Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including
More informationWrite your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).
STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods in Economics 2 Course code: EC2402 Examiner: Peter Skogman Thoursie Number of credits: 7,5 credits (hp) Date of exam: Saturday,
More informationEconometrics. 7) Endogeneity
30C00200 Econometrics 7) Endogeneity Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Common types of endogeneity Simultaneity Omitted variables Measurement errors
More informationCausal Inference with and without Experiments I
http://www.econ.ucsd.edu/~elib/250a/ Causal Inference with and without Experiments I 1. The Big Picture: Data, Statistics, Economic Theory and Applied Econometrics 2. Causal Inference: The SAT Prep example
More information14.32 Final : Spring 2001
14.32 Final : Spring 2001 Please read the entire exam before you begin. You have 3 hours. No books or notes should be used. Calculators are allowed. There are 105 points. Good luck! A. True/False/Sometimes
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Per Pettersson-Lidbom Number of creds: 7,5 creds Date of exam: Thursday, January 15, 009 Examination
More informationSimultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations
Simultaneous Equation Models. Introduction: basic definitions 2. Consequences of ignoring simultaneity 3. The identification problem 4. Estimation of simultaneous equation models 5. Example: IS LM model
More informationLinear Regression with Multiple Regressors
Linear Regression with Multiple Regressors (SW Chapter 6) Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution
More informationLogistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015
Logistic regression: Why we often can do what we think we can do Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 1 Introduction Introduction - In 2010 Carina Mood published an overview article
More informationECON The Simple Regression Model
ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In
More informationPotential Outcomes Model (POM)
Potential Outcomes Model (POM) Relationship Between Counterfactual States Causality Empirical Strategies in Labor Economics, Angrist Krueger (1999): The most challenging empirical questions in economics
More informationIntroduction to Econometrics. Multiple Regression
Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Multiple Regression Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1 OLS estimate of the TS/STR
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint
More informationBTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014
BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014 Homework 4 (version 3) - posted October 3 Assigned October 2; Due 11:59PM October 9 Problem 1 (Easy) a. For the genetic regression model: Y
More informationAnswers to Problem Set #4
Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2
More informationRegression with a Single Regressor: Hypothesis Tests and Confidence Intervals
Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Friday, June 5, 009 Examination time: 3 hours
More informationEC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)
1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For
More informationResearch Design: Causal inference and counterfactuals
Research Design: Causal inference and counterfactuals University College Dublin 8 March 2013 1 2 3 4 Outline 1 2 3 4 Inference In regression analysis we look at the relationship between (a set of) independent
More informationECON3150/4150 Spring 2015
ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2
More informationIntroduction to Econometrics Third Edition James H. Stock Mark W. Watson The statistical analysis of economic (and related) data
Introduction to Econometrics Third Edition James H. Stock Mark W. Watson The statistical analysis of economic (and related) data 1/2/3-1 1/2/3-2 Brief Overview of the Course Economics suggests important
More informationEcon 423 Lecture Notes: Additional Topics in Time Series 1
Econ 423 Lecture Notes: Additional Topics in Time Series 1 John C. Chao April 25, 2017 1 These notes are based in large part on Chapter 16 of Stock and Watson (2011). They are for instructional purposes
More informationFNCE 926 Empirical Methods in CF
FNCE 926 Empirical Methods in CF Lecture 3 Causality Professor Todd Gormley Announcement n You should have uploaded Exercise #1 and your DO files to Canvas already n Exercise #2 due two weeks from today
More informationIntroduction to Econometrics. Multiple Regression (2016/2017)
Introduction to Econometrics STAT-S-301 Multiple Regression (016/017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 OLS estimate of the TS/STR relation: OLS estimate of the Test Score/STR relation:
More informationStat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS
Stat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS 1a) The model is cw i = β 0 + β 1 el i + ɛ i, where cw i is the weight of the ith chick, el i the length of the egg from which it hatched, and ɛ i
More informationLecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)
Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model
More informationSingle-Equation GMM: Endogeneity Bias
Single-Equation GMM: Lecture for Economics 241B Douglas G. Steigerwald UC Santa Barbara January 2012 Initial Question Initial Question How valuable is investment in college education? economics - measure
More informationLinear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons
Linear Regression with 1 Regressor Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor 1. The regression equation 2. Estimating the equation 3. Assumptions required for
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University
More informationApplied Quantitative Methods II
Applied Quantitative Methods II Lecture 4: OLS and Statistics revision Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 4 VŠE, SS 2016/17 1 / 68 Outline 1 Econometric analysis Properties of an estimator
More informationIV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation
SS 2010 WS 2014/15 Alexander Spermann Evaluation With Non-Experimental Approaches Selection on Unobservables Natural Experiment (exogenous variation in a variable) DiD Example: Card/Krueger (1994) Minimum
More information