Recitation 7. and Kirkebøen, Leuven, and Mogstad (2014) Spring Peter Hull

Similar documents
Potential Outcomes Model (POM)

Recitation Notes 6. Konrad Menzel. October 22, 2006

Estimating the Dynamic Effects of a Job Training Program with M. Program with Multiple Alternatives

II. MATCHMAKER, MATCHMAKER

The Generalized Roy Model and Treatment Effects

Recitation Notes 5. Konrad Menzel. October 13, 2006

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

PSC 504: Instrumental Variables

Recitation 10

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Lecture 11 Roy model, MTE, PRTE

More on Roy Model of Self-Selection

Regression Discontinuity Designs.

AGEC 661 Note Fourteen

Lecture 11/12. Roy Model, MTE, Structural Estimation

International Trade Lecture 16: Gravity Models (Theory)

Part VII. Accounting for the Endogeneity of Schooling. Endogeneity of schooling Mean growth rate of earnings Mean growth rate Selection bias Summary

Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market

Empirical approaches in public economics

Gov 2002: 5. Matching

IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments

Front-Door Adjustment

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Single-Equation GMM: Endogeneity Bias

PSC 504: Differences-in-differeces estimators

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters

Gov 2002: 4. Observational Studies and Confounding

Treatment Effects with Normal Disturbances in sampleselection Package

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

Advanced Econometrics I

Instrumental Variables in Action: Sometimes You get What You Need

What s New in Econometrics? Lecture 14 Quantile Methods

Causality and Experiments

Estimation of Treatment Effects under Essential Heterogeneity

Applied Econometrics Lecture 1

The Problem of Causality in the Analysis of Educational Choices and Labor Market Outcomes Slides for Lectures

Impact Evaluation Technical Workshop:

Econometric Causality

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

Exam D0M61A Advanced econometrics

ted: a Stata Command for Testing Stability of Regression Discontinuity Models

Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1

Exploring Marginal Treatment Effects

Instrumental Variables

Recitation 5. "Fréchet-ing up" Roy (Hsieh et al. 2013) Spring Peter Hull

AAEC/ECON 5126 FINAL EXAM: SOLUTIONS

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Internation1al Trade

Recitation 1: Regression Review. Christina Patterson

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects.

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

CEMMAP Masterclass: Empirical Models of Comparative Advantage and the Gains from Trade 1 Lecture 3: Gravity Models

Quantitative Economics for the Evaluation of the European Policy

REGRESSION RECAP. Josh Angrist. MIT (Fall 2014)

Flexible Estimation of Treatment Effect Parameters

Counterfactuals via Deep IV. Matt Taddy (Chicago + MSR) Greg Lewis (MSR) Jason Hartford (UBC) Kevin Leyton-Brown (UBC)

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade

Transparent Structural Estimation. Matthew Gentzkow Fisher-Schultz Lecture (from work w/ Isaiah Andrews & Jesse M. Shapiro)

New Developments in Econometrics Lecture 16: Quantile Estimation

Fall 2017 Recitation 10 Notes

Groupe de lecture. Instrumental Variables Estimates of the Effect of Subsidized Training on the Quantiles of Trainee Earnings. Abadie, Angrist, Imbens

Introduction to Econometrics

Multivariate Distributions

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

The problem of causality in microeconometrics.

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

VALUATION USING HOUSEHOLD PRODUCTION LECTURE PLAN 15: APRIL 14, 2011 Hunt Allcott

Randomized trials for policy

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Outline. Overview of Issues. Spatial Regression. Luc Anselin

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil

The regression model with one stochastic regressor (part II)

Rising Wage Inequality and the Effectiveness of Tuition Subsidy Policies:

Final Exam. Economics 835: Econometrics. Fall 2010

Estimating and Testing Non-Linear Models Using Instrumental Variables

Weighting for External Validity

Identification and Extrapolation with Instrumental Variables

Using Instrumental Variables to Find Causal Effects in Public Health

Recitation #4: Solving Endogenous Growth Models

Prediction and causal inference, in a nutshell

ECON 4160: Econometrics-Modelling and Systems Estimation Lecture 9: Multiple equation models II

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Exam ECON5106/9106 Fall 2018

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

NBER WORKING PAPER SERIES ESTIMATING AND TESTING NON-LINEAR MODELS USING INSTRUMENTAL VARIABLES. Lance Lochner Enrico Moretti

Linear IV and Simultaneous Equations

Chapter 2 Regression Discontinuity Design: When Series Interrupt

Identification of Models of the Labor Market

The problem of causality in microeconometrics.

A measurement error model approach to small area estimation

Instrumental Variables

The Economics of European Regions: Theory, Empirics, and Policy

Sensitivity checks for the local average treatment effect

MIT PhD International Trade Lecture 15: Gravity Models (Theory)

Transcription:

14.662 Recitation 7 The Roy Model, isolateing, and Kirkebøen, Leuven, and Mogstad (2014) Peter Hull Spring 2015

The Roy Model Motivation 1/14 Selection: an Applied Microeconomist s Best Friend Life (and data) is all about choices E.g. schooling (labor), location (urban), insurance (PF), goods (IO) How can a dataset of zeros and ones tell us about meaningful latent economic parameters? Natural starting point: agents select optimally on potential gains Now obvious, but wasn t always: longstanding belief that job choice developed by the process of historical accident (Roy, 1951) With enough structure, link from observed to latent is straightforward (e.g. Roy (1951), Heckman (1979), Borjas (1987)) Nature of selection characterized by small set of parameters Still a lot to do on relaxing structure while staying tractable Recent attempts: Kirkebøen, Leuven, and Mogstad (2014), Hull (2015)

The Roy Model Solving Roy Borjas (1987) Roy Notation and Setup Potential wages for individual i with schooling level j {0,1}: w ij = E [w ij ] + (w ij E [w ij ]) µ j + ε ij Residuals distributed by [ ] ([ ε σ 2 ]) i0 0 0 σ N, 01 ε σ 2 i1 0 σ 01 1 Individual i chooses schooling j = 1 iff w i1 w i0 > c µ 1 µ 0 c > ε i0 ε }{{}}{{ i1 } z ν i Where c denotes relative cost (assume constant for now) Question: what is E [w ij z > ν i ] for each group? 2/14

The Roy Model Solving Roy Some Essential Normal Facts 1. Law of Iterated Expectations (not just normals): for nonrandom f ( ), E [Y f (X)] = E[E [Y X ] f (X)] 2. Linear Conditional Expectations: if X and Y are jointly normal E[Y X = x] = µ Y + σ XY (x µ X ) σ 2 X 3. Inverse Mills Ratio: if X N(µ,σ 2 ), k constant ( ) k µ φ σ E [X X > k] = µ + σ ( ) k µ 1 Φ σ ( ) k µ φ σ E [X X < k] = µ σ ( ) k µ Φ Key to remembering: E [X X < k] should be smaller than E[X ] σ 3/14

The Roy Model Solving Roy 4/14 Solving Roy Note first that [ ] ([ ] [ ε i0 0 σ 2 0 σ 2 ]) 0 σ N, 01 ν i 0 σ 2 0 σ 01 σ 2 0 + σ 2 1 2σ 01 By Fact #1, By Fact #2, So: E [w i0 z > ν i ] = µ 0 + E [ε i0 z > ν i ] = µ 0 + E [E [ε i0 ν i ] z > ν i ] 0 σ 01 E [ε i0 ν i ] = σ 2 2 2 2 ν i, where σ ν σ 0 + σ 1 2σ 01 σ ν 2 0 σ 01 E[w i0 z > ν i ] = µ 0 + σ 2 E [ν i ν i < z] σ ν 2

The Roy Model Solving Roy 5/14 Solving Roy (cont.) By Fact #3, 0 σ 01 E[w i0 i chooses 1] = µ 0 + σ 2 E [ν i ν i < z] σ ν 2 σ 01 σ 2 0 φ (z/σ ν ) = µ 0 + σ ν Φ(z/σ ν ) ( ) σ 0 σ 0 σ 1 φ (z/σ ν ) = µ 0 + ρ 01 σ1 σ ν Φ(z/σ ν ) The same steps give us ( ) σ 1 σ 0 σ 1 φ (z/σ ν ) E[w i1 i chooses 1] = µ 1 + ρ 01 σ 0 σ ν Φ(z/σ ν ) When are observed j = 1 workers above average?

The Roy Model Solving Roy Positive and Negative Roy Selection Positive selection (avg. j = 1 wage above avg. in both groups): ρ 01 σ 0 σ 1 > 0 and ρ01 > 0 σ 1 σ [ 0 ] σ 0 σ 1 = ρ 01, σ 1 σ 0 = Distribution of productivity with schooling more unequal Negative selection (avg. j = 1 wage below avg. in both sectors): ρ 01 σ 0 σ 1 < 0 and ρ01 < 0 σ 1 σ [ 0 ] σ 1 σ 0 = ρ 01, σ 0 σ 1 = Distribution of productivity without schooling more unequal [ ] σ 0 σ Also can have refugee selection, where ρ 1 01 < min, (but σ 1 σ 0 σ 0 σ 1 can t have the other case, where ρ 01 > max, 1 ) σ 1 σ 0 6/14

The Roy Model Estimating Roy Bringing Roy to Data Let D i = 1 if i selects j = 1. What does OLS of w i on D i give? σ01 σ0 2 φ (z/σ ν ) E[w i D i = 0] = µ 0 + σν Φ(z/σ ν ) σ01 σ1 2 φ ( z/σ ν ) E[w i D i = 1] = µ 1 + σν Φ( z/σ ν ) E[w i D i = 1] E[w i D i = 0] = µ 1 µ 0 }{{} +(selection bias) "treatment effect" Suppose costs are random: c i {0,1},c i (ε i1 ε i0 ) : w i = µ 0 + (µ 1 µ 0 + ε i1 ε i0 )D i + ε i0 D i = 1{µ 1 µ 0 + ε i1 ε i0 > c i } Then IV gives LATE; with Roy selection: E[µ 1 µ 0 + ε i1 ε i0 0 < ε i1 ε i0 (µ 1 µ 0 ) 1] µ }{{} 1 µ }{{} 0 LATE ATE 7/14

More on Roy Estimation Multi-Armed Roy 8/14 Estimation with Multi-Armed Roy W/Roy + unrestricted heterogeneity, valid instrument isn t enough Problem even worse with many sectors; suppose: w i = µ 0 + (w ia w i0 )A i + (w ib w i0 )B i + ε i0 With binary, independent Z i that reduces cost of sector a, IV identifies E [w ai w ai A 1i > A 0i ] =E [w ai w 0i A 1i > A 0i,B 0i = 0]P(B 0i = 0 A 1i > A 0i ) + E [w ai w bi A 1i > A 0i,B 0i = 1]P(B 0i = 1 A 1i > A 0i ) weighted average across compliers with fallback b and with fallback 0 Heckman et al. (2006), Heckman and Urzua (2010): unordered treatment and Roy selection demands a parametric model

More on Roy Estimation Hull (2015) 9/14 isolateing: a semi-parametric solution Want to deconvolute E [w ai w ai A 1i > A 0i ] into its two causal parts Can identify ω P(B 0i = 1 A 1i > A 0i ): just the first stage of B i on Z i If you can split the data into two parts ( strata ) where ω differs but E [w ai w 0i A 1i > A 0i,B 0i = j] doesn t, can solve out ( isolate ) It turns out (see Hull, 2015) two-endogenous variable IV can automate this deconvolution (and give SEs for free!) Problem: if E [w ai w 0i A 1i > A 0i,B 0i = j] also varies across strata (as you d expect with Roy selection and ω varying), isolate is biased Possible solution (work in progress!) assume no Roy selection conditional on rich enough covariates X i, weight cond. IV over X i Similar to Angrist and Fernandez-Val (2013) solution to LATE!= ATE, Angrist and Rokkanen (2016) solution to RD extrapolation

More on Roy Estimation Kirkebøen, Leuven, and Mogstad (2014) 10/14 KLM (2014): a data-driven solution Kirkebøen, Leuven, and Mogstad (2014) have data on centralized post-secondary admissions and earnings in Norway Interested in estimating the returns to fields and selection patterns Note that when (A 1i = A 0i = 0) = (B 1i = B 0i ), IV conditional on (A 0i = B 0i ) = 0 identifies E [w ai w 0i A 1i > A 0i,B 0i = 0] Application score running variable for assignment into ranked fields Sequential dictatorship assignment: truth-telling a dominant strategy Observe completed field/education and earnings Assume ranking reveals potential behavior (plausible? Could test); run fuzzy RD for each next-best field k: y = β jk d j + x ' γ k + λ jk + ε j=k d j = π jk z j + x ' ψ jk + η jk + u, j = k j=k

More on Roy Estimation Kirkebøen, Leuven, and Mogstad (2014) KLM (2014) First Stages Courtesy of Lars Kirkebøen, Edwin Leuven, and Magne Mogstand. Used with permission. 11/14

More on Roy Estimation Kirkebøen, Leuven, and Mogstad (2014) KLM (2014) IV Estimates Courtesy of Lars Kirkebøen, Edwin Leuven, and Magne Mogstand. Used with permission. 12/14

More on Roy Estimation Kirkebøen, Leuven, and Mogstad (2014) Testing for Comparative Advantage With selection on gains would expect E[Y j Y k j k] > E[Y j Y k k j] Courtesy of Lars Kirkebøen, Edwin Leuven, and Magne Mogstand. Used with permission. 13/14

Conclusions 14/14 Roy Takeaways Selection on potential gains a powerful, natural assumption Should be comfortable with basic Roy formalization and how to solve Above statistics facts are common labor tools Tight link between theory and empirics (all ID roads lead to sorting) Post-credibility revolution, we care more about what causal parameters actually represent and how they inform theory Nature of sorting bias can be just as interesting as a treatment effect With Roy selection and unknown heterogeneity, a valid instrument is not enough (ATE vs. LATE, fallback heterogeneity, RD locality) How much structure is needed/plausible? Are model-free, data-driven assumptions satisfying (e.g. isolate, KLM 14)? Or is Heckman right that we need a selection model? Would love to hear your thoughts!

MIT OpenCourseWare http://ocw.mit.edu 14.662 Labor Economics II Spring 2015 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.