Identification in Triangular Systems using Control Functions

Similar documents
Econ 2148, fall 2017 Instrumental variables II, continuous treatment

A Discontinuity Test for Identification in Nonparametric Models with Endogeneity

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Flexible Estimation of Treatment Effect Parameters

Generated Covariates in Nonparametric Estimation: A Short Review.

Unconditional Quantile Regression with Endogenous Regressors

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Partial Identification of Nonseparable Models using Binary Instruments

Nonadditive Models with Endogenous Regressors

Using data to inform policy

Final Exam. Economics 835: Econometrics. Fall 2010

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

Why experimenters should not randomize, and what they should do instead

Principles Underlying Evaluation Estimators

Control Variables, Discrete Instruments, and. Identification of Structural Functions

Nonparametric Instrumental Variables Identification and Estimation of Nonseparable Panel Models

IDENTIFICATION, ESTIMATION, AND SPECIFICATION TESTING OF NONPARAMETRIC TRANSFORMATION MODELS

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

Semiparametric Estimation of Structural Functions in Nonseparable Triangular Models

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics

Identification of Nonparametric Simultaneous Equations Models with a Residual Index Structure

Correct Specification and Identification

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Instrumental Variables

Trimming for Bounds on Treatment Effects with Missing Outcomes *

4. Distributions of Functions of Random Variables

NBER WORKING PAPER SERIES

New Developments in Econometrics Lecture 16: Quantile Estimation

The Distribution of Treatment Effects in Experimental Settings: Applications of Nonlinear Difference-in-Difference Approaches. Susan Athey - Stanford

The Generalized Roy Model and Treatment Effects

What s New in Econometrics? Lecture 14 Quantile Methods

ESTIMATION OF NONPARAMETRIC MODELS WITH SIMULTANEITY

Conditions for the Existence of Control Functions in Nonseparable Simultaneous Equations Models 1

IDENTIFICATION OF MARGINAL EFFECTS IN NONSEPARABLE MODELS WITHOUT MONOTONICITY

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53

IDENTIFICATION IN NONPARAMETRIC SIMULTANEOUS EQUATIONS

Information Structure and Statistical Information in Discrete Response Models

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

Identification with Latent Choice Sets: The Case of the Head Start Impact Study

Identification of Models of the Labor Market

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Using data to inform policy

Semiparametric Estimation of Invertible Models

Necessary and sufficient conditions for nonlinear parametric function identification

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

The Economics of European Regions: Theory, Empirics, and Policy

A Note on Demand Estimation with Supply Information. in Non-Linear Models

Who wins, who loses? Tools for distributional policy evaluation

Midterm #1. Lecture 10: Joint Distributions and the Law of Large Numbers. Joint Distributions - Example, cont. Joint Distributions - Example

Identification of Endogenous Social Effects: The Reflection Problem. Manski

13 Endogeneity and Nonparametric IV

Limited Information Econometrics

1 Motivation for Instrumental Variable (IV) Regression

Nonparametric Instrumental Variable Estimation Under Monotonicity

Dealing with a Technological Bias: The Difference-in-Difference Approach

CAEPR Working Paper # Identifying Multiple Marginal Effects with a Single Binary Instrument or by Regression Discontinuity

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Financial Econometrics / 49

Math 181B Homework 1 Solution

Identification and Estimation of Bidders Risk Aversion in. First-Price Auctions

Regression Discontinuity Designs.

Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix)

ECO Class 6 Nonparametric Econometrics

Generalized instrumental variable models, methods, and applications

Causal Inference Lecture Notes: Selection Bias in Observational Studies

Censored Regression Quantiles with Endogenous Regressors

Semi and Nonparametric Models in Econometrics

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Identification of Additive and Polynomial Models of Mismeasured Regressors Without Instruments

COUNTERFACTUAL MAPPING AND INDIVIDUAL TREATMENT EFFECTS IN NONSEPARABLE MODELS WITH DISCRETE ENDOGENEITY*

Lectures on Identi cation 2

Random Coefficients on Endogenous Variables in Simultaneous Equations Models

treatment effect models

Testing Rank Similarity

Identification of Instrumental Variable. Correlated Random Coefficients Models

Quantitative Economics for the Evaluation of the European Policy

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

Problem Solving. Correlation and Covariance. Yi Lu. Problem Solving. Yi Lu ECE 313 2/51

STA 2201/442 Assignment 2

Binary Models with Endogenous Explanatory Variables

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Gov 2002: 4. Observational Studies and Confounding

Nonseparable multinomial choice models in cross-section and panel data

Quantile and Average Effects in Nonseparable Panel Models 1

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Problem Set 2 Solution Sketches Time Series Analysis Spring 2010

Decomposing Real Wage Changes in the United States

Quantile Structural Treatment Effect: Application to Smoking Wage Penalty and its Determinants

δ -method and M-estimation

1 Hotz-Miller approach: avoid numeric dynamic programming

Statistical Properties of Numerical Derivatives

9.1 Orthogonal factor model.

Policy-Relevant Treatment Effects

Identification and Estimation of Triangular Simultaneous Equations Models Without Additivity

Control Functions in Nonseparable Simultaneous Equations Models 1

CS145: Probability & Computing Lecture 11: Derived Distributions, Functions of Random Variables

Lecture 4: Proofs for Expectation, Variance, and Covariance Formula

Chapter 7. Confidence Sets Lecture 30: Pivotal quantities and confidence sets

Causal Analysis After Haavelmo: Definitions and a Unified Analysis of Identification of Recursive Causal Models

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

Transcription:

Identification in Triangular Systems using Control Functions Maximilian Kasy Department of Economics, UC Berkeley Maximilian Kasy (UC Berkeley) Control Functions 1 / 19

Introduction Introduction There is a lively literature on nonparametric IV, control functions, e.g., Newey, Powell, and Vella (1999), Imbens and Newey (2009). These papers discuss identification under assumptions on the first stage relationship (additive residual/monotonicity in one-dimensional residual). Question: Generalizability? What are necessary and sufficient conditions for the existence of control functions? Answer: Dimensionality restrictions on unobserved heterogeneity/family of conditional distributions. No control function exists in the context of a generic random coefficient model. Maximilian Kasy (UC Berkeley) Control Functions 2 / 19

Introduction The nonparametric, continuous triangular system setup Y = g(x, ɛ) (1) X = h(z, η) (2) where we assume Z (ɛ, η) (3) with Z, X, Y each continuously distributed in R. Z is the exogenous instrument, X is the treatment, Y is the outcome variable. The object of interest is the structural function g. Maximilian Kasy (UC Berkeley) Control Functions 3 / 19

Introduction Control functions Idea: find a function C ( control function ) of X and Z such that, for V = C(X, Z), X ɛ V. (4) Compare Newey, Powell, and Vella (1999), Imbens and Newey (2009). In this talk, we will discuss: Conditions which are both necessary and sufficient for the existence of control functions that satisfy conditional independence and support requirements. Maximilian Kasy (UC Berkeley) Control Functions 4 / 19

Introduction Roadmap Review Counterexample with random coefficient first stage, failure of conditional independence Characterization of triangular systems, for which a control function C exist such that V = C(X, Z) is a function of first stage unobservables η alone Characterization of triangular systems, for which C exist such that V satisfies conditional independence X ɛ V Proof that no control function exists in the random coefficient model Conclusion Maximilian Kasy (UC Berkeley) Control Functions 5 / 19

Why care Review Recall the definition of the average structural function by Blundell and Powell (2003), ASF (x) := E ɛ [g(x, ɛ)]. Given a control function, the ASF is identified by ASF (x) = E V [E[g(X, ɛ) V, X = x]] = E V [E[Y V, X = x]]. (5) The first equality requires conditional independence. Identification of E[Y V, X = x] for all V requires full support of V given X. Under the same conditions, the quantile structural function (QSF) is identified. Maximilian Kasy (UC Berkeley) Control Functions 6 / 19

Review Control functions proposed in literature Newey, Powell, and Vella (1999): V = C(X, Z) = X E[X Z]. (6) Justified by an additive model for h, h(z, η) = h(z) + η. Imbens and Newey (2009): V = C(X, Z) = F [X Z]. (7) Justified by a first stage h that is strictly monotonic in a one-dimensional η. In either case conditional independence follows from V being a function of η alone. Maximilian Kasy (UC Berkeley) Control Functions 7 / 19

Counterexample Counterexample - random coefficient first stage Assume X = η 1 + η 2 Z = η (1, Z) (8) (η 1, η 2, ɛ) N(µ, Σ) (9) Z (η, ɛ), (10) and let Then V = F (X Z) = Φ ( ) (X µ η1 Zµ η2 ). (11) Var(X Z) E[ɛ V, X ] = E[ɛ V, Z] = E[ɛ X, Z] = = µ ɛ + Φ 1 Σ η1,ɛ + ZΣ η2,ɛ (V ). (12) Ση1,η 1 + 2ZΣ η1,η 2 + Z 2 Σ η2,η 2 Maximilian Kasy (UC Berkeley) Control Functions 8 / 19

Counterexample Conditional independence is violated. Q: Is there another function C for this model, such that conditional independence holds? More generally: Under what conditions does a valid control function exist? Maximilian Kasy (UC Berkeley) Control Functions 9 / 19

First characterization First characterization Sufficient condition for conditional independence: Proposition If V = C(h(Z, η), Z) does not depend on Z given η, then conditional independence Z ɛ V holds. Proof: By assumption, Z (η, ɛ). As we can write V as a function of η, Z (V (η), ɛ) Z. (13) Z ɛ V is equivalent to X ɛ V if there exists a mapping (Z, V ) (X, V ), which is true if C is invertible. Maximilian Kasy (UC Berkeley) Control Functions 10 / 19

First characterization The sufficient condition implies a one dimensional first stage: Proposition If V = C(h(Z, η), Z) does not depend on Z given η for a C(X, Z) that is smooth and almost surely invertible in X, then {h(, η)} is a one-dimensional family of functions in Z. Sketch of proof: Since C is smooth and invertible with range independent of Z, V must have one dimensional range. Inverting C gives a function h such that X = h(z, V ). The assumption that V does not depend on Z given η (!) makes the first stage structural in the sense that we can write h(z, η) = h(z, V (η)). (14) Maximilian Kasy (UC Berkeley) Control Functions 11 / 19

First characterization Remarks Identification of the ASF requires additionally that V has full support given X = x, i.e., the range of C(X, Z) must be independent of X. The family of functions {h(., η)} is one-dimensional if and only if it is possible to predict the counterfactual X under manipulation of Z from knowledge of X and Z. a much stronger requirement than identification of the ASF for the first stage relationship. Maximilian Kasy (UC Berkeley) Control Functions 12 / 19

First characterization The reverse of the last proposition holds as well: Proposition If {h(., η)} is a one-dimensional family of functions in Z and almost surely h(z, η 1 ) h(z, η 2 ) for independent draws Z, η 1, η 2 from the respective distributions of Z and η, then there exists a control function V = C(h(Z, η), Z) which does not depend on Z given η. Sketch of Proof: Choose C(X, Z) = h(z 0, h 1 (Z, X )). Then C(h(Z, η), Z) = h(z 0, η), which is a function of η alone. Maximilian Kasy (UC Berkeley) Control Functions 13 / 19

First characterization Application to the random coefficient model Here no control function satisfying the sufficient condition of proposition 1 and invertibility in X can exist. The family of functions is two-dimensional. h(z, η 1, η 2 ) = η 1 + η 2 Z (15) This implies that we cannot predict the counterfactual X under a manipulation setting Z = z, h(z, η), for a given observational unit from X and Z alone. Maximilian Kasy (UC Berkeley) Control Functions 14 / 19

Second characterization Second characterization Conditional independence can hold if and only if P(ɛ X, Z) is a one dimensional family of distributions: Proposition There exists a control function V = C(X, Z) such that conditional independence X ɛ V holds and which is invertible in Z if and only if P(ɛ X, Z) is an at most one-dimensional family of distributions that is not constant in Z if it is not constant. Sketch of Proof: By invertibility, P(ɛ X, Z) = P(ɛ X, V ). By conditional independence, P(ɛ X, V ) = P(ɛ V ). By invertibility of C, dim(v ) = dim(z) = 1. Reversely, let θ parametrize P(ɛ X, Z). Take C = θ. Maximilian Kasy (UC Berkeley) Control Functions 15 / 19

Second characterization No control function in the random coefficient model Corollary There exists no control function invertible in Z in the generic random coefficient model discussed before, such that conditional independence X ɛ V holds. Sketch of proof: ( Cov(X, ɛ Z) ɛ X, Z N µ ɛ + (X µ η1 µ η2 Z) Var(X Z), Var(ɛ) Cov ) 2 (X, ɛ Z), (16) Var(X Z) This is is a two-dimensional family for generic Σ. Maximilian Kasy (UC Berkeley) Control Functions 16 / 19

Conclusion Conclusion No control function exists in the random coefficient model. Examples of models for first stage structural relationships, where control functions do exist: First stage relationships that are monotonic in unobserved heterogeneity, of the form X = h( Z η ), which could describe the loss from missing an unknown target η, of the form X = h(z) η, where h is of non-constant sign. Maximilian Kasy (UC Berkeley) Control Functions 17 / 19

Conclusion Impossible to fully identify structural features such as the ASF or the QSF without assumptions which are hard to justify. Maybe more promising to look for identification of features that have some interpretable dependence on first stage parameters, e.g. the LATE as introduced in Imbens and Angrist (1994). Alternatively: partial identification approach pioneered by works such as Manski (2003) set identification of fully structural features under similarly weak assumptions. Maximilian Kasy (UC Berkeley) Control Functions 18 / 19

Conclusion Thanks for your time! Maximilian Kasy (UC Berkeley) Control Functions 19 / 19