Technical Track Session I:

Similar documents
Technical Track Session I: Causal Inference

CAUSAL INFERENCE. Technical Track Session I. Phillippe Leite. The World Bank

Click to edit Master title style

Econometric Causality

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.

An example to start off with

1 Impact Evaluation: Randomized Controlled Trial (RCT)

Treatment Effects. Christopher Taber. September 6, Department of Economics University of Wisconsin-Madison

Quantitative Economics for the Evaluation of the European Policy

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Research Design: Causal inference and counterfactuals

Statistical Models for Causal Analysis

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

Causal Inference. Prediction and causation are very different. Typical questions are:

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Notes 6: Multivariate regression ECO 231W - Undergraduate Econometrics

Potential Outcomes Model (POM)

Selection on Observables: Propensity Score Matching.

One Economist s Perspective on Some Important Estimation Issues

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles

Differences in Differences (Dif-in and Panel Data. Technical Session III. Manila, December Manila, December Christel Vermeersch

Predicting the Treatment Status

Recitation Notes 6. Konrad Menzel. October 22, 2006

Impact Evaluation of Rural Road Projects. Dominique van de Walle World Bank

A Distinction between Causal Effects in Structural and Rubin Causal Models

Empirical approaches in public economics

Bounding the Probability of Causation in Mediation Analysis

6. Assessing studies based on multiple regression

CompSci Understanding Data: Theory and Applications

Causality II: How does causal inference fit into public health and what it is the role of statistics?

From Causality, Second edition, Contents

Job Training Partnership Act (JTPA)

The Distribution of Treatment Effects in Experimental Settings: Applications of Nonlinear Difference-in-Difference Approaches. Susan Athey - Stanford

Probability and Probability Distributions. Dr. Mohammed Alahmed

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region

Causal Inference with Big Data Sets

I CAN STATEMENTS 6TH GRADE SOCIAL STUDIES

Notes on causal effects

CORRELATION, ASSOCIATION, CAUSATION, AND GRANGER CAUSATION IN ACCOUNTING RESEARCH

The World Bank Health System Performance Reinforcement Project (P156679)

ECON Introductory Econometrics. Lecture 17: Experiments

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

External validity, causal interaction and randomised trials

Exploratory Causal Analysis in Bivariate Time Series Data Abstract

The World Bank TOGO Community Development and Safety Nets Project (P127200)

The World Bank Ghana - Maternal, Child Health and Nutrition Project (P145792)

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

Controlling for Time Invariant Heterogeneity

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

EMERGING MARKETS - Lecture 2: Methodology refresher

Causal Analysis in Social Research

Causal Sensitivity Analysis for Decision Trees

Supplemental Material for Policy Deliberation and Voter Persuasion: Experimental Evidence from an Election in the Philippines

Rubin s Potential Outcome Framework. Motivation. Example: Crossfit versus boot camp 2/20/2012. Paul L. Hebert, PhD

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem

Chapter 1 Introduction. What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes

Analysis of Panel Data: Introduction and Causal Inference with Panel Data

Causal mediation analysis: Definition of effects and common identification assumptions

Causality and Experiments

Experiments and Quasi-Experiments

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (

REVIEW OF SOCIAL ECONOMY, VOL. LXIII, NO. 1, MARCH Book Reviews

Geoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015

Introduction to Panel Data Analysis

The World Bank CG Rep. LISUNGI Safety Nets System Project (P145263)

causal inference at hulu

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Sensitivity checks for the local average treatment effect

Applied Microeconometrics (L5): Panel Data-Basics

Introduction to Causal Calculus

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

1Department of Demography and Organization Studies, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX

Counterfactual worlds

Electronic Research Archive of Blekinge Institute of Technology

Gov 2002: 4. Observational Studies and Confounding

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Ignoring the matching variables in cohort studies - when is it valid, and why?

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Analysis of propensity score approaches in difference-in-differences designs

Introduction to Econometrics

Differences-in- Differences. November 10 Clair

Effect Modification and Interaction

Randomization-Based Inference With Complex Data Need Not Be Complex!

Advanced Statistical Methods for Observational Studies L E C T U R E 0 1

Lecture Note 2 Causal Inference in Economics, with an Application to the Minimum Wage Debate

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (

Introduction to Statistical Inference

Running head: GEOGRAPHICALLY WEIGHTED REGRESSION 1. Geographically Weighted Regression. Chelsey-Ann Cu GEOB 479 L2A. University of British Columbia

Combining Experimental and Non-Experimental Design in Causal Inference

Microeconometrics. Bernd Süssmuth. IEW Institute for Empirical Research in Economics. University of Leipzig. April 4, 2011

Causal analysis after Haavelmo. 8th/last Lecture - Hedibert Lopes

Public Disclosure Copy

Lecture 34 Woodward on Manipulation and Causation

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

The Doubly-Modifiable Structural Model (DMSM) The Modifiable Structural Model (MSM)

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.

Econometrics of Policy Evaluation (Geneva summer school)

Transcription:

Impact Evaluation Technical Track Session I: Click to edit Master title style Causal Inference Damien de Walque Amman, Jordan March 8-12, 2009 Click to edit Master subtitle style Human Development Human Network Development Network Middle East and North Africa Middle East Region and North Africa Region World Bank Institute Spanish Impact Evaluation Fund

Motivation The research questions that motivate most studies in the health sciences are causal in nature. For example: What is the efficacy of a given drug a given population? What fraction of deaths from a given disease could have been avoided by a given treatment or policy? 2

Motivation The most challenging empirical questions in economics also involve causal-effect relationships: Does school decentralization improve schools quality? Does one more year of education causes higher income? Does conditional cash transfers causes better health outcomes in children? 3

Motivation Interest in these questions is motivated by: Policy concerns Do public programs reduce poverty? Theoretical considerations Problems facing individual decision makers 4

Intuition of the problem: a hypothetical example A conditional cash transfer (CCT) program for pregnant women They will be under periodic medical checkouts The program focuses on better nutrition Periodic meetings about risks of smoking and drinking Key question: does the program have any impact on children s birth weight? 5

Cash transfers (CCTs) and birth weight Suppose your data are: women in a town with CCTs and women in a nearby town without CCTs Y = Average birth weight in grams Before the CCTs started?? Women Women Difference Double in town in nearby town Difference with CCT without CCT Once the CCTs started 3,250 3,100 150? 6

Cash transfers (CCTs) and birth weight Suppose your data are: women in a town with the program and women in a nearby town (without the program) Y = Average birth weight in grams Women Women Difference Double in town in nearby town Difference with CCTs without CCTs Before the program started 3,025 2,840 185 Once the program started 3,250 3,100 150-35 7

Cash transfers (CCTs) and birth weight Suppose you have data on women who got assigned to the program randomly Y = average birth weight in grams Women Women Difference Double in town in nearby town Difference with CCTs without CCTs Before the program started 3,025 2,840 185 Once the program started 3,250 3,100 150-35 Women Women Difference Double in town in nearby town Difference with CCTs BY RANDOMIZATION Before the program started 3,028 3,028 0 Once the program started 3,250 3,105 145 145 8

Standard Statistical Analysis Tools: likelihood and other estimation techniques Aim: to infer parameters of a distribution from samples drawn of that distribution. Uses: With the help of such parameters, one can: Infer association among variables, Estimate the likelihood of past and future events, Update the likelihood of events in light of new evidence or new measurement. Condition for this to work well: experimental conditions must remain the same. 9

Causal Analysis Causal Analysis goes one step further than Standard Statistical Analysis: Its aim is to infer aspects of the data generation process. With the help of such aspects, one can deduce the likelihood of events under static conditions, (as in Standard Statistical Analysis) and also the dynamics of events under changing conditions. 10

Causal Analysis dynamics of events under changing conditions includes : 1. Predicting the effects of interventions 2. Predicting the effects of spontaneous changes 3. Identifying causes of reported events 11

Causation versus Correlation Standard statistical analysis/ probability theory: The word cause is not in its vocabulary Allows us to say is that two events are mutually correlated, or dependent if we find one, we can expect to find the other This is not enough for policy makers They look rationales for policy decisions: if we do XXX, then will we get YYY? Hence we must supplement the language of probability with a vocabulary for causality. 12

The Rubin Causal Model Define the population by U. Each unit in U is denoted by u. For each u U, there is an associated value Y(u) of the variable of interest Y, which we call: the response variable. Rubin takes the position that causes are only those things that could be treatments in hypothetical experiments. 13

For simplicity, we assume that there are just two causes or level of treatment. Let D be a variable that indicates the cause to which each unit in U is exposed: D 1 if unit u is exposed to treatment = 0 if unit u is exposed tocontrol In a controlled study, D is constructed by the experimenter. In an uncontrolled study, D is determined by factors beyond the experimenter s control. 14

The response Y is potentially affected by whether u receives treatment or not. Thus, we need two response variables: Y( u) is the outcome if unit u is exposed to treatment 1 Y ( u) is the outcome if unit u is exposed to control 0 15

D 1 if unit u is exposed to treatment = 0 if unit u is exposed to control Y( u) is the outcome if unit u is exposed to treatment 1 Y ( u) is the outcome if unit u is exposed to control 0 Then, the outcome of each unit u can be written as: Y ( u) = DY ( u) + (1 D) Y ( u) 1 0 Note: This definition assumes that the treatment status of one unit does not affect the potential outcomes of other units. 16

Definition: For every unit u, treatment causes the effect δ = Y( u) Y ( u) u 1 0 Fundamental Problem of Causal Inferene c : For a given u, we observe either Y( u) OR Y ( u) 1 1 0 We cannot observe the value of Y ( u) and Y ( u) on the same unit u it is impossible to observe the effect of treatment on u by itself. 0 Issue: We do not have the counterfactual evidence for u i.e. what would have happened to u in the absence of treatment. 17

Given that the treatment effect for a single unit u cannot be observed, we aim to identify the average treatment effect for the population U (or for subpopulations). The average treatment effect ATE over U (or subpopulations of U): TE u 1 ATE = E [ Y ( u) Y ( u)] U u = = δ U 1 1 0 ( u) Y ( u) 0 1 0 E [ Y( u)] E [ Y ( u)] U Y = Y Y = δ = U 0 ( 1) 18

The statistical solution replaces the impossible-to-observe treatment effect of t on a specific unit u with the possibleto-estimate average treatment effect of t over a population U of such units. Although E U (Y 1 ) and E U (Y 0 ) cannot both be calculated, they can be estimated. Most econometrics methods attempt to construct from observational data consistent estimates of E ( Y) = Y and E ( Y ) = Y U 1 1 U 0 0 19

So we are trying to estimate: ATE = E [ Y( u)] E [ Y ( u)] U U 1 U 0 = Y Y 1 0 (1) Consider the following simple estimator of ATE U : ˆ [ ˆ 1]-[ ˆ δ = Y D= Y D= 0] (2) 1 0 equation (1) is defined for the whole population, equation (2) is an estimator to be evaluated on a sample drawn from that population 20

Lemma: If we assume that [ Y D= 1] = [ Y D= 0 ] 1 1 and [ Y D = 1] = [ Y D = 0 0 0 then ˆ ˆ ˆ δ = [ Y D= 1] -[ Y D= 0] 1 0 is a consistent estimator of δ = Y -Y 1 0 21

Thus, a sufficient condition for the simple estimator to consistently estimate the true ATE is that: [ Y D= 1] = [ Y D= 0 ] 1 1 and [ Y D= 1] = [ Y D= 0 0 0 The average outcome under treatment Y 1 is the same for the treatment (D=1) and the control (D=0) groups The average outcome under control is the same for the treatment (D=1) and the control (D=0) groups Y 0 23

When will those conditions be satisfied? It is sufficient that treatment assignment D be uncorrelated with the potential outcome distributions of Y 0 and Y 1. Intuitively: there can be no correlation between Whether someone gets the treatment How much that person potentially benefits from the treatment The easiest way to achieve this uncorrelatedness is through random assignment of treatment. 24

Another way of looking at it After a bit of algebra, it can be shown that: ˆ δ = ( ) δ + Y D= Y D= 0 0 simple estimator true impact [ 1] [ 0 ] Baseline Difference ( ) + (1 π) δ δ { D= 1} { D = 0} Treatment Heterogeneity 25

Another way of looking at it (in words) There are two sources of biases that need to be eliminated from estimates of causal effects from observational studies. Baseline difference. (selection bias) Treatment Heterogeneity. Most of the methods available only deal with selection bias 26

Treatment on the Treated ATE is not always the parameter of interest. Often, it is the average treatment effect for the treated that is of substantive interest: TOT = E [ Y( u) Y ( u) D = 1] 1 0 = E [ Y( u) D = 1] E [ Y ( u) D = 1] 1 0 27

Treatment on the Treated If we need to estimate TOT TOT = E [ Y( u) D = 1] E [ Y ( u) D = 1] 1 0 Then the simple estimator (2) ˆ δ = [ Y ˆ D= 1] -[ Y ˆ D= 0] 1 0 consistently estimates TOT if: [ Y D= 1] = [ Y D= 0 0 0 No baseline difference between the treated and control groups 28

References Judea Pearl (2000): Causality: Models, Reasoning and Inference, Cambridge University press. (Book) Chapters 1, 5 and 7. Trygve Haavelmo (1944): The probability approach in econometrics, Econometrica 12, pp. iii-vi+1-115. Arthur Goldberger (1972): Structural Equations Methods in the Social Sciences, Econometrica 40, pp. 979-1002. Donald B. Rubin (1974): Estimating causal effects of treatments in randomized and nonrandomized experiments, Journal of Educational Psychology 66, pp. 688-701. Paul W. Holland (1986): Statistics and Causal Inference, Journal of the American Statistical Association 81, pp. 945-70, with discussion. 29