CAUSAL INFERENCE. Technical Track Session I. Phillippe Leite. The World Bank

Similar documents
INSTRUMENTAL VARIABLES

4th Indian Institute of Astrophysics - PennState Astrostatistics School July, 2013 Vainu Bappu Observatory, Kavalur. Correlation and Regression

MATCHING TECHNIQUES. Technical Track Session VI. Emanuela Galasso. The World Bank

Evaluating enterprise support: state of the art and future challenges. Dirk Czarnitzki KU Leuven, Belgium, and ZEW Mannheim, Germany

Technical Track Session I:

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Technical Track Session I: Causal Inference

REGRESSION DISCONTINUITY (RD) Technical Track Session V. Dhushyanth Raju Julieta Trias The World Bank

Bootstrap Method > # Purpose: understand how bootstrap method works > obs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(obs) >

CHAPTER 24: INFERENCE IN REGRESSION. Chapter 24: Make inferences about the population from which the sample data came.

MATCHING TECHNIQUES Technical Track Session VI Céline Ferré The World Bank

Lecture 13: Markov Chain Monte Carlo. Gibbs sampling

Lesson Plan. Recode: They will do a graphic organizer to sequence the steps of scientific method.

Functional Form and Nonlinearities

In the OLG model, agents live for two periods. they work and divide their labour income between consumption and

A Matrix Representation of Panel Data

What is Statistical Learning?

THE LIFE OF AN OBJECT IT SYSTEMS

Assessment Primer: Writing Instructional Objectives

Name: Block: Date: Science 10: The Great Geyser Experiment A controlled experiment

Competency Statements for Wm. E. Hay Mathematics for grades 7 through 12:

Five Whys How To Do It Better

A Quick Overview of the. Framework for K 12 Science Education

Simple Linear Regression (single variable)

READING STATECHART DIAGRAMS

Comparing Several Means: ANOVA. Group Means and Grand Mean

A New Evaluation Measure. J. Joiner and L. Werner. The problems of evaluation and the needed criteria of evaluation

How do scientists measure trees? What is DBH?

NAME: Prof. Ruiz. 1. [5 points] What is the difference between simple random sampling and stratified random sampling?

Hypothesis Tests for One Population Mean

Differentiation Applications 1: Related Rates

Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff

(2) Even if such a value of k was possible, the neutrons multiply

Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff

Lab 1 The Scientific Method

Resampling Methods. Chapter 5. Chapter 5 1 / 52

Unit 1: Introduction to Biology

Modelling of Clock Behaviour. Don Percival. Applied Physics Laboratory University of Washington Seattle, Washington, USA

If (IV) is (increased, decreased, changed), then (DV) will (increase, decrease, change) because (reason based on prior research).

Admin. MDP Search Trees. Optimal Quantities. Reinforcement Learning

Excessive Social Imbalances and the Performance of Welfare States in the EU. Frank Vandenbroucke, Ron Diris and Gerlinde Verbist

Introduction to Quantitative Genetics II: Resemblance Between Relatives

PSU GISPOPSCI June 2011 Ordinary Least Squares & Spatial Linear Regression in GeoDa

Science Unit: School year: 2006/2007

WRITING THE REPORT. Organizing the report. Title Page. Table of Contents

Reinforcement Learning" CMPSCI 383 Nov 29, 2011!

Math Foundations 20 Work Plan

IB Sports, Exercise and Health Science Summer Assignment. Mrs. Christina Doyle Seneca Valley High School

CONSTRUCTING STATECHART DIAGRAMS

IN a recent article, Geary [1972] discussed the merit of taking first differences

1 The limitations of Hartree Fock approximation

Emphases in Common Core Standards for Mathematical Content Kindergarten High School

Building Consensus The Art of Getting to Yes

Thermodynamics Partial Outline of Topics

Collocation Map for Overcoming Data Sparseness

Maximum A Posteriori (MAP) CS 109 Lecture 22 May 16th, 2016

MACE For Conformation Traits

Multiple Source Multiple. using Network Coding

A Regression Solution to the Problem of Criterion Score Comparability

Writing Guidelines. (Updated: November 25, 2009) Forwards

SPECIMEN. Candidate Surname. Candidate Number

How T o Start A n Objective Evaluation O f Your Training Program

CS 477/677 Analysis of Algorithms Fall 2007 Dr. George Bebis Course Project Due Date: 11/29/2007

Cambridge Assessment International Education Cambridge Ordinary Level. Published

BASD HIGH SCHOOL FORMAL LAB REPORT

AP Literature and Composition. Summer Reading Packet. Instructions and Guidelines

ENSC Discrete Time Systems. Project Outline. Semester

Chapter 3: Cluster Analysis

Phys. 344 Ch 7 Lecture 8 Fri., April. 10 th,

The Law of Total Probability, Bayes Rule, and Random Variables (Oh My!)

Section 11 Simultaneous Equations

Math 10 - Exam 1 Topics

SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical model for microarray data analysis

Least Squares Optimal Filtering with Multirate Observations

Chapter 8: The Binomial and Geometric Distributions

Chairville Elementary Science Fair Guide. Everything you need to succeed At the Chairville Science Fair

Do big losses in judgmental adjustments affect experts behaviour? Fotios Petropoulos, Robert Fildes and Paul Goodwin

Weathering. Title: Chemical and Mechanical Weathering. Grade Level: Subject/Content: Earth and Space Science

Systems and Modeling Introduction

EASTERN ARIZONA COLLEGE Introduction to Statistics

Sequential Allocation with Minimal Switching

AP Statistics Notes Unit Five: Randomness and Probability

SIZE BIAS IN LINE TRANSECT SAMPLING: A FIELD TEST. Mark C. Otto Statistics Research Division, Bureau of the Census Washington, D.C , U.S.A.

Statistics, Numerical Models and Ensembles

Optimization Programming Problems For Control And Management Of Bacterial Disease With Two Stage Growth/Spread Among Plants

UN Committee of Experts on Environmental Accounting New York, June Peter Cosier Wentworth Group of Concerned Scientists.

CS 109 Lecture 23 May 18th, 2016

Document for ENES5 meeting

Getting Involved O. Responsibilities of a Member. People Are Depending On You. Participation Is Important. Think It Through

1b) =.215 1c).080/.215 =.372

We can see from the graph above that the intersection is, i.e., [ ).

CHAPTER 3 INEQUALITIES. Copyright -The Institute of Chartered Accountants of India

k-nearest Neighbor How to choose k Average of k points more reliable when: Large k: noise in attributes +o o noise in class labels

Statistical Learning. 2.1 What Is Statistical Learning?

Eric Klein and Ning Sa

Source Coding and Compression

CESAR Science Case The differential rotation of the Sun and its Chromosphere. Introduction. Material that is necessary during the laboratory

English 10 Pacing Guide : Quarter 2

Kinetic Model Completeness

Transcription:

CAUSAL INFERENCE Technical Track Sessin I Phillippe Leite The Wrld Bank These slides were develped by Christel Vermeersch and mdified by Phillippe Leite fr the purpse f this wrkshp Plicy questins are causal in nature Cause-effect relatinships are a part f what plicy makers d: Des schl decentralizatin imprve schl quality? Des ne mre year f educatin cause higher incme? Des cnditinal cash transfers cause better health utcmes in children? Hw d we imprve student learning? 1

Standard Statistical Analysis Tls Likelihd and ther estimatin techniques. Aim T infer parameters f a distributin frm samples drawn f that distributin. Uses With the help f such parameters, ne can: Infer assciatin amng variables, Estimate the likelihd f past and future events, Update the likelihd f events in light f new evidence r new measurement. Standard Statistical Analysis Cnditin Fr this t wrk well, experimental cnditins must remain the same. But ur plicy questins were: If I decentralize schls, will quality imprve? If I find a way t make a child g t schl lnger, will she earn mre mney? If I start giving cash t families, will their children becme healthier? If I train teachers, will students learn mre? 2

Causal Analysis Fr causal questins, we need t infer aspects f the data generatin prcess. We need t be able t deduce: 1. the likelihd f events under static cnditins, (as in Standard Statistical Analysis) and als 2. the dynamics f events under changing cnditins. Causal Analysis dynamics f events under changing cnditins includes: 1. Predicting the effects f interventins. 2. Predicting the effects f spntaneus changes. 3. Identifying causes f reprted events. 3

Causatin vs. Crrelatin Standard statistical analysis/prbability thery: The wrd cause is nt in its vcabulary. Allws us t say is that tw events are mutually crrelated, r dependent. This is nt enugh fr plicy makers They lk at ratinales fr plicy decisins: if we d XXX, then will we get YYY? We need a vcabulary fr causality. THE RUBIN CAUSAL MODEL Vcabulary fr Causality 4

Ppulatin & Outcme Variable Define the ppulatin by U. Each unit in U is dented by u. The utcme f interest is Y. Als called the respnse variable. Fr each u U, there is an assciated value Y(u). Causes/Treatment experiments. Causes are thse things that culd be treatments in hypthetical Fr simplicity, we assume that there are just tw pssible states: Unit u is expsed t treatment and Unit u is expsed t cmparisn. - Rubin 5

The Treatment Variable Let D be a variable that indicates the state t which each unit in U is expsed. D = 1 If unit u is expsed t treatment 0 If unit u is expsed t cmparisn Where des D cme frm? In a cntrlled study: cnstructed by the experimenter. In an uncntrlled study: determined by factrs beynd the experimenter s cntrl. Linking Y and D Y=respnse variable D= treatment variable The respnse Y is ptentially affected by whether u receives treatment r nt. Thus, we need tw respnse variables: Y 1 (u) is the utcme if unit u is expsed t treatment. Y 0 (u) is the utcme if unit u is expsed t cmparisn. 6

The effect f treatment n utcme Treatment variable D D = 1 If unit u is expsed t treatment 0 If unit u is expsed t cmparisn Respnse variable Y Y 1 (u) is the utcme if unit u is expsed t treatment Y 0 (u) is the utcme if unit u is expsed t cmparisn Fr any unit u, treatment causes the effect δ u = Y 1 (u) - Y 0 (u) But there is a prblem: Fr any unit u, treatment causes the effect δ u = Y 1 (u) - Y 0 (u) Fundamental prblem f causal inference Fr a given unit u, we bserve either Y 1 (u) r Y 0 (u), it is impssible t bserve the effect f treatment n u by itself! We d nt bserve the cunterfactual If we give u treatment, then we cannt bserve what wuld have happened t u in the absence f treatment. 7

S what d we d? Instead f measuring the treatment effect n unit u, we identify the average treatment effect fr the ppulatin U (r fr sub-ppulatins) Y ( u) Y ( u) u ATE E [ Y ( u) Y ( u)] U Y U U Y E [ Y ( u)] E [ Y ( u)] 1 U 0 ( 1) Estimating the ATE S, (1) Replace the impssible-t-bserve treatment effect f D n a specific unit u, (2) with the pssible-t-estimate average treatment effect f D ver a ppulatin U f such units. Althugh E U (Y 1 ) and E U (Y 0 ) cannt bth be calculated, they can be estimated. Mst ecnmetrics methds attempt t cnstruct frm bservatinal data cnsistent estimatrs f: E U (Y 1 ) = Y 1 and E U (Y 0 )= Y 0 8

A simple estimatr f ATE U S we are trying t estimate: ATE U = E U (Y 1 ) - E U (Y 0 ) = Y 1 - Y 0 Cnsider the fllwing simple estimatr: δ = [ Y 1 D = 1] - [ Y 0 D =0 ] (1) (2) Nte Equatin (1) is defined fr the whle ppulatin. Equatin (2) is an estimatr t be cmputed n a sample drawn frm that ppulatin. An imprtant lemma Lemma: If we assume that [ Y D 1] [ Y D 0] 1 1 and [ Y D 1] [ Y D 0] 0 0 then ˆ [ Y ˆ D 1] -[ Y ˆ D 0] is a cnsistent estimatr f Y -Y 9

Fundamental Cnditins Thus, a sufficient cnditin fr the simple estimatr t cnsistently estimate the true ATE is that: And [Y 1 D=1]=[Y 1 D=0] The average utcme under treatment Y 1 is the same fr the treatment (D=1) and the cmparisn (D=0) grups [Y 0 D=1]=[Y 0 D=0] The average utcme under cmparisn Y 0 is the same fr the treatment (D=1) and the cmparisn (D=0) grups When will thse cnditins be satisfied? It is sufficient that treatment assignment D be uncrrelated with the ptential utcme distributins f Y 0 and Y 1 Intuitively, there can be n crrelatin between (1) Whether smene gets the treatment and (2) Hw much that persn ptentially benefits frm the treatment. The easiest way t achieve this uncrrelatedness is thrugh randm assignment f treatment. 10

Anther way f lking at it With sme algebra, it can be shwn that: ˆ simple true estimatr impact [ Y D 1] [ Y D 0] 0 0 (1 ) Baseline Difference { D 1} { D 0} Hetergeneus Respnse t Treatment Anther way f lking at it (in wrds) There are tw surces f bias that need t be eliminated frm estimates f causal effects : Baseline difference/selectin bias Hetergeneus respnse t the treatment Mst f the methds available nly deal with selectin bias. 11

Treatment n the Treated treated that is f substantive interest: Average Treatment Effect is nt always the parameter f interest ften, it is the average treatment effect fr the TOT E [ Y ( u) Y ( u) D 1] E [ Y ( u) D 1] E [ Y ( u) D 1] Treatment n the Treated If we need t estimate Treatment n the Treated: TOT E [ Y ( u) D 1] E [ Y ( u) D 1] Then the simple estimatr ˆ [ Y ˆ D 1]-[ Y ˆ D 0] [ Y D 1] [ Y D 0] 0 0 (2) cnsistently estimates Treatment n the Treated if: N baseline difference between the treatment and cmparisn grups. 12

References Judea Pearl (2000): Causality: Mdels, Reasning and Inference, Cambridge University press. (Bk) Chapters 1, 5 and 7. Trygve Haavelm (1944): The prbability apprach in ecnmetrics, Ecnmetrica 12, pp. iii-vi+1-115. Arthur Gldberger (1972): Structural Equatins Methds in the Scial Sciences, Ecnmetrica 40, pp. 979-1002. Dnald B. Rubin (1974): Estimating causal effects f treatments in randmized and nnrandmized experiments, Jurnal f Educatinal Psychlgy 66, pp. 688-701. Paul W. Hlland (1986): Statistics and Causal Inference, Jurnal f the American Statistical Assciatin 81, pp. 945-70, with discussin. Thank Yu 13

? Q & A 14