MATCHING FOR EE AND DR IMPACTS

Similar documents
Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Gov 2002: 5. Matching

The Balance-Sample Size Frontier in Matching Methods for Causal Inference

Matching is a statistically powerful and conceptually

Matching. Stephen Pettigrew. April 15, Stephen Pettigrew Matching April 15, / 67

Propensity Score Methods for Causal Inference

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

Comments on Best Quasi- Experimental Practice

Massive Experiments and Observational Studies: A Linearithmic Algorithm for Blocking/Matching/Clustering

Optimal Blocking by Minimizing the Maximum Within-Block Distance

Propensity Score Matching

Selection on Observables: Propensity Score Matching.

Dynamics in Social Networks and Causality

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

Section 10: Inverse propensity score weighting (IPSW)

(Mis)use of matching techniques

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Combining Experimental and Non-Experimental Design in Causal Inference

Summary and discussion of The central role of the propensity score in observational studies for causal effects

Propensity Score Matching and Genetic Matching : Monte Carlo Results

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Matching for Causal Inference Without Balance Checking

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai

PROPENSITY SCORE MATCHING. Walter Leite

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Propensity Score Weighting with Multilevel Data

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

CompSci Understanding Data: Theory and Applications

A Theory of Statistical Inference for Matching Methods in Causal Research

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Discussion of Papers on the Extensions of Propensity Score

A Theory of Statistical Inference for Matching Methods in Applied Causal Research

Job Training Partnership Act (JTPA)

Gov 2002: 4. Observational Studies and Confounding

c 2015 by Jason James Sauppe. All rights reserved.

Measuring Social Influence Without Bias

Causal Inference by Minimizing the Dual Norm of Bias. Nathan Kallus. Cornell University and Cornell Tech

Quantitative Economics for the Evaluation of the European Policy

The problem of causality in microeconometrics.

Estimating the Marginal Odds Ratio in Observational Studies

Causal Inference Basics

Is My Matched Dataset As-If Randomized, More, Or Less? Unifying the Design and. Analysis of Observational Studies

Targeted Maximum Likelihood Estimation in Safety Analysis

AAEC/ECON 5126 FINAL EXAM: SOLUTIONS

Small domain estimation using probability and non-probability survey data

Multivariate Analysis Cluster Analysis

Causal inference in multilevel data structures:

Metric-based classifiers. Nuno Vasconcelos UCSD

Lecture 3: Causal inference

Recommendations as Treatments: Debiasing Learning and Evaluation

Background of Matching

Imbens/Wooldridge, Lecture Notes 1, Summer 07 1

Causal Analysis in Social Research

Propensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr

Chapter 7: Sampling Distributions

Estimation of the Conditional Variance in Paired Experiments

Flexible Estimation of Treatment Effect Parameters

Analyzing the NYC Subway Dataset

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

What s New in Econometrics. Lecture 1

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

finite-sample optimal estimation and inference on average treatment effects under unconfoundedness

How to Use the Internet for Election Surveys

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017

EMERGING MARKETS - Lecture 2: Methodology refresher

Marginal, crude and conditional odds ratios

Can a Pseudo Panel be a Substitute for a Genuine Panel?

Analyses of Variance. Block 2b

Analytics Software. Beyond deterministic chain ladder reserves. Neil Covington Director of Solutions Management GI

Inference for Average Treatment Effects

Implementing Matching Estimators for. Average Treatment Effects in STATA

Balancing Covariates via Propensity Score Weighting: The Overlap Weights

Propensity Score for Causal Inference of Multiple and Multivalued Treatments

Impact Evaluation of Mindspark Centres

Multiple Linear Regression for the Supervisor Data

DATA-ADAPTIVE VARIABLE SELECTION FOR

CSE446: non-parametric methods Spring 2017

The Balance-Sample Size Frontier in Matching Methods for Causal Inference: Supplementary Appendix

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield

arxiv: v1 [stat.me] 15 May 2011

Unit 2. Describing Data: Numerical

MA 1125 Lecture 33 - The Sign Test. Monday, December 4, Objectives: Introduce an example of a non-parametric test.

Chapter 7: Sampling Distributions

Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings

Lecture 4 January 23

Classification 2: Linear discriminant analysis (continued); logistic regression

Potential Outcomes Model (POM)

Review for Algebra Final Exam 2015

Exploring EDA. Exploring EDA, Clustering and Data Preprocessing Lecture 1. Exploring EDA. Vincent Croft NIKHEF - Nijmegen

Causal Sensitivity Analysis for Decision Trees

Econometric Analysis of Cross Section and Panel Data

Variable selection and machine learning methods in causal inference

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies

Rubin s Potential Outcome Framework. Motivation. Example: Crossfit versus boot camp 2/20/2012. Paul L. Hebert, PhD

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

A Pilot Study on Functional Processing: Inferences of Pairwise Relationships in Systems of Three Variables. Kent L. Norman and Benjamin K.

Transcription:

MATCHING FOR EE AND DR IMPACTS Seth Wayland, Opinion Dynamics August 12, 2015

A Proposal Always use matching Non-parametric preprocessing to reduce model dependence Decrease bias and variance Better understand your data EE, DR Quasi-Experiment Randomized Experiment IEPEC 2015 2

Agenda Review current best practice for impact evaluation Review some matching methods Matching example IEPEC 2015 3

Impact Estimation Best Practice RCT + Model (to reduce bias and variance) Quasi-experiment + Matching + Model IEPEC 2015 4

Methods Model Difference-in-Difference Linear Fixed Effects Lagged Dependent Variable Match Propensity Score Mahalanobis Distance Coarsened Exact Matching Frontier IEPEC 2015 5

Feeling Lucky? Randomized experiments are guaranteed to be unbiased over repeated experiments There is only one actual experiment How sure can we be that this one is unbiased? Check the balance of treatment versus control What can we do? Match to reduce imbalance Model to correct for dependence on known and (fixed) unknown covariates IEPEC 2015 6

Applying the Rubin Causal Model For a particular unit, the causal effect of a treatment at time t is the difference between what would have happened at time t if the unit was exposed to the treatment and what would have happened at time t if the unit was not exposed to the treatment. IEPEC 2015 7

Applying the Rubin Causal Model The customer cannot be simultaneously exposed to the treatment and not exposed to the treatment We need to make some assumptions SUTVA Ignorable treatment assignment IEPEC 2015 8

Ignorable treatment assignment Model Parametrically adjust for the effect of covariates Match Non-parametrically improve balance of all included covariates Both also usually reduce variance Matching yields insight into the data IEPEC 2015 9

Matching Procedure 1. Select a distance measure 2. Select and implement a matching method 3. Assess balance, return to 1 or 2 as necessary 4. Use the matched data to perform analysis IEPEC 2015 10

Matching Procedure - Considerations Choice of treatment effect (ITT, ATE, ATT, SATT, FSATT, etc.) Choice of variables to include in matching Choice of matching method Choice of model in distance metric for Propensity Score matching Choice of balance checks IEPEC 2015 11

Example Home energy report program with an RCT design IEPEC 2015 12

Matching Methods Exact K nearest neighbors Coarsened Exact Matching Frontier Many others IEPEC 2015 13

Balance Checks Difference in Means Check all variables (don t use statistical significance) Average Mahalanobis Imbalance Mean Mahalanobis distance between all matched pairs Median L1 Distance Distance between multivariate histograms IEPEC 2015 14

When Matching Doesn t Help Coincident non-treatment changes Some whole-house programs Missing information about treatment assignment Opt-in bias? Modeling doesn t help either IEPEC 2015 15

Coarsened Exact N = 9,408, Nc = 9,355 Median L1 distance: 0.09 Much better Average mean distance: 0.37 kwh/day Somewhat worse IEPEC 2015 16

Coarsened Exact Feasible Group Non-Feasible Group IEPEC 2015 17

Coarsened Exact FSATT (N f =9,408, N c =9,355) savings = 4.3% NFSATT (N nf =592, N c =644) savings = 9.6% Weighted SATT (N=N c =10,000) savings = 4.6% Full Sample SATT (N=N c =10,000) savings = 4.8% weighted SATT = FSATT N f + NFSATT N nf N IEPEC 2015 18

A Second Proposal How do we evaluate what are the best methods/approaches for impact evaluation? We need published data and well-defined metrics Common Task Method Everyone works on the same problem Method Publish data Define evaluation metrics Periodic public evaluation of methods IEPEC 2015 19

For More Information Seth Wayland, Associate Director Opinion Dynamics swayland@opiniondynamics.com IEPEC 2015 20

Thank you xkcd.com/925 IEPEC 2015 21

Distance Metrics Exact Propensity Score Mahalanobis Euclidian is a special case IEPEC 2015 22

Match Anyway Methods K nearest neighbors (1:1) with SATT Propensity score distance Mahalanobis distance Coarsened Exact with weighted SATT L1 distance IEPEC 2015 23

Balance Metrics Treated group N = 10,000 Comparison group N c = 10,000 Average mean difference for the 12 months of the pre-period: 0.03 kwh/day Median L1 distance: 0.56 IEPEC 2015 24

K Nearest Neighbors Propensity score metric Simple model with a variable for each month of pre-period usage N = 10,000 and N c = 5,580 Average mean difference: -0.22 kwh/day Balance is a little worse IEPEC 2015 25

K Nearest Neighbors Mahalanobis distance N = 10,000 and N c = 5,762 Average mean difference: 2.9 Balance is much worse IEPEC 2015 26