Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Similar documents
Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

Balancing Covariates via Propensity Score Weighting

Observational Studies and Propensity Scores

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

Gov 2002: 5. Matching

Propensity Score Methods for Causal Inference

PROPENSITY SCORE MATCHING. Walter Leite

Balancing Covariates via Propensity Score Weighting: The Overlap Weights

Overlap Propensity Score Weighting to Balance Covariates

MATCHING FOR EE AND DR IMPACTS

Matching. Stephen Pettigrew. April 15, Stephen Pettigrew Matching April 15, / 67

Selection on Observables: Propensity Score Matching.

Propensity Score Matching

Rerandomization to Balance Covariates

Summary and discussion of The central role of the propensity score in observational studies for causal effects

Section 10: Inverse propensity score weighting (IPSW)

An Introduction to Causal Analysis on Observational Data using Propensity Scores

AAEC/ECON 5126 FINAL EXAM: SOLUTIONS

Job Training Partnership Act (JTPA)

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018

Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings

Propensity Score Weighting with Multilevel Data

THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Combining Experimental and Non-Experimental Design in Causal Inference

Introduction to Propensity Score Matching: A Review and Illustration

A Course in Applied Econometrics. Lecture 2 Outline. Estimation of Average Treatment Effects. Under Unconfoundedness, Part II

Difference-in-Differences Methods

The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Dynamics in Social Networks and Causality

II. MATCHMAKER, MATCHMAKER

CompSci Understanding Data: Theory and Applications

Background of Matching

Estimating the Marginal Odds Ratio in Observational Studies

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Variable selection and machine learning methods in causal inference

DOCUMENTS DE TRAVAIL CEMOI / CEMOI WORKING PAPERS. A SAS macro to estimate Average Treatment Effects with Matching Estimators

Quantitative Economics for the Evaluation of the European Policy

Lab 4, modified 2/25/11; see also Rogosa R-session

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Randomization-Based Inference With Complex Data Need Not Be Complex!

arxiv: v1 [stat.me] 15 May 2011

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Data Integration for Big Data Analysis for finite population inference

Implementing Matching Estimators for. Average Treatment Effects in STATA

Assessing Studies Based on Multiple Regression

Authors and Affiliations: Nianbo Dong University of Missouri 14 Hill Hall, Columbia, MO Phone: (573)

Imbens/Wooldridge, IRP Lecture Notes 2, August 08 1

Robustness to Parametric Assumptions in Missing Data Models

Matching for Causal Inference Without Balance Checking

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem

Comments on Best Quasi- Experimental Practice

What s New in Econometrics. Lecture 1

Propensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr

Causal modelling in Medical Research

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

The Balance-Sample Size Frontier in Matching Methods for Causal Inference: Supplementary Appendix

(Mis)use of matching techniques

Potential Outcomes Model (POM)

MMWS Software Program Manual

Achieving Optimal Covariate Balance Under General Treatment Regimes

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

PSC 504: Differences-in-differeces estimators

Propensity Score Analysis with Hierarchical Data

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Statistical Models for Causal Analysis

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017

Imbens, Lecture Notes 1, Unconfounded Treatment Assignment, IEN, Miami, Oct 10 1

Integrated approaches for analysis of cluster randomised trials

PSC 504: Instrumental Variables

Causal inference in multilevel data structures:

Introduction to Econometrics. Review of Probability & Statistics

Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand

150C Causal Inference

PSC 504: Dynamic Causal Inference

Balancing Covariates via Propensity Score Weighting

Ignoring the matching variables in cohort studies - when is it valid, and why?

Counterfactual Model for Learning Systems

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019

Front-Door Adjustment

AGEC 661 Note Fourteen

Empirical approaches in public economics

Weighting Missing Data Coding and Data Preparation Wrap-up Preview of Next Time. Data Management

Combining Difference-in-difference and Matching for Panel Data Analysis

CSC321 Lecture 4 The Perceptron Algorithm

Imbens/Wooldridge, Lecture Notes 1, Summer 07 1

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies

Advising on Research Methods: A consultant's companion. Herman J. Ader Gideon J. Mellenbergh with contributions by David J. Hand

Lecture #11: Classification & Logistic Regression

Matching Methods for Observational Microarray Studies

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

Marginal, crude and conditional odds ratios

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Subgroup Balancing Propensity Score

Transcription:

STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University Frequency 0 2 4 6 8 Quiz 2 Histogram of Quiz2 10 12 14 16 18 20 Quiz2 > summary(quiz2) Min. 1st Qu. Median Mean 3rd Qu. Max. 11.0 14.0 15.0 15.5 17.0 20.0 Quiz 2 one-sided or two-sided p-value? (depends on question being asked) imputation: use observed control outcomes to impute missing treatment outcomes and vice versa. o class year: use observed outcomes from control sophomores to impute missing outcomes for treatment sophomores biased or unbiased : Find control units to match the units in the treatment group Restrict the sample to matched units Analyze the difference for each match (analyze as matched pair experiment) Useful when one group (usually the control) is much larger than the other Estimand Changes the estimand: now estimating the causal effect for the subpopulation of treated units ATE: Average treatment effect ATT: Average treatment effect for the treated Exact For exact matching, covariate values must match exactly 21 year old female in treatment group must be matched with 21 year old female in control group ATC: Average treatment effect for the controls 1

Inexact Matches Often, exact matching is not feasible, and matches are just as close as possible The farther apart the matches are, the more bias there will be Bias: covariate imbalance There are ways of adjusting for bias (ch 18) Can use calipers: only matches within a certain caliper are acceptable (remove units without an acceptable match) 2

? observed 35 uh oh Ideal Matches Ideal: minimize total (or average) covariate distance for pairs Hard to do computationally, especially for large sample sizes Greedy Greedy matching orders the treated units, and then sequentially chooses the closest control (ignoring effect on later matches) When doing this, helps to first match units that will be hardest to match One possibility: order by decreasing propensity score (treated units with highest propensity scores are most unlike controls) with Replacement can be done with replacement Pros: o Easier computationally (ideal matches overall same as just closest for each unit) o Better matches Cons: o Variance of estimator higher (controls can be used more than once, so less information) o Variance is harder to estimate (no longer independent) 3

with Replacement with replacement is necessary if the group you want to make inferences about is the smaller group with replacement also allows you to make inferences about the entire sample (find a match for every unit, from opposite group) Units more similar to those in the opposite group will be selected more Multiple Covariates With multiple covariates, how do you know which to prioritize? 21 year old female Which is a better match: o 18 year old female o 21 year old male Want a way to measure multivariate covariate distance Distance Metric Lots of different possible distance metrics Mahalanobis distance? Sum of squared (standardized) covariate difference in means? Difference in propensity scores? Linearized propensity score Difference between propensity scores of 0.001 and 0.01 is larger than difference between propensity scores 0.1 and 0.109 Better option: linearized propensity score, the log odds of propensity score: e(x) log 1 e(x) e(x) Logistic regression: log 1 e(x) = α + β x PS Linearized PS 0.001-3 0.01-2 0.10-0.95 0.109-0.91 0.5 0 0.9 0.95 Note: the linearized propensity score is recommended for subclassification as well, although it isn t as important in that setting Won t change subclasses, but will change your view of whether a subclass is small enough 4

Hybrid In hybrid matching, match on more than one criteria Example: exact matches are required for some covariates, and other covariates are just as close as possible o Example: 21 year old female; look for closest age only within female controls Example: match on propensity score and important covariate(s) Multiple Matches Paired matching is called 1:1 matching (1 control to 1 treated) If the control group is much bigger than the treatment group, can do 2:1 matching (2 controls to 1 treatment unit), or more to one matching Another option: caliper matching in which all controls within a certain distance (based on some metric) of a treated unit are matched with that unit Like propensity score estimation and like subclassification. there are no right matches If the matches you choose give good covariate balance, then you did a good job! Decisions Estimating propensity score: o What variables to include? o How many units to trim, if any? Subclassification: o How many subclasses and where to break? : o with or without replacement? o 1:1, 2:1,? o how to weight variables / distance measure? o exact matching for any variable(s)? o calipers for which a match is acceptable? o Lalonde Data Analyze the causal effect of a job training program on wages Data on 185 treated (participated in job training program) and 2490 controls (did not participate in job training program) Read Ch 15, 18 To Do Homework 4 (due Monday) GOAL: achieve covariate balance! 5