Matching for Causal Inference Without Balance Checking

Similar documents
Gov 2002: 5. Matching

Multivariate Matching Methods That are Monotonic Imbalance Bounding

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

A Theory of Statistical Inference for Matching Methods in Applied Causal Research

Background of Matching

A Theory of Statistical Inference for Matching Methods in Causal Research

Dynamics in Social Networks and Causality

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Propensity Score Matching

Achieving Optimal Covariate Balance Under General Treatment Regimes

Matching is a statistically powerful and conceptually

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

Alternative Balance Metrics for Bias Reduction in. Matching Methods for Causal Inference

The Balance-Sample Size Frontier in Matching Methods for Causal Inference

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Job Training Partnership Act (JTPA)

Propensity Score Matching and Genetic Matching : Monte Carlo Results

AFFINELY INVARIANT MATCHING METHODS WITH DISCRIMINANT MIXTURES OF PROPORTIONAL ELLIPSOIDALLY SYMMETRIC DISTRIBUTIONS

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

On the Use of Linear Fixed Effects Regression Models for Causal Inference

Covariate Balancing Propensity Score for General Treatment Regimes

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai

PROPENSITY SCORE MATCHING. Walter Leite

Matching and Weighting Methods for Causal Inference

CEM: Software for Coarsened Exact Matching

Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies

Flexible Estimation of Treatment Effect Parameters

Implementing Matching Estimators for. Average Treatment Effects in STATA

Journal of Statistical Software

Matching. Stephen Pettigrew. April 15, Stephen Pettigrew Matching April 15, / 67

arxiv: v1 [stat.me] 15 May 2011

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Variable selection and machine learning methods in causal inference

MATCHING FOR EE AND DR IMPACTS

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region

The propensity score with continuous treatments

Selection on Observables: Propensity Score Matching.

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Gov 2002: 4. Observational Studies and Confounding

The Balance-Sample Size Frontier in Matching Methods for Causal Inference: Supplementary Appendix

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Propensity Score Methods for Causal Inference

Measuring Social Influence Without Bias

Authors and Affiliations: Nianbo Dong University of Missouri 14 Hill Hall, Columbia, MO Phone: (573)

Discussion: Can We Get More Out of Experiments?

Data Mining. CS57300 Purdue University. Jan 11, Bruno Ribeiro

Genetic Matching for Estimating Causal Effects:

Flexible, Optimal Matching for Comparative Studies Using the optmatch package

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

Massive Experiments and Observational Studies: A Linearithmic Algorithm for Blocking/Matching/Clustering

Stratified Randomized Experiments

arxiv: v1 [stat.me] 18 Nov 2018

What s New in Econometrics. Lecture 1

Propensity Score for Causal Inference of Multiple and Multivalued Treatments

Is My Matched Dataset As-If Randomized, More, Or Less? Unifying the Design and. Analysis of Observational Studies

Summary and discussion of The central role of the propensity score in observational studies for causal effects

Bounds on least squares estimates of causal effects in the presence of heterogeneous assignment probabilities

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

The Essential Role of Pair Matching in. Cluster-Randomized Experiments. with Application to the Mexican Universal Health Insurance Evaluation

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018

Eric Shou Stat 598B / CSE 598D METHODS FOR MICRODATA PROTECTION

Chapter 15 Sampling Distribution Models

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies

Optimal Blocking by Minimizing the Maximum Within-Block Distance

New Developments in Econometrics Lecture 16: Quantile Estimation

DOCUMENTS DE TRAVAIL CEMOI / CEMOI WORKING PAPERS. A SAS macro to estimate Average Treatment Effects with Matching Estimators

review session gov 2000 gov 2000 () review session 1 / 38

For more information about how to cite these materials visit

Causal Inference Lecture Notes: Selection Bias in Observational Studies

Combining Experimental and Non-Experimental Design in Causal Inference

A Course in Applied Econometrics. Lecture 2 Outline. Estimation of Average Treatment Effects. Under Unconfoundedness, Part II

Section 10: Inverse propensity score weighting (IPSW)

Matching with Multiple Control Groups, and Adjusting for Group Differences

Gov 2002: 3. Randomization Inference

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

HOMEWORK ANALYSIS #2 - STOPPING DISTANCE

Chapter 1 Statistical Inference

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

COMPARING THE MATCHING PROPERTIES OF COARSENED EXACT MATCHING, PROPENSITY SCORE MATCHING, AND GENETIC MATCHING IN A NATIONWIDE

Imbens/Wooldridge, IRP Lecture Notes 2, August 08 1

Section 9: Matching without replacement, Genetic matching

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Describing Change over Time: Adding Linear Trends

finite-sample optimal estimation and inference on average treatment effects under unconfoundedness

A Introduction to Matrix Algebra and the Multivariate Normal Distribution

CompSci Understanding Data: Theory and Applications

Section 8.1: Interval Estimation

Univariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation

ECO Class 6 Nonparametric Econometrics

Regression Discontinuity Designs

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Discussion of Papers on the Extensions of Propensity Score

Randomized trials for policy

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Transcription:

Matching for ausal Inference Without Balance hecking Gary King Institute for Quantitative Social Science Harvard University joint work with Stefano M. Iacus (Univ. of Milan) and Giuseppe Porro (Univ. of rieste) (talk at the Society for Political Methodology, University of Michigan, 7/12/08) () (talk at the Society for Political M / 16

Preview Slide: oarsened Exact Matching (EM) A very simple method of causal inference, with surprisingly powerful properties Preprocess (X, ) with EM: 1 emporarily coarsen X as much as you re willing e.g., income (±$1, 000); education (high school, college, graduate); etc. Most variables are pre-coarsened (e.g., men, women, wars, non-wars) Easy to understand, or can be automated as for a histogram 2 Perform exact matching on the coarsened X, (X ) Sort observations into strata, each with unique values of (X ) Prune any stratum with 0 treated or 0 control units 3 Pass on original (uncoarsened) units except those pruned (Optionally: apply another matching method within strata) Analyze as without matching (adding weights for stratum-size) A version of EM: Last studied 40 years ago by ochran First used many decades before that We prove: many new properties, uses, & extensions, and show how it resolves many problems in the literature Gary King (Harvard, IQSS) Matching without Balance hecking 2 / 16

haracteristics of Observational Data Plenty of data Data is of uncertain origin. reatment assignment: not random, not controlled by investigator, not known Bias-Variance radeoff. Bias: huge concern; Variance: he idea of matching: sacrifice some data to avoid bias Removing heterogeneous data will often reduce variance too not so much (Medical experiments are the reverse: small-n with random treatment assignment; don t match unless something goes wrong) Gary King (Harvard, IQSS) Matching without Balance hecking 3 / 16

Model Dependence (King and Zeng, 2006: fig.4 Political Analysis) What to do? Preprocess I: Eliminate extrapolation region (a separate step) Preprocess II: Match (prune bad matches) within interpolation region Model remaining imbalance (same methods as without matching) Gary King (Harvard, IQSS) Matching without Balance hecking 4 / 16

Matching within the Interpolation Region (Ho, Imai, King, Stuart, 2007: fig.1, Political Analysis) 5 0 5 10 15 20 25 30 4.6 4.7 4.8 4.9 5.0 5.1 5.2 Before Matching X Y Linear Model, reated Group Linear Model, ontrol Group Quadratic Model, reated Group Quadratic Model, ontrol Group 5 0 5 10 15 20 25 30 4.6 4.7 4.8 4.9 5.0 5.1 5.2 After Matching X Y Linear Model, reated Group Linear Model, ontrol Group Quadratic Model, reated Group Quadratic Model, ontrol Group Matching reduces model dependence, bias, and variance Gary King (Harvard, IQSS) Matching without Balance hecking 5 / 16

he Goals, with some more precision Notation: Y i Dependent variable i reatment variable (dichotomous) X i ovariates reatment Effect for treated ( i = 1) observation i: E i = Y i ( i = 1) Y i ( i = 0) = observed unobserved Estimate Y i ( i = 0) with Y j from controls with X j where X i X j Sample Average reatment effect on the reated: SA = 1 n i { i =1} E i Gary King (Harvard, IQSS) Matching without Balance hecking 6 / 16

Matching Methods Goal: Find matching control units to estimate Y i ( i = 0) Exact matching (X i = X j ): rarely works (except with lame X s) Approximate matching methods (X i X j ): Minimize scalar Mahalanobis distance between X i and X j Propensity score: match on scalar π =logit( X ) Others: optimal, full, genetic, caliper, subclassification, etc. All methods attempt to achieve good balance between treated and control groups, so that X is unimportant Gary King (Harvard, IQSS) Matching without Balance hecking 7 / 16

Problems With Existing Matching Methods Don t eliminate extrapolation region Don t work with multiply imputed data Most violate the congruence principle For most, the objective function is not optimized directly Not well designed for observational data: Least important (variance): matched n chosen ex ante Most important (bias): imbalance reduction checked ex post Hard to use: Improving balance on 1 variable can reduce it on others Best practice: choose n-match-check, tweak-match-check, tweak-match-check, tweak-match-check, Actual practice: choose n, match, publish, SOP. (Is balance even improved?) Gary King (Harvard, IQSS) Matching without Balance hecking 8 / 16

Largest lass of Methods: Equal Percent Bias Reducing Goal: changing balance on 1 variable should not harm others For EPBR to be useful, it requires: (a) X drawn randomly from a specified population X, (b) X Normal (or DMPES) (c) Matching algorithm is invariant to linear transformations of X. (d) All X s have the same importance w.r.t. Y (e) Y is a linear function of X. EPBR Definition: Matched sample size set ex ante, and matched original E( X m X m ) =γe( X X ) When conditions hold: Reducing mean-imbalance on one variable, reduces it on all n set ex ante; balance calculated ex post Methods can only be potentially EPBR (depends on the data) EPBR controls only univariate expected mean-imbalance Gary King (Harvard, IQSS) Matching without Balance hecking 9 / 16

A New lass of Methods: Monotonic Imbalance Bounding No restrictions on data types Designed for observational data (reversing EPBR): Most important (bias): degree of balance chosen ex ante Least important (variance): matched n checked ex post Balance is measured in sample (like blocked designs), not merely in expectation (like complete randomization) overs all forms of imbalance: means, interactions, nonlinearities, moments, multivariate histogram, etc. One adjustable tuning parameter per variable onvenient monotonicity property: Reducing maximum imbalance on one X : no effect on others MIB Formally: D(X ɛ J,, X ɛ J, ) γ J(ɛ J ) D(X ɛ H,, X ɛ H, ) γ H(ɛ H ) vars to adjust (J) remaining vars (H) reated and control X variables to adjust Remaining treated and control X variables Imbalance given chosen distance metric Bounds on Gary King (Harvard, IQSS) Matching without Balance hecking 10 / 16

EM as an MIB Method ɛ defines coarsening (ɛ = 0 is exact matching) We Prove: EM bounds the treated-control group difference, within strata and globally, for: means, variances, skewness, covariances, comoments, coskewness, co-kurtosis, quantiles, and full multivariate histogram. = User sets ɛ and thus controls all multivariate differences, interactions, and nonlinearities, up to the chosen level Matched n is determined ex post If ɛ is set too large: you re left modeling remaining imbalances If ɛ is set too small: n may be too small What if ɛ is set appropriately, but n is still too small? No magic method of matching can save you; you re stuck modeling or collecting better data Gary King (Harvard, IQSS) Matching without Balance hecking 11 / 16

Other EM properties we prove Meets the congruence principle he principle: data space = analysis space Estimators that violate it are nonrobust and counterintuitive EM: ɛ j is set using each variable s units E.g., calipers (strata centered on each unit): would bin college drop out with 1st year grad student; and not bin Bill Gates & Warren Buffett Automatically eliminates extrapolation region (no separate step) Approximate invariance to measurement error: EM pscore Mahalanobis Genetic % ommon Units 96.5 70.2 80.9 80.0 Bounds model dependence (not merely imbalance) Bounds causal effect estimation error (not merely model dependence) omputing: extremely fast and memory-efficient even for extremely large data sets; can be fully automated Simple to teach: coarsen, then exact match Gary King (Harvard, IQSS) Matching without Balance hecking 12 / 16

EM in Practice: EPBR-ompliant Data Monte arlo: X N 5 (0, Σ) and X N 5 (1, Σ). n = 2, 000 Allow MAH & PS to match with replacement; use automated EM Difference in means: X 1 X 2 X 3 X 4 X 5 Seconds initial 1.00 1.00 1.00 1.00 1.00 MAH.20.20.20.20.20.28 PS.11.06.03.06.03.16 EM.04.02.06.06.04.08 Local and multivariate imbalance: X 1 X 2 X 3 X 4 X 5 L 1 initial 1.24 PS 2.38 1.25.74 1.25.74 1.18 MAH.56.36.29.36.29 1.13 EM.42.26.17.22.19.78 EM dominates EPBR-methods in EPBR Data Gary King (Harvard, IQSS) Matching without Balance hecking 13 / 16

EM in Practice: Non-EPBR Data Monte arlo: Exact replication of Diamond and Sekhon (2005), using data from Dehejia and Wahba (1999). EM coarsening automated. BIAS SD RMSE Seconds L 1 initial 423.7 1566.5 1622.6.00 1.28 MAH 784.8 737.9 1077.2.03 1.08 PS 260.5 1025.8 1058.4.02 1.23 GEN 78.3 499.5 505.6 27.38 1.12 EM.8 111.4 111.4.03.76 EM works well in non-epbr data too Gary King (Harvard, IQSS) Matching without Balance hecking 14 / 16

Extensions of EM EM and Multiple Imputation for Missing Data 1 put missing observation in stratum where plurality of imputations fall 2 pass on uncoarsened imputations to analysis stage 3 Use the usual MI combining rules to analyze Multicategory treatments: No modification necessary; keep all strata with at least one value of each value of Blocking in Randomized Experiments: no modification needed; randomly assign within EM strata Automating user choices Histogram bin size calculations, Estimated SA error bound, Progressive oarsening Detecting Extreme ounterfactuals (related to convex hull) Improving Existing Matching Methods: Other methods applied within EM strata inherit EM properties Propensity or Prognosis Score matching Genetic Matching (now in GenMatch) Synthetic Matching (Synth by Abadie, Diamond, & Hainmueller) Nonparametric Adjustment (Abadie & Imbens) & whatever else you all come up with. Gary King (Harvard, IQSS) Matching without Balance hecking 15 / 16

For more information http://gking.harvard.edu/cem Gary King (Harvard, IQSS) Matching without Balance hecking 16 / 16