PROPENSITY SCORE MATCHING. Walter Leite
|
|
- Doris Moore
- 5 years ago
- Views:
Transcription
1 PROPENSITY SCORE MATCHING Walter Leite 1
2 EXAMPLE Question: Does having a job that provides or subsidizes child care increate the length that working mothers breastfeed their children? Treatment: Working for a company that provides or subsidizes child care Outcome: age of the child in weeks when breastfeeding ended Data source: National Longitudinal Survey of Youth 1979 (NLSY79) and the NLSY79 Children and Youth Sample size: Child care was provided or subsidized in 107 (8.85%) of 1209 cases. 2
3 ESTIMATION OF PROPENSITY SCORES FOR EXAMPLE 31 covariates were selected; Examples: benefits provided by the mother s current job (i.e., life insurance, dental insurance, profit sharing, retirement, training opportunities), the mother s education level, hours worked per week, and employment sector, family size, amount of public assistance received by the family, and whether a cesarean section was performed. Estimation was performed using logistic regression with the glm function of the base R package. 3
4 PROPENSITY SCORES FOR MATCHING It is advantageous to match on the linear propensity score (i.e., the logit of the propensity score) rather than the propensity score itself, because it avoids compression around zero and one. log(e(x)) log ex ( ) 1 ex ( ) 4
5 Common support region Cases excluded Range of matched cases. Participants Nonparticipants Predicted Probability 5
6 COMMON SUPPORT REQUIREMENTS Matching to estimate the ATT only requires that the distribution of propensity scores for the treated is contained within the distribution of the untreated. 6
7 MATCHING METHODS TAXONOMY Replacement: Matching with replacement Matching without replacement Algorithms: Greedy matching: Optimal matching Genetic matching Ratio: Pair matching (1 to 1) Fixed ratio (1 to k) variable ratio Full 7
8 GREEDY MATCHING For each individual in the treated sample, select the best available match without accounting for the quality of the match of the entire treated sample. Greedy matching has the advantage of allowing any analysis to estimate the treatment effect after matching. Types of Greedy Matching: Nearest neighbor propensity score matching Nearest neighbor propensity score matching within a caliper Mahalanobis metric matching Mahalanobis metric matching within a propensity score caliper 8
9 LIMITATIONS OF GREEDY MATCHING While trying to maximize exact matches (i.e., within the common support region or within a caliper), cases may be excluded due to incomplete matching (no available matches). While trying to maximize cases (i.e., widen the region), inexact matching may result. To prevent inexact or incomplete matching when the number of untreated is not much larger than the treated, use matching with replacement. 9
10 NEAREST NEIGHBOR MATCHING 1. Randomly order the treated and untreated individuals 2. select the first treated individual i and find the untreated individual j with closest propensity score. 3. If matching is without replacement, remove j from the pool. 4. Repeat the above process until matches are found for all participants. 10
11 NEAREST NEIGHBOR WITHIN A CALIPER Caliper: a required common-support region for each match, usually in standard deviation units. Rosembaum and Rubin (1985) suggest a caliper of.25 standard deviations. Matching Method: Within the caliper of each treated observation, select the untreated observation with closest propensity score. 11
12 MAHALANOBIS DISTANCE 1 di (, j) ( uv) C ( uv) T u and v are values of the matching variables for participant i and nonparticipant j, C is the sample covariance matrix of the matching variables from the full set of nonparticipants; 12
13 MAHALANOBIS METRIC MATCHING 1. Randomly order sample 2. Calculate the Mahalanobis distance between the first treated individual and all untreated individuals based on all covariates; 3. Choose the untreated individual, j, with the minimum distance d(i,j) as the match for treated individual i; 4. If matching is without replacement, remove j from the pool. 5. Repeat the above process until matches are found for all participants. 13
14 MAHALANOBIS METRIC MATCHING WITHIN PROPENSITY SCORE CALIPER 1. Randomly order sample 2. Calculate the Mahalanobis distance between the first treated individual and all untreated individuals within an propensity score caliper based on all covariates except the propensity score; 3. Choose the untreated individual, j, with the minimum distance d(i,j) as the match for treated individual i; 4. If matching is without replacement, remove j from the pool. 5. Repeat the above process until matches are found for all participants. 14
15 RESULTS OF SIMULATION STUDIES COMPARING DISTANCE MEASURES Gu and Rosenbaum (1993): Propensity score performed better than Mahalanobis distance and Mahalanobis within propensity calipers when there were many covariates for use in the study. Zhao (2004): Propensity score matching performed better than Mahalanobis metric matching in conditions with high correlations between covariates and the treatment participation indicator. Propensity score matching did not work well when the sample size used in the simulation was small. 15
16 OPTIMAL MATCHING Optimal matching is a network flow optimization problem that can be solved by linear programming methods. Matching is performed to minimize total weighted sample distance (which is the minimum cost in the network). Optimal matching will perform as well or better than greedy matching with respect to minimum distance and balance. 16
17 SIMULATION STUDIES COMPARING MATCHING ALGORITHMS Gu and Rosenbaum (1993): Optimal matching outperformed greedy matching except when there were a large number of control units available for matching to treated units. Optimal matching performed slightly better than greedy matching when comparing both matching methods based on the propensity distance. Cepeda, Boston, Farrar and Strom(2003): Optimal matching produced a larger reduction in bias when using a variable number of control units compared to using a fixed number of control units. 17
18 NUMBER OF MATCHES: PAIR MATCHING For ATT: Each treated is matched to a single control. For ATE: Each treated is matched to a single control and each control to a single treated. 18
19 NUMBER OF MATCHES: 1 TO K For ATT: Each treated is matched to K controls. For ATE: Each treated is matched to K controls and each control to K treated. 19
20 NUMBER OF MATCHES: VARIABLE RATIO OR FULL Variable ratio matching where each treated is matched to many controls. Full matching were each treated is matched to many controls and each control is matched to many treated. Full matching can be considered a form of stratification to a maximum number of strata containing at least one treated and one untreated. 20
21 SIMULATION STUDIES COMPARING NUMBER OF MATCHES. Gu & Rosenbaum (1993): Full matching performed better than 1 to k matching in terms of distance within matched sets as well as producing greater balance, especially when the number of covariates was large. Cepeda, Boston, Farrar and Strom(2003): Overall, optimal matching with variable number of controls removes more bias than with a fixed number of controls. Rosembaum (1989) points out that optimal pair matching is not actually optimal, but optimal full matching is. 21
22 SIMULATION RESULTS ABOUT THE EFFECT OF THE NUMBER OF AVAILABLE CONTROLS Gu & Rosenbaum: When there were many controls available for every one that will be used in pair matching, there was little difference between optimal and greedy matching When there is only one control available for matching to each treated unit, optimal matching is noticeably better than greedy matching. Cepeda, Boston, Farrar and Strom(2003): Both optimal matching with a fixed and variable number of control units produced identical reduction in bias when the treated to control ratio was 1/5, but optimal matching with a variable number of controls performed better with 1/2, 1/3 and 1/4 ratios. Regardless of method, reduction of bias with optimal matching decreases as the number of available controls decreases. 22
23 MATCHING WITH A GENETIC ALGORITHM Minimizes a multivariate weighted distance where weights are chosen to maximize a measure of covariate balance (e.g., standardized mean difference). Weight Matrix. If weights are 1, d is the Mahalanobis distance Cholesky decomposition of the sample covariance matrix S of the matching variables, which is a lower triangular matrix L where S = LL. 23
24 MATCHING METHODS USED FOR THE EXAMPLE Matching Method One-to-one greedy with replacement and caliper Variable ratio greedy with replacement and caliper Variable ratio genetic with replacement (propensity score [PS] + covariates) Variable ratio genetic with replacement (PS only) One-to-one optimal without replacement Full matching 24
25 COVARIATE BALANCE FOR THE EXAMPLE Matching Method Maximum standardized difference Unbalanced Covariates One-to-one greedy with replacement and caliper (26.1) Variable ratio greedy with replacement and caliper (9.5) Variable ratio genetic with replacement (PS (28.6) covariates) Variable ratio genetic with replacement (PS only) (19.0) One-to-one optimal without replacement (26.1) Full matching (9.5) 25
26 SHOULD MATCHED DATA BE TREATED AS RELATED SAMPLES? Schafer and Kang (2008) and Stuart (2010): matched samples should be treated as independent data because matching does not produce correlations between outcomes of matched individuals. Austin (2011): because the covariates which have similar distributions for matched and treated groups are related to outcomes, the distributions of outcomes will be more similar for treated and matched samples than from randomly selected samples. 26
27 ABADIE AND IMBENS SIMPLE MATCHING ESTIMATOR n 1 ATE Yˆ Yˆ n 1 0 i 1 0 ATT Y i i Yi i n1 1 ˆ ˆ n1 i T ˆ1 i 1 Y M i Y j J i M if Z 1 () i Y j i if Z 0 i ˆ 0 i 1 Y M i Y i j J () i M if Z 0 Y i j if Z 1 i 27
28 TREATMENT EFFECT ESTIMATES WITH VARIABLE RATIO GENETIC MATCHING USING THE MATCHING PACKAGE Matching only: Estimate AI SE T-stat p.val Matching with additional bias adjustment by regressing the outcomes on covariates only with the matched data: Estimate AI SE T-stat p.val
29 WEIGHTS FOR TREATMENT EFFECT ESTIMATION WITH 1 TO K, VARIABLE RATIO OR FULL MATCHING One-to-k or variable ratio: weighs are the inverse of the total number of matches unit received. Matching with replacement: weights for each untreated unit are summed across the multiple matched groups it was included in. Then, weights of the matched cases are multiplied by the ratio of the total number of matched units and total number of treated units. w i n n 0 1 n 1 if Z =1 i m 1 i 1 if Zi = 0 M m 29
30 ESTIMATION OF TREATMENT EFFECT WITH MATCHING Horvitz and Thompson Estimator of the treatment effect (Rosenbaum, 1987) n 1 wy i1 i1 i0 i0 i1 i1 n n 1 0 w i1 i0 i1 i1 n 0 wy w Estimation with Weighted regression: Y BZ e i 0 1 i i
31 TREATMENT EFFECT ESTIMATES USING FULL MATCHING WITH WEIGHTED REGRESSION Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) <2e-16 *** childcaretrue
32 ROSENBAUM S SENSITIVITY ANALYSIS Rosenbaum (2002) proposed a method based on the Wilcoxon signed ranks test The sensitivity analysis increases the level of hidden bias to obtain an upper and lower bound for how the p-value of association might be affected. By varying the assumed magnitude of hidden bias, we can find out how much hidden bias is necessary to render our p-value not significant. 32
33 BASIC CONCEPTS OF ROSENBAUM S METHOD If two matched individuals have the same observed covariates but different probabilities of receiving the treatment, the odds ratio of these units receiving the treatment is: j /(1 j) j(1 k) /(1 ) (1 ) k k k 1 If there is hidden bias,the odds ratio will be larger than one and smaller than a constant. 1 (1 ) j k (1 ) k 1 33
34 MEASURING THE DEGREE OF HIDDEN BIAS (gamma) measures the degree of departure from a study that is free of hidden bias. A study is sensitive to hidden bias if small values of lead to change in inferences. A study is insensitive to hidden bias if large values of do not lead to change in inferences. 34
35 STEPS OF WILCOXON S SIGNED RANK TEST FOR SENSITIVITY ANALYSIS IN A MATCHED PAIRS STUDY 1. Compute the differences between matched pairs and rank them. 2. Compute the Wilcoxon signed rank statistic for the outcome difference between treated and control. 3. Compute the expectation and variance of the Wilcoxon signed rank statistic under the null hypothesis of no treatment effect. 4. Compute the Z score for the observed signed rank statistic given the expected value and variance. For the test against the null hypothesis, the lower and upper bounds of the p value are the same. 35
36 STEPS OF WILCOXON S SIGNED RAND TEST FOR SENSITIVITY ANALYSIS IN A MATCHED PAIRS STUDY 5. Compute the expectation of the lower bound of the Wilcoxon signed rank statistic under the null hypothesis of = 2 (or any other value higher than 1) and its variance. 6. Compute the Z score associated with the lower bound and associated p value. 7. Compute the expectation of the upper bound of the Wilcoxon signed rank statistic under the null hypothesis of = 2 (or any other value higher than 1) and its variance. 8. Compute the Z score associated with the upper bound and associated p value. 9. Check whether range between the p values of the lower and upper bounds does not cross 0.05, which would change conclusions about the statistical significance of results. 36
37 RESULTS OF ROSENBAUM S SENSITIVITY ANALYSIS WITH ESTIMATES FROM GENETIC MATCHING Rosenbaum Sensitivity Test for Wilcoxon Signed Rank P-Value Unconfounded estimate Gamma Lower bound Upper bound
38 R PACKAGES USEFUL FOR PROPENSITY SCORE MATCHING Package Function Objective MatchIt matchit Implement greedy matching and as interface for genetic, optimal, and full matching Matching GenMatch, Match, MatchBalance Obtain covariate weights for genetic matching, implement genetic matching with weights, as well as greedy matching rbounds psens Implement Rosenbaum s sensitivity analysis method 38
Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14
STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University Frequency 0 2 4 6 8 Quiz 2 Histogram of Quiz2 10 12 14 16 18 20 Quiz2
More informationDynamics in Social Networks and Causality
Web Science & Technologies University of Koblenz Landau, Germany Dynamics in Social Networks and Causality JProf. Dr. University Koblenz Landau GESIS Leibniz Institute for the Social Sciences Last Time:
More informationGov 2002: 5. Matching
Gov 2002: 5. Matching Matthew Blackwell October 1, 2015 Where are we? Where are we going? Discussed randomized experiments, started talking about observational data. Last week: no unmeasured confounders
More informationPropensity Score Methods for Causal Inference
John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good
More informationESTIMATION OF TREATMENT EFFECTS VIA MATCHING
ESTIMATION OF TREATMENT EFFECTS VIA MATCHING AAEC 56 INSTRUCTOR: KLAUS MOELTNER Textbooks: R scripts: Wooldridge (00), Ch.; Greene (0), Ch.9; Angrist and Pischke (00), Ch. 3 mod5s3 General Approach The
More informationSince the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai
Chapter 1 Propensity Score Analysis Concepts and Issues Wei Pan Haiyan Bai Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity score analysis, research using propensity score analysis
More informationSelection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More information(Mis)use of matching techniques
University of Warsaw 5th Polish Stata Users Meeting, Warsaw, 27th November 2017 Research financed under National Science Center, Poland grant 2015/19/B/HS4/03231 Outline Introduction and motivation 1 Introduction
More informationPropensity Score Matching
Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In
More informationLab 4, modified 2/25/11; see also Rogosa R-session
Lab 4, modified 2/25/11; see also Rogosa R-session Stat 209 Lab: Matched Sets in R Lab prepared by Karen Kapur. 1 Motivation 1. Suppose we are trying to measure the effect of a treatment variable on the
More informationIntroduction to Propensity Score Matching: A Review and Illustration
Introduction to Propensity Score Matching: A Review and Illustration Shenyang Guo, Ph.D. School of Social Work University of North Carolina at Chapel Hill January 28, 2005 For Workshop Conducted at the
More informationJob Training Partnership Act (JTPA)
Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training
More informationNISS. Technical Report Number 167 June 2007
NISS Estimation of Propensity Scores Using Generalized Additive Models Mi-Ja Woo, Jerome Reiter and Alan F. Karr Technical Report Number 167 June 2007 National Institute of Statistical Sciences 19 T. W.
More informationAn Introduction to Causal Analysis on Observational Data using Propensity Scores
An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut
More informationPropensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018
Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Schedule and outline 1:00 Introduction and overview 1:15 Quasi-experimental vs. experimental designs
More informationEcon 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem
Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects The Problem Analysts are frequently interested in measuring the impact of a treatment on individual behavior; e.g., the impact of job
More informationBackground of Matching
Background of Matching Matching is a method that has gained increased popularity for the assessment of causal inference. This method has often been used in the field of medicine, economics, political science,
More informationWeighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai
Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving
More informationCovariate Balancing Propensity Score for General Treatment Regimes
Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian
More informationWeighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.
Weighting Unconfounded Homework 2 Describe imbalance direction matters STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University
More informationarxiv: v1 [stat.me] 15 May 2011
Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland
More informationPropensity Score Analysis with Hierarchical Data
Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationFlexible Estimation of Treatment Effect Parameters
Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both
More informationEvaluating the performance of propensity score matching methods: A simulation study
James Madison University JMU Scholarly Commons Dissertations The Graduate School Spring 2017 Evaluating the performance of propensity score matching methods: A simulation study Jessica N. Jacovidis James
More informationSection 10: Inverse propensity score weighting (IPSW)
Section 10: Inverse propensity score weighting (IPSW) Fall 2014 1/23 Inverse Propensity Score Weighting (IPSW) Until now we discussed matching on the P-score, a different approach is to re-weight the observations
More informationMatching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region
Impact Evaluation Technical Session VI Matching Techniques Jed Friedman Manila, December 2008 Human Development Network East Asia and the Pacific Region Spanish Impact Evaluation Fund The case of random
More informationWhat s New in Econometrics. Lecture 1
What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and
More informationImbens/Wooldridge, IRP Lecture Notes 2, August 08 1
Imbens/Wooldridge, IRP Lecture Notes 2, August 08 IRP Lectures Madison, WI, August 2008 Lecture 2, Monday, Aug 4th, 0.00-.00am Estimation of Average Treatment Effects Under Unconfoundedness, Part II. Introduction
More informationMatching. Stephen Pettigrew. April 15, Stephen Pettigrew Matching April 15, / 67
Matching Stephen Pettigrew April 15, 2015 Stephen Pettigrew Matching April 15, 2015 1 / 67 Outline 1 Logistics 2 Basics of matching 3 Balance Metrics 4 Matching in R 5 The sample size-imbalance frontier
More informationEstimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing
Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Alessandra Mattei Dipartimento di Statistica G. Parenti Università
More informationEmpirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design
1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary
More informationPropensity Score for Causal Inference of Multiple and Multivalued Treatments
Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2016 Propensity Score for Causal Inference of Multiple and Multivalued Treatments Zirui Gu Virginia Commonwealth
More informationStratified Randomized Experiments
Stratified Randomized Experiments Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Stratified Randomized Experiments Stat186/Gov2002 Fall 2018 1 / 13 Blocking
More informationPropensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr
Propensity Score Analysis Using teffects in Stata SOC 561 Programming for the Social Sciences Hyungjun Suh Apr. 25. 2016 Overview Motivation Propensity Score Weighting Propensity Score Matching with teffects
More informationControlling for overlap in matching
Working Papers No. 10/2013 (95) PAWEŁ STRAWIŃSKI Controlling for overlap in matching Warsaw 2013 Controlling for overlap in matching PAWEŁ STRAWIŃSKI Faculty of Economic Sciences, University of Warsaw
More informationSummary and discussion of The central role of the propensity score in observational studies for causal effects
Summary and discussion of The central role of the propensity score in observational studies for causal effects Statistics Journal Club, 36-825 Jessica Chemali and Michael Vespe 1 Summary 1.1 Background
More informationImplementing Matching Estimators for. Average Treatment Effects in STATA
Implementing Matching Estimators for Average Treatment Effects in STATA Guido W. Imbens - Harvard University West Coast Stata Users Group meeting, Los Angeles October 26th, 2007 General Motivation Estimation
More informationImplementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston
Implementing Matching Estimators for Average Treatment Effects in STATA Guido W. Imbens - Harvard University Stata User Group Meeting, Boston July 26th, 2006 General Motivation Estimation of average effect
More informationDOCUMENTS DE TRAVAIL CEMOI / CEMOI WORKING PAPERS. A SAS macro to estimate Average Treatment Effects with Matching Estimators
DOCUMENTS DE TRAVAIL CEMOI / CEMOI WORKING PAPERS A SAS macro to estimate Average Treatment Effects with Matching Estimators Nicolas Moreau 1 http://cemoi.univ-reunion.fr/publications/ Centre d'economie
More informationNew Developments in Nonresponse Adjustment Methods
New Developments in Nonresponse Adjustment Methods Fannie Cobben January 23, 2009 1 Introduction In this paper, we describe two relatively new techniques to adjust for (unit) nonresponse bias: The sample
More informationHOW A SUPPRESSOR VARIABLE AFFECTS THE ESTIMATION OF CAUSAL EFFECT: EXAMPLES OF CLASSICAL AND RECIPROCAL SUPPRESSIONS. Yun-Jia Lo
HOW A SUPPRESSOR VARIABLE AFFECTS THE ESTIMATION OF CAUSAL EFFECT: EXAMPLES OF CLASSICAL AND RECIPROCAL SUPPRESSIONS By Yun-Jia Lo A DISSERTATION Submitted to Michigan State University in partial fulfillment
More informationCausal Sensitivity Analysis for Decision Trees
Causal Sensitivity Analysis for Decision Trees by Chengbo Li A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Master of Mathematics in Computer
More informationstudies, situations (like an experiment) in which a group of units is exposed to a
1. Introduction An important problem of causal inference is how to estimate treatment effects in observational studies, situations (like an experiment) in which a group of units is exposed to a well-defined
More informationLongitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts
Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 217, Boston, Massachusetts Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More informationCausal Inference Basics
Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,
More informationClassification. Chapter Introduction. 6.2 The Bayes classifier
Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode
More informationAn Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016
An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions
More informationPrimal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing
Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal
More informationGov 2002: 4. Observational Studies and Confounding
Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What
More informationSIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS
SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS TOMMASO NANNICINI universidad carlos iii de madrid UK Stata Users Group Meeting London, September 10, 2007 CONTENT Presentation of a Stata
More informationSensitivity analysis for average treatment effects
The Stata Journal (2007) 7, Number 1, pp. 71 83 Sensitivity analysis for average treatment effects Sascha O. Becker Center for Economic Studies Ludwig-Maximilians-University Munich, Germany so.b@gmx.net
More informationH-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL
H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL Intesar N. El-Saeiti Department of Statistics, Faculty of Science, University of Bengahzi-Libya. entesar.el-saeiti@uob.edu.ly
More informationMatching for Causal Inference Without Balance Checking
Matching for ausal Inference Without Balance hecking Gary King Institute for Quantitative Social Science Harvard University joint work with Stefano M. Iacus (Univ. of Milan) and Giuseppe Porro (Univ. of
More informationIntroduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017
Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent
More informationSection 9: Matching without replacement, Genetic matching
Section 9: Matching without replacement, Genetic matching Fall 2014 1/59 Matching without replacement The standard approach is to minimize the Mahalanobis distance matrix (In GenMatch we use a weighted
More informationVector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings
Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings Jessica Lum, MA 1 Steven Pizer, PhD 1, 2 Melissa Garrido,
More informationMachine Learning Linear Classification. Prof. Matteo Matteucci
Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationTable B1. Full Sample Results OLS/Probit
Table B1. Full Sample Results OLS/Probit School Propensity Score Fixed Effects Matching (1) (2) (3) (4) I. BMI: Levels School 0.351* 0.196* 0.180* 0.392* Breakfast (0.088) (0.054) (0.060) (0.119) School
More informationChapter 60 Evaluating Social Programs with Endogenous Program Placement and Selection of the Treated
See discussions, stats, and author profiles for this publication at: http://www.researchgate.net/publication/222400893 Chapter 60 Evaluating Social Programs with Endogenous Program Placement and Selection
More informationOMITTED VARIABLES, R, AND BIAS REDUCTION IN MATCHING HIERARCHICAL DATA: A MONTE CARLO STUDY
Journal of Statistics: Advances in Theory and Applications Volume 17, Number 1, 017, Pages 43-81 Available at http://scientificadvances.co.in DOI: http://dx.doi.org/10.1864/jsata_710011791 OMITTED VARIABLES,
More informationEstimating the Marginal Odds Ratio in Observational Studies
Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios
More informationUse of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:
Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 13, 2009 Kosuke Imai (Princeton University) Matching
More informationSection 9c. Propensity scores. Controlling for bias & confounding in observational studies
Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a
More informationGeneralized Linear Models. Last time: Background & motivation for moving beyond linear
Generalized Linear Models Last time: Background & motivation for moving beyond linear regression - non-normal/non-linear cases, binary, categorical data Today s class: 1. Examples of count and ordered
More informationDifference-in-Differences Methods
Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics
More informationFour Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil
Four Parameters of Interest in the Evaluation of Social Programs James J. Heckman Justin L. Tobias Edward Vytlacil Nueld College, Oxford, August, 2005 1 1 Introduction This paper uses a latent variable
More informationAn Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies
Paper 177-2015 An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies Yan Wang, Seang-Hwane Joo, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta
More informationMATCHING FOR EE AND DR IMPACTS
MATCHING FOR EE AND DR IMPACTS Seth Wayland, Opinion Dynamics August 12, 2015 A Proposal Always use matching Non-parametric preprocessing to reduce model dependence Decrease bias and variance Better understand
More informationA SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS
A SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS TOMMASO NANNICINI universidad carlos iii de madrid North American Stata Users Group Meeting Boston, July 24, 2006 CONTENT Presentation of
More informationPropensity Score Methods for Estimating Causal Effects from Complex Survey Data
Propensity Score Methods for Estimating Causal Effects from Complex Survey Data Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School
More informationMatching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ
Matching James J. Heckman Econ 312 This draft, May 15, 2007 1 / 169 Introduction The assumption most commonly made to circumvent problems with randomization is that even though D is not random with respect
More informationAlternative Balance Metrics for Bias Reduction in. Matching Methods for Causal Inference
Alternative Balance Metrics for Bias Reduction in Matching Methods for Causal Inference Jasjeet S. Sekhon Version: 1.2 (00:38) I thank Alberto Abadie, Jake Bowers, Henry Brady, Alexis Diamond, Jens Hainmueller,
More informationStatistics Handbook. All statistical tables were computed by the author.
Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance
More informationEstimating Causal Effects from Observational Data with the CAUSALTRT Procedure
Paper SAS374-2017 Estimating Causal Effects from Observational Data with the CAUSALTRT Procedure Michael Lamm and Yiu-Fai Yung, SAS Institute Inc. ABSTRACT Randomized control trials have long been considered
More informationThe Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates
The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates Eun Sook Kim, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta E. Lanehart, Aarti Bellara,
More informationAAEC/ECON 5126 FINAL EXAM: SOLUTIONS
AAEC/ECON 5126 FINAL EXAM: SOLUTIONS SPRING 2013 / INSTRUCTOR: KLAUS MOELTNER This exam is open-book, open-notes, but please work strictly on your own. Please make sure your name is on every sheet you
More informationLecture 12: Effect modification, and confounding in logistic regression
Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression
More informationBusiness Statistics. Lecture 10: Correlation and Linear Regression
Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form
More informationTHE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.
THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B. RUBIN My perspective on inference for causal effects: In randomized
More informationAsymptotic equivalence of paired Hotelling test and conditional logistic regression
Asymptotic equivalence of paired Hotelling test and conditional logistic regression Félix Balazard 1,2 arxiv:1610.06774v1 [math.st] 21 Oct 2016 Abstract 1 Sorbonne Universités, UPMC Univ Paris 06, CNRS
More informationA Theory of Statistical Inference for Matching Methods in Causal Research
A Theory of Statistical Inference for Matching Methods in Causal Research Stefano M. Iacus Gary King Giuseppe Porro October 4, 2017 Abstract Researchers who generate data often optimize efficiency and
More informationAnalysis of propensity score approaches in difference-in-differences designs
Author: Diego A. Luna Bazaldua Institution: Lynch School of Education, Boston College Contact email: diego.lunabazaldua@bc.edu Conference section: Research methods Analysis of propensity score approaches
More informationImbens, Lecture Notes 1, Unconfounded Treatment Assignment, IEN, Miami, Oct 10 1
Imbens, Lecture Notes 1, Unconfounded Treatment Assignment, IEN, Miami, Oct 10 1 Lectures on Evaluation Methods Guido Imbens Impact Evaluation Network October 2010, Miami Methods for Estimating Treatment
More informationBiostat 2065 Analysis of Incomplete Data
Biostat 2065 Analysis of Incomplete Data Gong Tang Dept of Biostatistics University of Pittsburgh September 13 & 15, 2005 1. Complete-case analysis (I) Complete-case analysis refers to analysis based on
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationData Integration for Big Data Analysis for finite population inference
for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation
More informationAnalysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.
Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a
More informationESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.
More informationTutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:
Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported
More informationStat 412/512 TWO WAY ANOVA. Charlotte Wickham. stat512.cwick.co.nz. Feb
Stat 42/52 TWO WAY ANOVA Feb 6 25 Charlotte Wickham stat52.cwick.co.nz Roadmap DONE: Understand what a multiple regression model is. Know how to do inference on single and multiple parameters. Some extra
More informationLecture 11/12. Roy Model, MTE, Structural Estimation
Lecture 11/12. Roy Model, MTE, Structural Estimation Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy model The Roy model is a model of comparative advantage: Potential earnings
More informationNear/Far Matching. Building a Stronger Instrument in an Observational Study of Perinatal Care for Premature Infants
Near/Far Matching Building a Stronger Instrument in an Observational Study of Perinatal Care for Premature Infants Joint research: Mike Baiocchi, Dylan Small, Scott Lorch and Paul Rosenbaum What this talk
More informationfinite-sample optimal estimation and inference on average treatment effects under unconfoundedness
finite-sample optimal estimation and inference on average treatment effects under unconfoundedness Timothy Armstrong (Yale University) Michal Kolesár (Princeton University) September 2017 Introduction
More informationOUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores
OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS 776 1 15 Outcome regressions and propensity scores Outcome Regression and Propensity Scores ( 15) Outline 15.1 Outcome regression 15.2 Propensity
More informationDouble Robustness. Bang and Robins (2005) Kang and Schafer (2007)
Double Robustness Bang and Robins (2005) Kang and Schafer (2007) Set-Up Assume throughout that treatment assignment is ignorable given covariates (similar to assumption that data are missing at random
More informationMatching with Multiple Control Groups, and Adjusting for Group Differences
Matching with Multiple Control Groups, and Adjusting for Group Differences Donald B. Rubin, Harvard University Elizabeth A. Stuart, Mathematica Policy Research, Inc. estuart@mathematica-mpr.com KEY WORDS:
More informationHigh Dimensional Propensity Score Estimation via Covariate Balancing
High Dimensional Propensity Score Estimation via Covariate Balancing Kosuke Imai Princeton University Talk at Columbia University May 13, 2017 Joint work with Yang Ning and Sida Peng Kosuke Imai (Princeton)
More informationMoving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand
Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand Richard K. Crump V. Joseph Hotz Guido W. Imbens Oscar Mitnik First Draft: July 2004
More information