THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.

Similar documents
Job Training Partnership Act (JTPA)

Achieving Optimal Covariate Balance Under General Treatment Regimes

Technical Track Session I:

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity

Propensity Score Weighting with Multilevel Data

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

Technical Track Session I: Causal Inference

Low-Income African American Women's Perceptions of Primary Care Physician Weight Loss Counseling: A Positive Deviance Study

Analyzing the Geospatial Rates of the Primary Care Physician Labor Supply in the Contiguous United States

Potential Outcomes and Causal Inference I

Statistical Models for Causal Analysis

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Causal Inference with Big Data Sets

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts?

Selection on Observables: Propensity Score Matching.

New Developments in Nonresponse Adjustment Methods

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Truncation and Censoring

CompSci Understanding Data: Theory and Applications

Introduction to Statistical Inference

Propensity Score Analysis with Hierarchical Data

Combining Non-probability and Probability Survey Samples Through Mass Imputation

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk

Lecture 4 Multiple linear regression

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction

Notes 6: Multivariate regression ECO 231W - Undergraduate Econometrics

Observational Studies and Propensity Scores

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Philip J Cimo DDS PA 650 West Bough Lane Ste #160 Houston TX

Introduction to Linear Regression Analysis

Instrumental Variables in Action: Sometimes You get What You Need

THE SURVEY OF INCOME AND PROGRAM PARTICIPATION

This report details analyses and methodologies used to examine and visualize the spatial and nonspatial

AGEC 661 Note Fourteen

T test for two Independent Samples. Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016

BROOKINGS May

The Church Demographic Specialists

Jun Tu. Department of Geography and Anthropology Kennesaw State University

Measuring community health outcomes: New approaches for public health services research

Summary and discussion of The central role of the propensity score in observational studies for causal effects

PROPENSITY SCORE MATCHING. Walter Leite

GROWING APART: THE CHANGING FIRM-SIZE WAGE PREMIUM AND ITS INEQUALITY CONSEQUENCES ONLINE APPENDIX

Near/Far Matching. Building a Stronger Instrument in an Observational Study of Perinatal Care for Premature Infants

In matrix algebra notation, a linear model is written as

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Near/Far Matching. Building a Stronger Instrument in an Observational Study of Perinatal Care for Premature Infants

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Rockefeller College University at Albany

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Applied Microeconometrics (L5): Panel Data-Basics

14.74 Lecture 10: The returns to human capital: education

Econometrics Problem Set 4

Chapter 11. Regression with a Binary Dependent Variable

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

II. MATCHMAKER, MATCHMAKER

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.

Implementing Matching Estimators for. Average Treatment Effects in STATA

Balancing Covariates via Propensity Score Weighting

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (

Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017

Lecture 1 January 18

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Bayesian Causal Forests

Propensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr

A Decision Theoretic Approach to Causality

TriMatch: An R Package for Propensity Score Matching of Non-binary Treatments

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University

Have you... Unit 1: Introduction to data Lecture 1: Data collection, observational studies, and experiments. Readiness assessment

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture Notes 12 Advanced Topics Econ 20150, Principles of Statistics Kevin R Foster, CCNY Spring 2012

Kausalanalyse. Analysemöglichkeiten von Paneldaten

What s New in Econometrics. Lecture 1

Gov 2002: 4. Observational Studies and Confounding

Item Response Theory for Conjoint Survey Experiments

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Combining multiple observational data sources to estimate causal eects

Does city structure cause unemployment?

Gov 2002: 5. Matching

Causal Inference. Prediction and causation are very different. Typical questions are:

Assessing Social Vulnerability to Biophysical Hazards. Dr. Jasmine Waddell

Modeling Mediation: Causes, Markers, and Mechanisms

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models

The Impacts of the Affordable Care Act on Preventive Services among Racial Groups UNDERGRADUATE RESEARCH THESIS

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

The propensity score with continuous treatments

Transcription:

THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B. RUBIN

My perspective on inference for causal effects: In randomized experiments In observational studies The RCM (Holland, 1986): 1974, 1975, 1976, 1977, 1978, 1979, 1980 Two essential parts, one optional part(*) 1. Define estimands using potential outcomes conceptual; pre data Observable results of real or hypothetical manipulations (treatments) No causation without manipulation 1975 Formal notation due to Neyman (1923) in randomized experiments Intuitive idea old, but general use of formal notation only from 1974 X, Y(0), Y(1) Prior to 1974 everyone used Y obs in observational studies Y obs,i = W i Y i (1) + (1 W i )Y i (0)

2. Posit assignment mechanism Reasons for missing and observed values of Y(0) and Y(1) Pr(W X, Y(0), Y(1)) Special cases much discussed Randomized experiments Neyman, Fisher (1925) Supply-Demand Haavelmo (1944) Optimizing Behavior Roy (1951) General formulation only from 1975 *3. Full probability model to predict the missing potential outcomes Pr(X, Y(0), Y(1)) Bayes formalized in 1975, 1978 #2 and #3 Posterior distribution of all causal estimands

Here, focus on the second step: design in observational studies. Template is generalized randomized experiment: Pr(W, X, Y(0), Y(1)) = Pr (W X) Pr( Wi = 1 Xi) = propensity scores 0 < propensity scores < 1 Design of randomized experiments is without Y(0), Y(1) Approximate randomized experiment in observational studies Blocking, matching subclassification just use X Old history of matching in statistics summarized in 2006. Structure an observational study as a broken randomized experiment, without using any values of potential outcomes Try to reconstruct underlying experiment

FDA analogy Pharmas do randomization over and over until they get a favorable result? Or use regression over and over on one sample until drug looks good? Instead, they carefully try to design study, followed by Specification of primary analysis & secondary supporting analyses And must live with answers that follow from proposed analyses Should we accept lower standards in social science? Objectivity can be attained in observational studies Outcome-free design before any analysis with observed outcomes

Very Recent Personal Examples: Tobacco litigation Value Added Assessment Peer effects of smoking Job training in Germany Treffer Illustrate with first example Using Propensity Scores to Help Design Observational Studies: Application to the Tobacco Litigation. Health Services & Outcomes Research Methodology (2002).

Table 1 Variables Used in Propensity Model Description Seatbelt 5 levels of reported seat belt use Arthritis Whether reported suffering from arthritis Census Division 9 census regions Champ Insurance Diabetes Doctor ever told having diabetes Down time 6 levels of reported Down time Dump time 6 levels of reported Dump time Employment Indicating employment status each quarter English English is a primary language Retirement Indicator for Retirement status Number of Friends 7 levels measuring the number of friends Membership in Clubs 6 levels measuring memberships in clubs Education Completed years of Education HMO coverage Indicating HMO coverage each quarter High blood pressure Doctor ever told having high blood pressure Industry Code 14 Industry codes Age Age of the respondent Labor Union Indicator for a member of labor union Log Height Natural Logarithm of Height Log Weight Natural Logarithm of Weight Marital Status Marital status in each quarter Medicaid On medicaid (each quarter) Medicare On medicare (each quarter) Occupation Occupation code (13 levels) Public Assistance Other Public assistance program (each quarter) Friends over Frequency of having Friends over (7 levels) Physical Activity Indicator Variable for Physically active Population density 3 levels Poverty Status 6 levels Pregnant 1987 Pregnancy status in 1987 (women) Private Insurance Other Private Insurance (each quarter) Race 4 levels Rated Health 5-point Self rating of health status Home ownership Indicator for Owning home Rheumatism Indicator for Suffering from rheumatism Share Life Indicator variable for having somebody to share their life

Region MSA Risk Uninsured Veteran Incapler Agesq Educat.sq Age_wt Age_educt Age_ht Educat_wt Educat_ht Loght_logwt Loghtsq Logwtsq 4 levels of region of the country 4 levels indicating types of Metropolitan Statistical Area General risk taking attitude (5 levels) Indicator for lack insurance (each quarter) Indicator for Veteran status Survey Weight in NMES database Age*Age Education*Education Age*logwt Age*Education Age*Loght Education*Logwt Education*loght Loght*logwt Loght*loght Logwt*logwt

TABLE 2 ESTIMATED PROPENSITY SCORES ON THE LOGIT SCALE FOR "SMOKERS" VERSUS NEVER SMOKERS IN FULL NMES B = Bias, R = Ratio of "smoker" to never-smoker variances; also displayed is the distribution of the ratio of variances in the covariates orthogonal to the propensity score Percent of covariates with specified variance ratio after adjustment for propensity score Treated B R <1/2 >1/2 and >4/5 and >5/4 and >2 Group < 4/5 < 5/4 < 2 Current 1.09 1.00 3 9 57 26 5 N = 3,510 Former 1.06 0.82 2 15 61 15 7 N = 3,384 Current 1.03 0.85 1 15 59 23 2 N = 3,434 Former 0.65 1.02.5 7 85 7.5 N = 2,657

Figure 1: Propensity Scores in NMES for Smokers versus Never Smokers Controls Treated Matched Controls Current Current Former Former

TABLE 3 ESTIMATED PROPENSITY SCORES FROM FULL NMES ON THE LOGIT SCALE FOR "SMOKERS" VERSUS NEVER SMOKERS IN MATCHED NMES B = Bias, R = Ratio of "smoker" to never-smoker variances; also displayed is the distribution of the ratio of variances in the covariates orthogonal to the propensity score Percent of covariates with specified variance ratio after adjustment for propensity score Treated B R <1/2 >1/2 and >4/5 and >5/4 and >2 Group < 4/5 < 5/4 < 2 Current.08 1.16 1 3 90 6 0 N = 3,510 Former.04 0.99 1 1 94 3 1 N = 3,384 Current.04 0.94 1 1 93 5 0 N = 3,434 Former.06 1.02 0 2 91 7 0 N = 2,657

TABLE 4 ESTIMATED PROPENSITY SCORES ON THE LOGIT SCALE FOR "SMOKERS" VERSUS NEVER SMOKERS IN MATCHED NMES B = Bias, R = Ratio of "smoker" to never-smoker variances; also displayed is the distribution of the ratio of variances in the covariates orthogonal to the propensity score Percent of covariates with specified variance ratio after adjustment for propensity score Treated B R <1/2 >1/2 and >4/5 and >5/4 and >2 Group < 4/5 < 5/4 < 2 Current 0.39 1.33 0 4 88 8 0 N = 3,510 Former 0.32 1.33 0 1 95 3 1 N = 3,384 Current 0.35 1.18 1 1 92 6 0 N = 3,434 Former 0.31 1.09 0 2 91 7 0 N = 2,657

TABLE 5A PROPENSITY SUBCLASSIFICATION ANALYSES ON THE LOGIT SCALE FOR CURRENT VS. NEVER-SMOKER MALES IN MATCHED NMES B = Bias, R = Ratio of "smoker" to never-smoker variances; also displayed is the distribution of the ratio of variances in the covariates orthogonal to the propensity score Percent of covariates with specified variance ratio after adjustment for propensity score Number of B R <1/2 >1/2 and >4/5 and >5/4 and >2 Subclasses < 4/5 < 5/4 < 2 2 0.18 1.36 0 2 98 0 0 4 0.10 1.25 0 1 99 0 0 6 0.09 1.30 0 0 100 0 0 8 0.08 1.16 0 0 100 0 0 10 0.07 1.12 0 0 100 0 0

TABLE 5B PROPENSITY SUBCLASSIFICATION ANALYSES ON THE LOGIT SCALE FOR FORMER VS. NEVER-SMOKER MALES IN MATCHED NMES B = Bias, R = Ratio of "smoker" to never-smoker variances; also displayed is the distribution of the ratio of variances in the covariates orthogonal to the propensity score Percent of covariates with specified variance ratio after adjustment for propensity score Number of B R <1/2 >1/2 and >4/5 and >5/4 and >2 Subclasses < 4/5 < 5/4 < 2 2 0.16 1.38 0 1 98 1 0 4 0.09 1.32 0 1 98 1 0 6 0.07 1.30 0 0 99 1 0 8 0.07 1.37 0 0 99 1 0 10 0.07 1.31 0 0 99 1 0

TABLE 5C PROPENSITY SUBCLASSIFICATION ANALYSES ON THE LOGIT SCALE FOR CURRENT VS. NEVER-SMOKER FEMALES IN MATCHED NMES B = Bias, R = Ratio of "smoker" to never-smoker variances; also displayed is the distribution of the ratio of variances in the covariates orthogonal to the propensity score Percent of covariates with specified variance ratio after adjustment for propensity score Number of B R <1/2 >1/2 and >4/5 and >5/4 and >2 Subclasses < 4/5 < 5/4 < 2 2 0.14 1.26 0 1 98 1 0 4 0.08 1.44 0 1 99 0 0 6 0.06 1.69 0 1 99 0 0 8 0.05 1.69 0 0 100 0 0 10 0.05 1.70 0 0 100 0 0

TABLE 5D PROPENSITY SUBCLASSIFICATION ANALYSES ON THE LOGIT SCALE FOR FORMER VS. NEVER-SMOKER FEMALES IN MATCHED NMES B = Bias, R = Ratio of "smoker" to never-smoker variances; also displayed is the distribution of the ratio of variances in the covariates orthogonal to the propensity score Percent of covariates with specified variance ratio after adjustment for propensity score Number of B R <1/2 >1/2 and >4/5 and >5/4 and >2 Subclasses < 4/5 < 5/4 < 2 2 0.13 1.09 0 0 97 3 0 4 0.08 0.85 0 0 99 1 0 6 0.07 0.85 0 0 100 0 0 8 0.06 0.77 0 0 100 0 0 10 0.06 0.92 0 0 100 0 0

TABLE 6 WEIGHTED PROPENSITY SCORE ANALYSES BASED ON INVERSE PROBABILITIES FOR WEIGHTS IN MATCHED NMES B = Bias, R = Ratio of "smoker" to never-smoker variances; also displayed is the distribution of the ratio of variances in the covariates orthogonal to the propensity score Percent of covariates with specified variance ratio after adjustment for propensity score Treated B R <1/2 >1/2 and >4/5 and >5/4 and >2 Group < 4/5 < 5/4 < 2 Current 0.03 1.19 0 0 100 0 0 N = 3,510 Former 0.08 0.22 0 0 100 0 0 N = 3,384 Current 0.03 1.70 0 0 100 0 0 N = 3,434 Former 0.03 0.66 0 0 100 0 0 N = 2,657

Table 7 ESTIMATED PROPENSITY SCORES ON THE LOGIT SCALE FOR MALE CURRENT SMOKERS (N = 3510) VERSUS MALE NEVER SMOKERS IN NMES B = Bias, R = Ratio of "smoker" to never-smoker variance; also displayed is the distribution of the ratio of variances in the covariates after adjusting for the propensity score Percent of covariates with specified variance ratio after adjustment for propensity score Analysis B R <1/2 >1/2 and >4/5 and >5/4 and >2 < 4/5 < 5/4 < 2 Full 1.09 1.00 3 9 57 26 5 N = 4,297 Matched 0.08 1.16 1 3 90 6 0 N = 3,510 Matched K = 1 0.39 1.33 0 4 88 8 0 K = 2 0.18 1.36 0 2 98 0 0 K = 4 0.10 1.25 0 1 99 0 0 K = 6 0.09 1.30 0 0 100 0 0 K = 8 0.08 1.16 0 0 100 0 0 K = 10 0.07 1.12 0 0 100 0 0 K = 0.03 1.19 0 0 100 0 0