Advanced Statistical Methods for Observational Studies L E C T U R E 0 1

Similar documents
Advanced Statistical Methods for Observational Studies L E C T U R E 0 1

Near/Far Matching. Building a Stronger Instrument in an Observational Study of Perinatal Care for Premature Infants

Near/Far Matching. Building a Stronger Instrument in an Observational Study of Perinatal Care for Premature Infants

Advanced Statistical Methods for Observational Studies L E C T U R E 0 6

1 Impact Evaluation: Randomized Controlled Trial (RCT)

Causal Inference. Prediction and causation are very different. Typical questions are:

Mediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016

Gov 2002: 4. Observational Studies and Confounding

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Uncertainty. Michael Peters December 27, 2013

An introduction to biostatistics: part 1

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015

Potential Outcomes Model (POM)

Technical Track Session I: Causal Inference

Statistical Models for Causal Analysis

CPSC 320 Sample Solution, Reductions and Resident Matching: A Residentectomy

Selection on Observables: Propensity Score Matching.

Two-sample Categorical data: Testing

Propensity Score Methods for Causal Inference

1 Probabilities. 1.1 Basics 1 PROBABILITIES

Differences-in- Differences. November 10 Clair

The Generalized Roy Model and Treatment Effects

The Simple Linear Regression Model

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University

Treatment Effects. Christopher Taber. September 6, Department of Economics University of Wisconsin-Madison

Modeling Log Data from an Intelligent Tutor Experiment

multilevel modeling: concepts, applications and interpretations

ECON Introductory Econometrics. Lecture 17: Experiments

review session gov 2000 gov 2000 () review session 1 / 38

PSC 504: Instrumental Variables

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.

Experiments and Quasi-Experiments

CIS 2033 Lecture 5, Fall

Casual Mediation Analysis

Causal inference in multilevel data structures:

Propensity Score Matching

Modeling Mediation: Causes, Markers, and Mechanisms

P (E) = P (A 1 )P (A 2 )... P (A n ).

Econometric Causality

Comments on Best Quasi- Experimental Practice

Chapter 3. Estimation of p. 3.1 Point and Interval Estimates of p

Chapter 7: Hypothesis testing

Causal Inference Basics

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Introduction to Econometrics

Instrumental variables estimation in the Cox Proportional Hazard regression model

University of Pennsylvania and The Children s Hospital of Philadelphia

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Lecture 5. 1 Review (Pairwise Independence and Derandomization)

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Comparative effectiveness of dynamic treatment regimes

Studies. Frank E Harrell Jr. NIH AHRQ Methodologic Challenges in CER 2 December 2010

Causal Inference with Big Data Sets

Analysis of propensity score approaches in difference-in-differences designs

Module 9: Sampling IPDET. Sampling. Intro Concepts Types Confidence/ Precision? How Large? Intervention or Policy. Evaluation Questions

Sampling and Sample Size. Shawn Cole Harvard Business School

Rubin s Potential Outcome Framework. Motivation. Example: Crossfit versus boot camp 2/20/2012. Paul L. Hebert, PhD

Impact Evaluation of Mindspark Centres

Nondeterministic finite automata

NISS. Technical Report Number 167 June 2007

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores

STA111 - Lecture 1 Welcome to STA111! 1 What is the difference between Probability and Statistics?

EMERGING MARKETS - Lecture 2: Methodology refresher

Confidence intervals CE 311S

Last few slides from last time

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

In Defence of a Naïve Conditional Epistemology

Basics of Proofs. 1 The Basics. 2 Proof Strategies. 2.1 Understand What s Going On

Causal modelling in Medical Research

Technical Track Session I:

PSC 504: Differences-in-differeces estimators

CompSci Understanding Data: Theory and Applications

LECTURE 15: SIMPLE LINEAR REGRESSION I

Using Instrumental Variables to Find Causal Effects in Public Health

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

Dynamics in Social Networks and Causality

Introduction to Econometrics. Review of Probability & Statistics

Regression with Nonlinear Transformations

1. When applied to an affected person, the test comes up positive in 90% of cases, and negative in 10% (these are called false negatives ).

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Treatment of Error in Experimental Measurements

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

A little (more) about me.

Ignoring the matching variables in cohort studies - when is it valid, and why?

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 10

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling

Heterogeneity and Causality: Unit Heterogeneity and Design Sensitivity in Observational Studies. Paul R. Rosenbaum, University of Pennsylvania

What s New in Econometrics. Lecture 1

Quantitative Economics for the Evaluation of the European Policy

Lecture Notes 22 Causal Inference

PROBLEMS OF CAUSAL ANALYSIS IN THE SOCIAL SCIENCES

Probabilistic Index Models

Episode 1: Phis wants to be a physicist

Targeted Maximum Likelihood Estimation in Safety Analysis

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

Transcription:

Advanced Statistical Methods for Observational Studies L E C T U R E 0 1

introduction

this class Website Expectations Questions

observational studies The world of observational studies is kind of hard to get into because it grew up in several distinct, but overlapping, disciplines: Epidemiology Demography Economics (econometrics) Political Science Sociology Biostatistics Statistics Psychology (psychometrics) Computer Science

a small bit about me I do causal inference: Observational studies of: NICU, surgical interventions, and smoking cessation. Randomized studies: six interventions here at Stanford to improve educational outcomes, two large trials of a sexual assault prevention program.

an aside You can call me Mike If you want to use my last name, Baiocchi, totally feel free to if you say it this way I ll definitely know you re talking to me: bye-oh-key

potential outcomes framework Design of Observational Studies: section 2.2

causal inference Our goal is to figure out what the change in the outcome will be for a person if we change from the control to the treatment: Y i t = 1 Y i t = 0 = Δ i

causal inference Y i t = 1 Y i t = 0 Fundamental problem of causality: We cannot observe both Y i t = 1 and Y i t = 0 at the same time.

notation DOS notation r C = response when the control is applied r T = response when the treatment is applied r Ti = response for observational unit i when the treatment is applied. Z i = assignment to treatment for unit i. R i = Z i r Ti 1 Z i r Ci = the observed response for unit i.

a table person i r_{c_i} r_{t_i} z_i r_i Delta_i 1-6 -15 1-15 -9 2-8 -19 0-8 -11 3 6 9 0 6 3 4 9 15 1 15 6 5-10 -23 1-23 -13 6 7 11 1 11 4 7 3 3 0 3 0 8 0-3 1-3 -3 9 0-3 1-3 -3 10-6 -15 0-6 -9 11 8 13 1 13 5 12-5 -13 0-5 -8

a table person i r_{c_i} r_{t_i} z_i r_i Delta_i 1-6 -15 1-15 -9 2-8 -19 0-8 -11 3 6 9 0 6 3 4 9 15 1 15 6 5-10 -23 1-23 -13 6 7 11 1 11 4 7 3 3 0 3 0 8 0-3 1-3 -3 9 0-3 1-3 -3 10-6 -15 0-6 -9 11 8 13 1 13 5 12-5 -13 0-5 -8 Sometimes called: The Science Table

potential outcomes framework The potential outcomes framework is meant to clarify the challenge: We only get to observe half of the data we actually want. How do we get at the unobserved half?

study design vs. inference Don Rubin: For objective causal inference, design trumps analysis

study design vs. inference 90% of statistics classes are about inference Why? It s useful, getting you those confidence intervals and p-values. The math is pretty cool. It feels hard. Because many of us don t really know much about the real world

design R A N D O M I Z A T I O N A N D S A M P L I N G

where does the data come from? We design trials. Assign groups that are similar at baseline Examine counterfactuals We also design sampling schemes. Representative groups Understand population from subsets of those populations Both use elements of control and randomness

an example: randomization Want to study a pill. Design the study Uniform randomization Matched pairs randomization Crossover design Cluster-randomized Inference t-test Matched-pairs t-test Repeated measures model Generalized linear mixed model But maybe all of those could be GLMM.

an example: sampling Want to study an election. Design the study Simple random sample Stratified sampling Snowball sampling Inference t-test Inverse probability weighting Generalized linear mixed model But maybe all of those could be GLMM.

different beliefs about where data come from RCT and sampling Structural equation modeling y i = β 0 + β 1 x 1,i + + β p x p,i + ε i If you want to be disabused of SEM spend some time reading

where data come from If you d like to be abused by SEM please see

inference

picking inference Inference requires assumptions Linear regression: Linearity and additivity Independent errors Homoskedastiticity Normality of errors Permutation test: Known assignment mechanism to T or C Fancier methods tend to have more assumptions and thus leave you open to more lines of attack. These attacks can be obviated by careful preparation during the design phase.

picking inference Use the simplest method that gets the job done. If you want to accomplish more, collect more data or do additional analyses. ( If have to use something more complicated than a t-test then someone messed up ) The fewer assumptions there are, the easier it will be to perform a sensitivity analysis build an argument to beat back the haters.

picking inference Another option: Proof by intimidation This paper presents a breakthrough in rhetorical logic, a promising field of science, of great values to those writing research proposals. It provides new, and utterly convincing tools for closing embarrassing gaps in your reasoning, without having to resort to brute-force methods such as actually thinking about the problem in the first place. The Craske-Trump Theorem Conjecture will allow researchers in any field to use the technique of Proof by Intimidation fully. - Michael Wilkinson (Annals of Improbable Research 2000)

observational study design N E O N A T A L I N T E N S I V E C A R E U N I T S

Application: Regionalization Hospitals vary in their ability to care for premature infants. The American Academy of Pediatrics recognizes levels: 1, 2, 3A, 3B, 3C, 3D and Regional Centers. Regionalization of care refers to a policy that suggests or requires that high-risk mothers deliver at hospitals with greater levels of capabilities.

Outcome Outcome

Outcome Outcome

The data Every baby delivered in a 10+ year period California Pennsylvania Missouri Mothers information ICD9 codes Delivery Post-delivery complications Some pre-delivery Some SES information Zip code of residence Birth/death certificates Census information PA and MO have zip code level CA will have block group Pre-delivery Severity?

H H

H H

naïve model for observational studies Design of Observational Studies: section 3.1-3.3

the variation comes from somewhere The model starts with thinking about the assignment mechanism: π i = Pr(Z i = 1 r Ti, r Ci, x i, u i ) where π i will likely vary from person to person, x i are the observed covariates and u i are the unobserved covariates. Consider the vector of all treatment assignments, Z. This can be written as: Pr Z = z r T, r C, x, u = π 1 z 1 1 π 1 1 z 1 π n z n 1 π n 1 z n

motivation for matching Imagine we can find two subjects, k and l, such that Z k + Z l = 1 but π k = π l Recall the model π i = Pr(Z i = 1 r Ti, r Ci, x i, u i ) we can only match on a subset of these. If we were in the RCT framework then we could create these trivially by doing a matched pairs randomization and assigning k and l to the same pair. We re building a story about the randomization.

H H Excess Travel Time

H H Excess Travel Time

H H Excess Travel Time

H H McClellan, McNeil & Newhouse; "Does more intensive treatment of acute myocardial infarction reduce mortality? JAMA. 272(11): 859-66, September 1994

motivation for matching Recall the model π i = Pr(Z i = 1 r Ti, r Ci, x i, u i ) we can only match on a subset of these. r Ti, r Ci are whether baby lives or dies. Z i is whether baby is delivered at low or high level NIC. x i are the 40+ covariates of mom and baby, as well as where they live. u i are all the unmeasured features that influenced the baby, mom and the process of how they ended up delivering at the hospital they did.

H H

H H

motivation: natural experiments Perhaps the π i behave in ways that will allow us to draw inferences in ways that are similar to experiments. Loosely speaking, this will tend to happen when the assignment mechanism is haphazard in its assignment not using prognostically relevant covariates to sort the units into levels of the treatment. What happens if we can find matches such that Z k + Z l = 1 and π k = π l?

motivation: natural experiments What happens if we can find matches such that π k = π l? Pr Z k = z k, Z l = z l r Tk, r Ck, x k, u k, r Tl, r Cl, x l, u l = π k z k 1 π k 1 z k π l z l 1 πl 1 z l = π k z k +z l 1 πk 1 z k +(1 z l ) What if we design our study such that Z l + Z k = 1? Pr Z k = 1, Z l = 0, Z l + Z k = 1

motivation: natural experiments What happens if we can find matches such that π k = π l? Pr Z k = z k, Z l = z l r Tk, r Ck, x k, u k, r Tl, r Cl, x l, u l = π k z k +z l 1 πk 1 z k +(1 z l ) What if we design our study such that Z l + Z k = 1? Pr Z k = 1, Z l = 0, Z l + Z k = 1 = Pr Z k = 1, Z l = 0 Pr Z k = 1, Z l = 0 +Pr Z k = 0, Z l = 1 = π k 1+0 1 πk 1 1 +(1 0) Pr Z k = 1, Z l = 0 +Pr Z k = 0, Z l = 1

motivation: natural experiments What if we design our study such that Z l + Z k = 1? Pr Z k = 1, Z l = 0, Z l + Z k = 1 = Pr Z k = 1, Z l = 0 Pr Z k = 1, Z l = 0 +Pr Z k = 0, Z l = 1 = π k 1+0 1 πk 1 1 +(1 0) Pr Z k = 1, Z l = 0 +Pr Z k = 0, Z l = 1 = π k 1+0 1 πk 1 1 +(1 0) π k 1+0 1 π k 1 1 +(1 0) +π k 0+1 1 π k 1 0 +(1 1) = 1 2 IF we can do this then we get to use the same tools developed for RCTs!

the naïve model Those that look alike (in our data set) are alike: π i = Pr Z i = 1 r Ti, r Ci, x i, u i = Pr Z i = 1 x i and with 0 < π i < 1 for all i = 1, 2,, n Pr Z = z r T, r C, x, u = n i=1 π i z i 1 πi 1 z i We could make this model true by randomly assigning, potentially using biased coins based on the observed covariates.

at the core: the propensity score This quantity has a name: e(x i ) = Pr Z i = 1 x i often, e(x i ) is referred to as the propensity score. It s the probability of assignment to treatment. It links the features of an individual unit to its assignment to the treatment. It s a description of the assignment mechanism. It s not magic. But it does have several cool features that are not immediately obvious: Propensity: facilitates randomization (Fisher) Score: short for balancing score which facilitates balance (Mill)

wishes If the ideal match and the naïve model were true then we could just match on the observed covariates and analyze data using traditional techniques for RCTs. Unfortunately, it s extraordinarily unlikely that strongly ignorable treatment assignment actually holds. It can be more plausible or less plausible given different situations, but it can never be proven.

takeaways

takeaways The potential outcomes framework helps organize our thinking on counterfactuals Design comes in two flavors (actually, three but the third one is not very healthy) In prospective studies design is an obvious consideration and one that MUST be passed through in order to obtain data In retrospective studies, design is a less obvious consideration but one that MUST be passed through unfortunately without much attention paid

fin. C H E C K O U T T H E W E B S I T E.