The Role of the Propensity Score in Observational Studies with Complex Data Structures

Similar documents
Propensity Score Weighting with Multilevel Data

Identification and estimation of treatment and interference effects in observational studies on networks

Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017

Selection on Observables: Propensity Score Matching.

What s New in Econometrics. Lecture 1

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

The propensity score with continuous treatments

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region

Estimating the Marginal Odds Ratio in Observational Studies

Covariate Balancing Propensity Score for General Treatment Regimes

High Dimensional Propensity Score Estimation via Covariate Balancing

Gov 2002: 4. Observational Studies and Confounding

Balancing Covariates via Propensity Score Weighting

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Implementing Matching Estimators for. Average Treatment Effects in STATA

Imbens/Wooldridge, IRP Lecture Notes 2, August 08 1

Ratio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

Propensity Score Analysis with Hierarchical Data

Propensity score weighting for causal inference with multi-stage clustered data

Combining multiple observational data sources to estimate causal eects

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Quantitative Economics for the Evaluation of the European Policy

Flexible Estimation of Treatment Effect Parameters

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

An Introduction to Causal Analysis on Observational Data using Propensity Scores

SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk

Propensity Score Methods for Causal Inference

Propensity score matching for multiple treatment levels: A CODA-based contribution

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Bootstrapping Sensitivity Analysis

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

Robustness to Parametric Assumptions in Missing Data Models

A SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS

Balancing Covariates via Propensity Score Weighting: The Overlap Weights

Achieving Optimal Covariate Balance Under General Treatment Regimes

Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death

Imbens/Wooldridge, Lecture Notes 1, Summer 07 1

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

arxiv: v1 [stat.me] 15 May 2011

Causal Inference Basics

arxiv: v1 [stat.me] 8 Jun 2016

The problem of causality in microeconometrics.

Statistical Models for Causal Analysis

Propensity Score Matching

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

Discussion of Papers on the Extensions of Propensity Score

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Empirical approaches in public economics

Observational Studies and Propensity Scores

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

Estimating Post-Treatment Effect Modification With Generalized Structural Mean Models

Estimating direct effects in cohort and case-control studies

Causal Inference Lecture Notes: Selection Bias in Observational Studies

CompSci Understanding Data: Theory and Applications

Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments

Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments

Section 10: Inverse propensity score weighting (IPSW)

Propensity Score for Causal Inference of Multiple and Multivalued Treatments

EMERGING MARKETS - Lecture 2: Methodology refresher

Dan Graham Professor of Statistical Modelling. Centre for Transport Studies

Marginal, crude and conditional odds ratios

Identification and Estimation of Causal Mediation Effects with Treatment Noncompliance

arxiv: v2 [stat.me] 19 Jan 2017

Telescope Matching: A Flexible Approach to Estimating Direct Effects *

Covariate selection and propensity score specification in causal inference

AGEC 661 Note Fourteen

A Sampling of IMPACT Research:

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Causal modelling in Medical Research

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

Lecture 2: Constant Treatment Strategies. Donglin Zeng, Department of Biostatistics, University of North Carolina

Composite Causal Effects for. Time-Varying Treatments and Time-Varying Outcomes

Is My Matched Dataset As-If Randomized, More, Or Less? Unifying the Design and. Analysis of Observational Studies

NONPARAMETRIC ESTIMATION OF AVERAGE TREATMENT EFFECTS UNDER EXOGENEITY: A REVIEW*

Measuring Social Influence Without Bias

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data

Extending causal inferences from a randomized trial to a target population

Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures

Causal Sensitivity Analysis for Decision Trees

Causal inference in multilevel data structures:

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem

Potential Outcomes and Causal Inference I

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula.

Parametric and Non-Parametric Weighting Methods for Mediation Analysis: An Application to the National Evaluation of Welfare-to-Work Strategies

The Evaluation of Social Programs: Some Practical Advice

Imbens, Lecture Notes 1, Unconfounded Treatment Assignment, IEN, Miami, Oct 10 1

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Gov 2002: 13. Dynamic Causal Inference

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Transcription:

The Role of the Propensity Score in Observational Studies with Complex Data Structures Fabrizia Mealli mealli@disia.unifi.it Department of Statistics, Computer Science, Applications University of Florence TATE Talks, UNC, School of Social Work May 22, 2017

Introduction and rationale of the talk Arpino B. and Mealli F. (2011). The specification of the propensity score in multilevel observational studies. Computational Statistics and Data Analysis 55, 1770 1780 Forastiere L., Airoldi E. M., and Mealli F. (2016). Identification and estimation of treatment and interference effects in observational studies on networks. Arxiv working paper (http://arxiv.org/abs/1609.06245) Papadogeorgou G., Mealli F., and Zigler C. (2017). Inverse probability weighted estimators under partial interference. (Work in progress, poster at ACIC 2017, Causal Inference for interfering units under treatment regimes that incorporate covariate information in the counterfactual treatment assignment) Common feature of these papers is that they use the (generalized) propensity score to propose methods to adjust for covariates in complex settings under various form of unconfoundedness and SUTVA

Notation Each unit (in a population of N) is characterized by a K-vector of characteristics, denoted by X i for unit i, with X denoting the N K matrix of characteristics Let W i denote the treatment, to which unit i is assigned : W i W = {0, 1} Stable Unit Treatment Value Assumption (SUTVA) SUTVA: the potential outcomes for any unit do not vary with the treatments assigned to any other units, and there are no different versions of the treatment SUTVA is a form of exclusion restriction: assumptions that rely on outside information to rule out the possibility of any causal effect of a particular treatment For each unit, let Y i (0) and Y i (1) denote the outcomes under the two values of the treatment Potential outcomes (Y(0), Y(1)) = [(Y i (0), Y i (1))] N i=1 and assignments W = [W i] N i=1 jointly determine the values of the observed and missing outcomes: Y obs i Y i (W i ) = W i Y i (1) + (1 W i ) Y i (0) Y mis i Y i (1 W i ) = (1 W i ) Y i (1) + W i Y i (0)

Basics of Propensity Scores The assignment mechanism (AM) gives the conditional probability of each vector of assignments given the covariates and potential outcomes: p(w X, Y(0), Y(1)) Given a population of N units, the AM defines the probability of receiving the treatment for each unit i as a function of the covariates and the potential outcomes: p i (X, Y(0), Y(1)) = p(w X, Y(0), Y(1)) i = 1, 2,..., N W:W i =1 Restrictions on the AM: Individualistic, Probabilistic and Unconfounded Individualistic Assignment: p i (X, Y(0), Y(1)) = p(w i = 1 X i, Y i (0), Y i (1)) Probabilistic Assignment: For each possible X, Y(0), Y(1) 0 < p i (X, Y(0), Y(1)) < 1 i = 1, 2,..., N i = 1, 2,..., N Unconfounded Assignment: An AM is unconfounded if it does not depend on the potential outcomes: p(w X, Y(0), Y(1)) = p(w X)

Propensity scores Propensity score for binary treatments. The propensity score at x is the average unit assignment probability for units with X i = x e(x) = 1 N(x) p i(x, Y(0), Y(1)) i:x i =x where N(x) = {i = 1,..., N X i = x} is the number of units with X i = x (e(x) 0 if N(x) = 0) Unconfoundedness and Individualistic assignment implies that the propensity score is the unit-level assignment probability of receiving the treatment e(x) = p(w i = 1 X i ) Observational studies: An assignment mechanism corresponds to an observational study if it is an unknown function of its arguments

Properties of the propensity score Balancing property of the propensity score: The probability of receiving the active treatment given the covariates is free of dependence on the covariates given the propensity score W i X i e(x i ) Unconfoundedness given the popensity score: Suppose assignment to treatment is unconfounded. Then assignment is unconfounded given the propensity score only: If W i Y i (0), Y i (1) X i ) then W i Y i (0), Y i (1) e(x i ) Unconfoundedness given the propensity score has generated methods of adjusting based on the propensity score: weighting, regression, subclassification, matching

Propensity score with multilevel data Clustered data: individual- and cluster-level covariates Treatment assignment at cluster level (Keele and Zubizarreta, 2017; Pimentel at al., 2017) Treatment assignment at individual level (Kim and Seltzer, 2007; Rosenbaum et al., 2007; Aussems, 2008; Su, 2008; Li et al., 2013; Arpino and Mealli, 2012)

The specification of the propensity score in multilevel studies Assignment mechanism may depend on individual- and cluster-level covariates Mimic block randomized experiments or multi-site experiments Arpino and Mealli (2012) consider cases of omitted variable bias due to unobserved cluster-level covariates Matching within clusters achieves perfect balance in cluster-level covariates but often not feasible and leading to poor balance in individual-level covariates

The specification of the propensity score in multilevel studies (Arpino and Mealli, 2012) Different specification of the propensity score (logit link): Random-effect multilevel models Fixed-effect models Models that ignore clustering Simulations showing bias/efficiency of nearest-neighbour PS matching estimators Motivating example: analyzing the effects of childbearing events on economic wellbeing in Vietnam, where community characteristics play important roles

Overall results and implications Fixed-effect specification of the PS outperforms in terms of bias and efficiency Robust to different distribution of cluster-level covariates Good even with small and/or imbalanced cluster size Still good when irrelevant variables included The inclusion of fixed-effects specifies a model for the PS more general than the ideal if cluster-level variables were available When conducting PS analysis it is safer to specify a more general model than pursuing model parsimony

Interference So far, we have assummed SUTVA, according to which the potential outcomes for any unit do not vary with the treatments assigned to any other units SUTVA allows us to write that for a unit i there are two potential outcomes Y i (0), Y i (1) In the presence of interference, a unit s outcome depends on the individual treatment, but also on the treatment of others For example, neighbor s vaccination status can affect an individual s outcome Under interference, the set of potential outcomes is {Y i (w), w {0, 1} n } This allows for 2 n potential outcomes for every unit, where n is the number of observations The treatment of any other observation can affect the outcome of unit i

Partial interference Units can be clustered in groups within which there is interference, but not among them Denote k {1, 2,..., K} to be a cluster with n k individuals. W(n) = {0, 1} n : set of vectors of possible treatment allocations of length n Let W ki to be the treatment indicator of unit i in cluster k, and write W k = (W k1,..., W knk ), and W k, i = (W k1,..., W kj 1, W kj+1,..., W knk ) Partial interference. Let k(i) {1,..., K} denote the class to which unit i belongs, and decompose W = (W 1..., W K ). For all W and W such that W k(i) = W k(i) we have Y i(w) = Y i (W ) Then, unit s i potential outcomes are {Y(w k ), w k W(n)} X ki = vector of fixed individual and group-level covariates; X k, X k, i similarly to W k, W k, i

Observed and counterfactual treatment allocation (Papadogeorgou, Mealli, Zigler, 2017) Observed treatment allocation The mechanism that has assigned the observed treatment Clinical trials (randomization), observational studies (covariates) Counterfactual treatment allocation What is the intervention that we are imagining? In what hypothesized world are we estimating the effect of interest? Interpretation of the effects requires a hypothesized treatment allocation that is applicable Previous literature on interference has considered Randomized observed treatment allocation Covariate-dependent observed treatment allocation Randomization-based counterfactual treatment allocation We propose the estimation of causal effects in the presence of interference under realistic interventions

Counterfactual treatment allocation We consider counterfactual treatment allocation that Incorporate covariates as treatment predictors Allow for dependence of treatments within the cluster Intervention takes place at the cluster level Denote p k (W k ; X k, α) to be the probability of allocating treatment W k to cluster k, when the cluster average propensity of treatment is equal to α The individual average potential outcome under w {0, 1} is defined as Y ki (w; X k, α) = Y ki (W ki = w, W k, i = w k, i )p k (W k, i = w k, i ; W ki = w, X k, α) where the summation is over w k, i W(n k 1) Group average potential outcome: Y k (w; X k, α) = 1 n k Y ki (w; X k, α) Population average potential outcome: Y(w; X, α) = 1 K Y k(w; X k, α)

Direct and indirect effects If the percentage of units in the cluster that are treated is equal to α, what is the effect of treatment? Individual direct effect: DE ki (X k, α) = Y ki (0; X k, α) Y ki (1; X k, α) Group average direct effect: DE k (X k, α) = 1 n k nk i=1 DE ki(x k, α) Population average direct effect: DE(X, α) = 1 K K k=1 DE k(x k, α) Among control units, what is the effect of changing α from α 1 to α 2? Individual indirect effect: IE ki (α 1, α 2 ; X k ) = Y ki (0; X k, α 1 ) Y ki (0; X k, α 2 ) Group and population level estimands IE k (α 1, α 2 ; X k ), IE(α 1, α 2 ; X) are defined similarly

Allocation average potential outcome and effects What if we are interested in evaluating the effect of interventions that shift the distribution of observed α from F α to F α For example, federal regulations could target the increase of state-specific vaccination rates Each state s compliance could be different We cannot know in advance which α each state/city will accept Allocation average individual potential outcome under F α Y ki (w; X k, F α ) = Y ki (w; X k, α) df α (α) Allocation average population direct effect DE(X, F α ) = DE(X, a) dfα(α) Allocation average population indirect effect IE(X, F α, F α) = Y(0; X, α) df α (α) Y(0; X, α) df α(α)

Assumptions, estimator, and asymptotic results Positivity: p(w k = w k X k ) > δ 0 > 0 for all w k W(n k ) Unconfoudedness: W k Y k ( ) X k Estimators Ŷ k (w; X k, α) = 1 n k Ŷ K (w, X, α) = 1 K n k i=1 p(w k, i W ki = w, X k, α) I{W ki = w}yki obs p(w k X k ) K Ŷ k (w, X k, α) k=1 Theorem 1 (Unbiasedness). Unbiased for Y k(w; X k, α), Y(w; X, α) Theorem 2 (Consistency). Let F 0 be the distribution of (Y k, X k, W k) in the whole population. lim Ŷ K (w, X, α) = E F0 [Y k (w, X, α)] K a.s. and so in probability Theorem 3 (Asymptotic normality). If positivity and unconfoudedness hold, the propensity scores are known or estimated from the correctly specified propensity score model, and outcome and cluster size are bounded, then Ŷ K (w, X, α) is asymptotically normal standard errors!

The use of the propensity score When the observed treatment allocation is not known (most times), p(w k X k ) needs to be estimated We model W ki Bern(p ki ) where logit(p ki ) = b k + X ki, b k N(0, σ 2 b), and X ki includes both individual and cluster level covariates [ ] n k Then p(w k X k ) = p(w ki b k, X ki ) f (b i σb) 2 db i i=1 The numerator is set equal to p(w kj X kj, α) where b k has been set equal to j i b α k, for the bα k that satisfies 1 p(w kj b k = b α k, X kj ) = α n k 1 j i The propensity score has been used to capture the covariate-treatment relationship in the observed (denominator) and counterfactual (numerator) treatment allocation

Estimating treatment and spillover effects in observational social network data using GPSs (Forastiere, Airoldi, Mealli, 2016) N = (V, E): Social Network i = 1,..., N = V : Node (Unit) N i ={j V : e ij = 1} : Neighborhood of unit i N i = N i : Degree of unit i W {0, 1} N : Treatment Vector Y i (W): Potential Outcomes Under SUTVA: Y i (W i, W Ni, W Ni ) = Y i (W i, W N i, W N i ) W Ni, W N i, W Ni, W N i SUTVA is untenable in the presence of network data Neighborhood-Level SUTVA: Y i (W i, W Ni, W Ni ) = Y i (W i, W Ni, W N i ) W Ni, W N i G-Neighborhood-Level SUTVA (SUTNVA). Let g i ( ) be a function g i : {0, 1} N i G i R Y i (W i, W Ni, W Ni ) = Y i (W i, W N i, W N i ) W Ni, W N i and W Ni, W N i : g i (W Ni ) = g i (W N i )

Main Effects and Spillover Effects A potential outcome Y i (w, g) is defined only for a subset of nodes V g = {i V : g G i } Main Effect: Average effect of the individual treatment, when the neighborhood treatment is set to g τ(g) = E [ Y i (W i = 1, G i = g) Y i (W i = 0, G i = g) i V g ] Overall Main Effect: Average effect of the individual treatment, averaged over the neighborhood treatment distribution τ = g G τ(g)p(g i = g) G = i G i Spillover Effect: average effect of having the neighborhood treatment at level g versus 0, when the individual treatment is set to z δ(g; w) = E [ Y i (W i = w, G i = g) Y i (W i = w, G i = 0) i V g ] Overall Spillover Effect (w): average of spillover effects δ(g; w) over the neighborhood treatment distribution (w) = g G δ(g; w)p(g i = g) G = i G i

Joint Treatment and Identifying Assumptions (Z i, G i ): Joint Treatment Observational study: The assignment mechanism is unknown and depends on covariates, Pr(W, G X, {Y(w, g), w = 0, 1; g G}) X = [X i] i = [X ind i, X neig i ] i where X ind i = Individual characteristics and X neig i = Summary of individual characteristics in neighboring units + Neighborhood structure (N i, Shared friends,...) Unconfoudedness Assumption of Joint Treatment Y i(w, g) W i, G i X i Under SUTNVA and unconfoudedness of the joint treatment an unbiased estimator for the adjusted average of Yi obs, conditional on the joint treatment Y obs w,g,x := E X [E[Y obs i W i = w, G i = g, X i, i V g ] W i = w, G i = g, i V g ] is unbiased for the marginal mean E[Y i(w, g) i V g]: Y obs w,g,x i = E[Y i(w, g) i V g]

Bias for main effects and overall main effects when SUTVA is wrongly assumed Observed adjusted mean difference τx obs = [ Y obs i W i = 1, X = x ] E [ Yi obs W i = 0, X = x ] p(x = x) x X E Under SUTVA and if Y i (w) W i X i, unbiased covariate-adjusted estimators of τ obs X are unbiased for E[Y i (1)] E[Y i (0)] Theorem 1. If Y i (w, g) W i, G i X i, then unbiased estimators of τ obs X are biased for τ Corollary 1. If Y i (w, g) W i, G i X i and W i G i X i, an unbiased estimator of τx obs is unbiased for τ, even in the presence of interference: τx obs = τ Corollary 2. If Y i (w, g) W i, G i X i but W i G i X i, then an unbiased estimator of is biased for τ τ obs X The bias depends on the level of interference and the association between W i and G i conditional on X i Theorem 2. If Y i (w, g) W i, G i X i, this bias due to interference is combined with the bias due to unmeasured confounders (U = X i \X i )

Joint propensity score Joint propensity score ψ(w; g; x) := Pr(W i = w, G i = g X i = x) = Pr(G i = g W i = w, X g i = x g ) Pr(W i = w X w i = x w ) }{{}}{{} λ(g;w;x g ) ϕ(w;x w ) Neighborhood Propensity Score Individual Propensity Score The cardinality of the neighborhood treatment depends of the function g i

Continuous treatments and the generalized propensity score Continuous treatment: Let G R be the set of values for the treatment Average dose-response function: µ(g) = E[Y i(g)] Weak unconfoundedness: G i Y i (g) X g i for all g G Let λ(g, x) = f G X g(g x g ) be the conditional density of the treatment given the covariates The GPS for a continuous treatments is Λ i = λ(g i, X g i ) Properties of the GPS The GPS is a balancing score: f G (g X g i ) = f G(g X g i, λ(g, X i)) = f G (g λ(g, X g i )) Weak unconfoundedness given the GPS. If the assignment to the treatment is weakly unconfounded given pretreatment variables X g, then, for every g, f G (g λ(g, X g i ), Y i(g)) = f G (g λ(g, X g i )) Bias Removal with GPS. If the assignment to the treatment is weakly unconfounded given pretreatment variables X, then, β(g, λ) = E [Y i (g) λ(g, X g i ) = λ] = E [ Yi obs G i = g, Λ i = λ ] µ(g) = E [β(g, λ(g, X g i ))] (e.g., Hirano and Imbens, 2004; Imai and Van Dyk, 2004; Bia and Mattei, 2008, 2012; Flores et al., 2012; Kluve et al., 2012; Zhao et al., 2013, Bia et al., 2014)

The generalized propensity score Matching usually unfeasible GPS allows avoiding to specify a model for the relationship between potential outcomes and covariates How to use GPS Estimate the GPS, e.g., using a flexible parametric approach: Let ˆΛ i be the estimated GPS Estimate the conditional expectation function of Yi obs given G i and Λ i as a flexible function of its two arguments: Ê [ Yi obs G i = g, Λ i = λ ] Estimate the[ average dose-response function at treatment level w averaging Ê Yi obs G i = g, ˆλ(g, ] X g i ) over ˆλ(g, X g i )

Joint propensity score Joint propensity score ψ(w; g; x) := Pr(W i = w, G i = g X i = x) = Pr(G i = g W i = w, X g i = x g ) Pr(W i = w X w i = x w ) }{{}}{{} λ(g;w;x g ) ϕ(w;x w ) Neighborhood Propensity Score Individual Propensity Score The joint propensity score is a balancing score: p(w i = w, G i = g X i, ψ(w; g; X i )) = p(w i = w, G i = g ψ(w; g; X i )) Conditional unconfoundedness of W i and G i given the joint / (individual + neighborhood) PS: If Y i (w, g) W i, G i X i then Y i (w, g) W i, G i ψ(w; g; X i ) Y i (w, g) W i, G i λ(g; w; X g i ), ϕ(w; Xw i )

Propensity Score-Based-Estimator (Subclassification + GPS) Subclassification on ϕ(1; X w i ) 1. Estimate ϕ(1; X w i ) (logistic regression for W i conditional on covariates X w i ) 2. Predict ϕ(1; X w i ) for each unit 3. Identify J subclasses B j, with j = 1,..., J, where X w i W i i B j Within each subclass B j estimate µ j (w, g) = E [ Y i (w, g) i B g ] j, where B g j = V g B j : 1. Estimate a model for the neighborhood propensity score λ(g; w; X g i ). 2. Use the observed data (Yi obs, W i, G i, X g i and Λ = λ(w i ; G i ; X g i )) to estimate a model Y i(w, g) λ(w; g; X g i ) f (w, g, λ(w; g; Xg i )) 3. For each unit i B g j, predict λ(w; g; Xg i ), and use it to predict Y i(w, g) 4. Estimate the dose-response function averaging the conditional potential outcomes over λ(w; g; X g i ) : i B g Ŷ i (w, g) j µ j(w, g) = B g j Derive the average dose-response function J µ(w, g) = µ j(w, g)π g j π g j = j=1 i V g 1(ϕ(1; X z i ) B j ) Standard errors and confidence intervals are derived using bootstrap methods v g

Some concluding remarks Propensity scores are powerful tools Must however be used with care Underlying assumptions are crucial They determine how the propensity score should be specified and estimated

References Achy-Brou A.C., Frangakis C.E. and Griswold M. (2010). Estimating treatment effects of longitudinal designs using regression models on propensity scores. Biometrics 66, 824-833. Arpino B. and Mealli F. (2011). The specification of the propensity score in multilevel observational studies. Computational Statistics and Data Analysis 55, 1770-1780 Aussems M.E. (2008). Multilevel data and propensity scores: an application to a virtual Y after-school program. Mimeo. Bia M. and Mattei A. (2008). A STATA Package for the estimation of the dose-response function through adjustment for the generalized propensity score. The STATA Journal 8, 354-373. Bia M. and Mattei A. (2012). Assessing the effect of the amount of financial aids to Piedmont firms using the generalized propensity score. Statistical Methods and Applications, 21, 485-516. Bia M., Flores A. C., Flores-Lagunes A., Mattei A. (2014). A Stata package for the application of semiparametric estimators of dose-response functions. The STATA Journal 14, 580-604. Cattaneo M. (2010). Efficient semiparametric estimation of multi-valued treatment effects under ignorability. Journal of Econometrics 155, 138-154. Flores C. A., Flores-Lagunes A. and Neumann T. (2012). Estimating the effects of length of exposure to instruction in a training program: The case of Job Corps. The Review of Economics and Statistics 94, 153-171. Forastiere L., Airoldi E. M., and Mealli F. (2016). Identification and estimation of treatment and interference effects in observational studies on networks. Arxiv working paper (http://arxiv.org/abs/1609.06245) Hirano K. and Imbens G.W. (2004). The propensity score with continuous treatments. In Applied Bayesian Modelling and Causal Inference from Missing Data Perspectives. Imai K. and Van Dyk D. (2004). Causal inference with general treatment regimes: generalizing the propensity score. JASA 99, 854-866.

References Imai K. and Ratkovic M. (2014). Covariate balancing propensity score. JRSS-B 76, 243-263. Imai K. and Ratkovic M. (2015). Robust Estimation of Inverse Probability Weights for Marginal Structural Models. JASA 110, 1013-1023. Imbens G.W. (2000). The role of the propensity score in estimating dose-response functions. Biometrika 87, 706-710. Kim J. and Seltzer M. (2007). Causal inference in multilevel settings in which selection process vary across schools. Working Paper 708. Center for the Study of Evaluation (CSE), Los Angeles. Keele L.J. and Zubizarreta J. (2017). Optimal Multilevel Matching in Clustered Observational Studies: A Case Study of the School Voucher System in Chile. JASA. Forthcoming. Kluve J., Schneider H., Uhlendorff A. and Zhao Z. (2012) Evaluating continuous training programmes by using the generalized propensity score. JRSS-A 175, 587-617 Lechner M. (2001) Identification and estimation of causal effects of multiple treatments under the conditional independence assumption. Econometric Evaluations of Active Labor Market Policies in Europe, 43-58 Li F., Zaslavsky A.M and Landrum M.B. (2013). Propensity score weighting with multilevel data. Statistics in Medicine 32, 3373-3387. Mattei A. and Mealli F. (2015) Commentary on On Bayesian estimation of marginal structural models by Saarela O., Stephens D. A., Moodie E. E. M., Klein M. B. Biometrics 71, 293-296. Papadogeorgou G., Zigler C. and Mealli F. (2017). Inverse probability weighted estimators under partial interference. (Work in progress) Pimentel S., Page L., Matthew L., and Lindsay and Keele, L. J. (2015). Optimal Multilevel Matching Using Network Flows: An Application to a Summer Reading Intervention. Mimeo (http://lukekeele.com/wp-content/uploads/2016/03/myon.pdf) Robins J. M., Hernan M. A. and Brumback B. (2000). Marginal structural models and causal inference in epidemiology. Epidemiology 11, 550-560. Su, Y., 2008. Causal inference of repeated observations: a synthesis of matching method and multilevel modeling. Paper Presented at the Annual Meeting of the APSA 2008, Boston, Massachusetts. Yang S., Imbens G.W., Cui Z., Faries D. E. and Kadziola Z. (2016). Propensity score matching and subclassification in observational studies with multi-level treatments. Biometrics 72,1055-1065. Zhao S., Van Dyk D. and Imai K. (2013) Propensity-score based methods for causal inference in observational studies with fixed non-binary treatments. Mimeo (http://imai.princeton.edu/research/files/gpscore.pdf)