Causal Inference with Counterfactuals

Size: px
Start display at page:

Download "Causal Inference with Counterfactuals"

Transcription

1 Causal Inference with Counterfactuals Robin Evans Hilary Introduction What does it mean to say that a (possibly random) variable X is a cause of the random variable Y? Certainly X does not determine the value of Y, so how can we claim causality through this noise? Example 1.1. Suppose you have a headache, and decide to take an aspirin. An hour later the headache is gone is this because you took the aspirin? Note we are not asking whether aspirin cures headaches in some more general sense, we wish to know whether this specific headache went away because of the decision to take aspirin. The only sensible way to answer this sort of question in the case of a specific event is to compare the outcome which you observed with the counterfactual outcome which you would have observed if you had chosen not to take the aspirin. Let X = 1 if the aspirin is taken, and 0 otherwise. Denote by Y = 1 the event that the headache has disappeared after an hour, Y = 0 otherwise. We might imagine that there is really a second, outcome which, contrary-to-fact, corresponds to what would have happened if you had not taken the aspirin. We can imagine this as a piece of missing data (Neyman, 1923; Rubin, 1974). We imagine therefore that we are characterized by a pair of variables (Y 0, Y 1 ), with four possible values. There is an individual-level causal effect if Y 0 Y 1 : the possible pairs are characterized in this table. Y 0 Y 1 Type t Y 0 0 Never Recover NR 0 1 Helped HE 1 0 Hurt HU 1 1 Always Recover AR Of course, we can logically only ever observe one of these two outcomes Y 0 or Y 1 : this is called the fundamental problem of causal inference. Definition 1.2. We will define Y Y X. This is sometimes formulated as the consistency assumption: X = x = Y = Y x. 1

2 C X Y X Y (a) (b) Figure 1: Graphs corresponding to (a) no confounding and (b) confounding by C. 1.1 Ignorability The fundamental problem means that is essentially impossible to identify individual causal effects. However, a lesser goal would be to consider E[Y 1 Y 0 ]. By consistency, if X = x, then Y x = Y. Hence E[Y X = x] = E[Y x X = x] = EY x if Y x X. Then inference about Y x can be done just by looking at the observed values of Y such that X = x. Definition 1.3. If Y x X then the marginal distribution of Y x is identifiable (and is the same as that of Y X = x). This assumption is called ignorability or sometimes exchangability. Remark 1.4. In our example ignorability requires that Y 0 X and Y 1 X, but not the joint independence (Y 0, Y 1 ) X, sometimes called strong ignorability. The latter is, of course, untestable, since it involves assumptions about different worlds. Although it is hard to think of examples in which the marginal independences hold but the joint independences do not, it is good practice to avoid untestable assumptions wherever possible! Ignorability might be interpreted as saying that the mechanism which assigns treatments is independent of the mechanism which turns those treatments into outcomes. Note that it does not imply that Y X: clearly Y Y X depends upon X in general. The idea of separating inputs and mechanisms for causal inferences has been exploited in other contexts (Janzing et al., 2012). Ignorability is related, but not identical, to a lack of confounding. Confounding is, in essence, when there is a common cause of X and Y, as represented by the node C in Figure 1(b). Any correlation between X and Y will be a combination of the causal effect of X on Y (represented by the direct arrow) and spurious correlation due to the common cause. In 2

3 C X x Y (x) X x Y (x) (a) (b) Figure 2: Single world intervention templates (SWITs) resulting from splitting node X in the graphs in Figure 1. the counterfactual world, this means that X is correlated with Y (x), as in Figure 2(b). When ignorability holds as in Figure 1(a) splitting the node X into its two components shows that X Y (x) (Richardson and Robins, 2013). Of course, the most common reason for ignorability holding is that treatment is randomized, and the estimator given is just the difference between two treatment groups. If confounding is measured, then we may obtain conditional ignorability, which requires that X Y (x) C for each x. In this case inference can proceed more or less as in the unconfounded case. In some cases randomization may be conditional upon a covariate (over sampling certain groups is common), or we may simply pick covariates which we believe are confounders in order to get sensible estimates. 1.2 Causal Effects Definition 1.5. The average causal effect is defined as ACE X Y E[Y (1) Y (0)]. Given i.i.d. observations (X i, Y i ), i = 1,..., n and assuming ignorability, we can estimate the average causal effect by ÂCE X Y = i X iy i i X i(1 X i)y i i i (1 X i). This is essentially a trick to exploit the linearity of expectations. For a positive continuous outcome Y, a perfectly reasonable measure of the causal effect would be E[Y (1)/Y (0)], but there is no simple way to estimate this using ordinary data. The null hypothesis of no population-level causal effect simply states that Y (0) Y (1), so the two potential outcomes are exchangable. This means in particular that the average causal effect is zero. This does not imply that there is no individual-level causal effect, which would require that Y i (0) = Y i (1) for each i. This is called the sharp null hypothesis. 1.3 Non-Compliance Consider a randomized trial in which patients are assigned a treatment Z, which they may or may not then choose to follow. Let X be the treatment actually taken, and Y some 3

4 t X, t Y Z X Y Figure 3: Graphs representing the non-compliance model with the exclusion restriction. outcome of interest. Since X is not randomly assigned, we cannot assume ignorability of Y given X: perhaps those most likely to take the treatment are also the healthiest patients. Example 1.6. The data in the table are from a randomized clinical trial of a treatment for high cholesterol. A treatment, Z was randomly assigned, and the treatment actually taken was recorded as X. An outcome measure based on reduced cholesterol levels was recorded as Y. The data are discussed in detail by Efron and Feldman (1991); the original continuous measurements were dichotomized by Pearl (2010). z x y count z x y count We are interested in the causal effect of the treatment taken (X) on the outcome (Y ). We can obtain a log-odds ratio of 3.3 with 95% confidence interval (2.7, 3.9). This suggests a very strong effect but is, of course, is just an association. We can consider the intention-to-treat effect of Z on Y, which gives 2.6 (2, 3.2). However, the conditions which influence the intention-to-treat effect are likely to be different in the world outside the trial. To model the non-compliance, we allow X to be one of two potential outcomes X z, which define an individual s compliance type: X(0) X(1) Type t X 0 0 Never Taker NT 0 1 Complier CO 1 0 Defier DE 1 1 Always Taker AT In general Y may be one of four potential outcomes, Y (x, z), giving up to 16 different types. However, we will enforce the exclusion restriction: Y (x, z) = Y (x, z ) for all z, z, so that the treatment assigned has no effect on the outcome other than through the treatment actually taken. This may or may not be realistic! We can then reduce the types for Y to the four usual ones. Define γ x i = E[Y (x) t X = i], i.e. the mean outcome for compliance type i when receiving 4

5 treatment x. We can define the average causal effect in this case as ACE X Y = i = i π i (E[Y 1 t X = i] E[Y 0 t X = i]) π i (γ 1 i γ 0 i ). It is not immediately clear whether or not this quantity is identifiable. 1.4 Identifiability and Inference Definition 1.7. A parameter is identifiable if it is a function of the observable probability distribution. A parameter is semi- or partially identifiable if the range of values it can take is restricted for some values of the observable probability distribution. Otherwise we say it is unidentifiable. The definition essentially state that something is identifiable if it can be determined precisely from a sufficiently large amount of data. Partial idetifiability implies that we may be able to narrow down the range of possible values with data, but not identify it preciesly. In the case of semi-identifiability there will be a range of possible values which are compatible with the observed probability distribution, so inference respecting the likelihood principle will not distinguish between them. Of course, we can assign a prior to the value of a parameter, but inference will be very sensitive to the choice of prior within the range of compatible values (Richardson et al., 2011). Denote by p X Z (x z) P (X = x Z = z) and p Y XZ (y x, z) P (Y = y Z = z, X = x) the observable conditional probability distributions. We will ignore the marginal distribution of P (Z = z), since randomization means that this is chosen by design, and is independent of all the potential outcomes. Example 1.8. To simplify matters, suppose that the treatment is not available to patients unless they are assigned to it, so that X(0) = 0. In other words, there are no Always Takers or Defiers. Then ACE X Y = π CO (γ 1 CO γ 0 CO) + π NT (γ 1 NT γ 0 NT). The quantities π CO and π NT are identifiable since In addition, π CO = p X Z (1 1), π NT = 1 π CO. γ 1 CO = p Y XZ (1 1, 1) γ 0 NT = p Y XZ (1 0, 1), since under our assumptions anyone who is observed to take the treatment is a complier, and anyone who fails to take the treatment when assigned it is a never taker. Also, p XY Z (0, 1 0) = π CO γ 0 CO + π NT γ 0 NT, so γ 0 CO can be recovered using the other information. 5

6 However, γnt 1 is totally unidentifiable, since we never observe what happens to a Never Taker unless they take the drug. Hence (exercise) ACE X Y = p XY Z (1, 1 1) p XY Z (0, 1 0) + γ 1 NT p X Z (0 1), where γnt 1 [0, 1]. This gives (exercise) p XY Z (0, 1 0) + p XY Z (1, 0 0) ACE X Y 1 p XY Z (0, 1 0) p XY Z (1, 0 1). The parameter ACE X Y is semi-identifiable, because its value depends upon γnt 1, but we can obtain non-trivial bounds. If one is willing to make assumptions about the effect of the treatment on never takers, then we can obtain tighter bounds. This separation of causal quantities into completely identifiable and completely unidentifiable is a sensible way of keeping track of our assumptions. A prior on an unidentified parameter will never be updated, regardless of how much data we collect. 1.5 Where logical positivists fear to tread The advantage of a view which allows for counterfactuals is that one can define causal concepts which otherwise do not exist. It is not possible to make the assumption of no Always Takers and no Defiers by setting p XY Z (1, y 0) = 0 for y {0, 1}. Therefore this constraint has an interpretation even without assuming the existence of well-defined counterfactual outcomes. For example, the complier average causal effect is defined as the effect of the treatment on individuals for whom (X 0, X 1 ) = (0, 1). That is: ACE CO γ 1 CO γ 0 CO. Unsurprisingly, this quantity does not have an interpretation without assuming the existence of well-defined compliance types. However, it is identifiable under the same assumptions as above. Such quantities may, under certain circumstances, be viewed as being more relevant than an intention-to-treat effect for the whole population, or enable us to make inferences about what could be achieved by a future intervention. For example, how much benefit would there be in educating Never Takers about the advantages of the treatment? 6

7 2 Probability Bounds Suppose we have a binary joint probability distribution p xy = P (X = x, Y = y) for x, y {0, 1}. If we observe the marginal distributions p x+ and p +y, what can we deduce about the joint distribution? The answer comes in the form of the Fréchet bounds: for any events A, B, there exists a probability distribution with probabilities P(A), P(B), P(A B) if and only if max{0, P(A) + P(B) 1} P(A B) min{p(a), P(B)}. The result is almost trivial, but as an illustration we show how to prove this with algebraic variable elimination. 1 library(rporta) M = rbind(cbind(diag(4), 0,0,0), c(1,1,1,1,0,0,1), c(1,1,0,0,-1,0,0), c(1,0,1,0,0,-1,0)) # first four columns represent joint probs, next two marginal # probs, last column constants M ## [,1] [,2] [,3] [,4] [,5] [,6] [,7] ## [1,] ## [2,] ## [3,] ## [4,] ## [5,] ## [6,] ## [7,] # hence M[6,] means p{00} + p{10} - p{+0} (which we will set =0) sign = c(rep(1,4),0,0,0) X = as.ieqfile(m, sign) X ## DIM = 6 ## ## INEQUALITIES_SECTION ## (1) 1x1 + 0x2 + 0x3 + 0x4 + 0x5 + 0x6 >= 0 ## (2) 0x1 + 1x2 + 0x3 + 0x4 + 0x5 + 0x6 >= 0 ## (3) 0x1 + 0x2 + 1x3 + 0x4 + 0x5 + 0x6 >= 0 ## (4) 0x1 + 0x2 + 0x3 + 1x4 + 0x5 + 0x6 >= 0 ## (5) 1x1 + 1x2 + 1x3 + 1x4 + 0x5 + 0x6 == 1 ## (6) 1x1 + 1x2 + 0x3 + 0x4-1x5 + 0x6 == 0 1 The package rporta is no longer on CRAN, but can be installed manually from the archive. 7

8 ## (7) 1x1 + 0x2 + 1x3 + 0x4 + 0x5-1x6 == 0 ## ## END # we will eliminate joint probs except p{00} X@elimination_order = c(0,1,2,3,0,0) fmel(x) ## DIM = 6 ## ## INEQUALITIES_SECTION ## (1) -1x1 + 0x2 + 0x3 + 0x4 + 0x5 + 0x6 <= 0 ## (2) 1x1 + 0x2 + 0x3 + 0x4-1x5 + 0x6 <= 0 ## (3) 1x1 + 0x2 + 0x3 + 0x4 + 0x5-1x6 <= 0 ## (4) -1x1 + 0x2 + 0x3 + 0x4 + 1x5 + 1x6 <= 1 ## ## END # one can check these are just the Frechet bounds The situation under the potential outcomes framework is very similar: in principle we can observe the marginal distributions of Y 0 and Y 1, but not their joint distribution. We cannot be certain of the proportion of people π DE who are defiers, for example. However, since p X Z (0 1) = π DE + π NT, we can deduce that π DE p X Z (0 1), and therefore it is semi-identifiable. Returning to the non-compliance setting, let π ij = P(X 0 = i, X 1 = j) be the proportions of individuals who are of a particular response type. Then, for example, max{0, P(X 0 = 0) + P(X 1 = 1) 1} π 01 min{p(x 0 = 0), P(X 1 = 1)}. If we assume monotonicity (so that no-one is hurt), then π 11 = P(X 0 = 1), π 00 = P(X 1 = 0), π 01 = 1 π 00 π 11, and the strata proportions magically become identifiable (recall that, under ignorability, P(X z = x) = P(X = x Z = z)). This may be realistic in certain non-compliance settings, for example, and often enables identification. 2.1 Causal Effect Bounds We can set up a much larger example to deal with the average causal effect in the noncompliance model. M = matrix(0, 26, 26) # define observed probabilities in terms of POs M[1,c(2,4,6,8)] = 1 # p(0,0 0) 8

9 M[2,c(11,12,15,16)] = 1 M[3,c(1,3,5,7)] = 1 M[4,c(9,10,13,14)] = 1 M[5,c(6,8,14,16)] = 1 M[6,c(3,4,11,12)] = 1 M[7,c(5,7,13,15)] = 1 M[8,c(1,2,9,10)] = 1 # p(1,1 1) M[1:8,17:24] = -diag(8) # set positive probabilities M[9:24,1:16] = diag(16) M[,26] = c(rep(0,25),1) # define ACE M[25,] = c(rep(c(0,1,-1,0),4),rep(0,8),-1,0) # sum to 1 M[26,] = c(rep(1,16),rep(0,8),0,1) sign = c(rep(0,8),rep(1,16),0,0) X = as.ieqfile(m, sign) X@elimination_order = c(1:16,rep(0,9)) out = fmel(x) # inequalities involving ACE out@inequalities@num[-c(1:8,11:12,19,20,22,26),17:26] ## [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10] ## [1,] ## [2,] ## [3,] ## [4,] ## [5,] ## [6,] ## [7,] ## [8,] ## [9,] ## [10,] ## [11,] ## [12,] ## [13,] ## [14,] ## [15,] ## [16,] We obtain 28 inequalities, of which 8 are just a consequence of working with probabilities, 4 are the instrumental inequalities, first derived by Pearl (1995). The remaining 16 (shown above) actually involve the ACE, and can be used to partially identify it; they were derived by Balke and Pearl (1997), improving on Manski and Robins (separately). Note that these bounds can also be obtained without using the counterfactual framework, since the average causal effect has an equivalent interpretation in (for example) 9

10 Pearl s do-calculus framework (Cai et al., 2008). If we choose to enforce additional constraints, such as the absence of Always Takers and Defiers, we can obtain tighter bounds. nodefy = matrix(0,nrow=8,ncol=ncol(m)) nodefy[,9:16] = diag(8) M2 = rbind(m, nodefy) sign2 = c(sign, rep(0,8)) X2 = as.ieqfile(m2, sign2) X2@elimination_order = c(1:16,rep(0,9)) out2 = fmel(x2) out2@inequalities@num[,17:26] ## [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10] ## [1,] ## [2,] ## [3,] ## [4,] ## [5,] ## [6,] ## [7,] ## [8,] ## [9,] ## [10,] ## [11,] ## [12,] ## [13,] ## [14,] ## [15,] ## [16,] Which means p XY Z (0, 1 0) + p XY Z (1, 0 0) ACE X Y 1 p XY Z (0, 1 0) p XY Z (1, 0 1), as we deduced in the previous section. Example 2.1. Applying the bounds to our lipid data, we find library(rje, warn.conflicts = FALSE) dat = array(c(158, 0, 14, 0, 52, 23, 12, 78), c(2, 2, 2)) dat2 = condition.table(dat, 1:2, 3) dat2 # p(x,y z) ##,, 1 ## 10

11 ## [,1] [,2] ## [1,] ## [2,] ## ##,, 2 ## ## [,1] [,2] ## [1,] ## [2,] mat = out2@inequalities@num bds = mat[, 17:24] %*% c(dat2) - mat[, 26] bds[c(13, 16)] ## [1] So ÂCE X Y If we work using the likelihood principle, everything within these bounds is indistinguishable. Of course, it s perfectly possible to place priors on these bounds. The average causal effect for compliers is gamco1 = condition.table(dat, 2, c(1, 3), c(2, 2))[2] gamnt0 = condition.table(dat, 2, c(1, 3), c(1, 2))[2] pico = condition.table(dat, 1, 3, 2)[2] pint = 1 - pico gamco0 = (condition.table(dat, 1:2, 3)[1, 2, 1] - pint * gamnt0)/pico gamco1 - gamco0 ## [1] This is high relative to the possible interval of overall average causal effect, because the Never Takers seem to do better than Compliers who are assigned the placebo (i.e. γ 0 CO < γ 0 NT ). 11

12 M X Y Figure 4: Graph representing mediation. 3 Mediators Sometimes we are not interested in the total causal effect of one quantity on another, but only the effect of a particular pathway. This particularly arises when we move from asking does it work? to how does it work?. Example 3.1. Suppose we administer an estrogen treatment X and observe it to increase the effect of cardiovascular disease (Y ). What is the mechanism for the increased risk? One hypothesis is an increase in serum lipid concentrations (M). The proposed causal pathways are shown in Figure 4 Suppose we could administer a treatment which reduced the serum lipid concentrations even after estrogen therapy: how much effect would this have on cardiovascular disease rates? In other words, how much of the effect of estrogen on CVD is mediated through serum lipid concentrations, and how much is due to other factors? Ideally of course, we d like to decompose the effect of X on Y into a direct effect, and an indirect effect through M. However, in general this is not possible, because there may be an interaction between the effects of X and M on Y. For example, suppose Y acts as an xor gate, so that Y = X + M mod 2. Even if we accept that the relation is causal, then X s direct effect on Y is either positive or negative, depending upon the state of M. This is an extreme example, and we might hope that combined effects are at least monotonic, and perhaps even approximately additive on some appropriate scale. We begin by giving two possible definitions of a direct effect, and then fixing the indirect effect to be the residual difference. Definition 3.2. The total effect of X on Y is EY (x) EY (0) = EY (x, M(x)) EY (0, M(0)). The total direct effect is the ordinary average causal effect: Alternatively the natural direct effect is TDE E Y (x, M(x)) E Y (0, M(x)). NDE E Y (x, M(0)) E Y (0, M(0)); i.e. keeping the mediator at a baseline level. Correspondingly the natural and total indirect effects are given by the differences of the total effect and relevant direct effect: NIE E Y (x, M(x)) E Y (x, M(0)) TIE E Y (x, M(0)) E Y (0, M(0)). 12

13 M U X Y Figure 5: outcome. Graph representing mediation with confounding between the mediator and If ignorability holds between X and Y (i.e. X Y (x, m)), then we can estimate the total causal effect of X on Y. In order to obtain the cross-world quantities such as Y (x, M(x )) where x x, we need more complex ignorability constraints. Suppose that X M(x), Y (x, m) and M(x) Y (x, m) for each x, m. This would hold if, for example, X and M were both randomized. By applying the same method as before twice, we find EY (x, m) = P(Y = 1 X = x, M = m). Then E Y (x, M(x )) = E[E[ Y (x, M(x )) M(x ) = m]] = m E[ Y (x, m) M(x ) = m] P(M(x ) = m) = m = m E[ Y (x, m)] P(M(x ) = m) P(Y = 1 X = x, M = m) P(M(x ) = m); note that the third equality uses Y (x, m) M(x ), where x is possibly different to x. This is a cross-world independence, and seems untestable even in principal. The strong ignorability assumption corresponding to mediation is that {M(x) x X } {Y (x, m) x X, m M}, a joint independence between all potential outcomes. This is plausible in the presence of randomization of some sort, but in general it is quite likely that there is confounding between the two mechanisms which generate M and Y. 3.1 Intermediate Confounding If there is unmeasured confounding between M(x) and Y (x, m), then we cannot identify the direct and indirect effects of X on Y. We can still, however, obtain semi-identification. Recall that the model in Figure 3 induces the instrumental inequalities on the observed distribution p XY Z. We can imagine that there is no direct causal effect of X on Y whatever, and apply the instrumental inequalities to find that p MY X (m, y 1 x) + p MY X (m, 1 y x) 1, m, x, y {0, 1}. Hence, if these inequalities are violated, we may deduce that there is some direct effect of X on Y. More generally, we can derive bounds: 13

14 idx = array(1:64, rep(2, 6)) obj = rep(0, 64) obj[c(idx[, 1, 2,,, ])] = 1 obj[c(idx[, 2,,, 2, ])] = obj[c(idx[, 2,,, 2, ])] + 1 obj[c(idx[, 1, 1,,, ])] = obj[c(idx[, 1, 1,,, ])] - 1 obj[c(idx[, 2,,, 1, ])] = obj[c(idx[, 2,,, 1, ])] - 1 trans = matrix(0, 8, 64) trans[1, c(idx[1,, 1,,, ])] = 1 trans[2, c(idx[1,, 2,,, ])] = 1 trans[3, c(idx[2,,,, 1, ])] = 1 trans[4, c(idx[2,,,, 2, ])] = 1 trans[5, c(idx[, 1,, 1,, ])] = 1 trans[6, c(idx[, 1,, 2,, ])] = 1 trans[7, c(idx[, 2,,,, 1])] = 1 trans[8, c(idx[, 2,,,, 2])] = 1 trans = rbind(cbind(trans, -diag(8), 0), cbind(diag(64), matrix(0, 64, 9))) trans = rbind(trans, c(rep(1, 64), rep(0, 9))) trans = rbind(trans, c(obj, rep(0, 8), -1)) rhs = c(rep(0, 72), 1, 0) eq = c(rep(0, 8), rep(1, 64), 0, 0) X = as.ieqfile(cbind(trans, rhs), eq) X@elimination_order = c(1:64, rep(0, 9)) out = fmel(x) out@inequalities@num[, -c(1:64)] ## [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10] ## [1,] ## [2,] ## [3,] ## [4,] ## [5,] ## [6,] ## [7,] ## [8,] ## [9,] ## [10,] ## [11,] ## [12,] ## [13,] ## [14,] ## [15,] ## [16,]

15 This leads to six inequalities involving the average causal effect: 1 2p X Z (1 1) + 2p XY Z (1, 0 0) max 1 2p X Z (0 1) + 2p XY Z (0, 0 0) p X Z (1 1) 2p XY Z (0, 0 0) ACE X Y min 1 + 2p X Z (0 1) 2p XY Z (1, 0 0). 1 Looking at the bounds, unsurprisingly one often finds that the inequalities are trivial and just tell us that the average causal effect is between 1 and 1. Of course, if we are willing to add in additional assumptions, we will obtain tighter bounds. M = out@inequalities@num M = M[M[, 73]!= 0, ] p = c(0.05, 0.05, 0.8, 0.1, 0.05, 0.05, 0.1, 0.8) # gives bounds of 0.4 to 1. cbind(m[, 65:72] %*% p - M[, 74], M[, 73]) ## [,1] [,2] ## [1,] ## [2,] ## [3,] ## [4,] ## [5,] ## [6,] Principal Stratification If M(0) = M(1) = m, then it is clear that the total effect is equal to the natural and total direct effects. In this case there is little cause to doubt that Y (1, m) Y (0, m) is the sensible measure of the direct (and total) effect of X on Y. This leads to the idea of stratifying the population based upon the values of (M i (0), M i (1)), or in other words the compliance type. For Always Takers (m = 1) and Never Takers (m = 0), the definition of the direct effect may sensibly be described as PSDE(m) = E[Y (1, m) Y (0, m) M(0) = M(1) = m]. This is the Principal Stratum Direct Effect (Frangakis and Rubin, 2002). One may posit that there is a sensible relationship between these causal effects and the direct effects for the other compliance types. It is worth noting that Principal Stratification is one of only a few approaches which has essentially no interpretation outside the potential outcomes framework. Estimation of the PSDE is somewhat non-trivial, however, since there is no way to ascertain which individuals are of which compliance type. Ignorability and monotoninicity of compliance (i.e. no defiers) is sufficient for identification. Otherwise, typical approaches are Bayesian and treat the unobserved potential outcomes as missing data. There are pitfalls with this approach, as noted above. It is not obvious how principal stratification can be extended to cases where the mediator has many categories or is continuous. 15

16 3.3 Parametric Relationships For dealing with continuous outcomes (or discrete outcomes with a larger state space) it is necessary to impose parametric relationships. Robust methods exist to ensure that model misspecification does not lead to incorrectly inferring a causal effect where none exists. See, for example Richardson et al. (2011); Hernán and Robins (2014). References A. Balke and J. Pearl. Bounds on treatment effects from studies with imperfect compliance. Journal of the American Statistical Association, 92(439): , Z. Cai, M. Kuroki, J. Pearl, and J. Tian. Bounds on direct effects in the presence of confounded intermediate variables. Biometrics, 64(3): , B. Efron and D. Feldman. Compliance as an explanatory variable in clinical trials. Journal of the American Statistical Association, 86(413):9 17, C. E Frangakis and D. B Rubin. Principal stratification in causal inference. Biometrics, 58(1):21 29, M. A. Hernán and J. M. Robins. Causal Inference. Chapman & Hall/CRC, See D. Janzing, J. Mooij, K. Zhang, J. Lemeire, J. Zscheischler, P. Daniušis, B. Steudel, and B. Schölkopf. Information-geometric approach to inferring causal directions. Artificial Intelligence, 182:1 31, J. Neyman. On the application of probability theory to agricultural experiments. essay on principles Translated in Statistical Science, 5(4) pp , J. Pearl. On the testability of causal models with latent and instrumental variables. In Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI- 95), pages Morgan Kaufmann Publishers Inc., J. Pearl. Causality: models, reasoning and inference. Cambridge University Press, second edition, T. S. Richardson and J. M. Robins. Single world intervention graphs (SWIGs): A unification of the counterfactual and graphical approaches to causality. Technical Report 128, CSSS, University of Washington, T. S. Richardson, R. J. Evans, and J. M. Robins. Transparent parameterizations of models for potential outcomes. Bayesian Statistics, 9: , D. B. Rubin. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5):688,

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS Donald B. Rubin Harvard University 1 Oxford Street, 7th Floor Cambridge, MA 02138 USA Tel: 617-495-5496; Fax: 617-496-8057 email: rubin@stat.harvard.edu

More information

Package noncompliance

Package noncompliance Type Package Package noncompliance February 15, 2016 Title Causal Inference in the Presence of Treatment Noncompliance Under the Binary Instrumental Variable Model Version 0.2.2 Date 2016-02-11 A finite-population

More information

Bounds on Direct Effects in the Presence of Confounded Intermediate Variables

Bounds on Direct Effects in the Presence of Confounded Intermediate Variables Bounds on Direct Effects in the Presence of Confounded Intermediate Variables Zhihong Cai, 1, Manabu Kuroki, 2 Judea Pearl 3 and Jin Tian 4 1 Department of Biostatistics, Kyoto University Yoshida-Konoe-cho,

More information

Bounding the Probability of Causation in Mediation Analysis

Bounding the Probability of Causation in Mediation Analysis arxiv:1411.2636v1 [math.st] 10 Nov 2014 Bounding the Probability of Causation in Mediation Analysis A. P. Dawid R. Murtas M. Musio February 16, 2018 Abstract Given empirical evidence for the dependence

More information

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh) WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, 2016 Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh) Our team! 2 Avi Feller (Berkeley) Jane Furey (Abt Associates)

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea)

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics

More information

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling A.P.Dawid 1 and S.Geneletti 2 1 University of Cambridge, Statistical Laboratory 2 Imperial College Department

More information

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS Judea earl Department of Computer Science UCLA OUTLINE Statistical vs. Causal Modeling: distinction and mental barriers N-R vs. structural model: strengths

More information

Single World Intervention Graphs (SWIGs):

Single World Intervention Graphs (SWIGs): Single World Intervention Graphs (SWIGs): Unifying the Counterfactual and Graphical Approaches to Causality Thomas Richardson Department of Statistics University of Washington Joint work with James Robins

More information

Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality

Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality Thomas S. Richardson University of Washington James M. Robins Harvard University Working

More information

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (www.cs.ucla. THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Modeling: Statistical vs. Causal Causal Models and Identifiability to

More information

Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death

Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death Kosuke Imai First Draft: January 19, 2007 This Draft: August 24, 2007 Abstract Zhang and Rubin 2003) derives

More information

A Decision Theoretic Approach to Causality

A Decision Theoretic Approach to Causality A Decision Theoretic Approach to Causality Vanessa Didelez School of Mathematics University of Bristol (based on joint work with Philip Dawid) Bordeaux, June 2011 Based on: Dawid & Didelez (2010). Identifying

More information

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title Comment on `Causal inference, probability theory, and graphical insights' (by Stuart G. Baker) Permalink https://escholarship.org/uc/item/9863n6gg Journal Statistics

More information

Bounds on Causal Effects in Three-Arm Trials with Non-compliance. Jing Cheng Dylan Small

Bounds on Causal Effects in Three-Arm Trials with Non-compliance. Jing Cheng Dylan Small Bounds on Causal Effects in Three-Arm Trials with Non-compliance Jing Cheng Dylan Small Department of Biostatistics and Department of Statistics University of Pennsylvania June 20, 2005 A Three-Arm Randomized

More information

Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017

Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017 Causal Inference Miguel A. Hernán, James M. Robins May 19, 2017 ii Causal Inference Part III Causal inference from complex longitudinal data Chapter 19 TIME-VARYING TREATMENTS So far this book has dealt

More information

Transparent Parametrizations of Models for Potential Outcomes

Transparent Parametrizations of Models for Potential Outcomes BAYESIAN STATISTICS 9, pp. 569 610. J. M. Bernardo, M. J. Bayarri, J. O. Berger, A. P. Dawid, D. Heckerman, A. F. M. Smith and M. West (Eds.) c Oxford University Press, 2011 Transparent Parametrizations

More information

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data Biometrics 000, 000 000 DOI: 000 000 0000 Discussion of Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing Data Dylan S. Small

More information

Graphical Representation of Causal Effects. November 10, 2016

Graphical Representation of Causal Effects. November 10, 2016 Graphical Representation of Causal Effects November 10, 2016 Lord s Paradox: Observed Data Units: Students; Covariates: Sex, September Weight; Potential Outcomes: June Weight under Treatment and Control;

More information

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Maximilian Kasy Department of Economics, Harvard University 1 / 40 Agenda instrumental variables part I Origins of instrumental

More information

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh Comparison of Three Approaches to Causal Mediation Analysis Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh Introduction Mediation defined using the potential outcomes framework natural effects

More information

Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE

Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE insert p. 90: in graphical terms or plain causal language. The mediation problem of Section 6 illustrates how such symbiosis clarifies the

More information

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Kosuke Imai Department of Politics Princeton University July 31 2007 Kosuke Imai (Princeton University) Nonignorable

More information

Single World Intervention Graphs (SWIGs):

Single World Intervention Graphs (SWIGs): Single World Intervention Graphs (SWIGs): Unifying the Counterfactual and Graphical Approaches to Causality Thomas Richardson Department of Statistics University of Washington Joint work with James Robins

More information

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable

More information

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla. OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/) Statistical vs. Causal vs. Counterfactual inference: syntax and semantics

More information

Sensitivity analysis and distributional assumptions

Sensitivity analysis and distributional assumptions Sensitivity analysis and distributional assumptions Tyler J. VanderWeele Department of Health Studies, University of Chicago 5841 South Maryland Avenue, MC 2007, Chicago, IL 60637, USA vanderweele@uchicago.edu

More information

A proof of Bell s inequality in quantum mechanics using causal interactions

A proof of Bell s inequality in quantum mechanics using causal interactions A proof of Bell s inequality in quantum mechanics using causal interactions James M. Robins, Tyler J. VanderWeele Departments of Epidemiology and Biostatistics, Harvard School of Public Health Richard

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

arxiv: v2 [stat.me] 21 Nov 2016

arxiv: v2 [stat.me] 21 Nov 2016 Biometrika, pp. 1 8 Printed in Great Britain On falsification of the binary instrumental variable model arxiv:1605.03677v2 [stat.me] 21 Nov 2016 BY LINBO WANG Department of Biostatistics, Harvard School

More information

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Causality II: How does causal inference fit into public health and what it is the role of statistics? Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

Probabilistic Causal Models

Probabilistic Causal Models Probabilistic Causal Models A Short Introduction Robin J. Evans www.stat.washington.edu/ rje42 ACMS Seminar, University of Washington 24th February 2011 1/26 Acknowledgements This work is joint with Thomas

More information

ECO Class 6 Nonparametric Econometrics

ECO Class 6 Nonparametric Econometrics ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................

More information

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL BRENDAN KLINE AND ELIE TAMER Abstract. Randomized trials (RTs) are used to learn about treatment effects. This paper

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Growth Mixture Modeling and Causal Inference. Booil Jo Stanford University

Growth Mixture Modeling and Causal Inference. Booil Jo Stanford University Growth Mixture Modeling and Causal Inference Booil Jo Stanford University booil@stanford.edu Conference on Advances in Longitudinal Methods inthe Socialand and Behavioral Sciences June 17 18, 2010 Center

More information

arxiv: v3 [stat.me] 20 Feb 2016

arxiv: v3 [stat.me] 20 Feb 2016 Posterior Predictive p-values with Fisher Randomization Tests in Noncompliance Settings: arxiv:1511.00521v3 [stat.me] 20 Feb 2016 Test Statistics vs Discrepancy Variables Laura Forastiere 1, Fabrizia Mealli

More information

CompSci Understanding Data: Theory and Applications

CompSci Understanding Data: Theory and Applications CompSci 590.6 Understanding Data: Theory and Applications Lecture 17 Causality in Statistics Instructor: Sudeepa Roy Email: sudeepa@cs.duke.edu Fall 2015 1 Today s Reading Rubin Journal of the American

More information

Introduction to Causal Calculus

Introduction to Causal Calculus Introduction to Causal Calculus Sanna Tyrväinen University of British Columbia August 1, 2017 1 / 1 2 / 1 Bayesian network Bayesian networks are Directed Acyclic Graphs (DAGs) whose nodes represent random

More information

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH The First Step: SAMPLE SIZE DETERMINATION THE ULTIMATE GOAL The most important, ultimate step of any of clinical research is to do draw inferences;

More information

I. Partial identification of causal parameters with multiple proposed instruments

I. Partial identification of causal parameters with multiple proposed instruments Supplemental Materials Table of contents of Supplemental Materials: I. Partial identification of causal parameters with multiple proposed instruments II. Data example III. R code I. Partial identification

More information

From Causality, Second edition, Contents

From Causality, Second edition, Contents From Causality, Second edition, 2009. Preface to the First Edition Preface to the Second Edition page xv xix 1 Introduction to Probabilities, Graphs, and Causal Models 1 1.1 Introduction to Probability

More information

The compliance score as a regressor in randomized trials

The compliance score as a regressor in randomized trials Biostatistics (2003), 4, 3,pp. 327 340 Printed in Great Britain The compliance score as a regressor in randomized trials MARSHALL M. JOFFE, THOMAS R. TEN HAVE, COLLEEN BRENSINGER Department of Biostatistics

More information

Uni- and Bivariate Power

Uni- and Bivariate Power Uni- and Bivariate Power Copyright 2002, 2014, J. Toby Mordkoff Note that the relationship between risk and power is unidirectional. Power depends on risk, but risk is completely independent of power.

More information

Estimating the Dynamic Effects of a Job Training Program with M. Program with Multiple Alternatives

Estimating the Dynamic Effects of a Job Training Program with M. Program with Multiple Alternatives Estimating the Dynamic Effects of a Job Training Program with Multiple Alternatives Kai Liu 1, Antonio Dalla-Zuanna 2 1 University of Cambridge 2 Norwegian School of Economics June 19, 2018 Introduction

More information

Recall from last time: Conditional probabilities. Lecture 2: Belief (Bayesian) networks. Bayes ball. Example (continued) Example: Inference problem

Recall from last time: Conditional probabilities. Lecture 2: Belief (Bayesian) networks. Bayes ball. Example (continued) Example: Inference problem Recall from last time: Conditional probabilities Our probabilistic models will compute and manipulate conditional probabilities. Given two random variables X, Y, we denote by Lecture 2: Belief (Bayesian)

More information

Distinguishing between Cause and Effect: Estimation of Causal Graphs with two Variables

Distinguishing between Cause and Effect: Estimation of Causal Graphs with two Variables Distinguishing between Cause and Effect: Estimation of Causal Graphs with two Variables Jonas Peters ETH Zürich Tutorial NIPS 2013 Workshop on Causality 9th December 2013 F. H. Messerli: Chocolate Consumption,

More information

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015 Introduction to causal identification Nidhiya Menon IGC Summer School, New Delhi, July 2015 Outline 1. Micro-empirical methods 2. Rubin causal model 3. More on Instrumental Variables (IV) Estimating causal

More information

Partial Identification of Average Treatment Effects in Program Evaluation: Theory and Applications

Partial Identification of Average Treatment Effects in Program Evaluation: Theory and Applications University of Miami Scholarly Repository Open Access Dissertations Electronic Theses and Dissertations 2013-07-11 Partial Identification of Average Treatment Effects in Program Evaluation: Theory and Applications

More information

Potential Outcomes and Causal Inference I

Potential Outcomes and Causal Inference I Potential Outcomes and Causal Inference I Jonathan Wand Polisci 350C Stanford University May 3, 2006 Example A: Get-out-the-Vote (GOTV) Question: Is it possible to increase the likelihood of an individuals

More information

Conditional probabilities and graphical models

Conditional probabilities and graphical models Conditional probabilities and graphical models Thomas Mailund Bioinformatics Research Centre (BiRC), Aarhus University Probability theory allows us to describe uncertainty in the processes we model within

More information

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles CAUSALITY Models, Reasoning, and Inference Judea Pearl University of California, Los Angeles 1 CAMBRIDGE UNIVERSITY PRESS Preface page xiii 1 Introduction to Probabilities, Graphs, and Causal Models 1

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2004 Paper 155 Estimation of Direct and Indirect Causal Effects in Longitudinal Studies Mark J. van

More information

Modeling Mediation: Causes, Markers, and Mechanisms

Modeling Mediation: Causes, Markers, and Mechanisms Modeling Mediation: Causes, Markers, and Mechanisms Stephen W. Raudenbush University of Chicago Address at the Society for Resesarch on Educational Effectiveness,Washington, DC, March 3, 2011. Many thanks

More information

Estimation of direct causal effects.

Estimation of direct causal effects. University of California, Berkeley From the SelectedWorks of Maya Petersen May, 2006 Estimation of direct causal effects. Maya L Petersen, University of California, Berkeley Sandra E Sinisi Mark J van

More information

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS ANALYTIC COMPARISON of Pearl and Rubin CAUSAL FRAMEWORKS Content Page Part I. General Considerations Chapter 1. What is the question? 16 Introduction 16 1. Randomization 17 1.1 An Example of Randomization

More information

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work

More information

Identification and Estimation of Causal Effects from Dependent Data

Identification and Estimation of Causal Effects from Dependent Data Identification and Estimation of Causal Effects from Dependent Data Eli Sherman esherman@jhu.edu with Ilya Shpitser Johns Hopkins Computer Science 12/6/2018 Eli Sherman Identification and Estimation of

More information

Causal Models with Hidden Variables

Causal Models with Hidden Variables Causal Models with Hidden Variables Robin J. Evans www.stats.ox.ac.uk/ evans Department of Statistics, University of Oxford Quantum Networks, Oxford August 2017 1 / 44 Correlation does not imply causation

More information

Statistical Models for Causal Analysis

Statistical Models for Causal Analysis Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring

More information

Sensitivity checks for the local average treatment effect

Sensitivity checks for the local average treatment effect Sensitivity checks for the local average treatment effect Martin Huber March 13, 2014 University of St. Gallen, Dept. of Economics Abstract: The nonparametric identification of the local average treatment

More information

Estimating direct effects in cohort and case-control studies

Estimating direct effects in cohort and case-control studies Estimating direct effects in cohort and case-control studies, Ghent University Direct effects Introduction Motivation The problem of standard approaches Controlled direct effect models In many research

More information

Comparative effectiveness of dynamic treatment regimes

Comparative effectiveness of dynamic treatment regimes Comparative effectiveness of dynamic treatment regimes An application of the parametric g- formula Miguel Hernán Departments of Epidemiology and Biostatistics Harvard School of Public Health www.hsph.harvard.edu/causal

More information

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables N-1 Experiments Suffice to Determine the Causal Relations Among N Variables Frederick Eberhardt Clark Glymour 1 Richard Scheines Carnegie Mellon University Abstract By combining experimental interventions

More information

An Introduction to Causal Analysis on Observational Data using Propensity Scores

An Introduction to Causal Analysis on Observational Data using Propensity Scores An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut

More information

Introduction to Causal Inference. Solutions to Quiz 4

Introduction to Causal Inference. Solutions to Quiz 4 Introduction to Causal Inference Solutions to Quiz 4 Teppei Yamamoto Tuesday, July 9 206 Instructions: Write your name in the space provided below before you begin. You have 20 minutes to complete the

More information

Causal Mechanisms Short Course Part II:

Causal Mechanisms Short Course Part II: Causal Mechanisms Short Course Part II: Analyzing Mechanisms with Experimental and Observational Data Teppei Yamamoto Massachusetts Institute of Technology March 24, 2012 Frontiers in the Analysis of Causal

More information

Counterfactual Reasoning in Algorithmic Fairness

Counterfactual Reasoning in Algorithmic Fairness Counterfactual Reasoning in Algorithmic Fairness Ricardo Silva University College London and The Alan Turing Institute Joint work with Matt Kusner (Warwick/Turing), Chris Russell (Sussex/Turing), and Joshua

More information

Bootstrapping Sensitivity Analysis

Bootstrapping Sensitivity Analysis Bootstrapping Sensitivity Analysis Qingyuan Zhao Department of Statistics, The Wharton School University of Pennsylvania May 23, 2018 @ ACIC Based on: Qingyuan Zhao, Dylan S. Small, and Bhaswar B. Bhattacharya.

More information

arxiv: v2 [math.st] 4 Mar 2013

arxiv: v2 [math.st] 4 Mar 2013 Running head:: LONGITUDINAL MEDIATION ANALYSIS 1 arxiv:1205.0241v2 [math.st] 4 Mar 2013 Counterfactual Graphical Models for Longitudinal Mediation Analysis with Unobserved Confounding Ilya Shpitser School

More information

Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs

Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs Proceedings of Machine Learning Research vol 73:21-32, 2017 AMBN 2017 Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs Jose M. Peña Linköping University Linköping (Sweden) jose.m.pena@liu.se

More information

Modeling Log Data from an Intelligent Tutor Experiment

Modeling Log Data from an Intelligent Tutor Experiment Modeling Log Data from an Intelligent Tutor Experiment Adam Sales 1 joint work with John Pane & Asa Wilks College of Education University of Texas, Austin RAND Corporation Pittsburgh, PA & Santa Monica,

More information

Mediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016

Mediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016 Mediation analyses Advanced Psychometrics Methods in Cognitive Aging Research Workshop June 6, 2016 1 / 40 1 2 3 4 5 2 / 40 Goals for today Motivate mediation analysis Survey rapidly developing field in

More information

The problem of causality in microeconometrics.

The problem of causality in microeconometrics. The problem of causality in microeconometrics. Andrea Ichino University of Bologna and Cepr June 11, 2007 Contents 1 The Problem of Causality 1 1.1 A formal framework to think about causality....................................

More information

Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017

Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017 Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability Alessandra Mattei, Federico Ricciardi and Fabrizia Mealli Department of Statistics, Computer Science, Applications, University

More information

Causal Inference Basics

Causal Inference Basics Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,

More information

CAUSAL INFERENCE IN STATISTICS. A Gentle Introduction. Judea Pearl University of California Los Angeles (www.cs.ucla.

CAUSAL INFERENCE IN STATISTICS. A Gentle Introduction. Judea Pearl University of California Los Angeles (www.cs.ucla. CAUSAL INFERENCE IN STATISTICS A Gentle Introduction Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/jsm12) OUTLINE Inference: Statistical vs. Causal, distinctions, and mental

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2010 Paper 117 Estimating Causal Effects in Trials Involving Multi-treatment Arms Subject to Non-compliance: A Bayesian Frame-work

More information

Harvard University. Harvard University Biostatistics Working Paper Series. On Partial Identification of the Pure Direct Effect

Harvard University. Harvard University Biostatistics Working Paper Series. On Partial Identification of the Pure Direct Effect Harvard University Harvard University Biostatistics Working Paper Series Year 2015 Paper 196 On Partial Identification of the Pure Direct Effect Caleb Miles Phyllis Kanki Seema Meloni Eric Tchetgen Tchetgen

More information

Casual Mediation Analysis

Casual Mediation Analysis Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction

More information

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks Y. Xu, D. Scharfstein, P. Mueller, M. Daniels Johns Hopkins, Johns Hopkins, UT-Austin, UF JSM 2018, Vancouver 1 What are semi-competing

More information

Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance

Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance Julia Y. Lin Thomas R. Ten Have Michael R. Elliott Julia Y. Lin is a doctoral candidate (E-mail: jlin@cceb.med.upenn.edu),

More information

Causal inference in biomedical sciences: causal models involving genotypes. Mendelian randomization genes as Instrumental Variables

Causal inference in biomedical sciences: causal models involving genotypes. Mendelian randomization genes as Instrumental Variables Causal inference in biomedical sciences: causal models involving genotypes Causal models for observational data Instrumental variables estimation and Mendelian randomization Krista Fischer Estonian Genome

More information

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014 Assess Assumptions and Sensitivity Analysis Fan Li March 26, 2014 Two Key Assumptions 1. Overlap: 0

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Technical Track Session I: Causal Inference

Technical Track Session I: Causal Inference Impact Evaluation Technical Track Session I: Causal Inference Human Development Human Network Development Network Middle East and North Africa Region World Bank Institute Spanish Impact Evaluation Fund

More information

review session gov 2000 gov 2000 () review session 1 / 38

review session gov 2000 gov 2000 () review session 1 / 38 review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review

More information

A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized. Clinical Trials Subject to Noncompliance.

A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized. Clinical Trials Subject to Noncompliance. Draft June 6, 006 A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized Clinical Trials Subject to Noncompliance Roderick Little 1, Qi Long and Xihong Lin 3 Abstract We

More information

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance Identi cation of Positive Treatment E ects in Randomized Experiments with Non-Compliance Aleksey Tetenov y February 18, 2012 Abstract I derive sharp nonparametric lower bounds on some parameters of the

More information

Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of Unmeasured Confounding

Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of Unmeasured Confounding University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations Summer 8-13-2010 Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of

More information

CAUSAL INFERENCE AS COMPUTATIONAL LEARNING. Judea Pearl University of California Los Angeles (

CAUSAL INFERENCE AS COMPUTATIONAL LEARNING. Judea Pearl University of California Los Angeles ( CAUSAL INFERENCE AS COMUTATIONAL LEARNING Judea earl University of California Los Angeles www.cs.ucla.edu/~judea OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics

More information

Causal Inference from Experimental Data

Causal Inference from Experimental Data 30th Fisher Memorial Lecture 10 November 2011 hypothetical approach counterfactual approach data Decision problem I have a headache. Should I take aspirin? Two possible treatments: t: take 2 aspirin c:

More information

Guanglei Hong. University of Chicago. Presentation at the 2012 Atlantic Causal Inference Conference

Guanglei Hong. University of Chicago. Presentation at the 2012 Atlantic Causal Inference Conference Guanglei Hong University of Chicago Presentation at the 2012 Atlantic Causal Inference Conference 2 Philosophical Question: To be or not to be if treated ; or if untreated Statistics appreciates uncertainties

More information

Structure learning in human causal induction

Structure learning in human causal induction Structure learning in human causal induction Joshua B. Tenenbaum & Thomas L. Griffiths Department of Psychology Stanford University, Stanford, CA 94305 jbt,gruffydd @psych.stanford.edu Abstract We use

More information

Extending causal inferences from a randomized trial to a target population

Extending causal inferences from a randomized trial to a target population Extending causal inferences from a randomized trial to a target population Issa Dahabreh Center for Evidence Synthesis in Health, Brown University issa dahabreh@brown.edu January 16, 2019 Issa Dahabreh

More information

Multiple Imputation Methods for Treatment Noncompliance and Nonresponse in Randomized Clinical Trials

Multiple Imputation Methods for Treatment Noncompliance and Nonresponse in Randomized Clinical Trials UW Biostatistics Working Paper Series 2-19-2009 Multiple Imputation Methods for Treatment Noncompliance and Nonresponse in Randomized Clinical Trials Leslie Taylor UW, taylorl@u.washington.edu Xiao-Hua

More information