GRAPHICAL REPRESENTATION OF CAUSAL EFFECTS (CHAPTER 6) BIOS Graphical Representation

Size: px
Start display at page:

Download "GRAPHICAL REPRESENTATION OF CAUSAL EFFECTS (CHAPTER 6) BIOS Graphical Representation"

Transcription

1 GRPHICL REPRESENTTION OF CUSL EFFECTS (CHPTER 6) BIOS Graphical Representation

2 Graphical representation of causal effects ( 6) Outline 6.1 Causal diagrams 6.2 Causal diagrams and marginal independence 6.3 Causal diagrams and conditional independence 6.4 Graphs, counterfactuals, and interventions 6.5 structural classification of bias 6.6 The structure of effect modification BIOS Graphical Representation

3 ( ) = =1 ( ). Consider Figure Causal diagrams L Figure 6.1 Figure 6.2 effect not mediated through any other va individual, or that we are unwilling to a do not exist. lternatively, the lack of a willingtoassume,that has no direct ca the population. For example, in Figure 6 either we know that disease severity affec transplant or that we are not willing to diagram does not distinguish whether an a protective effect. Furthermore, if, as i two causes, the diagram does not encode Causal diagrams like the one in Figu graphs, which is commonly abbreviated edges imply a direction: because the arro, but not the other way around. cy variable cannot cause itself, either direct Directed acyclic graphs have applicat we focus on causal directed acyclic graph is causal if the common causes of any p in the graph. For example, suppose in assigned to heart transplant with a pro of their disease. Then is a common included in the graph, as shown in the suppose in our study individuals are ra with the same probability regardless of a common cause of and and need n Figure 6.1 represents a conditionally ran 6.2 represents a marginally randomized e Figure 6.1 may also represent an obs Three nodes representing random variables (L,, ) and three directed edges (arrows). Time flows from left to right, and thus L is temporally prior to and, etc s in previous chapters, L,, and represent disease severity, heart transplant, and death, respectively BIOS Graphical Representation

4 6.1 Causal diagrams V W indicates there is a direct causal effect (not mediated through any other variables on the graph) for at least one individual, or unwilling to assume o/w ( ) = Lack of an arrow means that we know, or assume, that V has no direct causal effect on W for any individual Figure 6-1 unmeasured, of any pair of variables on the graph are themselves on the graph descendants. Causal DGs are of no practical use unless we make an assumption linki the DG to the data obtained in a study. This assumption, referred to as the c conditional on its direct causes, a variable is independent of any variable conditional on its parents, is independent of its non-descendants. This latter to the statement that the density ( ) of the variables in DG satisfies t L Figure 6.1 =1 ( ). effect not mediated through any other va individual, or that we are unwilling to a do not exist. lternatively, the lack of a willingtoassume,that has no direct ca the population. For example, in Figure 6 either we know that disease severity affec transplant or that we are not willing to diagram does not distinguish whether an disease severity affects the probability of receiving a heart transplant (or we are not willing to assume otherwise) a protective effect. Furthermore, if, as i two causes, the diagram does not encode Causal diagrams like the one in Figu graphs, which is commonly abbreviated edges imply a direction: because the arro, but not the other way around. cy variable cannot cause itself, either direct Directed acyclic graphs have applicat we focus on causal directed acyclic graph is causal if the common causes of any p in the graph. For example, suppose in assigned to heart transplant with a pro BIOS Graphical Representation

5 Direct cyclic Graphs (DGs) 6.1 Causal Diagrams Directed: L implies L causes but not the other direction cyclic: there are no cycles; a variable cannot cause itself, either directly or through other variables Causal DG is a DG where common causes of any pair of variables in the graph is also on the graph BIOS Graphical Representation

6 Technical Point 6.1 DG G is a graph whose nodes (vertices) are random variables V = (V 1,...,V m ) with directed edges (arrows) and no directed cycles ( ) = P m parents of V m, i.e., set of nodes from which there is a direct arrow into V m Figure 6.1: descendants. Causal DGs are of no practical use unless we make an assumption linki the DG to the data obtained in a study. This assumption, referred to as the c conditional on its direct causes, a variable is independent of any variable conditional on its parents, is independent of its non-descendants. This latter to the statement that the density ( ) of the variables in DG satisfies t L Figure 6.1 Let V 1 = L, V 2 =, V 3 =. Then P 3 = {, } =1 ( ). effect not mediated through any other va individual, or that we are unwilling to a do not exist. lternatively, the lack of a willingtoassume,that has no direct ca the population. For example, in Figure 6 either we know that disease severity affec transplant or that we are not willing to diagram does not distinguish whether an a protective effect. Furthermore, if, as i two causes, the diagram does not encode Causal diagrams like the one in Figu graphs, which is commonly abbreviated edges imply a direction: because the arro, but not the other way around. cy variable cannot cause itself, either direct Directed acyclic graphs have applicat we focus on causal directed acyclic graph is causal if the common causes of any p in the graph. For example, suppose in assigned to heart transplant with a pro of their disease. Then is a common included in the graph, as shown in the BIOS Graphical Representation

7 Technical Point 6.1 Two vertices joined by an edge are said to be adjacent path between X and consists of a sequence of distinct vertices that are adjacent X B C D V m is a descendant of V j (and V j is an ancestor of V m ) if there is a directed path V j V m BIOS Graphical Representation

8 Technical Point 6.1 Notational convention: if m > j then V m is not an ancestor of V j G is a causal DG if 1. Lack of arrow from V j to V m can be interpreted as the absence of a direct causal effect of V j on V m (relative to the other variables on the graph) 2. ll common causes, even if unmeasured, of any pair of variables on the graph are themselves on the graph 3. ny variable is a cause of its descendants BIOS Graphical Representation

9 Technical Point 6.1 Causal Markov assumption, which links DG to the observable random variable, states: Conditional on its direct causes, a variable V j is independent of any variable for which it is not a cause Conditional on its parents, V j is independent of its non-descendants Distribution f (V ) of variables V in DG G satisfies the Markov factorization f (v) = M j=1 f (v j pa j ) BIOS Graphical Representation

10 ( ) = =1 ( ). Figure Causal Diagrams L Figure 6.1 Figure 6.2 effect not mediated through any other va individual, or that we are unwilling to a do not exist. lternatively, the lack of a willingtoassume,that has no direct ca the population. For example, in Figure 6 either we know that disease severity affec transplant or that we are not willing to diagram does not distinguish whether an a protective effect. Furthermore, if, as i two causes, the diagram does not encode Causal diagrams like the one in Figu graphs, which is commonly abbreviated edges imply a direction: because the arro, but not the other way around. cy variable cannot cause itself, either direct Directed acyclic graphs have applicat we focus on causal directed acyclic graph is causal if the common causes of any p in the graph. For example, suppose in assigned to heart transplant with a pro of their disease. Then is a common included in the graph, as shown in the suppose in our study individuals are ra with the same probability regardless of a common cause of and and need n Figure 6.1 represents a conditionally ran 6.2 represents a marginally randomized e Figure 6.1 may also represent an obs Could represent (i) a conditional randomized experiment or (ii) observational study where we assume conditional exchangeability a L Note how the DG does not includes a. This disconnect will be ignored for now; formally incorporating potential outcomes into graphs via SWIGs (single world intervention graphs) will be discussed later BIOS Graphical Representation

11 Figure Causal Diagrams Figure 6.2 Directed acyclic graphs have applicat we focus on causal directed acyclic graph is causal if the common causes of any p in the graph. For example, suppose in assigned to heart transplant with a pro of their disease. Then is a common included in the graph, as shown in the suppose in our study individuals are ra with the same probability regardless of a common cause of and and need n Figure 6.1 represents a conditionally ran 6.2 represents a marginally randomized e Could represent (i) a marginally randomized experiment or (ii) observational study where we were willing to assume (unconditional) exchangeability a Below we discuss key results from graph theory that allows us to read off certain conditional independencies from a DG Heuristically, in Figure 6.2 we would expect and to be associated Figure 6.1 may also represent an obs BIOS Graphical Representation

12 Causal Diagrams and Marginal Independence Let L denote cigarette smoking, carrying a lighter, and lung cancer ssume does not affect Figure Causal diagrams and marginal independence L Figure 6.3 path between two variables and in a DG is a route that connects and by following a sequence Consider the following two examples. Fir has a preventive causal effect on the ri 1] 6= Pr[ =0 =1]. The causal diagram tion of this knowledge for an experiment unconditionally, assigned. Second, suppo has no causal effect (causative or prevent i.e., Pr[ =1 =1]=Pr[ =0 =1], and t effect on both carrying a lighter and l Figure 6.3 is the graphical translation of between and indicates that carrying on lung cancer; is depicted as a comm To draw Figures 6.2 and 6.3 we only u relations among the variables in the dia diagrams also encode information about Continuing with heuristic/intutive arguments, we might expect for this DG that and are associated due to common cause L exactly, the lack of them) among the var heuristically that, in general, the variabl Figure 6.2 and 6.3, and describe key rela Take first the randomized experiment one would expect that two variables would be associated. nd that is exac one knows that has a causal effect on also generally expect and to be ass with the fact that, in an ideal randomize changeability, causation Pr[ =1 =1] Pr[ =1 =1]6= Pr[ =1 =0], a tures the causation-association correspon alization of the paths between two varia Eg, suppose L is Louise s hair color, and and are the hair color of her two children lice and usuf association flows. ssociation, unlike ca between two variables; thus, when presen BIOS Graphical Representation

13 6.2 Causal Diagrams and Marginal Independence genetic haplotype, cigarette smoking, L heart disease ssume does not affect Figure 6.4 leads to the same methods described here, we do not include decision nodes in chapter. Because we were always explicit about the potential interventions on t represent the potential interventions) would be somewhat redundant. Figure 6.4 L terms, and are associated because th (or, equivalently, from to ) throug Let us now consider a third example. S haplotype has no causal effect on an smoker, i.e., Pr[ =1 =1]=Pr[ =0 and cigarette smoking have a causal The causal diagram in Figure 6.4 is the gr The lack of an arrow between and i have a causal effect on cigarette smoking, of and. In graph theory the commo on the path because two arro gain the question is whether an question, imagine that another investiga lotype on the risk of becoming a cigare is no effect but this is unknown to the in minations on a large number of children up becoming smokers. pollo is one of t pollo does not have the haplotype ( come a cigarette smoker ( =1) than th haplotype does not improve our abili the risk in those with ( =1)andwitho or Pr[ =1 =1]=Pr[ =1 =0]. conclude that and are not associat q. The knowledge that both and when considering the association betwee firms our intuition because it says that c the flow of association along the path o independent because the only path betw Lack of arrow from to reflects assumption does not affect L is a common effect of and L is a collider on the path L because two arrowheads collide on this node Note the concept of collider is path specific BIOS Graphical Representation

14 6.2 Causal Diagrams and Marginal Independence Figure 6.4 Figure 6.4 Intuitively, we might expect that and are independent L terms, and are associated because th (or, equivalently, from to ) throug Let us now consider a third example. S haplotype has no causal effect on an smoker, i.e., Pr[ =1 =1]=Pr[ =0 and cigarette smoking have a causal The causal diagram in Figure 6.4 is the gr The lack of an arrow between and i have a causal effect on cigarette smoking, of and. In graph theory the commo on the path because two arro gain the question is whether an question, imagine that another investiga lotype on the risk of becoming a cigare is no effect but this is unknown to the in minations on a large number of children up becoming smokers. pollo is one of t pollo does not have the haplotype ( come a cigarette smoker ( =1) than th haplotype does not improve our abili the risk in those with ( =1)andwitho or Pr[ =1 =1]=Pr[ =1 =0]. conclude that and are not associat q. The knowledge that both and when considering the association betwee firms our intuition because it says that c the flow of association along the path o independent because the only path betw the collider. In summary, two variables are (mar other, or if they share common causes. O dependent. The next section explores the Graph theory in fact confirms this intuition: colliders block the flow of association along the path on which they lie Eg, suppose and denote hair color of braham and olanda, and L denotes the hair color of their son Lionel BIOS Graphical Representation

15 6.2 Causal Diagrams and Marginal Independence Heuristic summary: Two variables are (marginally) associated if 1. one causes the other (Figure 6.2) 2. they share common causes (Figure 6.3) Otherwise they will be (marginally) independent (Figure 6.4) The next section explores the conditions under which two variables and may be independent conditionally on a third variable L BIOS Graphical Representation

16 6.3 Causal diagrams and conditional independence Consider in Figure 6.2 we had where aspirin affects the risk of death Suppose this is because aspirin reduces platelet aggregration B Is there an association between and conditional on B? Figure B Figure 6.5 Because no conditional independences are expected in complete causal diagrams (those in which all possible arrows are present), it is often said that information about associations is in the missing arrows. associated because aspirin has a causal we obtain an additional piece of informat because it reduces platelet aggregation into the causal diagram of Figure 6.5 t high, 0: low) as a mediator of the effect Once a third variable is introduced in question: is there an association between on)? Or, equivalently: when we alread mation about improve our ability to suppose data were collected on,, an and that we restrict the analysis to the s aggregation ( =0). The square box pl represents this restriction. (We would als were restricted to the subset of individu Individuals with low platelet aggregat risk of heart disease. Now take one of the the individual was treated ( =1)or that he has a lower than average risk be In fact, because aspirin use affects hea aggregation, learning an individual s trea additional information to predict his risk individuals with =0,treatment and The square box around B in Figure 6.5 indicates we are conditioning on B BIOS Graphical Representation

17 6.3 Causal diagrams and conditional independence If we assume aspirin use affects heart disease risk only through platelet aggregation, learning an individuals treatment status does not contribute any additional information to predict his risk of heart disease That is, B, even though and are marginally associated Indeed, graph theory states that a box placed around variable B blocks the flow of association through the path B BIOS Graphical Representation

18 causal diagrams (those in which all possible arrows are present), it is often said that information about as- 6.3 Causal diagrams sociations and is inconditional the missing arrows. independence Now consider Figure 6.3 where L Here carrying a lighter is associated with lung cancer because of L cigarette smoking Suppose we restrict study to non-smokers L = 0 Figure 6.6 L Figure 6.6 Blocking the flow of association between treatment and outcome through the common cause is the graph-based justification to use stratification as a method to achieve exchangeability. risk of heart disease. Now take one of th the individual was treated ( =1)or that he has a lower than average risk b In fact, because aspirin use affects hea aggregation, learning an individual s tre additional information to predict his risk individuals with =0,treatment an same informal argument can be made for Even though and are marginally a independent (unassociated) given be same in the treated and the untreated 1 = ] =Pr[ =1 =0 = ] graph theory states that a box placed association through the path Let us now return to Figure 6.3. We carrying a lighter was associated wi the path was open to the question we ask now is whether is ass Conditioning on L blocks the flow of association between and new question is represented by the box investigator restricts the study to nonsm that an individual carries a lighter ( lung cancer ( =1) because the entire on the fact that people carrying lighter argument is irrelevant when the study generally, to people who smoke with a and are marginally associated, and because the risk of lung cancer is the within levels of : Pr[ =1 =1 =. Thatis, q. Graphically, we sa and is interrupted because the pat around. Finally, consider Figure 6.4 again. that having the haplotype was inde because the path between and BIOS Graphical Representation

19 6.3 Causal diagrams and conditional independence That is, L even though and are marginally associated Eg, once we know you are a non-smoker, whether you are carrying a lighter tells us nothing about your risk of lung cancer Blocking the flow of association between treatment and outcome through the common cause is the graph-based justification to use stratification as a method to achieve exchangeability BIOS Graphical Representation

20 6.3 Causal diagrams and conditional independence Recall in Figure 6.4 we had L where haplotype, cigarette smoker, and L heart disease Suppose we condition on L Figure 6.7 Graphical representation of causal effects Figure 6.7 Figure 6.8 L L C disease ( =1). The square around in conditioning on a particular value of. disease lacks haplotype provides some in because, in the absence of, it is more as is present. That is, among people w smokers is increased among those without are inversely associated conditionally o a mistake if he concludes that has a ca are associated within levels of. In th causes of, then among people with hear would perfectly predict the presence of t indeed conditioning on a collider like o was blocked when the collider was not c two variables (the causes) are associated inthefuture(theireffect), but two cause associated once we stratify on the commo s another example, the causal diagram 6.7 a diuretic medication whose use is a disease. and are also associated withi effect of and. Graph theory shows affected by a collider also opens the pat Consider extreme scenario where and are only known causes of L Suppose L = 1, i.e., See thechapter person8 has for more heart ondisease. associations due to conditioning on com- Suppose also = 0, i.e., they lack themon haplotype effects. that increases susceptbility to heart disease. Then it is must be that the other cause of heart disease is present, i.e., = 1. I.e., L BIOS Graphical Representation

21 6.3 Causal diagrams and conditional independence Graph theory shows that indeed conditioning on a collider like L opens the path L, which was blocked when the collider was not conditioned on Intuitively, whether two variables (the causes) are associated cannot be influenced by an event in the future (their effect), but two causes of a given effect generally become associated once we stratify on the common effect Eg given Lionel has red hair, then if braham does not have red hair it is more likely that olanda does have red hair BIOS Graphical Representation

22 See Chapter 8 for more on associations due to conditioning on common effects. 6.3 Causal diagrams and conditional independence similar example is given in Figure 6.8 Figure 6.8 L C The mathematical theory underlying the graphical rules is known as d-separation (Pearl 1995). See Fine Point 6.3. Figure 6.9 S are associated within levels of. In th causes of, then among people with hea would perfectly predict the presence of t indeed conditioning on a collider like was blocked when the collider was not c two variables (the causes) are associate inthefuture(theireffect), but two caus associated once we stratify on the commo s another example, the causal diagra 6.7 a diuretic medication whose use is disease. and are also associated with effect of and. Graph theory show affected by a collider also opens the pa in the absence of conditioning on either t This and the previous section review t ables may be associated: one causes the they share a common effect and the analy common effect. long the way we introdu can be applied to any causal diagram to (conditionally) independent. The argume cal rules were heuristic and relied on our however, have been formalized and mathe for a systematic summary of the graphica There is another possible source of as we have not discussed yet: chance or rand reasons for an association between two v other, shared common causes, conditioni ability results in chance associations that study population increases. To focus our discussion on structural ciations,wecontinuetoassumeuntilchap every individual in a very large (perhaps Suppose C is a medicine prescribed as a consequence of heart disease L Then conditional on C, there will be an association between and Graph theory shows that conditioning on a variable C affected by a collider L also opens the path L whereas this path is blocked (i.e., there is no association between L and ) in the absence of conditioning on either the collider L or its descendant C BIOS Graphical Representation

23 Example (R): Conditioning on descendant of a collider > set.seed(1) > n < > a <- runif(n) > y <- runif(n) > l <- (a-y)^2 > c <- rbinom(n,1,l) > cor.test(a,y) p-value = > cor.test(a[c==0],y[c==0]) p-value < 2.2e-16 > cor.test(a[c==1],y[c==1]) p-value < 2.2e-16 BIOS Graphical Representation

24 6.3 Causal diagrams and conditional independence These examples have demonstrated heuristically three structural reasons why two variables may be associated 1. One causes the other 2. They share common causes 3. They share a commone effect and the analysis is restricted to a certain level of that common effect The formal underpinnings of these heuristic arguments are based on the notion of d-separation BIOS Graphical Representation

25 Fine Point 6.3: d-separation Key result: Suppose we have a DG G and a distribution over its nodes where each varibale is independent of its non-descendants conditional on its parents (Markov factorization) If two variables (e.g., and ) are d-separated given some other variable (e.g., L), then then the two variables are conditionally independent given the third L Notion extends to sets of variables { 1, 2...,}, { 1, 2,...} and {L 1,L 2,...} What is d-separation? First we need to define paths as open or blocked BIOS Graphical Representation

26 Fine Point 6.3: d-separation path is open or closed according to the following four rules 1. If there are no variables being coniditioned on, a path is blocked Graphical 74 representation of causal effects iff it contains a collider Eg, L is open where L is blocked disease associated ( =1). because The aspirin square has around a causa conditioning we obtain anon additional a particular piecevalue of inform of because it reduces platelet aggregati affected by a collider also opens the p was inindividuals the blocked absence with when of conditioning the =0,treatment collider on waseither not a Figure 6.7 disease lacks haplotype provides some because, into the in causal the absence diagramof of, Figure it is mo 6.5 B as high, is0: present. low) as That a mediator is, among the peopl effec smokers Onceis aincreased third variable among is introduced those witho Graphical representation question: is there an association betwe Figure 6.5 of causal effects are inversely associated conditionally aon) mistake? Or, if he equivalently: concludes that when we has alre a See Chapter 8 for more on associations due to conditioning on common mation are associated about within improve levels our of ability. Int disease causes suppose of ( data, =1). then were The among collected square people on around with,, he effects. conditioning would and that perfectly weon restrict apredict particular the the analysis presence valuetoofth o L disease indeed aggregation lacks conditioning haplotype ( =0). onthe a provides collider square like some box Figure 6.7 Because no conditional independences because, was represents blocked in this the when absence restriction. the collider of, (We itwas would is mor not as two were variables isrestricted present. (the to That the causes) is, subset among areofassocia individ people are expected in complete causal diagrams (those in which all possible arrows are present), it is oftenchapter smokers inthefuture(theireffect), Individuals is increased withamong low platelet those but two aggreg witho cau associated risk areof inversely heart once disease. associated we stratify Nowconditionally take the onecomm of t athe mistake s individual another if example, concludes was treated the that causal ( =1)o has diag a said that8 information more onabout associ- as- 6.7 that are a diuretic he associated has amedication lower within than levels average whose of. risk use In it sociations See due toisconditioning in the missing L onarrows. common C causes disease. In fact, of, because and then among are aspirin alsopeople associated use affects with wi he h effects. Figure 6.8 would effect aggregation, perfectly of and learning predict. Graph anthe individual s presence theory sho of tr indeed additional conditioning information on ato collider predict his likeris 2. ny path which contains a non-colliders L that which been conditioned on is blocked, eg, as in Figure collider that has been conditioned on does not block a path 4. collider that has a descendent that has been conditioned on does not block a path BIOS Graphical Representation

27 Fine Point descendants. 6.3: d-separation In summary: path is blocked iff (i) it contains a noncolllider that has been conditioned on or (ii) it contains a collider which has not been conditioned on and has no descendenant that have been conditioned on Two variables are d-separated if all paths between them are blocked Otherwise the two variables are d-connected Consider Figure 6-1 ( ). We will adopt the notational convention that if, is not an an causal DG is a DG in which 1) the lack of an arrow from node to of a direct causal effect of on (relative to the other variables on the g unmeasured, of any pair of variables on the graph are themselves on the graph Causal DGs are of no practical use unless we make an assumption linki the DG to the data obtained in a study. This assumption, referred to as the c conditional on its direct causes, a variable is independent of any variable conditional on its parents, is independent of its non-descendants. This latter to the statement that the density ( ) of the variables in DG satisfies t L Figure 6.1 re and d-separated or d-connected conditional on L? ( ) = =1 ( ). effect not mediated through any other va individual, or that we are unwilling to a do not exist. lternatively, the lack of a willingtoassume,that has no direct ca the population. For example, in Figure 6 either we know that disease severity affec transplant or that we are not willing to diagram does not distinguish whether an a protective effect. Furthermore, if, as i two causes, the diagram does not encode Causal diagrams like the one in Figu graphs, which is commonly abbreviated edges imply a direction: because the arro, but not the other way around. cy variable cannot cause itself, either direct Directed acyclic graphs have applicat we focus on causal directed acyclic graph is causal if the common causes of any p BIOS Graphical Representation

28 Consider Figure 6-6 Fine Point 6.3: d-separation L Figure 6.6 re and d-separated or d-connected conditional on L? Blocking the flow of association between treatment and outcome through the common cause is the graph-based justification to use stratification as a method to achieve exchangeability. Even though and are marginally a independent (unassociated) given be same in the treated and the untreated 1 = ] =Pr[ =1 =0 = ] graph theory states that a box placed association through the path Let us now return to Figure 6.3. We carrying a lighter was associated wi the path was open to the question we ask now is whether is ass new question is represented by the box investigator restricts the study to nonsm that an individual carries a lighter ( lung cancer ( =1) because the entire on the fact that people carrying lighter argument is irrelevant when the study generally, to people who smoke with a and are marginally associated, and because the risk of lung cancer is the within levels of : Pr[ =1 =1 =. Thatis, q. Graphically, we sa and is interrupted because the pat around. Finally, consider Figure 6.4 again. that having the haplotype was inde because the path between and collider. We now argue heuristicall conditionally associated within levels of the investigators, who are interested in on smoking status, restricted the stu So what can we conclude about the association between and? BIOS Graphical Representation

29 Fine Point 6.2: Faithfulness bsence of an arrow from to indicates that the sharp null hypothesis of no causal effect of on any individual s holds rrow from to indicates has causal effect on of at least one individual in the population Suppose has a positive effect on for half the population =1 and a negative effect for the other half of the population such that the overall effect of on is zero Then the DG Figure 6.1. In this DG, =3and we can choose 1 =, 2 =, and 3 ( ). We will adopt the notational convention that if, is not an an causal DG is a DG in which 1) the lack of an arrow from node to of a direct causal effect of on (relative to the other variables on the g unmeasured, of any pair of variables on the graph are themselves on the graph descendants. Causal DGs are of no practical use unless we make an assumption linki the DG to the data obtained in a study. This assumption, referred to as the c conditional on its direct causes, a variable is independent of any variable conditional on its parents, is independent of its non-descendants. This latter to the statement that the density ( ) of the variables in DG satisfies t L Figure 6.1 ( ) = ( ). effect not mediated through any other va individual, or that we are unwilling to a do not exist. lternatively, the lack of a willingtoassume,that has no direct ca the population. For example, in Figure 6 either we know that disease severity affec transplant or that we are not willing to diagram does not distinguish whether an is correct in the sense that has a causal effect on for at least one individual in the population, and yet the average causal effect is zero a protective effect. Furthermore, if, as i two causes, the diagram does not encode Causal diagrams like the one in Figu graphs, which is commonly abbreviated edges imply a direction: because the arro, but not the other way around. cy variable cannot cause itself, either direct Directed acyclic graphs have applicat BIOS Graphical Representation we focus on causal directed acyclic graph

30 Fine Point 6.2: Faithfulness Faithfulness is the assumption that this does not occur I.e., that if there is an arrow from to, i.e., there is an effect of on for at least one individual, then there is an average effect of on This assumption might be reasonable if we think perfect canelleations like on the previous slide are unlikely There are instances however where faithfullness is violated by design BIOS Graphical Representation

31 Figure 6.8 Fine Point 6.2: Faithfulness disease. and are also associated with effect of and. Graph theory show affected by a collider also opens the pa in the absence of conditioning on either This and the previous section review ables may be associated: one causes the they share a common effect and the analy Consider example described in 4.5 where we match on L in order The mathematical theory underlyingeffect the graphical of on rules is known to estimate the causal as d-separation (Pearl 1995). Let S = 1 indicate selection Fine Point into 6.3. the matched cohort, S = 0 otherwise L Figure 6.9 S See common effect. long the way we introd can be applied to any causal diagram to (conditionally) independent. The argum cal rules were heuristic and relied on ou however, have been formalized and math for a systematic summary of the graphic There is another possible source of a we have not discussed yet: chance or ran reasons for an association between two other, shared common causes, condition ability results in chance associations tha study population increases. To focus our discussion on structural ciations,wecontinuetoassumeuntilcha ccording to d-separation rules, there are two open paths between and L when conditioning on S, namely L and L S every individual in a very large (perhaps Thus one would expect L and to be associated conditional on S 6.4 Graphs, counterfactuals, and interventions However, matching ensures L and are not associated. Why? Unfaithfulness, i.e., perfect cancellation of assocations Pearl (2009) reviews quantitative methods for causal inference that are derived from graph theory. Causal diagrams encode qualitative expe the causal structure of a problem and h biases. Though causal diagrams are a us a causal inference problem, quantitativ causal effects. The identification formula in Chapter 2 can also be derived using our choice of counterfactual theory in Ch BIOS Graphical Representation

32 6.4 Graphs, counterfactuals, and interventions Causal diagrams encode qualitative expert knowledge, or assumptions, about the causal structure of a problem and hence about the causal determinant of biases Though causal diagrams are a useful tool to think conceptually about a causal inference problem, quantitative approaches are needed to draw inference causal effects Variables sufficient for d-separation between exposure and outcome can be used in conjunction with standardization and inverse probability weighting for such inference BIOS Graphical Representation

33 6.5 structural classification of bias Structural bias: Bias due to structural reasons There is structural bias whenever the causal effect measure and the corresponding association measure are not equal, e.g., E[ a=1 ] E[ a=0 ] E[ = 1] E[ = 0] bove is unconditional bias; can define conditional bias (given L) analogously Under faithfulness, presence of conditional bias implies unconditional bias Without the faithfulness assumption, positive bias in one stratum of L could cancel the negative bias in another stratum BIOS Graphical Representation

34 6.5 structural classification of bias Three forms of structural bias 1. Confounding: Treatment and outcome share a common cause ( 7) 2. Selection bias: Conditioning on common effects ( 8) 3. Measurement error: Bias induced by mis-measuring, L, and/or ( 9) The next three chapters will utilize DGs to examine these structural biases BIOS Graphical Representation

35 6.6 The structure of effect modification Causal DGs are helpful in helping determine potential sources of structural bias DGs are less helpful 6.6 in depicting The structure effectof modification effect modification E.g., consider Figure 6.10 M Figure 6.10 where M is quality of care, is heart transplant, is survival like that. The remaining chapters of Pa fuzzy boundary between experimenting a three chapters we turn our attention to t three classes of biases: bias due to the to the selection of individuals, and bias Before that, we take a brief detour to des of effect modification. Identifying potential sources of bias is a use our causal expert knowledge to draw association between treatment and outco to illustrate the concept of effect modific Suppose heart transplant wasran identify the average causal effect of on that there is no bias, and thus Figure Computing the effect of on the risk association is causation, the association Pr [ =1 =0] can be interpreted as 1] Pr[ =0 =1]. The investigators, how suspect that the causal effect of heart tran care offered in each hospital participating This DG is correct regardless of whether M is an effect modifier N Note if we were constructing a DG to assess the effect of on, we would not need to M include M on the DG b/c M is not a Figure 6.11 common cause of and BIOS Graphical Representation

Graphical Representation of Causal Effects. November 10, 2016

Graphical Representation of Causal Effects. November 10, 2016 Graphical Representation of Causal Effects November 10, 2016 Lord s Paradox: Observed Data Units: Students; Covariates: Sex, September Weight; Potential Outcomes: June Weight under Treatment and Control;

More information

Graphical models and causality: Directed acyclic graphs (DAGs) and conditional (in)dependence

Graphical models and causality: Directed acyclic graphs (DAGs) and conditional (in)dependence Graphical models and causality: Directed acyclic graphs (DAGs) and conditional (in)dependence General overview Introduction Directed acyclic graphs (DAGs) and conditional independence DAGs and causal effects

More information

Introduction to Causal Calculus

Introduction to Causal Calculus Introduction to Causal Calculus Sanna Tyrväinen University of British Columbia August 1, 2017 1 / 1 2 / 1 Bayesian network Bayesian networks are Directed Acyclic Graphs (DAGs) whose nodes represent random

More information

Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017

Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017 Causal Inference Miguel A. Hernán, James M. Robins May 19, 2017 ii Causal Inference Part III Causal inference from complex longitudinal data Chapter 19 TIME-VARYING TREATMENTS So far this book has dealt

More information

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Causality II: How does causal inference fit into public health and what it is the role of statistics? Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS 776 1 15 Outcome regressions and propensity scores Outcome Regression and Propensity Scores ( 15) Outline 15.1 Outcome regression 15.2 Propensity

More information

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS 776 1 14 G-Estimation ( G-Estimation of Structural Nested Models 14) Outline 14.1 The causal question revisited 14.2 Exchangeability revisited

More information

Bounding the Probability of Causation in Mediation Analysis

Bounding the Probability of Causation in Mediation Analysis arxiv:1411.2636v1 [math.st] 10 Nov 2014 Bounding the Probability of Causation in Mediation Analysis A. P. Dawid R. Murtas M. Musio February 16, 2018 Abstract Given empirical evidence for the dependence

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS 776 1 14 G-Estimation G-Estimation of Structural Nested Models ( 14) Outline 14.1 The causal question revisited 14.2 Exchangeability revisited

More information

6.3 How the Associational Criterion Fails

6.3 How the Associational Criterion Fails 6.3. HOW THE ASSOCIATIONAL CRITERION FAILS 271 is randomized. We recall that this probability can be calculated from a causal model M either directly, by simulating the intervention do( = x), or (if P

More information

MINIMAL SUFFICIENT CAUSATION AND DIRECTED ACYCLIC GRAPHS. By Tyler J. VanderWeele and James M. Robins University of Chicago and Harvard University

MINIMAL SUFFICIENT CAUSATION AND DIRECTED ACYCLIC GRAPHS. By Tyler J. VanderWeele and James M. Robins University of Chicago and Harvard University Submitted to the Annals of Statistics MINIMAL SUFFICIENT CAUSATION AND DIRECTED ACYCLIC GRAPHS By Tyler J. VanderWeele and James M. Robins University of Chicago and Harvard University Notions of minimal

More information

Causal Inference & Reasoning with Causal Bayesian Networks

Causal Inference & Reasoning with Causal Bayesian Networks Causal Inference & Reasoning with Causal Bayesian Networks Neyman-Rubin Framework Potential Outcome Framework: for each unit k and each treatment i, there is a potential outcome on an attribute U, U ik,

More information

Causal Directed Acyclic Graphs

Causal Directed Acyclic Graphs Causal Directed Acyclic Graphs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Causal DAGs Stat186/Gov2002 Fall 2018 1 / 15 Elements of DAGs (Pearl. 2000.

More information

Help! Statistics! Mediation Analysis

Help! Statistics! Mediation Analysis Help! Statistics! Lunch time lectures Help! Statistics! Mediation Analysis What? Frequently used statistical methods and questions in a manageable timeframe for all researchers at the UMCG. No knowledge

More information

Directed Graphical Models

Directed Graphical Models CS 2750: Machine Learning Directed Graphical Models Prof. Adriana Kovashka University of Pittsburgh March 28, 2017 Graphical Models If no assumption of independence is made, must estimate an exponential

More information

A Decision Theoretic Approach to Causality

A Decision Theoretic Approach to Causality A Decision Theoretic Approach to Causality Vanessa Didelez School of Mathematics University of Bristol (based on joint work with Philip Dawid) Bordeaux, June 2011 Based on: Dawid & Didelez (2010). Identifying

More information

Job Training Partnership Act (JTPA)

Job Training Partnership Act (JTPA) Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training

More information

1. what conditional independencies are implied by the graph. 2. whether these independecies correspond to the probability distribution

1. what conditional independencies are implied by the graph. 2. whether these independecies correspond to the probability distribution NETWORK ANALYSIS Lourens Waldorp PROBABILITY AND GRAPHS The objective is to obtain a correspondence between the intuitive pictures (graphs) of variables of interest and the probability distributions of

More information

Motivation. Bayesian Networks in Epistemology and Philosophy of Science Lecture. Overview. Organizational Issues

Motivation. Bayesian Networks in Epistemology and Philosophy of Science Lecture. Overview. Organizational Issues Bayesian Networks in Epistemology and Philosophy of Science Lecture 1: Bayesian Networks Center for Logic and Philosophy of Science Tilburg University, The Netherlands Formal Epistemology Course Northern

More information

Arrowhead completeness from minimal conditional independencies

Arrowhead completeness from minimal conditional independencies Arrowhead completeness from minimal conditional independencies Tom Claassen, Tom Heskes Radboud University Nijmegen The Netherlands {tomc,tomh}@cs.ru.nl Abstract We present two inference rules, based on

More information

Intelligent Systems: Reasoning and Recognition. Reasoning with Bayesian Networks

Intelligent Systems: Reasoning and Recognition. Reasoning with Bayesian Networks Intelligent Systems: Reasoning and Recognition James L. Crowley ENSIMAG 2 / MoSIG M1 Second Semester 2016/2017 Lesson 13 24 march 2017 Reasoning with Bayesian Networks Naïve Bayesian Systems...2 Example

More information

Causal Models. Macartan Humphreys. September 4, Abstract Notes for G8412. Some background on DAGs and questions on an argument.

Causal Models. Macartan Humphreys. September 4, Abstract Notes for G8412. Some background on DAGs and questions on an argument. Causal Models Macartan Humphreys September 4, 2018 Abstract Notes for G8412. Some background on DAGs and questions on an argument. 1 A note on DAGs DAGs directed acyclic graphs are diagrams used to represent

More information

Part IV Statistics in Epidemiology

Part IV Statistics in Epidemiology Part IV Statistics in Epidemiology There are many good statistical textbooks on the market, and we refer readers to some of these textbooks when they need statistical techniques to analyze data or to interpret

More information

What Causality Is (stats for mathematicians)

What Causality Is (stats for mathematicians) What Causality Is (stats for mathematicians) Andrew Critch UC Berkeley August 31, 2011 Introduction Foreword: The value of examples With any hard question, it helps to start with simple, concrete versions

More information

Machine Learning Summer School

Machine Learning Summer School Machine Learning Summer School Lecture 1: Introduction to Graphical Models Zoubin Ghahramani zoubin@eng.cam.ac.uk http://learning.eng.cam.ac.uk/zoubin/ epartment of ngineering University of ambridge, UK

More information

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla. OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/) Statistical vs. Causal vs. Counterfactual inference: syntax and semantics

More information

MINIMAL SUFFICIENT CAUSATION AND DIRECTED ACYCLIC GRAPHS 1. By Tyler J. VanderWeele and James M. Robins. University of Chicago and Harvard University

MINIMAL SUFFICIENT CAUSATION AND DIRECTED ACYCLIC GRAPHS 1. By Tyler J. VanderWeele and James M. Robins. University of Chicago and Harvard University MINIMAL SUFFICIENT CAUSATION AND DIRECTED ACYCLIC GRAPHS 1 By Tyler J. VanderWeele and James M. Robins University of Chicago and Harvard University Summary. Notions of minimal su cient causation are incorporated

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Counterfactual Reasoning in Algorithmic Fairness

Counterfactual Reasoning in Algorithmic Fairness Counterfactual Reasoning in Algorithmic Fairness Ricardo Silva University College London and The Alan Turing Institute Joint work with Matt Kusner (Warwick/Turing), Chris Russell (Sussex/Turing), and Joshua

More information

Causal Discovery Methods Using Causal Probabilistic Networks

Causal Discovery Methods Using Causal Probabilistic Networks ausal iscovery Methods Using ausal Probabilistic Networks MINFO 2004, T02: Machine Learning Methods for ecision Support and iscovery onstantin F. liferis & Ioannis Tsamardinos iscovery Systems Laboratory

More information

Challenges (& Some Solutions) and Making Connections

Challenges (& Some Solutions) and Making Connections Challenges (& Some Solutions) and Making Connections Real-life Search All search algorithm theorems have form: If the world behaves like, then (probability 1) the algorithm will recover the true structure

More information

Learning in Bayesian Networks

Learning in Bayesian Networks Learning in Bayesian Networks Florian Markowetz Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin Berlin: 20.06.2002 1 Overview 1. Bayesian Networks Stochastic Networks

More information

Causal Discovery by Computer

Causal Discovery by Computer Causal Discovery by Computer Clark Glymour Carnegie Mellon University 1 Outline 1. A century of mistakes about causation and discovery: 1. Fisher 2. Yule 3. Spearman/Thurstone 2. Search for causes is statistical

More information

Probabilistic Causal Models

Probabilistic Causal Models Probabilistic Causal Models A Short Introduction Robin J. Evans www.stat.washington.edu/ rje42 ACMS Seminar, University of Washington 24th February 2011 1/26 Acknowledgements This work is joint with Thomas

More information

Conditional probabilities and graphical models

Conditional probabilities and graphical models Conditional probabilities and graphical models Thomas Mailund Bioinformatics Research Centre (BiRC), Aarhus University Probability theory allows us to describe uncertainty in the processes we model within

More information

CMPT Machine Learning. Bayesian Learning Lecture Scribe for Week 4 Jan 30th & Feb 4th

CMPT Machine Learning. Bayesian Learning Lecture Scribe for Week 4 Jan 30th & Feb 4th CMPT 882 - Machine Learning Bayesian Learning Lecture Scribe for Week 4 Jan 30th & Feb 4th Stephen Fagan sfagan@sfu.ca Overview: Introduction - Who was Bayes? - Bayesian Statistics Versus Classical Statistics

More information

Automatic Causal Discovery

Automatic Causal Discovery Automatic Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour Dept. of Philosophy & CALD Carnegie Mellon 1 Outline 1. Motivation 2. Representation 3. Discovery 4. Using Regression for Causal

More information

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS ANALYTIC COMPARISON of Pearl and Rubin CAUSAL FRAMEWORKS Content Page Part I. General Considerations Chapter 1. What is the question? 16 Introduction 16 1. Randomization 17 1.1 An Example of Randomization

More information

Causal Bayesian networks. Peter Antal

Causal Bayesian networks. Peter Antal Causal Bayesian networks Peter Antal antal@mit.bme.hu A.I. 4/8/2015 1 Can we represent exactly (in)dependencies by a BN? From a causal model? Suff.&nec.? Can we interpret edges as causal relations with

More information

Bayesian Network Representation

Bayesian Network Representation Bayesian Network Representation Sargur Srihari srihari@cedar.buffalo.edu 1 Topics Joint and Conditional Distributions I-Maps I-Map to Factorization Factorization to I-Map Perfect Map Knowledge Engineering

More information

Data, Design, and Background Knowledge in Etiologic Inference

Data, Design, and Background Knowledge in Etiologic Inference Data, Design, and Background Knowledge in Etiologic Inference James M. Robins I use two examples to demonstrate that an appropriate etiologic analysis of an epidemiologic study depends as much on study

More information

Probabilistic Models

Probabilistic Models Bayes Nets 1 Probabilistic Models Models describe how (a portion of) the world works Models are always simplifications May not account for every variable May not account for all interactions between variables

More information

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016 CS 2750: Machine Learning Bayesian Networks Prof. Adriana Kovashka University of Pittsburgh March 14, 2016 Plan for today and next week Today and next time: Bayesian networks (Bishop Sec. 8.1) Conditional

More information

Chapter 11. Correlation and Regression

Chapter 11. Correlation and Regression Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of

More information

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D. Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D. David Kaplan Department of Educational Psychology The General Theme

More information

Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges

Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges 1 PRELIMINARIES Two vertices X i and X j are adjacent if there is an edge between them. A path

More information

A counterfactual approach to bias and effect modification in terms of response types

A counterfactual approach to bias and effect modification in terms of response types uzuki et al. BM Medical Research Methodology 2013, 13:101 RARH ARTIL Open Access A counterfactual approach to bias and effect modification in terms of response types tsuji uzuki 1*, Toshiharu Mitsuhashi

More information

Causal Inference. Prediction and causation are very different. Typical questions are:

Causal Inference. Prediction and causation are very different. Typical questions are: Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting

More information

Bayesian networks as causal models. Peter Antal

Bayesian networks as causal models. Peter Antal Bayesian networks as causal models Peter Antal antal@mit.bme.hu A.I. 3/20/2018 1 Can we represent exactly (in)dependencies by a BN? From a causal model? Suff.&nec.? Can we interpret edges as causal relations

More information

Confounding, mediation and colliding

Confounding, mediation and colliding Confounding, mediation and colliding What types of shared covariates does the sibling comparison design control for? Arvid Sjölander and Johan Zetterqvist Causal effects and confounding A common aim of

More information

Causal Diagrams. By Sander Greenland and Judea Pearl

Causal Diagrams. By Sander Greenland and Judea Pearl Wiley StatsRef: Statistics Reference Online, 2014 2017 John Wiley & Sons, Ltd. 1 Previous version in M. Lovric (Ed.), International Encyclopedia of Statistical Science 2011, Part 3, pp. 208-216, DOI: 10.1007/978-3-642-04898-2_162

More information

Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality

Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality Thomas S. Richardson University of Washington James M. Robins Harvard University Working

More information

CS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine

CS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine CS 484 Data Mining Classification 7 Some slides are from Professor Padhraic Smyth at UC Irvine Bayesian Belief networks Conditional independence assumption of Naïve Bayes classifier is too strong. Allows

More information

Local Probability Models

Local Probability Models Readings: K&F 3.4, 5.~5.5 Local Probability Models Lecture 3 pr 4, 2 SE 55, Statistical Methods, Spring 2 Instructor: Su-In Lee University of Washington, Seattle Outline Last time onditional parameterization

More information

Naïve Bayes Classifiers

Naïve Bayes Classifiers Naïve Bayes Classifiers Example: PlayTennis (6.9.1) Given a new instance, e.g. (Outlook = sunny, Temperature = cool, Humidity = high, Wind = strong ), we want to compute the most likely hypothesis: v NB

More information

arxiv: v2 [math.st] 4 Mar 2013

arxiv: v2 [math.st] 4 Mar 2013 Running head:: LONGITUDINAL MEDIATION ANALYSIS 1 arxiv:1205.0241v2 [math.st] 4 Mar 2013 Counterfactual Graphical Models for Longitudinal Mediation Analysis with Unobserved Confounding Ilya Shpitser School

More information

Learning causal network structure from multiple (in)dependence models

Learning causal network structure from multiple (in)dependence models Learning causal network structure from multiple (in)dependence models Tom Claassen Radboud University, Nijmegen tomc@cs.ru.nl Abstract Tom Heskes Radboud University, Nijmegen tomh@cs.ru.nl We tackle the

More information

Recall from last time. Lecture 3: Conditional independence and graph structure. Example: A Bayesian (belief) network.

Recall from last time. Lecture 3: Conditional independence and graph structure. Example: A Bayesian (belief) network. ecall from last time Lecture 3: onditional independence and graph structure onditional independencies implied by a belief network Independence maps (I-maps) Factorization theorem The Bayes ball algorithm

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title Comment on `Causal inference, probability theory, and graphical insights' (by Stuart G. Baker) Permalink https://escholarship.org/uc/item/9863n6gg Journal Statistics

More information

Lecture 5: Bayesian Network

Lecture 5: Bayesian Network Lecture 5: Bayesian Network Topics of this lecture What is a Bayesian network? A simple example Formal definition of BN A slightly difficult example Learning of BN An example of learning Important topics

More information

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH The First Step: SAMPLE SIZE DETERMINATION THE ULTIMATE GOAL The most important, ultimate step of any of clinical research is to do draw inferences;

More information

CS Lecture 3. More Bayesian Networks

CS Lecture 3. More Bayesian Networks CS 6347 Lecture 3 More Bayesian Networks Recap Last time: Complexity challenges Representing distributions Computing probabilities/doing inference Introduction to Bayesian networks Today: D-separation,

More information

Bayesian Networks Representation and Reasoning

Bayesian Networks Representation and Reasoning ayesian Networks Representation and Reasoning Marco F. Ramoni hildren s Hospital Informatics Program Harvard Medical School (2003) Harvard-MIT ivision of Health Sciences and Technology HST.951J: Medical

More information

CS 188: Artificial Intelligence Fall 2009

CS 188: Artificial Intelligence Fall 2009 CS 188: Artificial Intelligence Fall 2009 Lecture 14: Bayes Nets 10/13/2009 Dan Klein UC Berkeley Announcements Assignments P3 due yesterday W2 due Thursday W1 returned in front (after lecture) Midterm

More information

CAUSAL DIAGRAMS. From their inception in the early 20th century, causal systems models (more commonly known as structural-equations

CAUSAL DIAGRAMS. From their inception in the early 20th century, causal systems models (more commonly known as structural-equations Greenland, S. and Pearl, J. (2007). Article on Causal iagrams. In: Boslaugh, S. (ed.). ncyclopedia of pidemiology. Thousand Oaks, CA: Sage Publications, 149-156. TCHNICAL RPORT R-332 Causal iagrams 149

More information

The rest of the course

The rest of the course The rest of the course Subtleties involved with maximizing expected utility: Finding the right state space: The wrong state space leads to intuitively incorrect answers when conditioning Taking causality

More information

Causal Discovery. Richard Scheines. Peter Spirtes, Clark Glymour, and many others. Dept. of Philosophy & CALD Carnegie Mellon

Causal Discovery. Richard Scheines. Peter Spirtes, Clark Glymour, and many others. Dept. of Philosophy & CALD Carnegie Mellon Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour, and many others Dept. of Philosophy & CALD Carnegie Mellon Graphical Models --11/30/05 1 Outline 1. Motivation 2. Representation 3. Connecting

More information

Estimation of direct causal effects.

Estimation of direct causal effects. University of California, Berkeley From the SelectedWorks of Maya Petersen May, 2006 Estimation of direct causal effects. Maya L Petersen, University of California, Berkeley Sandra E Sinisi Mark J van

More information

CS 5522: Artificial Intelligence II

CS 5522: Artificial Intelligence II CS 5522: Artificial Intelligence II Bayes Nets Instructor: Alan Ritter Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley. All materials available at http://ai.berkeley.edu.]

More information

Formal Modeling in Cognitive Science

Formal Modeling in Cognitive Science Formal Modeling in Cognitive Science Lecture 9: Application of Bayes Theorem; Discrete Random Variables; Steve Renals (notes by Frank Keller) School of Informatics University of Edinburgh s.renals@ed.ac.uk

More information

Formal Modeling in Cognitive Science Lecture 19: Application of Bayes Theorem; Discrete Random Variables; Distributions. Background.

Formal Modeling in Cognitive Science Lecture 19: Application of Bayes Theorem; Discrete Random Variables; Distributions. Background. Formal Modeling in Cognitive Science Lecture 9: ; Discrete Random Variables; Steve Renals (notes by Frank Keller) School of Informatics University of Edinburgh s.renals@ed.ac.uk February 7 Probability

More information

Sensitivity analysis and distributional assumptions

Sensitivity analysis and distributional assumptions Sensitivity analysis and distributional assumptions Tyler J. VanderWeele Department of Health Studies, University of Chicago 5841 South Maryland Avenue, MC 2007, Chicago, IL 60637, USA vanderweele@uchicago.edu

More information

Introduction to Artificial Intelligence. Unit # 11

Introduction to Artificial Intelligence. Unit # 11 Introduction to Artificial Intelligence Unit # 11 1 Course Outline Overview of Artificial Intelligence State Space Representation Search Techniques Machine Learning Logic Probabilistic Reasoning/Bayesian

More information

The identification of synergism in the sufficient-component cause framework

The identification of synergism in the sufficient-component cause framework * Title Page Original Article The identification of synergism in the sufficient-component cause framework Tyler J. VanderWeele Department of Health Studies, University of Chicago James M. Robins Departments

More information

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty Lecture 10: Introduction to reasoning under uncertainty Introduction to reasoning under uncertainty Review of probability Axioms and inference Conditional probability Probability distributions COMP-424,

More information

Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs

Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs Proceedings of Machine Learning Research vol 73:21-32, 2017 AMBN 2017 Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs Jose M. Peña Linköping University Linköping (Sweden) jose.m.pena@liu.se

More information

On Causal Explanations of Quantum Correlations

On Causal Explanations of Quantum Correlations On Causal Explanations of Quantum Correlations Rob Spekkens Perimeter Institute From XKCD comics Quantum Information and Foundations of Quantum Mechanics workshop Vancouver, July 2, 2013 Simpson s Paradox

More information

Probabilistic Models. Models describe how (a portion of) the world works

Probabilistic Models. Models describe how (a portion of) the world works Probabilistic Models Models describe how (a portion of) the world works Models are always simplifications May not account for every variable May not account for all interactions between variables All models

More information

Generalized Do-Calculus with Testable Causal Assumptions

Generalized Do-Calculus with Testable Causal Assumptions eneralized Do-Calculus with Testable Causal Assumptions Jiji Zhang Division of Humanities and Social Sciences California nstitute of Technology Pasadena CA 91125 jiji@hss.caltech.edu Abstract A primary

More information

A Brief Introduction to Graphical Models. Presenter: Yijuan Lu November 12,2004

A Brief Introduction to Graphical Models. Presenter: Yijuan Lu November 12,2004 A Brief Introduction to Graphical Models Presenter: Yijuan Lu November 12,2004 References Introduction to Graphical Models, Kevin Murphy, Technical Report, May 2001 Learning in Graphical Models, Michael

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 4 Problems with small populations 9 II. Why Random Sampling is Important 10 A myth,

More information

I. Partial identification of causal parameters with multiple proposed instruments

I. Partial identification of causal parameters with multiple proposed instruments Supplemental Materials Table of contents of Supplemental Materials: I. Partial identification of causal parameters with multiple proposed instruments II. Data example III. R code I. Partial identification

More information

CS 343: Artificial Intelligence

CS 343: Artificial Intelligence CS 343: Artificial Intelligence Bayes Nets Prof. Scott Niekum The University of Texas at Austin [These slides based on those of Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188

More information

Undirected Graphical Models

Undirected Graphical Models Undirected Graphical Models 1 Conditional Independence Graphs Let G = (V, E) be an undirected graph with vertex set V and edge set E, and let A, B, and C be subsets of vertices. We say that C separates

More information

Conditional Independence

Conditional Independence Conditional Independence Sargur Srihari srihari@cedar.buffalo.edu 1 Conditional Independence Topics 1. What is Conditional Independence? Factorization of probability distribution into marginals 2. Why

More information

Building Bayesian Networks. Lecture3: Building BN p.1

Building Bayesian Networks. Lecture3: Building BN p.1 Building Bayesian Networks Lecture3: Building BN p.1 The focus today... Problem solving by Bayesian networks Designing Bayesian networks Qualitative part (structure) Quantitative part (probability assessment)

More information

Introduction to Graphical Models. Srikumar Ramalingam School of Computing University of Utah

Introduction to Graphical Models. Srikumar Ramalingam School of Computing University of Utah Introduction to Graphical Models Srikumar Ramalingam School of Computing University of Utah Reference Christopher M. Bishop, Pattern Recognition and Machine Learning, Jonathan S. Yedidia, William T. Freeman,

More information

CSE 312, 2017 Winter, W.L.Ruzzo. 5. independence [ ]

CSE 312, 2017 Winter, W.L.Ruzzo. 5. independence [ ] CSE 312, 2017 Winter, W.L.Ruzzo 5. independence [ ] independence Defn: Two events E and F are independent if P(EF) = P(E) P(F) If P(F)>0, this is equivalent to: P(E F) = P(E) (proof below) Otherwise, they

More information

Axioms of Probability? Notation. Bayesian Networks. Bayesian Networks. Today we ll introduce Bayesian Networks.

Axioms of Probability? Notation. Bayesian Networks. Bayesian Networks. Today we ll introduce Bayesian Networks. Bayesian Networks Today we ll introduce Bayesian Networks. This material is covered in chapters 13 and 14. Chapter 13 gives basic background on probability and Chapter 14 talks about Bayesian Networks.

More information

Bayesian Networks for Causal Analysis

Bayesian Networks for Causal Analysis Paper 2776-2018 Bayesian Networks for Causal Analysis Fei Wang and John Amrhein, McDougall Scientific Ltd. ABSTRACT Bayesian Networks (BN) are a type of graphical model that represent relationships between

More information

CS 188: Artificial Intelligence Fall 2008

CS 188: Artificial Intelligence Fall 2008 CS 188: Artificial Intelligence Fall 2008 Lecture 14: Bayes Nets 10/14/2008 Dan Klein UC Berkeley 1 1 Announcements Midterm 10/21! One page note sheet Review sessions Friday and Sunday (similar) OHs on

More information

Decision Tree Learning

Decision Tree Learning Decision Tree Learning Berlin Chen Department of Computer Science & Information Engineering National Taiwan Normal University References: 1. Machine Learning, Chapter 3 2. Data Mining: Concepts, Models,

More information

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling A.P.Dawid 1 and S.Geneletti 2 1 University of Cambridge, Statistical Laboratory 2 Imperial College Department

More information

Research Design: Causal inference and counterfactuals

Research Design: Causal inference and counterfactuals Research Design: Causal inference and counterfactuals University College Dublin 8 March 2013 1 2 3 4 Outline 1 2 3 4 Inference In regression analysis we look at the relationship between (a set of) independent

More information

The Doubly-Modifiable Structural Model (DMSM) The Modifiable Structural Model (MSM)

The Doubly-Modifiable Structural Model (DMSM) The Modifiable Structural Model (MSM) unified theory of counterfactual reasoning hristopher G. Lucas cglucas@cmu.edu Department of Psychology arnegie Mellon University harles Kemp ckemp@cmu.edu Department of Psychology arnegie Mellon University

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 6 (MWF) Conditional probabilities and associations Suhasini Subba Rao Review of previous lecture

More information

Online supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population

Online supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population Online supplement Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in Breathlessness in the General Population Table S1. Comparison between patients who were excluded or included

More information