Equivalence in Non-Recursive Structural Equation Models

Size: px
Start display at page:

Download "Equivalence in Non-Recursive Structural Equation Models"

Transcription

1 Equivalence in Non-Recursive Structural Equation Models Thomas Richardson 1 Philosophy Department, Carnegie-Mellon University Pittsburgh, P 15213, US thomas.richardson@andrew.cmu.edu Introduction In the last decade, there has been considerable progress in understanding a certain class of statistical models, known as directed acyclic graph (DG) models, which encode independence, and conditional independence constraints. (See Pearl, 1988). This research has had fruitful results in many areas: there is now a relatively clear causal interpretation of these models, there are efficient procedures for determining the statistical indistinguishability of DG s, reliable algorithms for generating a class of DG models from sample data and background knowledge, etc. Two important elements in these investigations were: First, a purely graphical condition for calculating the conditional independence relations entailed by a DG. Second, a local characterization of equivalence between two graphs, in the sense that all of the same conditional independencies are entailed by each graph. Such a local characterization was essential in allowing the construction of efficient algorithms which could search the whole class of DG models and to find those which fitted the given data. 1 Non-Recursive Structural Equations and Cyclic Graphs 1.1 The limitations of DG models The DG formalism is very general: gamut of more familiar constructs such as recursive linear structural equation models with independent errors, regression models, factor analytic models, path models, and discrete latent variable models can be represented as DG models. However, as you might suspect from the name, directed acyclic graphs, DG models do exclude a kind of model familiar in engineering, economics, and the social sciences: those in which variable influences variable, and at the same time, influences. Economic and physical processes are often modelled by linear systems of this sort; so-called non-recursive structural equation models. nother use for such systems is modelling time series in which feedback is present, as well as structures in which causal influences propagate in different directions in certain subpopulations. 1 I thank P. Spirtes, C. Glymour, R. Scheines and C. Meek for helpful conversations. Research for this paper was supported by the Office of Naval Research through contract number N

2 This paper will give a survey of some recent research aimed at filling this gap; developing a theory of cyclic graphical models, that would allow the generalization of acyclic techniques and methods to the cyclic case. 1.2 Linear Structural Equation Models (linear SEMs) In an SEM, the variables are divided into two disjoint sets, the error terms, and the nonerror terms. ssociated to each non-error variable V there is a unique error term ε V. linear SEM contains a set of equations in which each non-error random variable V is written as a linear function of other non-error random variables and ε V. linear SEM also specifies a joint distribution over the error terms. In our discussion we will consider only linear SEMs with error terms that are jointly independent, but as we shall see, in an important sense, at least within the context of our discussion, nothing is lost by this restriction. The following is an example of such a model: X=ε X Y=ε Y = α 1 X + α 2 + ε = β 1 Y + β 2 + ε The ε V s are jointly independent standard normal error terms. structural equation model in which, for some ordering of the variables, the matrix of coefficients is in lower triangular form, is said to be recursive. 1.3 Graphs There is a directed graph, naturally associated with a given linear SEM, by the rule that there is an edge from X to Y (X Y) if and only if the coefficient of X, in the equation for Y, is non-zero. y convention we do not include error terms in the graph. Hence the graph relating to the model above is (here the error terms are omitted, being assumed jointly independent): X Y linear SEM with a jointly independent distribution over the error terms constitutes a parameterization of its associated graph. It is easy to see that the linear SEM associated with an acyclic graph will be a recursive structural equation model. 1.4 Linear Entailment directed graph containing disjoint sets of variables X, Y, and Z, 2 linearly entails that X is independent of Y given Z if and only if X is independent of Y given Z for all values 2 We use bold face letters (X) to denote sets of variables. -2-

3 of the non-zero linear coefficients and all distributions of the exogenous variables in which they have positive variances and are jointly independent. It is important to note that in any particular SEM with directed graph, there may be conditional independencies which hold even though they are not linearly entailed by. However, if a zero-correlation holds for some but not all parameterizations of, then the set of parameterizations in which this extra conditional independence holds, is of zero Lesbesgue measure over the set of all parameter value assignments to the non-zero linear coefficents. 1.5 Conditional Independencies and Equivalence in a graph. In an acyclic graph, there is a graphical path condition which holds between disjoint vertex sets X, Y and Z in the graph if and only if linearly entails that X Y Z. 3 Similarly, the same graphical path condition holds between X, Y and a set Z, not containing X or Y, if and only if in the partial correlation between X and Y controlling for Z, vanishes: ρ XY.Z = 0. We can calculate the partial correlations that are zero in all linear parametizations of in which X and Y have correlated errors in the following way. First, form a directed graph in which X and Y are the effects of a latent common cause T. The same graphical path condition holds in iff in every parameter assignment to in which X and Y have correlated errors, ρ XY.Z = 0. This observation is central to the usefulness of the graphical method. The task of generalizing this result to the cyclic case has already been accomplished: uilding on the work of Haavelmo(1943), Spirtes (1993) showed that the same graphical condition, the Geiger-Pearl-Verma d-separation criterion (defined in the ppendix) which determines whether a particular conditional independence relation or zero partial correlation is linearly entailed by a recursive structural equation model, can also be used with linear non-recursive models. Or equivalently, that the same technique used for reading conditional independencies from an acyclic graph can be applied in the cyclic case. We say that two graphs are equivalent if they both linearly entail the same set of conditional independencies. It is important to be clear what we are establishing when we work out the conditional independencies entailed by a given model: we are calculating the conditional independence consequences of having a certain form of linear equations, i.e. having linear equations in which certain coefficients are zero. We are not trying to estimate parameters, we are not making any distinction between latent and measured variables, and we are not constructing a model from data; though the development of efficient procedures for determining the equivalence of cyclic models will facilitate the construction of computer aids for model specification and updating. 3 X Y Z means that X is independent of Y given Z. -3-

4 2 Partial Results about Equivalence for Cyclic Graphs lthough there is an O(n 3 ) algorithm which can determine when two acyclic graphs are equivalent in our sense, a feasible algorithm which will establish this equivalence for cyclic graphs has not yet been obtained. Several preliminary results indicate that there is considerable heterogeneity in the equivalence class - much more than in the acyclic case. 2.1 Contrast with the cyclic case In the acyclic case, if and are dependent conditional on any subset of the other variables, then either, or, i.e. either is a direct cause of or is a direct cause of. However, this implication fails in the cyclic case. 4 For example: There are no independencies, conditional or otherwise entailed by this graph, and yet there is no edge between and C. This marks a significant difference between the cyclic and acyclic equivalence classes. C 2.2 Non-locality property In the acyclic case, if two graphs are not equivalent then there will be some conditional independence between variables separated by at most two edges, entailed by one graph, and not by the other. This means that we need only look at the structure of triples of adjacent vertices in order to establish that two graphs are equivalent. This is not true for the cyclic case, as the following example shows: 1 X X X X X X X X and 2 are not equivalent. lthough every conditional independence implied by 2, is also implied by 1, in 1,, while in 2, \. ut and are separated by more than two edges (in both graphs). It is simple to see that we could extend these graphs so that they continued to entail the same conditional independencies, with the exception of, while and were separated by arbitrarily many edges. This result, however leaves open whether a polynomial time algorithm exists for deciding equivalence. 4 similar point is made in Whittaker (1989). -4-

5 2.3 Criteria for detecting feedback: We give a set of conditions which are sufficient, though not necessary conditions for cyclicity. The following four conditional independence conditions can only be linearly entailed (simultaneously) by a cyclic graph. For some triple of vertices, the following hold: (i) For any set S, not containing or, \ S (ii) For any set S, not containing or C, \ C S (iii) There is a set T, with T and C T (iv) There is a set T *, with T * and C T *. s an illustration of this criterion, the following model, presented by Whittaker (1989), is not equivalent to any acyclic graph: X Here <,X,> and <,Y,> satisfy the conditions given above. Y 2.4 The Orientation of cycles Our last and possibly most interesting result says that given a graph with a cycle, there is an equivalent graph, in which is replaced by another cycle, having the opposite orientation to. Thus if is clockwise, is anti-clockwise, and vice-versa. C C E E 1 D 2 D In the above example, in 1, the cycle <C,,D,E> has anti-clockwise orientation, while in the equivalent graph 2, the corresponding cycle <C,,D,E> has clockwise orientation. One important consequence of this result is that it is not possible to orient a cycle merely using conditional independence information. ppendix Definition of d-separation: If there is an arrow from to or from to is called an edge between and. Given three vertices, and C such that there is an edge between and, and between and C, then if the edges collide at, (i.e. C) then we say is a -5-

6 collider between and C, relative to these edges. Otherwise we will say that is a noncollider between and C, relative to these edges. e.g. in the following cases is a noncollider: C, C, Β C. If there is an arrow from to ( ), then we say that is a parent of, and is a child of. We define the descendant relation as the transitive reflexive closure of child, and similarly, ancestor as the transitive reflexive closure of parent. sequence of distinct edges <E 1,,E n > in is an undirected path if and only if there exists a sequences of vertices <V 1, V n+1 > such that for 1 i n either <V i+1,v i >=E i or <V i,v i+1 >=E i. We are now in a position to define d-connection: For disjoint sets, X, Y and Z, X is d-connected to Y given Z if for some X X, and Y Y, there is a path from X to Y, satisfying the following conditions: (i) if, and C are adjacent vertices on the path, and Ζ, then is a collider between and C. (ii) If is a collider between and C, then there is a descendant D, of, and D Ζ. If X and Y are not d-connected given Z then X and Y are said to be d-separated by Z. The following important theorems gives the relationship between d-separation and the linear entailment of conditional independencies, and partial correlations. It was proved for the cyclic case by Spirtes (1993). Theorem (Spirtes): In a (cyclic or acyclic) graph, for disjoint sets, X, Y, Z, X and Y are d-separated given Z, if and only if linearly entails X Y Z. Theorem (Spirtes): In a (cyclic or acyclic) graph, for any set Z, not containing X or Y, X and Y are d-separated given Z, if and only if linearly entails ρ XY.Z =

7 REFERENCES: COX, D.R., and WERMUTH, N. (1993) Linear dependencies represented by chain graphs. In Statistical Science, 1993, 8 No.3, GEIGER, D. (1990). Graphoids: a qualitative framework for probabilistic inference. PhD dissertation, Univ. California, Los ngeles. HVELMO,T.(1943). The statistical implications of a system of simultaneous equations. Econometrica, 11, HEISE D.(1975). Causal nalysis. Wiley, New York. KIIVERI, H. and SPEED, T.P. (1982). Structural nalysis of multivariate data: review. In Sociological Methodology, 1982 (S. Leinhardt, ed.) Jossey ass, San Francisco. PERL, J. (1988) Probabilistic Reasoning in Intelligent Systems. Morgan Kaufman, San Mateo, C. SPIRTES, P. (1993) Directed Cyclic graphs, Conditional Independence and Non- Recursive Linear Structural Equation Models. Philosophy, Methodology and Logic Technical Report 35, Carnegie Mellon University SPIRTES, P., GLYMOUR C. and SCHEINES R. (1993), Causation, Prediction and Search. Lecture Notes in Statistics, Springer-Verlag. WHITTKER, J. (1989) Graphical Models in pplied Multivariate Statistics, Wiley. -7-

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables N-1 Experiments Suffice to Determine the Causal Relations Among N Variables Frederick Eberhardt Clark Glymour 1 Richard Scheines Carnegie Mellon University Abstract By combining experimental interventions

More information

Causal Inference in the Presence of Latent Variables and Selection Bias

Causal Inference in the Presence of Latent Variables and Selection Bias 499 Causal Inference in the Presence of Latent Variables and Selection Bias Peter Spirtes, Christopher Meek, and Thomas Richardson Department of Philosophy Carnegie Mellon University Pittsburgh, P 15213

More information

Causality in Econometrics (3)

Causality in Econometrics (3) Graphical Causal Models References Causality in Econometrics (3) Alessio Moneta Max Planck Institute of Economics Jena moneta@econ.mpg.de 26 April 2011 GSBC Lecture Friedrich-Schiller-Universität Jena

More information

Learning Multivariate Regression Chain Graphs under Faithfulness

Learning Multivariate Regression Chain Graphs under Faithfulness Sixth European Workshop on Probabilistic Graphical Models, Granada, Spain, 2012 Learning Multivariate Regression Chain Graphs under Faithfulness Dag Sonntag ADIT, IDA, Linköping University, Sweden dag.sonntag@liu.se

More information

Faithfulness of Probability Distributions and Graphs

Faithfulness of Probability Distributions and Graphs Journal of Machine Learning Research 18 (2017) 1-29 Submitted 5/17; Revised 11/17; Published 12/17 Faithfulness of Probability Distributions and Graphs Kayvan Sadeghi Statistical Laboratory University

More information

Automatic Causal Discovery

Automatic Causal Discovery Automatic Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour Dept. of Philosophy & CALD Carnegie Mellon 1 Outline 1. Motivation 2. Representation 3. Discovery 4. Using Regression for Causal

More information

The TETRAD Project: Constraint Based Aids to Causal Model Specification

The TETRAD Project: Constraint Based Aids to Causal Model Specification The TETRAD Project: Constraint Based Aids to Causal Model Specification Richard Scheines, Peter Spirtes, Clark Glymour, Christopher Meek and Thomas Richardson Department of Philosophy, Carnegie Mellon

More information

Measurement Error and Causal Discovery

Measurement Error and Causal Discovery Measurement Error and Causal Discovery Richard Scheines & Joseph Ramsey Department of Philosophy Carnegie Mellon University Pittsburgh, PA 15217, USA 1 Introduction Algorithms for causal discovery emerged

More information

Causal Discovery. Richard Scheines. Peter Spirtes, Clark Glymour, and many others. Dept. of Philosophy & CALD Carnegie Mellon

Causal Discovery. Richard Scheines. Peter Spirtes, Clark Glymour, and many others. Dept. of Philosophy & CALD Carnegie Mellon Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour, and many others Dept. of Philosophy & CALD Carnegie Mellon Graphical Models --11/30/05 1 Outline 1. Motivation 2. Representation 3. Connecting

More information

Learning Marginal AMP Chain Graphs under Faithfulness

Learning Marginal AMP Chain Graphs under Faithfulness Learning Marginal AMP Chain Graphs under Faithfulness Jose M. Peña ADIT, IDA, Linköping University, SE-58183 Linköping, Sweden jose.m.pena@liu.se Abstract. Marginal AMP chain graphs are a recently introduced

More information

Using Descendants as Instrumental Variables for the Identification of Direct Causal Effects in Linear SEMs

Using Descendants as Instrumental Variables for the Identification of Direct Causal Effects in Linear SEMs Using Descendants as Instrumental Variables for the Identification of Direct Causal Effects in Linear SEMs Hei Chan Institute of Statistical Mathematics, 10-3 Midori-cho, Tachikawa, Tokyo 190-8562, Japan

More information

Arrowhead completeness from minimal conditional independencies

Arrowhead completeness from minimal conditional independencies Arrowhead completeness from minimal conditional independencies Tom Claassen, Tom Heskes Radboud University Nijmegen The Netherlands {tomc,tomh}@cs.ru.nl Abstract We present two inference rules, based on

More information

Learning causal network structure from multiple (in)dependence models

Learning causal network structure from multiple (in)dependence models Learning causal network structure from multiple (in)dependence models Tom Claassen Radboud University, Nijmegen tomc@cs.ru.nl Abstract Tom Heskes Radboud University, Nijmegen tomh@cs.ru.nl We tackle the

More information

Inferring the Causal Decomposition under the Presence of Deterministic Relations.

Inferring the Causal Decomposition under the Presence of Deterministic Relations. Inferring the Causal Decomposition under the Presence of Deterministic Relations. Jan Lemeire 1,2, Stijn Meganck 1,2, Francesco Cartella 1, Tingting Liu 1 and Alexander Statnikov 3 1-ETRO Department, Vrije

More information

Causal Inference on Data Containing Deterministic Relations.

Causal Inference on Data Containing Deterministic Relations. Causal Inference on Data Containing Deterministic Relations. Jan Lemeire Kris Steenhaut Sam Maes COMO lab, ETRO Department Vrije Universiteit Brussel, Belgium {jan.lemeire, kris.steenhaut}@vub.ac.be, sammaes@gmail.com

More information

Graphical models and causality: Directed acyclic graphs (DAGs) and conditional (in)dependence

Graphical models and causality: Directed acyclic graphs (DAGs) and conditional (in)dependence Graphical models and causality: Directed acyclic graphs (DAGs) and conditional (in)dependence General overview Introduction Directed acyclic graphs (DAGs) and conditional independence DAGs and causal effects

More information

Carnegie Mellon Pittsburgh, Pennsylvania 15213

Carnegie Mellon Pittsburgh, Pennsylvania 15213 A Tutorial On Causal Inference Peter Spirtes August 4, 2009 Technical Report No. CMU-PHIL-183 Philosophy Methodology Logic Carnegie Mellon Pittsburgh, Pennsylvania 15213 1. Introduction A Tutorial On Causal

More information

On the Identification of a Class of Linear Models

On the Identification of a Class of Linear Models On the Identification of a Class of Linear Models Jin Tian Department of Computer Science Iowa State University Ames, IA 50011 jtian@cs.iastate.edu Abstract This paper deals with the problem of identifying

More information

Respecting Markov Equivalence in Computing Posterior Probabilities of Causal Graphical Features

Respecting Markov Equivalence in Computing Posterior Probabilities of Causal Graphical Features Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI-10) Respecting Markov Equivalence in Computing Posterior Probabilities of Causal Graphical Features Eun Yong Kang Department

More information

Identifiability assumptions for directed graphical models with feedback

Identifiability assumptions for directed graphical models with feedback Biometrika, pp. 1 26 C 212 Biometrika Trust Printed in Great Britain Identifiability assumptions for directed graphical models with feedback BY GUNWOONG PARK Department of Statistics, University of Wisconsin-Madison,

More information

Identifying Linear Causal Effects

Identifying Linear Causal Effects Identifying Linear Causal Effects Jin Tian Department of Computer Science Iowa State University Ames, IA 50011 jtian@cs.iastate.edu Abstract This paper concerns the assessment of linear cause-effect relationships

More information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

PDF hosted at the Radboud Repository of the Radboud University Nijmegen PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is an author's version which may differ from the publisher's version. For additional information about this

More information

Generalized Measurement Models

Generalized Measurement Models Generalized Measurement Models Ricardo Silva and Richard Scheines Center for Automated Learning and Discovery rbas@cs.cmu.edu, scheines@andrew.cmu.edu May 16, 2005 CMU-CALD-04-101 School of Computer Science

More information

A Relational Knowledge System

A Relational Knowledge System Relational Knowledge System by Shik Kam Michael Wong, Cory James utz, and Dan Wu Technical Report CS-01-04 January 2001 Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S

More information

Learning Semi-Markovian Causal Models using Experiments

Learning Semi-Markovian Causal Models using Experiments Learning Semi-Markovian Causal Models using Experiments Stijn Meganck 1, Sam Maes 2, Philippe Leray 2 and Bernard Manderick 1 1 CoMo Vrije Universiteit Brussel Brussels, Belgium 2 LITIS INSA Rouen St.

More information

CS Lecture 3. More Bayesian Networks

CS Lecture 3. More Bayesian Networks CS 6347 Lecture 3 More Bayesian Networks Recap Last time: Complexity challenges Representing distributions Computing probabilities/doing inference Introduction to Bayesian networks Today: D-separation,

More information

Learning the Structure of Linear Latent Variable Models

Learning the Structure of Linear Latent Variable Models Journal (2005) - Submitted /; Published / Learning the Structure of Linear Latent Variable Models Ricardo Silva Center for Automated Learning and Discovery School of Computer Science rbas@cs.cmu.edu Richard

More information

Motivation. Bayesian Networks in Epistemology and Philosophy of Science Lecture. Overview. Organizational Issues

Motivation. Bayesian Networks in Epistemology and Philosophy of Science Lecture. Overview. Organizational Issues Bayesian Networks in Epistemology and Philosophy of Science Lecture 1: Bayesian Networks Center for Logic and Philosophy of Science Tilburg University, The Netherlands Formal Epistemology Course Northern

More information

Augmented Bayesian Networks for Representing Information Equivalences and Extensions to the PC Learning Algorithm.

Augmented Bayesian Networks for Representing Information Equivalences and Extensions to the PC Learning Algorithm. Augmented Bayesian Networks for Representing Information Equivalences and Extensions to the PC Learning Algorithm. Jan Lemeire, Sam Maes, Erik Dirkx, Kris Steenhaut, Ann Nowé August 8, 2007 Abstract Data

More information

Confounding Equivalence in Causal Inference

Confounding Equivalence in Causal Inference In Proceedings of UAI, 2010. In P. Grunwald and P. Spirtes, editors, Proceedings of UAI, 433--441. AUAI, Corvallis, OR, 2010. TECHNICAL REPORT R-343 July 2010 Confounding Equivalence in Causal Inference

More information

Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs

Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs Proceedings of Machine Learning Research vol 73:21-32, 2017 AMBN 2017 Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs Jose M. Peña Linköping University Linköping (Sweden) jose.m.pena@liu.se

More information

1. what conditional independencies are implied by the graph. 2. whether these independecies correspond to the probability distribution

1. what conditional independencies are implied by the graph. 2. whether these independecies correspond to the probability distribution NETWORK ANALYSIS Lourens Waldorp PROBABILITY AND GRAPHS The objective is to obtain a correspondence between the intuitive pictures (graphs) of variables of interest and the probability distributions of

More information

Generalized Do-Calculus with Testable Causal Assumptions

Generalized Do-Calculus with Testable Causal Assumptions eneralized Do-Calculus with Testable Causal Assumptions Jiji Zhang Division of Humanities and Social Sciences California nstitute of Technology Pasadena CA 91125 jiji@hss.caltech.edu Abstract A primary

More information

Markov properties for mixed graphs

Markov properties for mixed graphs Bernoulli 20(2), 2014, 676 696 DOI: 10.3150/12-BEJ502 Markov properties for mixed graphs KAYVAN SADEGHI 1 and STEFFEN LAURITZEN 2 1 Department of Statistics, Baker Hall, Carnegie Mellon University, Pittsburgh,

More information

Introduction to Probabilistic Graphical Models

Introduction to Probabilistic Graphical Models Introduction to Probabilistic Graphical Models Kyu-Baek Hwang and Byoung-Tak Zhang Biointelligence Lab School of Computer Science and Engineering Seoul National University Seoul 151-742 Korea E-mail: kbhwang@bi.snu.ac.kr

More information

The Role of Assumptions in Causal Discovery

The Role of Assumptions in Causal Discovery The Role of Assumptions in Causal Discovery Marek J. Druzdzel Decision Systems Laboratory, School of Information Sciences and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA 15260,

More information

Identification and Overidentification of Linear Structural Equation Models

Identification and Overidentification of Linear Structural Equation Models In D. D. Lee and M. Sugiyama and U. V. Luxburg and I. Guyon and R. Garnett (Eds.), Advances in Neural Information Processing Systems 29, pre-preceedings 2016. TECHNICAL REPORT R-444 October 2016 Identification

More information

Detecting marginal and conditional independencies between events and learning their causal structure.

Detecting marginal and conditional independencies between events and learning their causal structure. Detecting marginal and conditional independencies between events and learning their causal structure. Jan Lemeire 1,4, Stijn Meganck 1,3, Albrecht Zimmermann 2, and Thomas Dhollander 3 1 ETRO Department,

More information

Lecture 4 October 18th

Lecture 4 October 18th Directed and undirected graphical models Fall 2017 Lecture 4 October 18th Lecturer: Guillaume Obozinski Scribe: In this lecture, we will assume that all random variables are discrete, to keep notations

More information

Pairwise Cluster Comparison for Learning Latent Variable Models

Pairwise Cluster Comparison for Learning Latent Variable Models Pairwise Cluster Comparison for Learning Latent Variable Models Nuaman Asbeh Industrial Engineering and Management Ben-Gurion University of the Negev Beer-Sheva, 84105, Israel Boaz Lerner Industrial Engineering

More information

Identification of Conditional Interventional Distributions

Identification of Conditional Interventional Distributions Identification of Conditional Interventional Distributions Ilya Shpitser and Judea Pearl Cognitive Systems Laboratory Department of Computer Science University of California, Los Angeles Los Angeles, CA.

More information

Directed acyclic graphs and the use of linear mixed models

Directed acyclic graphs and the use of linear mixed models Directed acyclic graphs and the use of linear mixed models Siem H. Heisterkamp 1,2 1 Groningen Bioinformatics Centre, University of Groningen 2 Biostatistics and Research Decision Sciences (BARDS), MSD,

More information

ε X ε Y Z 1 f 2 g 2 Z 2 f 1 g 1

ε X ε Y Z 1 f 2 g 2 Z 2 f 1 g 1 Semi-Instrumental Variables: A Test for Instrument Admissibility Tianjiao Chu tchu@andrew.cmu.edu Richard Scheines Philosophy Dept. and CALD scheines@cmu.edu Peter Spirtes ps7z@andrew.cmu.edu Abstract

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title Comment on `Causal inference, probability theory, and graphical insights' (by Stuart G. Baker) Permalink https://escholarship.org/uc/item/9863n6gg Journal Statistics

More information

Università degli Studi di Modena e Reggio Emilia. On Block Ordering of Variables in Graphical Modelling. Alberto Roverato Luca La Rocca

Università degli Studi di Modena e Reggio Emilia. On Block Ordering of Variables in Graphical Modelling. Alberto Roverato Luca La Rocca Università degli Studi di Modena e Reggio Emilia On Block Ordering of Variables in Graphical Modelling Alberto Roverato Luca La Rocca R 29-04 ottobre 2004 RESEARCH REPORT Dipartimento di Scienze Sociali,

More information

COMP538: Introduction to Bayesian Networks

COMP538: Introduction to Bayesian Networks COMP538: Introduction to Bayesian Networks Lecture 9: Optimal Structure Learning Nevin L. Zhang lzhang@cse.ust.hk Department of Computer Science and Engineering Hong Kong University of Science and Technology

More information

Instrumental Sets. Carlos Brito. 1 Introduction. 2 The Identification Problem

Instrumental Sets. Carlos Brito. 1 Introduction. 2 The Identification Problem 17 Instrumental Sets Carlos Brito 1 Introduction The research of Judea Pearl in the area of causality has been very much acclaimed. Here we highlight his contributions for the use of graphical languages

More information

Belief Update in CLG Bayesian Networks With Lazy Propagation

Belief Update in CLG Bayesian Networks With Lazy Propagation Belief Update in CLG Bayesian Networks With Lazy Propagation Anders L Madsen HUGIN Expert A/S Gasværksvej 5 9000 Aalborg, Denmark Anders.L.Madsen@hugin.com Abstract In recent years Bayesian networks (BNs)

More information

Causal Reasoning with Ancestral Graphs

Causal Reasoning with Ancestral Graphs Journal of Machine Learning Research 9 (2008) 1437-1474 Submitted 6/07; Revised 2/08; Published 7/08 Causal Reasoning with Ancestral Graphs Jiji Zhang Division of the Humanities and Social Sciences California

More information

Learning in Bayesian Networks

Learning in Bayesian Networks Learning in Bayesian Networks Florian Markowetz Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin Berlin: 20.06.2002 1 Overview 1. Bayesian Networks Stochastic Networks

More information

Exchangeability and Invariance: A Causal Theory. Jiji Zhang. (Very Preliminary Draft) 1. Motivation: Lindley-Novick s Puzzle

Exchangeability and Invariance: A Causal Theory. Jiji Zhang. (Very Preliminary Draft) 1. Motivation: Lindley-Novick s Puzzle Exchangeability and Invariance: A Causal Theory Jiji Zhang (Very Preliminary Draft) 1. Motivation: Lindley-Novick s Puzzle In their seminal paper on the role of exchangeability in statistical inference,

More information

Interpreting and using CPDAGs with background knowledge

Interpreting and using CPDAGs with background knowledge Interpreting and using CPDAGs with background knowledge Emilija Perković Seminar for Statistics ETH Zurich, Switzerland perkovic@stat.math.ethz.ch Markus Kalisch Seminar for Statistics ETH Zurich, Switzerland

More information

Separable and Transitive Graphoids

Separable and Transitive Graphoids Separable and Transitive Graphoids Dan Geiger Northrop Research and Technology Center One Research Park Palos Verdes, CA 90274 David Heckerman Medical Computer Science Group Knowledge Systems Laboratory

More information

Local Characterizations of Causal Bayesian Networks

Local Characterizations of Causal Bayesian Networks In M. Croitoru, S. Rudolph, N. Wilson, J. Howse, and O. Corby (Eds.), GKR 2011, LNAI 7205, Berlin Heidelberg: Springer-Verlag, pp. 1-17, 2012. TECHNICAL REPORT R-384 May 2011 Local Characterizations of

More information

Introduction to Causal Calculus

Introduction to Causal Calculus Introduction to Causal Calculus Sanna Tyrväinen University of British Columbia August 1, 2017 1 / 1 2 / 1 Bayesian network Bayesian networks are Directed Acyclic Graphs (DAGs) whose nodes represent random

More information

Part I Qualitative Probabilistic Networks

Part I Qualitative Probabilistic Networks Part I Qualitative Probabilistic Networks In which we study enhancements of the framework of qualitative probabilistic networks. Qualitative probabilistic networks allow for studying the reasoning behaviour

More information

Monotone DAG Faithfulness: A Bad Assumption

Monotone DAG Faithfulness: A Bad Assumption Monotone DAG Faithfulness: A Bad Assumption David Maxwell Chickering Christopher Meek Technical Report MSR-TR-2003-16 Microsoft Research Microsoft Corporation One Microsoft Way Redmond, WA 98052 Abstract

More information

The Specification of Causal Models with Tetrad IV: A Review

The Specification of Causal Models with Tetrad IV: A Review Structural Equation Modeling, 17:703 711, 2010 Copyright Taylor & Francis Group, LLC ISSN: 1070-5511 print/1532-8007 online DOI: 10.1080/10705511.2010.510074 SOFTWARE REVIEW The Specification of Causal

More information

Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges

Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges 1 PRELIMINARIES Two vertices X i and X j are adjacent if there is an edge between them. A path

More information

Iterative Conditional Fitting for Gaussian Ancestral Graph Models

Iterative Conditional Fitting for Gaussian Ancestral Graph Models Iterative Conditional Fitting for Gaussian Ancestral Graph Models Mathias Drton Department of Statistics University of Washington Seattle, WA 98195-322 Thomas S. Richardson Department of Statistics University

More information

6.867 Machine learning, lecture 23 (Jaakkola)

6.867 Machine learning, lecture 23 (Jaakkola) Lecture topics: Markov Random Fields Probabilistic inference Markov Random Fields We will briefly go over undirected graphical models or Markov Random Fields (MRFs) as they will be needed in the context

More information

Chris Bishop s PRML Ch. 8: Graphical Models

Chris Bishop s PRML Ch. 8: Graphical Models Chris Bishop s PRML Ch. 8: Graphical Models January 24, 2008 Introduction Visualize the structure of a probabilistic model Design and motivate new models Insights into the model s properties, in particular

More information

1 : Introduction. 1 Course Overview. 2 Notation. 3 Representing Multivariate Distributions : Probabilistic Graphical Models , Spring 2014

1 : Introduction. 1 Course Overview. 2 Notation. 3 Representing Multivariate Distributions : Probabilistic Graphical Models , Spring 2014 10-708: Probabilistic Graphical Models 10-708, Spring 2014 1 : Introduction Lecturer: Eric P. Xing Scribes: Daniel Silva and Calvin McCarter 1 Course Overview In this lecture we introduce the concept of

More information

Testable Implications of Linear Structural Equation Models

Testable Implications of Linear Structural Equation Models Testable Implications of Linear Structural Equation Models Bryant Chen University of California, Los Angeles Computer Science Department Los Angeles, CA, 90095-1596, USA bryantc@cs.ucla.edu Jin Tian Iowa

More information

CAUSAL MODELS: THE MEANINGFUL INFORMATION OF PROBABILITY DISTRIBUTIONS

CAUSAL MODELS: THE MEANINGFUL INFORMATION OF PROBABILITY DISTRIBUTIONS CAUSAL MODELS: THE MEANINGFUL INFORMATION OF PROBABILITY DISTRIBUTIONS Jan Lemeire, Erik Dirkx ETRO Dept., Vrije Universiteit Brussel Pleinlaan 2, 1050 Brussels, Belgium jan.lemeire@vub.ac.be ABSTRACT

More information

Towards an extension of the PC algorithm to local context-specific independencies detection

Towards an extension of the PC algorithm to local context-specific independencies detection Towards an extension of the PC algorithm to local context-specific independencies detection Feb-09-2016 Outline Background: Bayesian Networks The PC algorithm Context-specific independence: from DAGs to

More information

Testable Implications of Linear Structural Equation Models

Testable Implications of Linear Structural Equation Models Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence Palo Alto, CA: AAAI Press, 2424--2430, 2014. TECHNICAL REPORT R-428 July 2014 Testable Implications of Linear Structural Equation

More information

Estimation of linear non-gaussian acyclic models for latent factors

Estimation of linear non-gaussian acyclic models for latent factors Estimation of linear non-gaussian acyclic models for latent factors Shohei Shimizu a Patrik O. Hoyer b Aapo Hyvärinen b,c a The Institute of Scientific and Industrial Research, Osaka University Mihogaoka

More information

On Markov Properties in Evidence Theory

On Markov Properties in Evidence Theory On Markov Properties in Evidence Theory 131 On Markov Properties in Evidence Theory Jiřina Vejnarová Institute of Information Theory and Automation of the ASCR & University of Economics, Prague vejnar@utia.cas.cz

More information

Markov properties for directed graphs

Markov properties for directed graphs Graphical Models, Lecture 7, Michaelmas Term 2009 November 2, 2009 Definitions Structural relations among Markov properties Factorization G = (V, E) simple undirected graph; σ Say σ satisfies (P) the pairwise

More information

Learning Measurement Models for Unobserved Variables

Learning Measurement Models for Unobserved Variables UAI2003 SILVA ET AL. 543 Learning Measurement Models for Unobserved Variables Ricardo Silva, Richard Scheines, Clark Glymour and Peter Spirtes* Center for Automated Learning and Discovery Carnegie Mellon

More information

Markov properties for graphical time series models

Markov properties for graphical time series models Markov properties for graphical time series models Michael Eichler Universität Heidelberg Abstract This paper deals with the Markov properties of a new class of graphical time series models which focus

More information

Acyclic Linear SEMs Obey the Nested Markov Property

Acyclic Linear SEMs Obey the Nested Markov Property Acyclic Linear SEMs Obey the Nested Markov Property Ilya Shpitser Department of Computer Science Johns Hopkins University ilyas@cs.jhu.edu Robin J. Evans Department of Statistics University of Oxford evans@stats.ox.ac.uk

More information

Causal Models with Hidden Variables

Causal Models with Hidden Variables Causal Models with Hidden Variables Robin J. Evans www.stats.ox.ac.uk/ evans Department of Statistics, University of Oxford Quantum Networks, Oxford August 2017 1 / 44 Correlation does not imply causation

More information

Causal Diagrams. By Sander Greenland and Judea Pearl

Causal Diagrams. By Sander Greenland and Judea Pearl Wiley StatsRef: Statistics Reference Online, 2014 2017 John Wiley & Sons, Ltd. 1 Previous version in M. Lovric (Ed.), International Encyclopedia of Statistical Science 2011, Part 3, pp. 208-216, DOI: 10.1007/978-3-642-04898-2_162

More information

Causal Inference for High-Dimensional Data. Atlantic Causal Conference

Causal Inference for High-Dimensional Data. Atlantic Causal Conference Causal Inference for High-Dimensional Data Atlantic Causal Conference Overview Conditional Independence Directed Acyclic Graph (DAG) Models factorization and d-separation Markov equivalence Structure Learning

More information

MULTIVARIATE STATISTICAL ANALYSIS. Nanny Wermuth 1

MULTIVARIATE STATISTICAL ANALYSIS. Nanny Wermuth 1 MULTIVARIATE STATISTICAL ANALYSIS Nanny Wermuth 1 Professor of Statistics, Department of Mathematical Sciences Chalmers Technical University/University of Gothenburg, Sweden Classical multivariate statistical

More information

Causal Directed Acyclic Graphs

Causal Directed Acyclic Graphs Causal Directed Acyclic Graphs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Causal DAGs Stat186/Gov2002 Fall 2018 1 / 15 Elements of DAGs (Pearl. 2000.

More information

Identifiability in Causal Bayesian Networks: A Sound and Complete Algorithm

Identifiability in Causal Bayesian Networks: A Sound and Complete Algorithm Identifiability in Causal Bayesian Networks: A Sound and Complete Algorithm Yimin Huang and Marco Valtorta {huang6,mgv}@cse.sc.edu Department of Computer Science and Engineering University of South Carolina

More information

Learning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach

Learning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach Learning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach Stijn Meganck 1, Philippe Leray 2, and Bernard Manderick 1 1 Vrije Universiteit Brussel, Pleinlaan 2,

More information

Graphical models, causal inference, and econometric models

Graphical models, causal inference, and econometric models Journal of Economic Methodology 12:1, 1 33 March 2005 Graphical models, causal inference, and econometric models Peter Spirtes Abstract A graphical model is a graph that represents a set of conditional

More information

Rapid Introduction to Machine Learning/ Deep Learning

Rapid Introduction to Machine Learning/ Deep Learning Rapid Introduction to Machine Learning/ Deep Learning Hyeong In Choi Seoul National University 1/32 Lecture 5a Bayesian network April 14, 2016 2/32 Table of contents 1 1. Objectives of Lecture 5a 2 2.Bayesian

More information

Tutorial: Causal Model Search

Tutorial: Causal Model Search Tutorial: Causal Model Search Richard Scheines Carnegie Mellon University Peter Spirtes, Clark Glymour, Joe Ramsey, others 1 Goals 1) Convey rudiments of graphical causal models 2) Basic working knowledge

More information

Lars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL), Institute BW/WI & Institute for Computer Science, University of Hildesheim

Lars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL), Institute BW/WI & Institute for Computer Science, University of Hildesheim Course on Bayesian Networks, summer term 2010 0/42 Bayesian Networks Bayesian Networks 11. Structure Learning / Constrained-based Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL)

More information

Issues on the Estimation of Latent Variable and Latent Class Models with Social Science Applications

Issues on the Estimation of Latent Variable and Latent Class Models with Social Science Applications UNIVERSITÀ DEGLI STUDI DI FIRENZE DIPARTIMENTO DI STATISTICA G. PARENTI DOTTORATO IN STATISTICA APPLICATA XVI CICLO Issues on the Estimation of Latent Variable and Latent Class Models with Social Science

More information

CAUSAL INFERENCE IN TIME SERIES ANALYSIS. Michael Eichler

CAUSAL INFERENCE IN TIME SERIES ANALYSIS. Michael Eichler CAUSAL INFERENCE IN TIME SERIES ANALYSIS Michael Eichler Department of Quantitative Economics, Maastricht University P.O. Box 616, 6200 MD Maastricht, The Netherlands November 11, 2011 1. ÁÒØÖÓ ÙØ ÓÒ The

More information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

PDF hosted at the Radboud Repository of the Radboud University Nijmegen PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a preprint version which may differ from the publisher's version. For additional information about this

More information

Alternative Markov and Causal Properties for Acyclic Directed Mixed Graphs

Alternative Markov and Causal Properties for Acyclic Directed Mixed Graphs Alternative Markov and Causal Properties for Acyclic Directed Mixed Graphs Jose M. Peña Department of Computer and Information Science Linköping University 58183 Linköping, Sweden Abstract We extend Andersson-Madigan-Perlman

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 7, Issue 1 2011 Article 16 A Complete Graphical Criterion for the Adjustment Formula in Mediation Analysis Ilya Shpitser, Harvard University Tyler J. VanderWeele,

More information

Causal Clustering for 1-Factor Measurement Models

Causal Clustering for 1-Factor Measurement Models Causal Clustering for -Factor Measurement Models Erich Kummerfeld University of Pittsburgh 5607 Baum Boulevard, Suite 500 Pittsburgh, PA 5206 erk57@pitt.edu Joseph Ramsey Carnegie Mellon University 39

More information

Towards Association Rules with Hidden Variables

Towards Association Rules with Hidden Variables Towards Association Rules with Hidden Variables Ricardo Silva and Richard Scheines Gatsby Unit, University College London, WCN AR London, UK Machine Learning Department, Carnegie Mellon, Pittsburgh PA,

More information

Day 3: Search Continued

Day 3: Search Continued Center for Causal Discovery Day 3: Search Continued June 15, 2015 Carnegie Mellon University 1 Outline Models Data 1) Bridge Principles: Markov Axiom and D-separation 2) Model Equivalence 3) Model Search

More information

5.3 Graphs and Identifiability

5.3 Graphs and Identifiability 218CHAPTER 5. CAUSALIT AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS.83 AFFECT.65.23 BEHAVIOR COGNITION Figure 5.5: Untestable model displaying quantitative causal information derived. method elucidates

More information

2 : Directed GMs: Bayesian Networks

2 : Directed GMs: Bayesian Networks 10-708: Probabilistic Graphical Models, Spring 2015 2 : Directed GMs: Bayesian Networks Lecturer: Eric P. Xing Scribes: Yi Cheng, Cong Lu 1 Notation Here the notations used in this course are defined:

More information

What Causality Is (stats for mathematicians)

What Causality Is (stats for mathematicians) What Causality Is (stats for mathematicians) Andrew Critch UC Berkeley August 31, 2011 Introduction Foreword: The value of examples With any hard question, it helps to start with simple, concrete versions

More information

2 : Directed GMs: Bayesian Networks

2 : Directed GMs: Bayesian Networks 10-708: Probabilistic Graphical Models 10-708, Spring 2017 2 : Directed GMs: Bayesian Networks Lecturer: Eric P. Xing Scribes: Jayanth Koushik, Hiroaki Hayashi, Christian Perez Topic: Directed GMs 1 Types

More information

CSC 412 (Lecture 4): Undirected Graphical Models

CSC 412 (Lecture 4): Undirected Graphical Models CSC 412 (Lecture 4): Undirected Graphical Models Raquel Urtasun University of Toronto Feb 2, 2016 R Urtasun (UofT) CSC 412 Feb 2, 2016 1 / 37 Today Undirected Graphical Models: Semantics of the graph:

More information

Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables

Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables Robert E. Tillman Carnegie Mellon University Pittsburgh, PA rtillman@cmu.edu

More information

Financial Econometrics

Financial Econometrics Financial Econometrics Multivariate Time Series Analysis: VAR Gerald P. Dwyer Trinity College, Dublin January 2013 GPD (TCD) VAR 01/13 1 / 25 Structural equations Suppose have simultaneous system for supply

More information

Machine Learning Summer School

Machine Learning Summer School Machine Learning Summer School Lecture 1: Introduction to Graphical Models Zoubin Ghahramani zoubin@eng.cam.ac.uk http://learning.eng.cam.ac.uk/zoubin/ epartment of ngineering University of ambridge, UK

More information