Equivalence in Non-Recursive Structural Equation Models
|
|
- Jody Ball
- 5 years ago
- Views:
Transcription
1 Equivalence in Non-Recursive Structural Equation Models Thomas Richardson 1 Philosophy Department, Carnegie-Mellon University Pittsburgh, P 15213, US thomas.richardson@andrew.cmu.edu Introduction In the last decade, there has been considerable progress in understanding a certain class of statistical models, known as directed acyclic graph (DG) models, which encode independence, and conditional independence constraints. (See Pearl, 1988). This research has had fruitful results in many areas: there is now a relatively clear causal interpretation of these models, there are efficient procedures for determining the statistical indistinguishability of DG s, reliable algorithms for generating a class of DG models from sample data and background knowledge, etc. Two important elements in these investigations were: First, a purely graphical condition for calculating the conditional independence relations entailed by a DG. Second, a local characterization of equivalence between two graphs, in the sense that all of the same conditional independencies are entailed by each graph. Such a local characterization was essential in allowing the construction of efficient algorithms which could search the whole class of DG models and to find those which fitted the given data. 1 Non-Recursive Structural Equations and Cyclic Graphs 1.1 The limitations of DG models The DG formalism is very general: gamut of more familiar constructs such as recursive linear structural equation models with independent errors, regression models, factor analytic models, path models, and discrete latent variable models can be represented as DG models. However, as you might suspect from the name, directed acyclic graphs, DG models do exclude a kind of model familiar in engineering, economics, and the social sciences: those in which variable influences variable, and at the same time, influences. Economic and physical processes are often modelled by linear systems of this sort; so-called non-recursive structural equation models. nother use for such systems is modelling time series in which feedback is present, as well as structures in which causal influences propagate in different directions in certain subpopulations. 1 I thank P. Spirtes, C. Glymour, R. Scheines and C. Meek for helpful conversations. Research for this paper was supported by the Office of Naval Research through contract number N
2 This paper will give a survey of some recent research aimed at filling this gap; developing a theory of cyclic graphical models, that would allow the generalization of acyclic techniques and methods to the cyclic case. 1.2 Linear Structural Equation Models (linear SEMs) In an SEM, the variables are divided into two disjoint sets, the error terms, and the nonerror terms. ssociated to each non-error variable V there is a unique error term ε V. linear SEM contains a set of equations in which each non-error random variable V is written as a linear function of other non-error random variables and ε V. linear SEM also specifies a joint distribution over the error terms. In our discussion we will consider only linear SEMs with error terms that are jointly independent, but as we shall see, in an important sense, at least within the context of our discussion, nothing is lost by this restriction. The following is an example of such a model: X=ε X Y=ε Y = α 1 X + α 2 + ε = β 1 Y + β 2 + ε The ε V s are jointly independent standard normal error terms. structural equation model in which, for some ordering of the variables, the matrix of coefficients is in lower triangular form, is said to be recursive. 1.3 Graphs There is a directed graph, naturally associated with a given linear SEM, by the rule that there is an edge from X to Y (X Y) if and only if the coefficient of X, in the equation for Y, is non-zero. y convention we do not include error terms in the graph. Hence the graph relating to the model above is (here the error terms are omitted, being assumed jointly independent): X Y linear SEM with a jointly independent distribution over the error terms constitutes a parameterization of its associated graph. It is easy to see that the linear SEM associated with an acyclic graph will be a recursive structural equation model. 1.4 Linear Entailment directed graph containing disjoint sets of variables X, Y, and Z, 2 linearly entails that X is independent of Y given Z if and only if X is independent of Y given Z for all values 2 We use bold face letters (X) to denote sets of variables. -2-
3 of the non-zero linear coefficients and all distributions of the exogenous variables in which they have positive variances and are jointly independent. It is important to note that in any particular SEM with directed graph, there may be conditional independencies which hold even though they are not linearly entailed by. However, if a zero-correlation holds for some but not all parameterizations of, then the set of parameterizations in which this extra conditional independence holds, is of zero Lesbesgue measure over the set of all parameter value assignments to the non-zero linear coefficents. 1.5 Conditional Independencies and Equivalence in a graph. In an acyclic graph, there is a graphical path condition which holds between disjoint vertex sets X, Y and Z in the graph if and only if linearly entails that X Y Z. 3 Similarly, the same graphical path condition holds between X, Y and a set Z, not containing X or Y, if and only if in the partial correlation between X and Y controlling for Z, vanishes: ρ XY.Z = 0. We can calculate the partial correlations that are zero in all linear parametizations of in which X and Y have correlated errors in the following way. First, form a directed graph in which X and Y are the effects of a latent common cause T. The same graphical path condition holds in iff in every parameter assignment to in which X and Y have correlated errors, ρ XY.Z = 0. This observation is central to the usefulness of the graphical method. The task of generalizing this result to the cyclic case has already been accomplished: uilding on the work of Haavelmo(1943), Spirtes (1993) showed that the same graphical condition, the Geiger-Pearl-Verma d-separation criterion (defined in the ppendix) which determines whether a particular conditional independence relation or zero partial correlation is linearly entailed by a recursive structural equation model, can also be used with linear non-recursive models. Or equivalently, that the same technique used for reading conditional independencies from an acyclic graph can be applied in the cyclic case. We say that two graphs are equivalent if they both linearly entail the same set of conditional independencies. It is important to be clear what we are establishing when we work out the conditional independencies entailed by a given model: we are calculating the conditional independence consequences of having a certain form of linear equations, i.e. having linear equations in which certain coefficients are zero. We are not trying to estimate parameters, we are not making any distinction between latent and measured variables, and we are not constructing a model from data; though the development of efficient procedures for determining the equivalence of cyclic models will facilitate the construction of computer aids for model specification and updating. 3 X Y Z means that X is independent of Y given Z. -3-
4 2 Partial Results about Equivalence for Cyclic Graphs lthough there is an O(n 3 ) algorithm which can determine when two acyclic graphs are equivalent in our sense, a feasible algorithm which will establish this equivalence for cyclic graphs has not yet been obtained. Several preliminary results indicate that there is considerable heterogeneity in the equivalence class - much more than in the acyclic case. 2.1 Contrast with the cyclic case In the acyclic case, if and are dependent conditional on any subset of the other variables, then either, or, i.e. either is a direct cause of or is a direct cause of. However, this implication fails in the cyclic case. 4 For example: There are no independencies, conditional or otherwise entailed by this graph, and yet there is no edge between and C. This marks a significant difference between the cyclic and acyclic equivalence classes. C 2.2 Non-locality property In the acyclic case, if two graphs are not equivalent then there will be some conditional independence between variables separated by at most two edges, entailed by one graph, and not by the other. This means that we need only look at the structure of triples of adjacent vertices in order to establish that two graphs are equivalent. This is not true for the cyclic case, as the following example shows: 1 X X X X X X X X and 2 are not equivalent. lthough every conditional independence implied by 2, is also implied by 1, in 1,, while in 2, \. ut and are separated by more than two edges (in both graphs). It is simple to see that we could extend these graphs so that they continued to entail the same conditional independencies, with the exception of, while and were separated by arbitrarily many edges. This result, however leaves open whether a polynomial time algorithm exists for deciding equivalence. 4 similar point is made in Whittaker (1989). -4-
5 2.3 Criteria for detecting feedback: We give a set of conditions which are sufficient, though not necessary conditions for cyclicity. The following four conditional independence conditions can only be linearly entailed (simultaneously) by a cyclic graph. For some triple of vertices, the following hold: (i) For any set S, not containing or, \ S (ii) For any set S, not containing or C, \ C S (iii) There is a set T, with T and C T (iv) There is a set T *, with T * and C T *. s an illustration of this criterion, the following model, presented by Whittaker (1989), is not equivalent to any acyclic graph: X Here <,X,> and <,Y,> satisfy the conditions given above. Y 2.4 The Orientation of cycles Our last and possibly most interesting result says that given a graph with a cycle, there is an equivalent graph, in which is replaced by another cycle, having the opposite orientation to. Thus if is clockwise, is anti-clockwise, and vice-versa. C C E E 1 D 2 D In the above example, in 1, the cycle <C,,D,E> has anti-clockwise orientation, while in the equivalent graph 2, the corresponding cycle <C,,D,E> has clockwise orientation. One important consequence of this result is that it is not possible to orient a cycle merely using conditional independence information. ppendix Definition of d-separation: If there is an arrow from to or from to is called an edge between and. Given three vertices, and C such that there is an edge between and, and between and C, then if the edges collide at, (i.e. C) then we say is a -5-
6 collider between and C, relative to these edges. Otherwise we will say that is a noncollider between and C, relative to these edges. e.g. in the following cases is a noncollider: C, C, Β C. If there is an arrow from to ( ), then we say that is a parent of, and is a child of. We define the descendant relation as the transitive reflexive closure of child, and similarly, ancestor as the transitive reflexive closure of parent. sequence of distinct edges <E 1,,E n > in is an undirected path if and only if there exists a sequences of vertices <V 1, V n+1 > such that for 1 i n either <V i+1,v i >=E i or <V i,v i+1 >=E i. We are now in a position to define d-connection: For disjoint sets, X, Y and Z, X is d-connected to Y given Z if for some X X, and Y Y, there is a path from X to Y, satisfying the following conditions: (i) if, and C are adjacent vertices on the path, and Ζ, then is a collider between and C. (ii) If is a collider between and C, then there is a descendant D, of, and D Ζ. If X and Y are not d-connected given Z then X and Y are said to be d-separated by Z. The following important theorems gives the relationship between d-separation and the linear entailment of conditional independencies, and partial correlations. It was proved for the cyclic case by Spirtes (1993). Theorem (Spirtes): In a (cyclic or acyclic) graph, for disjoint sets, X, Y, Z, X and Y are d-separated given Z, if and only if linearly entails X Y Z. Theorem (Spirtes): In a (cyclic or acyclic) graph, for any set Z, not containing X or Y, X and Y are d-separated given Z, if and only if linearly entails ρ XY.Z =
7 REFERENCES: COX, D.R., and WERMUTH, N. (1993) Linear dependencies represented by chain graphs. In Statistical Science, 1993, 8 No.3, GEIGER, D. (1990). Graphoids: a qualitative framework for probabilistic inference. PhD dissertation, Univ. California, Los ngeles. HVELMO,T.(1943). The statistical implications of a system of simultaneous equations. Econometrica, 11, HEISE D.(1975). Causal nalysis. Wiley, New York. KIIVERI, H. and SPEED, T.P. (1982). Structural nalysis of multivariate data: review. In Sociological Methodology, 1982 (S. Leinhardt, ed.) Jossey ass, San Francisco. PERL, J. (1988) Probabilistic Reasoning in Intelligent Systems. Morgan Kaufman, San Mateo, C. SPIRTES, P. (1993) Directed Cyclic graphs, Conditional Independence and Non- Recursive Linear Structural Equation Models. Philosophy, Methodology and Logic Technical Report 35, Carnegie Mellon University SPIRTES, P., GLYMOUR C. and SCHEINES R. (1993), Causation, Prediction and Search. Lecture Notes in Statistics, Springer-Verlag. WHITTKER, J. (1989) Graphical Models in pplied Multivariate Statistics, Wiley. -7-
Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables
N-1 Experiments Suffice to Determine the Causal Relations Among N Variables Frederick Eberhardt Clark Glymour 1 Richard Scheines Carnegie Mellon University Abstract By combining experimental interventions
More informationCausal Inference in the Presence of Latent Variables and Selection Bias
499 Causal Inference in the Presence of Latent Variables and Selection Bias Peter Spirtes, Christopher Meek, and Thomas Richardson Department of Philosophy Carnegie Mellon University Pittsburgh, P 15213
More informationCausality in Econometrics (3)
Graphical Causal Models References Causality in Econometrics (3) Alessio Moneta Max Planck Institute of Economics Jena moneta@econ.mpg.de 26 April 2011 GSBC Lecture Friedrich-Schiller-Universität Jena
More informationLearning Multivariate Regression Chain Graphs under Faithfulness
Sixth European Workshop on Probabilistic Graphical Models, Granada, Spain, 2012 Learning Multivariate Regression Chain Graphs under Faithfulness Dag Sonntag ADIT, IDA, Linköping University, Sweden dag.sonntag@liu.se
More informationFaithfulness of Probability Distributions and Graphs
Journal of Machine Learning Research 18 (2017) 1-29 Submitted 5/17; Revised 11/17; Published 12/17 Faithfulness of Probability Distributions and Graphs Kayvan Sadeghi Statistical Laboratory University
More informationAutomatic Causal Discovery
Automatic Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour Dept. of Philosophy & CALD Carnegie Mellon 1 Outline 1. Motivation 2. Representation 3. Discovery 4. Using Regression for Causal
More informationThe TETRAD Project: Constraint Based Aids to Causal Model Specification
The TETRAD Project: Constraint Based Aids to Causal Model Specification Richard Scheines, Peter Spirtes, Clark Glymour, Christopher Meek and Thomas Richardson Department of Philosophy, Carnegie Mellon
More informationMeasurement Error and Causal Discovery
Measurement Error and Causal Discovery Richard Scheines & Joseph Ramsey Department of Philosophy Carnegie Mellon University Pittsburgh, PA 15217, USA 1 Introduction Algorithms for causal discovery emerged
More informationCausal Discovery. Richard Scheines. Peter Spirtes, Clark Glymour, and many others. Dept. of Philosophy & CALD Carnegie Mellon
Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour, and many others Dept. of Philosophy & CALD Carnegie Mellon Graphical Models --11/30/05 1 Outline 1. Motivation 2. Representation 3. Connecting
More informationLearning Marginal AMP Chain Graphs under Faithfulness
Learning Marginal AMP Chain Graphs under Faithfulness Jose M. Peña ADIT, IDA, Linköping University, SE-58183 Linköping, Sweden jose.m.pena@liu.se Abstract. Marginal AMP chain graphs are a recently introduced
More informationUsing Descendants as Instrumental Variables for the Identification of Direct Causal Effects in Linear SEMs
Using Descendants as Instrumental Variables for the Identification of Direct Causal Effects in Linear SEMs Hei Chan Institute of Statistical Mathematics, 10-3 Midori-cho, Tachikawa, Tokyo 190-8562, Japan
More informationArrowhead completeness from minimal conditional independencies
Arrowhead completeness from minimal conditional independencies Tom Claassen, Tom Heskes Radboud University Nijmegen The Netherlands {tomc,tomh}@cs.ru.nl Abstract We present two inference rules, based on
More informationLearning causal network structure from multiple (in)dependence models
Learning causal network structure from multiple (in)dependence models Tom Claassen Radboud University, Nijmegen tomc@cs.ru.nl Abstract Tom Heskes Radboud University, Nijmegen tomh@cs.ru.nl We tackle the
More informationInferring the Causal Decomposition under the Presence of Deterministic Relations.
Inferring the Causal Decomposition under the Presence of Deterministic Relations. Jan Lemeire 1,2, Stijn Meganck 1,2, Francesco Cartella 1, Tingting Liu 1 and Alexander Statnikov 3 1-ETRO Department, Vrije
More informationCausal Inference on Data Containing Deterministic Relations.
Causal Inference on Data Containing Deterministic Relations. Jan Lemeire Kris Steenhaut Sam Maes COMO lab, ETRO Department Vrije Universiteit Brussel, Belgium {jan.lemeire, kris.steenhaut}@vub.ac.be, sammaes@gmail.com
More informationGraphical models and causality: Directed acyclic graphs (DAGs) and conditional (in)dependence
Graphical models and causality: Directed acyclic graphs (DAGs) and conditional (in)dependence General overview Introduction Directed acyclic graphs (DAGs) and conditional independence DAGs and causal effects
More informationCarnegie Mellon Pittsburgh, Pennsylvania 15213
A Tutorial On Causal Inference Peter Spirtes August 4, 2009 Technical Report No. CMU-PHIL-183 Philosophy Methodology Logic Carnegie Mellon Pittsburgh, Pennsylvania 15213 1. Introduction A Tutorial On Causal
More informationOn the Identification of a Class of Linear Models
On the Identification of a Class of Linear Models Jin Tian Department of Computer Science Iowa State University Ames, IA 50011 jtian@cs.iastate.edu Abstract This paper deals with the problem of identifying
More informationRespecting Markov Equivalence in Computing Posterior Probabilities of Causal Graphical Features
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI-10) Respecting Markov Equivalence in Computing Posterior Probabilities of Causal Graphical Features Eun Yong Kang Department
More informationIdentifiability assumptions for directed graphical models with feedback
Biometrika, pp. 1 26 C 212 Biometrika Trust Printed in Great Britain Identifiability assumptions for directed graphical models with feedback BY GUNWOONG PARK Department of Statistics, University of Wisconsin-Madison,
More informationIdentifying Linear Causal Effects
Identifying Linear Causal Effects Jin Tian Department of Computer Science Iowa State University Ames, IA 50011 jtian@cs.iastate.edu Abstract This paper concerns the assessment of linear cause-effect relationships
More informationPDF hosted at the Radboud Repository of the Radboud University Nijmegen
PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is an author's version which may differ from the publisher's version. For additional information about this
More informationGeneralized Measurement Models
Generalized Measurement Models Ricardo Silva and Richard Scheines Center for Automated Learning and Discovery rbas@cs.cmu.edu, scheines@andrew.cmu.edu May 16, 2005 CMU-CALD-04-101 School of Computer Science
More informationA Relational Knowledge System
Relational Knowledge System by Shik Kam Michael Wong, Cory James utz, and Dan Wu Technical Report CS-01-04 January 2001 Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S
More informationLearning Semi-Markovian Causal Models using Experiments
Learning Semi-Markovian Causal Models using Experiments Stijn Meganck 1, Sam Maes 2, Philippe Leray 2 and Bernard Manderick 1 1 CoMo Vrije Universiteit Brussel Brussels, Belgium 2 LITIS INSA Rouen St.
More informationCS Lecture 3. More Bayesian Networks
CS 6347 Lecture 3 More Bayesian Networks Recap Last time: Complexity challenges Representing distributions Computing probabilities/doing inference Introduction to Bayesian networks Today: D-separation,
More informationLearning the Structure of Linear Latent Variable Models
Journal (2005) - Submitted /; Published / Learning the Structure of Linear Latent Variable Models Ricardo Silva Center for Automated Learning and Discovery School of Computer Science rbas@cs.cmu.edu Richard
More informationMotivation. Bayesian Networks in Epistemology and Philosophy of Science Lecture. Overview. Organizational Issues
Bayesian Networks in Epistemology and Philosophy of Science Lecture 1: Bayesian Networks Center for Logic and Philosophy of Science Tilburg University, The Netherlands Formal Epistemology Course Northern
More informationAugmented Bayesian Networks for Representing Information Equivalences and Extensions to the PC Learning Algorithm.
Augmented Bayesian Networks for Representing Information Equivalences and Extensions to the PC Learning Algorithm. Jan Lemeire, Sam Maes, Erik Dirkx, Kris Steenhaut, Ann Nowé August 8, 2007 Abstract Data
More informationConfounding Equivalence in Causal Inference
In Proceedings of UAI, 2010. In P. Grunwald and P. Spirtes, editors, Proceedings of UAI, 433--441. AUAI, Corvallis, OR, 2010. TECHNICAL REPORT R-343 July 2010 Confounding Equivalence in Causal Inference
More informationCausal Effect Identification in Alternative Acyclic Directed Mixed Graphs
Proceedings of Machine Learning Research vol 73:21-32, 2017 AMBN 2017 Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs Jose M. Peña Linköping University Linköping (Sweden) jose.m.pena@liu.se
More information1. what conditional independencies are implied by the graph. 2. whether these independecies correspond to the probability distribution
NETWORK ANALYSIS Lourens Waldorp PROBABILITY AND GRAPHS The objective is to obtain a correspondence between the intuitive pictures (graphs) of variables of interest and the probability distributions of
More informationGeneralized Do-Calculus with Testable Causal Assumptions
eneralized Do-Calculus with Testable Causal Assumptions Jiji Zhang Division of Humanities and Social Sciences California nstitute of Technology Pasadena CA 91125 jiji@hss.caltech.edu Abstract A primary
More informationMarkov properties for mixed graphs
Bernoulli 20(2), 2014, 676 696 DOI: 10.3150/12-BEJ502 Markov properties for mixed graphs KAYVAN SADEGHI 1 and STEFFEN LAURITZEN 2 1 Department of Statistics, Baker Hall, Carnegie Mellon University, Pittsburgh,
More informationIntroduction to Probabilistic Graphical Models
Introduction to Probabilistic Graphical Models Kyu-Baek Hwang and Byoung-Tak Zhang Biointelligence Lab School of Computer Science and Engineering Seoul National University Seoul 151-742 Korea E-mail: kbhwang@bi.snu.ac.kr
More informationThe Role of Assumptions in Causal Discovery
The Role of Assumptions in Causal Discovery Marek J. Druzdzel Decision Systems Laboratory, School of Information Sciences and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA 15260,
More informationIdentification and Overidentification of Linear Structural Equation Models
In D. D. Lee and M. Sugiyama and U. V. Luxburg and I. Guyon and R. Garnett (Eds.), Advances in Neural Information Processing Systems 29, pre-preceedings 2016. TECHNICAL REPORT R-444 October 2016 Identification
More informationDetecting marginal and conditional independencies between events and learning their causal structure.
Detecting marginal and conditional independencies between events and learning their causal structure. Jan Lemeire 1,4, Stijn Meganck 1,3, Albrecht Zimmermann 2, and Thomas Dhollander 3 1 ETRO Department,
More informationLecture 4 October 18th
Directed and undirected graphical models Fall 2017 Lecture 4 October 18th Lecturer: Guillaume Obozinski Scribe: In this lecture, we will assume that all random variables are discrete, to keep notations
More informationPairwise Cluster Comparison for Learning Latent Variable Models
Pairwise Cluster Comparison for Learning Latent Variable Models Nuaman Asbeh Industrial Engineering and Management Ben-Gurion University of the Negev Beer-Sheva, 84105, Israel Boaz Lerner Industrial Engineering
More informationIdentification of Conditional Interventional Distributions
Identification of Conditional Interventional Distributions Ilya Shpitser and Judea Pearl Cognitive Systems Laboratory Department of Computer Science University of California, Los Angeles Los Angeles, CA.
More informationDirected acyclic graphs and the use of linear mixed models
Directed acyclic graphs and the use of linear mixed models Siem H. Heisterkamp 1,2 1 Groningen Bioinformatics Centre, University of Groningen 2 Biostatistics and Research Decision Sciences (BARDS), MSD,
More informationε X ε Y Z 1 f 2 g 2 Z 2 f 1 g 1
Semi-Instrumental Variables: A Test for Instrument Admissibility Tianjiao Chu tchu@andrew.cmu.edu Richard Scheines Philosophy Dept. and CALD scheines@cmu.edu Peter Spirtes ps7z@andrew.cmu.edu Abstract
More informationUCLA Department of Statistics Papers
UCLA Department of Statistics Papers Title Comment on `Causal inference, probability theory, and graphical insights' (by Stuart G. Baker) Permalink https://escholarship.org/uc/item/9863n6gg Journal Statistics
More informationUniversità degli Studi di Modena e Reggio Emilia. On Block Ordering of Variables in Graphical Modelling. Alberto Roverato Luca La Rocca
Università degli Studi di Modena e Reggio Emilia On Block Ordering of Variables in Graphical Modelling Alberto Roverato Luca La Rocca R 29-04 ottobre 2004 RESEARCH REPORT Dipartimento di Scienze Sociali,
More informationCOMP538: Introduction to Bayesian Networks
COMP538: Introduction to Bayesian Networks Lecture 9: Optimal Structure Learning Nevin L. Zhang lzhang@cse.ust.hk Department of Computer Science and Engineering Hong Kong University of Science and Technology
More informationInstrumental Sets. Carlos Brito. 1 Introduction. 2 The Identification Problem
17 Instrumental Sets Carlos Brito 1 Introduction The research of Judea Pearl in the area of causality has been very much acclaimed. Here we highlight his contributions for the use of graphical languages
More informationBelief Update in CLG Bayesian Networks With Lazy Propagation
Belief Update in CLG Bayesian Networks With Lazy Propagation Anders L Madsen HUGIN Expert A/S Gasværksvej 5 9000 Aalborg, Denmark Anders.L.Madsen@hugin.com Abstract In recent years Bayesian networks (BNs)
More informationCausal Reasoning with Ancestral Graphs
Journal of Machine Learning Research 9 (2008) 1437-1474 Submitted 6/07; Revised 2/08; Published 7/08 Causal Reasoning with Ancestral Graphs Jiji Zhang Division of the Humanities and Social Sciences California
More informationLearning in Bayesian Networks
Learning in Bayesian Networks Florian Markowetz Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin Berlin: 20.06.2002 1 Overview 1. Bayesian Networks Stochastic Networks
More informationExchangeability and Invariance: A Causal Theory. Jiji Zhang. (Very Preliminary Draft) 1. Motivation: Lindley-Novick s Puzzle
Exchangeability and Invariance: A Causal Theory Jiji Zhang (Very Preliminary Draft) 1. Motivation: Lindley-Novick s Puzzle In their seminal paper on the role of exchangeability in statistical inference,
More informationInterpreting and using CPDAGs with background knowledge
Interpreting and using CPDAGs with background knowledge Emilija Perković Seminar for Statistics ETH Zurich, Switzerland perkovic@stat.math.ethz.ch Markus Kalisch Seminar for Statistics ETH Zurich, Switzerland
More informationSeparable and Transitive Graphoids
Separable and Transitive Graphoids Dan Geiger Northrop Research and Technology Center One Research Park Palos Verdes, CA 90274 David Heckerman Medical Computer Science Group Knowledge Systems Laboratory
More informationLocal Characterizations of Causal Bayesian Networks
In M. Croitoru, S. Rudolph, N. Wilson, J. Howse, and O. Corby (Eds.), GKR 2011, LNAI 7205, Berlin Heidelberg: Springer-Verlag, pp. 1-17, 2012. TECHNICAL REPORT R-384 May 2011 Local Characterizations of
More informationIntroduction to Causal Calculus
Introduction to Causal Calculus Sanna Tyrväinen University of British Columbia August 1, 2017 1 / 1 2 / 1 Bayesian network Bayesian networks are Directed Acyclic Graphs (DAGs) whose nodes represent random
More informationPart I Qualitative Probabilistic Networks
Part I Qualitative Probabilistic Networks In which we study enhancements of the framework of qualitative probabilistic networks. Qualitative probabilistic networks allow for studying the reasoning behaviour
More informationMonotone DAG Faithfulness: A Bad Assumption
Monotone DAG Faithfulness: A Bad Assumption David Maxwell Chickering Christopher Meek Technical Report MSR-TR-2003-16 Microsoft Research Microsoft Corporation One Microsoft Way Redmond, WA 98052 Abstract
More informationThe Specification of Causal Models with Tetrad IV: A Review
Structural Equation Modeling, 17:703 711, 2010 Copyright Taylor & Francis Group, LLC ISSN: 1070-5511 print/1532-8007 online DOI: 10.1080/10705511.2010.510074 SOFTWARE REVIEW The Specification of Causal
More informationSupplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges
Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges 1 PRELIMINARIES Two vertices X i and X j are adjacent if there is an edge between them. A path
More informationIterative Conditional Fitting for Gaussian Ancestral Graph Models
Iterative Conditional Fitting for Gaussian Ancestral Graph Models Mathias Drton Department of Statistics University of Washington Seattle, WA 98195-322 Thomas S. Richardson Department of Statistics University
More information6.867 Machine learning, lecture 23 (Jaakkola)
Lecture topics: Markov Random Fields Probabilistic inference Markov Random Fields We will briefly go over undirected graphical models or Markov Random Fields (MRFs) as they will be needed in the context
More informationChris Bishop s PRML Ch. 8: Graphical Models
Chris Bishop s PRML Ch. 8: Graphical Models January 24, 2008 Introduction Visualize the structure of a probabilistic model Design and motivate new models Insights into the model s properties, in particular
More information1 : Introduction. 1 Course Overview. 2 Notation. 3 Representing Multivariate Distributions : Probabilistic Graphical Models , Spring 2014
10-708: Probabilistic Graphical Models 10-708, Spring 2014 1 : Introduction Lecturer: Eric P. Xing Scribes: Daniel Silva and Calvin McCarter 1 Course Overview In this lecture we introduce the concept of
More informationTestable Implications of Linear Structural Equation Models
Testable Implications of Linear Structural Equation Models Bryant Chen University of California, Los Angeles Computer Science Department Los Angeles, CA, 90095-1596, USA bryantc@cs.ucla.edu Jin Tian Iowa
More informationCAUSAL MODELS: THE MEANINGFUL INFORMATION OF PROBABILITY DISTRIBUTIONS
CAUSAL MODELS: THE MEANINGFUL INFORMATION OF PROBABILITY DISTRIBUTIONS Jan Lemeire, Erik Dirkx ETRO Dept., Vrije Universiteit Brussel Pleinlaan 2, 1050 Brussels, Belgium jan.lemeire@vub.ac.be ABSTRACT
More informationTowards an extension of the PC algorithm to local context-specific independencies detection
Towards an extension of the PC algorithm to local context-specific independencies detection Feb-09-2016 Outline Background: Bayesian Networks The PC algorithm Context-specific independence: from DAGs to
More informationTestable Implications of Linear Structural Equation Models
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence Palo Alto, CA: AAAI Press, 2424--2430, 2014. TECHNICAL REPORT R-428 July 2014 Testable Implications of Linear Structural Equation
More informationEstimation of linear non-gaussian acyclic models for latent factors
Estimation of linear non-gaussian acyclic models for latent factors Shohei Shimizu a Patrik O. Hoyer b Aapo Hyvärinen b,c a The Institute of Scientific and Industrial Research, Osaka University Mihogaoka
More informationOn Markov Properties in Evidence Theory
On Markov Properties in Evidence Theory 131 On Markov Properties in Evidence Theory Jiřina Vejnarová Institute of Information Theory and Automation of the ASCR & University of Economics, Prague vejnar@utia.cas.cz
More informationMarkov properties for directed graphs
Graphical Models, Lecture 7, Michaelmas Term 2009 November 2, 2009 Definitions Structural relations among Markov properties Factorization G = (V, E) simple undirected graph; σ Say σ satisfies (P) the pairwise
More informationLearning Measurement Models for Unobserved Variables
UAI2003 SILVA ET AL. 543 Learning Measurement Models for Unobserved Variables Ricardo Silva, Richard Scheines, Clark Glymour and Peter Spirtes* Center for Automated Learning and Discovery Carnegie Mellon
More informationMarkov properties for graphical time series models
Markov properties for graphical time series models Michael Eichler Universität Heidelberg Abstract This paper deals with the Markov properties of a new class of graphical time series models which focus
More informationAcyclic Linear SEMs Obey the Nested Markov Property
Acyclic Linear SEMs Obey the Nested Markov Property Ilya Shpitser Department of Computer Science Johns Hopkins University ilyas@cs.jhu.edu Robin J. Evans Department of Statistics University of Oxford evans@stats.ox.ac.uk
More informationCausal Models with Hidden Variables
Causal Models with Hidden Variables Robin J. Evans www.stats.ox.ac.uk/ evans Department of Statistics, University of Oxford Quantum Networks, Oxford August 2017 1 / 44 Correlation does not imply causation
More informationCausal Diagrams. By Sander Greenland and Judea Pearl
Wiley StatsRef: Statistics Reference Online, 2014 2017 John Wiley & Sons, Ltd. 1 Previous version in M. Lovric (Ed.), International Encyclopedia of Statistical Science 2011, Part 3, pp. 208-216, DOI: 10.1007/978-3-642-04898-2_162
More informationCausal Inference for High-Dimensional Data. Atlantic Causal Conference
Causal Inference for High-Dimensional Data Atlantic Causal Conference Overview Conditional Independence Directed Acyclic Graph (DAG) Models factorization and d-separation Markov equivalence Structure Learning
More informationMULTIVARIATE STATISTICAL ANALYSIS. Nanny Wermuth 1
MULTIVARIATE STATISTICAL ANALYSIS Nanny Wermuth 1 Professor of Statistics, Department of Mathematical Sciences Chalmers Technical University/University of Gothenburg, Sweden Classical multivariate statistical
More informationCausal Directed Acyclic Graphs
Causal Directed Acyclic Graphs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Causal DAGs Stat186/Gov2002 Fall 2018 1 / 15 Elements of DAGs (Pearl. 2000.
More informationIdentifiability in Causal Bayesian Networks: A Sound and Complete Algorithm
Identifiability in Causal Bayesian Networks: A Sound and Complete Algorithm Yimin Huang and Marco Valtorta {huang6,mgv}@cse.sc.edu Department of Computer Science and Engineering University of South Carolina
More informationLearning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach
Learning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach Stijn Meganck 1, Philippe Leray 2, and Bernard Manderick 1 1 Vrije Universiteit Brussel, Pleinlaan 2,
More informationGraphical models, causal inference, and econometric models
Journal of Economic Methodology 12:1, 1 33 March 2005 Graphical models, causal inference, and econometric models Peter Spirtes Abstract A graphical model is a graph that represents a set of conditional
More informationRapid Introduction to Machine Learning/ Deep Learning
Rapid Introduction to Machine Learning/ Deep Learning Hyeong In Choi Seoul National University 1/32 Lecture 5a Bayesian network April 14, 2016 2/32 Table of contents 1 1. Objectives of Lecture 5a 2 2.Bayesian
More informationTutorial: Causal Model Search
Tutorial: Causal Model Search Richard Scheines Carnegie Mellon University Peter Spirtes, Clark Glymour, Joe Ramsey, others 1 Goals 1) Convey rudiments of graphical causal models 2) Basic working knowledge
More informationLars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL), Institute BW/WI & Institute for Computer Science, University of Hildesheim
Course on Bayesian Networks, summer term 2010 0/42 Bayesian Networks Bayesian Networks 11. Structure Learning / Constrained-based Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL)
More informationIssues on the Estimation of Latent Variable and Latent Class Models with Social Science Applications
UNIVERSITÀ DEGLI STUDI DI FIRENZE DIPARTIMENTO DI STATISTICA G. PARENTI DOTTORATO IN STATISTICA APPLICATA XVI CICLO Issues on the Estimation of Latent Variable and Latent Class Models with Social Science
More informationCAUSAL INFERENCE IN TIME SERIES ANALYSIS. Michael Eichler
CAUSAL INFERENCE IN TIME SERIES ANALYSIS Michael Eichler Department of Quantitative Economics, Maastricht University P.O. Box 616, 6200 MD Maastricht, The Netherlands November 11, 2011 1. ÁÒØÖÓ ÙØ ÓÒ The
More informationPDF hosted at the Radboud Repository of the Radboud University Nijmegen
PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a preprint version which may differ from the publisher's version. For additional information about this
More informationAlternative Markov and Causal Properties for Acyclic Directed Mixed Graphs
Alternative Markov and Causal Properties for Acyclic Directed Mixed Graphs Jose M. Peña Department of Computer and Information Science Linköping University 58183 Linköping, Sweden Abstract We extend Andersson-Madigan-Perlman
More informationThe International Journal of Biostatistics
The International Journal of Biostatistics Volume 7, Issue 1 2011 Article 16 A Complete Graphical Criterion for the Adjustment Formula in Mediation Analysis Ilya Shpitser, Harvard University Tyler J. VanderWeele,
More informationCausal Clustering for 1-Factor Measurement Models
Causal Clustering for -Factor Measurement Models Erich Kummerfeld University of Pittsburgh 5607 Baum Boulevard, Suite 500 Pittsburgh, PA 5206 erk57@pitt.edu Joseph Ramsey Carnegie Mellon University 39
More informationTowards Association Rules with Hidden Variables
Towards Association Rules with Hidden Variables Ricardo Silva and Richard Scheines Gatsby Unit, University College London, WCN AR London, UK Machine Learning Department, Carnegie Mellon, Pittsburgh PA,
More informationDay 3: Search Continued
Center for Causal Discovery Day 3: Search Continued June 15, 2015 Carnegie Mellon University 1 Outline Models Data 1) Bridge Principles: Markov Axiom and D-separation 2) Model Equivalence 3) Model Search
More information5.3 Graphs and Identifiability
218CHAPTER 5. CAUSALIT AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS.83 AFFECT.65.23 BEHAVIOR COGNITION Figure 5.5: Untestable model displaying quantitative causal information derived. method elucidates
More information2 : Directed GMs: Bayesian Networks
10-708: Probabilistic Graphical Models, Spring 2015 2 : Directed GMs: Bayesian Networks Lecturer: Eric P. Xing Scribes: Yi Cheng, Cong Lu 1 Notation Here the notations used in this course are defined:
More informationWhat Causality Is (stats for mathematicians)
What Causality Is (stats for mathematicians) Andrew Critch UC Berkeley August 31, 2011 Introduction Foreword: The value of examples With any hard question, it helps to start with simple, concrete versions
More information2 : Directed GMs: Bayesian Networks
10-708: Probabilistic Graphical Models 10-708, Spring 2017 2 : Directed GMs: Bayesian Networks Lecturer: Eric P. Xing Scribes: Jayanth Koushik, Hiroaki Hayashi, Christian Perez Topic: Directed GMs 1 Types
More informationCSC 412 (Lecture 4): Undirected Graphical Models
CSC 412 (Lecture 4): Undirected Graphical Models Raquel Urtasun University of Toronto Feb 2, 2016 R Urtasun (UofT) CSC 412 Feb 2, 2016 1 / 37 Today Undirected Graphical Models: Semantics of the graph:
More informationLearning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables
Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables Robert E. Tillman Carnegie Mellon University Pittsburgh, PA rtillman@cmu.edu
More informationFinancial Econometrics
Financial Econometrics Multivariate Time Series Analysis: VAR Gerald P. Dwyer Trinity College, Dublin January 2013 GPD (TCD) VAR 01/13 1 / 25 Structural equations Suppose have simultaneous system for supply
More informationMachine Learning Summer School
Machine Learning Summer School Lecture 1: Introduction to Graphical Models Zoubin Ghahramani zoubin@eng.cam.ac.uk http://learning.eng.cam.ac.uk/zoubin/ epartment of ngineering University of ambridge, UK
More information