Causal Discovery with Latent Variables: the Measurement Problem

Size: px
Start display at page:

Download "Causal Discovery with Latent Variables: the Measurement Problem"

Transcription

1 Causal Discovery with Latent Variables: the Measurement Problem Richard Scheines Carnegie Mellon University Peter Spirtes, Clark Glymour, Ricardo Silva, Steve Riese, Erich Kummerfeld, Renjie Yang, Max Mansolf 1

2 Outline 1. Measurement Error, Coarsening and Conditional Independence 2. Conventional Strategies 3. Latent Variable Models to the Rescue 4. The Problem of Impurity 5. Strategies for Handling Impurity (Rank Constraints) 6. Application to Psychometric Models

3 Causal Structure D-separation Causal Markov Axiom Testable Statistical Predictions Causal Graphs e.g., Conditional Independence X Y Z X _ _ Z Y x,y,z P(X = x, Z=z Y=y) = P(X = x Y=y) P(Z=z Y=y) 3

4 Measurement Error and Coarsening Endanger conditional Independence X Z Y X _ _ Y Z Z Z* ε Z = Z + ε Measurement Error Coarsening: - < Z < 0 à Z* = 0 0 Z < i à Z* = 1 i Z < j à Z* = 2.. k Z < à Z* = k X _ _ Y Z (unless Var(ε ) = 0) X _ _ Y Z* (almost always) 4

5 Parameterizing Measurement Error Y Y m := Y + ε ym ε ym ~ N(0,σ 2 2 ) Y m Measurement_Error = ε Ym ε ym Var(Y m ) = Var(Y) + Var(ε Ym ) Amount of Measurement Error = Var(eYm)/Var(Ym) = Var(Ym) Var(Y)/Var(Ym) 5

6 Unstandardized Standardized Y m := Y + ε ym Y Y Y m := β* m Y + ε y m β* m ε ym ~ N(0,σ 2 ) Y m Y m ε y m ~ N(0,σ 2 ) ε ym ε y'm Var(Y) = Var(Y m ) = 1.0 Measurement Error = Var(eYm)/Var(Ym) Var(ε y m ) = σ 2 = 1 - β* m 2 β* m = ρ(y, Y m ) Var(ε y m ) = 1 - ρ(y, Y m ) 2 Measurement Error = Var(eY m)/ Var(Y m) 6 Measurement Error = 1 r(y, Y m)2/1.0

7 Measurement Error True Graph Parameterization 1: Standardized SEM IM Measurement error (negligible)

8 Measurement Error N= 1,000

9 Measurement Error Parameterization 1: Negligible Measurement Error GES Output FCI Output

10 Measurement Error Parameterization 2: Small Measurement Error GES Output FCI Output (α =.05)

11 Measurement Error Parameterization 2: Small Measurement Error GES Output FCI Output (α =.01)

12 Coarsening Parameterization 3: GES Output FCI Output (α =.05)

13 Coarsening L1 > 0.0 à L1c = 1.0 L1 0.0 à L1c = 0.0 L1 > 0.0 à L1c = 1.0 L1 0.0 à L1c = 0.0

14 Strategies X Z Y 1. Guess (the amount of measurement error): Sensitivity Analysis Bayesian Analysis Bounds Z ε Measurement Error = 1 ρ(z, Z )2 Prior over 14

15 Strategies 1. Guess the measurement error: Sensitivity Analysis Bayesian Analysis Bounds X Z Y Z ε 2. Multiple Indicators: X Z Y Z1 Z2 15

16 Strategies 1. Parameterize measurement error: Sensitivity Analysis Bayesian Analysis Bounds X Z Y Z ε 2. Multiple Indicators: Scales X Z Y Z1 Z2 X _ _ Y Z_scale εz1 Z_scale εz2 16

17 Simulated Example: Lead and IQ Parental Resources Lead Exposure IQ Psuedorandom sample: N = 2,000 Test of Lead _ _ IQ Parental Resources: Regression: Dependent Variable: IQ Independent Variables: Lead, PR Independent Variable Coefficient Estimate p-value PR Lead

18 Multiple Measures of the Confounder ε 1 ε 2 ε 3 X 1 X 2 X 3 Lead Exposure Parental Resources IQ X 1 := Parental Resources + ε 1 X 2 := Parental Resources + ε 2 X 3 := Parental Resources + ε 3 18

19 Scales don't preserve conditional independence PR_Scale ε 1 ε 2 ε 3 X 1 X 2 X 3 Lead Exposure Parental Resources IQ PR_Scale = (X 1 + X 2 + X 3 ) / 3 Independent Variable Coefficient Estimate p-value PR_scale Lead

20 Indicators Don t Preserve Conditional Independence X 1 X 2 X 3 Lead Exposure Parental Resources IQ Regress IQ on: Lead, X 1, X 2, X 3 Independent Variable Coefficient Estimate p-value X X X Lead

21 Strategies 1. Parameterize measurement error: Sensitivity Analysis Bayesian Analysis Bounds 2. Multiple Indicators: Scales SEM εx X Z Y Z ε βxy X Z Y Z1 Z2 εy εz1 εz2 E( ˆ β ) = yx 0 X _ _ Y Z 21

22 Structural Equation Models Work! True Model Estimated Model X 1 X 2 X 3 X 1 X 2 X 3 Lead Exposure Parental Resources IQ Lead Exposure Parental Resources IQ β In the Estimated Model E( ˆ) β = 0 ˆ β =.07 (p-value =.499) Lead and IQ detectibly screened off by PR 22

23 Test conditional independence relations among latents generally in a SEM Question: L 1 _ _ L 2 {Q 1, Q 2,..., Q n } β 21 E( ˆ β 21 ) = 0 L 1 _ _ L 2 {Q 1, Q 2,..., Q n }

24 Pure Measurement Models: Causal Discovery among latents feasible Independence Questions: e.g., L 1 _ _ L 2 {L 3, L 4,..., L n } Constraint Based Search over L MIMBuild (PC) Pattern, PAG over L

25 The Problem of Impurities ( Unmodeled Complexity ) Specified Model True Model εx β yx εy εx εy X Z Y X Z Y Z1 Z2 Z1 Z2 εz1 εz2 εz1 εz2 E( ˆ β ) = yx 0 X _ _ Y Z E( ˆ β yx ) β xy 25

26 Test conditional independence relations among latents generally Question: L 1 _ _ L 2 {Q 1, Q 2,..., Q n } β 21 E( ˆ β 21 ) = 0 L 1 _ _ L 2 {Q 1, Q 2,..., Q n }

27 Strategy: Purify the Measurement Model Full Model F1 F2 F3 x 1 x 2 x 3 x 4 y 1 y 2 y 3 y 4 z 1 z 2 z 3 z 4 Structural Model F1 F2 F3 Measurement Model F1 F2 F3 x 1 x 2 x 3 x 4 y 1 y 2 y 3 y 4 z 1 z 2 z 3 z 4 27

28 Local Independence / Pure Measurement Models Impure F1 F2 F3 x 1 x 2 x 3 x 4 y 1 y 2 y 3 y 4 z 1 z 2 z 3 z 4 F 1 st Order Pure F1 F2 F3 x 1 x 2 x 3 x 4 x 5 x 6 x 7 x 8 x 9 x 10 x 11 x 12 28

29 Purify Impure F1 F2 F3 x 1 x 2 x 3 x 4 y 1 y 2 y 3 y 4 z 1 z 2 z 3 z 4 F 1 st Order Purified F1 F2 F3 x 1 x 2 X3 y 1 y 2 y 3 y 4 z 1 z 3 z 4 X 4 Z2 29

30 Testable/Observable Contraints Latent variable models that are linear below the latents entail testable rank constraints on the measured covariance matrix regardless of the structural model. Impurities selectively defeat such implications, and are thus in many circumstances detectable and localizable. 30

31 Rank 1 Constraints: Tetrad Equations Fact: given W L λ 1 λ 2 λ 3 λ 4 X Y Z W = λ 1 L + ε 1 X = λ 2 L + ε 2 Y = λ 3 L + ε 3 Z = λ 4 L + ε 4 it follows that Cov WX Cov YZ = (λ 1 λ 2 σ 2 L ) (λ 3λ 4 σ 2 L ) = = (λ 1 λ 3 σ 2 L ) (λ 2λ 4 σ 2 L ) = Cov WY Cov XZ σ WX σ YZ = σ WY σ XZ = σ WZ σ XY tetrad constraints

32 Charles Spearman (1904) Statistical Constraints à Measurement Model Structure ρ m1,m2 * ρ r1,r2 = ρ m1,r1 * ρ m2,r2 = ρ m1,r2 * ρ m2,r1 g m1 m2 r1 r2

33 Impurities defeat rank (e.g., tetrad) constraints selectively Truth F1 Truth F1 x 1 x 2 x 3 x 4 x 1 x 2 x 3 x 4 F5 ρ x1,x2 * ρ x3,x4 = ρ x1,x3 * ρ x2,x4 ρ x1,x2 * ρ x3,x4 = ρ x1,x4 * ρ x2,x3 ρ x1,x3 * ρ x2,x4 = ρ x1,x4 * ρ x2,x3 ρ x1,x2 * ρ x3,x4 = ρ x1,x3 * ρ x2,x4 ρ x1,x2 * ρ x3,x4 = ρ x1,x4 * ρ x2,x3 ρ x1,x3 * ρ x2,x4 = ρ x1,x4 * ρ x2,x3 33

34 2006: Build (1 st -Order) Pure Clusters (BPC) Input: - Covariance matrix over set of original items BPC (FOFC) 1) Cluster (complicated boolean combinations of tetrads) 2) Purify Output: Equivalence class of 1 st order pure clusters BPC: Pointwise consistent Silva, R., Glymour, C., Scheines, R. and Spirtes, P. (2006) Learning the Structure of Latent Linear Structure Models, Journal of Machine Learning Research, 7,

35 Case Study: Stress, Depression, and Religion Masters Students (N = 127) 61 - item survey (Likert Scale) Stress: St 1 - St 21 Depression: D 1 - D 20 Religious Coping: C 1 - C 20 St 1 1 St St 21 1 Specified Model Depression Stress + + Coping - Dep 1 1 Dep Dep 20 1 P(χ2) = 0.00 C 1 C 2.. C 20 35

36 Case Study: Stress, Depression, and Religion St 3 1 Build Pure Clusters St 4 1 St 16 1 St 18 1 Stress Coping Dep 9 1 Depression Dep 13 1 Dep 19 1 St 20 1 C 9 C 12 C 15 C 14 36

37 Case Study: Stress, Depression, and Religion Assume : Stress causally prior to Depression Find : Stress _ _ Coping Depression St 3 1 St 4 1 St 16 1 St 18 1 Stress + Coping Dep 9 1 Depression Dep Dep 19 1 St 20 1 C 9 C 12 C 15 C 14 P(χ2) =

38 Psychometric Models in Practice: Almost Never 1 st Order Pure or 1 st -order Purifiable

39 Psychometric Models in Practice: Almost Never 1 st Order Pure or 1 st -order Purifiable Bi-factor structure

40 Local Independence / Pure Measurement Models 1 st Order Pure F1 F2 F3 x 1 x 2 x 3 x 4 x 5 x 6 x 7 x 8 x 9 x 10 x 11 x 12 2 nd Order Pure F1 F2 F3 x 1 x 2 x 3 x 4 x 5 x 6 x 7 x 8 x 9 x 10 x 11 x 12 G 40

41 Beyond 1 st -Order Purity and Tetrads Drton, M., Sturmfels, B., Sullivant, S. (2007) Algebraic factor analysis: tetrads, pentads and beyond, Probability Theory and Related Fields, 138, 3-4, Sullivant, S., Talaska, K., & Draisma, J. (2010). Trek Separation for Gaussian Graphical Models. Annals of Statistics, 38(3),

42 Trek-Separation 42

43 2013 FTFC (2 nd -order pure measurement clusters) Input: Covariance Matrix of measured items: Output: Subset of items and clusters that are 2 nd Order Pure Kummerfeld and Ramsey

44 2 nd Order Impure F1 F2 F3 x 1 x 2 x 3 x 4 x 5 x 6 x 7 x 8 x 9 x 10 x 11 x 12 G Σ(X) FTFC 2 nd -Order Pure Clusters: {X 1, X 2, X 3, X 4, X 5, X 6 } {X 8, X 9, X 10, X 11, X 12 } 44

45 Psychometric Models in Practice:Bi-factor vs. Higher-Order Bi-factor Higher-order

46 Model Comparison: 2 nd -Order vs. Bi-factor 2 nd -Order entails more constraints (simpler, more degrees of freedom, nesting not simple) Statistical comparison: fit - complexity fit to data (good) complexity (bad) p-value(χ 2 ), BIC, AIC, etc. Theoretically BIC desirable.

47 Model Comparison: 2 nd -Order vs. Bi-factor Empirical finding: Bi-factor usually wins (65 published studies comparing: all chose bi-factors) What if data-generating process is neither 2 nd -order nor bi-factor? What if there is unmodeled complexity?

48 Simulation Study: Truth: 2 nd -Order

49 Simulation Study: 2 Strategies Data from impure 2 nd -order model 1. Before FOFC: Fit and compare pure versions of 2 nd -order, bi-factor models with correct clustering on all items 2. After FOFC/FTFC Locate and discard impure indicators Fit and compare pure versions of 2 nd -order, bi-factor models with correct clustering on retained items

50 Truth Purified Version of Truth {X12, X13, X20} removed

51 Simulation Study Truth: 2 nd -Order dbic > 0: 2 nd -Order Model dbic < 0: Bi-factor Model

52 Size of the impurities dbic > 0: 2 nd -Order Model Truth: 2 nd -Order dbic < 0: Bi-factor Model

53 Level of Inhomogeneity Inhomogeneiity: the degree to which the impurities are concentrated on a small number of indicators Low Inhomogeneiity High Inhomogeneiity

54 Level of Inhomogeneity Truth: 2 nd -Order dbic > 0: 2 nd -Order Model dbic < 0: Bi-factor Model

55 Sample Size Truth: 2 nd -Order dbic > 0: 2 nd -Order Model dbic < 0: Bi-factor Model

56 Thanks 56

57 Measurement Model Misspecification à Structural Parameter Bias 57

58 Measurement Model Misspecification à Structural Parameter Bias Specified Model True Model Parametric Measure of Difference: Explained Common Variance (ECV) 58

59 Measurement Models Unidimensional Correlated Trait

60 Psychometric Models in Practice: Bi-factor vs. 2 nd (Higher)-Order Models 2 nd Order Bi-factor

Day 3: Search Continued

Day 3: Search Continued Center for Causal Discovery Day 3: Search Continued June 15, 2015 Carnegie Mellon University 1 Outline Models Data 1) Bridge Principles: Markov Axiom and D-separation 2) Model Equivalence 3) Model Search

More information

Automatic Causal Discovery

Automatic Causal Discovery Automatic Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour Dept. of Philosophy & CALD Carnegie Mellon 1 Outline 1. Motivation 2. Representation 3. Discovery 4. Using Regression for Causal

More information

Measurement Error and Causal Discovery

Measurement Error and Causal Discovery Measurement Error and Causal Discovery Richard Scheines & Joseph Ramsey Department of Philosophy Carnegie Mellon University Pittsburgh, PA 15217, USA 1 Introduction Algorithms for causal discovery emerged

More information

Tutorial: Causal Model Search

Tutorial: Causal Model Search Tutorial: Causal Model Search Richard Scheines Carnegie Mellon University Peter Spirtes, Clark Glymour, Joe Ramsey, others 1 Goals 1) Convey rudiments of graphical causal models 2) Basic working knowledge

More information

Learning the Structure of Linear Latent Variable Models

Learning the Structure of Linear Latent Variable Models Journal (2005) - Submitted /; Published / Learning the Structure of Linear Latent Variable Models Ricardo Silva Center for Automated Learning and Discovery School of Computer Science rbas@cs.cmu.edu Richard

More information

Causal Discovery. Richard Scheines. Peter Spirtes, Clark Glymour, and many others. Dept. of Philosophy & CALD Carnegie Mellon

Causal Discovery. Richard Scheines. Peter Spirtes, Clark Glymour, and many others. Dept. of Philosophy & CALD Carnegie Mellon Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour, and many others Dept. of Philosophy & CALD Carnegie Mellon Graphical Models --11/30/05 1 Outline 1. Motivation 2. Representation 3. Connecting

More information

Challenges (& Some Solutions) and Making Connections

Challenges (& Some Solutions) and Making Connections Challenges (& Some Solutions) and Making Connections Real-life Search All search algorithm theorems have form: If the world behaves like, then (probability 1) the algorithm will recover the true structure

More information

Using Descendants as Instrumental Variables for the Identification of Direct Causal Effects in Linear SEMs

Using Descendants as Instrumental Variables for the Identification of Direct Causal Effects in Linear SEMs Using Descendants as Instrumental Variables for the Identification of Direct Causal Effects in Linear SEMs Hei Chan Institute of Statistical Mathematics, 10-3 Midori-cho, Tachikawa, Tokyo 190-8562, Japan

More information

Causal Clustering for 1-Factor Measurement Models

Causal Clustering for 1-Factor Measurement Models Causal Clustering for -Factor Measurement Models Erich Kummerfeld University of Pittsburgh 5607 Baum Boulevard, Suite 500 Pittsburgh, PA 5206 erk57@pitt.edu Joseph Ramsey Carnegie Mellon University 39

More information

Generalized Measurement Models

Generalized Measurement Models Generalized Measurement Models Ricardo Silva and Richard Scheines Center for Automated Learning and Discovery rbas@cs.cmu.edu, scheines@andrew.cmu.edu May 16, 2005 CMU-CALD-04-101 School of Computer Science

More information

Center for Causal Discovery: Summer Workshop

Center for Causal Discovery: Summer Workshop Center for Causal Discovery: Summer Workshop - 2015 June 8-11, 2015 Carnegie Mellon University 1 Goals 1) Working knowledge of graphical causal models 2) Basic working knowledge of Tetrad V 3) Basic understanding

More information

Causality in Econometrics (3)

Causality in Econometrics (3) Graphical Causal Models References Causality in Econometrics (3) Alessio Moneta Max Planck Institute of Economics Jena moneta@econ.mpg.de 26 April 2011 GSBC Lecture Friedrich-Schiller-Universität Jena

More information

Causal Models with Hidden Variables

Causal Models with Hidden Variables Causal Models with Hidden Variables Robin J. Evans www.stats.ox.ac.uk/ evans Department of Statistics, University of Oxford Quantum Networks, Oxford August 2017 1 / 44 Correlation does not imply causation

More information

Carnegie Mellon Pittsburgh, Pennsylvania 15213

Carnegie Mellon Pittsburgh, Pennsylvania 15213 A Tutorial On Causal Inference Peter Spirtes August 4, 2009 Technical Report No. CMU-PHIL-183 Philosophy Methodology Logic Carnegie Mellon Pittsburgh, Pennsylvania 15213 1. Introduction A Tutorial On Causal

More information

Introduction to Probabilistic Graphical Models

Introduction to Probabilistic Graphical Models Introduction to Probabilistic Graphical Models Sargur Srihari srihari@cedar.buffalo.edu 1 Topics 1. What are probabilistic graphical models (PGMs) 2. Use of PGMs Engineering and AI 3. Directionality in

More information

The intersection axiom of

The intersection axiom of The intersection axiom of conditional independence: some new [?] results Richard D. Gill Mathematical Institute, University Leiden This version: 26 March, 2019 (X Y Z) & (X Z Y) X (Y, Z) Presented at Algebraic

More information

LEARNING CAUSAL EFFECTS: BRIDGING INSTRUMENTS AND BACKDOORS

LEARNING CAUSAL EFFECTS: BRIDGING INSTRUMENTS AND BACKDOORS LEARNING CAUSAL EFFECTS: BRIDGING INSTRUMENTS AND BACKDOORS Ricardo Silva Department of Statistical Science and Centre for Computational Statistics and Machine Learning, UCL Fellow, Alan Turing Institute

More information

Causal Discovery in the Presence of Measurement Error: Identifiability Conditions

Causal Discovery in the Presence of Measurement Error: Identifiability Conditions Causal Discovery in the Presence of Measurement Error: Identifiability Conditions Kun Zhang,MingmingGong?,JosephRamsey,KayhanBatmanghelich?, Peter Spirtes, Clark Glymour Department of philosophy, Carnegie

More information

Causal Inference & Reasoning with Causal Bayesian Networks

Causal Inference & Reasoning with Causal Bayesian Networks Causal Inference & Reasoning with Causal Bayesian Networks Neyman-Rubin Framework Potential Outcome Framework: for each unit k and each treatment i, there is a potential outcome on an attribute U, U ik,

More information

Towards Association Rules with Hidden Variables

Towards Association Rules with Hidden Variables Towards Association Rules with Hidden Variables Ricardo Silva and Richard Scheines Gatsby Unit, University College London, WCN AR London, UK Machine Learning Department, Carnegie Mellon, Pittsburgh PA,

More information

Latent Variable and Network Model Implications for Partial Correlation Structures. Riet van Bork University of Amsterdam

Latent Variable and Network Model Implications for Partial Correlation Structures. Riet van Bork University of Amsterdam Latent Variable and Network Model Implications for Partial Correlation Structures Riet van Bork University of Amsterdam The latent variable model They share variance manifest variable 1 manifest variable

More information

Causal Reasoning with Ancestral Graphs

Causal Reasoning with Ancestral Graphs Journal of Machine Learning Research 9 (2008) 1437-1474 Submitted 6/07; Revised 2/08; Published 7/08 Causal Reasoning with Ancestral Graphs Jiji Zhang Division of the Humanities and Social Sciences California

More information

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS Judea earl Department of Computer Science UCLA OUTLINE Statistical vs. Causal Modeling: distinction and mental barriers N-R vs. structural model: strengths

More information

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables N-1 Experiments Suffice to Determine the Causal Relations Among N Variables Frederick Eberhardt Clark Glymour 1 Richard Scheines Carnegie Mellon University Abstract By combining experimental interventions

More information

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature

More information

Equivalence in Non-Recursive Structural Equation Models

Equivalence in Non-Recursive Structural Equation Models Equivalence in Non-Recursive Structural Equation Models Thomas Richardson 1 Philosophy Department, Carnegie-Mellon University Pittsburgh, P 15213, US thomas.richardson@andrew.cmu.edu Introduction In the

More information

CHAPTER 3 BOOLEAN ALGEBRA

CHAPTER 3 BOOLEAN ALGEBRA CHAPTER 3 BOOLEAN ALGEBRA (continued) This chapter in the book includes: Objectives Study Guide 3.1 Multiplying Out and Factoring Expressions 3.2 Exclusive-OR and Equivalence Operations 3.3 The Consensus

More information

A DARK GREY P O N T, with a Switch Tail, and a small Star on the Forehead. Any

A DARK GREY P O N T, with a Switch Tail, and a small Star on the Forehead. Any Y Y Y X X «/ YY Y Y ««Y x ) & \ & & } # Y \#$& / Y Y X» \\ / X X X x & Y Y X «q «z \x» = q Y # % \ & [ & Z \ & { + % ) / / «q zy» / & / / / & x x X / % % ) Y x X Y $ Z % Y Y x x } / % «] «] # z» & Y X»

More information

BN Semantics 3 Now it s personal!

BN Semantics 3 Now it s personal! Readings: K&F: 3.3, 3.4 BN Semantics 3 Now it s personal! Graphical Models 10708 Carlos Guestrin Carnegie Mellon University September 22 nd, 2008 10-708 Carlos Guestrin 2006-2008 1 Independencies encoded

More information

STAT 518 Intro Student Presentation

STAT 518 Intro Student Presentation STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible

More information

Learning causal network structure from multiple (in)dependence models

Learning causal network structure from multiple (in)dependence models Learning causal network structure from multiple (in)dependence models Tom Claassen Radboud University, Nijmegen tomc@cs.ru.nl Abstract Tom Heskes Radboud University, Nijmegen tomh@cs.ru.nl We tackle the

More information

Learning Instrumental Variables with Structural and Non-Gaussianity Assumptions

Learning Instrumental Variables with Structural and Non-Gaussianity Assumptions Journal of Machine Learning Research 18 (2017) 1-49 Submitted 1/17; Revised 8/17; Published 11/17 Learning Instrumental Variables with Structural and Non-Gaussianity Assumptions Ricardo Silva ricardo@stats.ucl.ac.u

More information

Acyclic Linear SEMs Obey the Nested Markov Property

Acyclic Linear SEMs Obey the Nested Markov Property Acyclic Linear SEMs Obey the Nested Markov Property Ilya Shpitser Department of Computer Science Johns Hopkins University ilyas@cs.jhu.edu Robin J. Evans Department of Statistics University of Oxford evans@stats.ox.ac.uk

More information

The Specification of Causal Models with Tetrad IV: A Review

The Specification of Causal Models with Tetrad IV: A Review Structural Equation Modeling, 17:703 711, 2010 Copyright Taylor & Francis Group, LLC ISSN: 1070-5511 print/1532-8007 online DOI: 10.1080/10705511.2010.510074 SOFTWARE REVIEW The Specification of Causal

More information

Machine Learning Tom M. Mitchell Machine Learning Department Carnegie Mellon University. September 20, 2012

Machine Learning Tom M. Mitchell Machine Learning Department Carnegie Mellon University. September 20, 2012 Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University September 20, 2012 Today: Logistic regression Generative/Discriminative classifiers Readings: (see class website)

More information

Review: Directed Models (Bayes Nets)

Review: Directed Models (Bayes Nets) X Review: Directed Models (Bayes Nets) Lecture 3: Undirected Graphical Models Sam Roweis January 2, 24 Semantics: x y z if z d-separates x and y d-separation: z d-separates x from y if along every undirected

More information

E&CE 223 Digital Circuits & Systems. Lecture Transparencies (Boolean Algebra & Logic Gates) M. Sachdev

E&CE 223 Digital Circuits & Systems. Lecture Transparencies (Boolean Algebra & Logic Gates) M. Sachdev E&CE 223 Digital Circuits & Systems Lecture Transparencies (Boolean Algebra & Logic Gates) M. Sachdev 4 of 92 Section 2: Boolean Algebra & Logic Gates Major topics Boolean algebra NAND & NOR gates Boolean

More information

Arrowhead completeness from minimal conditional independencies

Arrowhead completeness from minimal conditional independencies Arrowhead completeness from minimal conditional independencies Tom Claassen, Tom Heskes Radboud University Nijmegen The Netherlands {tomc,tomh}@cs.ru.nl Abstract We present two inference rules, based on

More information

The TETRAD Project: Constraint Based Aids to Causal Model Specification

The TETRAD Project: Constraint Based Aids to Causal Model Specification The TETRAD Project: Constraint Based Aids to Causal Model Specification Richard Scheines, Peter Spirtes, Clark Glymour, Christopher Meek and Thomas Richardson Department of Philosophy, Carnegie Mellon

More information

E&CE 223 Digital Circuits & Systems. Lecture Transparencies (Boolean Algebra & Logic Gates) M. Sachdev. Section 2: Boolean Algebra & Logic Gates

E&CE 223 Digital Circuits & Systems. Lecture Transparencies (Boolean Algebra & Logic Gates) M. Sachdev. Section 2: Boolean Algebra & Logic Gates Digital Circuits & Systems Lecture Transparencies (Boolean lgebra & Logic Gates) M. Sachdev 4 of 92 Section 2: Boolean lgebra & Logic Gates Major topics Boolean algebra NND & NOR gates Boolean algebra

More information

Linear regression methods

Linear regression methods Linear regression methods Most of our intuition about statistical methods stem from linear regression. For observations i = 1,..., n, the model is Y i = p X ij β j + ε i, j=1 where Y i is the response

More information

Learning Measurement Models for Unobserved Variables

Learning Measurement Models for Unobserved Variables UAI2003 SILVA ET AL. 543 Learning Measurement Models for Unobserved Variables Ricardo Silva, Richard Scheines, Clark Glymour and Peter Spirtes* Center for Automated Learning and Discovery Carnegie Mellon

More information

Causal Discovery by Computer

Causal Discovery by Computer Causal Discovery by Computer Clark Glymour Carnegie Mellon University 1 Outline 1. A century of mistakes about causation and discovery: 1. Fisher 2. Yule 3. Spearman/Thurstone 2. Search for causes is statistical

More information

A MCMC Approach for Learning the Structure of Gaussian Acyclic Directed Mixed Graphs

A MCMC Approach for Learning the Structure of Gaussian Acyclic Directed Mixed Graphs A MCMC Approach for Learning the Structure of Gaussian Acyclic Directed Mixed Graphs Ricardo Silva Abstract Graphical models are widely used to encode conditional independence constraints and causal assumptions,

More information

Causal Discovery with Linear Non-Gaussian Models under Measurement Error: Structural Identifiability Results

Causal Discovery with Linear Non-Gaussian Models under Measurement Error: Structural Identifiability Results Causal Discovery with Linear Non-Gaussian Models under Measurement Error: Structural Identifiability Results Kun Zhang,MingmingGong?, Joseph Ramsey,KayhanBatmanghelich?, Peter Spirtes,ClarkGlymour Department

More information

SEM with observed variables: parameterization and identification. Psychology 588: Covariance structure and factor models

SEM with observed variables: parameterization and identification. Psychology 588: Covariance structure and factor models SEM with observed variables: parameterization and identification Psychology 588: Covariance structure and factor models Limitations of SEM as a causal modeling 2 If an SEM model reflects the reality, the

More information

Causal Inference in the Presence of Latent Variables and Selection Bias

Causal Inference in the Presence of Latent Variables and Selection Bias 499 Causal Inference in the Presence of Latent Variables and Selection Bias Peter Spirtes, Christopher Meek, and Thomas Richardson Department of Philosophy Carnegie Mellon University Pittsburgh, P 15213

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea)

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics

More information

Estimation of linear non-gaussian acyclic models for latent factors

Estimation of linear non-gaussian acyclic models for latent factors Estimation of linear non-gaussian acyclic models for latent factors Shohei Shimizu a Patrik O. Hoyer b Aapo Hyvärinen b,c a The Institute of Scientific and Industrial Research, Osaka University Mihogaoka

More information

Consequences of measurement error. Psychology 588: Covariance structure and factor models

Consequences of measurement error. Psychology 588: Covariance structure and factor models Consequences of measurement error Psychology 588: Covariance structure and factor models Scaling indeterminacy of latent variables Scale of a latent variable is arbitrary and determined by a convention

More information

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded 1 Background Latent confounder is common in social and behavioral science in which most of cases the selection mechanism

More information

Multivariate Gaussians. Sargur Srihari

Multivariate Gaussians. Sargur Srihari Multivariate Gaussians Sargur srihari@cedar.buffalo.edu 1 Topics 1. Multivariate Gaussian: Basic Parameterization 2. Covariance and Information Form 3. Operations on Gaussians 4. Independencies in Gaussians

More information

Machine Learning

Machine Learning Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University February 2, 2015 Today: Logistic regression Generative/Discriminative classifiers Readings: (see class website)

More information

Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges

Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges Supplementary material to Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges 1 PRELIMINARIES Two vertices X i and X j are adjacent if there is an edge between them. A path

More information

Tutorial: Gaussian conditional independence and graphical models. Thomas Kahle Otto-von-Guericke Universität Magdeburg

Tutorial: Gaussian conditional independence and graphical models. Thomas Kahle Otto-von-Guericke Universität Magdeburg Tutorial: Gaussian conditional independence and graphical models Thomas Kahle Otto-von-Guericke Universität Magdeburg The central dogma of algebraic statistics Statistical models are varieties The central

More information

Why write proofs? Why not just test and repeat enough examples to confirm a theory?

Why write proofs? Why not just test and repeat enough examples to confirm a theory? P R E F A C E T O T H E S T U D E N T Welcome to the study of mathematical reasoning. The authors know that many students approach this material with some apprehension and uncertainty. Some students feel

More information

LOWELL JOURNAL. MUST APOLOGIZE. such communication with the shore as Is m i Boimhle, noewwary and proper for the comfort

LOWELL JOURNAL. MUST APOLOGIZE. such communication with the shore as Is m i Boimhle, noewwary and proper for the comfort - 7 7 Z 8 q ) V x - X > q - < Y Y X V - z - - - - V - V - q \ - q q < -- V - - - x - - V q > x - x q - x q - x - - - 7 -» - - - - 6 q x - > - - x - - - x- - - q q - V - x - - ( Y q Y7 - >»> - x Y - ] [

More information

Two Posts to Fill On School Board

Two Posts to Fill On School Board Y Y 9 86 4 4 qz 86 x : ( ) z 7 854 Y x 4 z z x x 4 87 88 Y 5 x q x 8 Y 8 x x : 6 ; : 5 x ; 4 ( z ; ( ) ) x ; z 94 ; x 3 3 3 5 94 ; ; ; ; 3 x : 5 89 q ; ; x ; x ; ; x : ; ; ; ; ; ; 87 47% : () : / : 83

More information

Based on slides by Richard Zemel

Based on slides by Richard Zemel CSC 412/2506 Winter 2018 Probabilistic Learning and Reasoning Lecture 3: Directed Graphical Models and Latent Variables Based on slides by Richard Zemel Learning outcomes What aspects of a model can we

More information

COMP538: Introduction to Bayesian Networks

COMP538: Introduction to Bayesian Networks COMP538: Introduction to Bayesian Networks Lecture 9: Optimal Structure Learning Nevin L. Zhang lzhang@cse.ust.hk Department of Computer Science and Engineering Hong Kong University of Science and Technology

More information

Machine Learning Gaussian Naïve Bayes Big Picture

Machine Learning Gaussian Naïve Bayes Big Picture Machine Learning 10-701 Tom M. Mitchell Machine Learning Department Carnegie Mellon University January 27, 2011 Today: Naïve Bayes Big Picture Logistic regression Gradient ascent Generative discriminative

More information

Course 2BA1: Hilary Term 2007 Section 8: Quaternions and Rotations

Course 2BA1: Hilary Term 2007 Section 8: Quaternions and Rotations Course BA1: Hilary Term 007 Section 8: Quaternions and Rotations David R. Wilkins Copyright c David R. Wilkins 005 Contents 8 Quaternions and Rotations 1 8.1 Quaternions............................ 1 8.

More information

ECON 3150/4150, Spring term Lecture 7

ECON 3150/4150, Spring term Lecture 7 ECON 3150/4150, Spring term 2014. Lecture 7 The multivariate regression model (I) Ragnar Nymoen University of Oslo 4 February 2014 1 / 23 References to Lecture 7 and 8 SW Ch. 6 BN Kap 7.1-7.8 2 / 23 Omitted

More information

High-dimensional regression modeling

High-dimensional regression modeling High-dimensional regression modeling David Causeur Department of Statistics and Computer Science Agrocampus Ouest IRMAR CNRS UMR 6625 http://www.agrocampus-ouest.fr/math/causeur/ Course objectives Making

More information

Strategies for Discovering Mechanisms of Mind using fmri: 6 NUMBERS. Joseph Ramsey, Ruben Sanchez Romero and Clark Glymour

Strategies for Discovering Mechanisms of Mind using fmri: 6 NUMBERS. Joseph Ramsey, Ruben Sanchez Romero and Clark Glymour 1 Strategies for Discovering Mechanisms of Mind using fmri: 6 NUMBERS Joseph Ramsey, Ruben Sanchez Romero and Clark Glymour 2 The Numbers 20 50 5024 9205 8388 500 3 fmri and Mechanism From indirect signals

More information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

PDF hosted at the Radboud Repository of the Radboud University Nijmegen PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a preprint version which may differ from the publisher's version. For additional information about this

More information

arxiv: v1 [stat.ml] 9 Nov 2015

arxiv: v1 [stat.ml] 9 Nov 2015 Learning Instrumental Variables with Non-Gaussianity Assumptions: Theoretical Limitations and Practical Algorithms arxiv:1511.02722v1 [stat.ml] 9 Nov 2015 Ricardo Silva Department of Statistical Science

More information

Correlation and Linear Regression

Correlation and Linear Regression Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means

More information

Probabilistic Graphical Models. Guest Lecture by Narges Razavian Machine Learning Class April

Probabilistic Graphical Models. Guest Lecture by Narges Razavian Machine Learning Class April Probabilistic Graphical Models Guest Lecture by Narges Razavian Machine Learning Class April 14 2017 Today What is probabilistic graphical model and why it is useful? Bayesian Networks Basic Inference

More information

Correlation: Relationships between Variables

Correlation: Relationships between Variables Correlation Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means However, researchers are

More information

Towards an extension of the PC algorithm to local context-specific independencies detection

Towards an extension of the PC algorithm to local context-specific independencies detection Towards an extension of the PC algorithm to local context-specific independencies detection Feb-09-2016 Outline Background: Bayesian Networks The PC algorithm Context-specific independence: from DAGs to

More information

Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables

Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables Robert E. Tillman Carnegie Mellon University Pittsburgh, PA rtillman@cmu.edu

More information

Signals and Systems Digital Logic System

Signals and Systems Digital Logic System Signals and Systems Digital Logic System Prof. Wonhee Kim Chapter 2 Design Process for Combinational Systems Step 1: Represent each of the inputs and outputs in binary Step 1.5: If necessary, break the

More information

Unit 2 Boolean Algebra

Unit 2 Boolean Algebra Unit 2 Boolean Algebra 1. Developed by George Boole in 1847 2. Applied to the Design of Switching Circuit by Claude Shannon in 1939 Department of Communication Engineering, NCTU 1 2.1 Basic Operations

More information

CORRELATION. compiled by Dr Kunal Pathak

CORRELATION. compiled by Dr Kunal Pathak CORRELATION compiled by Dr Kunal Pathak Flow of Presentation Definition Types of correlation Method of studying correlation a) Scatter diagram b) Karl Pearson s coefficient of correlation c) Spearman s

More information

ECE 238L Boolean Algebra - Part I

ECE 238L Boolean Algebra - Part I ECE 238L Boolean Algebra - Part I August 29, 2008 Typeset by FoilTEX Understand basic Boolean Algebra Boolean Algebra Objectives Relate Boolean Algebra to Logic Networks Prove Laws using Truth Tables Understand

More information

Bayesian Networks Representation

Bayesian Networks Representation Bayesian Networks Representation Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University March 19 th, 2007 Handwriting recognition Character recognition, e.g., kernel SVMs a c z rr r r

More information

Cross-Validation with Confidence

Cross-Validation with Confidence Cross-Validation with Confidence Jing Lei Department of Statistics, Carnegie Mellon University UMN Statistics Seminar, Mar 30, 2017 Overview Parameter est. Model selection Point est. MLE, M-est.,... Cross-validation

More information

Pairwise Cluster Comparison for Learning Latent Variable Models

Pairwise Cluster Comparison for Learning Latent Variable Models Pairwise Cluster Comparison for Learning Latent Variable Models Nuaman Asbeh Industrial Engineering and Management Ben-Gurion University of the Negev Beer-Sheva, 84105, Israel Boaz Lerner Industrial Engineering

More information

Machine Learning

Machine Learning Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University February 4, 2015 Today: Generative discriminative classifiers Linear regression Decomposition of error into

More information

CHAPTER 2 BOOLEAN ALGEBRA

CHAPTER 2 BOOLEAN ALGEBRA CHAPTER 2 BOOLEAN ALGEBRA This chapter in the book includes: Objectives Study Guide 2.1 Introduction 2.2 Basic Operations 2.3 Boolean Expressions and Truth Tables 2.4 Basic Theorems 2.5 Commutative, Associative,

More information

Chapter 2 Combinational Logic Circuits

Chapter 2 Combinational Logic Circuits Logic and Computer Design Fundamentals Chapter 2 Combinational Logic Circuits Part 1 Gate Circuits and Boolean Equations Chapter 2 - Part 1 2 Chapter 2 - Part 1 3 Chapter 2 - Part 1 4 Chapter 2 - Part

More information

Key Concepts. Correlation (Pearson & Spearman) & Linear Regression. Assumptions. Correlation parametric & non-para. Correlation

Key Concepts. Correlation (Pearson & Spearman) & Linear Regression. Assumptions. Correlation parametric & non-para. Correlation Correlation (Pearson & Spearman) & Linear Regression Azmi Mohd Tamil Key Concepts Correlation as a statistic Positive and Negative Bivariate Correlation Range Effects Outliers Regression & Prediction Directionality

More information

arxiv: v3 [stat.me] 3 Jun 2015

arxiv: v3 [stat.me] 3 Jun 2015 The Annals of Statistics 2015, Vol. 43, No. 3, 1060 1088 DOI: 10.1214/14-AOS1295 c Institute of Mathematical Statistics, 2015 A GENERALIZED BACK-DOOR CRITERION 1 arxiv:1307.5636v3 [stat.me] 3 Jun 2015

More information

Machine Learning Srihari. Probability Theory. Sargur N. Srihari

Machine Learning Srihari. Probability Theory. Sargur N. Srihari Probability Theory Sargur N. Srihari srihari@cedar.buffalo.edu 1 Probability Theory with Several Variables Key concept is dealing with uncertainty Due to noise and finite data sets Framework for quantification

More information

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46 BIO5312 Biostatistics Lecture 10:Regression and Correlation Methods Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/1/2016 1/46 Outline In this lecture, we will discuss topics

More information

arxiv: v2 [stat.ml] 21 Sep 2009

arxiv: v2 [stat.ml] 21 Sep 2009 Submitted to the Annals of Statistics TREK SEPARATION FOR GAUSSIAN GRAPHICAL MODELS arxiv:0812.1938v2 [stat.ml] 21 Sep 2009 By Seth Sullivant, and Kelli Talaska, and Jan Draisma, North Carolina State University,

More information

PMR Learning as Inference

PMR Learning as Inference Outline PMR Learning as Inference Probabilistic Modelling and Reasoning Amos Storkey Modelling 2 The Exponential Family 3 Bayesian Sets School of Informatics, University of Edinburgh Amos Storkey PMR Learning

More information

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline. Practitioner Course: Portfolio Optimization September 10, 2008 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y ) (x,

More information

Bayesian Networks Structure Learning (cont.)

Bayesian Networks Structure Learning (cont.) Koller & Friedman Chapters (handed out): Chapter 11 (short) Chapter 1: 1.1, 1., 1.3 (covered in the beginning of semester) 1.4 (Learning parameters for BNs) Chapter 13: 13.1, 13.3.1, 13.4.1, 13.4.3 (basic

More information

Learning Marginal AMP Chain Graphs under Faithfulness

Learning Marginal AMP Chain Graphs under Faithfulness Learning Marginal AMP Chain Graphs under Faithfulness Jose M. Peña ADIT, IDA, Linköping University, SE-58183 Linköping, Sweden jose.m.pena@liu.se Abstract. Marginal AMP chain graphs are a recently introduced

More information

Directed and Undirected Graphical Models

Directed and Undirected Graphical Models Directed and Undirected Davide Bacciu Dipartimento di Informatica Università di Pisa bacciu@di.unipi.it Machine Learning: Neural Networks and Advanced Models (AA2) Last Lecture Refresher Lecture Plan Directed

More information

Digital Circuit And Logic Design I. Lecture 3

Digital Circuit And Logic Design I. Lecture 3 Digital Circuit And Logic Design I Lecture 3 Outline Combinational Logic Design Principles (). Introduction 2. Switching algebra 3. Combinational-circuit analysis 4. Combinational-circuit synthesis Panupong

More information

Learning Semi-Markovian Causal Models using Experiments

Learning Semi-Markovian Causal Models using Experiments Learning Semi-Markovian Causal Models using Experiments Stijn Meganck 1, Sam Maes 2, Philippe Leray 2 and Bernard Manderick 1 1 CoMo Vrije Universiteit Brussel Brussels, Belgium 2 LITIS INSA Rouen St.

More information

Correlation. Quantitative Aptitude & Business Statistics

Correlation. Quantitative Aptitude & Business Statistics Correlation Statistics Correlation Correlation is the relationship that exists between two or more variables. If two variables are related to each other in such a way that change increases a corresponding

More information

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011 Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011 Outline Ordinary Least Squares (OLS) Regression Generalized Linear Models

More information

Inferring the Causal Decomposition under the Presence of Deterministic Relations.

Inferring the Causal Decomposition under the Presence of Deterministic Relations. Inferring the Causal Decomposition under the Presence of Deterministic Relations. Jan Lemeire 1,2, Stijn Meganck 1,2, Francesco Cartella 1, Tingting Liu 1 and Alexander Statnikov 3 1-ETRO Department, Vrije

More information

Machine Learning

Machine Learning Machine Learning 10-701 Tom M. Mitchell Machine Learning Department Carnegie Mellon University February 1, 2011 Today: Generative discriminative classifiers Linear regression Decomposition of error into

More information