Linear Non-Gaussian Structural Equation Models

Similar documents
Data Assimilation 1. Alan O Neill National Centre for Earth Observation UK

Applied Statistics II - Categorical Data Analysis Data analysis using Genstat - Exercise 2 Logistic regression

Strongly Connected Components

Chapter 13 GMM for Linear Factor Models in Discount Factor form. GMM on the pricing errors gives a crosssectional

Addition of angular momentum

EXST Regression Techniques Page 1

Basic Polyhedral theory

Estimation of apparent fraction defective: A mathematical approach

Higher order derivatives

Addition of angular momentum

Outline. Image processing includes. Edge detection. Advanced Multimedia Signal Processing #8:Image Processing 2 processing

What are those βs anyway? Understanding Design Matrix & Odds ratios

Dealing with quantitative data and problem solving life is a story problem! Attacking Quantitative Problems

Supplementary Materials

u r du = ur+1 r + 1 du = ln u + C u sin u du = cos u + C cos u du = sin u + C sec u tan u du = sec u + C e u du = e u + C

u x v x dx u x v x v x u x dx d u x v x u x v x dx u x v x dx Integration by Parts Formula

Estimation of odds ratios in Logistic Regression models under different parameterizations and Design matrices

Self-Adjointness and Its Relationship to Quantum Mechanics. Ronald I. Frank 2016

This test is for two independent Populations. The test is sometimes called the Mann-Whitney U test or the Rank Sum Wilcoxon. They are equivalent.

Finite element discretization of Laplace and Poisson equations

Propositional Logic. Combinatorial Problem Solving (CPS) Albert Oliveras Enric Rodríguez-Carbonell. May 17, 2018

Discovery of Linear Acyclic Models Using Independent Component Analysis

NEW APPLICATIONS OF THE ABEL-LIOUVILLE FORMULA

10. The Discrete-Time Fourier Transform (DTFT)

Network Congestion Games

Solution of Assignment #2

ARIMA Methods of Detecting Outliers in Time Series Periodic Processes

AS 5850 Finite Element Analysis

Status of LAr TPC R&D (2) 2014/Dec./23 Neutrino frontier workshop 2014 Ryosuke Sasaki (Iwate U.)

Computing and Communications -- Network Coding

Estimation of linear non-gaussian acyclic models for latent factors

CS 361 Meeting 12 10/3/18

Recursive Estimation of Dynamic Time-Varying Demand Models

Differential Equations

Difference -Analytical Method of The One-Dimensional Convection-Diffusion Equation

MCE503: Modeling and Simulation of Mechatronic Systems Discussion on Bond Graph Sign Conventions for Electrical Systems

ECE 650 1/8. Homework Set 4 - Solutions

Derangements and Applications

Random Process Part 1

COHORT MBA. Exponential function. MATH review (part2) by Lucian Mitroiu. The LOG and EXP functions. Properties: e e. lim.

Review Statistics review 14: Logistic regression Viv Bewick 1, Liz Cheek 1 and Jonathan Ball 2

Where k is either given or determined from the data and c is an arbitrary constant.

ME 321 Kinematics and Dynamics of Machines S. Lambert Winter 2002

SCHUR S THEOREM REU SUMMER 2005

Text: WMM, Chapter 5. Sections , ,

1 Minimum Cut Problem

What is the product of an integer multiplied by zero? and divided by zero?

Slide 1. Slide 2. Slide 3 DIGITAL SIGNAL PROCESSING CLASSIFICATION OF SIGNALS

Math 34A. Final Review

Thus, because if either [G : H] or [H : K] is infinite, then [G : K] is infinite, then [G : K] = [G : H][H : K] for all infinite cases.

Transitional Probability Model for a Serial Phases in Production

Sara Godoy del Olmo Calculation of contaminated soil volumes : Geostatistics applied to a hydrocarbons spill Lac Megantic Case

Function Spaces. a x 3. (Letting x = 1 =)) a(0) + b + c (1) = 0. Row reducing the matrix. b 1. e 4 3. e 9. >: (x = 1 =)) a(0) + b + c (1) = 0

2008 AP Calculus BC Multiple Choice Exam

Inference Methods for Stochastic Volatility Models

Square of Hamilton cycle in a random graph

The graph of y = x (or y = ) consists of two branches, As x 0, y + ; as x 0, y +. x = 0 is the

Multiple-Choice Test Introduction to Partial Differential Equations COMPLETE SOLUTION SET

DIFFERENTIAL EQUATION

Note If the candidate believes that e x = 0 solves to x = 0 or gives an extra solution of x = 0, then withhold the final accuracy mark.

ECE602 Exam 1 April 5, You must show ALL of your work for full credit.

Panel Data Analysis Introduction

4.2 Design of Sections for Flexure

3-2-1 ANN Architecture

Observer Bias and Reliability By Xunchi Pu

Rational Approximation for the one-dimensional Bratu Equation

Bifurcation Theory. , a stationary point, depends on the value of α. At certain values

That is, we start with a general matrix: And end with a simpler matrix:

2. Laser physics - basics

Numbering Systems Basic Building Blocks Scaling and Round-off Noise. Number Representation. Floating vs. Fixed point. DSP Design.

Week 3: Connected Subgraphs

ANALYSIS IN THE FREQUENCY DOMAIN

SUMMER 17 EXAMINATION

Two Products Manufacturer s Production Decisions with Carbon Constraint

Principles of Humidity Dalton s law

ON RIGHT(LEFT) DUO PO-SEMIGROUPS. S. K. Lee and K. Y. Park

ME311 Machine Design

dx equation it is called a second order differential equation.

Generalized Estimating Equations for Zero-Inflated Spatial Count Data

perm4 A cnt 0 for for if A i 1 A i cnt cnt 1 cnt i j. j k. k l. i k. j l. i l

The Equitable Dominating Graph

Section 6.1. Question: 2. Let H be a subgroup of a group G. Then H operates on G by left multiplication. Describe the orbits for this operation.

First derivative analysis


Finite Element Analysis

L 1 = L G 1 F-matrix: too many F ij s even at quadratic-only level

Section 11.6: Directional Derivatives and the Gradient Vector

Calculus II (MAC )

10. Limits involving infinity

Fourier Transforms and the Wave Equation. Key Mathematics: More Fourier transform theory, especially as applied to solving the wave equation.

MCB137: Physical Biology of the Cell Spring 2017 Homework 6: Ligand binding and the MWC model of allostery (Due 3/23/17)

CS 6353 Compiler Construction, Homework #1. 1. Write regular expressions for the following informally described languages:

Construction of asymmetric orthogonal arrays of strength three via a replacement method

MATHEMATICS (B) 2 log (D) ( 1) = where z =

Einstein Equations for Tetrad Fields

4037 ADDITIONAL MATHEMATICS

Pair (and Triplet) Production Effect:

CPSC 665 : An Algorithmist s Toolkit Lecture 4 : 21 Jan Linear Programming

INC 693, 481 Dynamics System and Modelling: The Language of Bound Graphs Dr.-Ing. Sudchai Boonto Assistant Professor

CE 530 Molecular Simulation

Transcription:

IMPS 8, Durham, NH Linar Non-Gaussian Structural Equation Modls Shohi Shimizu, Patrik Hoyr and Aapo Hyvarinn Osaka Univrsity, Japan Univrsity of Hlsinki, Finland

Abstract Linar Structural Equation Modling (linar SEM) Analyzs causal rlations Covarianc-basd SEM Uss covarianc structur alon for modl idntification A numbr of indistinguishabl modls Linar non-gaussian SEM Uss non-gaussian structurs for modl idntification Maks many modls distinguishabl

SEM and causal analysis SEM is oftn usd for causal analysis basd on non-primntal data Assumption: th data gnrating procss is rprsntd by a SEM modl If th assumption is rasonabl, SEM provids causal information 3

Limitations of covarianc-basd SEM Covarianc-basd SEM cannot distinguish btwn many modls Eampl 4

Linar non-gaussian SEM Many obsrvd data ar considrably non- Gaussian (Miccri, 989; Hyvarinn t al. ) Non-Gaussian structurs of data ar usful (Bntlr 983; Mooijaart 985) Non-Gaussianity distinguish btwn th two modls (Shimizu t al. 6) : 5

Indpndnt componnt analysis (ICA) Obsrvd random vctor is modld as As s i ar indpndnt and non-gaussian Zro mans and unit variancs A is a constant matri Typically squar, # variabls # indpndnt componnts Idntifiabl up to prmutation of th columns (Mooijaart 985; Comon, 994) 6

ICA stimation An altrnativ prssion of ICA (As): ~ whr W A s calld a ~, W rcovring Find such W that maimizs indpndnc of componnts of s ˆ W Many proposals (Hyvarinn t al. ) matri W ~ is stimatd up to prmutation of th rows: W PW ~ 7

ICA stimation An altrnativ prssion of ICA (As): ~ whr W A s calld a ~, W rcovring Find such W that maimizs indpndnc of componnts of s ˆ W Many proposals (Hyvarinn t al. ) matri W ~ is stimatd up to prmutation of th rows: W PW ~ 8

ICA stimation An altrnativ prssion of ICA (As): ~ whr W A s calld a ~, W rcovring Find such W that maimizs indpndnc of componnts of s ˆ W Many proposals (Hyvarinn t al. ) matri W ~ is stimatd up to prmutation of th rows: W PW ~ 9

Discovry of linar non-gaussian acyclic modls Shimizu, Hoyr, Hyvarinn and Krminn (6)

Linar non-gaussian acyclic modl (LiNGAM) Dirctd acyclic graphs (DAG) i can b arrangd in a ordr k(i) Assumptions: Linarity Etrnal influncs ar indpndnt and ar non-gaussian i i b j + k ( ij j) < k ( i) i or B +

Goal W know Data X is gnratd by W do NOT know Path cofficints: bij Ordrs k(i) Etrnal influncs: i B + What w obsrv is data X only Goal Estimat B and k(i) using data X only!

Ky ida First, rlat LiNGAM with ICA as follows: B + ( I B) A - ICA! ~ quivalntly ( I B) W Du to th prmutation indtrminacy, ICA givs: W PW ~ Can find a corrct P Th corrct prmutation is th only on that has no zros in th diagonal 3

Ky ida First, rlat LiNGAM with ICA as follows: B + ( I B) A - ICA! ~ quivalntly ( I B) W Du to th prmutation indtrminacy, ICA givs: W PW ~ Can find a corrct P Th corrct prmutation is th only on that has no zros in th diagonal 4

Ky ida First, rlat LiNGAM with ICA as follows: B + ( I B) A - ICA! ~ quivalntly ( I B) W Du to th prmutation indtrminacy, ICA givs: W PW ~ Can find a corrct P Th corrct prmutation is th only on that has no zros in th diagonal 5

Ky ida First, rlat LiNGAM with ICA as follows: B + ( I B) A - ICA! ~ quivalntly ( I B) W Du to th prmutation indtrminacy, ICA givs: W PW ~ Can find th corrct P Th corrct prmutation is th only on that has no zros in th diagonal 6

Illustrativ ampl Considr th modl:.6 443 B +.6 Goal Estimat th path dirction btwn and obsrving only and 7

Prform ICA Rlation of th LiNGAM modl with ICA: ~ W.6 443 W ~ Du to th prmutation indtrminacy, ICA might giv: ( ~ ) W PW.8 8

Prform ICA Rlation of th LiNGAM modl with ICA: ~ W.6 443 W ~ Du to th prmutation indtrminacy, ICA might giv: W ( ~ ) PW.6 9

Find th corrct P Find a prmutation of th rows of W so that it has no zros in th diagonal In th ampl.8 443.6 443 Prmut th rows W W ~

Find th corrct P Find a prmutation of th rows of W so that it has no zros in th diagonal In th ampl.8 443.6 443 Prmut th rows W W ~

Find th corrct P Find a prmutation of th rows of W so that it has no zros in th diagonal In th ampl.6 443.6 443 Prmut th rows W W ~

Find th corrct P In practic, Pˆ ma P ( ) P T W ii Havily pnalizs small absolut valus in th diagonal 3

Simulations: Estimation of B Both supr- and sub-gaussian trnal influncs tstd 5 datasts cratd for ach scattrplot B randomly gnratd at ach trial 3 3 3 - - - Numbr of variabls 5 Estimatd bij - -3-3 - - 3 3 - - -3-3 - - 3 3 - -3-3 - - 3 3 - - -3-3 - - 3 3 - -3-3 - - 3 3 - - -3-3 - - 3 3 - - - - - - -3-3 - - 3-3 -3 - - 3-3 -3 - - 3 Gnrating bij, 3, Numbr of obsrvations 4

5 Prun B () In practic, du to stimation rrors, w would gt: Nd to find which path cofficints ar actually zros +.5.65 4 43 4 4 B

Find a prmutation that givs a lowr triangular matri Th LiNGAM modl is acyclic Th matri B can b prmutd to b lowr triangular for som prmutation of variabls (Bolln, 989) First, find a simultanous prmutation of rows and columns of B that givs a lowr-triangular B In practic, find a prmutation matri Q that minimizs th sum of th lmnts in its uppr T Qˆ ma QBQ triangular part: ( ) ij Q i j 6

Find a prmutation that givs a lowr triangular matri Th LiNGAM modl is acyclic Th matri B can b prmutd to b lowr triangular for som prmutation of variabls (Bolln, 989) First, find a simultanous prmutation of rows and columns of B that givs a lowr-triangular B In practic, find a prmutation matri Q that minimizs th sum of th lmnts in its uppr T Qˆ ma QBQ triangular part: ( ) ij Q i j 7

Find a prmutation that givs a lowr triangular matri Th LiNGAM modl is acyclic Th matri B can b prmutd to b lowr triangular for som prmutation of variabls (Bolln, 989) First, find a simultanous prmutation of rows and columns of B that givs a lowr-triangular B In practic, find a prmutation matri Q that minimizs th sum of th lmnts in its uppr triangular part: ( T Qˆ min QBQ ) ij Q i j 8

9 Gt a lowr-triangular B +.5.65 +.6.5 B Applying such a simultanous prmutation of th rows and columns, w gt a prmutd B that is as lowr-triangular as possibl St th uppr-triangular lmnts to b zros T QBQ

3 Gt a lowr-triangular B +.5.65 +.65.5 B Applying such a simultanous prmutation of th rows and columns, w gt a prmutd B that is as lowr-triangular as possibl St th uppr-triangular lmnts to b zros T QBQ

3 Gt a lowr-triangular B +.5.65 +.65.5 B Applying such a simultanous prmutation of th rows and columns, w gt a prmutd B that is as lowr-triangular as possibl St th uppr-triangular lmnts to b zros T QBQ -.5

3 Gt a lowr-triangular B +.5.65 +.65.5 B Applying such a simultanous prmutation of th rows and columns, w gt a prmutd B that is as lowr-triangular as possibl St th uppr-triangular lmnts to b zros T QBQ

33 Pruning B () Onc w gt a lowr-triangular B, th modl is idntifiabl using covarianc-basd SEM Many isting mthods can b usd for pruning th rmaining path cofficints Wald tst, Bootstrapping, Modl fit Lasso-typ stimators (Tibshirani 996; Zou, 6) tc. +.65

To summariz th procdur. Estimat B ICA + finding th corrct row prmutation. Prun stimatd B. Find a row-and-column prmutation that maks stimatd B lowr triangular. Prun rmaining paths using a covarianc-basd mthod. Estimat B. Prun stimatd B 4 4 4 3 3 3 34

Summary of th rgular LiNGAM A linar acyclic modl is idntifiabl basd on non-gaussianity ICA-basd stimation works wll Confidnc intrvals (Konya t al., in progrss) Bttr pruning mthods might b dvlopd Imposing sparsnss in th ICA stag (Zhang & Chang, 6; Hayashi t al. in progrss) lik Lasso (Tibshirani 996) 35

Som tnsions

Latnt factors (Shimizu t al., 7) A non-gaussian multipl indicator modl: f Bf + d Gf + Suppos that G is idntifid, thn B is idntifid Could idntify G in a data drivn way using a ttradconstraint-basd mthod (Silva t al., 6) 37

Latnt classs (Shimizu & Hyvarinn, 8) LiNGAM modl for ach class q: B + ( I B ) ì + ì + A - ICA! q q q q q q q ICA miturs (L t al., ; Mollah t al., 6) Class :.9 Class : 6. 5 38

Unobsrvd confoundrs (Hoyr t al., in prss Can idntify and distinguish btwn mor modls.. 3. 4. 5. 6. u u u 39

Tim structurs (Hyvarinn t al., 8) Combining LiNGAM and autorgrssiv modl: k ( t) B ( t ) + ( t) In conomtrics: Structural vctor autorgrssion (Swanson & Grangr, 997) Changs ordinary AR cofficints basd on instantanous ffcts: B ( I B ) for ( : AR matri) M > M 4

Som variabls ar Gaussian (Hoyr t al., 8) Considr th modl:.6 Can idntify th path dirction if ithr of or is non-gaussian In gnral, thr ist svral quivalnt modls that ntail th sam distribution if som ar Gaussian 4

Som othr tnsions Cyclic modls (Lacrda t al., 8) Fwr quivalnt modls than covariancbasd approach Nonlinarity (Zhang & Chan, 7; Sun t al., 7) Modl fit statistics ar undr dvlopmnt Non-Gaussian structurs 4

Conclusion Us of non-gaussianity in SEM is usful for modl idntification Many obsrvd data ar considrably non-gaussian Th non-gaussian approach can b a good option 43

Most of our paprs and Matlab/Octav cod ar availabl on our wbpags Googl will find us! 44