Causation and Prediction (2007) Neural Connectomics (2014) Pot-luck challenge (2008) Causality challenges Isabelle Guyon

Size: px
Start display at page:

Download "Causation and Prediction (2007) Neural Connectomics (2014) Pot-luck challenge (2008) Causality challenges Isabelle Guyon"

Transcription

1

2 Isabelle Guyon

3 Causation and Prediction (2007) Fast Cause-Effect causation coefficient Pairs (2013) (2014) Pot-luck challenge (2008) Neural Connectomics (2014) Causality challenges Isabelle Guyon

4 Initial impulse: Joris Mooij, Dominik Janzing, and Bernhard Schölkopf, from the Max Planck. Examples of algorithms and data: Povilas Daniušis, Arthur Gretton, Patrik O. Hoyer, Dominik Janzing, Antti Kerminen, Joris Mooij, Jonas Peters, Bernhard Schölkopf, Shohei Shimizu, Oliver Stegle, and Kun Zhang, Jakob Zscheischler. Datasets and result analysis: Isabelle Guyon + Mehreen Saeed + {Mikael Henaff, Sisi Ma, and Alexander Statnikov}, from NYU. Website and sample code: Isabelle Guyon + Phase 1: Ben Hamner (Kaggle) Phase 2: Ivan Judson, Christophe Poulain, Evelyne Viegas, Michael Zyskowski Review, testing: Marc Boullé, Hugo Jair Escalante, Frederick Eberhardt, Seth Flaxman, Patrik Hoyer, Dominik Janzing, Richard Kennaway, Vincent Lemaire, Joris Mooij, Jonas Peters, Florin, Peter Spirtes, Ioannis Tsamardinos, Jianxin Yin, Kun Zhang.

5 Phase 1: 1 st place: Diogo Moitinho de Almeida 2 nd place : José Adrián Rodríguez Fonollosa 3 rd place : Spyridon Samothrakis Phase 2: José Adrián Rodríguez Fonollosa David Lopez-Paz

6 What affects your health? the economy? climate changes? actions

7

8 Causality challenges Isabelle Guyon Thanks to Jonas Peters for this example

9 Causality challenges Isabelle Guyon Thanks to Jonas Peters for this example

10 Causality challenges Isabelle Guyon Thanks to Jonas Peters for this example

11

12 Random i.i.d. samples of A and B A -> B B = f(a, Z) A B A <- C -> B A = f(c, Z 1 ) B = f(c, Z 2 ) B -> A A = f(b, Z) A -> B B = f(a, Z) A B A <- C -> B A = f(c, Z 1 ) B = f(c, Z 2 ) B -> A A = f(b, Z)

13

14

15 Demographics: Sex -> Height Age -> Wages Native country -> Education Latitude -> Infant mortality Ecology: City elevation -> Temperature Water level -> Algal frequency Elevation -> Vegetation type Distance to hydrology -> Fire Econometrics: Mileage -> Car resell price Number of rooms -> House price Trade price last day -> Trade price Medicine: Cancer volume -> Recurrence Metastasis -> Prognosis Age -> Blood pressure Genomics (mrna level): transcription factor -> protein induced A B A B A B A -> B Z Z A A B Z A A <- B 20% 80% Engineering: Car model year -> Horsepower Number of cylinders -> MPG Cache memory -> Compute power Roof area -> Heating load Cement used -> Compressive strength B A <- Z -> B Z Z B

16

17 A B A B

18 Z Z A Z B A B A <- Z -> B

19 B Z A A <- B A Z B A -> B

20 - Causality - Counfounding - Dependency

21 Denis Boigelot, Université libre de Bruxelles, Belgium, Wikipedia Causality challenges Isabelle Guyon

22 Causality challenges Isabelle Guyon

23 Causality challenges Isabelle Guyon

24

25 Correlation: Mutual information: HSIC: Residual: res(b) 2 =(1/n) i (f(a i ) B i ) 2 Coefficient of determination: R B 2 = 1 res(b) 2 / B 2 Independence(Input, Residual): IR B =I(A,res(B))

26

27 A -> B

28 B -> A

29 A <- C -> B A = 2 C+ noisea B = 2 C+ noiseb

30

31

32

33 C(A,B) = cov(a,b)/( A B ) MI(A,B) = H(A) + H(B) - H(A,B) = KL[p(A,B) p(a)p(b)] I(A,B) = pval( C AB HS2 ) res(b) 2 =(1/n) i (f(a i ) B i ) 2 R B 2 = 1 res(b) 2 / B 2 IR B =I(A,res(B))

34

35

36 Restrict setting Study a particular model Propose a statistical test Only continuous variables Non Gaussian noise Non linear, no noise ANM PNL GP Identifiability Consistency, bias, variance, power Computational properties chalearn.org

37 Preprocess data Compute a lot of features Use non-linear classifier (ensemble of decision trees). Standardize Discretize Re-label categorical variables ProtoML: 20,000 (324 after selection) Jarfo: 43 features FirfID: 30 features RF GTB chalearn.org

38 Independence tests: Pearson correlation, Spearman correlation, HSIC, Mutual Information. Curve fitting: Residuals of fits with various models and loss functions; model complexity; independence of residual and input. Discretized conditional distribution: Variance, and other moments and statistics. Information theoretic features: Entropy, KL divergence to Gaussian or Uniform distrib. conditional entropy, IGCI. Data statistics: Num. data points, num. unique values, variable type.

39 CDS(A B) > CDS(B A) no yes H(A) > H(B) H(A) > H(B) no yes no yes Max(H(A), H(B))>2.5 A->B B->A Max(H(A), H(B))>2.5 no yes no yes A->B B->A B->A A->B

40 B A chalearn.org

41

42 C(A,B) = cov(a,b)/( A B ) MI(A,B) = H(A) + H(B) - H(A,B) = KL[p(A,B) p(a)p(b)] I(A,B) = pval( C AB HS2 ) res(b) 2 =(1/n) i (f(a i ) B i ) 2 R B 2 = 1 res(b) 2 / B 2 IR B =I(A,res(B))

43

44 Challenges can be a powerful tool to advance the state of the art, but most data scientists prefer hacking than producing papers. We are preparing a paper in which we will include: - Analysis the features. - Theoretical results. - Test on other data. Our next challenges will address - causality in time series and - entire network reconstruction.

45 Google group: causalitychallenge

46 Save the planet and return your name badge before you leave (on Tuesday) Microsoft Privacy Policy statement applies to all information collected. Read at research.microsoft.com

47

arxiv: v1 [cs.lg] 3 Jan 2017

arxiv: v1 [cs.lg] 3 Jan 2017 Deep Convolutional Neural Networks for Pairwise Causality Karamjit Singh, Garima Gupta, Lovekesh Vig, Gautam Shroff, and Puneet Agarwal TCS Research, New-Delhi, India January 4, 2017 arxiv:1701.00597v1

More information

Causal Inference via Kernel Deviance Measures

Causal Inference via Kernel Deviance Measures Causal Inference via Kernel Deviance Measures Jovana Mitrovic Department of Statistics University of Oxford Dino Sejdinovic Department of Statistics University of Oxford Yee Whye Teh Department of Statistics

More information

MACHINE LEARNING FOR CAUSE-EFFECT PAIRS DETECTION. Mehreen Saeed CLE Seminar 11 February, 2014.

MACHINE LEARNING FOR CAUSE-EFFECT PAIRS DETECTION. Mehreen Saeed CLE Seminar 11 February, 2014. MACHINE LEARNING FOR CAUSE-EFFECT PAIRS DETECTION Mehreen Saeed CLE Seminar 11 February, 214. WHY CAUSALITY. Polio drops can cause polio epidemics (The Nation, January 214) A supernova explosion causes

More information

Deep Convolutional Neural Networks for Pairwise Causality

Deep Convolutional Neural Networks for Pairwise Causality Deep Convolutional Neural Networks for Pairwise Causality Karamjit Singh, Garima Gupta, Lovekesh Vig, Gautam Shroff, and Puneet Agarwal TCS Research, Delhi Tata Consultancy Services Ltd. {karamjit.singh,

More information

Distinguishing between cause and effect

Distinguishing between cause and effect JMLR Workshop and Conference Proceedings 6:147 156 NIPS 28 workshop on causality Distinguishing between cause and effect Joris Mooij Max Planck Institute for Biological Cybernetics, 7276 Tübingen, Germany

More information

Distinguishing Cause from Effect Using Observational Data: Methods and Benchmarks

Distinguishing Cause from Effect Using Observational Data: Methods and Benchmarks Journal of Machine Learning Research 17 (2016) 1-102 Submitted 12/14; Revised 12/15; Published 4/16 Distinguishing Cause from Effect Using Observational Data: Methods and Benchmarks Joris M. Mooij Institute

More information

Distinguishing Causes from Effects using Nonlinear Acyclic Causal Models

Distinguishing Causes from Effects using Nonlinear Acyclic Causal Models Distinguishing Causes from Effects using Nonlinear Acyclic Causal Models Kun Zhang Dept of Computer Science and HIIT University of Helsinki 14 Helsinki, Finland kun.zhang@cs.helsinki.fi Aapo Hyvärinen

More information

Distinguishing Causes from Effects using Nonlinear Acyclic Causal Models

Distinguishing Causes from Effects using Nonlinear Acyclic Causal Models JMLR Workshop and Conference Proceedings 6:17 164 NIPS 28 workshop on causality Distinguishing Causes from Effects using Nonlinear Acyclic Causal Models Kun Zhang Dept of Computer Science and HIIT University

More information

Causal Inference on Discrete Data via Estimating Distance Correlations

Causal Inference on Discrete Data via Estimating Distance Correlations ARTICLE CommunicatedbyAapoHyvärinen Causal Inference on Discrete Data via Estimating Distance Correlations Furui Liu frliu@cse.cuhk.edu.hk Laiwan Chan lwchan@cse.euhk.edu.hk Department of Computer Science

More information

Causal Modeling with Generative Neural Networks

Causal Modeling with Generative Neural Networks Causal Modeling with Generative Neural Networks Michele Sebag TAO, CNRS INRIA LRI Université Paris-Sud Joint work: D. Kalainathan, O. Goudet, I. Guyon, M. Hajaiej, A. Decelle, C. Furtlehner https://arxiv.org/abs/1709.05321

More information

Distinguishing between Cause and Effect: Estimation of Causal Graphs with two Variables

Distinguishing between Cause and Effect: Estimation of Causal Graphs with two Variables Distinguishing between Cause and Effect: Estimation of Causal Graphs with two Variables Jonas Peters ETH Zürich Tutorial NIPS 2013 Workshop on Causality 9th December 2013 F. H. Messerli: Chocolate Consumption,

More information

Inference of Cause and Effect with Unsupervised Inverse Regression

Inference of Cause and Effect with Unsupervised Inverse Regression Inference of Cause and Effect with Unsupervised Inverse Regression Eleni Sgouritsa Dominik Janzing Philipp Hennig Bernhard Schölkopf Max Planck Institute for Intelligent Systems, Tübingen, Germany {eleni.sgouritsa,

More information

Probabilistic latent variable models for distinguishing between cause and effect

Probabilistic latent variable models for distinguishing between cause and effect Probabilistic latent variable models for distinguishing between cause and effect Joris M. Mooij joris.mooij@tuebingen.mpg.de Oliver Stegle oliver.stegle@tuebingen.mpg.de Dominik Janzing dominik.janzing@tuebingen.mpg.de

More information

Inferring deterministic causal relations

Inferring deterministic causal relations Inferring deterministic causal relations Povilas Daniušis 1,2, Dominik Janzing 1, Joris Mooij 1, Jakob Zscheischler 1, Bastian Steudel 3, Kun Zhang 1, Bernhard Schölkopf 1 1 Max Planck Institute for Biological

More information

arxiv: v1 [cs.lg] 22 Feb 2017

arxiv: v1 [cs.lg] 22 Feb 2017 Causal Inference by Stochastic Complexity KAILASH BUDHATHOKI, Max Planck Institute for Informatics and Saarland University, Germany JILLES VREEKEN, Max Planck Institute for Informatics and Saarland University,

More information

Causality. Bernhard Schölkopf and Jonas Peters MPI for Intelligent Systems, Tübingen. MLSS, Tübingen 21st July 2015

Causality. Bernhard Schölkopf and Jonas Peters MPI for Intelligent Systems, Tübingen. MLSS, Tübingen 21st July 2015 Causality Bernhard Schölkopf and Jonas Peters MPI for Intelligent Systems, Tübingen MLSS, Tübingen 21st July 2015 Charig et al.: Comparison of treatment of renal calculi by open surgery, (...), British

More information

arxiv: v1 [stat.ml] 24 Dec 2018

arxiv: v1 [stat.ml] 24 Dec 2018 Journal of Machine Learning Research 0 (0000) 0-00 Submitted 2/8; Published 00/00 Bayesian Causal Inference arxiv:82.09895v [stat.ml] 24 Dec 208 Maximilian Kurthen Max-Planck-Institut für Astrophysik Karl-Schwarzschildstr.,

More information

arxiv: v1 [cs.lg] 11 Dec 2014

arxiv: v1 [cs.lg] 11 Dec 2014 Distinguishing cause from effect using observational data: methods and benchmarks Distinguishing cause from effect using observational data: methods and benchmarks Joris M. Mooij Institute for Informatics,

More information

Cause-Effect Inference by Comparing Regression Errors

Cause-Effect Inference by Comparing Regression Errors Cause-Effect Inference by Comparing Regression Errors Patrick Blöbaum Dominik Janzing Takashi Washio Osaka University MPI for Intelligent Systems Osaka University Japan Tübingen, Germany Japan Shohei Shimizu

More information

Simplicity of Additive Noise Models

Simplicity of Additive Noise Models Simplicity of Additive Noise Models Jonas Peters ETH Zürich - Marie Curie (IEF) Workshop on Simplicity and Causal Discovery Carnegie Mellon University 7th June 2014 contains joint work with... ETH Zürich:

More information

Discovery of Linear Acyclic Models Using Independent Component Analysis

Discovery of Linear Acyclic Models Using Independent Component Analysis Created by S.S. in Jan 2008 Discovery of Linear Acyclic Models Using Independent Component Analysis Shohei Shimizu, Patrik Hoyer, Aapo Hyvarinen and Antti Kerminen LiNGAM homepage: http://www.cs.helsinki.fi/group/neuroinf/lingam/

More information

arxiv: v2 [cs.ai] 15 Nov 2016

arxiv: v2 [cs.ai] 15 Nov 2016 Entropic Causal Inference Murat Kocaoglu 1,*, Alexandros G. Dimakis 1,, Sriram Vishwanath 1, and Babak Hassibi 2, arxiv:1611.04035v2 [cs.ai] 15 Nov 2016 1 Department of Electrical and Computer Engineering,

More information

Learning of Causal Relations

Learning of Causal Relations Learning of Causal Relations John A. Quinn 1 and Joris Mooij 2 and Tom Heskes 2 and Michael Biehl 3 1 Faculty of Computing & IT, Makerere University P.O. Box 7062, Kampala, Uganda 2 Institute for Computing

More information

CAUSAL DISCOVERY USING PROXY VARIABLES

CAUSAL DISCOVERY USING PROXY VARIABLES CAUSAL DISCOVERY USING PROXY VARIABLES Mateo Rojas-Carulla Facebook AI Research, Paris University of Cambridge Max Plank Institute for Intelligent Systems, Tübingen mateo.rojas-carulla@tuebingen.mpg.de

More information

Identification of Time-Dependent Causal Model: A Gaussian Process Treatment

Identification of Time-Dependent Causal Model: A Gaussian Process Treatment Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 25) Identification of Time-Dependent Causal Model: A Gaussian Process Treatment Biwei Huang Kun Zhang,2

More information

arxiv: v2 [stat.ml] 16 Oct 2017

arxiv: v2 [stat.ml] 16 Oct 2017 Causal Inference on Multivariate and Mixed-Type Data Alexander Marx Jilles Vreeken arxiv:702.06385v2 [stat.ml] 6 Oct 207 Abstract Given data over the joint distribution of two random variables X and Y,

More information

Nonlinear causal discovery with additive noise models

Nonlinear causal discovery with additive noise models Nonlinear causal discovery with additive noise models Patrik O. Hoyer University of Helsinki Finland Dominik Janzing MPI for Biological Cybernetics Tübingen, Germany Joris Mooij MPI for Biological Cybernetics

More information

arxiv: v1 [cs.ai] 22 Apr 2015

arxiv: v1 [cs.ai] 22 Apr 2015 Distinguishing Cause from Effect Based on Exogeneity [Extended Abstract] Kun Zhang MPI for Intelligent Systems 7276 Tübingen, Germany & Info. Sci. Inst., USC 4676 Admiralty Way, CA 9292 kzhang@tuebingen.mpg.de

More information

Detecting the Direction of Causal Time Series

Detecting the Direction of Causal Time Series Jonas Peters jonas.peters@tuebingen.mpg.de Dominik Janzing dominik.janzing@tuebingen.mpg.de Arthur Gretton arthur.gretton@tuebingen.mpg.de Bernhard Schölkopf bernhard.schoelkopf@tuebingen.mpg.de Max-Planck-Institute

More information

39 On Estimation of Functional Causal Models: General Results and Application to Post-Nonlinear Causal Model

39 On Estimation of Functional Causal Models: General Results and Application to Post-Nonlinear Causal Model 39 On Estimation of Functional Causal Models: General Results and Application to Post-Nonlinear Causal Model KUN ZHANG, Max-Planck Institute for Intelligent Systems, 776 Tübingen, Germany ZHIKUN WANG,

More information

Dependence Minimizing Regression with Model Selection for Non-Linear Causal Inference under Non-Gaussian Noise

Dependence Minimizing Regression with Model Selection for Non-Linear Causal Inference under Non-Gaussian Noise Dependence Minimizing Regression with Model Selection for Non-Linear Causal Inference under Non-Gaussian Noise Makoto Yamada and Masashi Sugiyama Department of Computer Science, Tokyo Institute of Technology

More information

Causal Inference on Multivariate and Mixed-Type Data

Causal Inference on Multivariate and Mixed-Type Data Causal Inference on Multivariate and Mixed-Type Data Alexander Marx and Jilles Vreeken Max Planck Institute for Informatics and Saarland University, Saarbrücken, Germany {amarx,jilles}@mpi-inf.mpg.de Abstract.

More information

Advances in Cyclic Structural Causal Models

Advances in Cyclic Structural Causal Models Advances in Cyclic Structural Causal Models Joris Mooij j.m.mooij@uva.nl June 1st, 2018 Joris Mooij (UvA) Rotterdam 2018 2018-06-01 1 / 41 Part I Introduction to Causality Joris Mooij (UvA) Rotterdam 2018

More information

Estimating Causal Direction and Confounding Of Two Discrete Variables

Estimating Causal Direction and Confounding Of Two Discrete Variables Estimating Causal Direction an Confouning Of Two Discrete Variables This inspire further work on the so calle aitive noise moels. Hoyer et al. (2009) extene Shimizu s ientifiaarxiv:1611.01504v1 [stat.ml]

More information

arxiv: v1 [stat.ml] 13 Mar 2018

arxiv: v1 [stat.ml] 13 Mar 2018 SAM: Structural Agnostic Model, Causal Discovery and Penalized Adversarial Learning arxiv:1803.04929v1 [stat.ml] 13 Mar 2018 Diviyan Kalainathan 1, Olivier Goudet 1, Isabelle Guyon 1, David Lopez-Paz 2,

More information

Identifying confounders using additive noise models

Identifying confounders using additive noise models Identifying confounders using additive noise models Domini Janzing MPI for Biol. Cybernetics Spemannstr. 38 7276 Tübingen Germany Jonas Peters MPI for Biol. Cybernetics Spemannstr. 38 7276 Tübingen Germany

More information

Lecture 24, Causal Discovery

Lecture 24, Causal Discovery Lecture 24, Causal Discovery 36-402, Advanced Data Analysis 21 April 2011 Contents 1 Testing DAGs 1 2 Causal Discovery with Known Variables 3 2.1 Causal Discovery with Hidden Variables.............. 5

More information

arxiv: v1 [stat.ml] 8 May 2018

arxiv: v1 [stat.ml] 8 May 2018 Learning Generalized Hypergeometric Distribution GHD) DAG models Gunwoong Park 1 1 Department of Statistics, University of Seoul Abstract arxiv:1805.02848v1 [stat.ml] 8 May 2018 In this paper, we introduce

More information

Causal Inference in Time Series via Supervised Learning

Causal Inference in Time Series via Supervised Learning Causal Inference in Time Series via Supervised Learning Yoichi Chikahara and Akinori Fujino NTT Communication Science Laboratories, Kyoto 619-0237, Japan chikahara.yoichi@lab.ntt.co.jp, fujino.akinori@lab.ntt.co.jp

More information

Recovering the Graph Structure of Restricted Structural Equation Models

Recovering the Graph Structure of Restricted Structural Equation Models Recovering the Graph Structure of Restricted Structural Equation Models Workshop on Statistics for Complex Networks, Eindhoven Jonas Peters 1 J. Mooij 3, D. Janzing 2, B. Schölkopf 2, P. Bühlmann 1 1 Seminar

More information

Estimation of linear non-gaussian acyclic models for latent factors

Estimation of linear non-gaussian acyclic models for latent factors Estimation of linear non-gaussian acyclic models for latent factors Shohei Shimizu a Patrik O. Hoyer b Aapo Hyvärinen b,c a The Institute of Scientific and Industrial Research, Osaka University Mihogaoka

More information

Foundations of Causal Inference

Foundations of Causal Inference Foundations of Causal Inference Causal inference is usually concerned with exploring causal relations among random variables X 1,..., X n after observing sufficiently many samples drawn from the joint

More information

arxiv: v1 [stat.ml] 26 May 2016

arxiv: v1 [stat.ml] 26 May 2016 Discovering Causal Signals in Images David Lopez-Paz Facebook AI Research dlp@fb.com Robert Nishihara UC Berkeley rkn@eecs.berkeley.edu Soumith Chintala Facebook AI Research soumith@fb.com arxiv:605.0879v

More information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

PDF hosted at the Radboud Repository of the Radboud University Nijmegen PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a preprint version which may differ from the publisher's version. For additional information about this

More information

Accurate Causal Inference on Discrete Data

Accurate Causal Inference on Discrete Data Accurate Causal Inference on Discrete Data Kailash Budhathoki and Jilles Vreeken Max Planck Institute for Informatics and Saarland University Saarland Informatics Campus, Saarbrücken, Germany {kbudhath,jilles}@mpi-inf.mpg.de

More information

arxiv: v1 [cs.ai] 16 Aug 2016

arxiv: v1 [cs.ai] 16 Aug 2016 Evaluating Causal Models by Comparing Interventional Distributions arxiv:1608.04698v1 [cs.ai] 16 Aug 2016 Dan Garant dgarant@cs.umass.edu College of Information and Computer Sciences University of Massachusetts

More information

UAI 2014 Workshop Causal Inference: Learning and Prediction

UAI 2014 Workshop Causal Inference: Learning and Prediction Joris M. Mooij, Dominik Janzing, Jonas Peters, Tom Claassen, Antti Hyttinen (Eds.) Proceedings of the UAI 2014 Workshop Causal Inference: Learning and Prediction Quebec City, Quebec, Canada July 27, 2014

More information

Joint estimation of linear non-gaussian acyclic models

Joint estimation of linear non-gaussian acyclic models Joint estimation of linear non-gaussian acyclic models Shohei Shimizu The Institute of Scientific and Industrial Research, Osaka University Mihogaoka 8-1, Ibaraki, Osaka 567-0047, Japan Abstract A linear

More information

Artificial Intelligence

Artificial Intelligence Artificial Intelligence 182 183 212) 1 31 Contents lists available at SciVerse ScienceDirect Artificial Intelligence www.elsevier.com/locate/artint Information-geometric approach to inferring causal directions

More information

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Bayesian learning: Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Let y be the true label and y be the predicted

More information

Causality: Objectives and Assessment Challenges in Machine Learning, Volume 4

Causality: Objectives and Assessment Challenges in Machine Learning, Volume 4 Causality: Objectives and Assessment Challenges in Machine Learning, Volume 4 Causality: Objectives and Assessment Challenges in Machine Learning, Volume 4 Isabelle Guyon, Dominik Janzing, and Bernhard

More information

FA Homework 5 Recitation 1. Easwaran Ramamurthy Guoquan (GQ) Zhao Logan Brooks

FA Homework 5 Recitation 1. Easwaran Ramamurthy Guoquan (GQ) Zhao Logan Brooks FA17 10-701 Homework 5 Recitation 1 Easwaran Ramamurthy Guoquan (GQ) Zhao Logan Brooks Note Remember that there is no problem set covering some of the lecture material; you may need to study these topics

More information

Introduction to Three Paradigms in Machine Learning. Julien Mairal

Introduction to Three Paradigms in Machine Learning. Julien Mairal Introduction to Three Paradigms in Machine Learning Julien Mairal Inria Grenoble Yerevan, 208 Julien Mairal Introduction to Three Paradigms in Machine Learning /25 Optimization is central to machine learning

More information

CSC2515 Winter 2015 Introduction to Machine Learning. Lecture 2: Linear regression

CSC2515 Winter 2015 Introduction to Machine Learning. Lecture 2: Linear regression CSC2515 Winter 2015 Introduction to Machine Learning Lecture 2: Linear regression All lecture slides will be available as.pdf on the course website: http://www.cs.toronto.edu/~urtasun/courses/csc2515/csc2515_winter15.html

More information

Efficient combination of pairwise feature networks

Efficient combination of pairwise feature networks JMLR: Workshop and Conference Proceedings 46:77 84, 2015 ECML 2014 Efficient combination of pairwise feature networks Pau Bellot pau.bellot@upc.edu Department of Signal Theory and Communications, Technical

More information

Bayesian Discovery of Linear Acyclic Causal Models

Bayesian Discovery of Linear Acyclic Causal Models Bayesian Discovery of Linear Acyclic Causal Models Patrik O. Hoyer Helsinki Institute for Information Technology & Department of Computer Science University of Helsinki Finland Antti Hyttinen Helsinki

More information

Dimension Reduction and Classification Using PCA and Factor. Overview

Dimension Reduction and Classification Using PCA and Factor. Overview Dimension Reduction and Classification Using PCA and - A Short Overview Laboratory for Interdisciplinary Statistical Analysis Department of Statistics Virginia Tech http://www.stat.vt.edu/consult/ March

More information

Telling Cause from Effect using MDL-based Local and Global Regression

Telling Cause from Effect using MDL-based Local and Global Regression Telling Cause from Effect using MDL-based Local and Global Regression Alexander Marx and Jilles Vreeken Max Planck Institute for Informatics and Saarland University Saarland Informatics Campus, Saarbrücken,

More information

NEURAL CAUSAL REGULARIZATION UNDER

NEURAL CAUSAL REGULARIZATION UNDER NEURAL CAUSAL REGULARIZATION UNDER THE INDEPENDENCE OF MECHANISMS ASSUMPTION Mohammad Taha Bahadori 1, Krzysztof Chalupka 2, Edward Choi 1, Robert Chen 1, Walter F. Stewart 3, & Jimeng Sun 1 1 Georgia

More information

Technical Track Session I: Causal Inference

Technical Track Session I: Causal Inference Impact Evaluation Technical Track Session I: Causal Inference Human Development Human Network Development Network Middle East and North Africa Region World Bank Institute Spanish Impact Evaluation Fund

More information

Inferring Causal Direction from Relational Data

Inferring Causal Direction from Relational Data Inferring Causal Direction from Relational Data David Arbour darbour@cs.umass.edu Katerina Marazopoulou kmarazo@cs.umass.edu College of Information and Computer Sciences University of Massachusetts Amherst

More information

Data-driven causal path discovery without prior knowledge - a benchmark study

Data-driven causal path discovery without prior knowledge - a benchmark study Data-driven causal path discovery without prior knowledge - a benchmark study Marcel M lyńczak arxiv:1807.02348v2 [stat.ap] 17 Jul 2018 July 18, 2018 Abstract Causal discovery broadens the inference possibilities,

More information

Content Preview. Multivariate Methods. There are several theoretical stumbling blocks to overcome to develop rating relativities

Content Preview. Multivariate Methods. There are several theoretical stumbling blocks to overcome to develop rating relativities Introduction to Ratemaking Multivariate Methods March 15, 2010 Jonathan Fox, FCAS, GuideOne Mutual Insurance Group Content Preview 1. Theoretical Issues 2. One-way Analysis Shortfalls 3. Multivariate Methods

More information

Causality: Objectives and Assessment

Causality: Objectives and Assessment JMLR Workshop and Conference Proceedings 6:1 38 NIPS 2008 workshop on causality Isabelle Guyon Clopinet, California, USA Causality: Objectives and Assessment ISABELLE@CLOPINET.COM Dominik Janzing DOMINIK.JANZING@TUEBINGEN.MPG.DE

More information

Analysis of Cause-Effect Inference via Regression Errors

Analysis of Cause-Effect Inference via Regression Errors arxiv:82.6698v [cs.ai] 9 Feb 28 Analysis of Cause-Effect Inference via Regression Errors Patrick Blöbaum, Dominik Janzing 2, Takashi Washio, Shohei Shimizu,3, and Bernhard Schölkopf 2 Osaka University,

More information

Practice Questions for Exam 1

Practice Questions for Exam 1 Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon

More information

Identifiability of Non-Gaussian Structural VAR Models for Subsampled and Mixed Frequency Time Series

Identifiability of Non-Gaussian Structural VAR Models for Subsampled and Mixed Frequency Time Series Identifiability of Non-Gaussian Structural VAR Models for Subsampled and Mixed Frequency Time Series Alex Tank Department of Statistics University of Washington alextank@uw.edu Emily Fox Department of

More information

Classification with Perceptrons. Reading:

Classification with Perceptrons. Reading: Classification with Perceptrons Reading: Chapters 1-3 of Michael Nielsen's online book on neural networks covers the basics of perceptrons and multilayer neural networks We will cover material in Chapters

More information

AP HUG REVIEW WELCOME TO 2 ND SEMESTER! Annette Parkhurst, M.Ed. January, 2015

AP HUG REVIEW WELCOME TO 2 ND SEMESTER! Annette Parkhurst, M.Ed. January, 2015 AP HUG REVIEW WELCOME TO 2 ND SEMESTER! Annette Parkhurst, M.Ed. January, 2015 Movement Globalization Latitude & Elevation Levels of Economic Activities CONNECTIONS Human Geography Human is the geography

More information

Mathematical Tools for Neuroscience (NEU 314) Princeton University, Spring 2016 Jonathan Pillow. Homework 8: Logistic Regression & Information Theory

Mathematical Tools for Neuroscience (NEU 314) Princeton University, Spring 2016 Jonathan Pillow. Homework 8: Logistic Regression & Information Theory Mathematical Tools for Neuroscience (NEU 34) Princeton University, Spring 206 Jonathan Pillow Homework 8: Logistic Regression & Information Theory Due: Tuesday, April 26, 9:59am Optimization Toolbox One

More information

From Deterministic ODEs to Dynamic Structural Causal Models

From Deterministic ODEs to Dynamic Structural Causal Models From Deterministic ODEs to Dynamic Structural Causal Models Paul K. Rubenstein Department of Engineering University of Cambridge pkr23@cam.ac.uk Stephan Bongers Informatics Institute University of Amsterdam

More information

Chart types and when to use them

Chart types and when to use them APPENDIX A Chart types and when to use them Pie chart Figure illustration of pie chart 2.3 % 4.5 % Browser Usage for April 2012 18.3 % 38.3 % Internet Explorer Firefox Chrome Safari Opera 35.8 % Pie chart

More information

Detecting Statistical Interactions from Neural Network Weights

Detecting Statistical Interactions from Neural Network Weights Detecting Statistical Interactions from Neural Network Weights Michael Tsang Joint work with Dehua Cheng, Yan Liu 1/17 Motivation: We seek assurance that a neural network learned the longitude x latitude

More information

Jessica Wehner. Summer Fellow Bioengineering and Bioinformatics Summer Institute University of Pittsburgh 29 May 2008

Jessica Wehner. Summer Fellow Bioengineering and Bioinformatics Summer Institute University of Pittsburgh 29 May 2008 Journal Club Jessica Wehner Summer Fellow Bioengineering and Bioinformatics Summer Institute University of Pittsburgh 29 May 2008 Comparison of Probabilistic Combination Methods for Protein Secondary Structure

More information

Arrowhead completeness from minimal conditional independencies

Arrowhead completeness from minimal conditional independencies Arrowhead completeness from minimal conditional independencies Tom Claassen, Tom Heskes Radboud University Nijmegen The Netherlands {tomc,tomh}@cs.ru.nl Abstract We present two inference rules, based on

More information

Telling cause from effect based on high-dimensional observations

Telling cause from effect based on high-dimensional observations Telling cause from effect based on high-dimensional observations Dominik Janzing dominik.janzing@tuebingen.mpg.de Max Planck Institute for Biological Cybernetics, Tübingen, Germany Patrik O. Hoyer patrik.hoyer@helsinki.fi

More information

Tensor Product Kernels: Characteristic Property, Universality

Tensor Product Kernels: Characteristic Property, Universality Tensor Product Kernels: Characteristic Property, Universality Zolta n Szabo CMAP, E cole Polytechnique Joint work with: Bharath K. Sriperumbudur Hangzhou International Conference on Frontiers of Data Science

More information

HB Methods for Combining Estimates from Multiple Surveys

HB Methods for Combining Estimates from Multiple Surveys Hierarchical Bayesian Methods for Combining Estimates from Multiple Surveys Adrijo Chakraborty NORC at the University of Chicago January 30, 2015 Joint work with Gauri Sankar Datta and Yang Cheng Outline

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

Computational Genomics

Computational Genomics Computational Genomics http://www.cs.cmu.edu/~02710 Introduction to probability, statistics and algorithms (brief) intro to probability Basic notations Random variable - referring to an element / event

More information

Spatial Decision Tree: A Novel Approach to Land-Cover Classification

Spatial Decision Tree: A Novel Approach to Land-Cover Classification Spatial Decision Tree: A Novel Approach to Land-Cover Classification Zhe Jiang 1, Shashi Shekhar 1, Xun Zhou 1, Joseph Knight 2, Jennifer Corcoran 2 1 Department of Computer Science & Engineering 2 Department

More information

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 8. Chapter 8. Classification: Basic Concepts

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 8. Chapter 8. Classification: Basic Concepts Data Mining: Concepts and Techniques (3 rd ed.) Chapter 8 Chapter 8. Classification: Basic Concepts Classification: Basic Concepts Decision Tree Induction Bayes Classification Methods Rule-Based Classification

More information

From Deterministic ODEs to Dynamic Structural Causal Models

From Deterministic ODEs to Dynamic Structural Causal Models From Deterministic ODEs to Dynamic Structural Causal Models Paul K. Rubenstein Department of Engineering University of Cambridge pkr23@cam.ac.uk Stephan Bongers Informatics Institute University of Amsterdam

More information

Causal Inference by Compression

Causal Inference by Compression Causal Inference by Compression Kailash Budhathoki and Jilles Vreeken Max Planck Institute for Informatics and Saarland University, Saarbrücken, Germany {kbudhath,jilles}@mpi-inf.mpg.de Abstract Causal

More information

Econometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.

Econometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton. 1/17 Research Methods Carlos Noton Term 2-2012 Outline 2/17 1 Econometrics in a nutshell: Variation and Identification 2 Main Assumptions 3/17 Dependent variable or outcome Y is the result of two forces:

More information

A Decision Stump. Decision Trees, cont. Boosting. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University. October 1 st, 2007

A Decision Stump. Decision Trees, cont. Boosting. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University. October 1 st, 2007 Decision Trees, cont. Boosting Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University October 1 st, 2007 1 A Decision Stump 2 1 The final tree 3 Basic Decision Tree Building Summarized

More information

arxiv: v1 [cs.lg] 26 May 2017

arxiv: v1 [cs.lg] 26 May 2017 Learning Causal tructures Using Regression Invariance ariv:1705.09644v1 [cs.lg] 26 May 2017 AmirEmad Ghassami, aber alehkaleybar, Negar Kiyavash, Kun Zhang Department of ECE, University of Illinois at

More information

Predictive Analytics on Accident Data Using Rule Based and Discriminative Classifiers

Predictive Analytics on Accident Data Using Rule Based and Discriminative Classifiers Advances in Computational Sciences and Technology ISSN 0973-6107 Volume 10, Number 3 (2017) pp. 461-469 Research India Publications http://www.ripublication.com Predictive Analytics on Accident Data Using

More information

Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs

Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs Proceedings of Machine Learning Research vol 73:21-32, 2017 AMBN 2017 Causal Effect Identification in Alternative Acyclic Directed Mixed Graphs Jose M. Peña Linköping University Linköping (Sweden) jose.m.pena@liu.se

More information

Experimental Design for Learning Causal Graphs with Latent Variables

Experimental Design for Learning Causal Graphs with Latent Variables Experimental Design for Learning Causal Graphs with Latent Variables Murat Kocaoglu Department of Electrical and Computer Engineering The University of Texas at Austin, USA mkocaoglu@utexas.edu Karthikeyan

More information

Assignment 1: Probabilistic Reasoning, Maximum Likelihood, Classification

Assignment 1: Probabilistic Reasoning, Maximum Likelihood, Classification Assignment 1: Probabilistic Reasoning, Maximum Likelihood, Classification For due date see https://courses.cs.sfu.ca This assignment is to be done individually. Important Note: The university policy on

More information

Kernel Methods. Lecture 4: Maximum Mean Discrepancy Thanks to Karsten Borgwardt, Malte Rasch, Bernhard Schölkopf, Jiayuan Huang, Arthur Gretton

Kernel Methods. Lecture 4: Maximum Mean Discrepancy Thanks to Karsten Borgwardt, Malte Rasch, Bernhard Schölkopf, Jiayuan Huang, Arthur Gretton Kernel Methods Lecture 4: Maximum Mean Discrepancy Thanks to Karsten Borgwardt, Malte Rasch, Bernhard Schölkopf, Jiayuan Huang, Arthur Gretton Alexander J. Smola Statistical Machine Learning Program Canberra,

More information

Decision Trees. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University. February 5 th, Carlos Guestrin 1

Decision Trees. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University. February 5 th, Carlos Guestrin 1 Decision Trees Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University February 5 th, 2007 2005-2007 Carlos Guestrin 1 Linear separability A dataset is linearly separable iff 9 a separating

More information

Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees

Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees Rafdord M. Neal and Jianguo Zhang Presented by Jiwen Li Feb 2, 2006 Outline Bayesian view of feature

More information

Hilbert Space Representations of Probability Distributions

Hilbert Space Representations of Probability Distributions Hilbert Space Representations of Probability Distributions Arthur Gretton joint work with Karsten Borgwardt, Kenji Fukumizu, Malte Rasch, Bernhard Schölkopf, Alex Smola, Le Song, Choon Hui Teo Max Planck

More information

Maximum Mean Discrepancy

Maximum Mean Discrepancy Maximum Mean Discrepancy Thanks to Karsten Borgwardt, Malte Rasch, Bernhard Schölkopf, Jiayuan Huang, Arthur Gretton Alexander J. Smola Statistical Machine Learning Program Canberra, ACT 0200 Australia

More information

Decision Trees Lecture 12

Decision Trees Lecture 12 Decision Trees Lecture 12 David Sontag New York University Slides adapted from Luke Zettlemoyer, Carlos Guestrin, and Andrew Moore Machine Learning in the ER Physician documentation Triage Information

More information

Importance Reweighting Using Adversarial-Collaborative Training

Importance Reweighting Using Adversarial-Collaborative Training Importance Reweighting Using Adversarial-Collaborative Training Yifan Wu yw4@andrew.cmu.edu Tianshu Ren tren@andrew.cmu.edu Lidan Mu lmu@andrew.cmu.edu Abstract We consider the problem of reweighting a

More information

PAC-learning, VC Dimension and Margin-based Bounds

PAC-learning, VC Dimension and Margin-based Bounds More details: General: http://www.learning-with-kernels.org/ Example of more complex bounds: http://www.research.ibm.com/people/t/tzhang/papers/jmlr02_cover.ps.gz PAC-learning, VC Dimension and Margin-based

More information

Variance Reduction and Ensemble Methods

Variance Reduction and Ensemble Methods Variance Reduction and Ensemble Methods Nicholas Ruozzi University of Texas at Dallas Based on the slides of Vibhav Gogate and David Sontag Last Time PAC learning Bias/variance tradeoff small hypothesis

More information