Preliminary Causal Analysis Results with Software Cost Estimation Data. Anandi Hira, Bob Stoddard, Mike Konrad, Barry Boehm

Size: px
Start display at page:

Download "Preliminary Causal Analysis Results with Software Cost Estimation Data. Anandi Hira, Bob Stoddard, Mike Konrad, Barry Boehm"

Transcription

1 Preliminary Causal Analysis Results with Software Cost Estimation Data Anandi Hira, Bob Stoddard, Mike Konrad, Barry Boehm

2 Parametric Cost Models COCOMO II Effort =2.94 Size E 17 i=1 EM i u Input: size, product and personnel attributes u Effort in Person-Months (PM) u Size in KSLOC (1000 SLOC) u Domain Experts u Data calibration u No causal analysis

3 Causal Inference Causal Learning/Inference Causal Discovery Causal Estimation Algorithms and Domain Knowledge on Data Algorithms to quantify causal influence

4 Past Causal-Type Analyses Dr. Boehm COCOMO 81 u In-depth behavioral analyses for effort factors Evidence-Based SE u Experiments u Cause precede effect u Cause covaries with effect u Alternative explanations are implausible Cuoto et al u Granger s causality test for software defect predictability u Doesn t get to heart of causality Hu et al u Bayesian networks with causality constraints for software risk factors

5 PC Search u Named after Peter Spirtes and Clark Glymour u First scalable discovery algorithm X 1 X 2 Change in X 1 causes change in X 2 X 1 X 2 Insufficient data to select orientation X 1 X 2 May be common confounder of both variables, missing from dataset

6 Tetrad

7 Dataset: Unified Code Count (UCC) Project Description u Maintained at USC u Code metrics tool (logical SLOC, cyclomatic complexity) u Implemented in C++ u 45 to 1425 logical SLOC u 2010 to 2014 u Modularized architecture u 4-month time-boxed increments Project Types u Add Functions u New language parsers u New features, such as GUI u Modify Functions u Cyclomatic complexity support (modify existing language parsers with mathematical operation and algorithms)

8 Dataset Attributes 1. Equivalent SLOC 2. IFPUG Function Points 3. IFPUG Software Non-functional Assessment Process 4. COSMIC Function Points 5. Total Effort 6. Applications Experience 7. Platform Experience 8. Use of Software Tools 9. Personnel Continuity 10. Documentation Match to Needs 11. Analyst Capability 12. Programmer Capability 13. Product Complexity

9 All Data Points

10 ESLOC Normalized Effort (hours) UCC Projects Calibrated Model Equivalent SLOC

11 IFPUG FPs Normalized Effort (hours) Modified Functions Add Functions Enhancement Function Points

12 IFPUG SNAP Normalized Effort (hours) Modified Functions Add Functions Enhancement SNAP Points

13 COSMIC FPs Normalized Effort (hours) Modified Functions Add Functions COSMIC Function Points

14 Add Functions

15 ESLOC Normalized Effort (hrs) Equivalent SLOC

16 IFPUG FPs Normalized Effort (hours) Enhancement Function Points

17 IFPUG SNAP Normalized Effort (hours) Enhancement SNAP Points

18 COSMIC FPs Normalized Effort (hours) COSMIC Function Points

19 Modify Functions

20 ESLOC Normalized Effort (hours) Equivalent SLOC

21 IFPUG FPs Normalized Effort (hours) Enhancement Function Points

22 IFPUG SNAP Normalized Effort (hours) Enhancement SNAP Points

23 COSMIC FPs Normalized Effort (hours) COSMIC Function Points

24 Conclusion

25 Conclusions General Conclusions u u u All Data Points u SNAP -> Total Effort u CFPs -> Total Effort u PCAP Total Effort u ACAP PCAP Add Functions u PCAP Total Effort Modify Functions u ESLOC Total Effort u SNAP Total Effort u ACAP PCAP Interesting Results u All Data Points u CFPs -> DOCU u Modify Functions u CFPs -> PCAP u ACAP -> PCAP

Preliminary Causal Analysis Results with Software Cost Estimation Data

Preliminary Causal Analysis Results with Software Cost Estimation Data Preliminary Causal Analysis Results with Software Cost Estimation Data Anandi Hira, Barry Boehm University of Southern California Robert Stoddard, Michael Konrad Software Engineering Institute Motivation

More information

Improve Forecasts: Use Defect Signals

Improve Forecasts: Use Defect Signals Improve Forecasts: Use Defect Signals Paul Below paul.below@qsm.com Quantitative Software Management, Inc. Introduction Large development and integration project testing phases can extend over many months

More information

Fuzzy Expert-COCOMO Risk Assessment and Effort Contingency Model in Software Project Management

Fuzzy Expert-COCOMO Risk Assessment and Effort Contingency Model in Software Project Management Western University Scholarship@Western Electronic Thesis and Dissertation Repository April 2013 Fuzzy Expert-COCOMO Assessment and Effort Contingency Model in Software Project Management Ekananta Manalif

More information

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS ANALYTIC COMPARISON of Pearl and Rubin CAUSAL FRAMEWORKS Content Page Part I. General Considerations Chapter 1. What is the question? 16 Introduction 16 1. Randomization 17 1.1 An Example of Randomization

More information

Joining Effort and Duration in a Probabilistic Method for Predicting Software Cost and Schedule

Joining Effort and Duration in a Probabilistic Method for Predicting Software Cost and Schedule Joining Effort and Duration in a Probabilistic Method for Predicting Software Cost and Schedule Mike Ross President & CEO ESTIMATING, LLC 325 Woodyard Road Lakeside, Montana 59922 (o) 480.710.4678 (f)

More information

Strategies for Discovering Mechanisms of Mind using fmri: 6 NUMBERS. Joseph Ramsey, Ruben Sanchez Romero and Clark Glymour

Strategies for Discovering Mechanisms of Mind using fmri: 6 NUMBERS. Joseph Ramsey, Ruben Sanchez Romero and Clark Glymour 1 Strategies for Discovering Mechanisms of Mind using fmri: 6 NUMBERS Joseph Ramsey, Ruben Sanchez Romero and Clark Glymour 2 The Numbers 20 50 5024 9205 8388 500 3 fmri and Mechanism From indirect signals

More information

MATHEMATICS OF DATA FUSION

MATHEMATICS OF DATA FUSION MATHEMATICS OF DATA FUSION by I. R. GOODMAN NCCOSC RDTE DTV, San Diego, California, U.S.A. RONALD P. S. MAHLER Lockheed Martin Tactical Defences Systems, Saint Paul, Minnesota, U.S.A. and HUNG T. NGUYEN

More information

Inferring the Causal Decomposition under the Presence of Deterministic Relations.

Inferring the Causal Decomposition under the Presence of Deterministic Relations. Inferring the Causal Decomposition under the Presence of Deterministic Relations. Jan Lemeire 1,2, Stijn Meganck 1,2, Francesco Cartella 1, Tingting Liu 1 and Alexander Statnikov 3 1-ETRO Department, Vrije

More information

The Specification of Causal Models with Tetrad IV: A Review

The Specification of Causal Models with Tetrad IV: A Review Structural Equation Modeling, 17:703 711, 2010 Copyright Taylor & Francis Group, LLC ISSN: 1070-5511 print/1532-8007 online DOI: 10.1080/10705511.2010.510074 SOFTWARE REVIEW The Specification of Causal

More information

Case Study on Software Effort Estimation

Case Study on Software Effort Estimation International Journal of Information and Electronics Engineering, Vol. 7, No. 3, May 217 Case Study on Software Estimation Tülin Erçelebi Ayyıldız and Hasan Can Terzi Abstract Since most of the projects

More information

Modeling the Relationship between Software Effort and Size Using Deming Regression

Modeling the Relationship between Software Effort and Size Using Deming Regression Modeling the Relationship between Software Effort and Size Using Deming Regression Nikolaos Mittas Makrina Viola Kosti Basiliki Argyropoulou Lefteris Angelis Programming Languages & Software Engineering

More information

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables N-1 Experiments Suffice to Determine the Causal Relations Among N Variables Frederick Eberhardt Clark Glymour 1 Richard Scheines Carnegie Mellon University Abstract By combining experimental interventions

More information

A Value-Added Predictive Defect Type Distribution Model based on Project Characteristics

A Value-Added Predictive Defect Type Distribution Model based on Project Characteristics Seventh IEEE/ACIS International Conference on Computer and Information Science A Value-Added Predictive Defect Type Distribution Model based on Project Characteristics Youngki Hong, Jongmoon Baik, In-Young

More information

Causal Discovery with Latent Variables: the Measurement Problem

Causal Discovery with Latent Variables: the Measurement Problem Causal Discovery with Latent Variables: the Measurement Problem Richard Scheines Carnegie Mellon University Peter Spirtes, Clark Glymour, Ricardo Silva, Steve Riese, Erich Kummerfeld, Renjie Yang, Max

More information

Challenges (& Some Solutions) and Making Connections

Challenges (& Some Solutions) and Making Connections Challenges (& Some Solutions) and Making Connections Real-life Search All search algorithm theorems have form: If the world behaves like, then (probability 1) the algorithm will recover the true structure

More information

Advising on Research Methods: A consultant's companion. Herman J. Ader Gideon J. Mellenbergh with contributions by David J. Hand

Advising on Research Methods: A consultant's companion. Herman J. Ader Gideon J. Mellenbergh with contributions by David J. Hand Advising on Research Methods: A consultant's companion Herman J. Ader Gideon J. Mellenbergh with contributions by David J. Hand Contents Preface 13 I Preliminaries 19 1 Giving advice on research methods

More information

Study of the effectiveness of cost-estimation models and complexity metrics on small projects

Study of the effectiveness of cost-estimation models and complexity metrics on small projects Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 12-12-1986 Study of the effectiveness of cost-estimation models and complexity metrics

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

Learning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach

Learning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach Learning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach Stijn Meganck 1, Philippe Leray 2, and Bernard Manderick 1 1 Vrije Universiteit Brussel, Pleinlaan 2,

More information

Sensor Network Information Analytical Methods: Analysis of Similarities and Differences

Sensor Network Information Analytical Methods: Analysis of Similarities and Differences Sensors & Transducers 2014 by IFSA Publishing, S. L. http://www.sensorsportal.com Sensor Network Information Analytical Methods: Analysis of Similarities and Differences Chen Jian Computer Engineering

More information

Chapter 8 Mining Additional Perspectives

Chapter 8 Mining Additional Perspectives Chapter 8 Mining Additional Perspectives prof.dr.ir. Wil van der Aalst www.processmining.org Overview Chapter 1 Introduction Part I: Preliminaries Chapter 2 Process Modeling and Analysis Chapter 3 Data

More information

Measurement Error and Causal Discovery

Measurement Error and Causal Discovery Measurement Error and Causal Discovery Richard Scheines & Joseph Ramsey Department of Philosophy Carnegie Mellon University Pittsburgh, PA 15217, USA 1 Introduction Algorithms for causal discovery emerged

More information

JOINT PROBABILISTIC INFERENCE OF CAUSAL STRUCTURE

JOINT PROBABILISTIC INFERENCE OF CAUSAL STRUCTURE JOINT PROBABILISTIC INFERENCE OF CAUSAL STRUCTURE Dhanya Sridhar Lise Getoor U.C. Santa Cruz KDD Workshop on Causal Discovery August 14 th, 2016 1 Outline Motivation Problem Formulation Our Approach Preliminary

More information

Adopting Curvilinear Component Analysis to Improve Software Cost Estimation Accuracy Model, Application Strategy, and an Experimental Verification

Adopting Curvilinear Component Analysis to Improve Software Cost Estimation Accuracy Model, Application Strategy, and an Experimental Verification Adopting Curvilinear Component Analysis to Improve Software Cost Estimation Accuracy Model, Application Strategy, and an Experimental Verification Salvatore A. Sarcia 1, Giovanni Cantone 1 and Victor R.

More information

Forecasting Conflict Lecture 4 Models and Metrics

Forecasting Conflict Lecture 4 Models and Metrics Forecasting Conflict Lecture 4 Models and Metrics Philip A. Schrodt Parus Analytical Systems schrodt735@gmail.com Graduate School of Decision Sciences University of Konstanz 14-17 October 2013 Overview

More information

Fast and Accurate Causal Inference from Time Series Data

Fast and Accurate Causal Inference from Time Series Data Fast and Accurate Causal Inference from Time Series Data Yuxiao Huang and Samantha Kleinberg Stevens Institute of Technology Hoboken, NJ {yuxiao.huang, samantha.kleinberg}@stevens.edu Abstract Causal inference

More information

Lars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL), Institute BW/WI & Institute for Computer Science, University of Hildesheim

Lars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL), Institute BW/WI & Institute for Computer Science, University of Hildesheim Course on Bayesian Networks, summer term 2010 0/42 Bayesian Networks Bayesian Networks 11. Structure Learning / Constrained-based Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL)

More information

UNIVERSITY OF THE PHILIPPINES LOS BAÑOS INSTITUTE OF STATISTICS BS Statistics - Course Description

UNIVERSITY OF THE PHILIPPINES LOS BAÑOS INSTITUTE OF STATISTICS BS Statistics - Course Description UNIVERSITY OF THE PHILIPPINES LOS BAÑOS INSTITUTE OF STATISTICS BS Statistics - Course Description COURSE COURSE TITLE UNITS NO. OF HOURS PREREQUISITES DESCRIPTION Elementary Statistics STATISTICS 3 1,2,s

More information

Weighted Stability Index (WSI) Metric Model Mike Libassi Intel Corp 8/11/99

Weighted Stability Index (WSI) Metric Model Mike Libassi Intel Corp 8/11/99 Weighted Stability Index (WSI) Metric Model Mike Libassi Intel Corp 8/11/99 Abstract Methods, such as McCabe's Cyclomatic Complexity, have proven that complexity is a reliable predictor of defects. Although

More information

QCA: Strengths, Weaknesses, Policy Relevance

QCA: Strengths, Weaknesses, Policy Relevance QCA: Strengths, Weaknesses, Policy Relevance SVUF Conference Stockholm, 19 October 2017 Barbara Befani Presentation Outline Clearing out some misconceptions on causality and causal inference Impact Evaluation

More information

BEYOND ASSORTATIVITY PROCLIVITY INDEX FOR ATTRIBUTED NETWORKS (PRONE) Reihaneh Rabbany Dhivya Eswaran*

BEYOND ASSORTATIVITY PROCLIVITY INDEX FOR ATTRIBUTED NETWORKS (PRONE) Reihaneh Rabbany Dhivya Eswaran* PROCLIVITY INDEX FOR ATTRIBUTED NETWORKS (PRONE) Reihaneh Rabbany rrabbany@andrew.cmu.edu Artur W. Dubrawski awd@andrew.cmu.edu Dhivya Eswaran* deswaran@cs.cmu.edu Christos Faloutsos christos@cs.cmu.edu

More information

Noisy-OR Models with Latent Confounding

Noisy-OR Models with Latent Confounding Noisy-OR Models with Latent Confounding Antti Hyttinen HIIT & Dept. of Computer Science University of Helsinki Finland Frederick Eberhardt Dept. of Philosophy Washington University in St. Louis Missouri,

More information

Automatic Causal Discovery

Automatic Causal Discovery Automatic Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour Dept. of Philosophy & CALD Carnegie Mellon 1 Outline 1. Motivation 2. Representation 3. Discovery 4. Using Regression for Causal

More information

Predicting causal effects in large-scale systems from observational data

Predicting causal effects in large-scale systems from observational data nature methods Predicting causal effects in large-scale systems from observational data Marloes H Maathuis 1, Diego Colombo 1, Markus Kalisch 1 & Peter Bühlmann 1,2 Supplementary figures and text: Supplementary

More information

MEASURING THE COMPLEXITY AND IMPACT OF DESIGN CHANGES

MEASURING THE COMPLEXITY AND IMPACT OF DESIGN CHANGES Presentation Paper Bio Return to Main Menu P R E S E N T A T I O N W4 Wednesday, March 8, 2000 11:00AM MEASURING THE COMPLEXITY AND IMPACT OF DESIGN CHANGES Mike Libassi Intel Corporation International

More information

An Introduction to Reversible Jump MCMC for Bayesian Networks, with Application

An Introduction to Reversible Jump MCMC for Bayesian Networks, with Application An Introduction to Reversible Jump MCMC for Bayesian Networks, with Application, CleverSet, Inc. STARMAP/DAMARS Conference Page 1 The research described in this presentation has been funded by the U.S.

More information

Causality in Econometrics (3)

Causality in Econometrics (3) Graphical Causal Models References Causality in Econometrics (3) Alessio Moneta Max Planck Institute of Economics Jena moneta@econ.mpg.de 26 April 2011 GSBC Lecture Friedrich-Schiller-Universität Jena

More information

Bayesian Discovery of Linear Acyclic Causal Models

Bayesian Discovery of Linear Acyclic Causal Models Bayesian Discovery of Linear Acyclic Causal Models Patrik O. Hoyer Helsinki Institute for Information Technology & Department of Computer Science University of Helsinki Finland Antti Hyttinen Helsinki

More information

Causal Inference on Data Containing Deterministic Relations.

Causal Inference on Data Containing Deterministic Relations. Causal Inference on Data Containing Deterministic Relations. Jan Lemeire Kris Steenhaut Sam Maes COMO lab, ETRO Department Vrije Universiteit Brussel, Belgium {jan.lemeire, kris.steenhaut}@vub.ac.be, sammaes@gmail.com

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Topic -2. Probability. Larson & Farber, Elementary Statistics: Picturing the World, 3e 1

Topic -2. Probability. Larson & Farber, Elementary Statistics: Picturing the World, 3e 1 Topic -2 Probability Larson & Farber, Elementary Statistics: Picturing the World, 3e 1 Probability Experiments Experiment : An experiment is an act that can be repeated under given condition. Rolling a

More information

Introduction to ecosystem modelling Stages of the modelling process

Introduction to ecosystem modelling Stages of the modelling process NGEN02 Ecosystem Modelling 2018 Introduction to ecosystem modelling Stages of the modelling process Recommended reading: Smith & Smith Environmental Modelling, Chapter 2 Models in science and research

More information

Revisiting Software Development Effort Estimation Based on Early Phase Development Activities

Revisiting Software Development Effort Estimation Based on Early Phase Development Activities Revisiting Software Development Effort Estimation Based on Early Phase Development Activities Masateru Tsunoda Toyo University Saitama, Japan tsunoda@ieee.org Koji Toda Fukuoka Institute of Technology

More information

Bayesian causal forests: dealing with regularization induced confounding and shrinking towards homogeneous effects

Bayesian causal forests: dealing with regularization induced confounding and shrinking towards homogeneous effects Bayesian causal forests: dealing with regularization induced confounding and shrinking towards homogeneous effects P. Richard Hahn, Jared Murray, and Carlos Carvalho July 29, 2018 Regularization induced

More information

AN APPROACH TO FIND THE TRANSITION PROBABILITIES IN MARKOV CHAIN FOR EARLY PREDICTION OF SOFTWARE RELIABILITY

AN APPROACH TO FIND THE TRANSITION PROBABILITIES IN MARKOV CHAIN FOR EARLY PREDICTION OF SOFTWARE RELIABILITY International Journal of Latest Research in Science and Technology Volume 2, Issue 6: Page No.111-115,November-December 2013 http://www.mnkjournals.com/ijlrst.htm ISSN (Online):2278-5299 AN APPROACH TO

More information

Bayesian network modeling. 1

Bayesian network modeling.  1 Bayesian network modeling http://springuniversity.bc3research.org/ 1 Probabilistic vs. deterministic modeling approaches Probabilistic Explanatory power (e.g., r 2 ) Explanation why Based on inductive

More information

diluted treatment effect estimation for trigger analysis in online controlled experiments

diluted treatment effect estimation for trigger analysis in online controlled experiments diluted treatment effect estimation for trigger analysis in online controlled experiments Alex Deng and Victor Hu February 2, 2015 Microsoft outline Trigger Analysis and The Dilution Problem Traditional

More information

Time Series: Theory and Methods

Time Series: Theory and Methods Peter J. Brockwell Richard A. Davis Time Series: Theory and Methods Second Edition With 124 Illustrations Springer Contents Preface to the Second Edition Preface to the First Edition vn ix CHAPTER 1 Stationary

More information

Tutorial: Causal Model Search

Tutorial: Causal Model Search Tutorial: Causal Model Search Richard Scheines Carnegie Mellon University Peter Spirtes, Clark Glymour, Joe Ramsey, others 1 Goals 1) Convey rudiments of graphical causal models 2) Basic working knowledge

More information

A523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 2011

A523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 2011 A523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 2011 Lecture 1 Organization:» Syllabus (text, requirements, topics)» Course approach (goals, themes) Book: Gregory, Bayesian

More information

Causal Graphical Models in Systems Genetics

Causal Graphical Models in Systems Genetics 1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation

More information

Statistics for Particle Physics. Kyle Cranmer. New York University. Kyle Cranmer (NYU) CERN Academic Training, Feb 2-5, 2009

Statistics for Particle Physics. Kyle Cranmer. New York University. Kyle Cranmer (NYU) CERN Academic Training, Feb 2-5, 2009 Statistics for Particle Physics Kyle Cranmer New York University 91 Remaining Lectures Lecture 3:! Compound hypotheses, nuisance parameters, & similar tests! The Neyman-Construction (illustrated)! Inverted

More information

Towards an extension of the PC algorithm to local context-specific independencies detection

Towards an extension of the PC algorithm to local context-specific independencies detection Towards an extension of the PC algorithm to local context-specific independencies detection Feb-09-2016 Outline Background: Bayesian Networks The PC algorithm Context-specific independence: from DAGs to

More information

Learning the Structure of Linear Latent Variable Models

Learning the Structure of Linear Latent Variable Models Journal (2005) - Submitted /; Published / Learning the Structure of Linear Latent Variable Models Ricardo Silva Center for Automated Learning and Discovery School of Computer Science rbas@cs.cmu.edu Richard

More information

Bioinformatics: Biology X

Bioinformatics: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA Model Building/Checking, Reverse Engineering, Causality Outline 1 2 Main theses Outline There are seven main propositions in the

More information

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology FE670 Algorithmic Trading Strategies Lecture 3. Factor Models and Their Estimation Steve Yang Stevens Institute of Technology 09/12/2012 Outline 1 The Notion of Factors 2 Factor Analysis via Maximum Likelihood

More information

Discovering Geographical Topics in Twitter

Discovering Geographical Topics in Twitter Discovering Geographical Topics in Twitter Liangjie Hong, Lehigh University Amr Ahmed, Yahoo! Research Alexander J. Smola, Yahoo! Research Siva Gurumurthy, Twitter Kostas Tsioutsiouliklis, Twitter Overview

More information

Discussion on Fygenson (2007, Statistica Sinica): a DS Perspective

Discussion on Fygenson (2007, Statistica Sinica): a DS Perspective 1 Discussion on Fygenson (2007, Statistica Sinica): a DS Perspective Chuanhai Liu Purdue University 1. Introduction In statistical analysis, it is important to discuss both uncertainty due to model choice

More information

Causal Discovery by Computer

Causal Discovery by Computer Causal Discovery by Computer Clark Glymour Carnegie Mellon University 1 Outline 1. A century of mistakes about causation and discovery: 1. Fisher 2. Yule 3. Spearman/Thurstone 2. Search for causes is statistical

More information

Nonparametric Inference and the Dark Energy Equation of State

Nonparametric Inference and the Dark Energy Equation of State Nonparametric Inference and the Dark Energy Equation of State Christopher R. Genovese Peter E. Freeman Larry Wasserman Department of Statistics Carnegie Mellon University http://www.stat.cmu.edu/ ~ genovese/

More information

Residential Showering Time-of-Day Analysis

Residential Showering Time-of-Day Analysis Residential Showering Time-of-Day Analysis An Independent MaP Research Report January 2017 Authors Bill Gauley, P.Eng., Principal, Gauley Associates Ltd. John Koeller, P.E., Principal, Koeller & Company

More information

Treatment of Expert Opinion Diversity in Bayesian Belief Network Model for Nuclear Digital I&C Safety Software Reliability Assessment

Treatment of Expert Opinion Diversity in Bayesian Belief Network Model for Nuclear Digital I&C Safety Software Reliability Assessment Treatment of Expert Opinion Diversity in Bayesian Belief Network Model for Nuclear Digital I&C Safety Software Reliability Assessment 20 International Topical Meeting on Probabilistic Safety Assessment

More information

A historical perspective on fine tuning: lessons from causal set cosmology

A historical perspective on fine tuning: lessons from causal set cosmology A historical perspective on fine tuning: lessons from causal set cosmology Fay Dowker Oxford th December 01 Rafael Sorkin, Graham Brightwell, Djamel Dou, Fay Dowker Raquel Garcia, Joe Henson, Xavier Martin,

More information

Human-level concept learning through probabilistic program induction

Human-level concept learning through probabilistic program induction B.M Lake, R. Salakhutdinov, J.B. Tenenbaum Human-level concept learning through probabilistic program induction journal club at two aspects in which machine learning spectacularly lags behind human learning

More information

PEARL VS RUBIN (GELMAN)

PEARL VS RUBIN (GELMAN) PEARL VS RUBIN (GELMAN) AN EPIC battle between the Rubin Causal Model school (Gelman et al) AND the Structural Causal Model school (Pearl et al) a cursory overview Dokyun Lee WHO ARE THEY? Judea Pearl

More information

Predicting Faults Using the Complexity of Code Change

Predicting Faults Using the Complexity of Code Change Predicting Faults Using the Complexity of Code Change International Conference on Software Engineering (2009) Ahmed E. Hassan 2013-07-09 Yoo Jin Lim Introduction Background Code Change Models Case Study

More information

Crowdsourcing via Tensor Augmentation and Completion (TAC)

Crowdsourcing via Tensor Augmentation and Completion (TAC) Crowdsourcing via Tensor Augmentation and Completion (TAC) Presenter: Yao Zhou joint work with: Dr. Jingrui He - 1 - Roadmap Background Related work Crowdsourcing based on TAC Experimental results Conclusion

More information

CSCI 5822 Probabilistic Model of Human and Machine Learning. Mike Mozer University of Colorado

CSCI 5822 Probabilistic Model of Human and Machine Learning. Mike Mozer University of Colorado CSCI 5822 Probabilistic Model of Human and Machine Learning Mike Mozer University of Colorado Topics Language modeling Hierarchical processes Pitman-Yor processes Based on work of Teh (2006), A hierarchical

More information

Un nouvel algorithme de génération des itemsets fermés fréquents

Un nouvel algorithme de génération des itemsets fermés fréquents Un nouvel algorithme de génération des itemsets fermés fréquents Huaiguo Fu CRIL-CNRS FRE2499, Université d Artois - IUT de Lens Rue de l université SP 16, 62307 Lens cedex. France. E-mail: fu@cril.univ-artois.fr

More information

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla. OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/) Statistical vs. Causal vs. Counterfactual inference: syntax and semantics

More information

Job Training Partnership Act (JTPA)

Job Training Partnership Act (JTPA) Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training

More information

British American Tobacco Group Research & Development. Method - Determination of phenols in mainstream cigarette smoke

British American Tobacco Group Research & Development. Method - Determination of phenols in mainstream cigarette smoke British American Tobacco Group Research & Development Method - Determination of phenols in mainstream cigarette smoke 1 SCOPE OF APPLICATION The method is applicable to the quantitative determination of

More information

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching Mayya Komisarchik Harvard University April 13, 2016 1 Heartfelt thanks to all of the Gov 2001 TFs of yesteryear; this section draws heavily

More information

Generative Techniques: Bayes Rule and the Axioms of Probability

Generative Techniques: Bayes Rule and the Axioms of Probability Intelligent Systems: Reasoning and Recognition James L. Crowley ENSIMAG 2 / MoSIG M1 Second Semester 2016/2017 Lesson 8 3 March 2017 Generative Techniques: Bayes Rule and the Axioms of Probability Generative

More information

MTAT Software Engineering

MTAT Software Engineering MTAT.03.094 Software Engineering Lecture 14: Measurement Dietmar Pfahl Fall 2015 email: dietmar.pfahl@ut.ee Schedule of Lectures Week 01: Introduction to SE Week 02: Requirements Engineering I Week 03:

More information

CAUSALITY AND EXOGENEITY IN ECONOMETRICS

CAUSALITY AND EXOGENEITY IN ECONOMETRICS EDITORS INTRODUCTION CAUSALITY AND EXOGENEITY IN ECONOMETRICS Luc Bauwens CORE and Department of Economics, Université catholique de Louvain bauwens@core.ucl.ac.be H. Peter Boswijk Department of Quantitative

More information

MATCHING FOR EE AND DR IMPACTS

MATCHING FOR EE AND DR IMPACTS MATCHING FOR EE AND DR IMPACTS Seth Wayland, Opinion Dynamics August 12, 2015 A Proposal Always use matching Non-parametric preprocessing to reduce model dependence Decrease bias and variance Better understand

More information

From Causality, Second edition, Contents

From Causality, Second edition, Contents From Causality, Second edition, 2009. Preface to the First Edition Preface to the Second Edition page xv xix 1 Introduction to Probabilities, Graphs, and Causal Models 1 1.1 Introduction to Probability

More information

CAUSAL GAN: LEARNING CAUSAL IMPLICIT GENERATIVE MODELS WITH ADVERSARIAL TRAINING

CAUSAL GAN: LEARNING CAUSAL IMPLICIT GENERATIVE MODELS WITH ADVERSARIAL TRAINING CAUSAL GAN: LEARNING CAUSAL IMPLICIT GENERATIVE MODELS WITH ADVERSARIAL TRAINING (Murat Kocaoglu, Christopher Snyder, Alexandros G. Dimakis & Sriram Vishwanath, 2017) Summer Term 2018 Created for the Seminar

More information

Lecture 24, Causal Discovery

Lecture 24, Causal Discovery Lecture 24, Causal Discovery 36-402, Advanced Data Analysis 21 April 2011 Contents 1 Testing DAGs 1 2 Causal Discovery with Known Variables 3 2.1 Causal Discovery with Hidden Variables.............. 5

More information

ENVIRONMENTAL DATA ANALYSIS WILLIAM MENKE JOSHUA MENKE WITH MATLAB COPYRIGHT 2011 BY ELSEVIER, INC. ALL RIGHTS RESERVED.

ENVIRONMENTAL DATA ANALYSIS WILLIAM MENKE JOSHUA MENKE WITH MATLAB COPYRIGHT 2011 BY ELSEVIER, INC. ALL RIGHTS RESERVED. ENVIRONMENTAL DATA ANALYSIS WITH MATLAB WILLIAM MENKE PROFESSOR OF EARTH AND ENVIRONMENTAL SCIENCE COLUMBIA UNIVERSITY JOSHUA MENKE SOFTWARE ENGINEER JOM ASSOCIATES COPYRIGHT 2011 BY ELSEVIER, INC. ALL

More information

Sensitivity Analysis with Several Unmeasured Confounders

Sensitivity Analysis with Several Unmeasured Confounders Sensitivity Analysis with Several Unmeasured Confounders Lawrence McCandless lmccandl@sfu.ca Faculty of Health Sciences, Simon Fraser University, Vancouver Canada Spring 2015 Outline The problem of several

More information

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf 1 Introduction to Machine Learning Maximum Likelihood and Bayesian Inference Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf 2013-14 We know that X ~ B(n,p), but we do not know p. We get a random sample

More information

Causal Discovery. Richard Scheines. Peter Spirtes, Clark Glymour, and many others. Dept. of Philosophy & CALD Carnegie Mellon

Causal Discovery. Richard Scheines. Peter Spirtes, Clark Glymour, and many others. Dept. of Philosophy & CALD Carnegie Mellon Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour, and many others Dept. of Philosophy & CALD Carnegie Mellon Graphical Models --11/30/05 1 Outline 1. Motivation 2. Representation 3. Connecting

More information

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 3 Probability Contents 1. Events, Sample Spaces, and Probability 2. Unions and Intersections 3. Complementary Events 4. The Additive Rule and Mutually Exclusive

More information

Chapter 4 Multi-factor Treatment Designs with Multiple Error Terms 93

Chapter 4 Multi-factor Treatment Designs with Multiple Error Terms 93 Contents Preface ix Chapter 1 Introduction 1 1.1 Types of Models That Produce Data 1 1.2 Statistical Models 2 1.3 Fixed and Random Effects 4 1.4 Mixed Models 6 1.5 Typical Studies and the Modeling Issues

More information

Design and implementation of a new meteorology geographic information system

Design and implementation of a new meteorology geographic information system Design and implementation of a new meteorology geographic information system WeiJiang Zheng, Bing. Luo, Zhengguang. Hu, Zhongliang. Lv National Meteorological Center, China Meteorological Administration,

More information

The Helicoidal Life Cycle A Tool for Software Development and Enhancement

The Helicoidal Life Cycle A Tool for Software Development and Enhancement The Helicoidal Life Cycle A Tool for Software Development and Enhancement Antonio Carlos Pinto Dias Alves Unidade Gestão de Riscos Banco do Brasil S.A. Rua Senador Dantas, 105, sala 1704 Centro - CEP 20031-201

More information

Construction of Quality Prediction Model based on Peer Review Performance Data

Construction of Quality Prediction Model based on Peer Review Performance Data Construction of Quality Prediction Model based on Peer Review Performance Data M. Komuro 1 and N. Komoda 2 1 Engineering Support Center, Hitachi Solutions, Ltd., Tokyo, Japan 2 Graduate School of Information

More information

COURSE CONTENT for Computer Science & Engineering [CSE]

COURSE CONTENT for Computer Science & Engineering [CSE] COURSE CONTENT for Computer Science & Engineering [CSE] 1st Semester 1 HU 101 English Language & Communication 2 1 0 3 3 2 PH 101 Engineering Physics 3 1 0 4 4 3 M 101 Mathematics 3 1 0 4 4 4 ME 101 Mechanical

More information

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 7 Time-dependent Covariates in Cox Regression Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the

More information

Contents. Part I: Fundamentals of Bayesian Inference 1

Contents. Part I: Fundamentals of Bayesian Inference 1 Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian

More information

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2016

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2016 Bayesian Networks: Construction, Inference, Learning and Causal Interpretation Volker Tresp Summer 2016 1 Introduction So far we were mostly concerned with supervised learning: we predicted one or several

More information

K. Nishijima. Definition and use of Bayesian probabilistic networks 1/32

K. Nishijima. Definition and use of Bayesian probabilistic networks 1/32 The Probabilistic Analysis of Systems in Engineering 1/32 Bayesian probabilistic bili networks Definition and use of Bayesian probabilistic networks K. Nishijima nishijima@ibk.baug.ethz.ch 2/32 Today s

More information

Causal Inference. Prediction and causation are very different. Typical questions are:

Causal Inference. Prediction and causation are very different. Typical questions are: Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Yuriy Sverchkov Intelligent Systems Program University of Pittsburgh October 6, 2011 Outline Latent Semantic Analysis (LSA) A quick review Probabilistic LSA (plsa)

More information

Knowledge-based and Expert Systems - 1. Knowledge-based and Expert Systems - 2. Knowledge-based and Expert Systems - 4

Knowledge-based and Expert Systems - 1. Knowledge-based and Expert Systems - 2. Knowledge-based and Expert Systems - 4 Knowledge-based and Expert Systems - 1 Knowledge-based and Expert Systems - 2 Expert systems (ES) provide expert quality advice, diagnosis or recommendations. ES solve real problems which normally would

More information

At A Glance. UQ16 Mobile App.

At A Glance. UQ16 Mobile App. At A Glance UQ16 Mobile App Scan the QR code with any QR reader and download the TripBuilder EventMobile app to your iphone, ipad, itouch or Android mobile device. To access the app or the HTML 5 version,

More information

Proofs Propositions and Calculuses

Proofs Propositions and Calculuses Lecture 2 CS 1813 Discrete Mathematics Proofs Propositions and Calculuses 1 City of Königsberg (Kaliningrad) 2 Bridges of Königsberg Problem Find a route that crosses each bridge exactly once Must the

More information