Understanding DWAs in First Nations Systems A Data Mining Approach

Size: px
Start display at page:

Download "Understanding DWAs in First Nations Systems A Data Mining Approach"

Transcription

1 Understanding DWAs in First Nations Systems A Data Mining Approach Emma Thompson E. McBean, Y. Post

2 65 percent of First Nations experienced a Drinking Water Advisory (DWA) between

3 The federal government has committed to end long-term boil water advisories on reserves within five years by investing $1.8 billion

4 Overview Objectives Methods Data Mining Overview of Data Methodology Models Occurrence of DWAs Duration of DWAs Frequency of DWAs Conclusions Next steps

5 Objectives To analyze historical data using data mining techniques to provide insight and inform decision makers Aim is to identify key attributes associated with the occurrence, duration and frequency of DWAs using decision tree analysis

6 Methods

7 Data Compilation Data for 800 drinking water systems from AANDC Roll-Up reports Data for 1,500 individual DWA events from Health Canada and provincial records System Info Source Type Population Certification Remoteness etc. Province Band Name System Name Community DWA Events Start Date End Date Reason

8 Data Mining An anlytical tool used to learn new information from a large database Find correlations or patterns This research used Decision Tree Analysis in RapidMiner Studio Basic

9 Decision Trees Generated by repeatedly splitting the data according to input attributes (recursive partitioning) Creates a classification model that predicts the value of the target attribute based on input attributes Target Attributes: Occurrence Frequency Duration Will a DWA occur? Will a DWA re-occur? How long will a DWA last?

10 Number of Systems Frequency Data Frequency of DWAs in 1 system ranged from 0 44 times Half of all systems have DWA 1 time Frequency (# times)

11 Number of DWA Events Duration Data Duration ranges from 1 3,277 days (9 years) Half of all events last <14 days Duration of DWA (days)

12 Input attributes Attribute Province Water Source Type System Age Treatment Class Operator Certification Environmental Class Geographic/Remoteness Zone Distribution Pipe Length Number of Homes Trucked Description Location Surface water (SW), groundwater (GW), GUDI Based on construction date Based on complexity (Level I, II, III, Small system) Operator class meets treatment class requirement Based on latitude Distance and access to nearest service centre Total length in metres Number of homes with water delivered

13 Validation Decision trees were validated using random subsets of the data to test prediction accuracy. Model parameters were optimized to obtain trees with highest prediction accuracy, in this case 60-70%. Frequency Tree: Accuracy = 70.08% +/- 5.55% True (No) True (Yes) Prediction (No) Prediction (Yes)

14 Models

15 Occurrence Will a DWA occur in a given facility? YES NO AB # Homes Trucked? 82 > 82 # Homes Piped? 10 > 10 Province? Atlantic BC MB ON QC SK Source? Pipe Length? Pipe Length? GW Population? GUDI # Homes SW Piped? MTA Unknown 400m > 400m Population? Unknown > 6,544m 6,544m 205 > > > 72 Yukon

16 Frequency Will a DWA occur more than once? >1 time 1 time AB Atlantic Max Daily Volume? 254m 3 /d > 254m 3 /d System 23 Age? > 23 BC Max Daily Volume? Unknown 175m 3 /d > 175m 3 /d # Homes Piped? 61 > 61 Province? MB Source? SW GW # Homes Trucked? 54 > 54 ON QC SK Yukon Max Daily Volume? Population? Unknown 71m 3 /d > 71m 3 /d 322 > 322 Max Daily Volume? 579m 3 /d > 579m 3 /d

17 Number of Systems Occurrence & Frequency Trends in occurrence and frequency vary by province Distribution properties including pipe length, maximum daily volume and population served are key attributes DWA = No Yes BC AB SK MB ON QC Atl. YK Province

18 Number of Systems Occurrence & Frequency Source Type is also important Surface water and GUDI systems and are more likely to have re-occurring DWAs than groundwater systems time > 1 time GW GUDI SW Source Type

19 Duration Will a DWA last longer than 2 weeks? >2 wks 2 wks Not Required Province? Alberta Atlantic BC 1 st Operator Treatment Certified? No No Operator Not req. Level Treatment Class? # Homes Piped? None Level I Level II Level III Small System 233 > 233 Pipe Length? Population? # Homes Trucked? 4199m > 4199m 495 > > 3 Yes Pipe Length/ Connection? 30m > 30m Pipe Length? 2 nd Op. Treatment Certified? 430m > 430m Not req. No 2 nd Op. No Not req. level Yes

20 Number of DWA Events Duration Operator certification influences duration of DWAs DWAs in systems without a trained operator are more likely to last longer than 2 weeks weeks > 2 weeks No Operator Not Required Not Certified Not Required Level Fully Certified Primary Operator Treatment Certification

21 Conclusions Province and operator certification are key attributes associated with the occurrence, duration and frequency of DWAs Drinking water problems are typically resolved more quickly when operators are trained. Decision trees are a powerful tool that can be used to investigate historical trends and predict likely outcomes.

22 Next Steps Further investigation is required to understand provincial differences, as First Nations are federally regulated. Data gaps and inconsistencies exist in system information at the time of each DWA event. An up-to-date nation-wide database would provide more accurate results

23 Questions?

24 References Government of Canada Budget Health Canada Information request by CBC. Neegan Burnside National Assessment of First Nations Water and Wastewater Systems - National Roll-Up Report. RapidMiner GmbH Studio Basic Ed

Getting Biodiversity Data

Getting Biodiversity Data Getting Biodiversity Data NatureServe Canada Douglas Hyde Executive Director Value of biodiversity data to business? Reasons vary depending on the business Reduce development uncertainty Integrated views

More information

Decision T ree Tree Algorithm Week 4 1

Decision T ree Tree Algorithm Week 4 1 Decision Tree Algorithm Week 4 1 Team Homework Assignment #5 Read pp. 105 117 of the text book. Do Examples 3.1, 3.2, 3.3 and Exercise 3.4 (a). Prepare for the results of the homework assignment. Due date

More information

An overview of the applications for early warning and mapping of the flood events in New Brunswick

An overview of the applications for early warning and mapping of the flood events in New Brunswick Flood Recovery, Innovation and Reponse IV 239 An overview of the applications for early warning and mapping of the flood events in New Brunswick D. Mioc 1, E. McGillivray 2, F. Anton 1, M. Mezouaghi 2,

More information

Geographic Locations Survey of Clinical Psychologists in Canada

Geographic Locations Survey of Clinical Psychologists in Canada Geographic Locations Survey of Clinical Psychologists in Canada A publication of the Canadian Psychological Association. 1999 To order print copies, click here! Print copies of this CPA document are available

More information

SOUTH DAKOTA BOARD OF REGENTS. Academic and Student Affairs ******************************************************************************

SOUTH DAKOTA BOARD OF REGENTS. Academic and Student Affairs ****************************************************************************** SOUTH DAKOTA BOARD OF REGENTS Academic and Student Affairs AGENDA ITEM: 7 C (4) DATE: June 28-30, 2016 ****************************************************************************** SUBJECT: New Minor:

More information

Project Plan for the City of Philadelphia Pole and Pole Attachment Geodatabase Design Project

Project Plan for the City of Philadelphia Pole and Pole Attachment Geodatabase Design Project Project Plan for the City of Philadelphia Pole and Pole Attachment Geodatabase Design Project Project Overview: The City of Philadelphia is experiencing data integrity problems caused by data format issues.

More information

Climate Trends and Variations Bulletin Winter

Climate Trends and Variations Bulletin Winter Climate Trends and Variations Bulletin Winter 2014 2015 This bulletin summarizes recent climate data and presents it in a historical context. It first examines the national average temperature for the

More information

The Quadratic Entropy Approach to Implement the Id3 Decision Tree Algorithm

The Quadratic Entropy Approach to Implement the Id3 Decision Tree Algorithm Journal of Computer Science and Information Technology December 2018, Vol. 6, No. 2, pp. 23-29 ISSN: 2334-2366 (Print), 2334-2374 (Online) Copyright The Author(s). All Rights Reserved. Published by American

More information

Chemistry Provincial Level CHEM 090 Adult Education/Adult Upgrading Program. Course Outline

Chemistry Provincial Level CHEM 090 Adult Education/Adult Upgrading Program. Course Outline Chemistry Provincial Level CHEM 090 Adult Education/Adult Upgrading Program Course Outline COURSE IMPLEMENTATION DATE: Pre 1998 OUTLINE EFFECTIVE DATE: January 2017 COURSE OUTLINE REVIEW DATE: September

More information

EECS 349:Machine Learning Bryan Pardo

EECS 349:Machine Learning Bryan Pardo EECS 349:Machine Learning Bryan Pardo Topic 2: Decision Trees (Includes content provided by: Russel & Norvig, D. Downie, P. Domingos) 1 General Learning Task There is a set of possible examples Each example

More information

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag Decision Trees Nicholas Ruozzi University of Texas at Dallas Based on the slides of Vibhav Gogate and David Sontag Supervised Learning Input: labelled training data i.e., data plus desired output Assumption:

More information

GEOMATICS SURVEYING AND MAPPING EXPERTS FOR OVER 35 YEARS

GEOMATICS SURVEYING AND MAPPING EXPERTS FOR OVER 35 YEARS GEOMATICS SURVEYING AND MAPPING EXPERTS FOR OVER 35 YEARS 2 GEOMATICS AND SURVEYING SOLUTIONS SPANNING THE ENTIRE PROJECT LIFECYCLE 1,000+ 250+ 24 Surveying professionals Active field crews Geomatics offices

More information

Chemistry Advanced Level - CHEM 080 Access Education/Upgrading for Academic and Career Entry. Course Outline

Chemistry Advanced Level - CHEM 080 Access Education/Upgrading for Academic and Career Entry. Course Outline Chemistry Advanced Level - CHEM 080 Access Education/Upgrading for Academic and Career Entry Course Outline COURSE IMPLEMENTATION DATE: Pre 1998 OUTLINE EFFECTIVE DATE: September 2017 COURSE OUTLINE REVIEW

More information

Data Mining Project. C4.5 Algorithm. Saber Salah. Naji Sami Abduljalil Abdulhak

Data Mining Project. C4.5 Algorithm. Saber Salah. Naji Sami Abduljalil Abdulhak Data Mining Project C4.5 Algorithm Saber Salah Naji Sami Abduljalil Abdulhak Decembre 9, 2010 1.0 Introduction Before start talking about C4.5 algorithm let s see first what is machine learning? Human

More information

Classification Using Decision Trees

Classification Using Decision Trees Classification Using Decision Trees 1. Introduction Data mining term is mainly used for the specific set of six activities namely Classification, Estimation, Prediction, Affinity grouping or Association

More information

Bringing Earth Science to Life

Bringing Earth Science to Life Bringing Earth Science to Life Earth History Geomorphology Surface Processes Soils Rocks Minerals Tectonics Using Natural Resources Careers www.edgeo.org In partnership with: Students investigate the

More information

Rule Generation using Decision Trees

Rule Generation using Decision Trees Rule Generation using Decision Trees Dr. Rajni Jain 1. Introduction A DT is a classification scheme which generates a tree and a set of rules, representing the model of different classes, from a given

More information

Decision Trees. Gavin Brown

Decision Trees. Gavin Brown Decision Trees Gavin Brown Every Learning Method has Limitations Linear model? KNN? SVM? Explain your decisions Sometimes we need interpretable results from our techniques. How do you explain the above

More information

Canada s Experience with Chemicals Assessment and Management and its Application to Nanomaterials

Canada s Experience with Chemicals Assessment and Management and its Application to Nanomaterials Canada s Experience with Chemicals Assessment and Management and its Application to Nanomaterials European Chemicals Agency (ECHA) Topical Scientific Workshop: Regulatory Challenges in Risk Assessment

More information

The Canadian Ceoscience Knowledge Network. - A Collaborative Effort for Unified Access to Ceoscience Data

The Canadian Ceoscience Knowledge Network. - A Collaborative Effort for Unified Access to Ceoscience Data The Canadian Ceoscience Knowledge Network - A Collaborative Effort for Unified Access to Ceoscience Data The Canadian Geoscience Knowledge Network A Collaborative Effort for Unified Access to Geoscience

More information

CS 543 Page 1 John E. Boon, Jr.

CS 543 Page 1 John E. Boon, Jr. CS 543 Machine Learning Spring 2010 Lecture 05 Evaluating Hypotheses I. Overview A. Given observed accuracy of a hypothesis over a limited sample of data, how well does this estimate its accuracy over

More information

Decision Trees. CS57300 Data Mining Fall Instructor: Bruno Ribeiro

Decision Trees. CS57300 Data Mining Fall Instructor: Bruno Ribeiro Decision Trees CS57300 Data Mining Fall 2016 Instructor: Bruno Ribeiro Goal } Classification without Models Well, partially without a model } Today: Decision Trees 2015 Bruno Ribeiro 2 3 Why Trees? } interpretable/intuitive,

More information

Enhancing Parcel Data In Colleton County. February 10, 2009

Enhancing Parcel Data In Colleton County. February 10, 2009 Enhancing Parcel Data In Colleton County GIS & CAMA Conference February 10, 2009 Introductions Bruce T. Harper Technology Director Colleton County, SC Bill Wetzel National GIS Account Manager The Sidwell

More information

CS6375: Machine Learning Gautam Kunapuli. Decision Trees

CS6375: Machine Learning Gautam Kunapuli. Decision Trees Gautam Kunapuli Example: Restaurant Recommendation Example: Develop a model to recommend restaurants to users depending on their past dining experiences. Here, the features are cost (x ) and the user s

More information

Public Awareness and Pipeline Safety

Public Awareness and Pipeline Safety Public Awareness and Pipeline Safety Pipeline Safety Trust November 15, 2007 Dan Kirschner Executive Director 5335 SW Meadows Rd., #220 Lake Oswego, OR 97035 (503) 624-2160 www.nwga.org NWGA Members: Avista

More information

International Journal of Computing and Business Research (IJCBR) ISSN (Online) : APPLICATION OF GIS IN HEALTHCARE MANAGEMENT

International Journal of Computing and Business Research (IJCBR) ISSN (Online) : APPLICATION OF GIS IN HEALTHCARE MANAGEMENT International Journal of Computing and Business Research (IJCBR) ISSN (Online) : 2229-6166 Volume 3 Issue 2 May 2012 APPLICATION OF GIS IN HEALTHCARE MANAGEMENT Dr. Ram Shukla, Faculty (Operations Area),

More information

Accessibility-Remoteness (A-R) Index Summary Paper

Accessibility-Remoteness (A-R) Index Summary Paper Accessibility-Remoteness (A-R) Index Summary Paper Newfoundland & Labrador Statistics Agency February 2014 The Accessibility-Remoteness (A-R) index was developed by the Newfoundland and Labrador Statistics

More information

Data Mining Prof. Pabitra Mitra Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur

Data Mining Prof. Pabitra Mitra Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Data Mining Prof. Pabitra Mitra Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Lecture 21 K - Nearest Neighbor V In this lecture we discuss; how do we evaluate the

More information

Impact Policies Enabling Value Enhancement of Geospatial Information in Canadian Economy and Society

Impact Policies Enabling Value Enhancement of Geospatial Information in Canadian Economy and Society 1 Impact Policies Enabling Value Enhancement of Geospatial Information in Canadian Economy and Society May 26, 2015 Prashant Shukle, Director General Canada Centre for Mapping and Earth Observation Increased

More information

ROLLING RIVER SCHOOL DIVISION POLICY

ROLLING RIVER SCHOOL DIVISION POLICY ROLLING RIVER SCHOOL DIVISION POLICY Transportation Storm EBCD/P The School Division is responsible for the safe transportation of students on school buses. The Division shall exercise due care and caution

More information

Data Mining Classification: Basic Concepts and Techniques. Lecture Notes for Chapter 3. Introduction to Data Mining, 2nd Edition

Data Mining Classification: Basic Concepts and Techniques. Lecture Notes for Chapter 3. Introduction to Data Mining, 2nd Edition Data Mining Classification: Basic Concepts and Techniques Lecture Notes for Chapter 3 by Tan, Steinbach, Karpatne, Kumar 1 Classification: Definition Given a collection of records (training set ) Each

More information

BIM-GIS Oriented Inteligente Knowledge Discovery

BIM-GIS Oriented Inteligente Knowledge Discovery BIM-GIS Oriented Inteligente Knowledge Discovery H. Kiavarz 1, M. Jadidi 1, A. Rajabifard 2, G. Sohn 1 1 Geomatics Engineering, Department of Earth & Space Science & Engineering, York University, Toronto,

More information

RETA 6422: Mainstreaming Environment for Poverty Reduction Category 2 Subproject

RETA 6422: Mainstreaming Environment for Poverty Reduction Category 2 Subproject RETA 6422: Mainstreaming Environment for Poverty Reduction Category 2 Subproject A. Basic Data 1. Subproject Title: Poverty-Environment Mapping to Support Decision Making 2. Country Director: Adrian Ruthenberg

More information

Predictive Modeling: Classification. KSE 521 Topic 6 Mun Yi

Predictive Modeling: Classification. KSE 521 Topic 6 Mun Yi Predictive Modeling: Classification Topic 6 Mun Yi Agenda Models and Induction Entropy and Information Gain Tree-Based Classifier Probability Estimation 2 Introduction Key concept of BI: Predictive modeling

More information

Putting the U.S. Geospatial Services Industry On the Map

Putting the U.S. Geospatial Services Industry On the Map Putting the U.S. Geospatial Services Industry On the Map December 2012 Definition of geospatial services and the focus of this economic study Geospatial services Geospatial services industry Allow consumers,

More information

Decision Trees. Each internal node : an attribute Branch: Outcome of the test Leaf node or terminal node: class label.

Decision Trees. Each internal node : an attribute Branch: Outcome of the test Leaf node or terminal node: class label. Decision Trees Supervised approach Used for Classification (Categorical values) or regression (continuous values). The learning of decision trees is from class-labeled training tuples. Flowchart like structure.

More information

American Chemical Society Fall 2011 National Meeting & Exposition

American Chemical Society Fall 2011 National Meeting & Exposition American Chemical Society Fall 2011 National Meeting & Exposition Overview of Issues in Aquatic Exposure Modeling in the US EPA Office of Pesticide Programs, Environmental Fate and Effects Division Donald

More information

Water Information Portal User Guide. Updated July 2014

Water Information Portal User Guide. Updated July 2014 Water Information Portal User Guide Updated July 2014 1. ENTER THE WATER INFORMATION PORTAL Launch the Water Information Portal in your internet browser via http://www.bcogc.ca/public-zone/water-information

More information

Is the laboratory s pledge or declaration of the quality of the results produced. to produce data compliant with the Safe Drinking Water Act (SDWA)

Is the laboratory s pledge or declaration of the quality of the results produced. to produce data compliant with the Safe Drinking Water Act (SDWA) QA/QC Is the laboratory s pledge or declaration of the quality of the results produced. to produce data compliant with the Safe Drinking Water Act (SDWA) Is a description of the policies, procedures, techniques

More information

WMO Aeronautical Meteorology Scientific Conference 2017

WMO Aeronautical Meteorology Scientific Conference 2017 Session 1 Science underpinning meteorological observations, forecasts, advisories and warnings 1.6 Observation, nowcast and forecast of future needs 1.6.1 Advances in observing methods and use of observations

More information

Exercises NP-completeness

Exercises NP-completeness Exercises NP-completeness Exercise 1 Knapsack problem Consider the Knapsack problem. We have n items, each with weight a j (j = 1,..., n) and value c j (j = 1,..., n) and an integer B. All a j and c j

More information

Data Mining. CS57300 Purdue University. Bruno Ribeiro. February 8, 2018

Data Mining. CS57300 Purdue University. Bruno Ribeiro. February 8, 2018 Data Mining CS57300 Purdue University Bruno Ribeiro February 8, 2018 Decision trees Why Trees? interpretable/intuitive, popular in medical applications because they mimic the way a doctor thinks model

More information

Costs and Benefits of Geological Mapping Contributions of Subhash Bhagwat. Illinois, Kentucky, Spain, and Nevada

Costs and Benefits of Geological Mapping Contributions of Subhash Bhagwat. Illinois, Kentucky, Spain, and Nevada Costs and Benefits of Geological Mapping Contributions of Subhash Bhagwat Illinois, Kentucky, Spain, and Nevada Geology-for- Planning Boone and Winnebago Counties Seasoned Mapping and Derived Benefits

More information

Volcanic Sulphur Dioxide

Volcanic Sulphur Dioxide Volcanic Sulphur Dioxide Overview Background & context Claire Witham VAAC SO 2 forecast demonstration Dov Bensimon Rolls Royce work on SO2 Rory Clarkson New capabilities to remotely sense SO2 - Marcel

More information

DUNNEDIN VENTURES INC.

DUNNEDIN VENTURES INC. . PST Kimberlite First Diamond Results November 2015 1 FORWARD LOOKING STATEMENT Except for historical information contained herein, this presentation may contain forward-looking statements including but

More information

PVT Course for Oil and Gas Professionals

PVT Course for Oil and Gas Professionals PVT Course for Oil and Gas Professionals The Instructor Overview Avada Oil and Gas is commitment to raising the bar for postgraduate learning. A student receiving a certificate of completion from us, has

More information

QualiMET 2.0. The new Quality Control System of Deutscher Wetterdienst

QualiMET 2.0. The new Quality Control System of Deutscher Wetterdienst QualiMET 2.0 The new Quality Control System of Deutscher Wetterdienst Reinhard Spengler Deutscher Wetterdienst Department Observing Networks and Data Quality Assurance of Meteorological Data Michendorfer

More information

Probability Basics. Part 3: Types of Probability. INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder

Probability Basics. Part 3: Types of Probability. INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder Probability Basics Part 3: Types of Probability INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder September 30, 2016 Prof. Michael Paul Prof. William Aspray Example A large government

More information

Terms of Reference for the Comparative Environmental Review (CER) of. Options for the Mactaquac Project, Mactaquac, New Brunswick

Terms of Reference for the Comparative Environmental Review (CER) of. Options for the Mactaquac Project, Mactaquac, New Brunswick Terms of Reference for the Comparative Environmental Review (CER) of Options for the Mactaquac Project, Mactaquac, New Brunswick Preamble The New Brunswick Power Corporation ( NB Power ) operates the Mactaquac

More information

STORAGE, HANDLING & SAFE USE OF CHEMICALS AND HAZARDOUS MATERIALS

STORAGE, HANDLING & SAFE USE OF CHEMICALS AND HAZARDOUS MATERIALS Training Title STORAGE, HANDLING & SAFE USE OF CHEMICALS AND HAZARDOUS MATERIALS Training Duration 5 days Training Venue and Dates REF Storage, Handling and Safe Use of Chemicals HS041 and Hazardous Materials

More information

USING HYPERSPECTRAL IMAGERY

USING HYPERSPECTRAL IMAGERY USING HYPERSPECTRAL IMAGERY AND LIDAR DATA TO DETECT PLANT INVASIONS 2016 ESRI CANADA SCHOLARSHIP APPLICATION CURTIS CHANCE M.SC. CANDIDATE FACULTY OF FORESTRY UNIVERSITY OF BRITISH COLUMBIA CURTIS.CHANCE@ALUMNI.UBC.CA

More information

Summary of Seasonal Normal Review Investigations. DESC 31 st March 2009

Summary of Seasonal Normal Review Investigations. DESC 31 st March 2009 Summary of Seasonal Normal Review Investigations DESC 31 st March 9 1 Introduction to the Seasonal Normal Review The relationship between weather and NDM demand is key to a number of critical processes

More information

Community and Infrastructure Services Committee

Community and Infrastructure Services Committee REPORT TO: DATE OF MEETING: November 7, 2016 Community and Infrastructure Services Committee SUBMITTED BY: Cynthia Fletcher, Interim Executive Director INS 519-741- PREPARED BY: WARD(S) INVOLVED: 2600

More information

PIOTR LUTYNSKI VANCOUVER, BRITISH COLUMBIA

PIOTR LUTYNSKI VANCOUVER, BRITISH COLUMBIA AN ASSESSMENT REPORT ON GROUND MAGNETIC SURVEYING CHUCHI PROPERTY FORT ST. JAMES AREA, BRITISH COLUMBIA OMINECA M.D. 55 17 N, 124 31 W NTS 93N/ 7&8 Claims Surveyed: 597976, 597878-597880 Survey Dates:

More information

Chapter 18. Decision Trees and Ensemble Learning. Recall: Learning Decision Trees

Chapter 18. Decision Trees and Ensemble Learning. Recall: Learning Decision Trees CSE 473 Chapter 18 Decision Trees and Ensemble Learning Recall: Learning Decision Trees Example: When should I wait for a table at a restaurant? Attributes (features) relevant to Wait? decision: 1. Alternate:

More information

Empirical Risk Minimization, Model Selection, and Model Assessment

Empirical Risk Minimization, Model Selection, and Model Assessment Empirical Risk Minimization, Model Selection, and Model Assessment CS6780 Advanced Machine Learning Spring 2015 Thorsten Joachims Cornell University Reading: Murphy 5.7-5.7.2.4, 6.5-6.5.3.1 Dietterich,

More information

} It is non-zero, and maximized given a uniform distribution } Thus, for any distribution possible, we have:

} It is non-zero, and maximized given a uniform distribution } Thus, for any distribution possible, we have: Review: Entropy and Information H(P) = X i p i log p i Class #04: Mutual Information & Decision rees Machine Learning (CS 419/519): M. Allen, 1 Sept. 18 } Entropy is the information gained on average when

More information

White-tailed Deer Winter Severity Index Volunteer Winter Weather Monitors Required

White-tailed Deer Winter Severity Index Volunteer Winter Weather Monitors Required Weather Monitoring White-tailed Deer Winter Severity Index Volunteer Winter Weather Monitors Required The Manitoba Wildlife Federation, in partnership with Manitoba Sustainable Development - Wildlife and

More information

Northern Alberta Institute of Technology

Northern Alberta Institute of Technology Northern Alberta Institute of Technology Alternative Energy Program Solar Photovoltaic Reference Array Report March 31, 2016 Goals Provide solar energy system educators, installers and adopters with real

More information

the tree till a class assignment is reached

the tree till a class assignment is reached Decision Trees Decision Tree for Playing Tennis Prediction is done by sending the example down Prediction is done by sending the example down the tree till a class assignment is reached Definitions Internal

More information

Parks Canada s Geomatics Infrastructure

Parks Canada s Geomatics Infrastructure Parks Canada s Geomatics Infrastructure Esri Canada User Conference Ottawa, Canada -- October 15, 2014 Presented By: Brock Fraser National Geomatics Coordinator, Parks Canada. Parks Canada Protects and

More information

Advanced Techniques for Mining Structured Data: Process Mining

Advanced Techniques for Mining Structured Data: Process Mining Advanced Techniques for Mining Structured Data: Process Mining Frequent Pattern Discovery /Event Forecasting Dr A. Appice Scuola di Dottorato in Informatica e Matematica XXXII Problem definition 1. Given

More information

RS Metrics CME Group Copper Futures Price Predictive Analysis Explained

RS Metrics CME Group Copper Futures Price Predictive Analysis Explained RS Metrics CME Group Copper Futures Price Predictive Analysis Explained Disclaimer ALL DATA REFERENCED IN THIS WHITE PAPER IS PROVIDED AS IS, AND RS METRICS DOES NOT MAKE AND HEREBY EXPRESSLY DISCLAIMS,

More information

Norm Hann Performance Manager, Performance Management Department Hydro One Networks Inc.

Norm Hann Performance Manager, Performance Management Department Hydro One Networks Inc. Norm Hann Performance Manager, Performance Management Department Hydro One Networks Inc. Phone: (416)-345-5407 Email: Norm.Hann@HydroOne.com Closing the Crevice Achieving A Process For Asset Performance

More information

HGP 470 GIS and Advanced Cartography for Social Science

HGP 470 GIS and Advanced Cartography for Social Science HGP 470 GIS and Advanced Cartography for Social Science Winter 2014 Instructor: Office: Tory 3-115 Telephone: 780-248-5758 E-mail: vukicevi@ualberta.ca Office hours: By appointment LECTURES AND LABS Lectures/Labs:

More information

This year s conference theme: Mining Geology through the value chain

This year s conference theme: Mining Geology through the value chain Benchmarking: Does it have a role in improving the performance of mining geology? AMC Consultants Pty Ltd Mark Berry, Principal Geologist This year s conference theme: Mining Geology through the value

More information

Using Winter Severity Indices for Winter Maintenance Performance Management. TRB Winter Maintenance Committee AHD65

Using Winter Severity Indices for Winter Maintenance Performance Management. TRB Winter Maintenance Committee AHD65 Using Winter Severity Indices for Winter Maintenance Performance Management TRB Winter Maintenance Committee AHD65 https://sites.google.com/site/trbcommitteeahd65/ Max Perchanok July 29 2015 Winter Maintenance

More information

GIS Geographic Information Systems

GIS Geographic Information Systems GIS Geographic Information Systems Connecting your Community Ruekert Mielke WAUKESHA WHO WE ARE DATE ESTABLISHED 1946 SERVING LOCAL PEOPLE. SOLVING LOCAL PROBLEMS. TYPE OF ORGANIZATION Ruekert & Mielke,

More information

Predicting MTA Bus Arrival Times in New York City

Predicting MTA Bus Arrival Times in New York City Predicting MTA Bus Arrival Times in New York City Man Geen Harold Li, CS 229 Final Project ABSTRACT This final project sought to outperform the Metropolitan Transportation Authority s estimated time of

More information

The North American Drought Monitor - The Canadian Perspective -

The North American Drought Monitor - The Canadian Perspective - The North American Drought Monitor - The Canadian Perspective - Trevor Hadwen National Agroclimate Information Service AAFC-PFRA, Regina Canmore, Alberta March 16-18, 2008 Background The NADM is a cooperative

More information

GIS Needs Assessment. for. The City of East Lansing

GIS Needs Assessment. for. The City of East Lansing GIS Needs Assessment for The City of East Lansing Prepared by: Jessica Moy and Richard Groop Center for Remote Sensing and GIS, Michigan State University February 24, 2000 Executive Summary At the request

More information

May 14, MRC Capacity Gap Analysis Preliminary Results

May 14, MRC Capacity Gap Analysis Preliminary Results May 14, 2018 MRC Capacity Gap Analysis Preliminary Results Overview Determine current perceptions of the MRC program in Massachusetts Examine desired outcomes (by region) of the MRC program Supplement

More information

2013 Weather Normalization Survey. Itron, Inc El Camino Real San Diego, CA

2013 Weather Normalization Survey. Itron, Inc El Camino Real San Diego, CA Itron, Inc. 11236 El Camino Real San Diego, CA 92130 2650 858 724 2620 March 2014 Weather normalization is the process of reconstructing historical energy consumption assuming that normal weather occurred

More information

Northern Dynasty Minerals Ltd.

Northern Dynasty Minerals Ltd. Northern Dynasty Minerals Ltd. 1020 800 W Pender St. Vancouver BC Canada V6C 2V6 Tel 604 684-6365 Fax 604 684-8092 Toll Free 1 800 667-2114 http://www.northerndynasty.com RESOURCE ESTIMATE FOR PEBBLE EAST

More information

General Ophthalmic Services Activity Statistics - England

General Ophthalmic Services Activity Statistics - England General Ophthalmic Services Activity Statistics - England Analysis of sight test patient eligibility data Copyright 2012, The Health and Social Care Information Centre, Dental and Eye Care Team. All Rights

More information

Lecture 7: DecisionTrees

Lecture 7: DecisionTrees Lecture 7: DecisionTrees What are decision trees? Brief interlude on information theory Decision tree construction Overfitting avoidance Regression trees COMP-652, Lecture 7 - September 28, 2009 1 Recall:

More information

Will it rain tomorrow?

Will it rain tomorrow? Will it rain tomorrow? Bilal Ahmed - 561539 Department of Computing and Information Systems, The University of Melbourne, Victoria, Australia bahmad@student.unimelb.edu.au Abstract With the availability

More information

Tutorial 2. Fall /21. CPSC 340: Machine Learning and Data Mining

Tutorial 2. Fall /21. CPSC 340: Machine Learning and Data Mining 1/21 Tutorial 2 CPSC 340: Machine Learning and Data Mining Fall 2016 Overview 2/21 1 Decision Tree Decision Stump Decision Tree 2 Training, Testing, and Validation Set 3 Naive Bayes Classifier Decision

More information

Standards in Action: The Canadian Geospatial Data Infrastructure (CGDI)

Standards in Action: The Canadian Geospatial Data Infrastructure (CGDI) Standards in Action: The Canadian Geospatial Data Infrastructure (CGDI) Craig Stewart ISO/TC211 Standards in Action Workshop, September 14, 2005 Presentation Outline Overview of SDIs Overview of Canada

More information

Land Accounts - The Canadian Experience

Land Accounts - The Canadian Experience Land Accounts - The Canadian Experience Development of a Geospatial database to measure the effect of human activity on the environment Who is doing Land Accounts Statistics Canada (national) Component

More information

Exercise Brunswick ALPHA 2018

Exercise Brunswick ALPHA 2018 ALPHA Exercise Brunswick ALPHA 2018 Who we are (our structure) What we do (our forecasts) How you can access the information Tropical cyclone information (basic) Overview of the products used for Exercise

More information

DEPARTMENT OF GEOLOGY AND MINERAL INDUSTRIES WAYS & MEANS SUBCOMMITTEE ON NATURAL RESOURCES MARCH 2, 2017

DEPARTMENT OF GEOLOGY AND MINERAL INDUSTRIES WAYS & MEANS SUBCOMMITTEE ON NATURAL RESOURCES MARCH 2, 2017 DEPARTMENT OF GEOLOGY AND MINERAL INDUSTRIES WAYS & MEANS SUBCOMMITTEE ON NATURAL RESOURCES MARCH 2, 2017 1 ABOUT DOGAMI AGENCY MISSION, VISION & GOALS 2 Lidar image of a stream network along the Umpqua

More information

QUANTIFYING RESILIENCE-BASED IMPORTANCE MEASURES USING BAYESIAN KERNEL METHODS

QUANTIFYING RESILIENCE-BASED IMPORTANCE MEASURES USING BAYESIAN KERNEL METHODS QUANTIFYING RESILIENCE-BASED IMPORTANCE MEASURES USING BAYESIAN KERNEL METHODS Hiba Baroud, Ph.D. Civil and Environmental Engineering Vanderbilt University Thursday, May 19, 2016 WHAT IS RESILIENCE? Photo:

More information

brainlinksystem.com $25+ / hr AI Decision Tree Learning Part I Outline Learning 11/9/2010 Carnegie Mellon

brainlinksystem.com $25+ / hr AI Decision Tree Learning Part I Outline Learning 11/9/2010 Carnegie Mellon I Decision Tree Learning Part I brainlinksystem.com $25+ / hr Illah Nourbakhsh s version Chapter 8, Russell and Norvig Thanks to all past instructors Carnegie Mellon Outline Learning and philosophy Induction

More information

MISSISSIPPI VALLEY STATE UNIVERSITY Department of Natural Science Chemistry Program Course Number: CH 320 Course Name: Introduction to Biochemistry

MISSISSIPPI VALLEY STATE UNIVERSITY Department of Natural Science Chemistry Program Course Number: CH 320 Course Name: Introduction to Biochemistry MISSISSIPPI VALLEY STATE UNIVERSITY Department of Natural Science Chemistry Program Course Number: CH 320 Course Name: Introduction to Biochemistry Instructor: Matthewos Eshete, PhD Office location: FLW

More information

Operational Perspectives on Hydrologic Model Data Assimilation

Operational Perspectives on Hydrologic Model Data Assimilation Operational Perspectives on Hydrologic Model Data Assimilation Rob Hartman Hydrologist in Charge NOAA / National Weather Service California-Nevada River Forecast Center Sacramento, CA USA Outline Operational

More information

Decision Trees. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University. February 5 th, Carlos Guestrin 1

Decision Trees. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University. February 5 th, Carlos Guestrin 1 Decision Trees Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University February 5 th, 2007 2005-2007 Carlos Guestrin 1 Linear separability A dataset is linearly separable iff 9 a separating

More information

Machine Learning and Data Mining. Decision Trees. Prof. Alexander Ihler

Machine Learning and Data Mining. Decision Trees. Prof. Alexander Ihler + Machine Learning and Data Mining Decision Trees Prof. Alexander Ihler Decision trees Func-onal form f(x;µ): nested if-then-else statements Discrete features: fully expressive (any func-on) Structure:

More information

Learning from Examples

Learning from Examples Learning from Examples Data fitting Decision trees Cross validation Computational learning theory Linear classifiers Neural networks Nonparametric methods: nearest neighbor Support vector machines Ensemble

More information

NUCLEAR SAFETY AND RELIABILITY WEEK 3

NUCLEAR SAFETY AND RELIABILITY WEEK 3 Nuclear Safety and Reliability Dan Meneley Page 1 of 10 NUCLEAR SAFETY AND RELIABILITY WEEK 3 TABLE OF CONTENTS - WEEK 1 1. Introduction to Risk Analysis...1 Conditional Probability Matrices for Each Step

More information

Data Mining. 3.6 Regression Analysis. Fall Instructor: Dr. Masoud Yaghini. Numeric Prediction

Data Mining. 3.6 Regression Analysis. Fall Instructor: Dr. Masoud Yaghini. Numeric Prediction Data Mining 3.6 Regression Analysis Fall 2008 Instructor: Dr. Masoud Yaghini Outline Introduction Straight-Line Linear Regression Multiple Linear Regression Other Regression Models References Introduction

More information

Regional Centre for Mapping of Resources for Development (RCMRD), Nairobi, Kenya

Regional Centre for Mapping of Resources for Development (RCMRD), Nairobi, Kenya Regional Centre for Mapping of Resources for Development (RCMRD), Nairobi, Kenya Introduction GIS ( 2 weeks: 10 days) Intakes: 7 th Jan, 4 th Feb,4 th March, 1 st April 6 th May, 3 rd June, 1 st July,

More information

Geohazard risk assessment and asset management along railway corridors

Geohazard risk assessment and asset management along railway corridors Geohazard risk assessment and asset management along railway corridors BGC: Matt Lato, Pete Quinn, Mark Pritchard, Mike Porter and Sarah Newton IOC: Dominique Sirois BGC supports risk-based geohazard management

More information

ESTIMATING THE SOCIAL & ENVIRONMENTAL EFFECTS OF ADVENTURE TOURISM AND RECREATION ON CROWN LAND IN BRITISH COLUMBIA

ESTIMATING THE SOCIAL & ENVIRONMENTAL EFFECTS OF ADVENTURE TOURISM AND RECREATION ON CROWN LAND IN BRITISH COLUMBIA ESTIMATING THE SOCIAL & ENVIRONMENTAL EFFECTS OF ADVENTURE TOURISM AND RECREATION ON CROWN LAND IN BRITISH COLUMBIA Wolfgang Haider School of Resource and Environmental Mgt. Simon Fraser University Burnaby,

More information

Classification: Decision Trees

Classification: Decision Trees Classification: Decision Trees Outline Top-Down Decision Tree Construction Choosing the Splitting Attribute Information Gain and Gain Ratio 2 DECISION TREE An internal node is a test on an attribute. A

More information

Calculating Land Values by Using Advanced Statistical Approaches in Pendik

Calculating Land Values by Using Advanced Statistical Approaches in Pendik Presented at the FIG Congress 2018, May 6-11, 2018 in Istanbul, Turkey Calculating Land Values by Using Advanced Statistical Approaches in Pendik Prof. Dr. Arif Cagdas AYDINOGLU Ress. Asst. Rabia BOVKIR

More information

Decision Trees. Lewis Fishgold. (Material in these slides adapted from Ray Mooney's slides on Decision Trees)

Decision Trees. Lewis Fishgold. (Material in these slides adapted from Ray Mooney's slides on Decision Trees) Decision Trees Lewis Fishgold (Material in these slides adapted from Ray Mooney's slides on Decision Trees) Classification using Decision Trees Nodes test features, there is one branch for each value of

More information

Project Development in Argentina. For Wind Energy and Minerals Using Spatial Data Modelling

Project Development in Argentina. For Wind Energy and Minerals Using Spatial Data Modelling Project Development in Argentina For Wind Energy and Minerals Using Spatial Data Modelling Introduction Development of New Business Opportunities in Argentina Key Project for Kenex Since Mining 2010. Based

More information

Press Release BACTERIA'S KEY INNOVATION HELPS UNDERSTAND EVOLUTION

Press Release BACTERIA'S KEY INNOVATION HELPS UNDERSTAND EVOLUTION Press Release 12 172 BACTERIA'S KEY INNOVATION HELPS UNDERSTAND EVOLUTION Genomic analysis of E. coli shows multiple steps to evolve new trait View Video Postdoc researcher Zachary Blount discusses discovering

More information

Classification and Regression Trees

Classification and Regression Trees Classification and Regression Trees Ryan P Adams So far, we have primarily examined linear classifiers and regressors, and considered several different ways to train them When we ve found the linearity

More information