Database Data Mining: Practical R Enterprise and Oracle Advanced Analytics
|
|
- Hope Oliver
- 6 years ago
- Views:
Transcription
1 Database Data Mining: Practical R Enterprise and Oracle Advanced Analytics Husnu Sensoy husnu.sensoy@globalmaksimum.com October 2, 2012
2 Content 1 Introduction 2 Oracle Enterprise R in Practice Data Visualization A Bit of Probability and Information Theory Optimization Text Analysis & Decision Trees 3 Wrap up
3 Who am I? Founder at Global Maksimum Data & Information Technologies in BI Domain Oracle Magazine DBA of the Year in 2009
4 A bunch of people who know what they are doing mainly focused on data and the transformation of data into information.
5 A bunch of people who know what they are doing mainly focused on data and the transformation of data into information. Complex Event Processing
6 A bunch of people who know what they are doing mainly focused on data and the transformation of data into information. Complex Event Processing 1.2 Million Event in a second on 2x2 Socket Nehalem Blades Data Mining
7 A bunch of people who know what they are doing mainly focused on data and the transformation of data into information. Complex Event Processing 1.2 Million Event in a second on 2x2 Socket Nehalem Blades Data Mining Churn Prediction Models for Telcos Marketing Target Selection Models Large Scale Database Management System Projects
8 A bunch of people who know what they are doing mainly focused on data and the transformation of data into information. Complex Event Processing 1.2 Million Event in a second on 2x2 Socket Nehalem Blades Data Mining Churn Prediction Models for Telcos Marketing Target Selection Models Large Scale Database Management System Projects 120+ TB Exadata migration from UNIX systems. Exadata Master Class all over the EMEA region for Exadata customers, Oracle partners, and Oracle staff at the region.
9 Advanced Analytics For the first version of BI we just filter rows, project columns, aggragate them using some functions, and give only what customer asks for.
10 Advanced Analytics For the first version of BI we just filter rows, project columns, aggragate them using some functions, and give only what customer asks for. After we have focused on machine generated data, or Big Data dealing the data as we did before becomes more and more fruitless.
11 Advanced Analytics For the first version of BI we just filter rows, project columns, aggragate them using some functions, and give only what customer asks for. After we have focused on machine generated data, or Big Data dealing the data as we did before becomes more and more fruitless. That s mainly because of the fact that there is only tiny amount of information available in this pile of data.
12 Advanced Analytics For the first version of BI we just filter rows, project columns, aggragate them using some functions, and give only what customer asks for. After we have focused on machine generated data, or Big Data dealing the data as we did before becomes more and more fruitless. That s mainly because of the fact that there is only tiny amount of information available in this pile of data. So it requires better tricks, automation, and post-analysis capabilities.
13 In-database Advanced Analytics 80% of data mining activity for enterprise means feature engineering.
14 In-database Advanced Analytics 80% of data mining activity for enterprise means feature engineering. Feature Engineering requires an iterative process of
15 In-database Advanced Analytics 80% of data mining activity for enterprise means feature engineering. Feature Engineering requires an iterative process of Filtering data (WHERE) Aggragating data (GROUP BY) Transforming data (CASE, DECODE, COALESCE, etc.)
16 In-database Advanced Analytics 80% of data mining activity for enterprise means feature engineering. Feature Engineering requires an iterative process of Filtering data (WHERE) Aggragating data (GROUP BY) Transforming data (CASE, DECODE, COALESCE, etc.) It is almost impossible to maintain an integrated mining environment (Scripts, files, metafiles,etc.) out of database
17 Oracle Advanced Analytics Toolkit SQL-2003 & Extensions Oracle Data Mining Oracle Spatial Extensions Flow based mining with SQL Developer Oracle Enterprise R
18 R is a free software environment for statistical computing and graphics. Majority of newbies (young data scientists) recently graduate or to be graduated from top universities use R. Batteries are included. Runs on all modern platforms.
19 Oracle R Enterprise Data you can process with standard R is limited with the amount of memory available on the server running R.
20 Oracle R Enterprise Data you can process with standard R is limited with the amount of memory available on the server running R. In order to bypass this problem people implement their own solutions in order to off-store data or utilize data sampling techniques.
21 Oracle R Enterprise Data you can process with standard R is limited with the amount of memory available on the server running R. In order to bypass this problem people implement their own solutions in order to off-store data or utilize data sampling techniques. ORE is an extension to standard R adding Oracle steroids into it.
22 Oracle R Enterprise Data you can process with standard R is limited with the amount of memory available on the server running R. In order to bypass this problem people implement their own solutions in order to off-store data or utilize data sampling techniques. ORE is an extension to standard R adding Oracle steroids into it. The basic idea is to off-load R commands seemless to Oracle Database or Oracle Big Data Appliance.
23 This session This session is not a R tutorial session but rather a fly over some possible solutions to real life scenarios using R. If you need some R tutorial please refer to Rob Kabacoff. R in Action. Manning, 2010 Oracle R Enterprise Training 2 - Introduction to R R Studio
24 Data Visualization Data Visualization Advance data analysis usually starts and ends with data visualization. Before modeling anything data scientists use graphs & charts to figure out behaviour of data After modeling in order to report the results they again refer to charts. R supports tens of different charting & graphing packages. Just to mention two of them lattice is used to generate conditioned graphs (a.k.a. trellis graphs) ggplot2 is used to make graph generation more consistent in R.
25 Data Visualization Histogram Do you see any significant pattern in distribution? Do you like the way histogram is represented? s o u r c e ( / r s n i p p l e t s /oow2012/mydata. r, l o c a l=true) d a t a s e t = generatecustomer ( ) h=h i s t ( d a t a s e t $ B i l l p e r P e r i o d, f r e q=true, y l a b= Number o f Customers, x l a b= B i l l Amount, main= B i l l Amount D i s t r i b u t i o n )
26 Data Visualization Remove the Outliers Do you see any significant pattern in distribution? s o u r c e ( / r s n i p p l e t s /oow2012/mydata. r, l o c a l=true) d a t a s e t = generatecustomer ( ) n o o u t l i e r = f u n c t i o n ( data, column, q =0.99, i n c=true){ q = q u a n t i l e ( data [, column ], na. rm=true, p r o b s = q u a n t i l e, names=false) i f ( i n c l u s i v e ){ pruned = s u b s e t ( data, data [, column ] <= q ) } e l s e { pruned = s u b s e t ( data, data [, column ] < q ) } } pruned pruned = n o o u t l i e r ( d a t a s e t, B i l l p e r P e r i o d, ) h=h i s t ( pruned $ B i l l p e r P e r i o d, f r e q=true, y l a b= Number o f Customers, x l a b= B i l l Amount, main= B i l l Amount D i s t r i b u t i o n )
27 Data Visualization Conditional Histograms s o u r c e ( / r s n i p p l e t s /oow2012/mydata. r, l o c a l=true) s o u r c e ( / r s n i p p l e t s /oow2012/commons. r, l o c a l=true) d a t a s e t = generatecustomer ( ) pruned = n o o u t l i e r ( d a t a s e t, B i l l p e r P e r i o d, ) l i b r a r y ( l a t t i c e ) h i s t o g r a m ( B i l l p e r P e r i o d U s i n g S e r v i c e X, data=pruned )
28 Data Visualization Too Many Columns to Visualize s o u r c e ( / r s n i p p l e t s /oow2012/mydata. r, l o c a l=true) s o u r c e ( / r s n i p p l e t s /oow2012/commons. r, l o c a l=true) d a t a s e t = generatecustomer ( ) head ( d a t a s e t ) pruned = n o o u t l i e r ( d a t a s e t, B i l l p e r P e r i o d, ) l i b r a r y ( l a t t i c e ) h i s t o g r a m ( B i l l p e r P e r i o d CarBrand, data=pruned )
29 A Bit of Probability and Information Theory Comparing Histograms We need a way to calculate similarity between those histograms. A strong tool from information theory Kullback Leibler Divergence allows us to define a distance metric between two distributions. equiwidth = function ( data, col, n=10, s f=1e 6){ q l i s t = q u a n t i l e ( data [, c o l ], na. rm=true, p r o b s = seq ( 0. 1, 1. 0, by =1./n ), names=false) } r e s u l t=c ( ) f o r ( q u a n t i l e i n q l i s t ){ r e s u l t = c ( r e s u l t, ( nrow ( s u b s e t ( data, data [, c o l ] <= q u a n t i l e ) ) / nrow ( data ) ) ) } r e s u l t [ 1 : n] c ( 0, r e s u l t [ 1 : ( n 1) ] ) + r e p ( s f, n )
30 A Bit of Probability and Information Theory KL Divergence & Symmetry D KL (P Q) = i Notice that D KL (P Q) D KL (Q P) P(i)ln P(i) Q(i) So we simply take the average of two to obtain a symmetric metric. k l d i s t a n c e = f u n c t i o n ( d i s t 1, d i s t 2 ){ k l 1 = 0. 0 f o r ( i i n 1 : l e n g t h ( d i s t 1 ) ){ k l 1 = k l 1 + d i s t 1 [ i ] l o g ( d i s t 1 [ i ] / d i s t 2 [ i ], 2 ) } k l 2 = 0. 0 f o r ( i i n 1 : l e n g t h ( d i s t 1 ) ){ k l 2 = k l 2 + d i s t 2 [ i ] l o g ( d i s t 2 [ i ] / d i s t 1 [ i ], 2 ) } ( k l 1+k l 2 ) /2 }
31 A Bit of Probability and Information Theory Top 5 Car Brands whose Owners Diverge from Baseline Brand KL Lancia Lincoln Proton Daewoo Pontiac ddf = NULL b a s e l i n e = e q u i w i d t h ( pruned, B i l l p e r P e r i o d ) f o r ( brand i n d a t a s e t [! d u p l i c a t e d ( d a t a s e t [, c ( CarBrand ) ] ), 1 ] ) { } b r a n d D i s t = e q u i w i d t h ( s u b s e t ( pruned, pruned [, CarBrand ] == brand ), B i l l p e r P e r i o d ) ddf = r b i n d ( ddf, data. frame ( carbrand=brand, k l=k l d i s t a n c e ( b a s e l i n e, b r a n d D i s t ) ) ) head ( ddf [ o r d e r ( ddf $ k l, d e c r e a s i n g=true), ] )
32 Optimization Problem Definition We a have a terrain covered by several stations and each point on the terrain has one of the following status GREEN Region is in the LoS of at least one station. YELLOW Region is in the LoS of at least on station but far away. RED Region is out of LoS. For a fixed number of stations we need to cover as much as we can.
33 Optimization Model Sketch Up 1 1 Define a function to calculate the ratio of green zones on terrain. 2 Give this function to one of optimization modules of R (Nelder Mead Technique) which can handle non-smooth target functions. 3 Get the optimal station distribution. t a r g e t f u n c=f u n c t i o n ( o b s e r v e r ){ m = matrix ( data=observer, ncol =2, byrow=true) # Compute merged s t a t u s o f a l l o b s e r v e r s m e r g e d s t a t u s < r e p ( r e d, l e n g t h ( t e r r $ h e i g h t ) ) f o r ( i i n seq ( 1 : dim (m) [ 1 ] ) ){ t e r r $ d i s t 2 o b s e r v e r = d i s t a n c e ( t e r r, c (m[ i, ], 7 ) ) s t a t u s = LoS ( t e r r, c (m[ i, ], 7 ), maxdist ) mergedstatus = updatestatus ( mergedstatus, s t a t u s ) } sum ( mergedstatus== green ) } optim < optim ( observers, targetfunc, c o n t r o l= l i s t ( f n s c a l e = 1, t r a c e =5, REPORT=1) ) 1 Refer to LoS Analysis (Part 4)
34 Optimization 1 Station (54%)
35 Optimization 3 Stations (83%)
36 Optimization 6 Stations (99%)
37 Text Analysis & Decision Trees Problem Definition For a given string which is written intentionally or erroneously wrong by subscribers, how can we build a model which can deduce the most probable string among 3 possibilities (or chose to not making any decision).
38 Text Analysis & Decision Trees Problem Definition For a given string which is written intentionally or erroneously wrong by subscribers, how can we build a model which can deduce the most probable string among 3 possibilities (or chose to not making any decision). Our legitamate strings are mom, dad, and brother. And we have brothe brother bro brother brother1 brother p? 1234? mom.i.came.home mom mmomyy mom dad[atwork] dad dod dad
39 Text Analysis & Decision Trees Model Sketch Up 1 Do some feature engineering
40 Text Analysis & Decision Trees Model Sketch Up 1 Do some feature engineering Length of the string
41 Text Analysis & Decision Trees Model Sketch Up 1 Do some feature engineering Length of the string Prefix flag (3 attributes for each)
42 Text Analysis & Decision Trees Model Sketch Up 1 Do some feature engineering Length of the string Prefix flag (3 attributes for each) Contains flag (3 attributes for each)
43 Text Analysis & Decision Trees Model Sketch Up 1 Do some feature engineering Length of the string Prefix flag (3 attributes for each) Contains flag (3 attributes for each) Anything else? 2 Build a classifier to classify those texts based on those features. 3 Evaluate your classifier
44 Text Analysis & Decision Trees First Model s o u r c e ( / r s n i p p l e t s /oow2012/mydata. r, l o c a l=true) df = generatetext ( ) l i b r a r y ( r p a r t ) # grow t r e e f i t < r p a r t ( c o r r e c t e d l e n g t h+p r e f i x B r o t h e r+p r e f i x D a d+prefixmom+i n s t r B r o t h e r+ instrdad+instrmom, method= c l a s s, data=d f ) t a b l e ( pred = p r e d i c t ( f i t, df, t y p e= c l a s s ), true = df $ corrected ) true pred? brother dad mom? brother dad mom
45 Text Analysis & Decision Trees More Feature Engineering using Jaro-Winkler Algorithm Jaro-Winkler distance is a distance metric between strings which can be used as a fuzzy string matching algorithm resilient to typo errors. l i b r a r y ( R e c o r d L i n k a g e ) enhanced = data. frame ( df, momscore = j a r o w i n k l e r ( mom, d f $ o r g i n a l ), dadscore = j a r o w i n k l e r ( dad, d f $ o r g i n a l ), b r o t h e r S c o r e = j a r o w i n k l e r ( b r o t h e r, d f $ o r g i n a l ) ) true pred? brother dad mom? brother dad mom
46 Conclusion R contains lots of libraries to help you model a physical phenomenon in anyway you like and visualize it.
47 Conclusion R contains lots of libraries to help you model a physical phenomenon in anyway you like and visualize it. Oracle Enterprise R makes it possible to handle large volumes of data without changing your R environment basics.
48 Conclusion R contains lots of libraries to help you model a physical phenomenon in anyway you like and visualize it. Oracle Enterprise R makes it possible to handle large volumes of data without changing your R environment basics. Don t take ODM and Oracle Enterprise R as alternatives of each other but rather complimentary solutions of the same problem.
49 Question & Answer
50 Stay in Touch
ST-Links. SpatialKit. Version 3.0.x. For ArcMap. ArcMap Extension for Directly Connecting to Spatial Databases. ST-Links Corporation.
ST-Links SpatialKit For ArcMap Version 3.0.x ArcMap Extension for Directly Connecting to Spatial Databases ST-Links Corporation www.st-links.com 2012 Contents Introduction... 3 Installation... 3 Database
More informationChuck Cartledge, PhD. 21 January 2018
Big Data: Data Analysis Boot Camp Non-SQL and R Chuck Cartledge, PhD 21 January 2018 1/19 Table of contents (1 of 1) 1 Intro. 2 Non-SQL DBMS Classic Non-SQL databases 3 Hands-on Airport connections as
More informationFEEG6017 lecture: Akaike's information criterion; model reduction. Brendan Neville
FEEG6017 lecture: Akaike's information criterion; model reduction Brendan Neville bjn1c13@ecs.soton.ac.uk Occam's razor William of Occam, 1288-1348. All else being equal, the simplest explanation is the
More informationAdministering your Enterprise Geodatabase using Python. Jill Penney
Administering your Enterprise Geodatabase using Python Jill Penney Assumptions Basic knowledge of python Basic knowledge enterprise geodatabases and workflows You want code Please turn off or silence cell
More informationMidterm. You may use a calculator, but not any device that can access the Internet or store large amounts of data.
INST 737 April 1, 2013 Midterm Name: }{{} by writing my name I swear by the honor code Read all of the following information before starting the exam: For free response questions, show all work, clearly
More informationMath 381 Midterm Practice Problem Solutions
Math 381 Midterm Practice Problem Solutions Notes: -Many of the exercises below are adapted from Operations Research: Applications and Algorithms by Winston. -I have included a list of topics covered on
More informationArcGIS Deployment Pattern. Azlina Mahad
ArcGIS Deployment Pattern Azlina Mahad Agenda Deployment Options Cloud Portal ArcGIS Server Data Publication Mobile System Management Desktop Web Device ArcGIS An Integrated Web GIS Platform Portal Providing
More informationA Summary of State DOT GIS Activities. Presented at the 2004 AASHTO GIS-T Symposium Rapid City, SD
A Summary of State DOT GIS Activities Presented at the 2004 AASHTO GIS-T Symposium Rapid City, SD Information Sources E-mail survey of state DOT GIS managers All 51 State DOTs (and DC) responded in 2004
More informationAmong various open-source GIS programs, QGIS can be the best suitable option which can be used across partners for reasons outlined below.
Comparison of Geographic Information Systems (GIS) software As of January 2018, WHO has reached an agreement with ESRI (an international supplier of GIS software) for an unlimited use of ArcGIS Desktop
More informationGeodatabase Programming with Python
DevSummit DC February 11, 2015 Washington, DC Geodatabase Programming with Python Craig Gillgrass Assumptions Basic knowledge of python Basic knowledge enterprise geodatabases and workflows Please turn
More informationGRASS GIS Development APIs
GRASS GIS Development APIs Lifting the fog on the different ways to develop with and for GRASS Moritz Lennert Member of the GRASS GIS Project Steering Comittee PostGISomics Over 30 years of development
More informationPredictive Analytics on Accident Data Using Rule Based and Discriminative Classifiers
Advances in Computational Sciences and Technology ISSN 0973-6107 Volume 10, Number 3 (2017) pp. 461-469 Research India Publications http://www.ripublication.com Predictive Analytics on Accident Data Using
More informationD2D SALES WITH SURVEY123, OP DASHBOARD, AND MICROSOFT SSAS
D2D SALES WITH SURVEY123, OP DASHBOARD, AND MICROSOFT SSAS EDWARD GAUSE, GISP DIRECTOR OF INFORMATION SERVICES (ENGINEERING APPS) HTC (HORRY TELEPHONE COOP.) EDWARD GAUSE, GISP DIRECTOR OF INFORMATION
More informationApproaches to Spatial Analysis. Flora Vale, Linda Beale, Mark Harrower, Clint Brown Esri Redlands
Approaches to Spatial Analysis Flora Vale, Linda Beale, Mark Harrower, Clint Brown Esri Redlands Analysis (noun) Detailed examination of the elements or structure of something, as a basis for discussion,
More informationGenerative Models for Discrete Data
Generative Models for Discrete Data ddebarr@uw.edu 2016-04-21 Agenda Bayesian Concept Learning Beta-Binomial Model Dirichlet-Multinomial Model Naïve Bayes Classifiers Bayesian Concept Learning Numbers
More informationGeodatabase Programming with Python John Yaist
Geodatabase Programming with Python John Yaist DevSummit DC February 26, 2016 Washington, DC Target Audience: Assumptions Basic knowledge of Python Basic knowledge of Enterprise Geodatabase and workflows
More informationPerforming Advanced Cartography with Esri Production Mapping
Esri International User Conference San Diego, California Technical Workshops July 25, 2012 Performing Advanced Cartography with Esri Production Mapping Tania Pal & Madhura Phaterpekar Agenda Outline generic
More informationThe File Geodatabase API. Craig Gillgrass Lance Shipman
The File Geodatabase API Craig Gillgrass Lance Shipman Schedule Cell phones and pagers Please complete the session survey we take your feedback very seriously! Overview File Geodatabase API - Introduction
More informationStatistics Toolbox 6. Apply statistical algorithms and probability models
Statistics Toolbox 6 Apply statistical algorithms and probability models Statistics Toolbox provides engineers, scientists, researchers, financial analysts, and statisticians with a comprehensive set of
More informationData Mining Decision-Trees for Comparative Models and Possibilities for Uniting Texts and Coded Data
Data Mining Decision-Trees for Comparative Models and Possibilities for Uniting Texts and Coded Data Michael D. Fischer, (U Kent) m.d.fischer@kent.ac.uk Comparative Research There are two basic approaches
More informationLeslie matrices and Markov chains.
Leslie matrices and Markov chains. Example. Suppose a certain species of insect can be divided into 2 classes, eggs and adults. 10% of eggs survive for 1 week to become adults, each adult yields an average
More informationECLT 5810 Data Preprocessing. Prof. Wai Lam
ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate
More informationInformation Theory, Statistics, and Decision Trees
Information Theory, Statistics, and Decision Trees Léon Bottou COS 424 4/6/2010 Summary 1. Basic information theory. 2. Decision trees. 3. Information theory and statistics. Léon Bottou 2/31 COS 424 4/6/2010
More informationA GUI FOR EVOLVE ZAMS
A GUI FOR EVOLVE ZAMS D. R. Schlegel Computer Science Department Here the early work on a new user interface for the Evolve ZAMS stellar evolution code is presented. The initial goal of this project is
More informationGalaxy Zoo. Materials Computer Internet connection
Name: Date: Galaxy Zoo Objectives: Distinguish between different types of galaxies Identify the various features of each subclass Contribute data that will be used by astronomers in their work Learn to
More informationWeighted Stability Index (WSI) Metric Model Mike Libassi Intel Corp 8/11/99
Weighted Stability Index (WSI) Metric Model Mike Libassi Intel Corp 8/11/99 Abstract Methods, such as McCabe's Cyclomatic Complexity, have proven that complexity is a reliable predictor of defects. Although
More informationPortal for ArcGIS: An Introduction
Portal for ArcGIS: An Introduction Derek Law Esri Product Management Esri UC 2014 Technical Workshop Agenda Web GIS pattern Product overview Installation and deployment Security and groups Configuration
More informationSTRUCTURAL BIOINFORMATICS I. Fall 2015
STRUCTURAL BIOINFORMATICS I Fall 2015 Info Course Number - Classification: Biology 5411 Class Schedule: Monday 5:30-7:50 PM, SERC Room 456 (4 th floor) Instructors: Vincenzo Carnevale - SERC, Room 704C;
More informationComplexity and NP-completeness
Lecture 17 Complexity and NP-completeness Supplemental reading in CLRS: Chapter 34 As an engineer or computer scientist, it is important not only to be able to solve problems, but also to know which problems
More informationBayesian Classifiers and Probability Estimation. Vassilis Athitsos CSE 4308/5360: Artificial Intelligence I University of Texas at Arlington
Bayesian Classifiers and Probability Estimation Vassilis Athitsos CSE 4308/5360: Artificial Intelligence I University of Texas at Arlington 1 Data Space Suppose that we have a classification problem The
More informationRoad Ahead: Linear Referencing and UPDM
Road Ahead: Linear Referencing and UPDM Esri European Petroleum GIS Conference November 7, 2014 Congress Centre, London Your Work Making a Difference ArcGIS Is Evolving Your GIS Is Becoming Part of an
More informationFeatures and Benefits
Autodesk LandXplorer Features and Benefits Use the Autodesk LandXplorer software family to help improve decision making, lower costs, view and share changes, and avoid the expense of creating physical
More informationReference: 4880(DOP.ADA)1136 Subject: Survey on the integration of geographic information systems into postal address development
International Bureau Weltpoststrasse 4 P.O. Box 312 3000 BERNE 15 SWITZERLAND To: Union member countries Regulators Designated operators T +41 31 350 31 11 F +41 31 350 31 10 www.upu.int For information
More informationPerforming Map Cartography. using Esri Production Mapping
AGENDA Performing Map Cartography Presentation Title using Esri Production Mapping Name of Speaker Company Name Kannan Jayaraman Agenda Introduction What s New in ArcGIS 10.1 ESRI Production Mapping Mapping
More informationBetaZi SCIENCE AN OVERVIEW OF PHYSIO-STATISTICS FOR PRODUCTION FORECASTING. EVOLVE, ALWAYS That s geologic. betazi.com. geologic.
BetaZi SCIENCE AN OVERVIEW OF PHYSIO-STATISTICS FOR PRODUCTION FORECASTING EVOLVE, ALWAYS That s geologic betazi.com geologic.com Introduction Predictive Analytics is the science of using past facts to
More informationGIS-BASED DISASTER WARNING SYSTEM OF LOW TEMPERATURE AND SPARE SUNLIGHT IN GREENHOUSE
GIS-BASED DISASTER WARNING SYSTEM OF LOW TEMPERATURE AND SPARE SUNLIGHT IN GREENHOUSE 1,2,* 1,2 Ruijiang Wei, Chunqiang Li, Xin Wang 1, 2 1 Hebei Provincial Institute of Meteorology, Shijiazhuang, Hebei
More informationIntroduction to Portal for ArcGIS. Hao LEE November 12, 2015
Introduction to Portal for ArcGIS Hao LEE November 12, 2015 Agenda Web GIS pattern Product overview Installation and deployment Security and groups Configuration options Portal for ArcGIS + ArcGIS for
More informationArcGIS. for Server. Understanding our World
ArcGIS for Server Understanding our World ArcGIS for Server Create, Distribute, and Manage GIS Services You can use ArcGIS for Server to create services from your mapping and geographic information system
More informationIntroduction to Google Mapping Tools
Introduction to Google Mapping Tools Google s Mapping Tools Explore geographic data. Organize your own geographic data. Visualize complex data. Share your data with the world. Tell your story and educate
More informationPerforming. Geospatial Analysis. Using Latitude and Longitude Data. Ginger Ni - Software Engineer
Performing Geospatial Analysis Using Latitude and Longitude Data Ginger Ni - Software Engineer Hewlett Packard Enterprise confidential information This is a rolling (up to three year) roadmap and is subject
More informationFrom Geographics Stella to Bentley Map Stella Map. Kimmo Soukki, Account Manager Bentley Finland
From Geographics Stella to Bentley Map Stella Map Kimmo Soukki, Account Manager Bentley Finland This presentation Stella? Why migrate? New Bentley Stella Map - How it is done? Data migration Bonuses? What
More informationGeodatabase Essentials Part One - Intro to the Geodatabase. Jonathan Murphy Colin Zwicker
Geodatabase Essentials Part One - Intro to the Geodatabase Jonathan Murphy Colin Zwicker Session Path The Geodatabase - What is it? - Why use it? - What types are there? Inside the Geodatabase Advanced
More informationAxisymmetric Modeling. This tutorial gives an overview of axisymmetric modeling. Learn how to:
Axisymmetric Modeling I-DEAS Tutorials: Simulation Projects This tutorial gives an overview of axisymmetric modeling. Learn how to: sketch on the XZ plane apply boundary conditions mesh axisymmetric elements
More informationCPSC 340: Machine Learning and Data Mining. Sparse Matrix Factorization Fall 2017
CPSC 340: Machine Learning and Data Mining Sparse Matrix Factorization Fall 2017 Admin Assignment 4: Due Friday. Assignment 5: Posted, due Monday of last week of classes Last Time: PCA with Orthogonal/Sequential
More information(Get a) Half Life. Date Per. mathematical. But it can also be modeled for us to see with simple materials.
Unit 5 (Get a) Half Life Name The theory of how half life break-down works is very Date Per. mathematical. But it can also be modeled for us to see with simple materials. As you do this lab, try to ask
More informationIntroduction to Portal for ArcGIS
Introduction to Portal for ArcGIS Derek Law Product Management March 10 th, 2015 Esri Developer Summit 2015 Agenda Web GIS pattern Product overview Installation and deployment Security and groups Configuration
More informationGeodatabase An Introduction
2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Geodatabase An Introduction David Crawford and Jonathan Murphy Session Path The Geodatabase What is it?
More informationToday s Agenda: 1) Why Do We Need To Measure The Memory Component? 2) Machine Pool Memory / Best Practice Guidelines
Today s Agenda: 1) Why Do We Need To Measure The Memory Component? 2) Machine Pool Memory / Best Practice Guidelines 3) Techniques To Measure The Memory Component a) Understanding Your Current Environment
More informationIntroduction to RStudio
Introduction to RStudio Carl Tony Fakhry Jie Chen April 4, 2015 Introduction R is a powerful language and environment for statistical computing and graphics. R is freeware and there is lot of help available
More information6.001 Recitation 22: Streams
6.001 Recitation 22: Streams RI: Gerald Dalley, dalleyg@mit.edu, 4 May 2007 http://people.csail.mit.edu/dalleyg/6.001/sp2007/ The three chief virtues of a programmer are: Laziness, Impatience and Hubris
More informationHow GIS can be used for improvement of literacy and CE programmes
How GIS can be used for improvement of literacy and CE programmes Training Workshop for Myanmar Literacy Resource Center (MLRC) ( Yangon, Myanmar, 11 20 October 2000 ) Presented by U THEIN HTUT GEOCOMP
More informationSection Notes 8. Integer Programming II. Applied Math 121. Week of April 5, expand your knowledge of big M s and logical constraints.
Section Notes 8 Integer Programming II Applied Math 121 Week of April 5, 2010 Goals for the week understand IP relaxations be able to determine the relative strength of formulations understand the branch
More informationEEOS 381 -Spatial Databases and GIS Applications
EEOS 381 -Spatial Databases and GIS Applications Lecture 5 Geodatabases What is a Geodatabase? Geographic Database ESRI-coined term A standard RDBMS that stores and manages geographic data A modern object-relational
More informationSpatial Analytics Workshop
Spatial Analytics Workshop Pete Skomoroch, LinkedIn (@peteskomoroch) Kevin Weil, Twitter (@kevinweil) Sean Gorman, FortiusOne (@seangorman) #spatialanalytics Introduction The Rise of Spatial Analytics
More informationEngineering for Compatibility
W17 Compatibility Testing Wednesday, October 3rd, 2018 3:00 PM Engineering for Compatibility Presented by: Melissa Benua mparticle Brought to you by: 350 Corporate Way, Suite 400, Orange Park, FL 32073
More informationCLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition
CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition Ad Feelders Universiteit Utrecht Department of Information and Computing Sciences Algorithmic Data
More informationJMP Regression Model to Predict Tire Performance: Demo and Tutorial
JMP Regression Model to Predict Tire Performance: Demo and Tutorial Leighton Spadone, DAAS Inc. Tire Technology and Statistical Consulting Avrohom.Spadone@Gmail.com Introduction: The International Mining
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:
More informationTerje Pedersen Product Manager Software / Hydrography Hydroacoustics Division
Terje Pedersen Product Manager Software / Hydrography Hydroacoustics Division WORLD CLASS through people, technology and dedication WORLD CLASS through people, technology and dedication 1 2 HYDROFEST 2005
More informationCWMS Modeling for Real-Time Water Management
Hydrologic Engineering Center Training Course on CWMS Modeling for Real-Time Water Management August 2018 Davis, California The Corps Water Management System (CWMS) is a software and hardware system to
More informationComputation of Large Sparse Aggregated Areas for Analytic Database Queries
Computation of Large Sparse Aggregated Areas for Analytic Database Queries Steffen Wittmer Tobias Lauer Jedox AG Collaborators: Zurab Khadikov Alexander Haberstroh Peter Strohm Business Intelligence and
More informationMEASURING THE COMPLEXITY AND IMPACT OF DESIGN CHANGES
Presentation Paper Bio Return to Main Menu P R E S E N T A T I O N W4 Wednesday, March 8, 2000 11:00AM MEASURING THE COMPLEXITY AND IMPACT OF DESIGN CHANGES Mike Libassi Intel Corporation International
More informationRapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value
Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Anthony Arvanites Daylight User Group Meeting March 10, 2005 Outline 1. Company Introduction
More informationI N T R O D U C T I O N : G R O W I N G I T C O M P L E X I T Y
Global Headquarters: 5 Speen Street Framingham, MA 01701 USA P.508.872.8200 F.508.935.4015 www.idc.com W H I T E P A P E R I n v a r i a n t A n a l y z e r : A n A u t o m a t e d A p p r o a c h t o
More informationFundamentals of Computational Science
Fundamentals of Computational Science Dr. Hyrum D. Carroll August 23, 2016 Introductions Each student: Name Undergraduate school & major Masters & major Previous research (if any) Why Computational Science
More informationOBEUS. (Object-Based Environment for Urban Simulation) Shareware Version. Itzhak Benenson 1,2, Slava Birfur 1, Vlad Kharbash 1
OBEUS (Object-Based Environment for Urban Simulation) Shareware Version Yaffo model is based on partition of the area into Voronoi polygons, which correspond to real-world houses; neighborhood relationship
More informationInternet Engineering Jacek Mazurkiewicz, PhD
Internet Engineering Jacek Mazurkiewicz, PhD Softcomputing Part 11: SoftComputing Used for Big Data Problems Agenda Climate Changes Prediction System Based on Weather Big Data Visualisation Natural Language
More informationHunting for Anomalies in PMU Data
Hunting for Anomalies in PMU Data BRETT AMIDAN JAMES FOLLUM JEFFERY DAGLE Pacific Northwest National Laboratory NASPI Presentation (October 23, 2014) November 3, 2014 b.amidan@pnnl.gov 1 Big Picture Objective
More informationSpatial Analysis using Vector GIS THE GOAL: PREPARATION:
PLAN 512 GIS FOR PLANNERS Department of Urban and Environmental Planning University of Virginia Fall 2006 Prof. David L. Phillips Spatial Analysis using Vector GIS THE GOAL: This tutorial explores some
More informationCoordination of Infrastructure Projects with GIS and SharePoint
Coordination of Infrastructure Projects with GIS and SharePoint Markus Schenardi Geocom Informatik AG Munich, October 24, 2013 2013 Esri Europe, Middle East and Africa User Conference October 23-25, 2013
More informationPUBLIC SAFETY POWER SHUTOFF POLICIES AND PROCEDURES
PACIFIC GAS AND ELECTRIC COMPANY PUBLIC SAFETY POWER SHUTOFF POLICIES AND PROCEDURES SEPTEMBER 2018 1 PACIFIC GAS AND ELECTRIC COMPANY PUBLIC SAFETY POWER SHUTOFF POLICIES AND PROCEDURES SEPTEMBER 2018
More informationESPRIT Feature. Innovation with Integrity. Particle detection and chemical classification EDS
ESPRIT Feature Particle detection and chemical classification Innovation with Integrity EDS Fast and Comprehensive Feature Analysis Based on the speed and accuracy of the QUANTAX EDS system with its powerful
More informationThe Next Generation GIS/LIS A Surveys Information System Integrated within a GIS
The Next Generation GIS/LIS A Surveys Information System Integrated within a GIS Michael F. WEIR, USA Key words: Linking, Snapping, GIS, Measurements, Measurement Data, GIS Features, Spatial Quality. SUMMARY
More informationChapter 1. GIS Fundamentals
1. GIS Overview Chapter 1. GIS Fundamentals GIS refers to three integrated parts. Geographic: Of the real world; the spatial realities, the geography. Information: Data and information; their meaning and
More informationPerformance Evaluation
Performance Evaluation David S. Rosenberg Bloomberg ML EDU October 26, 2017 David S. Rosenberg (Bloomberg ML EDU) October 26, 2017 1 / 36 Baseline Models David S. Rosenberg (Bloomberg ML EDU) October 26,
More informationGeodatabase Best Practices. Dave Crawford Erik Hoel
Geodatabase Best Practices Dave Crawford Erik Hoel Geodatabase best practices - outline Geodatabase creation Data ownership Data model Data configuration Geodatabase behaviors Data integrity and validation
More informationDemand forecasting. Mehmet Yurdal May White paper
Microsoft Dynamics AX 2012 R3 White paper Roxana Cristina Diaconu and Mehmet Yurdal May 2014 This white paper explains the value proposition of the tool in Microsoft Dynamics AX 2012 R3 and gives an overview
More informationMap Application Progression
Map Application Progression Application Migration with Latest ArcGIS by Dean Chiang California Department of Fish and Wildlife What we do at CDFW Hunting and fishing licensing and regulation Conservation
More informationIntroduction to ArcGIS Server Development
Introduction to ArcGIS Server Development Kevin Deege,, Rob Burke, Kelly Hutchins, and Sathya Prasad ESRI Developer Summit 2008 1 Schedule Introduction to ArcGIS Server Rob and Kevin Questions Break 2:15
More informationMachine Learning Applications in Astronomy
Machine Learning Applications in Astronomy Umaa Rebbapragada, Ph.D. Machine Learning and Instrument Autonomy Group Big Data Task Force November 1, 2017 Research described in this presentation was carried
More informationIntroduction to ArcMap
Introduction to ArcMap ArcMap ArcMap is a Map-centric GUI tool used to perform map-based tasks Mapping Create maps by working geographically and interactively Display and present Export or print Publish
More informationAUTOMATIC AND INTERACTIVE ANALYSIS SOFTWARE FOR BETA- GAMMA COINCIDENCE SYSTEMS USED IN CTBT MONITORING
ABSTRACT AUTOMATIC AND INTERACTIVE ANALYSIS SOFTWARE FOR BETA- GAMMA COINCIDENCE SYSTEMS USED IN CTBT MONITORING J. Rynes, K.M.F. Biegalski, P. Donohoe, and S. Biegalski Veridian Pacific-Sierra Research
More information7 Distances. 7.1 Metrics. 7.2 Distances L p Distances
7 Distances We have mainly been focusing on similarities so far, since it is easiest to explain locality sensitive hashing that way, and in particular the Jaccard similarity is easy to define in regards
More informationLecture 15: Exploding and Vanishing Gradients
Lecture 15: Exploding and Vanishing Gradients Roger Grosse 1 Introduction Last lecture, we introduced RNNs and saw how to derive the gradients using backprop through time. In principle, this lets us train
More informationNeural Networks for Machine Learning. Lecture 2a An overview of the main types of neural network architecture
Neural Networks for Machine Learning Lecture 2a An overview of the main types of neural network architecture Geoffrey Hinton with Nitish Srivastava Kevin Swersky Feed-forward neural networks These are
More informationGeodatabase Management Pathway
Geodatabase Management Pathway Table of Contents ArcGIS Desktop II: Tools and Functionality 3 ArcGIS Desktop III: GIS Workflows and Analysis 6 Building Geodatabases 8 Data Management in the Multiuser Geodatabase
More informationWelcome! Power BI User Group (PUG) Copenhagen
Welcome! Power BI User Group (PUG) Copenhagen Making Maps in Power BI Andrea Martorana Tusa BI Specialist Welcome to Making maps in Power BI Who am I? First name: Andrea. Last name: Martorana Tusa. Italian,
More information(S1) (S2) = =8.16
Formulae for Attributes As described in the manuscript, the first step of our method to create a predictive model of material properties is to compute attributes based on the composition of materials.
More informationSampling Distributions of the Sample Mean Pocket Pennies
You will need 25 pennies collected from recent day-today change Some of the distributions of data that you have studied have had a roughly normal shape, but many others were not normal at all. What kind
More informationThis tutorial is intended to familiarize you with the Geomatica Toolbar and describe the basics of viewing data using Geomatica Focus.
PCI GEOMATICS GEOMATICA QUICKSTART 1. Introduction This tutorial is intended to familiarize you with the Geomatica Toolbar and describe the basics of viewing data using Geomatica Focus. All data used in
More informationWorking with Temporal Data in ArcGIS
Working with Temporal Data in ArcGIS By Aileen Buckley, Esri Research Cartographer Time is an important dimension in many types of geospatial visualizations and analyses. The temporal aspect adds when
More informationEvaluating Physical, Chemical, and Biological Impacts from the Savannah Harbor Expansion Project Cooperative Agreement Number W912HZ
Evaluating Physical, Chemical, and Biological Impacts from the Savannah Harbor Expansion Project Cooperative Agreement Number W912HZ-13-2-0013 Annual Report FY 2018 Submitted by Sergio Bernardes and Marguerite
More informationOverview. Everywhere. Over everything.
Cadenza Desktop Cadenza Web Cadenza Mobile Cadenza Overview. Everywhere. Over everything. The ultimate GIS and reporting suite. Provide, analyze and report data efficiently. For desktop, web and mobile.
More informationOne platform for desktop, web and mobile
One platform for desktop, web and mobile Search and filter Get access to all data thematically filter data in context factually and spatially as well as display it dynamically. Export a selection or send
More informationCreation and modification of a geological model Program: Stratigraphy
Engineering manual No. 39 Updated: 11/2018 Creation and modification of a geological model Program: Stratigraphy File: Demo_manual_39.gsg Introduction The aim of this engineering manual is to explain the
More informationPredictive Modeling: Classification. KSE 521 Topic 6 Mun Yi
Predictive Modeling: Classification Topic 6 Mun Yi Agenda Models and Induction Entropy and Information Gain Tree-Based Classifier Probability Estimation 2 Introduction Key concept of BI: Predictive modeling
More informationDidacticiel - Études de cas
1 Theme Comparing the results of the Partial Least Squares Regression from various data mining tools (free: Tanagra, R; commercial: SIMCA-P, SPAD, and SAS). Comparing the behavior of tools is always a
More informationElectric Power * OpenStax HS Physics. : By the end of this section, you will be able to:
OpenStax-CNX module: m54446 1 Electric Power * OpenStax HS Physics This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 4.0 1 : By the end of this section,
More informationData Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning
CI125230 Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning Stephen Brockwell Brockwell IT Consulting Inc. Sean Kinahan Brockwell IT Consulting Inc. Learning Objectives
More informationMachine Learning 3. week
Machine Learning 3. week Entropy Decision Trees ID3 C4.5 Classification and Regression Trees (CART) 1 What is Decision Tree As a short description, decision tree is a data classification procedure which
More information