Environmental Chemistry through Intelligent Atmospheric Data Analysis (EnChIlADA): A Platform for Mining ATOFMS and Other Atmospheric Data

Size: px
Start display at page:

Download "Environmental Chemistry through Intelligent Atmospheric Data Analysis (EnChIlADA): A Platform for Mining ATOFMS and Other Atmospheric Data"

Transcription

1 Environmental Chemistry through Intelligent Atmospheric Data Analysis (EnChIlADA): A Platform for Mining ATOFMS and Other Atmospheric Data Katie Barton, John Choiniere, Melanie Yuen, and Deborah Gross Department of Chemistry, Anna Ritz, Thomas Smith, Leah Steinberg, and David Musicant Department of Mathematics and Computer Science, Jamie Schauer Environmental Chemistry and Technology, University of Wisconsin Madison Lei Chen, Greg Cipriano, and Raghu Ramakrishnan Department of Computer Sciences, University of Wisconsin Madison

2 Today s Menu Goals of Enchilada project Discussion of capabilities (current and future) Data organization User Interface and Analysis Tools Clustering Algorithms Time-series Analysis Labeling ATOFMS Spectra Automatically Summary and Future Outlook

3 Project Description NSF-ITR funding to develop data mining techniques for complex aerosol data sets. Collaboration between aerosol scientists and computer scientists: Address the right questions. Integrate domain knowledge into the data mining techniques. Use real data to test techniques. Answer interesting questions along the way!

4 The Aerosol Scientist s View of Enchilada: Particles Enchilada Real-time measurements A new picture of complex aerosol data sets! Particle Source

5 How Enchilada Fits In ATOFMS MS-Analyze (MS-Access) Time Series Data Other Instrument #1 MS-Control Enchilada (SQL Server) Other Instrument #2... A new picture of complex aerosol data sets!

6 What is Data Mining? The non-trivial discovery of novel, valid, comprehensible and potentially useful patterns from data (Fayyad et al) Computer! Learn something from this data, and explain it to me. Includes Clustering Classification Time series analysis

7 Enchilada Open Source Software Implements data mining tools to analyze atmospheric data (in real time) General Will work with any relevant data, not just ATOFMS data Scalable Will work with data that contains millions of particles Easy to use, intuitive

8 Enchilada Basics Import ATOFMS datasets Organize datasets into collections Display particle spectra Query (time, size, count) Synchronize, Cluster, Label Export collections to MS-Analyze

9 Particle Organization in Enchilada Atoms Not atoms in a chemistry sense! Smallest units of imported data (for our purposes, particles) Dense Atom Info Time = 08:03:03 12/03/03 Size = 0.21 microns Filename = particle1.amz Atom Sparse Atom Info

10 Dataset Organization in Enchilada Datasets Groups of particles Gathered from ATOFMS instrument in one session Each atom belongs to one dataset Collections Virtual datasets Collections can be made from other collections Each atom can belong to many collections Original Datasets Datasets imported as Collections Collections are split or merged to form new Collections

11 (Demo of Enchilada Tools)

12 The Collection Hierarchy Provides a way to view all imported datasets at once Can be quite complex! Contains all the atoms from their subcollections Has the ability to create empty collections Has the ability to copy, cut, and paste collections

13 Datatypes in the Database datatype: the kind of data the program is working with Examples: ATOFMS, TimeSeries, AMS, Mercury data Multiple datatypes give Enchilada power Easy comparisons between instruments or over time Different datatypes in one database Database needs to be detailed and generalized

14 Database Representation Tables in the database: General tables for information that is consistent across datatypes Datatype-specific tables for information specific to each datatype Only one set of general tables in the entire database One set of datatype-specific tables for each datatype in the database

15 Time-Series Analysis Observe trends over many datasets. Provide a broader picture. Elucidate dynamics in data series. Compare different types of time series data. Most other data of interest are time series. Time steps need to be synchronized.

16 East St. Louis Time-Series x10 8 ATOFMS Scaled Number of Particles 6.0x10 8 9x10 7 5x /28/ /01/ /04/ /08/ /12/ /16/ /20/ /24/2003 PM2.5 Concentration (µg/m 3 ) PM-2.5 Total Mass and PM-2.5 OC Mass vs. ATOFMS Total and OC Particles PM-2.5 (MetOne BAM) ATOFMS All Particles PM-2.5 OC (Sunset Labs) ATOFMS OC Particles

17 ATOFMS Scaled Number of Particles 2x10 7 East St. Louis Time-Series 7.0 PM-2.5 BC (Aethalometer) ATOFMS EC Particles PM-2.5 EC (Sunset Labs) ATOFMS EC Particles 2.0 1x x10 7 1x PM-2.5 Real-time BC and PM-2.5 Real-time EC Mass vs. ATOFMS EC Particles 12/01/ /04/ /08/ /12/ /16/ /20/ /24/ /28/2003 PM2.5 Concentration (µg/m 3 )

18 EC vs. BC by Chemical Composition? Goals of this analysis are to use data mining to: Determine the relationship(s) between EC and BC, using composition information from ATOFMS. Understand which chemical components contribute to light-absorbing properties of particles. We want to understand these questions without using prior knowledge. Enchilada will search for the relationship(s).

19 Flexible Importation How do we get all these different datatypes into Enchilada? Customized XML file formats for importing datatypes and data XML (Extensible Markup Language) is an industry standard for exchanging data. Any datatype that can be represented in these file formats could be imported.

20 MetaData Files Contain information about a datatype Used to create new datatype-specific tables in the database Import once, use many times Database MetaData File Mercury Datatype NEW! NEW! General Tables MercuryDataSetInfo Table MercuryAtomInfoDense Table NEW! MercuryAtomInfoSparse Table

21 EnchiladaData Files Contain the actual data to import into the program XML example: <atominfodense> <field>exampleparticle</field> <field> :03:03</field> <atominfosparse table= Peaks > <field>12</field> <field> </field> </atominfosparse> </atominfodense>

22 (Demo of Enchilada Importer & Time Series)

23 Clustering: A Data Mining Technique Answers the question What are the main groups of data points in this database? For chemists: a fast way to find out about the major types of particles present in a sample. With time series aggregation, we can explore simultaneously clustering different datatypes.

24 Clustering The purpose of clustering is to find groups of similar objects. How many groups are in this graph? 4 clusters 3 clusters 2 clusters 1 cluster Or lots of clusters

25 k-means Demo in One Dimension Step 1. Choose initial cluster centers somehow (here, the first two points are chosen arbitrarily). There are better methods of choosing initial cluster centers in the Data Mining literature.

26 k-means Demo in One Dimension Step 2. Assign each point to the centroid to which it is closest. Each group of points is now a cluster.

27 k-means Demo in One Dimension Step 3. Average all the points in each cluster to find the centroid.

28 k-means Demo in One Dimension Step 2 again. Assign each point to its nearest centroid. Note that three points change which cluster they belong to.

29 k-means Demo in One Dimension Step 3 again. Average all the points in each cluster to find the new centroid.

30 k-means Demo in One Dimension Step 2 again. Assign each point to its nearest centroid.

31 k-means Demo in One Dimension Step 3 again. Average all the points in each cluster to find the new centroid.

32 k-means Demo in One Dimension Step 4. Notice that the centroids in the last two averaging steps were the same. This means that the algorithm is done.

33 Clustering Spectra Treat each m/z value as a dimension (so that we store information on the relative concentrations of each ion) Over 600 dimensions Apply the same algorithms we used for 2D data Cluster centers still summarize the particles Positive Spectrum for Typical k-means Centroid

34 (Demo of Clustering in Enchilada)

35 Different Clustering Approaches k-means/k-medians Cluster # parameter, use mean or median to find centers ART-2a Vigilance parameter: cluster radius Considers particles one at a time, rather than in a group. Has been used with ATOFMS before k = 3 vigilance =

36 Clustering Experiment #1 How good is a clustering algorithm? Synthetic Dataset 2000-particle dataset with 7 real particles We know what the clusters should look like Error The average distance from every point in a given cluster to its center Error =

37 Quantitative Analysis of Clustering Homogeneity Graphs: k-means, k = # of particles A B C D E F G A B C D E F G A B C D E F G A B C D E F G A B C D E F G Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 # of particles Error Error Graph A B C D E F G A B C D E F G Cluster 6 Cluster Iteration

38 Clustering Experiment #2 ART-2a versus k-means ART-2a has been used before for ATOFMS data k-means is a more standard clustering algorithm More user-friendly than ART-2a This experiment: ~2000 particles sampled in East St. Louis Same number of clusters from each algorithm Compare cluster centroids to evaluate algorithm performance

39 Chemical Comparison of Clustering ART-2a and k-means are comparable Clusters make chemical sense k-means calculations are more efficient Other algorithms will also be compared m/z of the 9 Cluster Centroids from 2000 East St. Louis ATOFMS Particles Cluster a ART-2a centroid k-means centroid Cluster b m/z

40 Clustering: An EC Case Study C 3 C 3 Relative Peak Area C 4 C C 5 C 6 C -C HSO 4 - k-means centroid EC particle type Relative Intensity C C 2 C 4 C 5 C 2 - C 4 - C 6 C 8 Sample EC particle From YSNP C 10 C 12 C 2 - C 5 - C 6 - C 5 - C 6 - C m/z m/z

41 Timeline: Query vs. Cluster EC Cluster contains 2,212 particles EC Query found 11,745 particles MS-Analyze Enchilada /2/ :00 9/3/ :00 9/4/ :00 9/5/ :00 9/6/ :00 9/7/ :00 9/8/ :00 9/9/ :00 9/10/ :00 9/11/ :00 Number of Particles

42 What are the other particles? 1.0 C C K C 4 C 3 C 5 Fraction of Particles Containing m/z C C 2 4 C 3 - C 2 - C 4 - C 5 C 6 C 6 - Histogram of k-means EC cluster particles C 7 HSO 4 - Fraction of Particles Containing m/z C C 2 C 2 - C 4 - C - C 6 Histogram of query results from YSNP C 10 C 8 HSO 4 - C 12 K 3 SO m/z m/z

43 Cluster and Query Results What are EC particles? Clusters produce a narrow definition of EC and queries use a broad definition of EC Are the queries finding too many particles? Are EC particles distributed among many clusters? Is there an EC particle type that the clustering algorithm is not isolating? YES! Other clusters contain EC and K peaks! Explore integrating domain knowledge for improved clustering. Combine query and cluster perspectives!

44 Future Work in Clustering Finding and implementing algorithms to cluster very large datasets Real-time clustering Finding clusters that change over time Adjustable weighting for data features Clustering multiple datatypes together

45 Labeling Mass Spectra Provide to program: List of common ions Natural isotope distributions Common ion combinations Raw ATOFMS spectra Obtain from program: Spectra with ion composition options labeled above all possible peaks

46 Conclusions and Future Endeavors Enchilada development will continue, and new algorithms will be developed and tested. New features will be implemented: Real-time data acquisition and analysis Enchilada will capture data as it is saved. Support for a variety of data sources, especially other mass spectral data types. Many datasets for analysis: Yellowstone National Park (Summer 2003): Natural Geothermal East St. Louis (Winter 2003/04): Industrial Emissions Switzerland (Fall/Winter 2005): Wood Smoke and Vehicles

47 Acknowledgements The National Science Foundation The University of Wisconsin-Madison TSI, Inc. Others who have contributed: Ben Anderson (Carleton CS) Andy Ault (Carleton Chemistry) Bee-Chung Chen (UW-Madison CS) Zheng (Colin) Huang (UW-Madison CS) Kate Nelson (Carleton CS) Jon Sulman (Carleton CS)

A Stochastic Simulation of Natural Organic Matter and Microbes in the Environment

A Stochastic Simulation of Natural Organic Matter and Microbes in the Environment A Stochastic Simulation of Natural Organic Matter and Microbes in the Environment Xiaorong Xiang Gregory Madey Yingping Huang Steve Cabaniss (University of New Mexico) Department of Computer Science and

More information

for XPS surface analysis

for XPS surface analysis Thermo Scientific Avantage XPS Software Powerful instrument operation and data processing for XPS surface analysis Avantage Software Atomic Concentration (%) 100 The premier software for surface analysis

More information

Administering your Enterprise Geodatabase using Python. Jill Penney

Administering your Enterprise Geodatabase using Python. Jill Penney Administering your Enterprise Geodatabase using Python Jill Penney Assumptions Basic knowledge of python Basic knowledge enterprise geodatabases and workflows You want code Please turn off or silence cell

More information

The shortest path to chemistry data and literature

The shortest path to chemistry data and literature R&D SOLUTIONS Reaxys Fact Sheet The shortest path to chemistry data and literature Designed to support the full range of chemistry research, including pharmaceutical development, environmental health &

More information

Data Structures & Database Queries in GIS

Data Structures & Database Queries in GIS Data Structures & Database Queries in GIS Objective In this lab we will show you how to use ArcGIS for analysis of digital elevation models (DEM s), in relationship to Rocky Mountain bighorn sheep (Ovis

More information

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning Stephen Brockwell President, Brockwell IT Consulting, Inc. Join the conversation #AU2017 KEYWORD Class Summary Silos

More information

Large Scale Structure of the Universe Lab

Large Scale Structure of the Universe Lab Large Scale Structure of the Universe Lab Introduction: Since the mid-1980 s astronomers have gathered data allowing, for the first time, a view of the structure of the Universe in three-dimensions. You

More information

Geodatabase Programming with Python John Yaist

Geodatabase Programming with Python John Yaist Geodatabase Programming with Python John Yaist DevSummit DC February 26, 2016 Washington, DC Target Audience: Assumptions Basic knowledge of Python Basic knowledge of Enterprise Geodatabase and workflows

More information

Troubleshooting Replication and Geodata Services. Liz Parrish & Ben Lin

Troubleshooting Replication and Geodata Services. Liz Parrish & Ben Lin Troubleshooting Replication and Geodata Services Liz Parrish & Ben Lin AGENDA: Troubleshooting Replication and Geodata Services Overview Demo Troubleshooting Q & A Overview of Replication Liz Parrish What

More information

Last updated: Copyright

Last updated: Copyright Last updated: 2012-08-20 Copyright 2004-2012 plabel (v2.4) User s Manual by Bioinformatics Group, Institute of Computing Technology, Chinese Academy of Sciences Tel: 86-10-62601016 Email: zhangkun01@ict.ac.cn,

More information

Geodatabase Programming with Python

Geodatabase Programming with Python DevSummit DC February 11, 2015 Washington, DC Geodatabase Programming with Python Craig Gillgrass Assumptions Basic knowledge of python Basic knowledge enterprise geodatabases and workflows Please turn

More information

NMR Data workup using NUTS

NMR Data workup using NUTS omework 1 Chem 636, Fall 2008 due at the beginning of the 2 nd week lab (week of Sept 9) NMR Data workup using NUTS This laboratory and homework introduces the basic processing of one dimensional NMR data

More information

Handling Human Interpreted Analytical Data. Workflows for Pharmaceutical R&D. Presented by Peter Russell

Handling Human Interpreted Analytical Data. Workflows for Pharmaceutical R&D. Presented by Peter Russell Handling Human Interpreted Analytical Data Workflows for Pharmaceutical R&D Presented by Peter Russell 2011 Survey 88% of R&D organizations lack adequate systems to automatically collect data for reporting,

More information

DOCUMENT HISTORY. Initials Section/s Modified Brief Description of Modifications

DOCUMENT HISTORY. Initials Section/s Modified Brief Description of Modifications Page 2 of 13 DOCUMENT HISTORY Date Modified Initials Section/s Modified Brief Description of Modifications Page 3 of 13 Table of Contents 1. Purpose and Applicability... 4 2. Definitions... 4 3. Procedures...

More information

ArcGIS Enterprise: What s New. Philip Heede Shannon Kalisky Melanie Summers Shreyas Shinde

ArcGIS Enterprise: What s New. Philip Heede Shannon Kalisky Melanie Summers Shreyas Shinde ArcGIS Enterprise: What s New Philip Heede Shannon Kalisky Melanie Summers Shreyas Shinde ArcGIS Enterprise is the new name for ArcGIS for Server ArcGIS Enterprise Software Components ArcGIS Server Portal

More information

GEOGRAPHY 350/550 Final Exam Fall 2005 NAME:

GEOGRAPHY 350/550 Final Exam Fall 2005 NAME: 1) A GIS data model using an array of cells to store spatial data is termed: a) Topology b) Vector c) Object d) Raster 2) Metadata a) Usually includes map projection, scale, data types and origin, resolution

More information

The Geodatabase Working with Spatial Analyst. Calculating Elevation and Slope Values for Forested Roads, Streams, and Stands.

The Geodatabase Working with Spatial Analyst. Calculating Elevation and Slope Values for Forested Roads, Streams, and Stands. GIS LAB 7 The Geodatabase Working with Spatial Analyst. Calculating Elevation and Slope Values for Forested Roads, Streams, and Stands. This lab will ask you to work with the Spatial Analyst extension.

More information

Web Visualization of Geo-Spatial Data using SVG and VRML/X3D

Web Visualization of Geo-Spatial Data using SVG and VRML/X3D Web Visualization of Geo-Spatial Data using SVG and VRML/X3D Jianghui Ying Falls Church, VA 22043, USA jying@vt.edu Denis Gračanin Blacksburg, VA 24061, USA gracanin@vt.edu Chang-Tien Lu Falls Church,

More information

SPATIAL INFORMATION GRID AND ITS APPLICATION IN GEOLOGICAL SURVEY

SPATIAL INFORMATION GRID AND ITS APPLICATION IN GEOLOGICAL SURVEY SPATIAL INFORMATION GRID AND ITS APPLICATION IN GEOLOGICAL SURVEY K. T. He a, b, Y. Tang a, W. X. Yu a a School of Electronic Science and Engineering, National University of Defense Technology, Changsha,

More information

MassHunter Software Overview

MassHunter Software Overview MassHunter Software Overview 1 Qualitative Analysis Workflows Workflows in Qualitative Analysis allow the user to only see and work with the areas and dialog boxes they need for their specific tasks A

More information

ACD/Labs Software Impurity Resolution Management. Presented by Peter Russell

ACD/Labs Software Impurity Resolution Management. Presented by Peter Russell ACD/Labs Software Impurity Resolution Management Presented by Peter Russell Impurity Resolution Process Chemists Method Development Specialists Toxicology Groups Stability Groups Analytical Chemists 7/8/2016

More information

Reaxys The Highlights

Reaxys The Highlights Reaxys The Highlights What is Reaxys? A brand new workflow solution for research chemists and scientists from related disciplines An extensive repository of reaction and substance property data A resource

More information

Kalexsyn Overview Kalexsyn, Inc Campus Drive Kalamazoo, MI Phone: (269) Fax: (269)

Kalexsyn Overview Kalexsyn, Inc Campus Drive Kalamazoo, MI Phone: (269) Fax: (269) Kalexsyn verview 2017 Kalexsyn, Inc. 4502 Campus Drive Kalamazoo, MI 49008 Phone: (269) 488-8488 Fax: (269) 488-8490 info@kalexsyn.com What we are Experienced medicinal chemists Conception and synthesis

More information

CityGML XFM Application Template Documentation. Bentley Map V8i (SELECTseries 2)

CityGML XFM Application Template Documentation. Bentley Map V8i (SELECTseries 2) CityGML XFM Application Template Documentation Bentley Map V8i (SELECTseries 2) Table of Contents Introduction to CityGML 1 CityGML XFM Application Template 2 Requirements 2 Finding Documentation 2 To

More information

CSD. Unlock value from crystal structure information in the CSD

CSD. Unlock value from crystal structure information in the CSD CSD CSD-System Unlock value from crystal structure information in the CSD The Cambridge Structural Database (CSD) is the world s most comprehensive and up-todate knowledge base of crystal structure data,

More information

Agilent MassHunter Profinder: Solving the Challenge of Isotopologue Extraction for Qualitative Flux Analysis

Agilent MassHunter Profinder: Solving the Challenge of Isotopologue Extraction for Qualitative Flux Analysis Agilent MassHunter Profinder: Solving the Challenge of Isotopologue Extraction for Qualitative Flux Analysis Technical Overview Introduction Metabolomics studies measure the relative abundance of metabolites

More information

From BASIS DD to Barista Application in Five Easy Steps

From BASIS DD to Barista Application in Five Easy Steps Y The steps are: From BASIS DD to Barista Application in Five Easy Steps By Jim Douglas our current BASIS Data Dictionary is perfect raw material for your first Barista-brewed application. Barista facilitates

More information

Accurate 3D-Modeling of User Inputted Molecules Using a Nelder Mead Algorithm Computer Systems Lab,

Accurate 3D-Modeling of User Inputted Molecules Using a Nelder Mead Algorithm Computer Systems Lab, Accurate 3D-Modeling of User Inputted Molecules Using a Nelder Mead Algorithm Computer Systems Lab, 2007-2008 Ben Parr May 12, 2008 Abstract In order to better understand chemistry, chemists create 3 dimensional

More information

Geodatabase Essentials Part One - Intro to the Geodatabase. Jonathan Murphy Colin Zwicker

Geodatabase Essentials Part One - Intro to the Geodatabase. Jonathan Murphy Colin Zwicker Geodatabase Essentials Part One - Intro to the Geodatabase Jonathan Murphy Colin Zwicker Session Path The Geodatabase - What is it? - Why use it? - What types are there? Inside the Geodatabase Advanced

More information

CS425: Algorithms for Web Scale Data

CS425: Algorithms for Web Scale Data CS425: Algorithms for Web Scale Data Most of the slides are from the Mining of Massive Datasets book. These slides have been modified for CS425. The original slides can be accessed at: www.mmds.org Challenges

More information

What are the Spatial Data Standards?

What are the Spatial Data Standards? What is SDSFIE? 1992 Army, Navy, Air Force and Marine Corps established the Tri- Service CADD/GIS Technology Center at the US Army Engineer Waterways Experiment Station in Vicksburg, Miss. 1999 name was

More information

Integrated Electricity Demand and Price Forecasting

Integrated Electricity Demand and Price Forecasting Integrated Electricity Demand and Price Forecasting Create and Evaluate Forecasting Models The many interrelated factors which influence demand for electricity cannot be directly modeled by closed-form

More information

Geodatabase An Introduction

Geodatabase An Introduction 2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Geodatabase An Introduction David Crawford and Jonathan Murphy Session Path The Geodatabase What is it?

More information

Analytical data, the web, and standards for unified laboratory informatics databases

Analytical data, the web, and standards for unified laboratory informatics databases Analytical data, the web, and standards for unified laboratory informatics databases Presented By Patrick D. Wheeler & Graham A. McGibbon ACS San Diego 16 March, 2016 Sources Process, Analyze Interfaces,

More information

A Reconfigurable Quantum Computer

A Reconfigurable Quantum Computer A Reconfigurable Quantum Computer David Moehring CEO, IonQ, Inc. College Park, MD Quantum Computing for Business 4-6 December 2017, Mountain View, CA IonQ Highlights Full Stack Quantum Computing Company

More information

GIS CONCEPTS ARCGIS METHODS AND. 2 nd Edition, July David M. Theobald, Ph.D. Natural Resource Ecology Laboratory Colorado State University

GIS CONCEPTS ARCGIS METHODS AND. 2 nd Edition, July David M. Theobald, Ph.D. Natural Resource Ecology Laboratory Colorado State University GIS CONCEPTS AND ARCGIS METHODS 2 nd Edition, July 2005 David M. Theobald, Ph.D. Natural Resource Ecology Laboratory Colorado State University Copyright Copyright 2005 by David M. Theobald. All rights

More information

Arboretum Explorer: Using GIS to map the Arnold Arboretum

Arboretum Explorer: Using GIS to map the Arnold Arboretum Arboretum Explorer: Using GIS to map the Arnold Arboretum Donna Tremonte, Arnold Arboretum of Harvard University 2015 Esri User Conference (UC), July 22, 2015 http://arboretum.harvard.edu/explorer Mission

More information

Outline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System

Outline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System Outline I Data Preparation Introduction to SpaceStat and ESTDA II Introduction to ESTDA and SpaceStat III Introduction to time-dynamic regression ESTDA ESTDA & SpaceStat Learning Objectives Activities

More information

Introduction to ArcGIS Server - Creating and Using GIS Services. Mark Ho Instructor Washington, DC

Introduction to ArcGIS Server - Creating and Using GIS Services. Mark Ho Instructor Washington, DC Introduction to ArcGIS Server - Creating and Using GIS Services Mark Ho Instructor Washington, DC Technical Workshop Road Map Product overview Building server applications GIS services Developer Help resources

More information

It s about time... The only timeline tool you ll ever need!

It s about time... The only timeline tool you ll ever need! It s about time... The only timeline tool you ll ever need! Introduction about me Jon Tomczak Senior Consultant Crypsis Game Dev turned Forensicator Past: Started TZWorks in 2006 Consultant at Mandiant

More information

GIS FOR PLANNING. Course Overview. Schedule. Instructor. Prerequisites. Urban Planning 792 Thursday s 5:30-8:10pm SARUP 158

GIS FOR PLANNING. Course Overview. Schedule. Instructor. Prerequisites. Urban Planning 792 Thursday s 5:30-8:10pm SARUP 158 GIS FOR PLANNING Urban Planning 792 Thursday s 5:30-8:10pm SARUP 158 Schedule Class/Lab - SARUP 158 Thursdays 5:30pm - 8:10pm Office Hours - By Appointment Project Ideas - Week 4 Final - 5/10/2018 Instructor

More information

Bentley Map Advancing GIS for the World s Infrastructure

Bentley Map Advancing GIS for the World s Infrastructure Bentley Map Advancing GIS for the World s Infrastructure Presentation Overview Why would you need Bentley Map? What is Bentley Map? Where is Bentley Map Used? Why would you need Bentley Map? Because your

More information

Gridded Traffic Density Estimates for Southern

Gridded Traffic Density Estimates for Southern Gridded Traffic Density Estimates for Southern California, 1995-2014 User Notes authored by Beau, 11/28/2017 METADATA: Each raster file contains Traffic Density data for one year (1995, 2000, 2005, 2010,

More information

Python Raster Analysis. Kevin M. Johnston Nawajish Noman

Python Raster Analysis. Kevin M. Johnston Nawajish Noman Python Raster Analysis Kevin M. Johnston Nawajish Noman Outline Managing rasters and performing analysis with Map Algebra How to access the analysis capability - Demonstration Complex expressions and optimization

More information

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University Text Mining Dr. Yanjun Li Associate Professor Department of Computer and Information Sciences Fordham University Outline Introduction: Data Mining Part One: Text Mining Part Two: Preprocessing Text Data

More information

Statistics Toolbox 6. Apply statistical algorithms and probability models

Statistics Toolbox 6. Apply statistical algorithms and probability models Statistics Toolbox 6 Apply statistical algorithms and probability models Statistics Toolbox provides engineers, scientists, researchers, financial analysts, and statisticians with a comprehensive set of

More information

Creation of an Internet Based Indiana Water Quality Atlas (IWQA)

Creation of an Internet Based Indiana Water Quality Atlas (IWQA) Department of Environmental Management Creation of an Internet Based Water Quality Atlas (IWQA) May 4, 2005 IUPUI 1200 Waterway Blvd., Suite 100 polis, 46202-5140 Water Quality Atlas John Buechler, Neil

More information

Working with the Geodatabase

Working with the Geodatabase Working with the Geodatabase Agenda What is the geodatabase? Benefits of the geodatabase Inside the geodatabase Geodatabase rules Demos Additional resources and training Q & A The Geodatabase is the foundation

More information

GIS-based Smart Campus System using 3D Modeling

GIS-based Smart Campus System using 3D Modeling GIS-based Smart Campus System using 3D Modeling Smita Sengupta GISE Advance Research Lab. IIT Bombay, Powai Mumbai 400 076, India smitas@cse.iitb.ac.in Concept of Smart Campus System Overview of IITB Campus

More information

A Study to Measure the Chemical Characteristics of Particle Emissions from Biomass Burning Stoves

A Study to Measure the Chemical Characteristics of Particle Emissions from Biomass Burning Stoves A Study to Measure the Chemical Characteristics of Particle Emissions from Biomass Burning Stoves End of Summer Workshop, 2006 Dabrina Dutcher School of Public Health and Department of Mechanical Engineering

More information

Tutorial 2: Analysis of DIA data in Skyline

Tutorial 2: Analysis of DIA data in Skyline Tutorial 2: Analysis of DIA data in Skyline In this tutorial we will learn how to use Skyline to perform targeted post-acquisition analysis for peptide and inferred protein detection and quantitation using

More information

GENERALIZATION IN THE NEW GENERATION OF GIS. Dan Lee ESRI, Inc. 380 New York Street Redlands, CA USA Fax:

GENERALIZATION IN THE NEW GENERATION OF GIS. Dan Lee ESRI, Inc. 380 New York Street Redlands, CA USA Fax: GENERALIZATION IN THE NEW GENERATION OF GIS Dan Lee ESRI, Inc. 380 New York Street Redlands, CA 92373 USA dlee@esri.com Fax: 909-793-5953 Abstract In the research and development of automated map generalization,

More information

Time Series Analysis with SAR & Optical Satellite Data

Time Series Analysis with SAR & Optical Satellite Data Time Series Analysis with SAR & Optical Satellite Data Thomas Bahr ESRI European User Conference Thursday October 2015 harris.com Motivation Changes in land surface characteristics mirror a multitude of

More information

From BASIS DD to Barista Application in Five Easy Steps

From BASIS DD to Barista Application in Five Easy Steps Y The steps are: From BASIS DD to Barista Application in Five Easy Steps By Jim Douglas our current BASIS Data Dictionary is perfect raw material for your first Barista-brewed application. Barista facilitates

More information

Skyline Small Molecule Targets

Skyline Small Molecule Targets Skyline Small Molecule Targets The Skyline Targeted Proteomics Environment provides informative visual displays of the raw mass spectrometer data you import into your Skyline documents. Originally developed

More information

Reaxys Medicinal Chemistry Fact Sheet

Reaxys Medicinal Chemistry Fact Sheet R&D SOLUTIONS FOR PHARMA & LIFE SCIENCES Reaxys Medicinal Chemistry Fact Sheet Essential data for lead identification and optimization Reaxys Medicinal Chemistry empowers early discovery in drug development

More information

Building Inflation Tables and CER Libraries

Building Inflation Tables and CER Libraries Building Inflation Tables and CER Libraries January 2007 Presented by James K. Johnson Tecolote Research, Inc. Copyright Tecolote Research, Inc. September 2006 Abstract Building Inflation Tables and CER

More information

Gridded Ambient Air Pollutant Concentrations for Southern California, User Notes authored by Beau MacDonald, 11/28/2017

Gridded Ambient Air Pollutant Concentrations for Southern California, User Notes authored by Beau MacDonald, 11/28/2017 Gridded Ambient Air Pollutant Concentrations for Southern California, 1995-2014 User Notes authored by Beau, 11/28/2017 METADATA: Each raster file contains data for one pollutant (NO2, O3, PM2.5, and PM10)

More information

Integrated Cheminformatics to Guide Drug Discovery

Integrated Cheminformatics to Guide Drug Discovery Integrated Cheminformatics to Guide Drug Discovery Matthew Segall, Ed Champness, Peter Hunt, Tamsin Mansley CINF Drug Discovery Cheminformatics Approaches August 23 rd 2017 Optibrium, StarDrop, Auto-Modeller,

More information

OECD QSAR Toolbox v.4.1. Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals

OECD QSAR Toolbox v.4.1. Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals OECD QSAR Toolbox v.4.1 Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals Background Outlook Objectives The exercise Workflow 2 Background This is a

More information

1. Prepare the MALDI sample plate by spotting an angiotensin standard and the test sample(s).

1. Prepare the MALDI sample plate by spotting an angiotensin standard and the test sample(s). Analysis of a Peptide Sequence from a Proteolytic Digest by MALDI-TOF Post-Source Decay (PSD) and Collision-Induced Dissociation (CID) Standard Operating Procedure Purpose: The following procedure may

More information

OECD QSAR Toolbox v.4.1. Step-by-step example for building QSAR model

OECD QSAR Toolbox v.4.1. Step-by-step example for building QSAR model OECD QSAR Toolbox v.4.1 Step-by-step example for building QSAR model Background Objectives The exercise Workflow of the exercise Outlook 2 Background This is a step-by-step presentation designed to take

More information

Are You Maximizing The Value Of All Your Data?

Are You Maximizing The Value Of All Your Data? Are You Maximizing The Value Of All Your Data? Using The SAS Bridge for ESRI With ArcGIS Business Analyst In A Retail Market Analysis SAS and ESRI: Bringing GIS Mapping and SAS Data Together Presented

More information

In Silico Investigation of Off-Target Effects

In Silico Investigation of Off-Target Effects PHARMA & LIFE SCIENCES WHITEPAPER In Silico Investigation of Off-Target Effects STREAMLINING IN SILICO PROFILING In silico techniques require exhaustive data and sophisticated, well-structured informatics

More information

Geodatabase Best Practices. Dave Crawford Erik Hoel

Geodatabase Best Practices. Dave Crawford Erik Hoel Geodatabase Best Practices Dave Crawford Erik Hoel Geodatabase best practices - outline Geodatabase creation Data ownership Data model Data configuration Geodatabase behaviors Data integrity and validation

More information

Free and Open Source Software for Cadastre and Land Registration : A Hidden Treasure? Gertrude Pieper Espada. Overview

Free and Open Source Software for Cadastre and Land Registration : A Hidden Treasure? Gertrude Pieper Espada. Overview Free and Open Source Software for Cadastre and Land Registration : A Hidden Treasure? Gertrude Pieper Espada Overview FLOSS concepts Digital Land Administration systems FLOSS Database alternatives FLOSS

More information

Lab Activity: The Central Limit Theorem

Lab Activity: The Central Limit Theorem Lab Activity: The Central Limit Theorem In this lab activity, you will explore the properties of the Central Limit Theorem. Student Learning Outcomes By the end of this chapter, you should be able to do

More information

Introduction to Machine Learning. PCA and Spectral Clustering. Introduction to Machine Learning, Slides: Eran Halperin

Introduction to Machine Learning. PCA and Spectral Clustering. Introduction to Machine Learning, Slides: Eran Halperin 1 Introduction to Machine Learning PCA and Spectral Clustering Introduction to Machine Learning, 2013-14 Slides: Eran Halperin Singular Value Decomposition (SVD) The singular value decomposition (SVD)

More information

Lecture 2. Introduction to ESRI s ArcGIS Desktop and ArcMap

Lecture 2. Introduction to ESRI s ArcGIS Desktop and ArcMap Lecture 2 Introduction to ESRI s ArcGIS Desktop and ArcMap Outline ESRI What is ArcGIS? ArcGIS Desktop ArcMap Overview Views Layers Attribute Tables Help! Scale Tips and Tricks ESRI Environmental Systems

More information

Introduction to Google Mapping Tools

Introduction to Google Mapping Tools Introduction to Google Mapping Tools Google s Mapping Tools Explore geographic data. Organize your own geographic data. Visualize complex data. Share your data with the world. Tell your story and educate

More information

GIS at UCAR. The evolution of NCAR s GIS Initiative. Olga Wilhelmi ESIG-NCAR Unidata Workshop 24 June, 2003

GIS at UCAR. The evolution of NCAR s GIS Initiative. Olga Wilhelmi ESIG-NCAR Unidata Workshop 24 June, 2003 GIS at UCAR The evolution of NCAR s GIS Initiative Olga Wilhelmi ESIG-NCAR Unidata Workshop 24 June, 2003 Why GIS? z z z z More questions about various climatological, meteorological, hydrological and

More information

Application Note 12: Fully Automated Compound Screening and Verification Using Spinsolve and MestReNova

Application Note 12: Fully Automated Compound Screening and Verification Using Spinsolve and MestReNova Application Note : Fully Automated Compound Screening and Verification Using Spinsolve and MestReNova Paul Bowyer, Magritek, Inc. and Mark Dixon, Mestrelab Sample screening to verify the identity or integrity

More information

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30 NED Mission: Provide a comprehensive, reliable and easy-to-use synthesis of multi-wavelength data from NASA missions, published catalogs, and the refereed literature, to enhance and enable astrophysical

More information

Impression Store: Compressive Sensing-based Storage for. Big Data Analytics

Impression Store: Compressive Sensing-based Storage for. Big Data Analytics Impression Store: Compressive Sensing-based Storage for Big Data Analytics Jiaxing Zhang, Ying Yan, Liang Jeff Chen, Minjie Wang, Thomas Moscibroda & Zheng Zhang Microsoft Research The Curse of O(N) in

More information

Framework for on an open 3D urban analysis platform based on OGC Web Services

Framework for on an open 3D urban analysis platform based on OGC Web Services Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen Framework for on an open 3D urban analysis platform based on OGC Web Services Marc-O. Löwner & Thomas Adolphi (née Becker) Technische

More information

EEOS 381 -Spatial Databases and GIS Applications

EEOS 381 -Spatial Databases and GIS Applications EEOS 381 -Spatial Databases and GIS Applications Lecture 5 Geodatabases What is a Geodatabase? Geographic Database ESRI-coined term A standard RDBMS that stores and manages geographic data A modern object-relational

More information

D2D SALES WITH SURVEY123, OP DASHBOARD, AND MICROSOFT SSAS

D2D SALES WITH SURVEY123, OP DASHBOARD, AND MICROSOFT SSAS D2D SALES WITH SURVEY123, OP DASHBOARD, AND MICROSOFT SSAS EDWARD GAUSE, GISP DIRECTOR OF INFORMATION SERVICES (ENGINEERING APPS) HTC (HORRY TELEPHONE COOP.) EDWARD GAUSE, GISP DIRECTOR OF INFORMATION

More information

A Model of GIS Interoperability Based on JavaRMI

A Model of GIS Interoperability Based on JavaRMI A Model of GIS Interoperability Based on Java Gao Gang-yi 1 Chen Hai-bo 2 1 Zhejiang University of Finance & Economics, Hangzhou 310018, China 2 College of Computer Science and Technology, Zhejiang UniversityHangzhou

More information

Geodatabase Management Pathway

Geodatabase Management Pathway Geodatabase Management Pathway Table of Contents ArcGIS Desktop II: Tools and Functionality 3 ArcGIS Desktop III: GIS Workflows and Analysis 6 Building Geodatabases 8 Data Management in the Multiuser Geodatabase

More information

These modules are covered with a brief information and practical in ArcGIS Software and open source software also like QGIS, ILWIS.

These modules are covered with a brief information and practical in ArcGIS Software and open source software also like QGIS, ILWIS. Online GIS Training and training modules covered are: 1. ArcGIS, Analysis, Fundamentals and Implementation 2. ArcGIS Web Data Sharing 3. ArcGIS for Desktop 4. ArcGIS for Server These modules are covered

More information

Detecting the Unexpected

Detecting the Unexpected Detecting the Unexpected Discovery in the Era of Astronomically Big Data Insights from Space Telescope Science Institute s first Big Data conference Josh Peek, Chair Sarah Kendrew, C0-Chair SOC: Erik Tollerud,

More information

Tutorial 8 Raster Data Analysis

Tutorial 8 Raster Data Analysis Objectives Tutorial 8 Raster Data Analysis This tutorial is designed to introduce you to a basic set of raster-based analyses including: 1. Displaying Digital Elevation Model (DEM) 2. Slope calculations

More information

DEVELOPMENT OF GPS PHOTOS DATABASE FOR LAND USE AND LAND COVER APPLICATIONS

DEVELOPMENT OF GPS PHOTOS DATABASE FOR LAND USE AND LAND COVER APPLICATIONS DEVELOPMENT OF GPS PHOTOS DATABASE FOR LAND USE AND LAND COVER APPLICATIONS An Ngoc VAN and Wataru TAKEUCHI Institute of Industrial Science University of Tokyo 4-6-1 Komaba, Meguro-ku, Tokyo 153-8505 Japan

More information

CSE 241 Class 1. Jeremy Buhler. August 24,

CSE 241 Class 1. Jeremy Buhler. August 24, CSE 41 Class 1 Jeremy Buhler August 4, 015 Before class, write URL on board: http://classes.engineering.wustl.edu/cse41/. Also: Jeremy Buhler, Office: Jolley 506, 314-935-6180 1 Welcome and Introduction

More information

HYPERGRAPH BASED SEMI-SUPERVISED LEARNING ALGORITHMS APPLIED TO SPEECH RECOGNITION PROBLEM: A NOVEL APPROACH

HYPERGRAPH BASED SEMI-SUPERVISED LEARNING ALGORITHMS APPLIED TO SPEECH RECOGNITION PROBLEM: A NOVEL APPROACH HYPERGRAPH BASED SEMI-SUPERVISED LEARNING ALGORITHMS APPLIED TO SPEECH RECOGNITION PROBLEM: A NOVEL APPROACH Hoang Trang 1, Tran Hoang Loc 1 1 Ho Chi Minh City University of Technology-VNU HCM, Ho Chi

More information

Patent Searching using Bayesian Statistics

Patent Searching using Bayesian Statistics Patent Searching using Bayesian Statistics Willem van Hoorn, Exscientia Ltd Biovia European Forum, London, June 2017 Contents Who are we? Searching molecules in patents What can Pipeline Pilot do for you?

More information

GIS CONCEPTS ARCGIS METHODS AND. 3 rd Edition, July David M. Theobald, Ph.D. Warner College of Natural Resources Colorado State University

GIS CONCEPTS ARCGIS METHODS AND. 3 rd Edition, July David M. Theobald, Ph.D. Warner College of Natural Resources Colorado State University GIS CONCEPTS AND ARCGIS METHODS 3 rd Edition, July 2007 David M. Theobald, Ph.D. Warner College of Natural Resources Colorado State University Copyright Copyright 2007 by David M. Theobald. All rights

More information

CS 350 A Computing Perspective on GIS

CS 350 A Computing Perspective on GIS CS 350 A Computing Perspective on GIS What is GIS? Definitions A powerful set of tools for collecting, storing, retrieving at will, transforming and displaying spatial data from the real world (Burrough,

More information

Marvin 5.4 A new generation of structure indexing at Elsevier. Dr. Michael Maier, Dr. Heike Nau, Elsevier

Marvin 5.4 A new generation of structure indexing at Elsevier. Dr. Michael Maier, Dr. Heike Nau, Elsevier Marvin 5.4 A new generation of structure indexing at Elsevier Dr. Michael Maier, Dr. Heike Nau, Elsevier Agenda Elsevier: Reaxys database Compound classes Structure requirements Marvin 5.4 Decision process

More information

Bentley Map V8i (SELECTseries 3)

Bentley Map V8i (SELECTseries 3) Bentley Map V8i (SELECTseries 3) A quick overview Why Bentley Map Viewing and editing of geospatial data from file based GIS formats, spatial databases and raster Assembling geospatial/non-geospatial data

More information

Traffic accidents and the road network in SAS/GIS

Traffic accidents and the road network in SAS/GIS Traffic accidents and the road network in SAS/GIS Frank Poppe SWOV Institute for Road Safety Research, the Netherlands Introduction The first figure shows a screen snapshot of SAS/GIS with part of the

More information

an accessible interface to marine environmental data Russell Moffitt

an accessible interface to marine environmental data Russell Moffitt an accessible interface to marine environmental data Russell Moffitt The Atlas Project GOAL: To provide a single point of access to oceanographic and environmental data for use by marine resource researchers,

More information

GIS Functions and Integration. Tyler Pauley Associate Consultant

GIS Functions and Integration. Tyler Pauley Associate Consultant GIS Functions and Integration Tyler Pauley Associate Consultant Contents GIS in AgileAssets products Displaying data within AMS Symbolizing the map display Display on Bing Maps Demo- Displaying a map in

More information

Classification Semi-supervised learning based on network. Speakers: Hanwen Wang, Xinxin Huang, and Zeyu Li CS Winter

Classification Semi-supervised learning based on network. Speakers: Hanwen Wang, Xinxin Huang, and Zeyu Li CS Winter Classification Semi-supervised learning based on network Speakers: Hanwen Wang, Xinxin Huang, and Zeyu Li CS 249-2 2017 Winter Semi-Supervised Learning Using Gaussian Fields and Harmonic Functions Xiaojin

More information

CityBES: A Data and Computing Platform for City Buildings

CityBES: A Data and Computing Platform for City Buildings CityBES: A Data and Computing Platform for City Buildings Yixing Chen, Tianzhen Hong, Mary Ann Piette, Xuan Luo Building Technology and Urban Systems Division Urban Infrastructures NSF Workshop New York

More information

Fog Monitor 100 (FM 100) Extinction Module. Operator Manual

Fog Monitor 100 (FM 100) Extinction Module. Operator Manual Particle Analysis and Display System (PADS): Fog Monitor 100 (FM 100) Extinction Module Operator Manual DOC-0217 Rev A-1 PADS 2.7.3, FM 100 Extinction Module 2.7.0 5710 Flatiron Parkway, Unit B Boulder,

More information

ADDRESSING A HOW TO LOOK AT GIS ADDRESSING 9/13/2017

ADDRESSING A HOW TO LOOK AT GIS ADDRESSING 9/13/2017 ADDRESSING A Look at Creating & Updating Point Files A HOW TO LOOK AT GIS ADDRESSING Creating points using LAT/LONG fields from WINGAP Creating addressing location (GPS/Latitude & Longitude) points using

More information

A Modular NMF Matching Algorithm for Radiation Spectra

A Modular NMF Matching Algorithm for Radiation Spectra A Modular NMF Matching Algorithm for Radiation Spectra Melissa L. Koudelka Sensor Exploitation Applications Sandia National Laboratories mlkoude@sandia.gov Daniel J. Dorsey Systems Technologies Sandia

More information

PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE

PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE 1 SERVING THE LIFE SCIENCES SPACE ADDRESSING KEY CHALLENGES ACROSS THE R&D VALUE CHAIN Characterize targets & analyze disease

More information

The Case for Use Cases

The Case for Use Cases The Case for Use Cases The integration of internal and external chemical information is a vital and complex activity for the pharmaceutical industry. David Walsh, Grail Entropix Ltd Costs of Integrating

More information