Environmental Chemistry through Intelligent Atmospheric Data Analysis (EnChIlADA): A Platform for Mining ATOFMS and Other Atmospheric Data
|
|
- Berenice Berry
- 5 years ago
- Views:
Transcription
1 Environmental Chemistry through Intelligent Atmospheric Data Analysis (EnChIlADA): A Platform for Mining ATOFMS and Other Atmospheric Data Katie Barton, John Choiniere, Melanie Yuen, and Deborah Gross Department of Chemistry, Anna Ritz, Thomas Smith, Leah Steinberg, and David Musicant Department of Mathematics and Computer Science, Jamie Schauer Environmental Chemistry and Technology, University of Wisconsin Madison Lei Chen, Greg Cipriano, and Raghu Ramakrishnan Department of Computer Sciences, University of Wisconsin Madison
2 Today s Menu Goals of Enchilada project Discussion of capabilities (current and future) Data organization User Interface and Analysis Tools Clustering Algorithms Time-series Analysis Labeling ATOFMS Spectra Automatically Summary and Future Outlook
3 Project Description NSF-ITR funding to develop data mining techniques for complex aerosol data sets. Collaboration between aerosol scientists and computer scientists: Address the right questions. Integrate domain knowledge into the data mining techniques. Use real data to test techniques. Answer interesting questions along the way!
4 The Aerosol Scientist s View of Enchilada: Particles Enchilada Real-time measurements A new picture of complex aerosol data sets! Particle Source
5 How Enchilada Fits In ATOFMS MS-Analyze (MS-Access) Time Series Data Other Instrument #1 MS-Control Enchilada (SQL Server) Other Instrument #2... A new picture of complex aerosol data sets!
6 What is Data Mining? The non-trivial discovery of novel, valid, comprehensible and potentially useful patterns from data (Fayyad et al) Computer! Learn something from this data, and explain it to me. Includes Clustering Classification Time series analysis
7 Enchilada Open Source Software Implements data mining tools to analyze atmospheric data (in real time) General Will work with any relevant data, not just ATOFMS data Scalable Will work with data that contains millions of particles Easy to use, intuitive
8 Enchilada Basics Import ATOFMS datasets Organize datasets into collections Display particle spectra Query (time, size, count) Synchronize, Cluster, Label Export collections to MS-Analyze
9 Particle Organization in Enchilada Atoms Not atoms in a chemistry sense! Smallest units of imported data (for our purposes, particles) Dense Atom Info Time = 08:03:03 12/03/03 Size = 0.21 microns Filename = particle1.amz Atom Sparse Atom Info
10 Dataset Organization in Enchilada Datasets Groups of particles Gathered from ATOFMS instrument in one session Each atom belongs to one dataset Collections Virtual datasets Collections can be made from other collections Each atom can belong to many collections Original Datasets Datasets imported as Collections Collections are split or merged to form new Collections
11 (Demo of Enchilada Tools)
12 The Collection Hierarchy Provides a way to view all imported datasets at once Can be quite complex! Contains all the atoms from their subcollections Has the ability to create empty collections Has the ability to copy, cut, and paste collections
13 Datatypes in the Database datatype: the kind of data the program is working with Examples: ATOFMS, TimeSeries, AMS, Mercury data Multiple datatypes give Enchilada power Easy comparisons between instruments or over time Different datatypes in one database Database needs to be detailed and generalized
14 Database Representation Tables in the database: General tables for information that is consistent across datatypes Datatype-specific tables for information specific to each datatype Only one set of general tables in the entire database One set of datatype-specific tables for each datatype in the database
15 Time-Series Analysis Observe trends over many datasets. Provide a broader picture. Elucidate dynamics in data series. Compare different types of time series data. Most other data of interest are time series. Time steps need to be synchronized.
16 East St. Louis Time-Series x10 8 ATOFMS Scaled Number of Particles 6.0x10 8 9x10 7 5x /28/ /01/ /04/ /08/ /12/ /16/ /20/ /24/2003 PM2.5 Concentration (µg/m 3 ) PM-2.5 Total Mass and PM-2.5 OC Mass vs. ATOFMS Total and OC Particles PM-2.5 (MetOne BAM) ATOFMS All Particles PM-2.5 OC (Sunset Labs) ATOFMS OC Particles
17 ATOFMS Scaled Number of Particles 2x10 7 East St. Louis Time-Series 7.0 PM-2.5 BC (Aethalometer) ATOFMS EC Particles PM-2.5 EC (Sunset Labs) ATOFMS EC Particles 2.0 1x x10 7 1x PM-2.5 Real-time BC and PM-2.5 Real-time EC Mass vs. ATOFMS EC Particles 12/01/ /04/ /08/ /12/ /16/ /20/ /24/ /28/2003 PM2.5 Concentration (µg/m 3 )
18 EC vs. BC by Chemical Composition? Goals of this analysis are to use data mining to: Determine the relationship(s) between EC and BC, using composition information from ATOFMS. Understand which chemical components contribute to light-absorbing properties of particles. We want to understand these questions without using prior knowledge. Enchilada will search for the relationship(s).
19 Flexible Importation How do we get all these different datatypes into Enchilada? Customized XML file formats for importing datatypes and data XML (Extensible Markup Language) is an industry standard for exchanging data. Any datatype that can be represented in these file formats could be imported.
20 MetaData Files Contain information about a datatype Used to create new datatype-specific tables in the database Import once, use many times Database MetaData File Mercury Datatype NEW! NEW! General Tables MercuryDataSetInfo Table MercuryAtomInfoDense Table NEW! MercuryAtomInfoSparse Table
21 EnchiladaData Files Contain the actual data to import into the program XML example: <atominfodense> <field>exampleparticle</field> <field> :03:03</field> <atominfosparse table= Peaks > <field>12</field> <field> </field> </atominfosparse> </atominfodense>
22 (Demo of Enchilada Importer & Time Series)
23 Clustering: A Data Mining Technique Answers the question What are the main groups of data points in this database? For chemists: a fast way to find out about the major types of particles present in a sample. With time series aggregation, we can explore simultaneously clustering different datatypes.
24 Clustering The purpose of clustering is to find groups of similar objects. How many groups are in this graph? 4 clusters 3 clusters 2 clusters 1 cluster Or lots of clusters
25 k-means Demo in One Dimension Step 1. Choose initial cluster centers somehow (here, the first two points are chosen arbitrarily). There are better methods of choosing initial cluster centers in the Data Mining literature.
26 k-means Demo in One Dimension Step 2. Assign each point to the centroid to which it is closest. Each group of points is now a cluster.
27 k-means Demo in One Dimension Step 3. Average all the points in each cluster to find the centroid.
28 k-means Demo in One Dimension Step 2 again. Assign each point to its nearest centroid. Note that three points change which cluster they belong to.
29 k-means Demo in One Dimension Step 3 again. Average all the points in each cluster to find the new centroid.
30 k-means Demo in One Dimension Step 2 again. Assign each point to its nearest centroid.
31 k-means Demo in One Dimension Step 3 again. Average all the points in each cluster to find the new centroid.
32 k-means Demo in One Dimension Step 4. Notice that the centroids in the last two averaging steps were the same. This means that the algorithm is done.
33 Clustering Spectra Treat each m/z value as a dimension (so that we store information on the relative concentrations of each ion) Over 600 dimensions Apply the same algorithms we used for 2D data Cluster centers still summarize the particles Positive Spectrum for Typical k-means Centroid
34 (Demo of Clustering in Enchilada)
35 Different Clustering Approaches k-means/k-medians Cluster # parameter, use mean or median to find centers ART-2a Vigilance parameter: cluster radius Considers particles one at a time, rather than in a group. Has been used with ATOFMS before k = 3 vigilance =
36 Clustering Experiment #1 How good is a clustering algorithm? Synthetic Dataset 2000-particle dataset with 7 real particles We know what the clusters should look like Error The average distance from every point in a given cluster to its center Error =
37 Quantitative Analysis of Clustering Homogeneity Graphs: k-means, k = # of particles A B C D E F G A B C D E F G A B C D E F G A B C D E F G A B C D E F G Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 # of particles Error Error Graph A B C D E F G A B C D E F G Cluster 6 Cluster Iteration
38 Clustering Experiment #2 ART-2a versus k-means ART-2a has been used before for ATOFMS data k-means is a more standard clustering algorithm More user-friendly than ART-2a This experiment: ~2000 particles sampled in East St. Louis Same number of clusters from each algorithm Compare cluster centroids to evaluate algorithm performance
39 Chemical Comparison of Clustering ART-2a and k-means are comparable Clusters make chemical sense k-means calculations are more efficient Other algorithms will also be compared m/z of the 9 Cluster Centroids from 2000 East St. Louis ATOFMS Particles Cluster a ART-2a centroid k-means centroid Cluster b m/z
40 Clustering: An EC Case Study C 3 C 3 Relative Peak Area C 4 C C 5 C 6 C -C HSO 4 - k-means centroid EC particle type Relative Intensity C C 2 C 4 C 5 C 2 - C 4 - C 6 C 8 Sample EC particle From YSNP C 10 C 12 C 2 - C 5 - C 6 - C 5 - C 6 - C m/z m/z
41 Timeline: Query vs. Cluster EC Cluster contains 2,212 particles EC Query found 11,745 particles MS-Analyze Enchilada /2/ :00 9/3/ :00 9/4/ :00 9/5/ :00 9/6/ :00 9/7/ :00 9/8/ :00 9/9/ :00 9/10/ :00 9/11/ :00 Number of Particles
42 What are the other particles? 1.0 C C K C 4 C 3 C 5 Fraction of Particles Containing m/z C C 2 4 C 3 - C 2 - C 4 - C 5 C 6 C 6 - Histogram of k-means EC cluster particles C 7 HSO 4 - Fraction of Particles Containing m/z C C 2 C 2 - C 4 - C - C 6 Histogram of query results from YSNP C 10 C 8 HSO 4 - C 12 K 3 SO m/z m/z
43 Cluster and Query Results What are EC particles? Clusters produce a narrow definition of EC and queries use a broad definition of EC Are the queries finding too many particles? Are EC particles distributed among many clusters? Is there an EC particle type that the clustering algorithm is not isolating? YES! Other clusters contain EC and K peaks! Explore integrating domain knowledge for improved clustering. Combine query and cluster perspectives!
44 Future Work in Clustering Finding and implementing algorithms to cluster very large datasets Real-time clustering Finding clusters that change over time Adjustable weighting for data features Clustering multiple datatypes together
45 Labeling Mass Spectra Provide to program: List of common ions Natural isotope distributions Common ion combinations Raw ATOFMS spectra Obtain from program: Spectra with ion composition options labeled above all possible peaks
46 Conclusions and Future Endeavors Enchilada development will continue, and new algorithms will be developed and tested. New features will be implemented: Real-time data acquisition and analysis Enchilada will capture data as it is saved. Support for a variety of data sources, especially other mass spectral data types. Many datasets for analysis: Yellowstone National Park (Summer 2003): Natural Geothermal East St. Louis (Winter 2003/04): Industrial Emissions Switzerland (Fall/Winter 2005): Wood Smoke and Vehicles
47 Acknowledgements The National Science Foundation The University of Wisconsin-Madison TSI, Inc. Others who have contributed: Ben Anderson (Carleton CS) Andy Ault (Carleton Chemistry) Bee-Chung Chen (UW-Madison CS) Zheng (Colin) Huang (UW-Madison CS) Kate Nelson (Carleton CS) Jon Sulman (Carleton CS)
A Stochastic Simulation of Natural Organic Matter and Microbes in the Environment
A Stochastic Simulation of Natural Organic Matter and Microbes in the Environment Xiaorong Xiang Gregory Madey Yingping Huang Steve Cabaniss (University of New Mexico) Department of Computer Science and
More informationfor XPS surface analysis
Thermo Scientific Avantage XPS Software Powerful instrument operation and data processing for XPS surface analysis Avantage Software Atomic Concentration (%) 100 The premier software for surface analysis
More informationAdministering your Enterprise Geodatabase using Python. Jill Penney
Administering your Enterprise Geodatabase using Python Jill Penney Assumptions Basic knowledge of python Basic knowledge enterprise geodatabases and workflows You want code Please turn off or silence cell
More informationThe shortest path to chemistry data and literature
R&D SOLUTIONS Reaxys Fact Sheet The shortest path to chemistry data and literature Designed to support the full range of chemistry research, including pharmaceutical development, environmental health &
More informationData Structures & Database Queries in GIS
Data Structures & Database Queries in GIS Objective In this lab we will show you how to use ArcGIS for analysis of digital elevation models (DEM s), in relationship to Rocky Mountain bighorn sheep (Ovis
More informationData Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning
Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning Stephen Brockwell President, Brockwell IT Consulting, Inc. Join the conversation #AU2017 KEYWORD Class Summary Silos
More informationLarge Scale Structure of the Universe Lab
Large Scale Structure of the Universe Lab Introduction: Since the mid-1980 s astronomers have gathered data allowing, for the first time, a view of the structure of the Universe in three-dimensions. You
More informationGeodatabase Programming with Python John Yaist
Geodatabase Programming with Python John Yaist DevSummit DC February 26, 2016 Washington, DC Target Audience: Assumptions Basic knowledge of Python Basic knowledge of Enterprise Geodatabase and workflows
More informationTroubleshooting Replication and Geodata Services. Liz Parrish & Ben Lin
Troubleshooting Replication and Geodata Services Liz Parrish & Ben Lin AGENDA: Troubleshooting Replication and Geodata Services Overview Demo Troubleshooting Q & A Overview of Replication Liz Parrish What
More informationLast updated: Copyright
Last updated: 2012-08-20 Copyright 2004-2012 plabel (v2.4) User s Manual by Bioinformatics Group, Institute of Computing Technology, Chinese Academy of Sciences Tel: 86-10-62601016 Email: zhangkun01@ict.ac.cn,
More informationGeodatabase Programming with Python
DevSummit DC February 11, 2015 Washington, DC Geodatabase Programming with Python Craig Gillgrass Assumptions Basic knowledge of python Basic knowledge enterprise geodatabases and workflows Please turn
More informationNMR Data workup using NUTS
omework 1 Chem 636, Fall 2008 due at the beginning of the 2 nd week lab (week of Sept 9) NMR Data workup using NUTS This laboratory and homework introduces the basic processing of one dimensional NMR data
More informationHandling Human Interpreted Analytical Data. Workflows for Pharmaceutical R&D. Presented by Peter Russell
Handling Human Interpreted Analytical Data Workflows for Pharmaceutical R&D Presented by Peter Russell 2011 Survey 88% of R&D organizations lack adequate systems to automatically collect data for reporting,
More informationDOCUMENT HISTORY. Initials Section/s Modified Brief Description of Modifications
Page 2 of 13 DOCUMENT HISTORY Date Modified Initials Section/s Modified Brief Description of Modifications Page 3 of 13 Table of Contents 1. Purpose and Applicability... 4 2. Definitions... 4 3. Procedures...
More informationArcGIS Enterprise: What s New. Philip Heede Shannon Kalisky Melanie Summers Shreyas Shinde
ArcGIS Enterprise: What s New Philip Heede Shannon Kalisky Melanie Summers Shreyas Shinde ArcGIS Enterprise is the new name for ArcGIS for Server ArcGIS Enterprise Software Components ArcGIS Server Portal
More informationGEOGRAPHY 350/550 Final Exam Fall 2005 NAME:
1) A GIS data model using an array of cells to store spatial data is termed: a) Topology b) Vector c) Object d) Raster 2) Metadata a) Usually includes map projection, scale, data types and origin, resolution
More informationThe Geodatabase Working with Spatial Analyst. Calculating Elevation and Slope Values for Forested Roads, Streams, and Stands.
GIS LAB 7 The Geodatabase Working with Spatial Analyst. Calculating Elevation and Slope Values for Forested Roads, Streams, and Stands. This lab will ask you to work with the Spatial Analyst extension.
More informationWeb Visualization of Geo-Spatial Data using SVG and VRML/X3D
Web Visualization of Geo-Spatial Data using SVG and VRML/X3D Jianghui Ying Falls Church, VA 22043, USA jying@vt.edu Denis Gračanin Blacksburg, VA 24061, USA gracanin@vt.edu Chang-Tien Lu Falls Church,
More informationSPATIAL INFORMATION GRID AND ITS APPLICATION IN GEOLOGICAL SURVEY
SPATIAL INFORMATION GRID AND ITS APPLICATION IN GEOLOGICAL SURVEY K. T. He a, b, Y. Tang a, W. X. Yu a a School of Electronic Science and Engineering, National University of Defense Technology, Changsha,
More informationMassHunter Software Overview
MassHunter Software Overview 1 Qualitative Analysis Workflows Workflows in Qualitative Analysis allow the user to only see and work with the areas and dialog boxes they need for their specific tasks A
More informationACD/Labs Software Impurity Resolution Management. Presented by Peter Russell
ACD/Labs Software Impurity Resolution Management Presented by Peter Russell Impurity Resolution Process Chemists Method Development Specialists Toxicology Groups Stability Groups Analytical Chemists 7/8/2016
More informationReaxys The Highlights
Reaxys The Highlights What is Reaxys? A brand new workflow solution for research chemists and scientists from related disciplines An extensive repository of reaction and substance property data A resource
More informationKalexsyn Overview Kalexsyn, Inc Campus Drive Kalamazoo, MI Phone: (269) Fax: (269)
Kalexsyn verview 2017 Kalexsyn, Inc. 4502 Campus Drive Kalamazoo, MI 49008 Phone: (269) 488-8488 Fax: (269) 488-8490 info@kalexsyn.com What we are Experienced medicinal chemists Conception and synthesis
More informationCityGML XFM Application Template Documentation. Bentley Map V8i (SELECTseries 2)
CityGML XFM Application Template Documentation Bentley Map V8i (SELECTseries 2) Table of Contents Introduction to CityGML 1 CityGML XFM Application Template 2 Requirements 2 Finding Documentation 2 To
More informationCSD. Unlock value from crystal structure information in the CSD
CSD CSD-System Unlock value from crystal structure information in the CSD The Cambridge Structural Database (CSD) is the world s most comprehensive and up-todate knowledge base of crystal structure data,
More informationAgilent MassHunter Profinder: Solving the Challenge of Isotopologue Extraction for Qualitative Flux Analysis
Agilent MassHunter Profinder: Solving the Challenge of Isotopologue Extraction for Qualitative Flux Analysis Technical Overview Introduction Metabolomics studies measure the relative abundance of metabolites
More informationFrom BASIS DD to Barista Application in Five Easy Steps
Y The steps are: From BASIS DD to Barista Application in Five Easy Steps By Jim Douglas our current BASIS Data Dictionary is perfect raw material for your first Barista-brewed application. Barista facilitates
More informationAccurate 3D-Modeling of User Inputted Molecules Using a Nelder Mead Algorithm Computer Systems Lab,
Accurate 3D-Modeling of User Inputted Molecules Using a Nelder Mead Algorithm Computer Systems Lab, 2007-2008 Ben Parr May 12, 2008 Abstract In order to better understand chemistry, chemists create 3 dimensional
More informationGeodatabase Essentials Part One - Intro to the Geodatabase. Jonathan Murphy Colin Zwicker
Geodatabase Essentials Part One - Intro to the Geodatabase Jonathan Murphy Colin Zwicker Session Path The Geodatabase - What is it? - Why use it? - What types are there? Inside the Geodatabase Advanced
More informationCS425: Algorithms for Web Scale Data
CS425: Algorithms for Web Scale Data Most of the slides are from the Mining of Massive Datasets book. These slides have been modified for CS425. The original slides can be accessed at: www.mmds.org Challenges
More informationWhat are the Spatial Data Standards?
What is SDSFIE? 1992 Army, Navy, Air Force and Marine Corps established the Tri- Service CADD/GIS Technology Center at the US Army Engineer Waterways Experiment Station in Vicksburg, Miss. 1999 name was
More informationIntegrated Electricity Demand and Price Forecasting
Integrated Electricity Demand and Price Forecasting Create and Evaluate Forecasting Models The many interrelated factors which influence demand for electricity cannot be directly modeled by closed-form
More informationGeodatabase An Introduction
2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Geodatabase An Introduction David Crawford and Jonathan Murphy Session Path The Geodatabase What is it?
More informationAnalytical data, the web, and standards for unified laboratory informatics databases
Analytical data, the web, and standards for unified laboratory informatics databases Presented By Patrick D. Wheeler & Graham A. McGibbon ACS San Diego 16 March, 2016 Sources Process, Analyze Interfaces,
More informationA Reconfigurable Quantum Computer
A Reconfigurable Quantum Computer David Moehring CEO, IonQ, Inc. College Park, MD Quantum Computing for Business 4-6 December 2017, Mountain View, CA IonQ Highlights Full Stack Quantum Computing Company
More informationGIS CONCEPTS ARCGIS METHODS AND. 2 nd Edition, July David M. Theobald, Ph.D. Natural Resource Ecology Laboratory Colorado State University
GIS CONCEPTS AND ARCGIS METHODS 2 nd Edition, July 2005 David M. Theobald, Ph.D. Natural Resource Ecology Laboratory Colorado State University Copyright Copyright 2005 by David M. Theobald. All rights
More informationArboretum Explorer: Using GIS to map the Arnold Arboretum
Arboretum Explorer: Using GIS to map the Arnold Arboretum Donna Tremonte, Arnold Arboretum of Harvard University 2015 Esri User Conference (UC), July 22, 2015 http://arboretum.harvard.edu/explorer Mission
More informationOutline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System
Outline I Data Preparation Introduction to SpaceStat and ESTDA II Introduction to ESTDA and SpaceStat III Introduction to time-dynamic regression ESTDA ESTDA & SpaceStat Learning Objectives Activities
More informationIntroduction to ArcGIS Server - Creating and Using GIS Services. Mark Ho Instructor Washington, DC
Introduction to ArcGIS Server - Creating and Using GIS Services Mark Ho Instructor Washington, DC Technical Workshop Road Map Product overview Building server applications GIS services Developer Help resources
More informationIt s about time... The only timeline tool you ll ever need!
It s about time... The only timeline tool you ll ever need! Introduction about me Jon Tomczak Senior Consultant Crypsis Game Dev turned Forensicator Past: Started TZWorks in 2006 Consultant at Mandiant
More informationGIS FOR PLANNING. Course Overview. Schedule. Instructor. Prerequisites. Urban Planning 792 Thursday s 5:30-8:10pm SARUP 158
GIS FOR PLANNING Urban Planning 792 Thursday s 5:30-8:10pm SARUP 158 Schedule Class/Lab - SARUP 158 Thursdays 5:30pm - 8:10pm Office Hours - By Appointment Project Ideas - Week 4 Final - 5/10/2018 Instructor
More informationBentley Map Advancing GIS for the World s Infrastructure
Bentley Map Advancing GIS for the World s Infrastructure Presentation Overview Why would you need Bentley Map? What is Bentley Map? Where is Bentley Map Used? Why would you need Bentley Map? Because your
More informationGridded Traffic Density Estimates for Southern
Gridded Traffic Density Estimates for Southern California, 1995-2014 User Notes authored by Beau, 11/28/2017 METADATA: Each raster file contains Traffic Density data for one year (1995, 2000, 2005, 2010,
More informationPython Raster Analysis. Kevin M. Johnston Nawajish Noman
Python Raster Analysis Kevin M. Johnston Nawajish Noman Outline Managing rasters and performing analysis with Map Algebra How to access the analysis capability - Demonstration Complex expressions and optimization
More informationText Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University
Text Mining Dr. Yanjun Li Associate Professor Department of Computer and Information Sciences Fordham University Outline Introduction: Data Mining Part One: Text Mining Part Two: Preprocessing Text Data
More informationStatistics Toolbox 6. Apply statistical algorithms and probability models
Statistics Toolbox 6 Apply statistical algorithms and probability models Statistics Toolbox provides engineers, scientists, researchers, financial analysts, and statisticians with a comprehensive set of
More informationCreation of an Internet Based Indiana Water Quality Atlas (IWQA)
Department of Environmental Management Creation of an Internet Based Water Quality Atlas (IWQA) May 4, 2005 IUPUI 1200 Waterway Blvd., Suite 100 polis, 46202-5140 Water Quality Atlas John Buechler, Neil
More informationWorking with the Geodatabase
Working with the Geodatabase Agenda What is the geodatabase? Benefits of the geodatabase Inside the geodatabase Geodatabase rules Demos Additional resources and training Q & A The Geodatabase is the foundation
More informationGIS-based Smart Campus System using 3D Modeling
GIS-based Smart Campus System using 3D Modeling Smita Sengupta GISE Advance Research Lab. IIT Bombay, Powai Mumbai 400 076, India smitas@cse.iitb.ac.in Concept of Smart Campus System Overview of IITB Campus
More informationA Study to Measure the Chemical Characteristics of Particle Emissions from Biomass Burning Stoves
A Study to Measure the Chemical Characteristics of Particle Emissions from Biomass Burning Stoves End of Summer Workshop, 2006 Dabrina Dutcher School of Public Health and Department of Mechanical Engineering
More informationTutorial 2: Analysis of DIA data in Skyline
Tutorial 2: Analysis of DIA data in Skyline In this tutorial we will learn how to use Skyline to perform targeted post-acquisition analysis for peptide and inferred protein detection and quantitation using
More informationGENERALIZATION IN THE NEW GENERATION OF GIS. Dan Lee ESRI, Inc. 380 New York Street Redlands, CA USA Fax:
GENERALIZATION IN THE NEW GENERATION OF GIS Dan Lee ESRI, Inc. 380 New York Street Redlands, CA 92373 USA dlee@esri.com Fax: 909-793-5953 Abstract In the research and development of automated map generalization,
More informationTime Series Analysis with SAR & Optical Satellite Data
Time Series Analysis with SAR & Optical Satellite Data Thomas Bahr ESRI European User Conference Thursday October 2015 harris.com Motivation Changes in land surface characteristics mirror a multitude of
More informationFrom BASIS DD to Barista Application in Five Easy Steps
Y The steps are: From BASIS DD to Barista Application in Five Easy Steps By Jim Douglas our current BASIS Data Dictionary is perfect raw material for your first Barista-brewed application. Barista facilitates
More informationSkyline Small Molecule Targets
Skyline Small Molecule Targets The Skyline Targeted Proteomics Environment provides informative visual displays of the raw mass spectrometer data you import into your Skyline documents. Originally developed
More informationReaxys Medicinal Chemistry Fact Sheet
R&D SOLUTIONS FOR PHARMA & LIFE SCIENCES Reaxys Medicinal Chemistry Fact Sheet Essential data for lead identification and optimization Reaxys Medicinal Chemistry empowers early discovery in drug development
More informationBuilding Inflation Tables and CER Libraries
Building Inflation Tables and CER Libraries January 2007 Presented by James K. Johnson Tecolote Research, Inc. Copyright Tecolote Research, Inc. September 2006 Abstract Building Inflation Tables and CER
More informationGridded Ambient Air Pollutant Concentrations for Southern California, User Notes authored by Beau MacDonald, 11/28/2017
Gridded Ambient Air Pollutant Concentrations for Southern California, 1995-2014 User Notes authored by Beau, 11/28/2017 METADATA: Each raster file contains data for one pollutant (NO2, O3, PM2.5, and PM10)
More informationIntegrated Cheminformatics to Guide Drug Discovery
Integrated Cheminformatics to Guide Drug Discovery Matthew Segall, Ed Champness, Peter Hunt, Tamsin Mansley CINF Drug Discovery Cheminformatics Approaches August 23 rd 2017 Optibrium, StarDrop, Auto-Modeller,
More informationOECD QSAR Toolbox v.4.1. Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals
OECD QSAR Toolbox v.4.1 Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals Background Outlook Objectives The exercise Workflow 2 Background This is a
More information1. Prepare the MALDI sample plate by spotting an angiotensin standard and the test sample(s).
Analysis of a Peptide Sequence from a Proteolytic Digest by MALDI-TOF Post-Source Decay (PSD) and Collision-Induced Dissociation (CID) Standard Operating Procedure Purpose: The following procedure may
More informationOECD QSAR Toolbox v.4.1. Step-by-step example for building QSAR model
OECD QSAR Toolbox v.4.1 Step-by-step example for building QSAR model Background Objectives The exercise Workflow of the exercise Outlook 2 Background This is a step-by-step presentation designed to take
More informationAre You Maximizing The Value Of All Your Data?
Are You Maximizing The Value Of All Your Data? Using The SAS Bridge for ESRI With ArcGIS Business Analyst In A Retail Market Analysis SAS and ESRI: Bringing GIS Mapping and SAS Data Together Presented
More informationIn Silico Investigation of Off-Target Effects
PHARMA & LIFE SCIENCES WHITEPAPER In Silico Investigation of Off-Target Effects STREAMLINING IN SILICO PROFILING In silico techniques require exhaustive data and sophisticated, well-structured informatics
More informationGeodatabase Best Practices. Dave Crawford Erik Hoel
Geodatabase Best Practices Dave Crawford Erik Hoel Geodatabase best practices - outline Geodatabase creation Data ownership Data model Data configuration Geodatabase behaviors Data integrity and validation
More informationFree and Open Source Software for Cadastre and Land Registration : A Hidden Treasure? Gertrude Pieper Espada. Overview
Free and Open Source Software for Cadastre and Land Registration : A Hidden Treasure? Gertrude Pieper Espada Overview FLOSS concepts Digital Land Administration systems FLOSS Database alternatives FLOSS
More informationLab Activity: The Central Limit Theorem
Lab Activity: The Central Limit Theorem In this lab activity, you will explore the properties of the Central Limit Theorem. Student Learning Outcomes By the end of this chapter, you should be able to do
More informationIntroduction to Machine Learning. PCA and Spectral Clustering. Introduction to Machine Learning, Slides: Eran Halperin
1 Introduction to Machine Learning PCA and Spectral Clustering Introduction to Machine Learning, 2013-14 Slides: Eran Halperin Singular Value Decomposition (SVD) The singular value decomposition (SVD)
More informationLecture 2. Introduction to ESRI s ArcGIS Desktop and ArcMap
Lecture 2 Introduction to ESRI s ArcGIS Desktop and ArcMap Outline ESRI What is ArcGIS? ArcGIS Desktop ArcMap Overview Views Layers Attribute Tables Help! Scale Tips and Tricks ESRI Environmental Systems
More informationIntroduction to Google Mapping Tools
Introduction to Google Mapping Tools Google s Mapping Tools Explore geographic data. Organize your own geographic data. Visualize complex data. Share your data with the world. Tell your story and educate
More informationGIS at UCAR. The evolution of NCAR s GIS Initiative. Olga Wilhelmi ESIG-NCAR Unidata Workshop 24 June, 2003
GIS at UCAR The evolution of NCAR s GIS Initiative Olga Wilhelmi ESIG-NCAR Unidata Workshop 24 June, 2003 Why GIS? z z z z More questions about various climatological, meteorological, hydrological and
More informationApplication Note 12: Fully Automated Compound Screening and Verification Using Spinsolve and MestReNova
Application Note : Fully Automated Compound Screening and Verification Using Spinsolve and MestReNova Paul Bowyer, Magritek, Inc. and Mark Dixon, Mestrelab Sample screening to verify the identity or integrity
More informationRick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30
NED Mission: Provide a comprehensive, reliable and easy-to-use synthesis of multi-wavelength data from NASA missions, published catalogs, and the refereed literature, to enhance and enable astrophysical
More informationImpression Store: Compressive Sensing-based Storage for. Big Data Analytics
Impression Store: Compressive Sensing-based Storage for Big Data Analytics Jiaxing Zhang, Ying Yan, Liang Jeff Chen, Minjie Wang, Thomas Moscibroda & Zheng Zhang Microsoft Research The Curse of O(N) in
More informationFramework for on an open 3D urban analysis platform based on OGC Web Services
Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen Framework for on an open 3D urban analysis platform based on OGC Web Services Marc-O. Löwner & Thomas Adolphi (née Becker) Technische
More informationEEOS 381 -Spatial Databases and GIS Applications
EEOS 381 -Spatial Databases and GIS Applications Lecture 5 Geodatabases What is a Geodatabase? Geographic Database ESRI-coined term A standard RDBMS that stores and manages geographic data A modern object-relational
More informationD2D SALES WITH SURVEY123, OP DASHBOARD, AND MICROSOFT SSAS
D2D SALES WITH SURVEY123, OP DASHBOARD, AND MICROSOFT SSAS EDWARD GAUSE, GISP DIRECTOR OF INFORMATION SERVICES (ENGINEERING APPS) HTC (HORRY TELEPHONE COOP.) EDWARD GAUSE, GISP DIRECTOR OF INFORMATION
More informationA Model of GIS Interoperability Based on JavaRMI
A Model of GIS Interoperability Based on Java Gao Gang-yi 1 Chen Hai-bo 2 1 Zhejiang University of Finance & Economics, Hangzhou 310018, China 2 College of Computer Science and Technology, Zhejiang UniversityHangzhou
More informationGeodatabase Management Pathway
Geodatabase Management Pathway Table of Contents ArcGIS Desktop II: Tools and Functionality 3 ArcGIS Desktop III: GIS Workflows and Analysis 6 Building Geodatabases 8 Data Management in the Multiuser Geodatabase
More informationThese modules are covered with a brief information and practical in ArcGIS Software and open source software also like QGIS, ILWIS.
Online GIS Training and training modules covered are: 1. ArcGIS, Analysis, Fundamentals and Implementation 2. ArcGIS Web Data Sharing 3. ArcGIS for Desktop 4. ArcGIS for Server These modules are covered
More informationDetecting the Unexpected
Detecting the Unexpected Discovery in the Era of Astronomically Big Data Insights from Space Telescope Science Institute s first Big Data conference Josh Peek, Chair Sarah Kendrew, C0-Chair SOC: Erik Tollerud,
More informationTutorial 8 Raster Data Analysis
Objectives Tutorial 8 Raster Data Analysis This tutorial is designed to introduce you to a basic set of raster-based analyses including: 1. Displaying Digital Elevation Model (DEM) 2. Slope calculations
More informationDEVELOPMENT OF GPS PHOTOS DATABASE FOR LAND USE AND LAND COVER APPLICATIONS
DEVELOPMENT OF GPS PHOTOS DATABASE FOR LAND USE AND LAND COVER APPLICATIONS An Ngoc VAN and Wataru TAKEUCHI Institute of Industrial Science University of Tokyo 4-6-1 Komaba, Meguro-ku, Tokyo 153-8505 Japan
More informationCSE 241 Class 1. Jeremy Buhler. August 24,
CSE 41 Class 1 Jeremy Buhler August 4, 015 Before class, write URL on board: http://classes.engineering.wustl.edu/cse41/. Also: Jeremy Buhler, Office: Jolley 506, 314-935-6180 1 Welcome and Introduction
More informationHYPERGRAPH BASED SEMI-SUPERVISED LEARNING ALGORITHMS APPLIED TO SPEECH RECOGNITION PROBLEM: A NOVEL APPROACH
HYPERGRAPH BASED SEMI-SUPERVISED LEARNING ALGORITHMS APPLIED TO SPEECH RECOGNITION PROBLEM: A NOVEL APPROACH Hoang Trang 1, Tran Hoang Loc 1 1 Ho Chi Minh City University of Technology-VNU HCM, Ho Chi
More informationPatent Searching using Bayesian Statistics
Patent Searching using Bayesian Statistics Willem van Hoorn, Exscientia Ltd Biovia European Forum, London, June 2017 Contents Who are we? Searching molecules in patents What can Pipeline Pilot do for you?
More informationGIS CONCEPTS ARCGIS METHODS AND. 3 rd Edition, July David M. Theobald, Ph.D. Warner College of Natural Resources Colorado State University
GIS CONCEPTS AND ARCGIS METHODS 3 rd Edition, July 2007 David M. Theobald, Ph.D. Warner College of Natural Resources Colorado State University Copyright Copyright 2007 by David M. Theobald. All rights
More informationCS 350 A Computing Perspective on GIS
CS 350 A Computing Perspective on GIS What is GIS? Definitions A powerful set of tools for collecting, storing, retrieving at will, transforming and displaying spatial data from the real world (Burrough,
More informationMarvin 5.4 A new generation of structure indexing at Elsevier. Dr. Michael Maier, Dr. Heike Nau, Elsevier
Marvin 5.4 A new generation of structure indexing at Elsevier Dr. Michael Maier, Dr. Heike Nau, Elsevier Agenda Elsevier: Reaxys database Compound classes Structure requirements Marvin 5.4 Decision process
More informationBentley Map V8i (SELECTseries 3)
Bentley Map V8i (SELECTseries 3) A quick overview Why Bentley Map Viewing and editing of geospatial data from file based GIS formats, spatial databases and raster Assembling geospatial/non-geospatial data
More informationTraffic accidents and the road network in SAS/GIS
Traffic accidents and the road network in SAS/GIS Frank Poppe SWOV Institute for Road Safety Research, the Netherlands Introduction The first figure shows a screen snapshot of SAS/GIS with part of the
More informationan accessible interface to marine environmental data Russell Moffitt
an accessible interface to marine environmental data Russell Moffitt The Atlas Project GOAL: To provide a single point of access to oceanographic and environmental data for use by marine resource researchers,
More informationGIS Functions and Integration. Tyler Pauley Associate Consultant
GIS Functions and Integration Tyler Pauley Associate Consultant Contents GIS in AgileAssets products Displaying data within AMS Symbolizing the map display Display on Bing Maps Demo- Displaying a map in
More informationClassification Semi-supervised learning based on network. Speakers: Hanwen Wang, Xinxin Huang, and Zeyu Li CS Winter
Classification Semi-supervised learning based on network Speakers: Hanwen Wang, Xinxin Huang, and Zeyu Li CS 249-2 2017 Winter Semi-Supervised Learning Using Gaussian Fields and Harmonic Functions Xiaojin
More informationCityBES: A Data and Computing Platform for City Buildings
CityBES: A Data and Computing Platform for City Buildings Yixing Chen, Tianzhen Hong, Mary Ann Piette, Xuan Luo Building Technology and Urban Systems Division Urban Infrastructures NSF Workshop New York
More informationFog Monitor 100 (FM 100) Extinction Module. Operator Manual
Particle Analysis and Display System (PADS): Fog Monitor 100 (FM 100) Extinction Module Operator Manual DOC-0217 Rev A-1 PADS 2.7.3, FM 100 Extinction Module 2.7.0 5710 Flatiron Parkway, Unit B Boulder,
More informationADDRESSING A HOW TO LOOK AT GIS ADDRESSING 9/13/2017
ADDRESSING A Look at Creating & Updating Point Files A HOW TO LOOK AT GIS ADDRESSING Creating points using LAT/LONG fields from WINGAP Creating addressing location (GPS/Latitude & Longitude) points using
More informationA Modular NMF Matching Algorithm for Radiation Spectra
A Modular NMF Matching Algorithm for Radiation Spectra Melissa L. Koudelka Sensor Exploitation Applications Sandia National Laboratories mlkoude@sandia.gov Daniel J. Dorsey Systems Technologies Sandia
More informationPIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE
PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE 1 SERVING THE LIFE SCIENCES SPACE ADDRESSING KEY CHALLENGES ACROSS THE R&D VALUE CHAIN Characterize targets & analyze disease
More informationThe Case for Use Cases
The Case for Use Cases The integration of internal and external chemical information is a vital and complex activity for the pharmaceutical industry. David Walsh, Grail Entropix Ltd Costs of Integrating
More information