Michael L. Norman Chief Scientific Officer, SDSC Distinguished Professor of Physics, UCSD

Similar documents
Data Intensive Computing meets High Performance Computing

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30

Baryon Acoustic Oscillations in the Lyman Alpha Forest

Virtual Observatory: Observational and Theoretical data

ArcGIS Enterprise: What s New. Philip Heede Shannon Kalisky Melanie Summers Sam Williamson

Enabling ENVI. ArcGIS for Server

Analytical data, the web, and standards for unified laboratory informatics databases

Data Management Plan Extended Baryon Oscillation Spectroscopic Survey

Open-Source Astrophysics with the Enzo Community Code. Brian W. O Shea Michigan State University

ArcGIS Enterprise: What s New. Philip Heede Shannon Kalisky Melanie Summers Shreyas Shinde

Harvard Center for Geographic Analysis Geospatial on the MOC

Geog 469 GIS Workshop. Managing Enterprise GIS Geodatabases

The File Geodatabase API. Craig Gillgrass Lance Shipman

The iplant Collaborative Semantic Web Platform

The Current Status of EarthCube with an EarthScope Perspective. Tim Ahern IRIS Director of Data Services

Scientific Data Flood. Large Science Project. Pipeline

Cosmology with Wide Field Astronomy

Spatial Data Infrastructure Concepts and Components. Douglas Nebert U.S. Federal Geographic Data Committee Secretariat

North Atlantic Coast Comprehensive Study Storm Simulation and Statistical Analysis Part II Production System

ESRI Survey Summit August Clint Brown Director of ESRI Software Products

Integrating Globus into a Science Gateway for Cryo-EM

Inflation - a solution to the uniformity problem. include more dark matter. count up all mass in galaxies include massive dark galaxy halos

SPATIAL INFORMATION GRID AND ITS APPLICATION IN GEOLOGICAL SURVEY

Advanced Topics on Astrophysics: Lectures on dark matter

Astronomy of the Next Decade: From Photons to Petabytes. R. Chris Smith AURA Observatory in Chile CTIO/Gemini/SOAR/LSST

CLICK HERE TO KNOW MORE

Administering your Enterprise Geodatabase using Python. Jill Penney

FIRE DEPARMENT SANTA CLARA COUNTY

Cosmological N-Body Simulations and Galaxy Surveys

Introduction to Portal for ArcGIS

CHAPTER 22 GEOGRAPHIC INFORMATION SYSTEMS

FYST17 Lecture 13 The cosmic connection. Thanks to R. Durrer, L. Covi, S. Sakar

2 The Sloan Digital Sky Survey (2000-) is a terapixel visualization of the Universe.

(Slides for Tue start here.)

Real Astronomy from Virtual Observatories

Database and Knowledge Base developments at IAEA A+M Unit

Current Status of Chinese Virtual Observatory

Outline Oxana Smirnova, Particle Physics 2

The Millennium Simulation: cosmic evolution in a supercomputer. Simon White Max Planck Institute for Astrophysics

The Next Generation GIS/LIS A Surveys Information System Integrated within a GIS

NOKIS - Information Infrastructure for the North and Baltic Sea

Welcome back to PHY 3305

You are Building Your Organization s Geographic Knowledge

ArcGIS Platform For NSOs

Cosmology with the Sloan Digital Sky Survey Supernova Search. David Cinabro

Cosmology with the Sloan Digital Sky Survey Supernova Search. David Cinabro

THIS IS NOT A PRESENTATION

Introduction to the Sloan Survey

Big Bang, Big Iron: CMB Data Analysis at the Petascale and Beyond

The SDSS-III Baryon Acoustic Oscillation Survey (BOSS)

Introduction to Portal for ArcGIS. Hao LEE November 12, 2015

Table of Contents and Executive Summary Final Report, ReSTAR Committee Renewing Small Telescopes for Astronomical Research (ReSTAR)

Challenges of Predictive 3D Astrophysical Simulations of Accreting Systems. John F. Hawley Department of Astronomy, University of Virginia

Yuji Shirasaki. National Astronomical Observatory of Japan. 2010/10/13 Shanghai, China

BARYON ACOUSTIC OSCILLATIONS. Cosmological Parameters and You

Overview of Geospatial Open Source Software which is Robust, Feature Rich and Standards Compliant

These modules are covered with a brief information and practical in ArcGIS Software and open source software also like QGIS, ILWIS.

Astro 32 - Galactic and Extragalactic Astrophysics/Spring 2016

ESASky, ESA s new open-science portal for ESA space astronomy missions

Open Data meets Big Data

Chapter 26: Cosmology

Astronomy 162, Week 10 Cosmology Patrick S. Osmer Spring, 2006

GIS ADMINISTRATOR / WEB DEVELOPER EVANSVILLE-VANDERBURGH COUNTY AREA PLAN COMMISSION

Dark Energy and Massive Neutrino Universe Covariances

Spatial data interoperability and INSPIRE compliance the platform approach BAGIS

Francisco Prada Campus de Excelencia Internacional UAM+CSIC Instituto de Física Teórica (UAM/CSIC) Instituto de Astrofísica de Andalucía (CSIC)

Topics and questions for astro presentations

Semantic Evolution of Geospatial Web Services: Use Cases and Experiments in the Geospatial Semantic Web

The Growth of Structure Read [CO 30.2] The Simplest Picture of Galaxy Formation and Why It Fails (chapter title from Longair, Galaxy Formation )

NASA/IPAC EXTRAGALACTIC DATABASE

Web-enabled GIS Services

Using OGC standards to improve the common

Baryon Acoustic Oscillations Part I

Dark Energy and the Accelerating Universe

Photometric Redshifts with DAME

The Tools of Cosmology. Andrew Zentner The University of Pittsburgh

Modern Image Processing Techniques in Astronomical Sky Surveys

Joint MISTRAL/CESI lunch workshop 15 th November 2017

Galaxy Clusters. Giants of the Universe. Maciej Soltynski. ASSA Symposium Virgo cluster around M86

ArcGIS is Advancing. Both Contributing and Integrating many new Innovations. IoT. Smart Mapping. Smart Devices Advanced Analytics

Formation of the Universe. What evidence supports current scientific theory?

Innovation. The Push and Pull at ESRI. September Kevin Daugherty Cadastral/Land Records Industry Solutions Manager

SDSS-IV and eboss Science. Hyunmi Song (KIAS)

Journal of e-media Studies Volume 3, Issue 1, 2013 Dartmouth College

The Early Universe: A Journey into the Past

The Early Universe: A Journey into the Past

BAO & RSD. Nikhil Padmanabhan Essential Cosmology for the Next Generation VII December 2017

AstroPortal: A Science Gateway for Large-scale Astronomy Data Analysis

A5682: Introduction to Cosmology Course Notes. 11. CMB Anisotropy

BSEE REQUIREMENTS

Geo-enabling a Transactional Real Estate Management System A case study from the Minnesota Dept. of Transportation

ArcGIS for Local Government

Extending a Plotting Application and Finding Hardware Injections for the LIGO Open Science Center

Military Geographic Institute

Spatial Data Management of Bio Regional Assessments Phase 1 for Coal Seam Gas Challenges and Opportunities

Demystifying ArcGIS Online. Karen Lizcano Esri

Planetary Science Big Data

What Would John Snow Do (Today)? Part 1

Observing the BitTorrent Universe Through Telescopes

One Optimized I/O Configuration per HPC Application

Transcription:

Michael L. Norman Chief Scientific Officer, SDSC Distinguished Professor of Physics, UCSD

Parsing the Title OptiPortals Scalable display walls based on commodity LCD panels Developed by EVL and CalIT2 Petascale Simulations Numerical simulations carried out on petaflop/s supercomputers Remote resources End Stations DOE-speak for where the science measurements get done e.g., detector at a particle accelerator

Project in a Nutshell.. Visualizations done here Science done here (end station) Simulations done here

Talk Outline State of cosmological research (motivation) Laboratory for Computational Astrophysics (who we are) Petascale cosmological simulations (the agony and the ecstasy) Building a computational end station (design, issues, progress)

Evolution of the Universe

A Bounty of Precision Cosmological Data Cosmic Microwave Background Anisotropies Initial Density Fluctuations Expansion of the Universe Galaxy Large Scale Structure

What the Universe is Made Of The part of the Universe we can see Causes matter to clump and form galaxies Causes expansion rate to accelerate

Cosmology s Big Mysteries What is the dark matter? Some undetected elementary particle? What is the dark energy? Why does it exist? Does it change with time, or stay constant? How do galaxies form? What are the first galaxies like? How do they evolve into spirals and ellipticals?

http://lca.ucsd.edu What we do Develop and distribute simulation software and tools for computational astrophysics and cosmology for HPC platforms ZEUS (astrophysical MHD and RHD) ENZO (cosmology) Apply our codes to problems requiring extreme levels of computing power (and data storage) Moving into building theory data digital collections for others to use

The Fate of the Universe Depends on DARK ENERGY How to measure expansion accurately?

Standard candles at various distances

Standard rulers to measure dark energy

Baryon Acoustic Oscillations (BAO) in the Cosmic Microwave Background

Standard rulers and cosmic structure BAO standard yardstick

The Cosmology Simulation We Need To Do: 4096 3 cells/particles in 600 Mpc Box Large volume to sample BOA wavelength High resolution to resolve cosmic structure 2048 3 simulation of cosmic structure ( 1/20 th volume shown) 2048 core Kraken Cray XT4 Physics: 6-species gas dynamics, N-body dark matter, self-gravity, non-eq. ionization

The Agony and the Ecstasy of Petascale Simulations Ecstasy Remarkable simulations are now possible 4096 3 simulations feasible today Agony 50,000 cores 200 TB of data generated intermittently over many months of execution time How to monitor, harvest and store? Complex data analysis task best done on other, geographically-distributed resources How to look at this much data? How to publish the data?

Lifecycle in Words Simulations are run on the largest of the NSF resources (e.g., ORNL Kraken) Raw output is reduced on various TeraGrid systems (e.g., TACC Ranger) Large-scale visualization is done on specialized machines (e.g., ANL Eureka) Reduced data and renderings are sent to the Triton Resource at UCSD for display and analysis on our OptIPortal At every step, metadata is added to a central database (SimDB, Simulation Database) The end result is a queryable repository of publicly available data, tied to publications

That s Point B We re still at Point A Here s what we need to get there: Internal processes for routinely adding metadata as data is produced Robust networks between large resources Analysis tools with performance comparable to the simulation code Storage for persistent data collections Standardized interfaces to simulation data and metadata

OptIPortal @ LCA/UCSD 80 megapixels, 65,000 2 image

UCSD 10G Research Network SDSC 10G Distribution switch fabric SDSC OptIPortal Schematic UCSD RN Switch Cisco SDSC/UCSD Sdnap SDSC TG Router Juniper UCSD/SDSC joint network operations NSF TeraGrid 10 Gb/s optical 10 Gb/s copper 10G optical switch Cisco File server OP Head node 10G copper switch HP OP server nodes 20 x 4 MegaPixel LCD panels Norman lab @ SDSC

A collection of resources for data-intensive HPC applications

Publishing Simulation Data: Goals Provide mechanisms for discovering and accessing simulation data Define standards for publishing simulation data and metadata to promote interoperability

Building Blocks (0/3) IVOA Theory Interest Group (TIG) The IVOA defines standards for publishing astronomical data online Standards describe web service interfaces and metadata formats TIG is focusing on 3 coupled standards for simulations: Simulation Data Model Simulation Database Simulation Data Access Protocol

Building Blocks (1/3) Simulation Data Model (SimDM) Set of related classes defining simulation metadata Key terms: Protocols (simulation and analysis codes) Experiments (simulations and post-processing) Snapshots (results, datasets) Rich--includes parameters, purpose and characterization of simulations Development led by Gerard Lemson (MPE)

Building Blocks (2/3) Simulation Database (SimDB) Relational database schema derived from the Simulation Data Model Intended to be common way for publishing simulation metadata Service interface defined by upcoming IVOA Table Access Protocol (TAP) Also led by Gerard Lemson (MPE)

Building Blocks (3/3) Simulation Data Access Protocol (SimDAP) Service interface for accessing actual simulation data (e.g., downloads) Services may also provide subsets of raw data (particle cutouts, extracted fields) Results described using SimDM + metadata for files and contents Development led by Rick Wagner (SDSC) and Claudio Gheller (CINECA)

Implementation - SimCat SimCat: Simulation Catalog Web service extending the SimDM RESTful API for managing the catalog Foundation for building SimDB and SimDAP services Uses Django (Python web framework) http://code.google.com/p/simcat

Status SimCat: Development service running Building catalog of existing simulations Needs human-friendly interface SimDAP service interface in development SimDB service interface when TAP standard closer to completion IVOA Standards SimDAP Note due in March 2009 SimDAP Working Draft out by next InterOp (May 2009)

http://volute.googlecode.com/svn/trunk/projects/theory/snap/simdap.ht ml

Next Steps Evaluate workflow software for automating forwarding of petascale data files over the network (from ORNL to ANL to UCSD) Evaluate workflow software for automating forwarding of file metadata to SimCAT Explore use of switched fiber optic circuits on Internet2/NLR for aiding data transfers Get analysis software running on Triton Resource Push the 4096 3 run off the cliff

Collaboration Opportunities Workflow automation Data modeling/publishing Scientific visualization Parallel analysis algorithms Data mining If you are interested, could use MURPA to engage. email me at mlnorman@ucsd.edu