Scientific Data Flood. Large Science Project. Pipeline

Similar documents
Astroinformatics: massive data research in Astronomy Kirk Borne Dept of Computational & Data Sciences George Mason University

Surprise Detection in Science Data Streams Kirk Borne Dept of Computational & Data Sciences George Mason University

Astronomy of the Next Decade: From Photons to Petabytes. R. Chris Smith AURA Observatory in Chile CTIO/Gemini/SOAR/LSST

Surprise Detection in Multivariate Astronomical Data Kirk Borne George Mason University

The Large Synoptic Survey Telescope

Real Astronomy from Virtual Observatories

Large Synoptic Survey Telescope

An end-to-end simulation framework for the Large Synoptic Survey Telescope Andrew Connolly University of Washington

LSST. Pierre Antilogus LPNHE-IN2P3, Paris. ESO in the 2020s January 19-22, LSST ESO in the 2020 s 1

Transient Alerts in LSST. 1 Introduction. 2 LSST Transient Science. Jeffrey Kantor Large Synoptic Survey Telescope

Target and Observation Managers. TOMs

Dark Energy Research at SLAC

Time Domain Astronomy in the 2020s:

LSST, Euclid, and WFIRST

Astronomical Notes. Astronomische Nachrichten. A machine learning classification broker for the LSST transient database

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30

LSST Science. Željko Ivezić, LSST Project Scientist University of Washington

Cosmology with Wide Field Astronomy

Machine Learning Applications in Astronomy

Examining Dark Energy With Dark Matter Lenses: The Large Synoptic Survey Telescope. Andrew R. Zentner University of Pittsburgh

13 June Board on Research Data & Information (BRDI)

Astroinformatics in the data-driven Astronomy

From DES to LSST. Transient Processing Goes from Hours to Seconds. Eric Morganson, NCSA LSST Time Domain Meeting Tucson, AZ May 22, 2017

Dark Energy. Cluster counts, weak lensing & Supernovae Ia all in one survey. Survey (DES)

Francisco Prada Campus de Excelencia Internacional UAM+CSIC Instituto de Física Teórica (UAM/CSIC) Instituto de Astrofísica de Andalucía (CSIC)

Heidi B. Hammel. AURA Executive Vice President. Presented to the NRC OIR System Committee 13 October 2014

JINA Observations, Now and in the Near Future

What shall we learn about the Milky Way using Gaia and LSST?

Real-time Variability Studies with the NOAO Mosaic Imagers

Exascale IT Requirements in Astronomy Joel Primack, University of California, Santa Cruz

A Vision for the US Virtual Astronomical Observatory. October D. De Young (NOAO), VAO Project Scientist

Crurent and Future Massive Redshift Surveys

LARGE SYNOPTIC SURVEY TELESCOPE AZ AAPT March 2011

Welcome to AURA Observatory in Chile

An Introduction to the Dark Energy Survey

The Dark Energy Survey Public Data Release 1

Énergie noire Formation des structures. N. Regnault C. Yèche

New Astronomy With a Virtual Observatory

Table of Contents and Executive Summary Final Report, ReSTAR Committee Renewing Small Telescopes for Astronomical Research (ReSTAR)

The New LSST Informatics and Statistical Sciences Collaboration Team. Kirk Borne Dept of Computational & Data Sciences GMU

Weak Gravitational Lensing. Gary Bernstein, University of Pennsylvania KICP Inaugural Symposium December 10, 2005

Cosmic acceleration. Questions: What is causing cosmic acceleration? Vacuum energy (Λ) or something else? Dark Energy (DE) or Modification of GR (MG)?

Optical Synoptic Telescopes: New Science Frontiers *

Cosmological Galaxy Surveys: Future Directions at cm/m Wavelengths

ANTARES: The Arizona-NOAO Temporal Analysis and Response to Events System

Introduction to SDSS -instruments, survey strategy, etc

The shapes of faint galaxies: A window unto mass in the universe

Data Intensive Computing meets High Performance Computing

Recent BAO observations and plans for the future. David Parkinson University of Sussex, UK

THE DARK ENERGY SURVEY: 3 YEARS OF SUPERNOVA

The Sloan Digital Sky Survey. Sebastian Jester Experimental Astrophysics Group Fermilab

NASA/IPAC EXTRAGALACTIC DATABASE

Lecture 25: Cosmology: The end of the Universe, Dark Matter, and Dark Energy. Astronomy 111 Wednesday November 29, 2017

The Dark Energy Survey

Cosmology with the ESA Euclid Mission

The Future of Cosmology

From the Big Bang to Big Data. Ofer Lahav (UCL)

How do telescopes work? Simple refracting telescope like Fuertes- uses lenses. Typical telescope used by a serious amateur uses a mirror

Using Authentic Astronomical Data in Investigations & Activities. Version 1.1, presented at CONASTA 59, UTS, Monday 5 July 2010.

Unveiling the misteries of our Galaxy with Intersystems Caché

(Present and) Future Surveys for Metal-Poor Stars

arxiv:astro-ph/ v1 3 Aug 2004

Tesla Jeltema. Assistant Professor, Department of Physics. Observational Cosmology and Astroparticle Physics

A Look Back: Galaxies at Cosmic Dawn Revealed in the First Year of the Hubble Frontier Fields Initiative

Large Synoptic Survey Telescope, Computational-science Requirements of. George Beckett, 2 nd September 2016

DataONE: Enabling Data-Intensive Biological and Environmental Research through Cyberinfrastructure

New Directions in Observational Cosmology: A New View of our Universe Tony Tyson UC Davis

Supernovae Observations of the Expanding Universe. Kevin Twedt PHYS798G April 17, 2007

PISCO Progress. Antony A. Stark, Christopher Stubbs Smithsonian Astrophysical Observatory 20 March 2009

APLUS: A Data Reduction Pipeline for HST/ACS and WFC3 Images

The Era of Synoptic Surveys. Peter Nugent (LBNL)

Skyalert: Real-time Astronomy for You and Your Robots

Lensing with KIDS. 1. Weak gravitational lensing

Cyber-infrastructure to Support Science and Data Management for the Dark Energy Survey

THE ROAD TO DARK ENERGY

AURA MEMBER INSTITUTIONS

Ultra-compact binaries in the Catalina Real-time Transient Survey. The Catalina Real-time Transient Survey. A photometric study of CRTS dwarf novae

Supernovae Observations of the Accelerating Universe. K Twedt Department of Physics, University of Maryland, College Park, MD, 20740, USA

The Hunt for Dark Energy

Dark Energy Survey. Josh Frieman DES Project Director Fermilab and the University of Chicago

Moment of beginning of space-time about 13.7 billion years ago. The time at which all the material and energy in the expanding Universe was coincident

The Large Synoptic Survey Telescope Project. Steven M. Kahn SLAC Summer Institute August 4, 2005

Galaxy Clusters in Stage 4 and Beyond

Astro-Informatics: Computation in the study of the Universe

Introductory Review on BAO

Euclid and MSE. Y. Mellier IAP and CEA/SAp.

The Square Kilometre Array and Data Intensive Radio Astronomy 1

ESASky, ESA s new open-science portal for ESA space astronomy missions

Mapping Dark Matter with the Dark Energy Survey

Igor Soszyński. Warsaw University Astronomical Observatory

Physics Lab #10: Citizen Science - The Galaxy Zoo

Space Telescopes Asteroid Mining Space Manufacturing. Wefunder.com/SpaceFab

Spatial Data Science. Soumya K Ghosh

High Precision Exoplanet Observations with Amateur Telescopes

A Cosmic Perspective. Scott Fisher, Ph.D. - Director of Undergraduate Studies - UO Department of Physics

An Industry Perspective. Bryn Fosburgh Vice President Trimble

(Slides for Tue start here.)

Data analysis of massive data sets a Planck example

Foreword. Taken from: Hubble 2009: Science Year in Review. Produced by NASA Goddard Space Flight Center and the Space Telescope Science Institute.

Local Volume, Milky Way, Stars, Planets, Solar System: L3 Requirements

Transcription:

The Scientific Data Flood Scientific Data Flood Large Science Project Pipeline 1 1

The Dark Energy Survey Study Dark Energy using four complementary* techniques: I. Cluster Counts II. Weak Lensing III. Baryon Acoustic Oscillations IV. Supernovae Blanco 4-meter at CTIO Two multiband surveys: 5000 deg 2 g, r, i, Z,Y to i~24 9 deg 2 repeat (SNe) Build new 3 deg 2 camera and Data Management system DES 30% of 5 years of telescope Response to NOAO AO DES Forecast: FoM up by 4.6 *in systematics & in cosmological parameter degeneracies *geometric+structure growth: test Dark Energy vs. Gravity 2

DES Data Management Expect ~2Pb final dataset, with ~100Tb in a relational database LSST precursor Transients Supernovae EPO also drives data service needs Commission late in CY2011 3

DES Data Management DES itself runs in the southern spring, September to March, taking one third of the CTIO 4m Blanco telescope nights for five years 4

LSST (Large Synoptic Survey Telescope): Ten year time series imaging of the night sky mapping the Universe 100,000 events each night anything that goes bump in the night Cosmic Cinematography The New Sky at http://www.lsst.org/ Observing Strategy: One pair of images every 40 seconds for each spot on the sky, then continue across the sky continuously every night for 10 years, with time domain sampling in log(time) intervals (to capture dynamic range of transients). Education and Public Outreach have been an integral and key feature of the project since the beginning the EPO program includes formal l& informal education, Citizen Science projects, Science Centers & Planetaria. 6

Data Challenges It s a large camera 201 CCDs @ 4096x4096 pixels each = 3 Gigapixels = 6 GB per image, covering 10 sq.degrees = ~3000 times the area of one Hubble Telescope image Process Obtain one 6-GB sky image in 15 seconds Process that image in 5 seconds Obtain & process another co-located image within 20s (= 15-second exposure + 5-second processing & slew) Process the 100 million sources in each image pair, catalog all sources, and generate worldwide alerts within 60 seconds (e.g., incoming killer asteroid) Generate 50-100,000 alerts per night (VOEvent messages) Obtain 2000 images per night Produce ~40 Terabytes per night Move the data from South America to US daily Repeat this every day for 10 years Provide rapid DB access to worldwide community: 100-200 Petabyte final 10-year image archive 10-20 Petabyte final 10-year database catalog 7

Data Management System Science Requirements that drive Data Management Very deep, time-dependent, wide field survey unprecedented data volumes (40Tb/night) Controlled systematics advanced photometry, astrometry Rapid transient alerting high performance, low latency Required system capabilities and performance Highly distributed, scalable, and reliable Evolvable scientifically and technically over the survey useful life Derived design and implementation features Advanced scientific algorithms that scale to LSST volumes, rates Layered architecture for extensibility and technology evolution High-performance computing, storage, networking technologies Rigorous formal development process Involves computer scientists, industry analysts, software developers, project managers, and others, from academic, commercial, and government enterprises. 8

The LSST Data Challenge Massive M i data stream: ~2 Terabytes of image data per hour that must be mined in real time (for 10 years). Massive 20-Petabyte database: more than 50 billion objects need to be classified, and most will be monitored for important variations in real time. Massive event stream: knowledge extraction in real time for 100,000 events each night. 9

NSF/NASA - Virtual Astronomical Observatory Move beyond the very successful NVO (international standards, software, data mining support, user interfaces, etc.) to a real astronomical observatory Just another observatory where a wealth of detail is covered (hidden?) by the word just VAO, LLC preliminary certification to receive federal funding agreed by NSF business office; NSF award made, May 2010; NASA contribution funded through existing centers Strong link with Microsoft Research, World Wide Telescope product Research is being enabled notable publications at http://www.us-vo.org/pubs/notablepubs.cfm Problem of marketing, branding 10

It takes a human to interpret a complex image 11

Citizen Science Exploit the cognitive abilities of Human Computation Novel mode of data collection: Citizen Science = Volunteer Science e.g., VGI = Volunteer Geographic Information e.g., Galaxy Zoo @ http://www.galaxyzoo.org/ Citizen science refers to the involvement of volunteer non-professionals in the research enterprise. The experience must be engaging, must work with real scientific data and information, must address authentic science research questions that are beyond the capacity of science teams and enterprises, and must involve the scientists. 12

Examples of Volunteer Science Audubon Bird Counts Project Budburst Stardust@Home VGI (Volunteer Geographic Information) Galaxy Zoo (many refereed publications) Zooniverse (buffet of Zoos) U-Science (semantic science 2.0) includes Biodas.org, Wikiproteins, HPKB, AstroDAS Ubiquitous, User-oriented, User-led, Universal, Untethered, You-centric Science 13