From DES to LSST. Transient Processing Goes from Hours to Seconds. Eric Morganson, NCSA LSST Time Domain Meeting Tucson, AZ May 22, 2017

Similar documents
Astronomy of the Next Decade: From Photons to Petabytes. R. Chris Smith AURA Observatory in Chile CTIO/Gemini/SOAR/LSST

The Large Synoptic Survey Telescope

The Dark Energy Survey Public Data Release 1

Transient Alerts in LSST. 1 Introduction. 2 LSST Transient Science. Jeffrey Kantor Large Synoptic Survey Telescope

Scientific Data Flood. Large Science Project. Pipeline

Data Processing in DES

LSST Pipelines and Data Products. Jim Bosch / LSST PST / January 30, 2018

Machine Learning Applications in Astronomy

Image Processing in Astronomy: Current Practice & Challenges Going Forward

BigBOSS Data Reduction Software

arxiv:astro-ph/ v1 3 Aug 2004

CFHT Large Area U-Band Deep Survey

Astroinformatics: massive data research in Astronomy Kirk Borne Dept of Computational & Data Sciences George Mason University

Cosmology with Wide Field Astronomy

2-D Images in Astronomy

Large Synoptic Survey Telescope

Astronomical image reduction using the Tractor

1 A photometric probe for Pan-STARRS

Big-Data as a Challenge for Astrophysics

Introduction to SDSS -instruments, survey strategy, etc

GALEX GR1 Instrument Performance and Calibration Review

Data Reduction - Optical / NIR Imaging. Chian-Chou Chen Ph319

Tim Axelrod, Jacek Becla, Gregory Dubois-Felsmann, Jeff Kantor, Kian- Tat Lim, Robert Lupton LDM-151

ABSTRACT 1. INTRODUCTION

The SDSS Data. Processing the Data

Real-time Variability Studies with the NOAO Mosaic Imagers

APLUS: A Data Reduction Pipeline for HST/ACS and WFC3 Images

Surprise Detection in Multivariate Astronomical Data Kirk Borne George Mason University

An end-to-end simulation framework for the Large Synoptic Survey Telescope Andrew Connolly University of Washington

Astrometric Observations of Double Stars Using Altimira Observatory

Relative Astrometry Within ACS Visits

Dark Energy. Cluster counts, weak lensing & Supernovae Ia all in one survey. Survey (DES)

Tim Axelrod, Jacek Becla, Gregory Dubois-Felsmann, Jeff Kantor, Kian- Tat Lim, Robert Lupton LDM-151

arxiv: v1 [astro-ph.im] 2 Aug 2016

Strong gravitational lenses in the 2020s

Project for Observational Astronomy 2018/2019: Colour-magnitude diagram of an open cluster

LCO Global Telescope Network: Operations and policies for a time-domain facility. Todd Boroson

Large Imaging Surveys for Cosmology:

SDSS Data Management and Photometric Quality Assessment

(Present and) Future Surveys for Metal-Poor Stars

Gaia Data Processing - Overview and Status

OBSERVATIONS OF SATELLITES AND ASTEROIDS IN DERENIVKA, UKRIANE. V.Kudak, V.Perig, Y.Motrunych, I.Nojbauer

Processing Wide Field Imaging data Mike Irwin

Cyber-infrastructure to Support Science and Data Management for the Dark Energy Survey

A new method to search for Supernova Progenitors in the PTF Archive. Frank Masci & the PTF Collaboration

Photometric Products. Robert Lupton, Princeton University LSST Pipeline/Calibration Scientist PST/SAC PST/SAC,

LSST Science. Željko Ivezić, LSST Project Scientist University of Washington

Real Astronomy from Virtual Observatories

How do telescopes work? Simple refracting telescope like Fuertes- uses lenses. Typical telescope used by a serious amateur uses a mirror

Reduced data products in the ESO Phase 3 archive (Status: 02 August 2017)

Parametrization and Classification of 20 Billion LSST Objects: Lessons from SDSS

LSST. Pierre Antilogus LPNHE-IN2P3, Paris. ESO in the 2020s January 19-22, LSST ESO in the 2020 s 1

Exploring Data. Keck LRIS spectra. Handbook of CCD Astronomy by Steve Howell Chap. 4, parts of 6

Shape Measurement: An introduction to KSB

A Library of the X-ray Universe: Generating the XMM-Newton Source Catalogues

The Large Synoptic Survey Telescope Project. Steven M. Kahn SLAC Summer Institute August 4, 2005

Time Domain Astronomy in the 2020s:

Weak Lensing: Status and Prospects

BAYESIAN CROSS-IDENTIFICATION IN ASTRONOMY

Introduction to the Sloan Survey

Public ESO

Monitor of All Sky X Ray X (MAXI) on the International Space Station

Data Management Plan Extended Baryon Oscillation Spectroscopic Survey

The Pan-STARRS Moving Object Pipeline

ALMA Development Program

Probabilistic photometric redshifts in the era of Petascale Astronomy

Surprise Detection in Science Data Streams Kirk Borne Dept of Computational & Data Sciences George Mason University

Cosmology with the ESA Euclid Mission

Examining Dark Energy With Dark Matter Lenses: The Large Synoptic Survey Telescope. Andrew R. Zentner University of Pittsburgh

LSST, Euclid, and WFIRST

STUDY OF ORPHAN (OFF-AXIS) AND PARENT (ON-AXIS) GAMMA-RAY BURSTS WITH LSST (A SMALL DROP IN THE TRANSIENT SKY)

Dark Energy Research at SLAC

The Large Synoptic Survey Telescope. Steven M. Kahn Presentation to the SLAC Program Review Committee - June 2, 2004

Local Volume, Milky Way, Stars, Planets, Solar System: L3 Requirements

THE DARK ENERGY SURVEY IMAGE PROCESSING PIPELINE

The Baryon Acoustic Peak (BAP) in the correlation function clustering of the SDSS LRG 47k galaxy sample, and is sensitive to the matter density

Modern Image Processing Techniques in Astronomical Sky Surveys

Science Alerts from GAIA. Simon Hodgkin Institute of Astronomy, Cambridge

Fedora Astronomy. Integration of astronomical software into a Linux distribution. Christian Dersch. AG Astronomie, Philipps-Universität Marburg

Introducing DENET, HSC, & WFMOS

Hubble Legacy Archive and Hubble Source Catalog

DECAM SEARCHES FOR OPTICAL SIGNATURES OF GW Soares-Santos et al arxiv:

Robust Machine Learning Applied to Terascale Astronomical Datasets

Pysynphot: A Python Re Implementation of a Legacy App in Astronomy

CLASH MCT Program Progress Report. Progress Report on the CLASH Multi-Cycle Treasury Program By Marc Postman

Transient Astronomy with the Gaia Satellite

RLW paper titles:

Heidi B. Hammel. AURA Executive Vice President. Presented to the NRC OIR System Committee 13 October 2014

LA SILLA PARANAL OBSERVATORY. ESO Calibration Workshop Wide Field Imaging (Surveys?)

ANTARES: The Arizona-NOAO Temporal Analysis and Response to Events System

The shapes of faint galaxies: A window unto mass in the universe

Baryon acoustic oscillations A standard ruler method to constrain dark energy

CONFIRMATION OF A SUPERNOVA IN THE GALAXY NGC6946

Exploring the Depths of the Universe

Future radio galaxy surveys

Laser Interferometer Gravitational-Wave Observatory (LIGO)! A Brief Overview!

A Hunt for Tidal Features in a Nearby Ultra Diffuse Galaxy

arxiv: v1 [astro-ph.sr] 17 Oct 2012

The Zadko Telescope: the Australian Node of a Global Network of Fully Robotic Follow-up Telescopes

Supernova Discovery in the Era of Data-Intensive Science

Transcription:

From DES to LSST Transient Processing Goes from Hours to Seconds Eric Morganson, NCSA LSST Time Domain Meeting Tucson, AZ May 22, 2017

Hi, I m Eric Dr. Eric Morganson, Research Scientist, Nation Center for Supercomputing Applications (NCSA) Astronomer from Pan-STARRS1, SDSS and Dark Energy Survey Run DES transient pipeline at NCSA (with other research interests) Will be production scientist for LSST transient processing My first LSST meeting Inexplicably resemble Tycho Brahe

NCSA: LSST Data Facility NCSA Astronomy Services Group Runs DES and LSST Data Processing Data Processing includes File transfer Database administration High performance computing coordination Software maintenance Archiving and serving the data LSST Data Processing requires Daily interaction (transients, L1) Major development for yearly (L2) releases Coordination between astronomers, programmers and system administrators The National Center for Supercomputing Applications (NCSA) in Urbana, IL

Incidents Response: When Things go Wrong Pan-STARRS and DES required 2-3 years to run smoothly Random things go wrong Pan-STARRS: DES: LSST: Shutter broke Storm wiped out powerline Power surge caused massive data loss (no raw data lost) Lightning caused DB outage Network outages Hardware/software problems Operations team recover and mitigate

Dark Energy Survey Mini-LSST Like SDSS, PS1, ZTF, HSC 4m telescope, 3 Deg 2 FoV 5,000 Deg 2 Survey Includes 10 SN fields Transient detection Taken every 5 days Pre-generated templates 3 hour processing time Widefield transients LIGO events Not processed regularly Templates hand made

DES Software: Not Reinventing the Wheel SExtractor: source extraction, catalog making PSFEx: PSF extraction and modeling SCAMP: Astrometric calibration SWarp: Image regridding for coaddition Emmanuel Bertin (Astromatic author) is collaboration member HOTPANTS (Andrew Becker): difference imaging DES-developed algorithms and software Flat fielding Sky background modeling Calibration Multi-Object Fitting Weak lensing analysis

How is LSST different? Wide field transients Generating templates on the fly Devoted software stack Transient response time: 1 minute (not 3 hours) Where does 200x come from?

DES Transient Pipeline: 3 Hours DES wants Type 1a Supernovae Follow-up within 1 week desirable Can obtain redshift from galaxy Need image evaluation in 10 hours Transient pipeline takes 3 hours, because it can

Transfer/Alert Speedups DES file transfer: 20 minutes Transfer time: slow Data archived before processing Waiting for 10 minutes cron jobs Auxiliary files loaded as needed LSST transfer will be seconds Devoted 100 GBPS line Parallel archiving and processing Working live no cron Auxiliary files prestaged DES alert system: 10 minutes Literally waiting for a cron job LSST alerts instantaneous

Whole Exposure Processing Crosstalk correction, astrometry, sky fitting best done on whole exposure Some tasks run 60x serially Once per CCD Dominated by flat fielding Parallelization possible Serial 130 minutes Parallel 8 minutes LSST Crosstalk Correction performed at telescope Without crosstalk, DES is 5.5 minutes

Whole Exposure Processing Some tasks run 60x serially Once per CCD Dominated by flat fielding Sky fitting/astrometry best done on whole exposure Parallelization possible Serial 130 minutes Parallel 8 minutes LSST Crosstalk Correction performed at telescope Without crosstalk, DES is 5.5 minutes

Real Reasons for Not Parallelizing Crosstalk Correction: whole exposure necessary LSST does it on the mountain LSST only corrects neighboring CCDs DES has crosstalk of up to 10-5 for non-neighboring CCDs Astrometry: best done on whole exposure Better constrained with more stars DES difference imaging fails ~15% of the time But SN taking with worst seeing Single CCD astrometry increases failure rate in marginal weather Sky (Background) Fitting: best done on whole exposure Limit L1 calibration L2 processing will perform full exposure analysis

DES Transient Pipeline: CCD Parallelized Last 8 minutes: parallelized by CCD Dominated by difference imaging HOTPANTS PSF Modeling next SExtractor, PSFEx Fully parallelized, post-crosstalk: 13.5 minutes One more way to speed things up...

Keep data in memory DES reads and writes files each of 46 processes Usually a large fits file Most not returned Rough estimate: 25% of processing time file i/o Keeping all data in memory takes us down to 10 minutes Increased operations difficulties Debugging easier with distinct processes and output Even moreso with batch operations Processes killed externally

From 10 Minutes to 1 Minute: Magic Processors still getting faster Optimizing code for LSST Algorithmic magic I am excited to learn about the magic

Conclusions Previous surveys show: operations team essential LSST detect transients 200X faster than DES 20X from parallelization, algorithms, working in memory Real data sacrifices and operations challenges 10X from faster code Interested in understanding how that works