SNLS supernovae photometric classification with machine learning

Similar documents
Photometric classification of SNe Ia in the SuperNova Legacy Survey with supervised learning

Supernovae photometric classification of SNLS data with supervised learning

Supernovae photometric classification of SNLS data with supervised learning

Cosmological constraints from the 3rd year SNLS dataset

R. Pain CSP 07 June 17,

Supernova Surveys. Julien GUY PPC2010, Torino, July LPNHE - IN2P3 - CNRS - Universités Paris 6 et Paris 7

Constraining Dark Energy: First Results from the SDSS-II Supernova Survey

Cosmology of Photometrically- Classified Type Ia Supernovae

Precision cosmology with Type Ia Supernovae?

Supernovae with Euclid

THE DARK ENERGY SURVEY: 3 YEARS OF SUPERNOVA

arxiv: v3 [astro-ph.co] 24 Jan 2018

SDSS-II Supernova Survey (continued) David Cinabro Wayne State University AIPW, Dubrovnik, Croatia 4-9 September 2006

SNIa detection in the SNLS photometric analysis using Morphological Component Analysis

Cosmology with type Ia Supernovae

Present and Future Large Optical Transient Surveys. Supernovae Rates and Expectations

The Dark Energy Survey - Supernova Survey

arxiv: v2 [astro-ph] 21 Aug 2007

Dependence of low redshift Type Ia Supernovae luminosities on host galaxies

MEASURING TYPE IA SUPERNOVA POPULATIONS OF STRETCH AND COLOR AND PREDICTING DISTANCE BIASES

Challenges of low and intermediate redshift supernova surveys

HSC Supernova Cosmology Legacy Survey with HST

The universe is static

Supernovae and the Accelerating Universe

Dark Energy. Cluster counts, weak lensing & Supernovae Ia all in one survey. Survey (DES)

Building a Photometric Hubble Diagram from DES-SN Data

A new method to search for Supernova Progenitors in the PTF Archive. Frank Masci & the PTF Collaboration

Bayesian Modeling for Type Ia Supernova Data, Dust, and Distances

arxiv: v2 [astro-ph.co] 4 Jun 2014

arxiv: v2 [astro-ph.co] 2 Oct 2014

arxiv: v1 [astro-ph.co] 4 Jun 2012

SNLS3: Constraints on Dark Energy Combining the Supernova Legacy Survey Three Year Data with Other Probes

Supplementary Information for SNLS-03D3bb a super- Chandrasekhar mass Type Ia supernova

The Supernova Legacy Survey (SNLS)

Type Ia Supernovae meet Maximum Likelihood Estimators

Stages of a Big Project ***

Making Cosmologyʼs Best Standard Candles Even Better

The Old Stellar Population Studies with Subaru Young-Wook Lee Yonsei Univ., Seoul, Korea

Learning algorithms at the service of WISE survey

Photometric Redshifts for the NSLS

Results from the Nearby Supernova Factory

PESSTO The Public ESO Spectroscopic Survey for Transient Objects

Type II Supernovae as Standardized Candles

BAYESIAN SINGLE-EPOCH PHOTOMETRIC CLASSIFICATION OF SUPERNOVAE

The Keck/SDSS SN Sample

Big Data Inference. Combining Hierarchical Bayes and Machine Learning to Improve Photometric Redshifts. Josh Speagle 1,

BAO and Lyman-α with BOSS

A SALT Spectropolarimetric Survey of Supernovae ( S 4 ) K. Nordsieck Univ of Wisconsin

First results from the Stockholm VIMOS Supernova Survey

EUCLID Spectroscopy. Andrea Cimatti. & the EUCLID-NIS Team. University of Bologna Department of Astronomy

Classifying Galaxy Morphology using Machine Learning

Improved Astronomical Inferences via Nonparametric Density Estimation

arxiv: v1 [astro-ph.co] 4 Oct 2010

Astronomy. Astrophysics

Introduction to SDSS -instruments, survey strategy, etc

arxiv: v1 [astro-ph.co] 22 Oct 2010

Supernova Legacy Survey 3-years data sample Delphine HARDIN, on behalf of the Supernova Legacy Survey (SNLS)

Carnegie Supernova Project: Detailed Observations of Supernovae

SN-MCT Science Goals

Testing the mutual consistency of different supernovae surveys

Infrared Light Curves of Type Ia Supernovae

Data Analysis Methods

Supernova cosmology. The quest to measure the equation of state of dark energy. Bruno Leibundgut European Southern Observatory

Supernovae Observations of the Expanding Universe. Kevin Twedt PHYS798G April 17, 2007

The SDSS is Two Surveys

SDSS-II: DETERMINATION OF SHAPE AND COLOR PARAMETER COEFFICIENTS FOR SALT-II FIT MODEL

Addressing the Challenges of Luminosity Function Estimation via Likelihood-Free Inference

Part 3: The Dark Energy

Optical/NIR Spectroscopy A3130. John Wilson Univ of Virginia

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30

Physics 24: The Big Bang, Week 3: Hubble s Law David Schlegel, LBNL

Measurements of Dark Energy

Type Ia Supernovae from Near-IR Photometry

ABSTRACT. 1. Introduction

Astronomical image reduction using the Tractor

Automated Classification of HETDEX Spectra. Ted von Hippel (U Texas, Siena, ERAU)

THE RISE TIME OF TYPE Ia SUPERNOVAE FROM THE SUPERNOVA LEGACY SURVEY 1

Observational cosmology and Type Ia Supernovae, Part II

Optically Selected GRB Afterglows, a Real Time Analysis System at the CFHT

Quantifying correlations between galaxy emission lines and stellar continua

Semi-supervised learning for photometric supernova classification

arxiv: v2 [astro-ph.co] 18 Apr 2016

Data Release 5. Sky coverage of imaging data in the DR5

Énergie noire Formation des structures. N. Regnault C. Yèche

An Introduction to the Dark Energy Survey

Late time cosmology with GWs

Type Ia SNe standardization accounting for the environment. Lluís Galbany CENTRA-IST, UTL, Lisbon

Exploiting Sparse Non-Linear Structure in Astronomical Data

Cosmology with Peculiar Velocity Surveys

Transient Astronomy with the Gaia Satellite

SALT: a Spectral Adaptive Light curve Template for Type Ia Supernovae

Probabilistic photometric redshifts in the era of Petascale Astronomy

Photometric Redshifts, DES, and DESpec

Science Alerts from GAIA. Simon Hodgkin Institute of Astronomy, Cambridge

VISTA HEMISPHERE SURVEY DATA RELEASE 1

A. KIM, M. KIM, J. LEE, R. PAIN 2, C. PENNYPACKER, S. PERLMUTTER AND B. BOYLE, P. BUNCLARK, D. CARTER, K. GLAZEBROOK 3 AND

2. Observations and data reduction

Cosmology with Wide Field Astronomy

Multi-wavelength Surveys for AGN & AGN Variability. Vicki Sarajedini University of Florida

Some ML and AI challenges in current and future optical and near infra imaging datasets

Transcription:

SNLS supernovae photometric classification with machine learning Courtesy of D. Campbell-Wilson Anais Möller CAASTRO, Australian National University Statistical Challenges in Astronomy, Carnegie Mellon University June 2016

SNe Ia cosmology Type Ia SNe very luminous homogeneous spectral and photometric properties characteristic light curves -> we assume reproducible luminosities SNIa cosmology advantages: Betoule et al. 2014 orthogonal constraints to CMB and BAO sensitive to dark energy properties 2

SNIa cosmology challenges we want : - more SNe Ia - precise measurements e.g. improving systematics. - host galaxy - modelling / templates - e.g. JLA Betoule et al. 2014 more: not enough resources for spectroscopy (DES,LSST) -> photometric classification

SN photometric classification challenges how to classify?: - features: some type of fitter - redshift: photometric/ host-galaxy spectroscopic - classification algorithm we need to trust our sample: - high purity sample - without sacrificing stats (efficiency) from simulations to data from a classified sample to cosmology the unexpected

SUPERNOVA PHOTOMETRIC CLASSIFICATION CHALLENGE Richard Kessler, 1,2 Alex Conley, 3 Saurabh Jha, 4 Stephen Kuhlmann 5 Challenge Released on Jan 29, 2010. Last update: September 24, 2013 ABSTRACT We have publicly released a blinded mix of simulated SNe, with types (Ia, Ib, Ic, II) selected in proportion to their expected rate. The simulation is realized in the griz filters of the Dark Energy Survey (DES) with realistic observing conditions (sky noise, point spread function and atmospheric Participants Abbreviation a Classified +Z b /noz c SN z ph d CPU e Description (strategy class) f P. Belov and S. Glazov.... Belov & Glazov Yes/no No 90 Light-curve χ 2 test against Nugent templates (2) S. Gonzalez... Gonzalez Yes/yes No 120 Cuts on SiFTO fit χ 2 and fit parameters (1) InCA group g... InCA No/yes No 1 Spline fit and nonlinear dimensionality reduction (4) JEDI group h... JEDI-KDE Yes/yes No 10 Kernel density evaluation with 21 parameters (4) JEDI group... JEDI boost Yes/yes No 10 Boosted decision trees (4) JEDI group... JEDI-Hubble Yes/no No 10 Hubble-diagram KDE (3) JEDI group... JEDI combo Yes/no No 10 Boosted decision trees and Hubble KDE (3 & 4) MGU+DU group I... MGU+DU-1 No/yes No <1 Light-curve slopes and neural network (2) MGU+DU group... MGU+DU-2 No/yes No <1 Light-curve slopes and random forests (2) Portsmouth group j... Portsmouth χ 2 Yes/no No 1 SALT-II χ 2 r and false discovery rate statistic (1) Portsmouth group..... Portsmouth-Hubble Yes/no No 1 Deviation from parametrized Hubble diagram (3) D. Poznanski..... Poz2007 RAW Yes/no Yes 2 SN automated Bayesian classifier (SN ABC) (2) D. Poznanski..... Poz2007 OPT Yes/no Yes 2 SN-ABC with cuts to optimize C FoM-Ia (2) S. Rodney... Rodney Yes/yes Yes 230 SN ontology with fuzzy templates (2) Asimpleandrobustmethodforautomatedphotometric classification of supernovae using neural networks N.V. Karpenka 1,F.Feroz 2 and M.P. Hobson 2 1 PHOTOMETRIC SUPERNOVA CLASSIFICATION WITH MACHINE LEARNING Michelle Lochner 1,JasonD.McEwen 2,HiranyaV.Peiris 1,OferLahav 1, Max K. Winter 1 1

SNLS based on the Canada France Hawaii Telescope MegaCam : 36 CCD mosaic 4 broadband filters g,r,i,z 4 fields of 1 square degree rolling search mode spectroscopic follow up (Keck, Gemini and VLT) observations: 2003-2008 the SuperNova Legacy Survey - SNLS3 analysed and published - SNLS5 currently being processed (complete SNLS data set)

SNLS3 contains large spectroscopically and photometrically identified SNe samples Entries 45 40 35 Entries Mean RMS Entries Mean RMS 310 0.86 0.22 175 0.60 0.21 30 25 20 Unidentified sample Identified sample 15 10 5 0 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 Host photometric redshift

SNLS pipelines standard SNLS approach e.g. Astier et al. 2005, Guy et al. 2010, used in JLA Betoule et al. 2015 real time detection, uses spectroscopy for typing and redshift 0.15 < z <1.1 deferred photometric pipeline developed in the SNLS Saclay group (France) no spectroscopy required differed detection of all kinds of transient events larger number of detections larger redshift coverage sensitive to other SN types G. Bazin et al. A&A 534, A43 (2011) Photometric selection of Type Ia supernovae in the Supernova Legacy Survey. G.Bazin et al. A&A 499, Issue 3,(2009) The core-collapse rate from the Supernova Legacy Survey.

SNLS photometric pipeline pre-processed CFHT images photometry SNe Ia transient events photometry SN-like photometry core-collapse SNe Preselection cuts (SN-like): -One significant flux variation SN-like variation Star rejection -Quality cuts on light curves SNLS3 original pipeline 300,000 -> 1,500

SN classification SN-like SNe Ia good z bad z How? 1. extract information: A. light-curve features B. redshift 2. classification strategy core-collapse SNe I. sequential cuts with host-galaxy photometric z & SALT2 G. Bazin et al. A&A 534, A43 (2011) II. supervised learning with SN z & general fitter Möller et al. in preparation (2016) III. supervised learning with host-galaxy photometric z & SALT2

method I G. Bazin et al. A&A 534, A43 (2011) How: 1. extract information: A. light-curve features SALT2: C, x1, magnitudes,chi 2. B -19-18 -17-16 -10 0 10 20 30 40 phase B. redshift host-galaxy photometric: Ilbert et al. 2006: 2. selection cuts: 520,000 galaxies with an AB magnitude brighter than 25 in im. ~4% resolution, catastrophic assignment ( z/(1 + z) > 0.15 ): 3.7% V R -19-18 -17-16 -19-18 -17-16 -10 0 10 20 30 40 phase -10 0 10 20 30 40 phase Guy et al. 2008 - sampling - poor fit 3. classification strategy: sequential cuts

method I photometric host galaxy z + SALT2 + sequential cuts G. Bazin et al. A&A 534, A43 (2011) SNLS3 data spectroscopic SNe Ia spectroscopic CC simulated SNe Ia

method I photometric host galaxy z + SALT2 + sequential cuts Simulation purity contamination e ciency SNe Ia bad redshift SNe Ia core-collapse SNIa 94.4 ± 0.5% 0.65 ± 0.08% 4.9 ± 0.5% 29.9 ± 0.3% SNLS3 data # events # spectroscopic SNe Ia # spectroscopic CC # photometric CC 486 175 0 0 the core-collapse photometric sample was determined by Bazin et al. 2009 in a specific analysis.

supervised learning decision trees make successive rectangular cuts in the parameter space supervised classification methods simulations to train improving model: boosting: making the model more complex AdaBoost, XGBoost averaging methods: Random Forest, Bagging scikit-learn implementation

supervised learning redshift + features N-dimensional problem BDT response 1-dimensional problem

method II How: 1. extract information: A. light-curve features SALT2: C, x1, magnitudes and chi 2 of the fit. B. redshift host-galaxy photometric 2. selection cuts: - sampling - poor fit 3. classification strategy: XGB DT

method II host-galaxy photometric z, SALT2 fit, XGB efficiency 38.2 +- 0.7 purity 98.0 +- 0.3 Bazin et al. 2011 Gupta et al. 2016 BUT host-galaxy redshift assignment (zgal) efficiency is 83%

method III How: 1. extract information: A. redshift SN-redshift B. light-curve features general fitter Möller et al. (2016) in preparation 2. selection cuts (adapted to our analysis): - sampling - poor fit 3. classification strategy: supervised learning

SN redshift Palanque-Delabrouille et al. 2010 estimated directly from SN light curves. light curves are fitted using SALT2 searching minimisation of the reduced χ 2 1. for successive values of the redshift separated by δz = 0.1 with color and stretch set with gaussian priors. 2. setting the color and stretch parameters free and performing a new redshift scan around first step s redshift with the same step δz as above. set up on SNLS3 data ~2% resolution, catastrophic assignment ( z/(1 + z) > 0.15 ): 1.4%

general SN fitter SN photometric redshift algorithm uses SALT2. Advisable to use a light curve fitter which is: independent from SALT2 able to fit other types of SNe. e.g. the empirical model used for SNlike selection. f k = A k e (t tk 0 )/ k fall 1+e (t tk 0 )/ k rise + c k k= filter t k max = t k 0 + k riseln( k fall/ k rise 1)

method III How: 1. extract information: A. light-curve features general fitter B. redshift SN-redshift 2. selection cuts (adapted to our analysis): - sampling - poor fit 3. classification strategy: supervised learning

method III data preliminary all events spectroscopic SNe Ia spectroscopic CC visually non SN-like visually long extreme z

method III How: 1. extract information: A. light-curve features general fitter B. redshift SN-redshift 2. selection cuts (adapted to our analysis): - sampling - poor fit 3. classification strategy: supervised learning - setting parameters - selecting features - crossvalidation using scikit-learn packages -Random Forest -AdaBoost -XGBoost (extreme gradient boosting). http://xgboost.readthedocs.org/

method III preliminary simulations

method III simulations preliminary we want to compare classified samples -> 95% purity

method III preliminary simulations data 95% purity AdaBoost Random Forest XGBoost total e ciency Ia 36.9 ± 0.6 32.4 ± 0.7 41.1 ± 0.7 purity Ia 95.6 ± 0.5 95.6 ± 0.4 95.3 ± 0.4 contamination Ia bad z 0.53 ± 0.09 0.29 ± 0.07 0.60 ± 0.09 contamination CC 3.9 ± 0.4 4.1 ± 0.4 4.0 ± 0.4 AdaBoost Random Forest XGBoost photometric sample 478 549 670 spectroscopic Ia 166 198 223 photometric Ia 318 364 444 spectroscopic CC 2 2 3 photometric CC 1 1 6

method III simulations

method III SN-z, general fit, XGB 98% purity data cut SNLS3 events spectroscopic photometric simulated in sample Ia CC Ia CC Ia% SN-like 1483 246 42 486 109 50 selected 1193 238 30 481 77 47 classified 529 205 1 374 1 35 simulations preliminary

method III preliminary simulations simulations data

take away how to classify? 3 DT algorithms different features purity and efficiency high purity from simulations to data 1st application to high-z data 95-98% purity, high efficiency - selection cuts necessary - unexpected sim->data - rely on training from a classified sample to cosmology - SNLS3 V. Ruhlmann-Kleider - and stats ML new questions SNIa diversity method to be applied for SNLS5