Sparse Composite Models of Galaxy Spectra (or Anything) using L1 Norm Regularization. Benjamin Weiner Steward Observatory University of Arizona

Similar documents
IRS Spectroscopy of z~2 Galaxies

Star Formation Indicators

The relation between cold dust and star formation in nearby galaxies

Exploring star formation in cluster galaxies. the Herschel Space Observatory. Tim Rawle. Steward Observatory, University of Arizona

Ultra Luminous Infared Galaxies. Yanling Wu Feb 22 nd,2005

Dusty star-forming galaxies at high redshift (part 5)

List of Changes and Response to Referee Report

Aromatic Features in M101 HII Regions and Starburst Galaxies

Exploiting Sparse Non-Linear Structure in Astronomical Data

Far-infrared Herschel SPIRE spectroscopy reveals physical conditions of ionised gas in high-redshift lensed starbursts

Stellar Mass Estimates of Galaxies in Superclusters at z~1

Margherita Talia 2015 A&A, 582, 80 ELG2017. University of Bologna. m g 1

LoCuSS: A Legacy Survey of Galaxy Clusters at z=0.2

Deriving stellar masses from SDSS

Lecture 11: SDSS Sources at Other Wavelengths: From X rays to radio. Astr 598: Astronomy with SDSS

The Ṁass- loss of Red Supergiants

THE GAS MASS AND STAR FORMATION RATE

Quantifying correlations between galaxy emission lines and stellar continua

Dusty star-forming galaxies at high redshift (part 5)

Extragalactic Astronomy

Hunting for Infrared Signatures of Supermassive Black Hole Activity in Dwarf Galaxies

A. Chen (INAF-IASF Milano) On behalf of the Fermi collaboration

From theory to observations

Star Formation in Brightest Cluster Galaxies in Cool Cores:

Lasso, Ridge, and Elastic Net

STA414/2104 Statistical Methods for Machine Learning II

The PRIsm MUlti-object Survey (PRIMUS)

Chien-Ting Chen! Dartmouth College

Radio infrared correlation for galaxies: from today's instruments to SKA

Gas Accretion & Outflows from Redshift z~1 Galaxies

The Far-Infrared Radio Correlation in Galaxies at High Redshifts

The HII Regions of Sextans A

9. Evolution with redshift - z > 1.5. Selection in the rest-frame UV

LePhare Download Install Syntax Examples Acknowledgement Le Phare

POLYCYCLIC AROMATIC HYDROCARBON CONTRIBUTION TO THE INFRARED OUTPUT ENERGY OF THE UNIVERSE AT Z 2

Introduction and Motivation

Chapter 7: From theory to observations

Galaxy Formation in the Epoch of Reionization: Semi-Analytic Model Predictions for JWST Observations

Galaxies 626. Lecture 9 Metals (2) and the history of star formation from optical/uv observations

STAR FORMATION ALONG A CLUSTER-FEEDING FILAMENT

EBL Studies with the Fermi Gamma-ray Space Telescope

Evolution of Star Formation Activity of Galaxies as seen by Herschel

The Measurement of the Expansion Rate of the Universe from γ-ray Attenuation*

Today. Kirchoff s Laws. Emission and Absorption. Stellar Spectra & Composition. Doppler Effect & Motion. Extrasolar Planets

Measurement of the stellar irradiance

VY Canis Majoris: The Astrophysical Basis of Its Luminosity

Dust. The four letter word in astrophysics. Interstellar Emission

Spectral Energy Distributions as probes of star formation in the distant Universe

The Infrared Properties of Sources in the H-ATLAS and WISE Surveys

Sunrise: Patrik Jonsson. Panchromatic SED Models of Simulated Galaxies. Lecture 2: Working with Sunrise. Harvard-Smithsonian Center for Astrophysics

arxiv:astro-ph/ v1 17 Oct 2002

A Calibration Method for Wide Field Multicolor. Photometric System 1

A Monster at any other Epoch:

Hierarchical Bayesian Modeling

aka Light Properties of Light are simultaneously

Big Data Inference. Combining Hierarchical Bayes and Machine Learning to Improve Photometric Redshifts. Josh Speagle 1,

Cosmological Background Radiation and Extragalactic Gamma-ray Opacity

A mid and far-ir view of the star formation activity in galaxy systems and their surroundings

A Far-ultraviolet Fluorescent Molecular Hydrogen Emission Map of the Milky Way Galaxy

A tool to test galaxy formation theories. Joe Wolf (UC Irvine)

Closing in on the NIRB & γ-ray Opacities

Today. Spectra. Thermal Radiation. Wien s Law. Stefan-Boltzmann Law. Kirchoff s Laws. Emission and Absorption. Spectra & Composition

Hierarchical Bayesian Modeling

Multi-wavelength Astronomy

A Least Squares Formulation for Canonical Correlation Analysis

Jens Melinder. With financial support from CNES/CNRS convention #

Interstellar Dust Clouds UROP Spring 2013 Chris Nolting Faculty Mentor: Professor Terry Jones 5/17/13

The Cosmic History of Star Formation. James Dunlop Institute for Astronomy, University of Edinburgh

Paul Broberg Ast 4001 Dec. 10, 2007

Treatment and analysis of data Applied statistics Lecture 4: Estimation

G. M. BERNSTEIN Department of Astronomy, University of Michigan, 830 Dennison Building, Ann Arbor, MI 48109

From theory to observations

Regularization Parameter Selection for a Bayesian Multi-Level Group Lasso Regression Model with Application to Imaging Genomics

Survey of dusty AGNs based on the mid-infrared all-sky survey catalog

Spectroscopy of Blue Supergiants in the Disks of Spiral Galaxies: Metallicities and Distances. Rolf Kudritzki

A Unified Theory of Photometric Redshifts. Tamás Budavári The Johns Hopkins University

Supernovae Observations of the Expanding Universe. Kevin Twedt PHYS798G April 17, 2007

VOSA. A VO Spectral Energy Distribution Analyzer. Carlos Rodrigo Blanco 1,2 Amelia Bayo Arán 3 Enrique Solano 1,2 David Barrado y Navascués 1

Removing systematic noise from lightcurves

THE CONNECTION BETWEEN STAR FORMATION AND DARK MATTER HALOS AS SEEN IN THE INFRARED

Or How to select variables Using Bayesian LASSO

Lecture 14: Shrinkage

Regression, Ridge Regression, Lasso

Physics 224 The Interstellar Medium

Measuring stellar distances.

The bimodal spiral galaxy surface brightness distribution

A tool to test galaxy formation arxiv: Joe Wolf (UC Irvine)

Astrophysics of Gaseous Nebulae and Active Galactic Nuclei

Stellar Population Mass Estimates. Roelof de Jong (STScI AIP) Eric Bell (MPIA Univ. of Michigan)

PoS(LCDU 2013)010. The extinction law at high redshift. Simona Gallerani Scuola Normale Superiore

Source plane reconstruction of the giant gravitational arc in Abell 2667: a condidate Wolf-Rayet galaxy at z 1

Studies of diffuse UV radiation

Type Ia Supernovae: Standardizable Candles and Crayons

Lecture 2: Galaxy types, spectra

Lecture 2: Galaxy types, spectra. Galaxies AS

Type II Supernovae as Standardized Candles

Gas Masses and Gas Fractions: Applications of the Kennicutt- Schmidt Law at High Redshift

Feeding the Beast. Chris Impey (University of Arizona)

Analysis Methods for Supersaturated Design: Some Comparisons

Bayesian Modeling for Type Ia Supernova Data, Dust, and Distances

Transcription:

Sparse Composite Models of Galaxy Spectra (or Anything) using L1 Norm Regularization Benjamin Weiner Steward Observatory University of Arizona

Inference of mass, redshift, etc. from physical models Many astrophysical problems require fitting complex physical models to a limited set of observables, where the model parameters can be either the object of study or nuisance parameters. The parameter space is large and the generating process is typically non-linear and expensive. For a galaxy s stellar mass, redshift, star formation history, dust mass, etc., the usual approach is to take a very large set of possible models, e.g. star formation histories, generate stellar populations and thence colors from each model, and fit the models one at a time to the observed flux data, scaling in brightness (and sometimes dust extinction) only. This process has several problems: - galaxies are intrinsically composite, - systematics due to template mismatch, - limited model space due to imposed forms of SF history, dust models, etc.

Galaxy dust mass from modeling far-infrared emission Data: Empirical galaxy far-ir spectra (Rieke et al 2009), which we observe at 8, 24, 70, 100, 160, 250, 350, 500 um Models: 240 physically motivated dust model spectra, generated by different mixtures of radiation intensity, dust composition, etc (Draine & Li 2007)

Fitting dust models one-at-a-time to simulated observations log L_IR=10.5 log L_IR=11 Red line is best fit of the 240 Draine & Li dust models (themselves composites of a diffuse and hot component) to black points with 10% errors. Normalization -> dust mass (eg Magdis et al 2012, Magnelli et al 2012). Problems: - Best fit still isn t very good. - Marginalizing over probability of each model is thus bogus. - Marginalization and Monte Carlo realizations of error stick to a few models, and so underestimate the parameter uncertainties.

Lack of diversity in best-fit single model SEDs Despite the seeming variety of 240 model SEDs, in the Monte Carlo realizations of 14 different far-ir spectra with 10% errors, nearly all of the best fits use only the 10 most common models.

How to fit linear combinations of models without going crazy c2 L1 penalty Fit likelihood Minimize: j (y j i c ix i,j ) 2 2 y,j c1 + i c i y j are data x i,j are i th model Penalize by sum of c s Real galaxies are composites. A linear combination of models spans observable space better than one-at-a-time models, but combinations of 10s to 100s of models are highly degenerate. Applying an L1-norm penalty makes the fit sparse by keeping many coefficients at zero. (see LASSO, e.g. R. Tibshirani 1996, M. Schmidt 2005)

Fitting composite dust models with L1 regularization log L_IR=10.5 log L_IR=11 Blue lines are the fitted components. From an input set of the 9 most popular Draine & Li models, the L1-penalized fit left 6 at zero, and fit the data with only 3 components. Remaining issues: - Overfitting: shallow minimum / fits are degenerate. - This causes MC realizations to again underestimate the error on mass. - Solution can be sensitive to the minimization algorithm; need to explore the acceptable region, e.g. with MCMC.

Composite models fit the data better, but their real value is showing the larger parameter space allowed by the data The best-single-fit and composite models find similar trends of dust mass with far-ir luminosity, but with an offset. The offset is not huge, but is grossly larger than the conventional method of error estimate on the bestfits, because galaxies are not well described by single models (even though these DL models are already mixtures of diffuse+hot components). The L1 regularization method is generally applicable to any data that is the sum of complex, physically meaningful models.

Thanks to David Hogg for suggesting the L1-norm as a technique to enforce sparsity, and George Rieke for discussions on infrared spectra and dust models. References: B. Draine & A. Li. 2007, ApJ, 657, 810 G. Rieke et al. 2009, ApJ, 692, 556 G. Magdis et al. 2012, ApJ, 760, 6 B. Magnelli et al. 2012, A&A, 548, 22 M. Schmidt. 2005, Least squares optimization with L1-norm regularization, https:// www.cs.ubc.ca/~schmidtm/documents/2005_notes_lasso.pdf R. Tibshirani. 1996, Regression shrinkage and selection via the lasso, J. Roy. Stat. Soc. B, 58, 267