Celeste: Variational inference for a generative model of astronomical images

Size: px

Start display at page:

Download "Celeste: Variational inference for a generative model of astronomical images"

Asher Spencer
5 years ago
Views:

1 Celeste: Variational inference for a generative model of astronomical images Jerey Regier Statistics Department UC Berkeley July 9, 2015 Joint work with Jon McAulie (UCB Statistics), Andrew Miller, Ryan Adams, Matt Homan, Dustin Lang, David Schlegel, and Prabhat

2 An astronomical image An image from the Sloan Digital Sky Survey, showing a galaxy from the constellation Serpens, 100 million light years from Earth, along with several other galaxies and many stars from our own galaxy. 2 / 20

3 Project goals Catalog all galaxies and stars that are visible through the next generation of telescopes. The Large Synoptic Survey Telescope, for example, will house a 3200-megapixel camera producing 8 terabytes of images nightly. Identify promising galaxies for spectrograph targeting. Better understand dark energy and the geometry of the universe. Develop an extensible model and inference procedure, for use by the astronomical community. Future applications might include nding supernovae and detecting killer asteroids. 3 / 20

4 The Celeste graphical model 4 / 20

5 Brightness and colors latent random variable r s models brightness of source s in the reference band the arriving quantity of energy in band r ( rs (as = i) Gamma Υ (i), Φ (i)). latent random vector c s models the B 1 colors of source s color = log ratio of brightness in adjacent bands ( ks (as = i) Categorical cs (ks = d, as = i) MvNormal Ξ (i) 1,..., Ξ(i) D ), ( Ω (i,d), Λ (i,d)). Then, the brightness l sb of each light source s in each band b is a deterministic function of r s and c s. 5 / 20

6 Colors: scientic priors 6 / 20

7 Galaxies: morphology 7 / 20

8 Galaxies: light-density model [ ] cos ϕs sin ϕ R s = s, sin ϕ s cos ϕ s W s = R s [ σ 2 s 0 0 σ 2 s ρ 2 s ] R s, J h si (w ) = η ij φ (w ; µ s, ν ij W s ), i = 0 or 1 j=1 h s (w ) = θ s h s1 (w ) + (1 θ s )h s0 (w ). 8 / 20

9 Idealized sky view Let δ µs denote the Dirac delta functionthe prole of a star. Then brightness for sky position w is S G(w ) = l sb g sas (w ) s=1 where g si (w ) = { δ µs (w ), if i = 0 (star") h s (w ), if i = 1 (galaxy"). 9 / 20

10 Point spread function Credit: Sloan Digital Sky Survey 10 / 20

11 Point spread function PSF mixture of K Gaussians: f nbm (w ) = K ᾱ nbk φ ( w m ; w + ξ ) nbk, τ nbk. k=1 φ is the bivariate normal density. K and the (ᾱ nb, ξ nb, τ nb ) are the parameters of the PSF, determined a priori. 11 / 20

12 Likelihood: idealized sky view + PSF Convolve intermediate sky view w/psf photon arrival rate for pixel m: G nbm = G(w )f (w ) dw S = l sb g sas (w )f (w ) dw. s=1 These normal-normal convolutions are analytic. For stars, f s0 (m) := g s0 (w )f (w ) dw K = ᾱ nbk φ ( m; µ s + ξ ) nbk, τ nbk. k=1 Let θ s1 = θ s and θ s2 = 1 θ s. For galaxies, f s1 (m) := g s1 (w )f (w ) dw = K ᾱ nk 2 θ si k=1 i=1 j=1 J η 1j φ ( m; µ s + ξ ) nbk, τ nbk + ν ij W s. 12 / 20

13 Likelihood Let a = (a s ) S, s=1 r = (r s) S, and s=1 c = (c s) S s=1. Then the expected number of photons landing in pixel m is F nbm (a, r, c) = ι nb [ɛ nb + G nbm ]. For n = 1,..., N, b = 1,..., B, and m = 1,..., M, we model x nbm a, r, c Poisson (F nbm (a, r, c)). 13 / 20

14 Intractable posterior Let Θ = (a, r, c). The posterior on Θ is intractable because of coupling between the sources: p(θ x) = p(x Θ)p(Θ) p(x) and p(x) = = p(x Θ)p(Θ) d Θ N B M p(x nbm Θ)p(Θ) dθ. n=1 b=1 m=1 14 / 20

15 Variational inference Let Q be a family of distributions on latent variables Θ. For q Q, log p(x) E q [log p(x, Θ)] E q [log q(θ)] = log p(x) D KL (q(θ), p(θ x)) =: L(q). Variational inference approximates the exact posterior with a simpler distribution q = arg max q Q L(q). 15 / 20

16 Variational inference: advantages and limitations Advantages: potentially orders of magnitude faster the MCMC no post-processing of samplescan compute statistics of the approximating distribution nearly instantaneously Limitations: biasand few error bounds are known for statistics based on an approximating distribution rather than the true posterior may require modeling changes, to avoid intractable expectations may necessitate solving dicult optimization problems, even if all expectations are tractable 16 / 20

17 Variational optimization...isn't easy L (χ, µ, κ, γ, ζ, β, λ, θ, ρ, σ, ϕ) { N B M = C + n=1 b=1 m=1 x nbm log [ S a {0,1} S s=1 ɛ nb + S s=1 χ a s s (1 χ s) 1 a s { r 1 b r b 1 r s exp {c sb } exp {c sb } j=b j=b r 3 ᾱ nk φ ( ] ) m w; ξ nbk, Σ nbk gsi (w) dw c 1 k 1 k=1 S b r b 1 3 ι nb r s exp {c sb } exp {c sb } ᾱ nk φ ( m w; ξ nbk, Σ ) nbk gsi (w) dw s=1 j=b j=b r k=1 } } dr 1 dc 1 dk 1... dr S dc S dk S { S 2 D KL (q(a s), p(a s)) + χ a s s s=1 i=1 (1 χ s) 1 a s [D KL (q(r s a s = i), p(r s a s = i)) + D KL (q (k s, c s a s = i), p s (k s, c s a s = i))] r S c S k S }. 17 / 20

18 However, A structured mean-eld assumption that factorizes across light sources makes most expectations tractable: q(θ) = S q(θ s ). s=1 The delta method for variational inference approximates the remainining expectations. Existing catalogs provide good initial settings for the variational parameters. Light sources are unlikely to contribute photons to distant pixels. 18 / 20

19 Posterior predictive check on a stamp Left is a 51 pixel 51 pixel sub-region of an astronomical image, captured through the r band lter. Each pixel's value corresponds to the number of photons that hit it. The right panel shows E q [F nbm ], the mean of x nbm with respect to our posterior approximation. 19 / 20

20 Results dataset real synthetic model primary celeste improve celeste improve n position (.00) (.01) 654 missed gals 28 / / (.01) 14 / (.01) 654 missed stars 8 / / (.01) 6 / (.01) 654 brightness (.12) (.08) 654 color u-g (.04) (.05) 582 color g-r (.01) (.02) 654 color r-i (.00) (.01) 654 color i-z (.01) (.02) 654 prole (.02) (.02) 237 eccentricity (.01) (.01) 237 scale (.17) (.04) 237 angle (.80) (.80) / 20

Learning an astronomical catalog of the visible universe through scalable Bayesian inference

Learning an astronomical catalog of the visible universe through scalable Bayesian inference Jeffrey Regier with the DESI Collaboration Department of Electrical Engineering and Computer Science UC Berkeley