Fully Bayesian Analysis of Low-Count Astronomical Images

Similar documents
How to Win With Poisson Data: Imaging Gamma-Rays. A. Connors, Eureka Scientific, CHASC, SAMSI06 SaFDe

Accounting for Calibration Uncertainty in Spectral Analysis. David A. van Dyk

Accounting for Calibration Uncertainty in Spectral Analysis. David A. van Dyk

Accounting for Calibration Uncertainty

High Energy Astrophysics:

pyblocxs Bayesian Low-Counts X-ray Spectral Analysis in Sherpa

1 Statistics Aneta Siemiginowska a chapter for X-ray Astronomy Handbook October 2008

Challenges and Methods for Massive Astronomical Data

Upper Limits. David A. van Dyk. Joint work with Vinay Kashyap 2 and Members of the California-Boston AstroStatistics Collaboration.

Embedding Astronomical Computer Models into Complex Statistical Models. David A. van Dyk

Embedding Astronomical Computer Models into Complex Statistical Models. David A. van Dyk

arxiv: v1 [astro-ph.im] 15 Oct 2015

Overlapping Astronomical Sources: Utilizing Spectral Information

New Results of Fully Bayesian

LAT Automated Science Processing for Gamma-Ray Bursts

Doing Right By Massive Data: How To Bring Probability Modeling To The Analysis Of Huge Datasets Without Taking Over The Datacenter

Statistical Tools and Techniques for Solar Astronomers

Embedding Astronomical Computer Models into Principled Statistical Analyses. David A. van Dyk

Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics)

Two Statistical Problems in X-ray Astronomy

Outline Challenges of Massive Data Combining approaches Application: Event Detection for Astronomical Data Conclusion. Abstract

X-ray observations of neutron stars and black holes in nearby galaxies

arxiv: v1 [astro-ph.he] 29 Jul 2015

Bayesian Estimation of log N log S

SEARCH FOR AXION- LIKE PARTICLE SIGNATURES IN GAMMA-RAY DATA

Statistical Methods for Astronomy

Analysis Challenges. Lynx Data: From Chandra to Lynx : 2017 Aug 8-10, Cambridge MA

Massive Event Detection. Abstract

X-ray Dark Sources Detection

The γ-ray Milky Way above 10 GeV: Distinguishing Sources from Diffuse Emission

SEARCHING FOR NARROW EMISSION LINES IN X-RAY SPECTRA: COMPUTATION AND METHODS

Astronomical image reduction using the Tractor

Application of deconvolution to images from the EGRET gamma-ray telescope

Fitting Narrow Emission Lines in X-ray Spectra

Fitting Narrow Emission Lines in X-ray Spectra

OVERVIEW OF A 3C279 s FLARE and SIMULATION OF AGNs

IMPROVING THE DECONVOLUTION METHOD FOR ASTEROID IMAGES: OBSERVING 511 DAVIDA, 52 EUROPA, AND 12 VICTORIA

Astronomical Imaging with Maximum Entropy. Retrospective and Outlook. Andy Strong MPE

Model Selection for Galaxy Shapes

The Challenges of Heteroscedastic Measurement Error in Astronomical Data

Signal Model vs. Observed γ-ray Sky

New Multiscale Methods for 2D and 3D Astronomical Data Set

The H.E.S.S. Standard Analysis Technique

A New Method for Characterizing Unresolved Point Sources: applications to Fermi Gamma-Ray Data

NuSTAR observation of the Arches cluster: X-ray spectrum extraction from a 2D image

Discovery of TeV Gamma-ray Emission Towards Supernova Remnant SNR G Last Updated Tuesday, 30 July :01

Iterations

MONTE CARLO METHODS FOR TREATING AND UNDERSTANDING HIGHLY-CORRELATED INSTRUMENT CALIBRATION UNCERTAINTIES

The Impact of Positional Uncertainty on Gamma-Ray Burst Environment Studies

Hunting for Dark Matter in Anisotropies of Gamma-ray Sky: Theory and First Observational Results from Fermi-LAT

Statistical Applications in the Astronomy Literature II Jogesh Babu. Center for Astrostatistics PennState University, USA

Use of the likelihood principle in physics. Statistics II

Incorporating Spectra Into Periodic Timing: Bayesian Energy Quantiles

James Theiler, NIS-2 Jeff Bloch, NIS-2

Summer School in Statistics for Astronomers & Physicists June 5-10, 2005

Search for Point-like. Neutrino Telescope

Detection of hot gas in multi-wavelength datasets. Loïc Verdier DDAYS 2015

Testing the MCS Deconvolution Algorithm on Infrared Data. Michael P. Egan National Geospatial-Intelligence Agency Basic and Applied Research Office

Lecture : Probabilistic Machine Learning

Indirect Dark Matter Searches in the Milky Way Center with the LAT on board Fermi

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 13: SEQUENTIAL DATA

Setting Limits, Computing Intervals, and Detection

MeV measurements of γ-ray bursts by CGRO- COMPTEL: Revised catalog

The microwave sky as seen by Planck

Constraining Galactic dark matter in the Fermi-LAT sky with photon counts statistics

e e-03. which is distributed with standard chandra processing. visual inspection are included.

A. Chen (INAF-IASF Milano) On behalf of the Fermi collaboration

Fully Bayesian Analysis of Calibration Uncertainty In High Energy Spectral Analysis

ctools" tutorial Ulisses Barres de Almeida FAPESP High-Energy IAG/USP CTA Advanced School

Statistical and Learning Techniques in Computer Vision Lecture 2: Maximum Likelihood and Bayesian Estimation Jens Rittscher and Chuck Stewart

Machine Learning Applications in Astronomy

Using the Fermi-LAT to Search for Indirect Signals from Dark Matter Annihilation

Celeste: Scalable variational inference for a generative model of astronomical images

Gamma-ray variability of radio-loud narrow-line Seyfert 1 galaxies

Primer on statistics:

Probability Distributions

Sequence Obs ID Instrument Exposure uf Exposure f Date Observed Aimpoint (J2000) (ks) (ks) (α, δ)

A Search for Point Sources of High Energy Neutrinos with AMANDA-B10

Construction and Preliminary Application of the Variability Luminosity Estimator

AGN Feedback in the Hot Halo of NGC 4649

Photometric Techniques II Data analysis, errors, completeness

ACIS ( , ) Total e e e e-10 4.

Central image detection with VLBI

New physics is learnt from extreme or fundamental things

The Large Area Telescope on-board of the Fermi Gamma-Ray Space Telescope Mission

AS750 Observational Astronomy

CTB 37A & CTB 37B - The fake twins SNRs

Detection ASTR ASTR509 Jasper Wall Fall term. William Sealey Gosset

Hypothesis Testing, Bayes Theorem, & Parameter Estimation

Computational Challenges from Imaging X-ray Polarimetry. Herman L. Marshall (MIT) and the IXPE Team

Principles of Interferometry. Hans-Rainer Klöckner IMPRS Black Board Lectures 2014

Learning to Classify Galaxy Shapes using the EM Algorithm

An IDL Based Image Deconvolution Software Package

A Modular NMF Matching Algorithm for Radiation Spectra

Statistics notes. A clear statistical framework formulates the logic of what we are doing and why. It allows us to make precise statements.

The Case of the 300 kpc Long X-ray Jet in PKS at z=1.18

Statistics for the LHC Lecture 2: Discovery

COMPTEL OBSERVATIONS OF CYGNUS X-1. M. McConnell, A. Connors, D. Forrest, J. Ryan Space Science Center, University of New Hampshire, Durham, NH 03824

Data Processing in DES

Data Analysis Methods

Transcription:

Fully Bayesian Analysis of Low-Count Astronomical Images 1 Alanna Connors 2 1 Department of Statistics University of California, Irvine 2 Eurika Scientific Thanks to James Chiang, Adam Roy, and The California Harvard AstroStatistics Collaboration. 2007 Joint Statistics Meetings

Outline Image Analysis 1 Image Analysis 2 A Statistical Model Advantages of 3

Outline Image Analysis 1 Image Analysis 2 A Statistical Model Advantages of 3

Data Generation in High-Energy Astrophysics Low Counts Imaging X-ray and γ-ray detectors typically count a small number of photons in each of a large number of pixels. Instrumentation Point Spread Functions can vary with energy and location Exposure Maps can vary across an image Background Contamination Sample Chandra psf s (Karovska et al., ADASS X) EGERT exposure map (area time) EGERT γ-ray counts >1GeV (entire sky and mission life).

Data Generation in High-Energy Astrophysics Low Counts Imaging X-ray and γ-ray detectors typically count a small number of photons in each of a large number of pixels. Instrumentation Point Spread Functions can vary with energy and location Exposure Maps can vary across an image Background Contamination EGERT exposure map (area time) EGERT γ-ray counts >1GeV (entire sky and mission life). Sample Chandra psf s (Karovska et al., ADASS X)

Data Generation in High-Energy Astrophysics Low Counts Imaging X-ray and γ-ray detectors typically count a small number of photons in each of a large number of pixels. Instrumentation Point Spread Functions can vary with energy and location Exposure Maps can vary across an image Background Contamination EGERT γ-ray counts >1GeV (entire sky and mission life). Sample Chandra psf s (Karovska et al., ADASS X) EGERT exposure map (area time)

Data Generation in High-Energy Astrophysics Low Counts Imaging X-ray and γ-ray detectors typically count a small number of photons in each of a large number of pixels. Instrumentation Point Spread Functions can vary with energy and location Exposure Maps can vary across an image Background Contamination EGERT γ-ray counts >1GeV (entire sky and mission life). Sample Chandra psf s (Karovska et al., ADASS X) EGERT exposure map (area time)

Outline Image Analysis 1 Image Analysis 2 A Statistical Model Advantages of 3

Scientific Goals Image Analysis Given our blurry, low-count, inhomogeneous, contaminated data we would like to learn about the structure and unexpected features of an astronomical source. What does the source look like? Are there interesting features? Are these features statistically significant? Are these features an indication of something beyond our current physical understanding of the source? Is our physical model sufficient to explain the data?

Example: Searching for the γ-ray halo. physical model data Is there excess emission/structure in the data?

Example: Residual Emission Dixon et al. fit a model of the form Physical Model Multi-Scale Residual to the data, using Haar wavelets for the residual. Thresholding wavelet coefficient led to the following fit:

Example: But is it Real?? Dixon et al. wondered... The immediate question arises as to the statistical significance of this feature. Though we are able to make rigorous statements about the coefficient-wise and level-wise FDR, similar quantification of object-wise significance (e.g., this blob is significant at the n sigma level ) are difficult.

Outline Image Analysis 1 Image Analysis 2 A Statistical Model Advantages of 3

Three for Low-Count Image Analysis Automate: We would like to automate 1 model fitting to avoid subjective stopping rules used to control reconstruction quality, and 2 the search for structure to avoid choosing parameters to enhance supposed structure. Formulate: We would like to formulate low-count image analysis in the terms of statistical theory to better understand the characteristics of the results. Evaluate: We would like to evaluate 1 the statistical error in the fitted reconstruction under the assumed model, 2 the likelihood that supposed structures exist in the astronomical source, and 3 the plausibility of the model assumptions.

Outline Image Analysis A Statistical Model Advantages of 1 Image Analysis 2 A Statistical Model Advantages of 3

A Statistical Model Advantages of A Statistical Model for the Data Generation Process Smooth Extended Source Flexible non-parametric model, e.g., MRF or Multi-Scale Point Sources Model the location, intensity, and perhaps extent and shape.

Outline Image Analysis A Statistical Model Advantages of 1 Image Analysis 2 A Statistical Model Advantages of 3

A Statistical Model Advantages of Advantages of A Model-Based Formulation 1 The use of well defined statistical estimates such as ML estimates, MAP estimates, or posterior means, eliminates the need for ad-hoc stopping rules (Esch et al., ApJ, 2004). 2 Statistical theory allows computation of statistical errors with Bayesian / frequency properties (Esch et al., 2004). 3 Allows us to incorporate knowledge from other data. 4 Principled methods for comparing / evaluating models. 5 Quantify evidence for supposed structure under a flexible model.

Outline Image Analysis A Statistical Model Advantages of 1 Image Analysis 2 A Statistical Model Advantages of 3

A Statistical Model Advantages of Outside information can be critical with low-count data. Lucky, such information is often available as high-count high-resolution data from a different energy band (e.g., Optical or Radio). Incorporating Information Through Model Components The number of and location of point sources. Smoothing parameters for extended source. Characterize spatial variation of smoothing parameters. Incorporating Information Through Bayesian Prior Distributions Include a region where a point source is likely to exist. Encourage param values similar to those from better data. Use of prior distributions offers a more flexible approach than setting parameters.

Outline Image Analysis 1 Image Analysis 2 A Statistical Model Advantages of 3

Is The Baseline Model Sufficient? Start with known parameterizied physical model (null). Residual is fit with a flexible multi-scale model. Is there structure in residual? We fit a finite mixture distribution with an unknown number of components: Physical Model Multi-Scale Residual If we fit the two-component model, we can look for structure in the fitted residual. Tests are technically and computationally challenging.

A Simulation Study We simulated data under the supposed physical model: Physical Model We fit the two component model: Physical Model Multi-Scale Residual Known Physical Model Simulated Data Posterior Mean of Residual

Simulation Under the Alternative We simulated data under a model with the supposed physical model plus a physically possible feature: Physical Model Multi-Scale Residual We fit the same two component model. Known Physical Model Simulated Data Posterior Mean of Residual

Evidence For The Added Component We examine the joint posterior distribution of 1 Baseline Scale Factor: α 2 Expected Total MS Counts: β in α Physical Model β Multi-Scale Residual Multi-Scale Residual. 3.34 3.36 3.38 3.40 3.42 0.005 0.005 0.015 0.025 Bright Discontinuous Unknown Log10(Expected Total MS Counts) Log10(Baseline Scale Factor) 3 2 1 0 1 2 3 0.01 0.01 0.02 0.03 Null (.) vs Bright Unknown () Log10(Expected Total MS Counts) Log10(Baseline Scale Factor)

Using a Bayesian prior to formulate frequentist test A procedure: Simulation Result 1 Construct a prior distribution that favors a null hypothesis H 0 : object is a point source 2 Compute the posterior and evaluate the propensity of the alternative hypothesis H A : an extended source 3 Using a test statistic, prior parameters can be used to set level (and power). data and PSF result w/ prior I prior II

Outline Image Analysis 1 Image Analysis 2 A Statistical Model Advantages of 3

Posterior Predictive P-values Is the deviation form the baseline model significant? Is the difference in the previous slide typical? For data generated under the null model, what is the sampling distribution of ˆβ, the expected residual count under the two-component model? We can answer these questions computationally: Sample replication datasets under the null model. Sample unknown parameters from their null posterior. Fit the two-component model to each replicate dataset. Compare the resulting distribution of ˆβ with the value fit to the actual data. This strategy is computationally demanding!

Image Analysis The search for highly irregular and unexpected structure in astronomical images posses many statistical challenges. Model-based methods allow us to make progress on formalizing an answering scientific questions. More Sophisticated computational methods and methods for summarizing high dimensional posterior distributions are yet to be explored.

For I Connors, A. and van Dyk, D. A.. How To Win With Non-Gaussian Data: Poisson Goodness-of-Fit. In Statistical Challenges in Modern Astronomy IV. to appear. van Dyk, D. A., Connors, A., Esch, D. N., Freeman, P., Kang, H., Karovska, M., and Kashyap, V. Deconvolution in High Energy Astrophysics: Science, Instrumentation, and Methods (with discussion). Bayesian Analysis, 1, 189 236, 2006. Esch, D. N., Connors, A., Karovska, M., and van Dyk, D. A. An Image Reconstruction Technique with Error Estimates. The Astrophysical Journal, 610, 1213 1227, 2004. Protassov, R., van Dyk, D. A., Connors, A., Kashyap, V. L. and Siemiginowska, A. Statistics: Handle with Care, Detecting Multiple Model Components with the Likelihood Ratio Test. The Astrophysical Journal, 571, 545 559, 2002.