Statistical Tools and Techniques for Solar Astronomers

Similar documents
Two Statistical Problems in X-ray Astronomy

Ages of stellar populations from color-magnitude diagrams. Paul Baines. September 30, 2008

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

Introduction to Probabilistic Machine Learning

Accounting for Calibration Uncertainty in Spectral Analysis. David A. van Dyk

Decision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over

Statistical Methods in Particle Physics

BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA

X-ray Dark Sources Detection

Bayesian Regression Linear and Logistic Regression

Statistical Methods for Astronomy

Parametric Models. Dr. Shuang LIANG. School of Software Engineering TongJi University Fall, 2012

Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics)

1 Statistics Aneta Siemiginowska a chapter for X-ray Astronomy Handbook October 2008

Accounting for Calibration Uncertainty in Spectral Analysis. David A. van Dyk

Treatment and analysis of data Applied statistics Lecture 4: Estimation

Introduction into Bayesian statistics

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1)

Probability and Estimation. Alan Moses

Bayesian Methods. David S. Rosenberg. New York University. March 20, 2018

Fully Bayesian Analysis of Low-Count Astronomical Images

Density Estimation. Seungjin Choi

MCMC for big data. Geir Storvik. BigInsight lunch - May Geir Storvik MCMC for big data BigInsight lunch - May / 17

E. Santovetti lesson 4 Maximum likelihood Interval estimation

Fitting Narrow Emission Lines in X-ray Spectra

Using Bayes Factors for Model Selection in High-Energy Astrophysics

Statistical and Learning Techniques in Computer Vision Lecture 2: Maximum Likelihood and Bayesian Estimation Jens Rittscher and Chuck Stewart

CLASS NOTES Models, Algorithms and Data: Introduction to computing 2018

Luminosity Functions

Introduction to Bayesian Inference

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets

Lecture 23 Maximum Likelihood Estimation and Bayesian Inference

Bayesian inference. Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark. April 10, 2017

Bayesian Inference and MCMC

Bayesian inference. Fredrik Ronquist and Peter Beerli. October 3, 2007

COS513 LECTURE 8 STATISTICAL CONCEPTS

New Bayesian methods for model comparison

Bayesian Estimation of log N log S

Measuring nuisance parameter effects in Bayesian inference

Hierarchical Bayesian Modeling of Planet Populations. Angie Wolfgang NSF Postdoctoral Fellow, Penn State

g-priors for Linear Regression

Introduction to Machine Learning

Neutral Bayesian reference models for incidence rates of (rare) clinical events

Bayesian Machine Learning

Parameter estimation! and! forecasting! Cristiano Porciani! AIfA, Uni-Bonn!

Parametric Techniques Lecture 3

Frequentist-Bayesian Model Comparisons: A Simple Example

Error analysis for efficiency

Lecture : Probabilistic Machine Learning

STAT 499/962 Topics in Statistics Bayesian Inference and Decision Theory Jan 2018, Handout 01

Statistics notes. A clear statistical framework formulates the logic of what we are doing and why. It allows us to make precise statements.

Embedding Astronomical Computer Models into Complex Statistical Models. David A. van Dyk

pyblocxs Bayesian Low-Counts X-ray Spectral Analysis in Sherpa

Advanced Statistical Methods. Lecture 6

STAT J535: Introduction

Introduction: MLE, MAP, Bayesian reasoning (28/8/13)

A8824: Statistics Notes David Weinberg, Astronomy 8824: Statistics Notes 6 Estimating Errors From Data

Parametric Techniques

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

ECE521 week 3: 23/26 January 2017

Lecture 2: From Linear Regression to Kalman Filter and Beyond

F & B Approaches to a simple model

ebay/google short course: Problem set 2

Embedding Astronomical Computer Models into Complex Statistical Models. David A. van Dyk

Embedding Computer Models for Stellar Evolution into a Coherent Statistical Analysis

Time Series and Dynamic Models

Bayesian RL Seminar. Chris Mansley September 9, 2008

Photometric Techniques II Data analysis, errors, completeness

Mathematical statistics

Hypothesis Testing, Bayes Theorem, & Parameter Estimation

Introduction to Bayesian Methods

Counting Statistics and Error Propagation!

Localization of Radioactive Sources Zhifei Zhang

Measurement And Uncertainty

New Results of Fully Bayesian

Treatment and analysis of data Applied statistics Lecture 6: Bayesian estimation

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework

Bayesian Inference in Astronomy & Astrophysics A Short Course

Supplementary Note on Bayesian analysis

Linear Models A linear model is defined by the expression

IEOR E4570: Machine Learning for OR&FE Spring 2015 c 2015 by Martin Haugh. The EM Algorithm

Bayesian Methods for Machine Learning

Statistical Data Analysis Stat 3: p-values, parameter estimation

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

Bayesian Phylogenetics:

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Introduction to Machine Learning

Introduction to Probability and Statistics (Continued)

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Notes on pseudo-marginal methods, variational Bayes and ABC

Approximate Bayesian computation: an application to weak-lensing peak counts

Lecture 6: Model Checking and Selection

Accounting for Calibration Uncertainty

Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment

Bayesian Estimation of DSGE Models 1 Chapter 3: A Crash Course in Bayesian Inference

Hierarchical Bayesian Modeling

Lecture 4: Probabilistic Learning. Estimation Theory. Classification with Probability Distributions

an introduction to bayesian inference

Lecture 4: Probabilistic Learning

Embedding Astronomical Computer Models into Principled Statistical Analyses. David A. van Dyk

Transcription:

Statistical Tools and Techniques for Solar Astronomers Alexander W Blocker Nathan Stein SolarStat 2012

Outline Outline 1 Introduction & Objectives 2 Statistical issues with astronomical data 3 Example: filter / hardness ratios 4 Standard approach 5 Building a statistical model 6 Using the model

Introduction & Objectives Outline 1 Introduction & Objectives 2 Statistical issues with astronomical data 3 Example: filter / hardness ratios 4 Standard approach 5 Building a statistical model 6 Using the model

Introduction & Objectives Introductions Alex Blocker, Harvard Statistics Model-based stacking for low-count observations Event detection for massive light curve databases Nathan Stein, Harvard Statistics Analysis of CMDs in stellar clusters Robust clustering methods for astronomical datas

Introduction & Objectives Objectives for the day Disclaimer: Will not make statisticians in two hours Goals: Awareness of statistical issues and concepts Understanding of probability modeling approach Familiarity with computational tools Basically, want informed statistical consumers

Introduction & Objectives Background Assuming little statistical background However, should have basic understanding of probability Not assuming knowledge of Bayesian modeling, MCMC, etc.

Statistical issues with astronomical data Outline 1 Introduction & Objectives 2 Statistical issues with astronomical data 3 Example: filter / hardness ratios 4 Standard approach 5 Building a statistical model 6 Using the model

Statistical issues with astronomical data Sources of error Raw data typically consist of photon counts Measurement noise and intrinsic variation Contamination from background Inhomogeneous instrumental sensitivities

Statistical issues with astronomical data Forms of data Starts with images, but many ways to extract numerical data (increasing order of structure and complexity): Point and area measurements (predefined) Light curves Spatiotemporal on predefined regions (local) Global spatiotemporal patterns

Statistical issues with astronomical data Focus Focusing today on simplest structure measurements on predefined regions Core modeling is shared across settings Computational strategies are similar, but greater sophistication is needed for more complex settings Analyses of light curves, whole images, etc. add layers of structure

Example: filter / hardness ratios Outline 1 Introduction & Objectives 2 Statistical issues with astronomical data 3 Example: filter / hardness ratios 4 Standard approach 5 Building a statistical model 6 Using the model

Example: filter / hardness ratios Problem definition Interested in relative flux of source or region between two energies Filter ratios in solar Hardness ratios in high-energy DEMs preferred to filter ratios for solar (e.g. Weber et al. 2005) However, ratios provide a straightforward setting to work with; DEM analysis is an extension

Example: filter / hardness ratios Definitions Denoting fluxes in hard and soft passbands as λ H and λ S Simple ratio Color R = λ S λ H C = log 10 ( λs λ H )

Example: filter / hardness ratios Data Observations Photon counts from region of interest in hard (H) and soft (S) passbands extracted from images Similar counts from area with background only in each passband (B H and B S ) Calibration Sensitivity of instrument to each band e H and e S (effective area) Relative effective area for background region r

Standard approach Outline 1 Introduction & Objectives 2 Statistical issues with astronomical data 3 Example: filter / hardness ratios 4 Standard approach 5 Building a statistical model 6 Using the model

Standard approach Simple case Without background or other corrections, standard approach just substitutes counts for fluxes: R = S H ) C = log 10 ( S H

Standard approach Corrections Adjusting for background, standard approach would use: R = S B S/r H B H /r ( ) S BS /r C = log 10 H B H /r

Standard approach Error estimates Standard errors of these are usually propagated Gaussian approximation (linear approximation): σ R = S B S/r σs 2 + σ2 B S /r 2 H B H /r (S B S /r) 2 + σ2 H + σ2 B H /r 2 (H B H /r) 2 1 σs 2 σ C = + σ2 B S /r 2 ln(10) (S B S /r) 2 + σ2 H + σ2 B H /r 2 (H B H /r) 2 where σ S, σ H, σ BS, and σ BH are typically approximated with the Gehrels prescription (Gehrels 1986) σ X X + 0.75 + 1

Standard approach From sigma to intervals Typically not interested in σ for its own sake Want to summarize uncertainty about R or C Standard statistical approach is to use intervals Often constructed as ˆθ ± k σ Confidence interpretation: want interval to include true value at least as often as stated e.g., for Gaussian data, X ± σ is a 68% interval; X ± 1.96σ is 95%

Standard approach Flaws Sigma does not summarize errors on R or C; actual uncertainty can be highly asymmetric Gaussian assumption is flawed for low-count observations; intervals are not valid Background subtraction leads to bias and inefficiency (van Dyk et al. 2001) Not accounting for differences in detector sensitivity effectively

Building a statistical model Outline 1 Introduction & Objectives 2 Statistical issues with astronomical data 3 Example: filter / hardness ratios 4 Standard approach 5 Building a statistical model 6 Using the model

Building a statistical model Models vs. procedures Classical approach is set of procedures; not derived from deeper framework Model-based approach starts from description of data-generating process Model is realistic (though not necessarily physical) description of underlying mechanisms Why model? Efficient use of information, consistency, and incorporation of complex error structure

Building a statistical model Parameters vs. observations Parameters regulate underlying processes (source and observation) Ideally invariant to detail of observation structure (e.g. flux, not expected counts) Target of inference For hardness ratios, parameters are source fluxes (λ H and λ S ) and background fluxes (ξ H and ξ S ) Observations are noisy outputs of parameters S, H, B S, and B H in ratio problem Input for, not target of, inference

Building a statistical model Distributions as connections Connect parameters to observations through distributions Background counts depend only on background flux and exposure B S Poisson(r e S ξ S ) B H Poisson(r e H ξ H ) where e is the effective area for the source region Source counts depend on both source and background fluxes S Poisson(e S (λ S + ξ S )) H Poisson(e H (λ H + ξ H ))

Building a statistical model Augmentation Sometimes useful to expand model by expanding observations Looks like adding complication, but can simplify computation and help with interpretation Usually ask what observations would make this problem easy? For ratio case, would be easy if we knew which parts for S and H came from source vs. background So, augment with background counts η S Poisson(e S λ S ) and η H Poisson(e H λ H ), β S Poisson(e S ξ S ) and β H Poisson(e H ξ H ), S = η S + β S and H = η H + β H

Using the model Outline 1 Introduction & Objectives 2 Statistical issues with astronomical data 3 Example: filter / hardness ratios 4 Standard approach 5 Building a statistical model 6 Using the model

Using the model Likelihood Likelihood is at the core of model-based inference Definition Likelihood is the probability of observing your (fixed) sample, as a function of the parameters. If your sample is Y and parameters are θ, likelihood is L(θ) Pr(Y θ). Likelihood is not the probability of your parameters taking on a particular value Higher values of likelihood indicate more support from data for given parameter value Likelihood function contains all information for inference with given model

Using the model Likelihood, in particular For independent observations, likelihood is just the product of their probabilities. So, for the ratio problem, the likelihood is: L(λ S, λ H, ξ S, ξ H ) P(B S ξ S ) P(S λ S, ξ S ) P(B H ξ H ) P(H λ H, ξ H ) Here, all of these probabilities take the form of the Poisson PMF

Using the model MLE One way to use the likelihood is to find the parameter values that maximize it; known as maximum likelihood estimation Resembles χ 2 fitting, but error measures need not be squared Has some desirable properties in large samples (efficiency, known approximate errors, etc.) Requires numerical maximization for most realistic models Can be badly misleading for small samples and settings with highly asymmetric uncertainty

Using the model Bayesian inference Uses Bayes Theorem to quantify uncertainty and perform estimation P(Y θ)p(θ) P(θ Y ) = P(Y ) P(θ Y ) is posterior distribution of parameters Estimates from mean, median, etc. of this distribution Intervals, standard errors, etc. from its quantiles and spread Can derive or simulate posterior of any function of θ using this posterior Drawback: need prior P(θ) Typically aim to choose prior that has little effect on results; check through sensitivity analysis

Using the model Implementation On to Nathan for computation!