Peak Detection for Images

Similar documents
Neuroimaging for Machine Learners Validation and inference

The General Linear Model. Guillaume Flandin Wellcome Trust Centre for Neuroimaging University College London

New Procedures for False Discovery Control

Lecture 7 April 16, 2018

False Discovery Control in Spatial Multiple Testing

Looking at the Other Side of Bonferroni

Lecture 28. Ingo Ruczinski. December 3, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

Statistical Inference

Hunting for significance with multiple testing

False Discovery Rate

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018

Statistical Inference

Statistical inference for MEG

Non-specific filtering and control of false positives

Multiple Testing. Hoang Tran. Department of Statistics, Florida State University

Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. T=number of type 2 errors

New Approaches to False Discovery Control

Lecture 27. December 13, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

High-throughput Testing

Extracting fmri features

Multiple testing: Intro & FWER 1

Bumpbars: Inference for region detection. Yuval Benjamini, Hebrew University

Confidence Thresholds and False Discovery Control

Step-down FDR Procedures for Large Numbers of Hypotheses

The General Linear Model (GLM)

Statistical Applications in Genetics and Molecular Biology

Exam: high-dimensional data analysis January 20, 2014

Doing Cosmology with Balls and Envelopes

Statistical testing. Samantha Kleinberg. October 20, 2009

Announcements. Proposals graded

PROCEDURES CONTROLLING THE k-fdr USING. BIVARIATE DISTRIBUTIONS OF THE NULL p-values. Sanat K. Sarkar and Wenge Guo

Aliaksandr Hubin University of Oslo Aliaksandr Hubin (UIO) Bayesian FDR / 25

Department of Statistics University of Central Florida. Technical Report TR APR2007 Revised 25NOV2007

arxiv: v2 [stat.me] 31 Aug 2017

Estimation of a Two-component Mixture Model

Advanced Statistical Methods: Beyond Linear Regression

False Discovery Rates for Random Fields

False discovery rate and related concepts in multiple comparisons problems, with applications to microarray data

Heterogeneity and False Discovery Rate Control

On Procedures Controlling the FDR for Testing Hierarchically Ordered Hypotheses

arxiv: v1 [math.st] 24 Apr 2015

Miscellanea False discovery rate for scanning statistics

Statistical Emerging Pattern Mining with Multiple Testing Correction

Alpha-Investing. Sequential Control of Expected False Discoveries

Week 5 Video 1 Relationship Mining Correlation Mining

Sta$s$cs for Genomics ( )

Sanat Sarkar Department of Statistics, Temple University Philadelphia, PA 19122, U.S.A. September 11, Abstract

Multiple Change-Point Detection and Analysis of Chromosome Copy Number Variations

Massive MIMO: Signal Structure, Efficient Processing, and Open Problems II

Signal Processing for Functional Brain Imaging: General Linear Model (2)

Controlling Bayes Directional False Discovery Rate in Random Effects Model 1

Optional Stopping Theorem Let X be a martingale and T be a stopping time such

Adaptive Filtering Multiple Testing Procedures for Partial Conjunction Hypotheses

Post-selection Inference for Changepoint Detection

Biostatistics Advanced Methods in Biostatistics IV

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations

Family-wise Error Rate Control in QTL Mapping and Gene Ontology Graphs

Resampling-Based Control of the FDR

A. Motivation To motivate the analysis of variance framework, we consider the following example.

Incorporation of Sparsity Information in Large-scale Multiple Two-sample t Tests

Topic 3: Hypothesis Testing

Stat 206: Estimation and testing for a mean vector,

Specific Differences. Lukas Meier, Seminar für Statistik

Bayesian inference J. Daunizeau

Journal Club: Higher Criticism

Model Selection Tutorial 2: Problems With Using AIC to Select a Subset of Exposures in a Regression Model

Generalized likelihood ratio tests for complex fmri data

2. What are the tradeoffs among different measures of error (e.g. probability of false alarm, probability of miss, etc.)?

Exceedance Control of the False Discovery Proportion Christopher Genovese 1 and Larry Wasserman 2 Carnegie Mellon University July 10, 2004

arxiv: v3 [stat.me] 29 Oct 2016

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Applying the Benjamini Hochberg procedure to a set of generalized p-values

Lecture 12: Multiple Hypothesis Testing

Controlling the False Discovery Rate: Understanding and Extending the Benjamini-Hochberg Method

Improving the Performance of the FDR Procedure Using an Estimator for the Number of True Null Hypotheses

ROI ANALYSIS OF PHARMAFMRI DATA:

Bayesian inference J. Daunizeau

University of California, Berkeley

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided

Asymptotic Results on Adaptive False Discovery Rate Controlling Procedures Based on Kernel Estimators

REPRODUCIBLE ANALYSIS OF HIGH-THROUGHPUT EXPERIMENTS

Understanding Regressions with Observations Collected at High Frequency over Long Span

A General Framework for High-Dimensional Inference and Multiple Testing

Association studies and regression

A Sequential Bayesian Approach with Applications to Circadian Rhythm Microarray Gene Expression Data

The General Linear Model (GLM)

Chapter 2 Signal Processing at Receivers: Detection Theory

More powerful control of the false discovery rate under dependence

Post-Selection Inference

Minitab Project Report Assignment 3

Piotr Majer Risk Patterns and Correlated Brain Activities

Contents. Introduction The General Linear Model. General Linear Linear Model Model. The General Linear Model, Part I. «Take home» message

Journal of Statistical Software

Lecture 5: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics. 1 Executive summary

Some General Types of Tests

Large-Scale Multiple Testing of Correlations

A Large-Sample Approach to Controlling the False Discovery Rate

Modelling temporal structure (in noise and signal)

Experimental design of fmri studies & Resting-State fmri

Two simple sufficient conditions for FDR control

Transcription:

Peak Detection for Images Armin Schwartzman Division of Biostatistics, UC San Diego June 016

Overview How can we improve detection power? Use a less conservative error criterion Take advantage of prior knowledge on the signal How do we know the algorithm works? Show theoretical error control and power consistency Show practical error control and power consistency simulations /39

Error Criteria in Multiple Testing Family-wise error rate FWER: Prob. of getting at least one false positive Computed under the complete null hypothesis of no signal anywhere Does not consider the signal False discovery rate FDR: Expected proportion of false positives among positives Benjamini and Hochberg Computed under the actual distribution of the data, including the signal 3/39

False Discovery Rate FDR False discovery proportion FDP Proportion of false discoveries: FDP = V maxr,1 False discovery rate FDR # false discoveries # discoveries Expected false discovery proportion: FDR = E maxr,1 V [ FDP] = E 4/39

Benjamini-Hochberg Procedure The BH procedure: Given m ordered p-values p 1 < < p m Reject the first k null hypotheses where k mpi = max i : < α i Theorem: If the p-values are independent or weakly positively dependent then m0 FDR m α α 5/39

Fluorescence Nanoscopy 1 μm Courtesy: Alex Egner, Gottingen 6/33

Background Adjusted 1 μm 7/33

Pixelwise BH 1 μm 8/33

Kernel Smoothing 1 μm =1 m ~ = 338 Local maxima 9/33

STEM Algorithm BH 1 μm FDR Level α = 0.001 Intensity Threshold R = 3 Sig. peaks 10/33

Inference for Images Approach Image Processing Statistical Control Familywise Error Rate FWER Control False Discovery Rate FDR Regularized estimation Widely used in neuroimaging Random field theory Widely used in genomics Less conservative Sparse estimates Dimension reduction Peak detection Inference at level of regions Too conservative false negatives Discrete tests Inference about voxels Driven by fit, not error control Threshold is often heuristic 11/39

Statistical Peak Detection Yields inference about spatial clusters, not voxels: 1/39

Statistical Peak Detection Significant clusters correspond to significant local maxima or minima 13/39

N-Dimensional Model Signal: unimodal peaks µ t J = a jh j j= 1 t, h t dt = 1, t [ 0, L] N j L = 00 h = Gaussian J = 6 14/39

N-Dimensional Model Add stationary ergodic Gaussian noise 0, y t = µ t + z t, z t ~ N σ E [ z t z t + s ] = c s σ = 1 cs = Gaussian 15/39

Pointwise Multiple Testing Every pixel has a p-value Number of tests is fixed = L Benjamini- Hochberg BH FDR Level α = 0.1 m = 4 10 tests 4 16/39

FWER Using the Supremum FWER = Prob. supremum exceeds thresh. Expected Euler Characteristic FWER Level α = 0.05 17/39

Pointwise Thresholding Every pixel has a p-value Number of tests is fixed = L FDR Level α = 0.1 m = 4 10 tests 4 18/39

The STEM Algorithm 1. Smooth with a unimodal kernel. Find all the local maxima 3. Compute the p-value of each local maximum 4. Correct for multiple testing STEM = Smooth and TEst Maxima 19/39

1. Kernel Smoothing Unimodal kernel best if same as signal = = = t w t w ds s y s t w t y t w t x N 1, = 6 0/39

1. Kernel Smoothing Unimodal kernel best if same as signal = = = t w t w ds s y s t w t y t w t x N 1, = 6 1/39

1. Kernel Smoothing Unimodal kernel best if same as signal = = = t w t w ds s y s t w t y t w t x N 1, = 6 /39

. Find Local Maxima Local maxima are candidate peaks ~ = T { t [ 0, L] : x t 0, x t 0} N = Dimension reduction! L = 4 10 4 ~ = 56 m Random! 3/39

3. Compute p-values Test at each local maximum the hypothesis H t : µ t = 0 vs. H t : µ t > 0, t T 0 A Define the distribution of the height of a local max. under the complete null hypothesis μt 0 ~ [ z t u is local max] F u P > t = Palm distribution For each observed local maximum, its p-value is computed as p t F x t = ~ [ ], t T 4/39

Isotropic Gaussian Fields In the case N =, the density of the height of a local maximum is In practice, κ is estimated from the data. Φ + + Φ = 3 3 3 3 3 3 1 3 κ κ σ κ κ σ ϕ κ σ κ σ ϕ κ πσ κ κ σ κ σ ϕ σ σ κ u u u u u u u u f 5/39

4. Multiple Testing Get threshold for p-values Number of tests is random! Benjamini- Hochberg BH Level α = 0.1 ~ = 56 m tests 6/39

4. Multiple Testing Get threshold for p-values Number of tests is random! Benjamini- Hochberg BH Level α = 0.1 ~ = 56 m tests 7/39

The problem: Summary So Far Find local significant regions in continuous domains What we have: Test local maxima after smoothing STEM What we are still missing: Distribution of local maxima Theoretical justification Simulations Data example 8/39

Theory: Assumptions True peaks are unimodal, finite support twice-differentiable after kernel smoothing Noise is Gaussian stationary ergodic trice-differentiable after kernel smoothing Kernel is unimodal, finite support 9/39

Theory: Error Definitions FDR = Expected fraction of false positive local max. among sig. local max. True peak Power = Expected fraction of detected peaks Smoothed peak False discovery True discovery False discovery 30/39

Theory: Main Result Let L, a = inf j a j, log L/a 0 # peaks Signal volume J / L N = A 1 + Oa -, 0 < A 1. S 1 / L N = A + Oa -, 0 < A < 1. Let u BH be the BH threshold at level α. 1. FDR u BH α + O p L N Error control + Op a. Power u BH 1 O p a Consistency In repeated samples: a = n so a = n. 31/39

L = 00 Simulation: Setup J = 9 peaks, regularly spaced h b t is isotropic Gaussian kernel with std. dev. b = 3, truncated at ±b zt is zero-mean Gaussian noise, Gaussian isotropic autocorrelation function with bandwidth ν = 1,, 3 Smoothing Gaussian kernel w t = h t,000 repetitions 3/39

Simulation: Performance ν = 0 ν = 1 ν = FDR a = 30 a = 40 Power a = 50 33/3

Optimal Bandwidth How to choose the kernel bandwidth? Small : High noise variance Large : low signal Optimal bandwidth maximizes SNR: arg max Power u arg max P [ x τ > u] arg max h σ 0 Gaussian peaks, Gaussian ACF, Gaussian kernel, std. dev. std. dev. std. dev. b ν opt = b ν 34/39

Functional MRI fmri 35/33

fmri Experiment: Social Thinking time off on off on off Social Social Nonsocial Nonsocial Nonsocial on Social Where in the brain is social information processed? Find differences between on and off states Moran et al. 009, OpenfMRI.org 36/33

Data: fmri: Voxelwise Analysis 71 ˣ 7 ˣ 36 ˣ 179 array Space Time At each voxel s: 1. Fit a linear model: Time Y s = Xβ s + ε s ˆ β s = X T X 1 X T Y s Obs. Stimulus. Compute a test statistic: ~ β s = se ˆ β s [ ˆ ] β s 37/33

STEM Algorithm BH =1.6 ~ = 334 m Local maxima FDR Level α = 0.05 R = 55 Sig. peaks 38/33

The problem: Summary Find local significant regions in continuous domains What we did: Test local maxima after smoothing STEM Algorithm searches for peaks and measures error in terms of peaks Conclusion: Inference should be about the right features 39/39