Beyond Univariate Analyses: Multivariate Modeling of Functional Neuroimaging Data

Similar documents
Bayesian Analysis. Bayesian Analysis: Bayesian methods concern one s belief about θ. [Current Belief (Posterior)] (Prior Belief) x (Data) Outline

Extracting fmri features

Neuroimaging for Machine Learners Validation and inference

The General Linear Model (GLM)

The General Linear Model. Guillaume Flandin Wellcome Trust Centre for Neuroimaging University College London

The General Linear Model (GLM)

Integrative Methods for Functional and Structural Connectivity

A hierarchical group ICA model for assessing covariate effects on brain functional networks

MIXED EFFECTS MODELS FOR TIME SERIES

Data Analysis I: Single Subject

Group Analysis. Lexicon. Hierarchical models Mixed effect models Random effect (RFX) models Components of variance

Jean-Baptiste Poline

Modelling temporal structure (in noise and signal)

Group analysis. Jean Daunizeau Wellcome Trust Centre for Neuroimaging University College London. SPM Course Edinburgh, April 2010

Statistical Analysis Aspects of Resting State Functional Connectivity

Overview of SPM. Overview. Making the group inferences we want. Non-sphericity Beyond Ordinary Least Squares. Model estimation A word on power

What is NIRS? First-Level Statistical Models 5/18/18

Experimental design of fmri studies

Experimental design of fmri studies & Resting-State fmri

Experimental design of fmri studies

Bayesian inference J. Daunizeau

Dynamic Causal Modelling for fmri

Statistical Inference

Experimental design of fmri studies

The General Linear Model Ivo Dinov

Contents. Introduction The General Linear Model. General Linear Linear Model Model. The General Linear Model, Part I. «Take home» message

Optimization of Designs for fmri

Contrasts and Classical Inference

Overview of Spatial Statistics with Applications to fmri

General linear model: basic

Statistical Inference

Signal Processing for Functional Brain Imaging: General Linear Model (2)

Detecting fmri activation allowing for unknown latency of the hemodynamic response

Bayesian inference J. Daunizeau

Neuroimage Processing

Wellcome Trust Centre for Neuroimaging, UCL, UK.

Experimental Design. Rik Henson. With thanks to: Karl Friston, Andrew Holmes

Testing for group differences in brain functional connectivity

Experimental design of fmri studies

Event-related fmri. Christian Ruff. Laboratory for Social and Neural Systems Research Department of Economics University of Zurich

ESTIMATION OF THE ERROR AUTOCORRELATION MATRIX IN SEMIPARAMETRIC MODEL FOR FMRI DATA

Effective Connectivity & Dynamic Causal Modelling

Mixed effects and Group Modeling for fmri data

Functional Connectivity and Network Methods

arxiv: v3 [stat.ap] 30 Apr 2015

Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation

Part 2: Multivariate fmri analysis using a sparsifying spatio-temporal prior

Granger Mediation Analysis of Functional Magnetic Resonance Imaging Time Series

Contents. Data. Introduction & recap Variance components Hierarchical model RFX and summary statistics Variance/covariance matrix «Take home» message

Bayesian modelling of fmri time series

An introduction to Bayesian inference and model comparison J. Daunizeau

Piotr Majer Risk Patterns and Correlated Brain Activities

Resampling Based Design, Evaluation and Interpretation of Neuroimaging Models Lars Kai Hansen DTU Compute Technical University of Denmark

Analysis of longitudinal neuroimaging data with OLS & Sandwich Estimator of variance

First Technical Course, European Centre for Soft Computing, Mieres, Spain. 4th July 2011

Machine learning strategies for fmri analysis

A. Motivation To motivate the analysis of variance framework, we consider the following example.

HST 583 FUNCTIONAL MAGNETIC RESONANCE IMAGING DATA ANALYSIS AND ACQUISITION A REVIEW OF STATISTICS FOR FMRI DATA ANALYSIS

Introduction to General Linear Models

Overview. Experimental Design. A categorical analysis. A taxonomy of design. A taxonomy of design. A taxonomy of design. 1. A Taxonomy of Designs

Morphometrics with SPM12

Statistical inference for MEG

Searching for Nested Oscillations in Frequency and Sensor Space. Will Penny. Wellcome Trust Centre for Neuroimaging. University College London.

Computational Brain Anatomy

Analyses of Variance. Block 2b

ROI ANALYSIS OF PHARMAFMRI DATA:

Statistical Analysis of fmrl Data

Contents. design. Experimental design Introduction & recap Experimental design «Take home» message. N εˆ. DISCOS SPM course, CRC, Liège, 2009

Approaches to Modeling Menstrual Cycle Function

Scaling in Neurosciences: State-of-the-art

Bayesian Treatments of. Neuroimaging Data Will Penny and Karl Friston. 5.1 Introduction

Power Comparisons of the Rician and Gaussian Random Fields Tests for Detecting Signal from Functional Magnetic Resonance Images

Large-scale VAR modeling based on functional MRI data by applying a Bayes shrinkage estimation and a modified local FDR procedure

General multilevel linear modeling for group analysis in FMRI

Bayesian Inference Course, WTCN, UCL, March 2013

Model-free Functional Data Analysis

Morphometry. John Ashburner. Wellcome Trust Centre for Neuroimaging, 12 Queen Square, London, UK.

Morphometry. John Ashburner. Wellcome Trust Centre for Neuroimaging, 12 Queen Square, London, UK. Voxel-Based Morphometry

A test for a conjunction

Hierarchy. Will Penny. 24th March Hierarchy. Will Penny. Linear Models. Convergence. Nonlinear Models. References

Models of effective connectivity & Dynamic Causal Modelling (DCM)

New Procedures for False Discovery Control

Statistical Analysis of Functional ASL Images

Peak Detection for Images

Optimal design for event-related functional magnetic resonance imaging considering both individual stimulus effects and pairwise contrasts

Course topics (tentative) The role of random effects

Decoding conceptual representations

Introduction to Simple Linear Regression

Computing FMRI Activations: Coefficients and t-statistics by Detrending and Multiple Regression

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

Dynamic Causal Models

Selected Topics in Statistics for fmri Data Analysis

Notes on the Multivariate Normal and Related Topics

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Generative classifiers: The Gaussian classifier. Ata Kaban School of Computer Science University of Birmingham

High-dimensional Statistics

PILCO: A Model-Based and Data-Efficient Approach to Policy Search

Appendix A: Review of the General Linear Model

A Unified Estimation Framework for State- Related Changes in Effective Brain Connectivity

Weighted Network Analysis for Groups:

Transcription:

Beyond Univariate Analyses: Multivariate Modeling of Functional Neuroimaging Data F. DuBois Bowman Department of Biostatistics and Bioinformatics Center for Biomedical Imaging Statistics Emory University, Atlanta, GA, 30322 How Not to Analyze Your Data: A Skeptical Introduction to Modeling Methods OHBM 2013 Seattle, WA June 16, 2013 F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 1 / 21

Outline 1 Background 2 General Linear Model 3 Multivariate Linear Model 4 Results 5 Summary F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 2 / 21

The Problem Neuroactivation Studies Task-related designs Seek group-level inferences relating stimuli to neural response Contrasts specify task-related changes (and possibly group differences) in neural activity Estimation and hypothesis testing about group-level contrasts Multiple contrasts for each subject, derived from multiple tasks/effects Linear model framework (linear in parameters) F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 3 / 21

Univariate versus Multivariate Linear Models Univariate Linear Models Involve a single dependent variable May involve one or more independent variables Multiple regression Multivariate Linear Models Involve multiple dependent variables Dependent variables are possibly correlated Over voxels Over time Related stimuli/tasks May involve one or more independent variables F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 4 / 21

Common Univariate Analysis Framework Two-stage Model: Mass Univariate Approach First, fit a linear model separately for each subject (at each voxel) Convolution with a HRF Temporal correlations between scans: AR models (+ white noise) Linear covariance structure Pre-coloring/temporal smoothing [Worsley and Friston, 1995] Pre-whitening [Bullmore et al, 1996; Purdon and Weisskoff, 1998] Alternative structures available for PET [Bowman and Kilts, 2003] Second, fit linear model that combines subject-specific estimates A two-stage (random effects) model Simplifies computations* Sacrifices efficiency For Inference: Compute t-statistics at each voxel and threshold Consider a multiple testing adjustment (Bonferonni-type, FDR, RFT) F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 5 / 21

Common Univariate Analysis Framework Properties Two-stage (random effects) model Simplifies computations Sacrifices efficiency May assume independence between different regression coefficients Assumes independence between different brain locations F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 6 / 21

Data Example Working Memory in Schizophrenia Patients N=28 subjects: 15 schizophrenia patients and 13 healthy controls fmri Tasks: Serial Item Recognition Paradigm (SIRP) Encoding set: Memorize 1, 3, or 5 target digits. Probing set: Shown single digit probes and asked to press a button: with their index finger, if the probe matched with their middle finger, if not. Between conditions, subjects fixated on a flashing cross. 6 runs per subject: (177 scans per run for each subject) 3 runs of working memory tasks on each of 2 days Objective: Compare working memory-related brain activity between patients and controls Data from the Biomedical Informatics Research Network (BIRN) [1]: Potkin et al. (2002). F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 7 / 21

Statistical Modeling General Linear Model: Stage I Y i (v) = X iv β i (v) + H iv γ i (v) + ε i (v) Y i (v) S 1 serial BOLD activity at voxel v. X iv S q design matrix reflecting fixation and WM tasks. β i (v) q 1 parameter vector linking experimental tasks. ε i (v) S 1 random error about ith subject s mean. H iv S m contains other covariates, e.g. high-pass filtering. ε i (v) Normal(0, τv 2 V). F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 8 / 21

Statistical Modeling: Univariate General Linear Model: Stage II (Contrast of Interest) Cβ ij (v) = µ j (v) + e ij (v) β ij (v) stage I fixation and WM parameters; subject i, group j. C contrast matrix (linear combinations of elements in β ij (v)). µ j (v) group-level mean (for group j). e ij (v) random error. e ij (v) Normal(0, σ 2 (v)). F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 9 / 21

Statistical Modeling: Univariate Working Memory Data: n 1 = n b [ µ1 µ 2 b 1 ] + n 1 General Linear Model: Stage II (Matrix Model) Cβ 11 (v) 1 0... [ ] Cβ nc1(v) Cβ 12 (v) = 1 0 µ1 (v) 0 1 µ 2 (v)... Cβ np2(v) 0 1 + e 11 (v). e nc1(v) e 12 (v). e np2(v) (I C)β(v) = Xµ(v) + e(v) F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 10 / 21

Statistical Modeling Mass Univariate Approach May not fully acknowledge the correlations between Multiple effects/contrasts Effects/contrasts at different voxels Separately models contrasts of interest Does not yield information on correlations between contrasts. Does not enable comparisons or linear combinations of contrasts. F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 11 / 21

Statistical Modeling Working Memory Data: = n p n q General Linear Multivariate Model: Stage II [ ] µ11 µ 12 µ 13 + µ 21 µ 22 µ 23 q p n p β(v) = Xµ(v) + e(v) Multiple summary statistics (or contrasts) included for each subject E.g. working memory load contrasts Rows contain data from different subjects Each row assumed to have variance covariance matrix Σ reflecting correlations between summary statistics/contrasts Define θ(v) = Cµ(v)U, e.g. (µ 13 µ 11 ) (µ 23 µ 21 ). F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 12 / 21

Statistical Modeling Contrast Variance: Define θ(v) = Cµ(v)U, e.g. (µ 13 µ 11 ) (µ 23 µ 21 ) C = [ 1 1 ] Var(ˆθ) U = 1 0 1 Var(ˆθ(v)) = Var(ˆθ(v) ) = Var [ vec((cˆµ(v)u) ) ] = C(X X) 1 C U Σ(v)U = ( 1 n c + 1 n p )(σ 2 1 + σ 2 3 2σ 13 ), for WM data. F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 13 / 21

Application to Working Memory Data Stage I analysis produces estimates of visual fixation, WM load 1, WM load 3, and WM load 5 for each subject (FSL, SPM, etc). Compute contrasts of each WM load versus fixation for each subject. (1) Load 1 vs. Fixation, (2) Load 3 vs. Fixation, and (3) Load 5 vs. Fixation. Fit second-stage univariate model (GLM) to estimate the group-level effects and associated variances. Estimate final contrast to compare Load 3 vs Load 1 between controls and schizophrenia patients Calculate test-statistic Fit second-stage multivariate model to estimate the group-level effects and associated variances. Estimate final contrast to compare Load 3 vs Load 1 between controls and schizophrenia patients Calculate test-statistic F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 14 / 21

Estimation Contrast estimates: GLMM GLUM Both methods produce unbiased estimates of regression coefficients and associated contrasts. θ = [task3 task1] Controls [task3 task1] Patients F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 15 / 21

Test Statistics F-statistics: GLMM GLUM The GLMM often produces larger test statistics than the GLUM. θ = [task3 task1] Controls [task3 task1] Patients F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 16 / 21

Test Statistics This figure clearly reveals increased statistical power in GLMM relative to GLUM. F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 17 / 21

Variances Contrast Variances: GLMM GLUM The GLUM produces larger variances and will thus sacrifice statistical power. θ = [task3 task1] Controls [task3 task1] Patients F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 18 / 21

Task Correlations Correlation Matrix: 1-Digit 3-Digit 5-Digit 1-Digit 3-Digit 5-Digit The GLMM yields estimates of correlations between the three working memory loads (stage I contrasts). F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 19 / 21

Summary Mass univariate and multivariate linear models produce identical estimates of task-related changes. Multivariate modeling approaches consider dependencies between multiple dependent variables Multiple effects/contrasts Multiple voxels [Bowman et al., 2008; Zhang et al., 2012] Multiple time points (e.g. longitudinal study) By accounting for correlations, multivariate methods generally Increase efficiency (reduces variability) Increase statistical power. Univariate approaches may have a deleterious effect on inference. F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 20 / 21

Acknowledgements Special Thanks* Phebe Brenne Kemmer, Emory, Biostatistics and Bioinformatics Anthony Pileggi, Emory, Biostatistics and Bioinformatics Lijun Zhang, PhD, Emory, Biostatistics and Bioinformatics Research Team Shuo Chen, PhD, Univ. of Maryland, Epidemiology and Biostatistics Gordana Derado, PhD, CDC Ying Guo, PhD, Emory, Biostatistics and Bioinformatics Jian Kang, PhD, Emory, Biostatistics and Bioinformatics Wenqiong Xue, PhD, Emory, Biostatistics and Bioinformatics Qing He, Emory, Biostatistics and Bioinformatics Yize Zhao, Emory, Biostatistics and Bioinformatics F. D. Bowman (Emory University) OHBM Educational Course Seattle, WA 21 / 21