Modelling temporal structure (in noise and signal)

Similar documents
Model-free Functional Data Analysis

Statistical Analysis of fmrl Data

Statistical Analysis Aspects of Resting State Functional Connectivity

The General Linear Model (GLM)

Modelling with Independent Components

Extracting fmri features

Neuroimaging for Machine Learners Validation and inference

Data Analysis I: Single Subject

Dynamic Causal Modelling for fmri

Effective Connectivity & Dynamic Causal Modelling

The General Linear Model. Guillaume Flandin Wellcome Trust Centre for Neuroimaging University College London

Functional Connectivity and Network Methods

Experimental design of fmri studies

Experimental design of fmri studies

Experimental design of fmri studies & Resting-State fmri

New Machine Learning Methods for Neuroimaging

Mixed effects and Group Modeling for fmri data

Overview of SPM. Overview. Making the group inferences we want. Non-sphericity Beyond Ordinary Least Squares. Model estimation A word on power

Contents. Introduction The General Linear Model. General Linear Linear Model Model. The General Linear Model, Part I. «Take home» message

Experimental design of fmri studies

Experimental design of fmri studies

Bayesian inference J. Daunizeau

Event-related fmri. Christian Ruff. Laboratory for Social and Neural Systems Research Department of Economics University of Zurich

Piotr Majer Risk Patterns and Correlated Brain Activities

Model Comparison. Course on Bayesian Inference, WTCN, UCL, February Model Comparison. Bayes rule for models. Linear Models. AIC and BIC.

Signal Processing for Functional Brain Imaging: General Linear Model (2)

Bayesian Analysis. Bayesian Analysis: Bayesian methods concern one s belief about θ. [Current Belief (Posterior)] (Prior Belief) x (Data) Outline

Bayesian inference J. Daunizeau

The General Linear Model (GLM)

Dynamic Causal Models

First Technical Course, European Centre for Soft Computing, Mieres, Spain. 4th July 2011

Computing FMRI Activations: Coefficients and t-statistics by Detrending and Multiple Regression

A MULTIVARIATE MODEL FOR COMPARISON OF TWO DATASETS AND ITS APPLICATION TO FMRI ANALYSIS

Multilevel linear modelling for FMRI group analysis using Bayesian inference

Group analysis. Jean Daunizeau Wellcome Trust Centre for Neuroimaging University College London. SPM Course Edinburgh, April 2010

Statistical models for neural encoding

Group Analysis. Lexicon. Hierarchical models Mixed effect models Random effect (RFX) models Components of variance

Stochastic Dynamic Causal Modelling for resting-state fmri

What is NIRS? First-Level Statistical Models 5/18/18

MIXED EFFECTS MODELS FOR TIME SERIES

Jean-Baptiste Poline

Beyond Univariate Analyses: Multivariate Modeling of Functional Neuroimaging Data

STA 4273H: Statistical Machine Learning

Wellcome Trust Centre for Neuroimaging, UCL, UK.

Causal modeling of fmri: temporal precedence and spatial exploration

An introduction to Bayesian inference and model comparison J. Daunizeau

+ + ( + ) = Linear recurrent networks. Simpler, much more amenable to analytic treatment E.g. by choosing

HST 583 FUNCTIONAL MAGNETIC RESONANCE IMAGING DATA ANALYSIS AND ACQUISITION A REVIEW OF STATISTICS FOR FMRI DATA ANALYSIS

New Procedures for False Discovery Control

Nonlinear reverse-correlation with synthesized naturalistic noise

Course in Data Science

Machine Learning. Lecture 4: Regularization and Bayesian Statistics. Feng Li.

Detecting fmri activation allowing for unknown latency of the hemodynamic response

A prior distribution over model space p(m) (or hypothesis space ) can be updated to a posterior distribution after observing data y.

Hierarchy. Will Penny. 24th March Hierarchy. Will Penny. Linear Models. Convergence. Nonlinear Models. References

Bayesian modelling of fmri time series

Part 2: Multivariate fmri analysis using a sparsifying spatio-temporal prior

Principal Component Analysis

Contrasts and Classical Inference

arxiv: v2 [q-bio.qm] 21 Aug 2017

! Dimension Reduction

Granger Mediation Analysis of Functional Magnetic Resonance Imaging Time Series

Machine learning strategies for fmri analysis

1/12/2017. Computational neuroscience. Neurotechnology.

Optimization of Designs for fmri

A hierarchical group ICA model for assessing covariate effects on brain functional networks

Log Gaussian Cox Processes. Chi Group Meeting February 23, 2016

EEG- Signal Processing

Overview of Spatial Statistics with Applications to fmri

Statistical Inference

SPIKE TRIGGERED APPROACHES. Odelia Schwartz Computational Neuroscience Course 2017

Hierarchical Dirichlet Processes with Random Effects

Nonparametric Bayesian Methods (Gaussian Processes)

Spatial Source Filtering. Outline EEG/ERP. ERPs) Event-related Potentials (ERPs( EEG

Bayesian Inference. Chris Mathys Wellcome Trust Centre for Neuroimaging UCL. London SPM Course

ECE521 week 3: 23/26 January 2017

Strategies for Discovering Mechanisms of Mind using fmri: 6 NUMBERS. Joseph Ramsey, Ruben Sanchez Romero and Clark Glymour

Dynamic causal modeling for fmri

Experimental Design. Rik Henson. With thanks to: Karl Friston, Andrew Holmes

Independent Component Analysis. Contents

Models of effective connectivity & Dynamic Causal Modelling (DCM)

FIL. Event-related. fmri. Rik Henson. With thanks to: Karl Friston, Oliver Josephs

Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation

Statistical Inference

General linear model: basic

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Annealed Importance Sampling for Neural Mass Models

Probabilistic Graphical Models

Variational solution to hemodynamic and perfusion response estimation from ASL fmri data

Chart types and when to use them

The ASL signal. Parenchy mal signal. Venous signal. Arterial signal. Input Function (Label) Dispersion: (t e -kt ) Relaxation: (e -t/t1a )

Contents. design. Experimental design Introduction & recap Experimental design «Take home» message. N εˆ. DISCOS SPM course, CRC, Liège, 2009

Supplementary Note on Bayesian analysis

Homework #2 Due date: 2/19/2013 (Tuesday) Translate the slice Two Main Paths in Lecture Part II: Neurophysiology and BOLD to words.

Comparison of stochastic and variational solutions to ASL fmri data analysis

Pattern Recognition and Machine Learning

What is the neural code? Sekuler lab, Brandeis

MultiDimensional Signal Processing Master Degree in Ingegneria delle Telecomunicazioni A.A

Dynamic causal modeling for fmri

Observed Brain Dynamics

Transcription:

Modelling temporal structure (in noise and signal) Mark Woolrich, Christian Beckmann*, Salima Makni & Steve Smith FMRIB, Oxford *Imperial/FMRIB temporal noise: modelling temporal autocorrelation temporal signal: FLOBS HRF optimal basis functions temporal signal: HRF deconvolution spatiotemporally structured signal / noise: ICA functional grand-plan : integrating ICA+GLM

Non-independent/Autocorrelation/ Coloured FMRI noise power Coloured / autocorrelated White / independent frequency Uncorrected, this causes: Even after high-pass filtering, FMRI noise has extra power at low frequencies (positive autocorrelation or temporal smoothness) - biased stats ( increased false positives) - decreased sensitivity

FMRIB s Improved Linear Modelling (FILM) FILM is used to fit the GLM voxel-wise in FEAT Deals with the autocorrelation locally and uses prewhitening FILM estimates autocorrelation by looking at the residuals of the GLM fit: Y = Xβ + ɛ residuals = Y X β

FMRIB s Improved Linear Modelling (FILM) Power vs. freq in the residuals 1) Fit the GLM and estimate the autocorrelation on the residuals power frequency

FMRIB s Improved Linear Modelling (FILM) Power vs. freq in the residuals 1) Fit the GLM and estimate the autocorrelation on the residuals power frequency power 2) Spatially and spectrally smooth the data frequency

FMRIB s Improved Linear Modelling (FILM) Power vs. freq in the residuals 1) Fit the GLM and estimate the autocorrelation on the residuals power frequency power 2) Spatially and spectrally smooth the data power frequency 3) Construct prewhitening filter to undo autocorrelation frequency

FMRIB s Improved Linear Modelling (FILM) 1) Fit the GLM and estimate the autocorrelation on the residuals power frequency power 2) Spatially and spectrally smooth the data power frequency 3) Construct prewhitening filter to undo autocorrelation frequency 4) Apply filter to data and design matrix and refit power frequency

FMRIB s Improved Linear Modelling (FILM)

Dealing with Variations in Haemodynamics The haemodynamic responses vary between subjects and areas of the brain How do we allow haemodynamics to be flexible but remain plausible? Reminder: the haemodynamic response function (HRF) describes the BOLD response to a short burst of neural activity

Temporal Derivatives A very simple approach to providing HRF variability is to include (alongside each EV) the EV temporal derivative Including the temporal derivative of an EV allows for a small shift in time of that EV This is based upon a first order Taylor series expansion EV Temporal derivative data model fit without derivative model fit with derivative

Using Parameterised HRFs We need to allow flexibility in the shape of the fitted HRF Parameterise HRF shape and fit shape parameters to the data Needs nonlinear fitting - HARD

Using Basis Sets We need to allow flexibility in the shape of the fitted HRF Parameterise HRF shape and fit shape parameters to the data We can use linear basis sets to span the space of expected HRF shapes Needs nonlinear fitting - HARD Linear fitting (use GLM) - EASY

How do HRF Basis Sets Work? Different linear combinations of the basis functions can be used to create different HRF shapes basis fn 1 basis fn 2 basis fn 3 HRF 1.0* + 0.3* + -0.1* =

How do HRF Basis Sets Work? Different linear combinations of the basis functions can be used to create different HRF shapes basis fn 1 basis fn 2 basis fn 3 HRF 1.0* + 0.3* + -0.1* = 0.7* + -0.2* + 0.5* =

FMRIB s Linear Optimal Basis Set (FLOBS) Using FLOBS we can: Specify a priori expectations of parameterised HRF shapes Generate an appropriate basis set

Generating FLOBs (1) Take samples of the HRF

Generating FLOBs (2) Perform SVD (3) Select the top eigenvectors as the optimal basis set Canonical HRF

Generating FLOBs (2) Perform SVD (3) Select the top eigenvectors as the optimal basis set Canonical HRF temporal derivative

Generating FLOBs (2) Perform SVD (3) Select the top eigenvectors as the optimal basis set dispersion derivative Canonical HRF temporal derivative The resulting basis set can then be used in FEAT

Bayesian Inference Priors on HRF parameters HRF Joint posterior distribution p(a, c, m 1, m 2... Y ) BOLD FMRI data Neural Activity *A + Gaussian noise Woolrich et al., TMI, 2004

Bayesian Inference Priors on HRF parameters HRF Marginal posterior distribution p(a Y ) = p(a, c, m 1, m 2... Y )dcdm 1 dm 2... BOLD FMRI data Neural Activity *A + Gaussian noise Infer using MCMC Woolrich et al., TMI, 2004

Temporal deconvolution of FMRI timecourses Inputs are raw paradigm (stimulation and modulation ) timecourses Model based on Bilinear Dynamical Systems (Penny 2005), where modulatory input changes neural response to stimulation What s new: estimate HRF from data full Bayesian inference on model, using VB Makni, NeuroImage 2008

CORRECTED PROOF

To do the Bayes, either: MCMC (computer takes ages) Variational Bayes (maths takes ages) CORRECTED PROOF

To do the Bayes, either: MCMC (computer takes ages) Variational Bayes (maths takes ages) CORRECTED PROOF

Model-free Functional Data Analysis MELODIC Multivariate Exploratory Linear Optimised Decomposition into Independent Components decomposes data into a set of statistically independent spatial component maps and associated time courses can perform multi-subject/ multi-session analysis fully automated (incl. estimation of the number of components) inference on IC maps using alternative hypothesis testing

Model-free Functional Data Analysis MELODIC Multivariate Exploratory Linear Optimised Decomposition into Independent Components decomposes data into a set of statistically independent spatial component maps and associated time courses can perform multi-subject/ multi-session analysis fully automated (incl. estimation of the number of components) inference on IC maps using alternative hypothesis testing

EDA techniques for FMRI are mostly multivariate often provide a multivariate linear decomposition: space # maps space time Scan #k FMRI data = time # maps spatial maps Data is represented as a 2D matrix and decomposed into factor matrices (or modes)

Model Order Selection can estimate the model order from the Eigenspectrum of the data covariance matrix (corrected using Wishart random matrix theory) approximate the Bayesian evidence for the model order for a probabilistic PCA model (PPCA) AR 16 noise + signal 1 1 0.9 0.8 0.95 15 20 25 30 0.7 0.6 0 30 60 90 120 150 Figure Laplace approximation 3: Bayesian BIC AIC Minka, TR 514 MIT Media Lab 2000

Probabilistic ICA GLM analysis standard ICA (unconstrained)

Probabilistic ICA GLM analysis probabilistic ICA

Probabilistic ICA GLM analysis probabilistic ICA designed to address the overfitting problem : tries to avoid generation of spurious results high spatial sensitivity and specificity

Applications EDA techniques can be useful to investigate the BOLD response estimate artefacts in the data find areas of activation which respond in a nonstandard way analyse data for which no model of the BOLD response is available

Investigate BOLD response estimated signal time course standard hrf model

Applications EDA techniques can be useful to investigate the BOLD response estimate artefacts in the data find areas of activation which respond in a nonstandard way analyse data for which no model of the BOLD response is available

slice drop-outs

gradient instability

EPI ghost

high-frequency noise

head motion

field inhomogeneity

eye-related artefacts

eye-related artefacts

eye-related artefacts Wrap around

Structured Noise and the GLM structured noise appears: in the GLM residuals and inflate variance estimates (more false negatives) in the parameter estimates (more false positives and/or false negatives) In either case lead to suboptimal estimates and wrong inference!

Structured noise and GLM Z-stats bias Correlations of the noise time courses with typical FMRI regressors can cause a shift in the histogram of the Z-statistics Thresholded maps will have wrong false-positive rate

Denoising FMRI before denoising 500 PE raw 300 Z raw 400 250 Example: left vs right hand finger tapping 300 200 100 0!1000!500 0 500 1000 500 PE clean 200 150 100 50 RIGHT 0!10!5 0 5 10 300 Z clean 400 300 200 100 250 200 150 100 50 Johansen-Berg et al. PNAS 2002 0!1000!500 0 500 1000 after denoising 0!10!5 0 5 10

Denoising FMRI before denoising 600 PE raw 350 Z raw Example: left vs right hand finger tapping 500 400 300 200 100 0!1000!500 0 500 1000 600 PE clean 300 250 200 150 100 50 LEFT 0!10!5 0 5 10 350 Z clean 500 400 300 200 100 300 250 200 150 100 50 Johansen-Berg et al. PNAS 2002 0!1000!500 0 500 1000 after denoising 0!10!5 0 5 10

Denoising FMRI before denoising Example: left vs right hand finger tapping LEFT - RIGHT contrast Johansen-Berg et al. PNAS 2002 after denoising

Apparent variability McGonigle et al.: 33 Sessions under motor paradigm de-noising data by regressing out noise: reduced apparent session variability

Applications EDA techniques can be useful to investigate the BOLD response estimate artefacts in the data find areas of activation which respond in a nonstandard way analyse data for which no model of the BOLD response is available

PICA on resting data perform ICA on null data and compare spatial maps between subjects/ scans ICA maps depict spatially localised and temporally coherent signal changes Example: ICA maps - 1 subject at 3 different sessions

Spatial characteristics (a) x=17 y=-73 z=-12 (b) x=-13 y=-61 z=6 (a) x=17 y=-73 z=-12 (b) x=-13 y=-61 z=6 R (c) L Medial visual cortex y=-17 x=3 x=-4 y=-29 x=45 R (d) z=33 (f) y=-29 x=1 x=5 (f) L (h) z=27 L Sensori-motor system y=6 x=-45 z=27 L R z=47 z=51 y=6 x=5 L y=-42 y=-21 R z=33 z=51 L R Auditory system x=-4 Lateral Visual Cortex L R (g) z=1.5 L y=-21 x=1 L R (e) (d) L R (e) z=1.5 y=-17 x=3 R (c) R y=-42 z=47

Spatial characteristics (e) x=-4 y=-29 z=33 (f) x=5 y=6 z=27 R L R L Visuospatial system Executive control (g) x=45 y=-42 z=47 (h) x=-45 y=-42 z=47 R L Visual Stream R L

Temporal deconvolution of ICA timecourses What are the task-related components? Use explicit time series model on the ICgenerated temporal modes (using BDS)

Example: BDS/ICA integration (b) (c) Estimate the mean of the posterior distribution over the neuronal response Estimate the mean of the Gaussian posterior distribution of the HRF 1 (d) 1.5 (e) 0.5 0!1.5 0 0 50 100 150 200 250 300 350 400 450 500 posterior probability of neuronal response > 0 for the IC 0 50 100 150 200 250 300 350 400 450 500 Time (s) Assess the total model fit

Integrating GLM and ICA for FMRI analysis GLM β i = + Noise Data Predicted response Estimated effect

Integrating GLM and ICA for FMRI analysis GLM β i = + Noise Data Predicted response Estimated effect + clear conclusions on a particular question - results depend on the model

Integrating GLM and ICA for FMRI analysis PICA space # maps space time Scan #k FMRI data = time # maps spatial maps + Noise Data Estimated artefact time courses Estimated spatially independent maps

Integrating GLM and ICA for FMRI analysis PICA space # maps space time Scan #k FMRI data = time # maps spatial maps + Noise Data Estimated artefact time courses Estimated spatially independent maps + data driven and multivariate approach - no knowledge about the fmri paradigm is used - can be hard to interpret activation results

Integrating GLM and ICA for FMRI analysis GLM + ICA space # maps space time Scan #k FMRI data = time # maps spatial maps + Noise Data Predicted response Estimated artefact time courses Estimated spatially independent maps

Integrating GLM and ICA for FMRI analysis GLM + ICA space # maps space time Scan #k FMRI data = time # maps spatial maps + Noise Data Predicted response Estimated artefact time courses Estimated spatially independent maps Stimulus model-based hypothesis testing Adaptive model-free artefact modelling - e.g. stimulus correlated motion, physiological noise, networks of spontaneous neuronal activity

0 5 10 15 20 25 30 35 40 45 Modelled ICs Estimated HRFs 10 auditory stimulus visual stimulus 5 2 0 Time (s) Spatial map Time course Example model free IC # maps space time # maps spatial maps Spatial map Time course