Piotr Majer Risk Patterns and Correlated Brain Activities

Similar documents
The General Linear Model (GLM)

Modelling temporal structure (in noise and signal)

Principal components in an asymmetric norm

Contents. Introduction The General Linear Model. General Linear Linear Model Model. The General Linear Model, Part I. «Take home» message

Extracting fmri features

Smooth Common Principal Component Analysis

HST 583 FUNCTIONAL MAGNETIC RESONANCE IMAGING DATA ANALYSIS AND ACQUISITION A REVIEW OF STATISTICS FOR FMRI DATA ANALYSIS

Neuroimaging for Machine Learners Validation and inference

MIXED EFFECTS MODELS FOR TIME SERIES

The General Linear Model. Guillaume Flandin Wellcome Trust Centre for Neuroimaging University College London


Recovering Copulae from Conditional Quantiles

Motivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary.

A Semi-Parametric Measure for Systemic Risk

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.

Overview of Spatial Statistics with Applications to fmri

High Dimensional Nonstationary Time Series Modelling with Generalized Dynamic Semiparametric Factor Model

Event-related fmri. Christian Ruff. Laboratory for Social and Neural Systems Research Department of Economics University of Zurich

Time Varying Hierarchical Archimedean Copulae (HALOC)

Time Series Modelling with Semiparametric Factor Dynamics

Experimental design of fmri studies & Resting-State fmri

New Machine Learning Methods for Neuroimaging

Machine learning strategies for fmri analysis

The General Linear Model (GLM)

Dynamic Causal Modelling for fmri

Integrative Methods for Functional and Structural Connectivity

The idiosyncratic nature of confidence

Semi-supervised subspace analysis of human functional magnetic resonance imaging data

Experimental design of fmri studies

ROI analysis of pharmafmri data: an adaptive approach for global testing

Peak Detection for Images

Statistical Analysis of fmrl Data

Model-free Functional Data Analysis

Principal components in an asymmetric norm

Optimization of Designs for fmri

Experimental design of fmri studies

Econometrics I. Introduction to Regression Analysis (Outline of the course) Marijus Radavi ius

Machine Learning. Lecture 3: Logistic Regression. Feng Li.

SUPPLEMENTARY APPENDICES FOR WAVELET-DOMAIN REGRESSION AND PREDICTIVE INFERENCE IN PSYCHIATRIC NEUROIMAGING

General linear model: basic

Jean-Baptiste Poline

Least Absolute Shrinkage is Equivalent to Quadratic Penalization

Modeling of salmonella-prevalence in the case of chicken

Finding a Basis for the Neural State

Pruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization

ROI ANALYSIS OF PHARMAFMRI DATA:

Experimental design of fmri studies

Testing Monotonicity of Pricing Kernels

Support Vector Machines (SVM) in bioinformatics. Day 1: Introduction to SVM

A. Motivation To motivate the analysis of variance framework, we consider the following example.

Discriminative Learning and Big Data

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Lecture 3: Statistical Decision Theory (Part II)

Beyond Univariate Analyses: Multivariate Modeling of Functional Neuroimaging Data

Towards a Regression using Tensors

Kernel Methods and Support Vector Machines

Support Vector Machines

CS 195-5: Machine Learning Problem Set 1

Dynamic Causal Models

10-701/ Machine Learning - Midterm Exam, Fall 2010

Generative v. Discriminative classifiers Intuition

Linear Models in Machine Learning

Linear Regression and Its Applications

Experimental design of fmri studies

A review of some semiparametric regression models with application to scoring

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Bayesian Learning. Tobias Scheffer, Niels Landwehr

Machine Learning for Biomedical Engineering. Enrico Grisan

Introduction to Linear Regression Analysis

Multivariate Lineare Modelle

Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation

Group analysis. Jean Daunizeau Wellcome Trust Centre for Neuroimaging University College London. SPM Course Edinburgh, April 2010

Volume 03, Issue 6. Comparison of Panel Cointegration Tests

Max Margin-Classifier

Bayesian Learning (II)

Discriminative Models

CMU-Q Lecture 24:

Generative v. Discriminative classifiers Intuition

Granger Mediation Analysis of Functional Magnetic Resonance Imaging Time Series

Decoding conceptual representations

Managing Uncertainty

Time Series Analysis

Classification. Chapter Introduction. 6.2 The Bayes classifier

Data Analysis I: Single Subject

Linear classifiers: Logistic regression

Linear classifiers: Overfitting and regularization

First Technical Course, European Centre for Soft Computing, Mieres, Spain. 4th July 2011

Group Analysis. Lexicon. Hierarchical models Mixed effect models Random effect (RFX) models Components of variance

Mixed-effects Maximum Likelihood Difference Scaling

Outlines: (June 11, 1996) Instructor:

RS-fMRI analysis in healthy subjects confirms gender based differences

SRNDNA Model Fitting in RL Workshop

Machine Learning Basics

Statistical Inference

Averaging Estimators for Regressions with a Possible Structural Break

Solving Classification Problems By Knowledge Sets

Dynamic Modeling of Brain Activity

Fractal functional regression for classification of gene expression data by wavelets

Support Vector Machines

Linear Models for Regression

Transcription:

Alena My²i ková Piotr Majer Song Song Alena Myšičková Peter N. C. Mohr Peter N. C. Mohr Wolfgang K. Härdle Song Song Hauke R. Heekeren Wolfgang K. Härdle Hauke R. Heekeren C.A.S.E. Centre C.A.S.E. for Applied Centre for Statistics Applied Statistics and and Economics Economics Humboldt-Universität zu Berlin Humboldt-Universität Freie Universität zu Berlin Berlin Freie Universität MaxBerlin Planck Institute for Molecular Max Planck Institute Genetics for Molecular Genetics http://lvb.wiwi.hu-berlin.de http://lvb.wiwi.hu-berlin.de http://www.languages-of-emotion.de http://www.languages-of-emotion.de http://www.molgen.mpg.de http://www.molgen.mpg.de

Motivation 1-1 Risk Perception Which part is activated during risk related decisions? Can statistical analysis help to detect this area? Response curve (to stimuli)? classify risky people?

Motivation 1-2 Risk Perception Survey conducted by Max Planck Institute 22 young, native German, right-handed and healthy volunteers 3 subjects with extensive head movements (> 5mm) 2 subjects with dierent stimulus frequency n = 22 (3 + 2) = 17 Experiment Risk Perception and Investment Decision (RPID) task ( 45) fmri images every 2.5 sec.

Motivation 1-3 Risk Perception Returns Pause Decision

Motivation 1-4 Risk Perception Thermodynamics Theoretical framework Risk-return model Mohr et al., 2010 Mechanical Equivalent of Heat the rst law of thermodynamics Mayer, 1841 Empirical evidence fmri analysis Experiments Joule apparatus Joule, 1843

Motivation 1-5 Risk Perception functional Magnetic Resonance Imaging Measuring Blood Oxygenation Level Dependent (BOLD) eect every 2-3 sec High-dimensional, high frequency & large data set

Motivation 1-6 Risk Perception Figure 1: Example of a fmri image at xed time point, 12 horizontal slices of the brain's scan. fmri

Motivation 1-7 fmri Is there a signicant reaction to specic stimuli in the hemodynamic response? Voxel X

Motivation 1-8 fmri methods Voxel-wise GLM Voxel-wise GLM linear model for each voxel separately strong a priori hypothesis necessary Dynamic Semiparametric Factor Model (DSFM) Use a time & space dynamic approach Separate low dim time dynamics from space functions Low dim time series exploratory analysis

Outline 1. Motivation 2. DSFM 3. Results vs. Subject's Behaviour 4. Conclusion 5. Future Perspectives

DSFM 2-1 Notation (X 1,1, Y 1,1 ),..., (X J,1, Y J,1),..., (X 1,T, Y 1,T ),..., (X J,T, Y J,T ), }{{}}{{} t=1 t=t X j,t R d, Y j,t R T - the number of observed time periods J - the number of the observations in a period t E(Y t X t ) = F t (X t ) What is F t (X t )? How does it move?

DSFM 2-2 Dynamic Semiparametric Factor Model E(Y t X t ) = L Z t,l m l (X t ) = Z t m(x t) = Z t A Ψ l=0 Z t = (1, Z t,1,..., Z t,l) low dim (stationary) time series m = (m 0, m 1,..., m L ), tuple of functions Ψ = {ψ 1 (X t ),..., ψ K (X t )}, ψ k (x) space basis A : (L + 1) K coecient matrix

DSFM 2-3 DSFM Estimation Y t,j = L Z t,l m l (X t,j) + ε t,j = Z t l=0 A ψ(x t,j) + ε t,j ψ(x) = {ψ 1 (x),..., ψ K (x)} tensor B-spline basis (Ẑt, Â ) = arg min Z t,a T J t=1 j=1 { Y t,j Z t A ψ(x t,j)} 2 (1) Minimization by Newton-Raphson algorithm

DSFM 2-4 B-Splines Figure 2: B-splines basis functions; order of B-splines: quadratic; number of knots: 36

DSFM 2-5 DSFM Estimation Selection of L by explained variance T J t=1 j=1 {Y t,j L l=0 Z t,l m l (X t,j) EV (L) = 1 J { } 2 j=1 Y t,j Ȳ T t=1 number of B-splines (equally spaced) knots: 12 14 14 } 2 L = 2 L = 4 L = 5 L = 10 L = 20 92.07 92.25 92.29 93.66 95.19 Table 1: EV in percent of the model with dierent numbers of factors L, averaged over all 17 analyzed subjects.

DSFM 2-6 Panel DSFM Y i t,j = L (Z i + t,l αi t,l )m l(x t,j) + ε t,j, 1 j J, 1 t T, l=0 n = 17 weakly/strongly risk-averse subjects Y t,j - BOLD signal; X j voxel's index α i - xed individual eect t,l Identication condition: E { n i=1 l=0 } L α i t,l m l(x t,j) X t,j = 0

DSFM 2-7 Panel DSFM Estimation 1. Average Y i over subjects i to obtain Ȳ t,j t,j 2. Estimate factors m l for the average brain (via one step of 1) 3. Given m l, for i, estimate Z i t,l Y i t,j = L Z i t,l m l(x t,j) + ε i t,j l=0 26h - estimation time; CPU - 2 2.8GHz; data set of size 24.31 GB

Results vs. Subject's Behaviour 3-1 Estimated constant factor m 0 = K k=1 â0,kψ(x ) with L = 20

Results vs. Subject's Behaviour 3-2 x 1 3 2.5 == Z => 2 = X => <= Y == 1.5 1 0.5 == Y => 0 Estimated factor m 5 = K k=1 â5,kψ(x ) with L = 20 (MOFC = Medial orbitofrontal cortex)

Results vs. Subject's Behaviour 3-3 x 1 3 2.5 == Z => 2 = X => <= Y == 1.5 1 0.5 == Y => 0 Estimated factor m 9 = K k=1 â9,kψ(x ) with L = 20

Results vs. Subject's Behaviour 3-4 x 1 3 2.5 == Z => 2 = X => <= Y == 1.5 1 0.5 == Y => 0 Estimated factor m 12 = K k=1 â12,kψ(x ) with L = 20

Results vs. Subject's Behaviour 3-5 x 1 3 2.5 == Z => 2 = X => <= Y == 1.5 1 0.5 == Y => 0 Estimated factor m 16 = K k=1 â16,kψ(x ) with L = 20

Results vs. Subject's Behaviour 3-6 x 1 3 2.5 == Z => 2 = X => <= Y == 1.5 1 0.5 == Y => 0 Estimated factor m 17 = K k=1 â17,kψ(x ) with L = 20

Results vs. Subject's Behaviour 3-7 x 1 3 2.5 == Z => 2 = X => <= Y == 1.5 1 0.5 == Y => 0 Estimated factor m 18 = K k=1 â18,kψ(x ) with L = 20

Results vs. Subject's Behaviour 3-8 Estimated Factor Loading Ẑ5 3 x 10 6 0 0 5 10 15 20 25 30 2 x 106 1 0 1 0 5 10 15 20 25 30 Figure 3: Estimated factor loading Ẑ5 for subjects within 30 minutes: 12 (lower panel) and 19 (upper panel) with L = 20; red dots denote stimulus

Results vs. Subject's Behaviour 3-9 Estimated Factor Loading Ẑ9 x 10 6 1 0 1 0 5 10 15 20 25 30 x 10 6 3 4 0 5 10 15 20 25 30 Figure 4: Estimated factor loading Ẑ9 for subjects within 30 minutes: 12 (lower panel) and 19 (upper panel) with L = 20; red dots denote stimulus

Results vs. Subject's Behaviour 3-10 Estimated Factor Loading Ẑ12 x 10 6 1 2 0 5 10 15 20 25 30 8 x 106 6 0 5 10 15 20 25 30 Figure 5: Estimated factor loading Ẑ12 for subjects within 30 minutes: 12 (lower panel) and 19 (upper panel) with L = 20; red dots denote stimulus

Results vs. Subject's Behaviour 3-11 Estimated Factor Loading Ẑ16 4 x 106 2 0 5 10 15 20 25 30 x 10 7 2 2.4 0 5 10 15 20 25 30 Figure 6: Estimated factor loading Ẑ16 for subjects within 30 minutes: 12 (lower panel) and 19 (upper panel) with L = 20; red dots denote stimulus

Results vs. Subject's Behaviour 3-12 Estimated Factor Loading Ẑ17 11 x 10 6 8 5 0 5 10 15 20 25 30 x 10 6 1 0 5 10 15 20 25 30 Figure 7: Estimated factor loading Ẑ17 for subjects within 30 minutes: 12 (lower panel) and 19 (upper panel) with L = 20; red dots denote stimulus

Results vs. Subject's Behaviour 3-13 Estimated Factor Loading Ẑ18 x 10 6 8 4 0 5 10 15 20 25 30 x 10 7 3.2 2.8 0 5 10 15 20 25 30 Figure 8: Estimated factor loading Ẑ18 for subjects within 30 minutes: 12 (lower panel) and 19 (upper panel) with L = 20; red dots denote stimulus

Results vs. Subject's Behaviour 3-14 Reaction to the stimulus 6 Subject 12 x 10 6 Subject 19 x 10 1 1 0.5 0.5 Z 1 2 0 Z 1 2 0 0.5 0.5 1 1 100 200 300 400 500 600 700 100 200 300 400 500 600 700 Time in scan units Time in scan units Figure 9: Reaction to stimulus for factors loadings Ẑt,12 for subjects 12 (left) and 19 (right) during the whole experiment (45 stimuli).

Results vs. Subject's Behaviour 3-15 Risk attitude Subject's risk perception - risk metrics standard deviation empirical frequency of loss (negative return) dierence between highest an lowest return (range) coecient of range (range/mean) empirical frequency of ending below 5% coecient of variation (standard deviation/mean) Dierent subject - dierent risk perception tted by correlation between risk metrics of return streams and answers for 1 task, N = 27

Results vs. Subject's Behaviour 3-16 Risk attitude Subjective expected return - return ratings recency (higher weights on later returns) primacy (higher weights on earlier returns) below 0% (higher weights on returns below 0%) below 5% (higher weights on returns below 5%) mean Selecting return ratings for each subject individually best model by one-leave-out cross validation procedure, N = 27

Results vs. Subject's Behaviour 3-17 Risk attitude Risk-return choice model V i = m i β i R i, 1 i n, m i -subjective expected return, R i - perceived risk, V i - subjective value, 5% - risk free return β Risk attitude parameter

Results vs. Subject's Behaviour 3-18 Risk attitude Estimation of individual risk attitude by logistic regression P {risky choice (m, R)} = 1 1 + exp(m βr 5) P {sure choice (m, R)} = 1 1 1 + exp(m βr 5) risky choice - unknown return, sure choice - xed, 5% return β derived by maximum likelihood method

Results vs. Subject's Behaviour 3-19 20 Risk Attitude of Subjects 15 18 19 Risk Attitude 10 5 1 3 2 4 5 6 8 9 10 11 15 16 17 12 0 0 5 10 15 20 Subjects Figure 10: Risk attitude for 16 subjects; modeled by the softmax function from individuals' decisions, estimated by ML method Mohr et. al.

Results vs. Subject's Behaviour 3-20 SVM Classication Analysis Support Vector Machines (SVM) 17 subjects, 20 factor loadings per subject Leave-one-out method to train and estimate classication rate SVM with Gaussian kernel; (R, C) chosen to maximize classication rate Weakly/strongly risk-averse subjects have larger/smaller dierences of Ẑ i inside each trial t,l

Results vs. Subject's Behaviour 3-21 SVM Classication Analysis Reaction to the RPID corresponds to dynamics of Ẑ i t,l, l = 5, 9, 12, 16, 17, 18 First 3 observations (7.5 sec.) after stimulus Decison time - 7 sec. Ẑt,2 x 106 10 5 0 5 20 21 22 0.5 1 1.5 x Ẑ 107 t,2 0 2 20 21 22

Results vs. Subject's Behaviour 3-22 SVM Classication Analysis 1. factors attributed to risk patterns: l = 5, 9, 12, 16, 17, 18 2. only Decision under Risk (Q3) stimulus 3. Ẑ i t,l def = Ẑ i Ẑs,l, s is the time of stimulus s+t,l 4. average reaction to s stimulus Ẑ i s,l = 1 3 3 τ=1 Ẑ i s+τ,l SVM input data: volatility of Ẑ i s,l over all Q3 Std Estimated Strongly Weakly Data Strongly 1.00 0.00 Weakly 0.14 0.86 Table 2: Classication rates of the SVM method, without knowing the subject's estimated risk attitude.

Conclusion 4-1 Conclusion Factors m identify activated areas, neurological reasonable Estimated factor loadings show dierences for individuals with dierent risk attitudes (e.g. 12 vs. 19) SVM classication analysis of measurements in Ẑt,l, l = 5, 9, 12, 16, 17, 18 after stimulus, can distinguish weakly/strongly risk-averse individuals with high classication rate, without knowing the subject's answers

Future Perspectives 5-1 Future Perspectives Comparison with the PCA/ICA (PARAFAC) approach Analysis of the second part of the experiment (under assumption of independency) to "generate" larger number of subjects Improvement of the classication criterion Penalized DSFM with seasonal eects

Alena My²i ková Piotr Majer Song Song Peter N. C. Mohr Wolfgang K. Härdle Hauke R. Heekeren Alena Myšičková Peter N. C. Mohr Song Song Wolfgang K. Härdle C.A.S.E. Centre for Applied Statistics and Hauke R. Heekeren Economics Humboldt-Universität C.A.S.E. Centre zu Berlin for Applied Statistics and Economics Freie Universität Berlin Humboldt-Universität zu Berlin Max Planck Institute Freie Universität for Molecular Berlin Genetics http://lvb.wiwi.hu-berlin.de Max Planck Institute for Molecular http://www.languages-of-emotion.de Genetics http://www.molgen.mpg.de http://lvb.wiwi.hu-berlin.de http://www.languages-of-emotion.de

Future Perspectives 5-3 References Mayer, R. Remarks on the Forces of Nature The Benjamin/Cummings Publishing Company, London, 1841. Mohr, P., Biele G., Krugel, L., Li S., Heekeren, H. Neural foundations of risk-return trade-o in investment decisions NeuroImage, 49: 2556-2563 Risk attitudes Park, B., Mammen, E., Härdle, W. and Borak, S. Time Series Modelling with Semiparametric Factor Dynamics J. Amer. Stat. Assoc., 104(485): 284-298, 2009.

Future Perspectives 5-4 References Ramsay, J. O. and Silverman, B. W. Functional Data Analysis New York: Springer. Woolrich, M., Ripley, B., Brady, M., Smith, S. Temporal Autocorrelation in Univariate Linear Modelling of FMRI Data NeuroImage, 21: 2245-2278

Appendix 6-1 Voxel-wise GLM fmri methods FEAT - FMRI Expert Analysis Tool by Department of Clinical Neurology, University of Oxford GLM framework Y = XB + η, Y - single voxel BOLD time series, X - design matrix (regressors, i.e. visual, auditory) Signicant, active areas (B) selected by z-scores