Big Data and the Brain G A L L A NT L A B

Size: px

Start display at page:

Download "Big Data and the Brain G A L L A NT L A B"

Sherman Moore
6 years ago
Views:

1 Big Data and the Brain MICHAEL O L IVER G A L L A NT L A B UC BERKELEY

2 The brain is gigantic The human brain has ~100 billion neurons connected by ~100 trillion synapses Multiple levels of organization

3 But our data is only Big Electrophysiology experiments can record from ~100 neurons simultaneously fmri experiments we can record from ~90,000 voxels of about 20 mm 3 There are over 2,000,000 neurons per voxel

4 The visual brain

5 System identification and neuroscience Model each neuron based on relationship between stimulus and response Evaluate models based on their ability to predict responses to novel stimuli

6 Why system identification in visual cortex is hard Car Eiffel Tower Non-linear High dimensional Interpretability is important!

7 Linearized regression Nonlinear Feature Mapping p 2 f 2 Encoding Model a 2 p 1 f 1 Decoding Model a 1 p 3 Pixel Space f 3 Feature Space a 3 Brain Activity Space

8 Nonlinearity Feature spaces Stimuli Model neuron/voxel Responses Feature Mapping Linear Weighting W1 W2 + Wn

9 Movie reconstruction from fmri data Nishimoto S, et al. Curr Biol Oct 11;21(19):1641-6

10 Van Essen DC, Gallant JL. Neuron Jul;13(1):1-10.

11 Tuning in V4

12 We need big data! V4 receptive fields are moderately large Potential stimulus space is very large Natural images span the relevant space Response is highly nonlinear

13 How to get big data Implantable electrodes allow us to record from the same cell over many days We used over 1 million frames of natural movies, the largest ever stimulus set in V4

14 General nonlinear modeling The Volterra Series: x 1 x 2 Can control model flexibility by order choice Parameter space grows quickly with order h 11 h 12 h 21 h 22 h 111 h 121 h 111 h 121 h 111 h 121 h 211 h 221

15 The scale of the problem If we have 1000 pixels 2 nd Order: 500,000+ coefficients 3 rd Order: 160 million+ coefficients 4 th Order: 40 billion+ coefficients But we actually have about 196,608 pixels 2 nd Order: 19 billion+ coefficients 3 rd Order: 1.2 quadrillion+ coefficients 4 th Order: 62 quintillion+ coefficients Pixels 1.2x10 15

16 Taming dimensionality Pixels 1.2x10 15 PCA 1.6x10 8

kernel regression model Kernel function equivalent to dot

17 Kernel regression For all i,j =1:n in training data Use weights to make predictions for new x Calculate weights for kernel regression model Kernel function equivalent to dot product in feature space Pixels 1.2x10 15 PCA 1.6x10 8 Kernel 1x10 6

18 The Inhomogeneous Polynomial Kernel 2 nd Order IHP: Implicitly Maps to feature space containing all first and second order terms

19 Neural networks as kernel machines Input Hidden Units Output In a standard NN: From NN perspective, kernel regression with a tanh kernel function is equivalent to a NN with hidden units = training samples Pixels 1.2x10 15 PCA 1.6x10 8 Kernel 1x10 6 IPKN 1.5x10 4

20 Data Stochastic gradient boosting Volterra Space Error Surface

21 Prediction performance by model order

22 Color constancy in V4

23 V4 Tuning for Color Kusunoki M et al. J Neurophysiol 2006;95:

24 Color Tuning of V4 Cells First Order Color Coefficients

25 V4 tuning to curvature Pasupathy A, and Connor C E J Neurophysiol 2001;86:

26 V4 tuning to Non-Cartesian Gratings Gallant JL, Connor CE, Rakshit S, Lewis JW, Van Essen DC. J Neurophysiol Oct;76(4):

27 Eigenvectors of second order V4 receptive field model

28 Eigenvectors of second order V4 receptive field model

29 Shape tuning of a V4 cell s Volterra model

30 Shape tuning of a V4 cell s Volterra model

31 Embracing the Complexity Demonstrated a way to make this big problem tractable Shown many reported features of V4 tuning can exist in a single cell Interpretation of large models is still a major problem Need tensor libraries that exploit symmetry to decompose large models

32 Thank you!

33 V4 High Response Movie Frames

34 Stochastic Gradient Boosted IPKNs Use IPKNs as the weak learners Fit to sample of data using backprop w/ a stopping set Perform line search to determine step size that minimizes error on sample Multiply step size by learning rate and update function Ensemble is equivalent to a single Volterra model!

35 Some Important Unanswered Questions What information are our models missing in V2, V4 and beyond? Do we need nonlinear combinations of basis functions? Can we derive new basis functions?

Extracting Coefficients from Model Create a design matrix or the desired order of interactions from the support vectors/input weights Multiply by the

36 Extracting Coefficients from Model Create a design matrix or the desired order of interactions from the support vectors/input weights Multiply by the output weights and weight by correction factor Extract coefficients from each iteration s network, weight by step size and sum to get final set of coefficients

Natural movies Feature Spaces Movies were shown to subjects Movies

found using regularized linear regression BOLD responses were

Building Category labels 2.8-0.4-1.5 x 0.1 0.1-2.7 = 1.7-0.8-0.9-2.

37 Natural movies Feature Spaces Movies were shown to subjects Movies were labeled with 1705 nouns and verbs Response to each category was found using regularized linear regression BOLD responses were recorded from the whole brain using fmri Woman Talking Text Car Building Category labels x = minutes 120 minutes Category model weights BOLD responses

Constructing a Semantic Space A B vehicle Coefficient on 1 st PC + + 4 th PC (Blue channel) mammal structure person plant organ athlete location bodypart city sky atmos. phenom.

38 Constructing a Semantic Space A B vehicle Coefficient on 1 st PC th PC (Blue channel) mammal structure person plant organ athlete location bodypart city sky atmos. phenom. RGB colormap for the group semantic space C bloom gallop change walk furniture weapon boat lean travel breathe fasten spin rappel gas pump device vehicle turn jump hit equipment move touch laptop bottle container tool drag move (transitive) bird ungulate reptile mammal fish animal consume plant way artifact structure carnivore rodent road arthropod athlete person organism plant organ clothing talk communicate hill location rodeo city herd rub geol. formation group mist matter text communication attribute dirt measure event atmospheric phenomenon wave food bodypart sky eye leg underwater 3 rd PC + (Green channel) + 2 nd PC (Red channel) car wheeled vehicle kettle ball shop door room building grassland material bamboo

39 Semantic Decoding

40 Flattening the Brain

ordnet anterior!!! Corre LSA anterior - D!

41 superior superior LH CoS! RH Visualizing Semantic Space! ordnet anterior!!! Corre LSA anterior - D! CiSmr SMFA RSC SMHA M1F S1F PrCu SEF! Speech! CeS CiS IPS Vper IPS! M1H SFS V7 FEF V3a S1H M1M TOS V3b V1 LO V2 V3! FO V1 V2 Vper V4!! psts FFA Speech MTS PPA STS A1 Sylvian Fissure ITS CoS IFS EBA V3 V4 Vper S S1M TPJ MT+! PoCeS! CoS anterior superior LH superior PPA AC! RH anterior!!!! Corre -

Nonlinear reverse-correlation with synthesized naturalistic noise

Cognitive Science Online, Vol1, pp1 7, 2003 http://cogsci-onlineucsdedu Nonlinear reverse-correlation with synthesized naturalistic noise Hsin-Hao Yu Department of Cognitive Science University of California