Analyzing large-scale spike trains data with spatio-temporal constraints

Similar documents
Analyzing large-scale spike trains data with spatio-temporal constraints

Methylation-associated PHOX2B gene silencing is a rare event in human neuroblastoma.

A new simple recursive algorithm for finding prime numbers using Rosser s theorem

Dispersion relation results for VCS at JLab

Two-step centered spatio-temporal auto-logistic regression model

Smart Bolometer: Toward Monolithic Bolometer with Smart Functions

Easter bracelets for years

Parameter Estimation for Spatio-Temporal Maximum Entropy Distributions: Application to Neural Spike Trains

Can we reduce health inequalities? An analysis of the English strategy ( )

Completeness of the Tree System for Propositional Classical Logic

Case report on the article Water nanoelectrolysis: A simple model, Journal of Applied Physics (2017) 122,

Analysis of Boyer and Moore s MJRTY algorithm

Near-Earth Asteroids Orbit Propagation with Gaia Observations

Gaia astrometric accuracy in the past

On Symmetric Norm Inequalities And Hermitian Block-Matrices

On size, radius and minimum degree

Passerelle entre les arts : la sculpture sonore

Evolution of the cooperation and consequences of a decrease in plant diversity on the root symbiont diversity

Thomas Lugand. To cite this version: HAL Id: tel

Vibro-acoustic simulation of a car window

Exogenous input estimation in Electronic Power Steering (EPS) systems

L institution sportive : rêve et illusion

Soundness of the System of Semantic Trees for Classical Logic based on Fitting and Smullyan

A Context free language associated with interval maps

Territorial Intelligence and Innovation for the Socio-Ecological Transition

Full-order observers for linear systems with unknown inputs

Hook lengths and shifted parts of partitions

FORMAL TREATMENT OF RADIATION FIELD FLUCTUATIONS IN VACUUM

Non Linear Observation Equation For Motion Estimation

Entropies and fractal dimensions

Prompt Photon Production in p-a Collisions at LHC and the Extraction of Gluon Shadowing

3D-CE5.h: Merge candidate list for disparity compensated prediction

From Unstructured 3D Point Clouds to Structured Knowledge - A Semantics Approach

Question order experimental constraints on quantum-like models of judgement

Parallel Repetition of entangled games on the uniform distribution

RHEOLOGICAL INTERPRETATION OF RAYLEIGH DAMPING

The status of VIRGO. To cite this version: HAL Id: in2p

The Windy Postman Problem on Series-Parallel Graphs

On the longest path in a recursively partitionable graph

A Simple Proof of P versus NP

Finite Volume for Fusion Simulations

Reduced Models (and control) of in-situ decontamination of large water resources

Spatial representativeness of an air quality monitoring station. Application to NO2 in urban areas

b-chromatic number of cacti

The core of voting games: a partition approach

On Symmetric Norm Inequalities And Hermitian Block-Matrices

The FLRW cosmological model revisited: relation of the local time with th e local curvature and consequences on the Heisenberg uncertainty principle

Learning an Adaptive Dictionary Structure for Efficient Image Sparse Coding

Multiple sensor fault detection in heat exchanger system

On Solving Aircraft Conflict Avoidance Using Deterministic Global Optimization (sbb) Codes

Some explanations about the IWLS algorithm to fit generalized linear models

Best linear unbiased prediction when error vector is correlated with other random vectors in the model

Impedance Transmission Conditions for the Electric Potential across a Highly Conductive Casing

Particle-in-cell simulations of high energy electron production by intense laser pulses in underdense plasmas

On infinite permutations

Comparison of Harmonic, Geometric and Arithmetic means for change detection in SAR time series

Exact Comparison of Quadratic Irrationals

Quantum efficiency and metastable lifetime measurements in ruby ( Cr 3+ : Al2O3) via lock-in rate-window photothermal radiometry

A non-commutative algorithm for multiplying (7 7) matrices using 250 multiplications

Control of an offshore wind turbine modeled as discrete system

Sound intensity as a function of sound insulation partition

Classification of high dimensional data: High Dimensional Discriminant Analysis

Diurnal variation of tropospheric temperature at a tropical station

SOLAR RADIATION ESTIMATION AND PREDICTION USING MEASURED AND PREDICTED AEROSOL OPTICAL DEPTH

A new approach of the concept of prime number

Tropical Graph Signal Processing

Fast Computation of Moore-Penrose Inverse Matrices

Impulse response measurement of ultrasonic transducers

A Slice Based 3-D Schur-Cohn Stability Criterion

Solution to Sylvester equation associated to linear descriptor systems

A simple kinetic equation of swarm formation: blow up and global existence

How to make R, PostGIS and QGis cooperate for statistical modelling duties: a case study on hedonic regressions

A Study of the Regular Pentagon with a Classic Geometric Approach

STATISTICAL ENERGY ANALYSIS: CORRELATION BETWEEN DIFFUSE FIELD AND ENERGY EQUIPARTITION

Towards an active anechoic room

Stickelberger s congruences for absolute norms of relative discriminants

Stochastic invariances and Lamperti transformations for Stochastic Processes

On path partitions of the divisor graph

Negative results on acyclic improper colorings

Higgs searches at L3

Posterior Covariance vs. Analysis Error Covariance in Data Assimilation

Approximation SEM-DG pour les problèmes d ondes elasto-acoustiques

Call Detail Records to Characterize Usages and Mobility Events of Phone Users

ON THE UNIQUENESS IN THE 3D NAVIER-STOKES EQUATIONS

Climbing discrepancy search for flowshop and jobshop scheduling with time-lags

Axiom of infinity and construction of N

Electromagnetic characterization of magnetic steel alloys with respect to the temperature

Radio-detection of UHECR by the CODALEMA experiment

Determination of absorption characteristic of materials on basis of sound intensity measurement

On Poincare-Wirtinger inequalities in spaces of functions of bounded variation

Lorentz force velocimetry using small-size permanent magnet systems and a multi-degree-of-freedom force/torque sensor

Widely Linear Estimation with Complex Data

The beam-gas method for luminosity measurement at LHCb

On one class of permutation polynomials over finite fields of characteristic two *

Numerical Modeling of Eddy Current Nondestructive Evaluation of Ferromagnetic Tubes via an Integral. Equation Approach

On Politics and Argumentation

Impairment of Shooting Performance by Background Complexity and Motion

Influence of a Rough Thin Layer on the Potential

Basic concepts and models in continuum damage mechanics

Finite element computation of leaky modes in straight and helical elastic waveguides

Transcription:

Analyzing large-scale spike trains data with spatio-temporal constraints Hassan Nasser, Olivier Marre, Selim Kraria, Thierry Viéville, Bruno Cessac To cite this version: Hassan Nasser, Olivier Marre, Selim Kraria, Thierry Viéville, Bruno Cessac. Analyzing large-scale spike trains data with spatio-temporal constraints. NeuroComp/KEOpS 12 workshop beyond the retina: from computational models to outcomes in bioengineering. Focus on architecture and dynamics sustaining information flows in the visuomotor system., Oct 2012, Bordeaux, France. 2012. <hal- 00756467> HAL Id: hal-00756467 https://hal.inria.fr/hal-00756467 Submitted on 23 Nov 2012 HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Analyzing large-scale spike trains data with spatio-temporal constraints Hassan Nasser, O. Marre, Selim Kraria, Thierry Vieville, Bruno Cessac, October 5, 2012 1 Context Recent experimental advances have made it possible to record several hundred neurons simultaneously in the retina as well as in the cortex. Analyzing such a huge amount of data requires to elaborate statistical, mathematical and numerical methods, to describe both the spatio-temporal structure of the population activity and its relevance to sensory coding. Among these methods, the maximum entropy principle has been used to describe the statistics of spike trains. Recall that the maximum entropy principle consists of fixing a set of constraints, determined with the empirical average of quantities ( observables ) measured on the raster: for example average firing rate of neurons, or pairwise correlations. Maximising the statistical entropy given those constraints provides a probability distribution, called a Gibbs distribution, that provides a statistical model to fit the data and extrapolate phenomenological laws. Most approaches were restricted to instantaneous observables i.e. quantities corresponding to spikes occurring at the same time (singlets, pairs, triplets and so on. See [6],[5] and [4]). The corresponding Gibbs distributions are only successful in predicting the probability of patterns lying within one time bin [2] but fail to predict the temporal statistics of the neural activity. 2 Extension of the maximum entropy to spatiotemporal statistics We first define the Gibbs potential as: H β = K β k O k, (1) k=1 where the β k s are real numbers and free parameters. O k s are the observables that defines the constraints. Fitting the data to this Gibbs potential model aims NeuroMathComp team, INRIA, 2004 Route des Lucioles, 06902 Sophia-Antipolis, France. Institut de la vision, 17 rue Moreau, 75012, Paris, France. Dream team, INRIA, 2004 Route des Lucioles, 06902 Sophia-Antipolis, France. Cortex team, INRIA, 2004 Route des Lucioles, 06902 Sophia-Antipolis, France. NeuroMathComp, INRIA, 2004 Route des Lucioles, 06902 Sophia-Antipolis, France. 1

to reproduce the probability distribution of the data, i.e, finding a theoretical distribution whose observables probabilities are close to the empirical probabilities measured on the raster, using the maximum entropy principle. We have extended this principle considering spatio temporal constraints (i.e. neurons interactions with time lags) and allowing the characterization of spatio-temporal patterns statistics. This approach, based on a transfer matrix technique, avoids the computation of the normalization factor (partition function) and directly provides the moment generating function (free energy density). We first developed a numerical method to learn the parameters of a maximum entropy model with spatio-temporal constraints on real spike train data [7]. However, this exact method could only be applied to small subsets of neurons, since it required construction of huge matrices whose size increases exponentially with the number of neurons. To circumvent this combinatorial explosion, we then adopted a Monte-Carlo approach to compute the observable averages for a given set of model parameters (see [3] for more details). We first tested the efficiency of this new algorithm in the case where a raster plot is generated from a known Gibbs distribution [Fig. 1]. We then make the algorithm find back this Gibbs distribution from that raster. This allowed us to quantify both convergence rate and model accuracy as a function of the number of neurons. The main advantage of this new technique is its computational speed. Fig. 2 shows the difference between the computation time taken using transfer matrix technique and the Monte-Carlo technique: The computation time with Monte-Carlo increases linearly with the number of neurons while it increases exponentially with transfer matrix based technique. 3 Learning the parameters β k We have also developed an efficient algorithm to compute the parameters β k defining the Gibbs distribution, even for large data sets. We adopted the method used in [1] and we extended it to the spatio-temporal case. Their method is based on the minimization a criterion, the negative likelihood, given by (Eq. 2) L (T) π (β) = π ω (T) [log µ β ], (2) ω where β k s are the parameters to be inferred, µ β is the probability distribution estimated using Monte-Carlo technique, π ω (T) is the empirical distribution and finally. L (T) π (β) represent the likelihood between the data and the model, ω to be minimized. We have tested the method considering large spatio temporal data sets where the number N of neurons is large (N 100) as well as the model range R, where the exact transfer matrix computation becomes intractable. In Fig. 3 we present the maximal distance (Eq. 3) between the exact value of the coefficient and the estimated value, averaged on a set of 10 random Gibbs potentials. The error is given by the following formula: Error = 1 β k L ˆβ k (3) k=1...l where β k is the exact value of coefficient k, ˆβk is the estimated value and L is the number of terms in the potential. 2

1 0.9 0.8 0.7 0.6 N=3 N=4 N=5 N=6 N=7 y=x 0.5 0.4 0.3 0.2 0.1 0-0.1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Figure 1: Comparison between the real (x axis) and estimated (y axis) values of observable averages (Points represent the mean value of the observable averages and segments represents the standard deviation, over 10 trials).given a random Gibbs potential, we computed the probability distribution of observables with the transfer-matrix based technique and with the Monte-Carlo based technique. Since the transfer matrix cannot handle for large sets, we were limited to a small number of neurons in order compare the empirical and theoretical averages. The values obtained with Monte-Carlo are closed to those obtained with the transfer matrix technique since they are all aligned to the y = x axis. 4 Conclusion We have extended the maximum entropy technique to handle spatio-temporal temporal constraint and developped a Monte-Carlo based method which allows to infer statistical properties for large data set, in less time than the transfer matrix based technique. This work will help to analyse spike train sorted from large MEA (Multi-Electrode array) recordings and discover the structure of the hidden statistics behind the data. The choice of the Gibbs potential form (what are the terms that we put in the potential) remains a challenge and will be subject for future studies. 3

1e+06 100000 10000 MC Av. 12.4*N^1.32 Th. Av. 0.00015*exp(2.24*N) CPU Time (s) 1000 100 10 1 0.1 0.01 2 4 8 N Figure 2: The CPU time necessary to obtain the observable averages presented in Fig. 1, for the Monte-Carlo Average (MC Av.) and for the theoretical (Th. Av.), as a function of N (Here, the theoretical average is exact since the potential is generated synthetically with known probability distribution). The full lines correspond to fit. The computation time with Monte-Carlo increases linearly with the number of neurons while it increases exponentially with transfer matrix based technique. 0.14 0.12 0.1 N=5, R=3 N=4,R=4 N=20,R=3 N=30,R=4 N=60,R=5 convergence bound Error 0.08 0.06 0.04 0.02 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 Number L of Monomials Figure 3: Max distance between the exact values of coefficients and the estimated values, averaged on a set of 10 random potentials for several values of N, R. Since for each (N, R) value the potential is not the same, then the maximum number of monomials (L) will not be the same and by consequence we observe that the curves do not have all the same number of points. References [1] M. Dudík, S. Phillips, and R. Schapire. Performance guarantees for regularized maximum entropy density estimation. In Proceedings of the 17th 4

Annual Conference on Computational Learning Theory, 2004. [2] O. Marre, S. El Boustani, Y. Frégnac, and A. Destexhe. Prediction of spatiotemporal patterns of neural activity from pairwise correlations. Phys. rev. Let., 102:138101, 2009. [3] H. Nasser, O. Marre, and B. Cessac. Spatio-temporal spike trains analysis for large scale networks using maximum entropy principle and monte-carlo method. Journal Of Statistical Mechanics, to appear, 2012. [4] E. Schneidman, M.J. Berry, R. Segev, and W. Bialek. Weak pairwise correlations imply strongly correlated network states in a neural population. Nature, 440(7087):1007 1012, 2006. [5] J. Shlens, G.D. Field, J.L. Gauthier, M.I. Grivich, D. Petrusca, A. Sher, A.M. Litke, and E.J. Chichilnisky. The structure of multi-neuron firing patterns in primate retina. Journal of Neuroscience, 26(32):8254, 2006. [6] G. Tkacik, Elad Schneidman, Michael J. Berry II, and William Bialek. Spin glass models for a network of real neurons. arxiv: 0912.5409v1, 2009. [7] J-C Vasquez, T. Viéville, and B. Cessac. Entropy-based parametric estimation of spike train statistics. INRIA Research Report, 2010. Acknowledgements: This work was supported by the INRIA, ERC-NERVI number 227747, KEOPS ANR-CONICYT and BrainScales projects. 5