An Inverse-Gamma Source Variance Prior with Factorized Parameterization for Audio Source Separation

Size: px
Start display at page:

Download "An Inverse-Gamma Source Variance Prior with Factorized Parameterization for Audio Source Separation"

Transcription

1 An Inverse-Gamma Source Variance Prior with Factorized Parameterization or Audio Source Separation Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud To cite this version: Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud. An Inverse-Gamma Source Variance Prior with Factorized Parameterization or Audio Source Separation. 4st IEEE International Conerence on Acoustics, Speech and SIgnal Processing ICASSP 206, Mar 206, Shanghai, China. ICASSP Proceedings, pp.36-40, <0.09/ICASSP >. <hal > HAL Id: hal Submitted on 8 Jan 206 HAL is a multi-disciplinary open access archive or the deposit and dissemination o scientiic research documents, whether they are published or not. The documents may come rom teaching and research institutions in France or abroad, or rom public or private research centers. L archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diusion de documents scientiiques de niveau recherche, publiés ou non, émanant des établissements d enseignement et de recherche rançais ou étrangers, des laboratoires publics ou privés.

2 AN INVERSE-GAMMA SOURCE VARIANCE PRIOR WITH FACTORIZED PARAMETERIZATION FOR AUDIO SOURCE SEPARATION Dionyssos Kounades-Bastian, Laurent Girin,2, Xavier Alameda-Pineda 3, Sharon Gannot 4, Radu Horaud INRIA Grenoble Rhône-Alpes, 2 GIPSA-Lab, Univ. Grenoble Alpes 3 University o Trento, 4 Faculty o Engineering, Bar-Ilan University ABSTRACT In this paper we present a new statistical model or the power spectral density PSD o an audio signal and its application to multichannel audio source separation MASS. The source signal is modeled with the local Gaussian model LGM and we propose to model its variance with an inverse-gamma distribution, whose scale parameter is actorized as a ran- model. We discuss the interest o this approach and evaluate it in a MASS tas with underdetermined convolutive mixtures. For this aim, we derive a variational EM algorithm or parameter estimation and source inerence. The proposed model shows a beneit in source separation perormance compared to a state-o-the-art LGM NMF-based technique. Index Terms Audio modeling, local Gaussian model, PSD model, audio source separation.. INTRODUCTION For the past decade, the statistical modeling o audio signals in the time-requency TF domain has been thoroughly investigated. Among the proposed models, the local Gaussian model LGM [] has become very popular because, among other reasons, it can be naturally coupled with models o the signal power spectral density PSD which identiies with the signal variance at each TF bin or a zero-mean signal. An important example is the use o non-negative matrix actorization NMF, which imposes a low-ran structure on the PSD matrix, namely the product o a spectral pattern matrix and a temporal activation matrix [2]. Intuitively, NMF is meant to eiciently represent the structure o the audio signal power in the TF domain with a reduced number o parameters. The NMF actors were irst treated as parameters [3, 4, 5, 2], and then as latent variables within a Bayesian ramewor [6, 2, 7, 8, 9]. These models have been successully applied to audio source separation. In MASS conigurations, the source signal models are combined with a mixing model, accounting or the source-to-sensor channels, e.g. [0,, 2, 3, 4]. In general, an EM algorithm is derived to estimate the source This research has received unding rom the EU-FP7 STREP project EARS # and ERC Advanced Grant VHIA #3403. and channel parameters, which are then used to construct demixing Wiener ilters. Besides, a general ramewor or inserting prior inormation about the sources in TF-domain MASS has been proposed in [5]. In current Bayesian NMF PSD models, the source PSD is irst modeled with NMF and, second, the NMF actors are assigned a prior distribution. In this paper, we propose to change the order o things: The source is still seen as a sum o components, but we irst assign a prior distribution to the component PSD and then assume a actorized model, reminiscent o NMF, on the parameters o this distribution. More precisely, we model the component PSD with an inverse-gamma IG distribution and assume that the scale parameter o this IG ollows a ran- NMF. As explained in Section 2, this enables to add exibility in the modeling o source PSD matrix, compared to conventional NMF, while preserving the ability to model structured source PSD. We apply the proposed model to MASS rom underdetermined convolutive mixtures. The proposed model is presented in Section 2. The associated variational EM VEM algorithm that we derived to estimate the model parameters and iner the source signals is described in Section 3. Experimental evaluation reported in Section 4 shows competitive perormances in comparison to the state-o-the-art LGM-NMF MASS method o [0]. 2.. The mixing model 2. MODELS As usually done in the MASS literature, we wor under the narrow-band assumption, which allows us to write a timeinvariant convolutive mixture in the short-term Fourier transorm STFT domain as: x = A s + b, where x = [x,,..., x I, ] C I is the I channel observation vector, s = [s,,..., s J, ] C J is the source vector to be inerred, A C I J is a mixing matrix parameter to be estimated, and b C I is the sensor noise. We assume pb = N c b ; 0, v I I, with v R + N cx; µ, Σ = πσ exp [x µ] H Σ [x µ] is the proper complex Gaussian distribution with x C I, µ C I and Σ C I I.

3 being a variance parameter to be estimated I I is the identity matrix o dimension I. The above assumption implies px s = N c x ; A s, v I I. Note that the above mixture can be underdetermined, i.e. we can have I < J The source model We embrace the LGM ramewor [, 0] where s j, C is assumed to ollow a zero-mean proper complex Gaussian distribution. Moreover, s j, is assumed to be the sum o elementary components c, C, also zero-mean proper complex Gaussian: s j, = c, s = Gc, 2 K j where K j is a subset o a nontrivial partition K = {K j } J j= nown in advance o the K components into the J sources, c = [c,,..., c K, ] C K is the vector o component coeicients, and G N J K is a binary matrix with entries G j = i K j and G j = 0 otherwise. Finally, as in [0], we assume that all {c, } F,L,K,l,= are independent, with: pc, u, = N c c, ; 0, u,. 3 In particular, the source PSD at each TF bin is the sum o individual component PSDs at that bin The component PSD model Traditionally, the component PSD or component variance u, is typically assumed to actorise over and l, i.e. an NMF model is applied on the source PSD directly. In a Bayesian ramewor, the NMF actors are assigned a prior distribution. The main contribution o this paper is to reverse the traditional order: We irst assume a prior distribution or the component variance and then impose a nonnegative actorized structure on its parameters. More precisely, we assume that each entry u, o the component PSD matrix ollows an inverse Gamma IG distribution 2 : pu, = IG u, ; γ, δ, with δ, = w h l, 4 where γ, w, h l R +. The choice or the IG distribution emerges naturally, as it is the conjugate prior o the variance o a Gaussian. The actorization o the scale parameter δ, into a ran- model is a ey point o our model. Indeed, modeling the parameters o the component PSD prior or instance δ, with a ran- model instead o the component PSD u, itsel allows the latter not to be constrained to have a low-ran structure. Thereore, with the proposed model, both the component PSD and the source PSD can be ull-ran, 2 The Inverse Gamma distribution is deined as IGu; γ, δ = δ γ Γγ u γ+ exp δ, with support u R u +, shape parameter γ R +, scale parameter δ R +, and Γ being the Gamma unction. u, γ, w, h l c, x v, A Fig.. Graphical reprsentation o the probabilisitc model. Latent variables are represented with circles, observations with double circles, deterministic parameters with rectangles. as opposed to conventional NMF. In the meantime, the proposed model eeps a limited number o parameters and an ability to represent structured signals, in the spirit o conventional NMF. Finally, we postulate that the proposed model has the potential to better represent natural audio signals, such as speech. As or the IG shape parameter γ, it intuitively acts as a measure o the relevance o the -th component: high resp. low values o γ decrease resp. increase the contribution o the -th component. 3. VARIATIONAL INFERENCE We propose an EM algorithm to perorm inerence o the hidden variables H = {c, u, } F,L,K,l,= and estimation o the parameters θ = {A, v, γ, w, h l } F,L,K,l,=. As the E-step does not admit a closed orm solution, we use variational inerence: Let qh 0 = p H 0 {x } F,L, ; θ denote the posterior distribution o a variable H 0 H. First qh is imposed to actorise as qh F,L, qc F,L,K Then qh 0 = ph 0 {x } F,L, qh 0 exp E qh/h0 ; θ is inered with:,l,= qu,. [ log ph, {x } F,L, ; θ], 5 where E qz [z] is the expectation o unctional z w.r.t. the distribution qz over the support o the variable z, and where qh/h 0 is the joint posterior distribution o all hidden variables except H 0. The resulting E-step is the alternating inerence o qc E-C, and qu, E-U,, l,. 3.. E-step Let the superscript r denote the VEM iteration index, i.e. θ r are the parameters computed at the r th iteration. E-U-step: First we consider the inerence o qu,. Using 5, one can easily identiy qu, to be also an IG: qu, pu, exp E qc [log pc, u, ] = IG u, ; g r, dr,, 6 with posterior parameters g r, dr, R + calculated as: g r =γr +, d r, = δr, where Σ cr, ĉ r, below. + Σ cr, + ĉ r 2, 7, R + is the th diagonal entry o Σ cr, and, both being calculated C is the th entry o ĉ r

4 E-C-step: Using 5, qc can be identiied to be complex-gaussian: K qc px Gc exp E qu, [log pc, u, ] = = N c c ; ĉ r, Σcr. 8 The posterior covariance matrix Σ cr C K K and component vector estimate ĉ r CK are given by: Σ cr = ĉ r diag K = Σcr g r d r, A r + H G A r x /v r HA r G G v r, 9 where diag K x is the K K diagonal matrix with entries x,..., x K. Eq. 9 corresponds to the Wiener iltering o the component, thus a similar result as in [0] except or the construction o the component posterior covariance matrix. Estimating the source coeicients: Now, using 2, it is easy to calculate the source posterior distribution, which as one expects, is a complex Gaussian with mean ŝ r C J, and 2 nd -order moment R sr ŝ r = Gĉr, 3.2. M-step Rsr C J J calculated as: = GΣ cr G + ŝ r ŝ r H. 0 As or the M step, the parameters maximizing the expected complete-data log-lielihood are computed. M-A step: The optimal value or the ilters is: A r = L x ŝ r H L R sr,, which is a standard orm o least square estimator [0]. M-v step: The optimal noise variance is: v r = { } x H LI x 2Re x H A r ŝr + { } HA tr R sr A r r, 2 where tr{.} is the trace operator. M-IG step: The IG parameters γ, w, h l are coupled in the objective unction and thus an alternating optimization strategy is required, i.e. ixing two parameters to estimate the third. The updates or w, h l are: w r = g r Lγ r h r l d r,, h r l = F γ r F g r = w r d r,. 3 Algorithm Separation o J static sound sources input {x } F,L,, binary matrix G, initial parameters θ0. { } F,L,K initialise IG parameters:, set r =. g 0, d0,,l,= repeat E-C step: Compute Σ cr and ĉ r with 9. Compute ŝ r and Rsr with 0. E-U step: Calculate g r and d r,, with 7. M-A step: Update A r with. M-v step: Update v r with 2. M-IG step: δ r, = wr hr l Update w r, hr l with 3. Calculate. Update γr with 5. set r = r +. until convergence return the estimated source images. Then we set δ r, = wr solution w.r.t γ r to: ψ g r ψ γ r hr l = F L, and the update or γr F = log is the r d,, 4 δ r, where ψ. is the digamma unction. Since 4 has no closed-orm solution, we propose to approximate g r with γ r +, relying on 7, and use the recurrence relation o the digamma unction ψx + = ψx + x. This leads to the ollowing update rule: γ r = F L F = log 3.3. Estimation o source images. 5 d r, δ r, Considering the inherent scale indeterminacy o the source separation problem, we rather measure the separation perormance using the time domain source images, i.e. the estimates o the source signals as recorded at the microphones [, 6]. These are calculated by applying inverse STFT with overlap-add on {a j, ŝ j, } F,L, a j, is the j-th column o A. The complete VEM procedure can be ound in Algorithm. 4. EXPERIMENTS To asses the perormance o the proposed algorithm, we simulated the challenging tas o separating J = 3 sources rom a convolutive stereo mixture I = 2. Source signals were 2s-speech signals randomly chosen rom the TIMIT database [7]. As mixing ilters, we used binaural room

5 Mix- Mix-2 Table. Average SDR and SIR scores. SDR db SIR db R db Algo. s s2 s3 s s2 s Prop Base Prop Base Prop Base Prop Base Fig. 2. Average SDR score as a unction o VEM iterations Mix-,R = 0dB. impulse responses BRIR rom [8] truncated to 52 taps with reverberation time o RT s. Two sets o BRIRs were used, corresponding respectively to azimuths 85, 20, 60 Mix-, and azimuths 45, 75, 0 Mix-2. Standard sound separation measures, namely signalto-distortion ratio SDR and signal-to-intererence-ratio SIR [9] were computed. All reported results are average measures over 8 sets o utterances or each mix. To ensure a air comparison with the baseline method [0], we provided both algorithms with the same initial inormation. The mixing ilters were blindly initialized to A 0 = a matrix illed with ones and we set v 0 = 03 F LI x H x,. As or the NMF parameters, each source was corrupted with the sum o the two other sources at two dierent SNRs R = 0dB or 0dB. An initial NMF decomposition {w init, hinit l },l, was then computed or each corrupted source PSD using the KL-NMF algorithm [2], with K j = 20 components per source. {w init, hinit l },l, were also used to initialize the NMF parameters o the baseline method. We set δ 0, = winit hinit l, γ = and d 0, = δ0, thus g 0 = 2 and E IGu, ;g 0,d0 [u,] = w inithinit l. We, run 00 iterations. Fig. 2 shows the average SDR obtained at each iteration or Mix- with R = 0dB. We observe a quite regular evolution o the SDR scores, which are quite stabilized ater 00 iterations ater some possible decrease, since the VEM does not guarantee monotonic evolution o the separation scores as opposed to the lielihood. For this mix, the proposed method shows a notable improvement over the baseline: up to 3.dB or s 2 remind that these scores are averaged over 8 experiments with same ilters but dierent sources. Final perormance at iteration 00 or other conigurations, and or SIR scores, are reported in Table. There we see that or Mix- and R = 0dB coniguration o Fig. 2, the SIR improvement is in line with the SDR improvement: the proposed model outperorms the baseline by 5.8dB or s 2, while the results or the two other sources are less impressive. Such quite substantial improvement o SDR and SIR may be due to the added exibility o the proposed PSD model compared with NMF see Section 2.3. As or Mix- with R = 0dB, all scores are higher because the NMF initialization is closer to true source PSDs. Here, the proposed method also notably and systematically outperorms the baseline method. The results are more mitigated with the Mix-2 coniguration. Here both the SDR and SIR scores o the two methods are more intricate. Note that the scores o the proposed method are remarably similar across the two mixes, as opposed to the scores o the baseline method. This seems to indicate that the proposed method is robust to the mixing coniguration, but urther investigation must be conducted to conclude on this. Globally, the overall results encourage us to urther investigate the potential o this ull-ran PSD modeling, or source separation and beyond. 5. CONCLUSION MASS experiments have shown the potential o the proposed model or TF-domain statistical signal modeling. Future research will concern an in-depth analysis o the proposed PSD model per se, i.e. to model audio signals independently o the MASS context. This should include a comparative study with conventional parametric and Bayesian NMF. Also, we will investigate the characterization o component relevance rom the estimated shape parameter, and its use within a model selection tas when the number o source components is unnown, see, e.g. [20]. As or the MASS tas, we will explore more realistic initialization techniques, e.g. using the output o existing source separation techniques, leading to a more realistic sound separation algorithm and, again, more systematic comparison with other source PSD models plugged into the LGM-based MASS ramewor.

6 6. REFERENCES [] A. Liutus, B. Badeau, and G. Richard, Gaussian processes or underdetermined source separation, IEEE Transactions on Signal Processing, vol. 59, no. 7, pp , 20. [2] C. Févotte, N. Bertin, and J.-L. Durrieu, Nonnegative matrix actorization with the Itaura-Saito divergence. With application to music analysis, Neural Computation, vol. 2, no. 3, pp , [3] D. Lee and H. Seung, Learning the parts o objects by non-negative matrix actorization, Nature, vol. 40, pp , 999. [4], Algorithms or non-negative matrix actorization, in Advances in neural inormation processing systems, 200. [5] P. Smaragdis and J. Brown, Non-negative matrix actorization or polyphonic music transcription, in IEEE Worshop on the Applications o Signal Processing to Audio and Acoustics, [6] T. Virtanen, S. Godsill et al., Bayesian extensions to non-negative matrix actorisation or audio signal modelling, in IEEE International Conerence on Acoustics, Speech and Signal Processing, 2008, pp [7] N. Bertin, R. Badeau, and E. Vincent, Enorcing harmonicity and smoothness in Bayesian non-negative matrix actorization applied to polyphonic music transcription, IEEE Transactions on Audio, Speech, and Language Processing, vol. 8, no. 3, pp , 200. [8] M. Homan, D. Blei, and P. Coo, Bayesian nonparametric matrix actorization or recorded music, in International Conerence on Machine Learning, 200, pp [9] N. Mohammadiha, J. Taghia, and A. Leijon, Single channel speech enhancement using bayesian NMF with recursive temporal updates o prior distributions, in IEEE Int. Con. Acoustics, Speech, Signal Processing, Kyoto, Japan, 202. [0] A. Ozerov and C. Févotte, Multichannel nonnegative matrix actorization in convolutive mixtures or audio source separation, IEEE Transactions on Audio, Speech and Language Processing, vol. 8, no. 3, pp , 200. [] N. Duong, E. Vincent, and R. Gribonval, Underdetermined reverberant audio source separation using a ull-ran spatial covariance model, IEEE Trans. on Audio, Speech, and Language Proc., vol. 8, no. 7, pp , 200. [2] S. Arberet, A. Ozerov, N. Q. K. Duong, E. Vincent, R. Gribonval, F. Bimbot, and P. Vandergheynst, Nonnegative matrix actorization and spatial covariance model or under-determined reverberant audio source separation, in International Conerence on Inormation Sciences, Signal Processing, and their Applications, 200. [3] T. Higuchi, N. Taamune, N. Tomohio, and H. Kameoa, Underdetermined blind separation and tracing o moving sources based on DOA-HMM, in IEEE International Conerence on Audio, Speech and Signal Processing, 204. [4] D. Kounades-Bastian, L. Girin, X. Alameda-Pineda, S. Gannot, and R. Horaud, A variational EM algorithm or the separation o moving sound sources, in IEEE Worshop on the Applications o Signal Processing to Audio and Acoustics, 205. [5] A. Ozerov, E. Vincent, and F. Bimbot, A general exible ramewor or the handling o prior inormation in audio source separation, IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 4, pp. 8 33, 202. [6] N. Sturmel, A. Liutus, J. Pinel, L. Girin, S. Marchand, G. Richard, R. Badeau, and L. Daudet, Linear mixing models or active listening o music productions in realistic studio conditions, in Convention o the Audio Engineering Society AES, Budapest, Hungary, 202. [7] J. S. Garoolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, N. L. Dahlgren, and V. Zue, Timit acoustic-phonetic continuous speech corpus, 993, linguistic Data Consortium, Philadelphia. [8] C. Hummersone, R. Mason, and T. Brooes, A comparison o computational precedence models or source separation in reverberant environments, Journal o the Audio Engineering Society, vol. 6, no. 7/8, pp , 203. [9] E. Vincent, R. Gribonval, and C. Févotte, Perormance measurement in blind audio source separation, IEEE Transactions on Audio, Speech and Language Processing, vol. 4, no. 4, pp , [20] V. Y. Tan and C. Fevotte, Automatic relevance determination in nonnegative matrix actorization with the β-divergence, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 7, pp , 203.

Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization

Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization Xiaofei Li, Laurent Girin, Radu Horaud To cite this version: Xiaofei Li, Laurent Girin, Radu Horaud.

More information

Non-Stationary Noise Power Spectral Density Estimation Based on Regional Statistics

Non-Stationary Noise Power Spectral Density Estimation Based on Regional Statistics Non-Stationary Noise Power Spectral Density Estimation Based on Regional Statistics Xiaofei Li, Laurent Girin, Sharon Gannot, Radu Horaud To cite this version: Xiaofei Li, Laurent Girin, Sharon Gannot,

More information

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement Simon Leglaive 1 Laurent Girin 1,2 Radu Horaud 1 1: Inria Grenoble Rhône-Alpes 2: Univ. Grenoble Alpes, Grenoble INP,

More information

A diagonal plus low-rank covariance model for computationally efficient source separation

A diagonal plus low-rank covariance model for computationally efficient source separation A diagonal plus low-ran covariance model for computationally efficient source separation Antoine Liutus, Kazuyoshi Yoshii To cite this version: Antoine Liutus, Kazuyoshi Yoshii. A diagonal plus low-ran

More information

AN EM ALGORITHM FOR AUDIO SOURCE SEPARATION BASED ON THE CONVOLUTIVE TRANSFER FUNCTION

AN EM ALGORITHM FOR AUDIO SOURCE SEPARATION BASED ON THE CONVOLUTIVE TRANSFER FUNCTION AN EM ALGORITHM FOR AUDIO SOURCE SEPARATION BASED ON THE CONVOLUTIVE TRANSFER FUNCTION Xiaofei Li, 1 Laurent Girin, 2,1 Radu Horaud, 1 1 INRIA Grenoble Rhône-Alpes, France 2 Univ. Grenoble Alpes, GIPSA-Lab,

More information

Blind Spectral-GMM Estimation for Underdetermined Instantaneous Audio Source Separation

Blind Spectral-GMM Estimation for Underdetermined Instantaneous Audio Source Separation Blind Spectral-GMM Estimation for Underdetermined Instantaneous Audio Source Separation Simon Arberet 1, Alexey Ozerov 2, Rémi Gribonval 1, and Frédéric Bimbot 1 1 METISS Group, IRISA-INRIA Campus de Beaulieu,

More information

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS Emad M. Grais and Hakan Erdogan Faculty of Engineering and Natural Sciences, Sabanci University, Orhanli

More information

Expectation-maximization algorithms for Itakura-Saito nonnegative matrix factorization

Expectation-maximization algorithms for Itakura-Saito nonnegative matrix factorization Expectation-maximization algorithms for Itakura-Saito nonnegative matrix factorization Paul Magron, Tuomas Virtanen To cite this version: Paul Magron, Tuomas Virtanen. Expectation-maximization algorithms

More information

Scalable audio separation with light Kernel Additive Modelling

Scalable audio separation with light Kernel Additive Modelling Scalable audio separation with light Kernel Additive Modelling Antoine Liutkus 1, Derry Fitzgerald 2, Zafar Rafii 3 1 Inria, Université de Lorraine, LORIA, UMR 7503, France 2 NIMBUS Centre, Cork Institute

More information

ESTIMATION OF RELATIVE TRANSFER FUNCTION IN THE PRESENCE OF STATIONARY NOISE BASED ON SEGMENTAL POWER SPECTRAL DENSITY MATRIX SUBTRACTION

ESTIMATION OF RELATIVE TRANSFER FUNCTION IN THE PRESENCE OF STATIONARY NOISE BASED ON SEGMENTAL POWER SPECTRAL DENSITY MATRIX SUBTRACTION ESTIMATION OF RELATIVE TRANSFER FUNCTION IN THE PRESENCE OF STATIONARY NOISE BASED ON SEGMENTAL POWER SPECTRAL DENSITY MATRIX SUBTRACTION Xiaofei Li 1, Laurent Girin 1,, Radu Horaud 1 1 INRIA Grenoble

More information

Informed Audio Source Separation: A Comparative Study

Informed Audio Source Separation: A Comparative Study Informed Audio Source Separation: A Comparative Study Antoine Liutkus, Stanislaw Gorlow, Nicolas Sturmel, Shuhua Zhang, Laurent Girin, Roland Badeau, Laurent Daudet, Sylvain Marchand, Gaël Richard To cite

More information

Sparse filter models for solving permutation indeterminacy in convolutive blind source separation

Sparse filter models for solving permutation indeterminacy in convolutive blind source separation Sparse filter models for solving permutation indeterminacy in convolutive blind source separation Prasad Sudhakar, Rémi Gribonval To cite this version: Prasad Sudhakar, Rémi Gribonval. Sparse filter models

More information

Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain (HR-NMF)

Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain (HR-NMF) Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain HR-NMF Roland Badeau, Angélique Dremeau To cite this version: Roland Badeau, Angélique Dremeau.

More information

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION Hiroazu Kameoa The University of Toyo / Nippon Telegraph and Telephone Corporation ABSTRACT This paper proposes a novel

More information

Rectified binaural ratio: A complex T-distributed feature for robust sound localization

Rectified binaural ratio: A complex T-distributed feature for robust sound localization Rectified binaural ratio: A complex T-distributed feature for robust sound localization Antoine Deleforge, Florence Forbes To cite this version: Antoine Deleforge, Florence Forbes. Rectified binaural ratio:

More information

Speech enhancement based on nonnegative matrix factorization with mixed group sparsity constraint

Speech enhancement based on nonnegative matrix factorization with mixed group sparsity constraint Speech enhancement based on nonnegative matrix factorization with mixed group sparsity constraint Hien-Thanh Duong, Quoc-Cuong Nguyen, Cong-Phuong Nguyen, Thanh-Huan Tran, Ngoc Q. K. Duong To cite this

More information

Cauchy Nonnegative Matrix Factorization

Cauchy Nonnegative Matrix Factorization Cauchy Nonnegative Matrix Factorization Antoine Liutkus, Derry Fitzgerald, Roland Badeau To cite this version: Antoine Liutkus, Derry Fitzgerald, Roland Badeau. Cauchy Nonnegative Matrix Factorization.

More information

LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION.

LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION. LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION. Romain Hennequin Deezer R&D 10 rue d Athènes, 75009 Paris, France rhennequin@deezer.com

More information

Entropy-constrained quantization of exponentially damped sinusoids parameters

Entropy-constrained quantization of exponentially damped sinusoids parameters Entropy-constrained quantization of exponentially damped sinusoids parameters Olivier Derrien, Roland Badeau, Gaël Richard To cite this version: Olivier Derrien, Roland Badeau, Gaël Richard. Entropy-constrained

More information

NONNEGATIVE MATRIX FACTORIZATION WITH TRANSFORM LEARNING. Dylan Fagot, Herwig Wendt and Cédric Févotte

NONNEGATIVE MATRIX FACTORIZATION WITH TRANSFORM LEARNING. Dylan Fagot, Herwig Wendt and Cédric Févotte NONNEGATIVE MATRIX FACTORIZATION WITH TRANSFORM LEARNING Dylan Fagot, Herwig Wendt and Cédric Févotte IRIT, Université de Toulouse, CNRS, Toulouse, France firstname.lastname@irit.fr ABSTRACT Traditional

More information

COLLABORATIVE SPEECH DEREVERBERATION: REGULARIZED TENSOR FACTORIZATION FOR CROWDSOURCED MULTI-CHANNEL RECORDINGS. Sanna Wager, Minje Kim

COLLABORATIVE SPEECH DEREVERBERATION: REGULARIZED TENSOR FACTORIZATION FOR CROWDSOURCED MULTI-CHANNEL RECORDINGS. Sanna Wager, Minje Kim COLLABORATIVE SPEECH DEREVERBERATION: REGULARIZED TENSOR FACTORIZATION FOR CROWDSOURCED MULTI-CHANNEL RECORDINGS Sanna Wager, Minje Kim Indiana University School of Informatics, Computing, and Engineering

More information

Music transcription with ISA and HMM

Music transcription with ISA and HMM Music transcription with ISA and HMM Emmanuel Vincent, Xavier Rodet To cite this version: Emmanuel Vincent, Xavier Rodet. Music transcription with ISA and HMM. 5th Int. Conf. on Independent Component Analysis

More information

Deep clustering-based beamforming for separation with unknown number of sources

Deep clustering-based beamforming for separation with unknown number of sources INTERSPEECH 2017 August 20 24, 2017, Stockholm, Sweden Deep clustering-based beamorming or separation with unknown number o sources Takuya Higuchi, Keisuke Kinoshita, Marc Delcroix, Kateřina Žmolíková,

More information

Audio Source Separation With a Single Sensor

Audio Source Separation With a Single Sensor Audio Source Separation With a Single Sensor Laurent Benaroya, Frédéric Bimbot, Rémi Gribonval To cite this version: Laurent Benaroya, Frédéric Bimbot, Rémi Gribonval. Audio Source Separation With a Single

More information

PROJET - Spatial Audio Separation Using Projections

PROJET - Spatial Audio Separation Using Projections PROJET - Spatial Audio Separation Using Proections Derry Fitzgerald, Antoine Liutkus, Roland Badeau To cite this version: Derry Fitzgerald, Antoine Liutkus, Roland Badeau. PROJET - Spatial Audio Separation

More information

Music separation guided by cover tracks: designing the joint NMF model

Music separation guided by cover tracks: designing the joint NMF model Music separation guided by cover tracks: designing the joint NMF model Nathan Souviraà-Labastie, Emmanuel Vincent, Frédéric Bimbot To cite this version: Nathan Souviraà-Labastie, Emmanuel Vincent, Frédéric

More information

Analysis of Boyer and Moore s MJRTY algorithm

Analysis of Boyer and Moore s MJRTY algorithm Analysis of Boyer and Moore s MJRTY algorithm Laurent Alonso, Edward M. Reingold To cite this version: Laurent Alonso, Edward M. Reingold. Analysis of Boyer and Moore s MJRTY algorithm. Information Processing

More information

Covariance smoothing and consistent Wiener filtering for artifact reduction in audio source separation

Covariance smoothing and consistent Wiener filtering for artifact reduction in audio source separation Covariance smoothing and consistent Wiener filtering for artifact reduction in audio source separation Emmanuel Vincent METISS Team Inria Rennes - Bretagne Atlantique E. Vincent (Inria) Artifact reduction

More information

Phase Unmixing : Multichannel Source Separation with Magnitude Constraints

Phase Unmixing : Multichannel Source Separation with Magnitude Constraints Phase Unmixing : Multichannel Source Separation with Magnitude Constraints Antoine Deleforge, Yann Traonmilin To cite this version: Antoine Deleforge, Yann Traonmilin. Phase Unmixing : Multichannel Source

More information

Automatic Relevance Determination in Nonnegative Matrix Factorization

Automatic Relevance Determination in Nonnegative Matrix Factorization Automatic Relevance Determination in Nonnegative Matrix Factorization Vincent Y. F. Tan, Cédric Févotte To cite this version: Vincent Y. F. Tan, Cédric Févotte. Automatic Relevance Determination in Nonnegative

More information

Sound intensity as a function of sound insulation partition

Sound intensity as a function of sound insulation partition Sound intensity as a function of sound insulation partition S. Cvetkovic, R. Prascevic To cite this version: S. Cvetkovic, R. Prascevic. Sound intensity as a function of sound insulation partition. Journal

More information

Deep NMF for Speech Separation

Deep NMF for Speech Separation MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Deep NMF for Speech Separation Le Roux, J.; Hershey, J.R.; Weninger, F.J. TR2015-029 April 2015 Abstract Non-negative matrix factorization

More information

Fast adaptive ESPRIT algorithm

Fast adaptive ESPRIT algorithm Fast adaptive ESPRIT algorithm Roland Badeau, Gaël Richard, Bertrand David To cite this version: Roland Badeau, Gaël Richard, Bertrand David. Fast adaptive ESPRIT algorithm. Proc. of IEEE Workshop on Statistical

More information

Vibro-acoustic simulation of a car window

Vibro-acoustic simulation of a car window Vibro-acoustic simulation of a car window Christophe Barras To cite this version: Christophe Barras. Vibro-acoustic simulation of a car window. Société Française d Acoustique. Acoustics 12, Apr 12, Nantes,

More information

Non-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology

Non-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology Non-Negative Matrix Factorization And Its Application to Audio Tuomas Virtanen Tampere University of Technology tuomas.virtanen@tut.fi 2 Contents Introduction to audio signals Spectrogram representation

More information

NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION. Julian M. Becker, Christian Sohn and Christian Rohlfing

NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION. Julian M. Becker, Christian Sohn and Christian Rohlfing NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION Julian M. ecker, Christian Sohn Christian Rohlfing Institut für Nachrichtentechnik RWTH Aachen University D-52056

More information

On monotonicity of multiplicative update rules for weighted nonnegative tensor factorization

On monotonicity of multiplicative update rules for weighted nonnegative tensor factorization On monotonicity of multiplicative update rules for weighted nonnegative tensor factorization Alexey Ozerov, Ngoc Duong, Louis Chevallier To cite this version: Alexey Ozerov, Ngoc Duong, Louis Chevallier.

More information

Independent Component Analysis and Unsupervised Learning. Jen-Tzung Chien

Independent Component Analysis and Unsupervised Learning. Jen-Tzung Chien Independent Component Analysis and Unsupervised Learning Jen-Tzung Chien TABLE OF CONTENTS 1. Independent Component Analysis 2. Case Study I: Speech Recognition Independent voices Nonparametric likelihood

More information

The constrained stochastic matched filter subspace tracking

The constrained stochastic matched filter subspace tracking The constrained stochastic matched filter subspace tracking Maissa Chagmani, Bernard Xerri, Bruno Borloz, Claude Jauffret To cite this version: Maissa Chagmani, Bernard Xerri, Bruno Borloz, Claude Jauffret.

More information

Automatic detection of the number of Raypaths

Automatic detection of the number of Raypaths Automatic detection of the number of Raypaths Longyu Jiang, Jerome Mars To cite this version: Longyu Jiang, Jerome Mars. Automatic detection of the number of Raypaths. OCEANS MTS/IEEE Kona - Oceans of

More information

EUSIPCO

EUSIPCO EUSIPCO 213 1569744273 GAMMA HIDDEN MARKOV MODEL AS A PROBABILISTIC NONNEGATIVE MATRIX FACTORIZATION Nasser Mohammadiha, W. Bastiaan Kleijn, Arne Leijon KTH Royal Institute of Technology, Department of

More information

Sparsity Measure and the Detection of Significant Data

Sparsity Measure and the Detection of Significant Data Sparsity Measure and the Detection of Significant Data Abdourrahmane Atto, Dominique Pastor, Grégoire Mercier To cite this version: Abdourrahmane Atto, Dominique Pastor, Grégoire Mercier. Sparsity Measure

More information

Unbiased minimum variance estimation for systems with unknown exogenous inputs

Unbiased minimum variance estimation for systems with unknown exogenous inputs Unbiased minimum variance estimation for systems with unknown exogenous inputs Mohamed Darouach, Michel Zasadzinski To cite this version: Mohamed Darouach, Michel Zasadzinski. Unbiased minimum variance

More information

Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition

Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Simsekli, Romain Serizel, Roland Badeau To cite this version: Mathieu

More information

Classification of high dimensional data: High Dimensional Discriminant Analysis

Classification of high dimensional data: High Dimensional Discriminant Analysis Classification of high dimensional data: High Dimensional Discriminant Analysis Charles Bouveyron, Stephane Girard, Cordelia Schmid To cite this version: Charles Bouveyron, Stephane Girard, Cordelia Schmid.

More information

Completeness of the Tree System for Propositional Classical Logic

Completeness of the Tree System for Propositional Classical Logic Completeness of the Tree System for Propositional Classical Logic Shahid Rahman To cite this version: Shahid Rahman. Completeness of the Tree System for Propositional Classical Logic. Licence. France.

More information

On Symmetric Norm Inequalities And Hermitian Block-Matrices

On Symmetric Norm Inequalities And Hermitian Block-Matrices On Symmetric Norm Inequalities And Hermitian lock-matrices Antoine Mhanna To cite this version: Antoine Mhanna On Symmetric Norm Inequalities And Hermitian lock-matrices 015 HAL Id: hal-0131860

More information

Independent Component Analysis and Unsupervised Learning

Independent Component Analysis and Unsupervised Learning Independent Component Analysis and Unsupervised Learning Jen-Tzung Chien National Cheng Kung University TABLE OF CONTENTS 1. Independent Component Analysis 2. Case Study I: Speech Recognition Independent

More information

Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms

Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms Masahiro Nakano 1, Jonathan Le Roux 2, Hirokazu Kameoka 2,YuKitano 1, Nobutaka Ono 1,

More information

Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification

Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification Hafiz Mustafa and Wenwu Wang Centre for Vision, Speech and Signal Processing (CVSSP) University of Surrey,

More information

Determination of absorption characteristic of materials on basis of sound intensity measurement

Determination of absorption characteristic of materials on basis of sound intensity measurement Determination of absorption characteristic of materials on basis of sound intensity measurement R. Prascevic, A. Milosevic, S. Cvetkovic To cite this version: R. Prascevic, A. Milosevic, S. Cvetkovic.

More information

Environmental Sound Classification in Realistic Situations

Environmental Sound Classification in Realistic Situations Environmental Sound Classification in Realistic Situations K. Haddad, W. Song Brüel & Kjær Sound and Vibration Measurement A/S, Skodsborgvej 307, 2850 Nærum, Denmark. X. Valero La Salle, Universistat Ramon

More information

Dept. Electronics and Electrical Engineering, Keio University, Japan. NTT Communication Science Laboratories, NTT Corporation, Japan.

Dept. Electronics and Electrical Engineering, Keio University, Japan. NTT Communication Science Laboratories, NTT Corporation, Japan. JOINT SEPARATION AND DEREVERBERATION OF REVERBERANT MIXTURES WITH DETERMINED MULTICHANNEL NON-NEGATIVE MATRIX FACTORIZATION Hideaki Kagami, Hirokazu Kameoka, Masahiro Yukawa Dept. Electronics and Electrical

More information

Estimation and detection of a periodic signal

Estimation and detection of a periodic signal Estimation and detection o a periodic signal Daniel Aronsson, Erik Björnemo, Mathias Johansson Signals and Systems Group, Uppsala University, Sweden, e-mail: Daniel.Aronsson,Erik.Bjornemo,Mathias.Johansson}@Angstrom.uu.se

More information

Methylation-associated PHOX2B gene silencing is a rare event in human neuroblastoma.

Methylation-associated PHOX2B gene silencing is a rare event in human neuroblastoma. Methylation-associated PHOX2B gene silencing is a rare event in human neuroblastoma. Loïc De Pontual, Delphine Trochet, Franck Bourdeaut, Sophie Thomas, Heather Etchevers, Agnes Chompret, Véronique Minard,

More information

Estimating Correlation Coefficient Between Two Complex Signals Without Phase Observation

Estimating Correlation Coefficient Between Two Complex Signals Without Phase Observation Estimating Correlation Coefficient Between Two Complex Signals Without Phase Observation Shigeki Miyabe 1B, Notubaka Ono 2, and Shoji Makino 1 1 University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki

More information

A simple test to check the optimality of sparse signal approximations

A simple test to check the optimality of sparse signal approximations A simple test to check the optimality of sparse signal approximations Rémi Gribonval, Rosa Maria Figueras I Ventura, Pierre Vergheynst To cite this version: Rémi Gribonval, Rosa Maria Figueras I Ventura,

More information

Thomas Lugand. To cite this version: HAL Id: tel

Thomas Lugand. To cite this version: HAL Id: tel Contribution à la Modélisation et à l Optimisation de la Machine Asynchrone Double Alimentation pour des Applications Hydrauliques de Pompage Turbinage Thomas Lugand To cite this version: Thomas Lugand.

More information

THE BAYESIAN ABEL BOUND ON THE MEAN SQUARE ERROR

THE BAYESIAN ABEL BOUND ON THE MEAN SQUARE ERROR THE BAYESIAN ABEL BOUND ON THE MEAN SQUARE ERROR Alexandre Renaux, Philippe Forster, Pascal Larzabal, Christ Richmond To cite this version: Alexandre Renaux, Philippe Forster, Pascal Larzabal, Christ Richmond

More information

Full-order observers for linear systems with unknown inputs

Full-order observers for linear systems with unknown inputs Full-order observers for linear systems with unknown inputs Mohamed Darouach, Michel Zasadzinski, Shi Jie Xu To cite this version: Mohamed Darouach, Michel Zasadzinski, Shi Jie Xu. Full-order observers

More information

Analyzing large-scale spike trains data with spatio-temporal constraints

Analyzing large-scale spike trains data with spatio-temporal constraints Analyzing large-scale spike trains data with spatio-temporal constraints Hassan Nasser, Olivier Marre, Selim Kraria, Thierry Viéville, Bruno Cessac To cite this version: Hassan Nasser, Olivier Marre, Selim

More information

Accounting for Missing Data in Sparse Wavelet Representation of Observation Error Correlations

Accounting for Missing Data in Sparse Wavelet Representation of Observation Error Correlations Accounting for Missing Data in Sparse Wavelet Representation of Observation Error Correlations Vincent Chabot, Maëlle Nodet, Arthur Vidard To cite this version: Vincent Chabot, Maëlle Nodet, Arthur Vidard.

More information

Best linear unbiased prediction when error vector is correlated with other random vectors in the model

Best linear unbiased prediction when error vector is correlated with other random vectors in the model Best linear unbiased prediction when error vector is correlated with other random vectors in the model L.R. Schaeffer, C.R. Henderson To cite this version: L.R. Schaeffer, C.R. Henderson. Best linear unbiased

More information

Learning an Adaptive Dictionary Structure for Efficient Image Sparse Coding

Learning an Adaptive Dictionary Structure for Efficient Image Sparse Coding Learning an Adaptive Dictionary Structure for Efficient Image Sparse Coding Jérémy Aghaei Mazaheri, Christine Guillemot, Claude Labit To cite this version: Jérémy Aghaei Mazaheri, Christine Guillemot,

More information

A new simple recursive algorithm for finding prime numbers using Rosser s theorem

A new simple recursive algorithm for finding prime numbers using Rosser s theorem A new simple recursive algorithm for finding prime numbers using Rosser s theorem Rédoane Daoudi To cite this version: Rédoane Daoudi. A new simple recursive algorithm for finding prime numbers using Rosser

More information

Acoustic Vector Sensor based Speech Source Separation with Mixed Gaussian-Laplacian Distributions

Acoustic Vector Sensor based Speech Source Separation with Mixed Gaussian-Laplacian Distributions Acoustic Vector Sensor based Speech Source Separation with Mixed Gaussian-Laplacian Distributions Xiaoyi Chen, Atiyeh Alinaghi, Xionghu Zhong and Wenwu Wang Department of Acoustic Engineering, School of

More information

Matrix Factorization for Speech Enhancement

Matrix Factorization for Speech Enhancement Matrix Factorization for Speech Enhancement Peter Li Peter.Li@nyu.edu Yijun Xiao ryjxiao@nyu.edu 1 Introduction In this report, we explore techniques for speech enhancement using matrix factorization.

More information

(3) where the mixing vector is the Fourier transform of are the STFT coefficients of the sources I. INTRODUCTION

(3) where the mixing vector is the Fourier transform of are the STFT coefficients of the sources I. INTRODUCTION 1830 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 7, SEPTEMBER 2010 Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model Ngoc Q.

More information

Tropical Graph Signal Processing

Tropical Graph Signal Processing Tropical Graph Signal Processing Vincent Gripon To cite this version: Vincent Gripon. Tropical Graph Signal Processing. 2017. HAL Id: hal-01527695 https://hal.archives-ouvertes.fr/hal-01527695v2

More information

Some explanations about the IWLS algorithm to fit generalized linear models

Some explanations about the IWLS algorithm to fit generalized linear models Some explanations about the IWLS algorithm to fit generalized linear models Christophe Dutang To cite this version: Christophe Dutang. Some explanations about the IWLS algorithm to fit generalized linear

More information

JOINT ACOUSTIC AND SPECTRAL MODELING FOR SPEECH DEREVERBERATION USING NON-NEGATIVE REPRESENTATIONS

JOINT ACOUSTIC AND SPECTRAL MODELING FOR SPEECH DEREVERBERATION USING NON-NEGATIVE REPRESENTATIONS JOINT ACOUSTIC AND SPECTRAL MODELING FOR SPEECH DEREVERBERATION USING NON-NEGATIVE REPRESENTATIONS Nasser Mohammadiha Paris Smaragdis Simon Doclo Dept. of Medical Physics and Acoustics and Cluster of Excellence

More information

From Unstructured 3D Point Clouds to Structured Knowledge - A Semantics Approach

From Unstructured 3D Point Clouds to Structured Knowledge - A Semantics Approach From Unstructured 3D Point Clouds to Structured Knowledge - A Semantics Approach Christophe Cruz, Helmi Ben Hmida, Frank Boochs, Christophe Nicolle To cite this version: Christophe Cruz, Helmi Ben Hmida,

More information

Multiple sensor fault detection in heat exchanger system

Multiple sensor fault detection in heat exchanger system Multiple sensor fault detection in heat exchanger system Abdel Aïtouche, Didier Maquin, Frédéric Busson To cite this version: Abdel Aïtouche, Didier Maquin, Frédéric Busson. Multiple sensor fault detection

More information

Source counting in real-time sound source localization using a circular microphone array

Source counting in real-time sound source localization using a circular microphone array Source counting in real-time sound source localization using a circular microphone array Despoina Pavlidi, Anthony Griffin, Matthieu Puigt, Athanasios Mouchtaris To cite this version: Despoina Pavlidi,

More information

Author manuscript, published in "7th International Symposium on Computer Music Modeling and Retrieval (CMMR 2010), Málaga : Spain (2010)"

Author manuscript, published in 7th International Symposium on Computer Music Modeling and Retrieval (CMMR 2010), Málaga : Spain (2010) Author manuscript, published in "7th International Symposium on Computer Music Modeling and Retrieval (CMMR 2010), Málaga : Spain (2010)" Notes on nonnegative tensor factorization of the spectrogram for

More information

Underdetermined Instantaneous Audio Source Separation via Local Gaussian Modeling

Underdetermined Instantaneous Audio Source Separation via Local Gaussian Modeling Underdetermined Instantaneous Audio Source Separation via Local Gaussian Modeling Emmanuel Vincent, Simon Arberet, and Rémi Gribonval METISS Group, IRISA-INRIA Campus de Beaulieu, 35042 Rennes Cedex, France

More information

Phase-dependent anisotropic Gaussian model for audio source separation

Phase-dependent anisotropic Gaussian model for audio source separation Phase-dependent anisotropic Gaussian model for audio source separation Paul Magron, Roland Badeau, Bertrand David To cite this version: Paul Magron, Roland Badeau, Bertrand David. Phase-dependent anisotropic

More information

Smart Bolometer: Toward Monolithic Bolometer with Smart Functions

Smart Bolometer: Toward Monolithic Bolometer with Smart Functions Smart Bolometer: Toward Monolithic Bolometer with Smart Functions Matthieu Denoual, Gilles Allègre, Patrick Attia, Olivier De Sagazan To cite this version: Matthieu Denoual, Gilles Allègre, Patrick Attia,

More information

Finite Volume for Fusion Simulations

Finite Volume for Fusion Simulations Finite Volume for Fusion Simulations Elise Estibals, Hervé Guillard, Afeintou Sangam To cite this version: Elise Estibals, Hervé Guillard, Afeintou Sangam. Finite Volume for Fusion Simulations. Jorek Meeting

More information

Posterior Covariance vs. Analysis Error Covariance in Data Assimilation

Posterior Covariance vs. Analysis Error Covariance in Data Assimilation Posterior Covariance vs. Analysis Error Covariance in Data Assimilation François-Xavier Le Dimet, Victor Shutyaev, Igor Gejadze To cite this version: François-Xavier Le Dimet, Victor Shutyaev, Igor Gejadze.

More information

On the Griesmer bound for nonlinear codes

On the Griesmer bound for nonlinear codes On the Griesmer bound for nonlinear codes Emanuele Bellini, Alessio Meneghetti To cite this version: Emanuele Bellini, Alessio Meneghetti. On the Griesmer bound for nonlinear codes. Pascale Charpin, Nicolas

More information

Hardware Operator for Simultaneous Sine and Cosine Evaluation

Hardware Operator for Simultaneous Sine and Cosine Evaluation Hardware Operator for Simultaneous Sine and Cosine Evaluation Arnaud Tisserand To cite this version: Arnaud Tisserand. Hardware Operator for Simultaneous Sine and Cosine Evaluation. ICASSP 6: International

More information

FORMAL TREATMENT OF RADIATION FIELD FLUCTUATIONS IN VACUUM

FORMAL TREATMENT OF RADIATION FIELD FLUCTUATIONS IN VACUUM FORMAL TREATMENT OF RADIATION FIELD FLUCTUATIONS IN VACUUM Frederic Schuller, Renaud Savalle, Michael Neumann-Spallart To cite this version: Frederic Schuller, Renaud Savalle, Michael Neumann-Spallart.

More information

Comparison of Harmonic, Geometric and Arithmetic means for change detection in SAR time series

Comparison of Harmonic, Geometric and Arithmetic means for change detection in SAR time series Comparison of Harmonic, Geometric and Arithmetic means for change detection in SAR time series Guillaume Quin, Béatrice Pinel-Puysségur, Jean-Marie Nicolas To cite this version: Guillaume Quin, Béatrice

More information

ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY

ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY Jean-Rémy Gloaguen, Arnaud Can Ifsttar - LAE Route de Bouaye - CS4 44344, Bouguenais, FR jean-remy.gloaguen@ifsttar.fr Mathieu

More information

Enhancing Fetal ECG Using Gaussian Process

Enhancing Fetal ECG Using Gaussian Process Enhancing Fetal ECG Using Gaussian Process Saman Noorzadeh, Bertrand Rivet, Pierre-Yves Guméry To cite this version: Saman Noorzadeh, Bertrand Rivet, Pierre-Yves Guméry. Enhancing Fetal ECG Using Gaussian

More information

Dispersion relation results for VCS at JLab

Dispersion relation results for VCS at JLab Dispersion relation results for VCS at JLab G. Laveissiere To cite this version: G. Laveissiere. Dispersion relation results for VCS at JLab. Compton Scattering from Low to High Momentum Transfer, Mar

More information

GMM-based classification from noisy features

GMM-based classification from noisy features GMM-based classification from noisy features Alexey Ozerov, Mathieu Lagrange and Emmanuel Vincent INRIA, Centre de Rennes - Bretagne Atlantique STMS Lab IRCAM - CNRS - UPMC alexey.ozerov@inria.fr, mathieu.lagrange@ircam.fr,

More information

Widely Linear Estimation with Complex Data

Widely Linear Estimation with Complex Data Widely Linear Estimation with Complex Data Bernard Picinbono, Pascal Chevalier To cite this version: Bernard Picinbono, Pascal Chevalier. Widely Linear Estimation with Complex Data. IEEE Transactions on

More information

Blind Source Separation in Nonlinear Mixture for Colored Sources Using Signal Derivatives

Blind Source Separation in Nonlinear Mixture for Colored Sources Using Signal Derivatives Blind Source Separation in Nonlinear Mixture for Colored Sources Using Signal Derivatives Bahram Ehsandoust, Masoud Babaie-Zadeh, Christian Jutten To cite this version: Bahram Ehsandoust, Masoud Babaie-Zadeh,

More information

Easter bracelets for years

Easter bracelets for years Easter bracelets for 5700000 years Denis Roegel To cite this version: Denis Roegel. Easter bracelets for 5700000 years. [Research Report] 2014. HAL Id: hal-01009457 https://hal.inria.fr/hal-01009457

More information

Diffuse noise suppression with asynchronous microphone array based on amplitude additivity model

Diffuse noise suppression with asynchronous microphone array based on amplitude additivity model Diffuse noise suppression with asynchronous microphone array based on amplitude additivity model Yoshikazu Murase, Hironobu Chiba, Nobutaka Ono, Shigeki Miyabe, Takeshi Yamada, and Shoji Makino University

More information

Generalized Constraints for NMF with Application to Informed Source Separation

Generalized Constraints for NMF with Application to Informed Source Separation Generalized Constraints for with Application to Informed Source Separation Christian Rohlfing and Julian M. Becker Institut für Nachrichtentechnik RWTH Aachen University, D2056 Aachen, Germany Email: rohlfing@ient.rwth-aachen.de

More information

Impulse response measurement of ultrasonic transducers

Impulse response measurement of ultrasonic transducers Impulse response measurement of ultrasonic transducers F. Kadlec To cite this version: F. Kadlec. Impulse response measurement of ultrasonic transducers. Journal de Physique IV Colloque, 1994, 04 (C5),

More information

Exogenous input estimation in Electronic Power Steering (EPS) systems

Exogenous input estimation in Electronic Power Steering (EPS) systems Exogenous input estimation in Electronic Power Steering (EPS) systems Valentina Ciarla, Carlos Canudas de Wit, Franck Quaine, Violaine Cahouet To cite this version: Valentina Ciarla, Carlos Canudas de

More information

SOUND SOURCE SEPARATION BASED ON NON-NEGATIVE TENSOR FACTORIZATION INCORPORATING SPATIAL CUE AS PRIOR KNOWLEDGE

SOUND SOURCE SEPARATION BASED ON NON-NEGATIVE TENSOR FACTORIZATION INCORPORATING SPATIAL CUE AS PRIOR KNOWLEDGE SOUND SOURCE SEPARATION BASED ON NON-NEGATIVE TENSOR FACTORIZATION INCORPORATING SPATIAL CUE AS PRIOR KNOWLEDGE Yuki Mitsufuji Sony Corporation, Tokyo, Japan Axel Roebel 1 IRCAM-CNRS-UPMC UMR 9912, 75004,

More information

Passerelle entre les arts : la sculpture sonore

Passerelle entre les arts : la sculpture sonore Passerelle entre les arts : la sculpture sonore Anaïs Rolez To cite this version: Anaïs Rolez. Passerelle entre les arts : la sculpture sonore. Article destiné à l origine à la Revue de l Institut National

More information

The achievable limits of operational modal analysis. * Siu-Kui Au 1)

The achievable limits of operational modal analysis. * Siu-Kui Au 1) The achievable limits o operational modal analysis * Siu-Kui Au 1) 1) Center or Engineering Dynamics and Institute or Risk and Uncertainty, University o Liverpool, Liverpool L69 3GH, United Kingdom 1)

More information

Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation

Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Mikkel N. Schmidt and Morten Mørup Technical University of Denmark Informatics and Mathematical Modelling Richard

More information

On Symmetric Norm Inequalities And Hermitian Block-Matrices

On Symmetric Norm Inequalities And Hermitian Block-Matrices On Symmetric Norm Inequalities And Hermitian lock-matrices Antoine Mhanna To cite this version: Antoine Mhanna On Symmetric Norm Inequalities And Hermitian lock-matrices 016 HAL Id: hal-0131860

More information