SUPPLEMENTARY MATERIAL FOR THE PAPER "A PARAMETRIC MODEL AND ESTIMATION TECHNIQUES FOR THE INHARMONICITY AND TUNING OF THE PIANO"

Size: px
Start display at page:

Download "SUPPLEMENTARY MATERIAL FOR THE PAPER "A PARAMETRIC MODEL AND ESTIMATION TECHNIQUES FOR THE INHARMONICITY AND TUNING OF THE PIANO""

Transcription

1 SUPPLEMENTARY MATERIAL FOR THE PAPER "A PARAMETRIC MODEL AND ESTIMATION TECHNIQUES FOR THE INHARMONICITY AND TUNING OF THE PIANO" François Rigaud and Bertrand David Institut Telecom; Telecom ParisTech; CNRS LTCI; Paris, France Laurent Daudet Institut Langevin; Paris Diderot Univ.; ESPCI ParisTech; CNRS; Paris, France and Institut Universitaire de France This document contains supplementary material for [ and is not intended to be read separately. It provides numerical results and curves for the estimation of the inharmonicity and tuning models on the full dataset pianos), mathematical demonstrations of the update rules of the proposed inharmonicity estimation algorithm and performance comparisons with a state-of-the-art algorithm.. PARAMETRIC MODEL OF PIANO TUNING.. Model reminder The beat cancellation realized when tuning an octave interval between notes indexed by m + and m) is formulated as: F ξ,φ m + ) = F ξ,φ + B ξ ρ φ + B ξ m + ) ρ φ, where B ξ and ρ φ are respectively a model on the whole compass m [, 8 from A to C8) for the inharmonicity coefficient related to the string design) and the choices of the tuner related to the octave type choice). B ξ is defined as follows: B ξ = e b B + e b T, where b B and b T are asymptotes referring to the string design on the bass and treble bridges respectively. { bt = s T m + y T, b B = s B m + y B. Since the design of piano strings that belong to the treble bridge is standardised, {s T, y T } is fixed for every kind of pianos. It has been estimated, in [, from the data of different pianos s T 9. and y T.. ξ = {s B, y B } is the set of free parameters that are dependent on the piano model. The model related to the octave type choice is given by: ρ φ = κ )) m m erf +, α where φ = {κ, m, α} and erf corresponds to the error function. κ + sets the value of the low bass asymptote, m is a This research was partially funded by the French Agence Nationale de la Recherche, PAFI project. parameter of translation along m and α rules the slope of the decrease. By setting the tuning of a reference note as for instance f m = 9) = Hz for the A) the octave interval tuning model allows to tune all the A notes of the keyboard. The rest of the compass is then deduced by a Lagrange polynomial interpolation. Finally, an extra parameter d g corresponding to a global deviation is introduced to take into account the use of tuning forks different than the A at Hz... Results on the full dataset The presented results are obtained from separate databases, representing a total of pianos: Iowa grand piano), RWC [ grand pianos) and MAPS upright piano and grand piano synthesizer using high quality samples). Among the complete dataset, the tuning model was computed for pianos that were assumed to be well-tuned by looking at the shape of their deviation from Equal Temperament ET). The other pianos were found strongly out-of-tune; for them retuning curves are proposed.... Modelling the tuning of well-tuned pianos The results for the well-tuned pianos are presented in figures,, and. The experimental data obtained from a beforehand estimation of B, F ) on isolated note recordings) are depicted as + grey markers and the model as black lines. Subfigures, and, respectively correspond to the inharmonicity coefficient B, the octave type parameter ρ, and the deviation from ET along the compass. The values of the parameters are given in table.... Tuning pianos Because the model of octave type choice ρ φ is defined for well-tuned pianos, it cannot be used to transcribe the tuning of strongly out-of-tune pianos. In this case, it is proposed to generate retuning curves deduced from a mean model of octave type choice. The model is obtained by averaging the curves ρ over three pianos RWC #, # and Iowa grand pianos, c.f. figures, and respectively) that were assumed to be well-tuned by looking at the shape of their deviation from ET curves. From this averaged data, a mean model

2 reference data: B, m [:8 data used for the estimation of the model: B, m M reference data: B, m [:8 data used for the estimation of the model: B, m M 8 data, m [,9 model reference data d model d, +d g Fig. : RWC grand piano #. inharmonicity coefficient B, octave type parameter ρ, deviation from ET along the whole compass. The data are depicted as grey + markers and the model as black lines. : 8: : : : 8 reference data d model d, +d g data, m [,9 model : Fig. : RWC grand piano #. inharmonicity coefficient B, octave type parameter ρ, deviation from ET along the whole compass. The data are depicted as grey + markers and the model as black lines. 8: : : : ρ φ is estimated, and with the purpose to give a range in which we suppose the pianos could be retuned, we define arbitrarily a high respectively low) octave type choice as ρ φ,h = ρ φ + resp. ρ φ,l = min ρ φ, )). These curves are shown in figure. Retuning curves are then given for the pianos on figures and. Subfigures and, respectively correspond to the inharmonicity coefficient B, and the deviation from ET along the compass. The values of the parameters are given in table. RWC RWC Iowa MAPS synt. s B y B κ m α d g Table : Values of the parameters for the well-tuned pianos. RWC MAPS upright s B y B κ. m.9 ᾱ. d g fixed) Table : Values of the parameters for the detuned pianos.

3 reference data: B, m [:8 data used for the estimation of the model: B, m M reference data: B, m [:8 data used for the estimation of the model: B, m M data, m [,9 model reference data d model d, +d g : 8: : : : data, m [,9 model reference data d model d, +d g Fig. : MAPS grand piano synthesizer. inharmonicity coefficient B, octave type parameter ρ, deviation from ET along the whole compass. The data are depicted as grey + markers and the model as black lines. : 8: : : : Fig. : Iowa grand piano. inharmonicity coefficient B, octave type parameter ρ, deviation from ET along the whole compass. The data are depicted as grey + markers and the model as black lines. data ρ model ρφ model ρφ,h model ρφ,l : 8: : : : Fig. : Mean octave type choice for retuning application. Plus grey markers correspond to an average of ρ over different pianos. In black line the estimated model. With circle markers resp. in dashed line) the high resp. low) octave type choice model.

4 reference data: B, m [:8 data used for the estimation of the model: B, m M reference data d tuning model for ρφ tuning model for ρφ,h tuning model for ρφ,l Fig. : RWC grand piano #. Inharmonicity curves along the compass. Actual tuning and proposed retuning. Plus grey markers correspond to the data. The model corresponding to the octave type choice ρ φ resp. ρ φ,l, ρ φ,h ) is depicted as black line resp. black dashed line, black dashed line with circle markers). AUTOMATIC ESTIMATION OF B, F ).. Optimization problem reminder It is recalled that the goal of the algorithm is to approximate an observation matrix V K T ), composed of T magnitude spectra computed on K frequency bins), by the product of two non-negative matrices W and H. W is representing a dictionary of R spectra/atoms K R), and H an activation matrix R T ). W is based on an additive model the spectrum of a note is considered as being composed by a sum of partials), in which the partial frequencies are constrained to follow an inharmonicity relation. Each spectrum of a note, indexed by r [, R, is composed of the sum of N r partials. The partial rank is denoted by n [, N r. Each partial is parametrized by its amplitude a nr and its frequency f nr. Thus, the set of parameters for a single atom is denoted by θ r = {a nr, f nr n [, N r } and the set of parameters for the dictionary is denoted by θ = {θ r r [, R}. Finally, the expression of a parametric atom is given by: Nr W θr kr = a nr g τ ), n= where f k is the frequency of the bin with index k and g τ f k ) the magnitude of the Fourier transform of the analysis window of size τ. The spectral support of g τ f k ) is limited to its main lobe to obtain a simple expression of the update rules [ and a faster optimization. For a Hanning window, the main lobe magnitude spectrum normalized to a maximal magnitude of ) is given by: g τ f k ) =, for f k [ /τ, /τ. πτ. sinπf kτ) f k τ fk The proposed cost function is: reference data: B, m [:8 data used for the estimation of the model: B, m M reference data d tuning model for ρφ tuning model for ρφ,h tuning model for ρφ,l Fig. : MAPS upright Disklavier. Inharmonicity curves along the compass. Actual tuning and proposed retuning. Plus grey markers correspond to the data. The model corresponding to the octave type choice ρ φ resp. ρ φ,l, ρ φ,h ) is depicted as black line resp. black dashed line, black dashed line with circle markers) where Cθ, γ, H) = C θ, H) + λ C f nr, γ), C θ, H) = C f nr, γ) = K k= t= R r= n= T d β V R r= W θr kr H rt ) f nr nf r + Br n ). C corresponds to a reconstruction cost function which measures the β-divergence between the observation V and the model W θ H. C is a cost function corresponding to a regularization term) that constraint the partial frequencies of the model to follow an inharmonicity relation γ = {F r, B r r [, R}). The empirical parameter λ sets the weight of the inharmonicity constraint in the global cost function... Multiplicative algorithm Multiplicative algorithms aim at decomposing the partial derivatives of a cost function, with respect to a given parameter θ, as a difference of two positive terms: Cθ ) = P θ ) Qθ ), P θ ), Qθ ) A) and at iteratively updating the corresponding parameter according to: θ θ Qθ )/P θ ) B) Since P θ ) and Qθ ) are positives, it guarantees that parameters initialized with positive values stay positives during,

5 the optimization, and that the update is performed in the steepness direction, along the parameter axis. Indeed, if the partial derivative of the cost function is positive respectively negative), then Qθ )/P θ ) is smaller resp. bigger) than and the value of the parameter is decreased resp. increased). At a stationary point, the derivative of the cost function is null so Qθ )/P θ ) =. In the general case, no proof is given for the convergence of the algorithm or even for the decrease of the criterion. However, multiplicative update rules are widely used for solving NMF minimizations because in practice it is observed that they give satisfactory results for a reasonable number of iterations, while preserving the positiveness of the decomposition... Demonstration of the multiplicative update rules The decompositions of the global cost function, with respect to all the parameters a nr, f nr, F r, B r, r [, R, n [, N r ), that lead to the update rules given in section III.A.. of [ are demonstrated in this section.... C θ, H) partial derivative decompositions The family of β-divergences is defined as: d β x y) = ββ ) xβ + β )y β βxy β ) β R \ {, }, xlog x log y) + y x) β =, x y log x y β =. C) It s derivative with relation to y is continuous in β, and given by: d β x y) y = y β y x) D) In our case x = V a time-frequency bin of the magnitude spectrogram), and y = ˆV = R W θr kr H rt is parametrized r= by θ. Thus, the partial derivative of the reconstruction cost function with relation to a specific parameter θ is given by: with C θ, H) = K T k= t= ˆV = R ˆV r= β ˆV ˆV V ), E) W θr kr H rt, F) which can be decomposed as a difference of two positive terms ˆV R = W θr R kr W θr kr H rt H rt. G) r= r= }{{}}{{} ˆV ˆV Finally, the quantity C θ, H)/ can also be expressed as a difference of two positive quantities: C θ, H) = [ ˆV β ˆV + ˆV β ˆV V }{{} [ ˆV β ˆV P θ ) V + ˆV β ˆV } {{ } Q θ ) Derivative with respect to a nr : r [, R and n [, N r ˆV = g τ ) H rt > a nr It is then chosen, ˆV = g τ ) H rt, a nr ˆV a nr =.. H) Finally, by replacing in equation H), equations ) and ) from [ are obtained: P a nr ) = [ g τ ).H rt ). V β, Q a nr ) = [ g τ ).H rt ). V β.v. Derivative with respect to f nr : r [, R and n [, N r ˆV f nr = a nr g τ ) H rt I) J) K) Regardless the analysis window that has been used, the quantity g τ ) changes its sign since g τ f k ) has several lobes. In order to obtain a satisfying expression i.e. a difference of two positive terms), the spectral support of g τ ) is limited to its main lobe so the sign of its derivative is changing once) and its derivative is expressed as: g τ ) = f nr f k ) g τ ). L) The quantity g τ f k f nr) f k f nr stays positive on the main lobe, for every kind of analysis window an illustration can be found in [). Thus, ˆV = a nr f k g τ ) H rt, f nr ˆV M) = a nr f nr g τ ) H rt. f nr And finally, by replacing in equation H), equations 8) and 9) from [ are obtained: P f nr ) = [ f k.g ) τ ) a nr.h rt. V β f nr.g ) τ ) + a nr.h rt. V β.v, Q f nr ) = [ f k.g ) τ ) a nr.h rt. V β.v f nr.g ) τ f f nr ) + a nr.h rt. V β.

6 For the Hanning window, the derivative of g τ f) for f [ /τ, /τ is given by: g τ f) = τ f ) sinπτf) + πτf τ f ) cosπτf) πτ f τ f )... C f nr, γ) partial derivative decompositions Derivative with respect to f nr : r [, R, n [, N r C ) = f nr nf r + Br n f nr Then equations ) and ) from [ are directly obtained: { P f nr ) = f nr, Q f nr ) = nf r + Br n. N) f n / n. x Peaks Estimated inh. relation 8 9 n Derivative with respect to B r : r [, R: C B r = n= = F r ) f nr nf r + Br n n= n F r ) n f nr + Br n n F r + B r n Thus, equations ) and ) from [ are obtained: P B r ) = F r n, Q B r ) = n= n= Derivative with respect to F r : r [, R: C F r = n= n f nr + Br n. O) ) f nr nf r + Br n n ) + B r n P) An exact analytic solution allows to cancel the partial derivative corresponding to the update rule ) from [): F r = n= N r f nr n + B r n n + B r n ) n=.. Performance evaluations The results for the estimation of the inharmonicity coefficient have been compared to the state-of-the-art PFD Partial Frequencies Deviation) algorithm [,, on both synthetic and real piano tones corresponding to these used in [). As suggested in [, the evaluation is performed on A-G range m [, ). Fig. 8: Inharmonicity coefficient ground truth extraction on note G of Iowa grand piano. Magnitude spectrum in grey), and selection of the peaks corresponding to transverse vibration of the strings + markers). Results of the estimation of B and F reference values in the plane f n /n) as a function of n. + markers correspond to manually extracted peaks, and grey line to the estimated inharmonicity relation.... Reference extraction As the ground truth i.e. the inharmonicity coefficient of each tone) is not directly available the string dimensions and properties are not known for the real piano, and the synthesis parameters are not available anymore for the synthetic tones), the partial frequencies corresponding to transverse vibrations of the strings have been manually extracted, up to a rank of, from isolated note magnitude spectra. The spectra have been computed from seconds of decaying sound on frequency bins. In case multiple partials are produced by the coupling of doublet/triplet of strings, several peaks have been considered as illustrated on figure 8). In the following equation, the frequencies of these peaks are denoted by f r,n,p, where r corresponds to the index of the note, p [, P r,n to the index of the peak within n [, N r the rank of the partial. Then, the reference parameters B r, F r) have been estimated by minimizing the least absolute deviation between the inharmonicity relation and the extracted partial frequencies: P r,n Br, Fr) = argmin B r,f r) n= p= P r,n f r,n,p nf r + Br n. Q) The /P r,n factor corresponds to a weighting that allows considering multiple partials as one theoretical partials in the regression. The result of the estimation of B, F ) for G note of Iowa grand piano is depicted in figure 8 by a graphic f n /n) as a function of n.

7 Relative error in %).... PFD NMF B log. scale) PFD NMF Relative error in %) 8 PFD NMF Fig. 9: Relative error in %) of the inharmonicity coefficient estimation along the range A-G m [, ) for PFD black) and NMF grey) algorithms. The evaluation is performed on synthetic and Iowa real piano tones. PFD %) NMF %) Synthetic.8. Iowa.. Table : Relative error in %) of the inharmonicity coefficient estimation, averaged on the range A-G, for PFD and NMF algorithms on synthetic and Iowa real piano tones. 8 9 Fig. : Inharmonicity coefficient estimation along the whole compass of Iowa grand piano. Black curve corresponds to PFD estimation and grey one to NMF. monicity and tuning of the piano. Journal of the Acoustical Society of America, ), May. [ M. Goto, T. Nishimura, H. Hashiguchi, and R. Oka. Rwc music database: Music genre database and musical instrument sound database. In ISMIR, pages 9,. [ Romain Hennequin, Roland Badeau, and Bertrand David. Time-dependent parametric and harmonic templates in non-negative matrix factorization. In Proc. of the th Int. Conf. on Digital Audio Effects DAFx-), September. [ J. Rauhala, H. M. Lehtonen, and V. Välimäki. Fast automatic inharmonicity estimation algorithm. JASA Express Letters, :8 89, May. [ J. Rauhala and V. Välimäki. F estimation of inharmonic piano tones using partial frequencies deviation method. In Proc. of International Computer Music Conference ICMC ), pages, August.... Evaluation results Finally, the inharmonicity coefficient estimation results for PFD and NMF algorithms are evaluated in terms of relative error with respect to the reference: B r B r /B r. These are presented on figure 9 for synthetic and real piano Iowa grand piano with mf dynamics) sample analysis on the range A-G m [, ). Table returns these relative errors averaged for each piano and algorithm and exhibits highest performances for the NMF algorithm. On G -C8 range m [, 88) results are not quantified because of the lack of ground truth and data for the synthetic signals. However, it can be observed graphically c.f. figure ) that NMF estimates seem more consistent with typical values than these obtained by using PFD not optimized there). It is worth noting that in the presented experiments, F r was initialized to equal temperament for PFD algorithm proposed as an optional input in the code). In a second time, in order to study the influence of the initialization, we modified the PFD code so that it can take into account the same initialization of B r as the one we used for the NMF algorithm. No improvement was found for the high pitch range.. REFERENCES [ François Rigaud, Bertrand David, and Laurent Daudet. A parametric model and estimation techniques for the inhar-

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION 13 th International Conference on Digital Audio Effects Romain Hennequin, Roland Badeau and Bertrand David Telecom

More information

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION Romain Hennequin, Roland Badeau and Bertrand David, Institut Telecom; Telecom ParisTech; CNRS LTCI Paris France romainhennequin@telecom-paristechfr

More information

Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms

Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms Masahiro Nakano 1, Jonathan Le Roux 2, Hirokazu Kameoka 2,YuKitano 1, Nobutaka Ono 1,

More information

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION Hiroazu Kameoa The University of Toyo / Nippon Telegraph and Telephone Corporation ABSTRACT This paper proposes a novel

More information

QUERY-BY-EXAMPLE MUSIC RETRIEVAL APPROACH BASED ON MUSICAL GENRE SHIFT BY CHANGING INSTRUMENT VOLUME

QUERY-BY-EXAMPLE MUSIC RETRIEVAL APPROACH BASED ON MUSICAL GENRE SHIFT BY CHANGING INSTRUMENT VOLUME Proc of the 12 th Int Conference on Digital Audio Effects (DAFx-09 Como Italy September 1-4 2009 QUERY-BY-EXAMPLE MUSIC RETRIEVAL APPROACH BASED ON MUSICAL GENRE SHIFT BY CHANGING INSTRUMENT VOLUME Katsutoshi

More information

LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION.

LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION. LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION. Romain Hennequin Deezer R&D 10 rue d Athènes, 75009 Paris, France rhennequin@deezer.com

More information

STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS

STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS 19th European Signal Processing Conference (EUSIPCO 2011) Barcelona, Spain, August 29 - September 2, 2011 STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS Ken O Hanlon and Mark D.Plumbley Queen

More information

Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation

Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Mikkel N. Schmidt and Morten Mørup Technical University of Denmark Informatics and Mathematical Modelling Richard

More information

10ème Congrès Français d Acoustique

10ème Congrès Français d Acoustique 1ème Congrès Français d Acoustique Lyon, 1-16 Avril 1 Spectral similarity measure invariant to pitch shifting and amplitude scaling Romain Hennequin 1, Roland Badeau 1, Bertrand David 1 1 Institut TELECOM,

More information

Physical Modeling Synthesis

Physical Modeling Synthesis Physical Modeling Synthesis ECE 272/472 Audio Signal Processing Yujia Yan University of Rochester Table of contents 1. Introduction 2. Basics of Digital Waveguide Synthesis 3. Fractional Delay Line 4.

More information

744 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 4, MAY 2011

744 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 4, MAY 2011 744 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 4, MAY 2011 NMF With Time Frequency Activations to Model Nonstationary Audio Events Romain Hennequin, Student Member, IEEE,

More information

Analysis of polyphonic audio using source-filter model and non-negative matrix factorization

Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Tuomas Virtanen and Anssi Klapuri Tampere University of Technology, Institute of Signal Processing Korkeakoulunkatu

More information

Entropy-constrained quantization of exponentially damped sinusoids parameters

Entropy-constrained quantization of exponentially damped sinusoids parameters Entropy-constrained quantization of exponentially damped sinusoids parameters Olivier Derrien, Roland Badeau, Gaël Richard To cite this version: Olivier Derrien, Roland Badeau, Gaël Richard. Entropy-constrained

More information

Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs

Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs Paris Smaragdis TR2004-104 September

More information

ACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES. Sebastian Ewert Mark D. Plumbley Mark Sandler

ACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES. Sebastian Ewert Mark D. Plumbley Mark Sandler ACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES Sebastian Ewert Mark D. Plumbley Mark Sandler Queen Mary University of London, London, United Kingdom ABSTRACT

More information

STATISTICAL APPROACH FOR SOUND MODELING

STATISTICAL APPROACH FOR SOUND MODELING STATISTICAL APPROACH FOR SOUND MODELING Myriam DESAINTE-CATHERINE SCRIME - LaBRI Université de Bordeaux 1 F-33405 Talence Cedex, France myriam@labri.u-bordeaux.fr Pierre HANNA SCRIME - LaBRI Université

More information

Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription

Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription Benoît Fuentes, Roland Badeau, Gaël Richard To cite this version: Benoît Fuentes, Roland Badeau, Gaël Richard.

More information

Oracle Analysis of Sparse Automatic Music Transcription

Oracle Analysis of Sparse Automatic Music Transcription Oracle Analysis of Sparse Automatic Music Transcription Ken O Hanlon, Hidehisa Nagano, and Mark D. Plumbley Queen Mary University of London NTT Communication Science Laboratories, NTT Corporation {keno,nagano,mark.plumbley}@eecs.qmul.ac.uk

More information

PHY 103: Standing Waves and Harmonics. Segev BenZvi Department of Physics and Astronomy University of Rochester

PHY 103: Standing Waves and Harmonics. Segev BenZvi Department of Physics and Astronomy University of Rochester PHY 103: Standing Waves and Harmonics Segev BenZvi Department of Physics and Astronomy University of Rochester Sounds of the Universe NASA/JPL, September 2016 2 Properties of Waves Wavelength: λ, length

More information

Octaves and inharmonicity in piano tuning

Octaves and inharmonicity in piano tuning Octaves and inharmonicity in piano tuning Kees van den Doel January 11, 2013 Abstract I show that equal beating 4:2/6:3 octaves (called 4:2+ octaves) results in an approximately equal beating 2:1 octave

More information

Jacobi Algorithm For Nonnegative Matrix Factorization With Transform Learning

Jacobi Algorithm For Nonnegative Matrix Factorization With Transform Learning Jacobi Algorithm For Nonnegative Matrix Factorization With Transform Learning Herwig Wt, Dylan Fagot and Cédric Févotte IRIT, Université de Toulouse, CNRS Toulouse, France firstname.lastname@irit.fr Abstract

More information

GAUSSIANIZATION METHOD FOR IDENTIFICATION OF MEMORYLESS NONLINEAR AUDIO SYSTEMS

GAUSSIANIZATION METHOD FOR IDENTIFICATION OF MEMORYLESS NONLINEAR AUDIO SYSTEMS GAUSSIANIATION METHOD FOR IDENTIFICATION OF MEMORYLESS NONLINEAR AUDIO SYSTEMS I. Marrakchi-Mezghani (1),G. Mahé (2), M. Jaïdane-Saïdane (1), S. Djaziri-Larbi (1), M. Turki-Hadj Alouane (1) (1) Unité Signaux

More information

Fast adaptive ESPRIT algorithm

Fast adaptive ESPRIT algorithm Fast adaptive ESPRIT algorithm Roland Badeau, Gaël Richard, Bertrand David To cite this version: Roland Badeau, Gaël Richard, Bertrand David. Fast adaptive ESPRIT algorithm. Proc. of IEEE Workshop on Statistical

More information

THE task of identifying the environment in which a sound

THE task of identifying the environment in which a sound 1 Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification Victor Bisot, Romain Serizel, Slim Essid, and Gaël Richard Abstract In this paper, we study the usefulness of various

More information

Variational inference

Variational inference Simon Leglaive Télécom ParisTech, CNRS LTCI, Université Paris Saclay November 18, 2016, Télécom ParisTech, Paris, France. Outline Introduction Probabilistic model Problem Log-likelihood decomposition EM

More information

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement Simon Leglaive 1 Laurent Girin 1,2 Radu Horaud 1 1: Inria Grenoble Rhône-Alpes 2: Univ. Grenoble Alpes, Grenoble INP,

More information

Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation

Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation CENTER FOR COMPUTER RESEARCH IN MUSIC AND ACOUSTICS DEPARTMENT OF MUSIC, STANFORD UNIVERSITY REPORT NO. STAN-M-4 Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation

More information

Scalable audio separation with light Kernel Additive Modelling

Scalable audio separation with light Kernel Additive Modelling Scalable audio separation with light Kernel Additive Modelling Antoine Liutkus 1, Derry Fitzgerald 2, Zafar Rafii 3 1 Inria, Université de Lorraine, LORIA, UMR 7503, France 2 NIMBUS Centre, Cork Institute

More information

A PIANO MODEL INCLUDING LONGITUDINAL STRING VIBRATIONS

A PIANO MODEL INCLUDING LONGITUDINAL STRING VIBRATIONS 04 DAFx A PIANO MODEL INCLUDING LONGITUDINAL STRING VIBRATIONS Balázs Bank and László Sujbert Department of Measurement and Information Systems Budapest University of Technology and Economics {bank sujbert}@mit.bme.hu

More information

CONVOLUTIVE NON-NEGATIVE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT

CONVOLUTIVE NON-NEGATIVE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT CONOLUTIE NON-NEGATIE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT Paul D. O Grady Barak A. Pearlmutter Hamilton Institute National University of Ireland, Maynooth Co. Kildare, Ireland. ABSTRACT Discovering

More information

Non-Negative Tensor Factorisation for Sound Source Separation

Non-Negative Tensor Factorisation for Sound Source Separation ISSC 2005, Dublin, Sept. -2 Non-Negative Tensor Factorisation for Sound Source Separation Derry FitzGerald, Matt Cranitch φ and Eugene Coyle* φ Dept. of Electronic Engineering, Cor Institute of Technology

More information

Superposition and Standing Waves

Superposition and Standing Waves Physics 1051 Lecture 9 Superposition and Standing Waves Lecture 09 - Contents 14.5 Standing Waves in Air Columns 14.6 Beats: Interference in Time 14.7 Non-sinusoidal Waves Trivia Questions 1 How many wavelengths

More information

Sound Recognition in Mixtures

Sound Recognition in Mixtures Sound Recognition in Mixtures Juhan Nam, Gautham J. Mysore 2, and Paris Smaragdis 2,3 Center for Computer Research in Music and Acoustics, Stanford University, 2 Advanced Technology Labs, Adobe Systems

More information

CS229 Project: Musical Alignment Discovery

CS229 Project: Musical Alignment Discovery S A A V S N N R R S CS229 Project: Musical Alignment iscovery Woodley Packard ecember 16, 2005 Introduction Logical representations of musical data are widely available in varying forms (for instance,

More information

16 SUPERPOSITION & STANDING WAVES

16 SUPERPOSITION & STANDING WAVES Chapter 6 SUPERPOSITION & STANDING WAVES 6. Superposition of waves Principle of superposition: When two or more waves overlap, the resultant wave is the algebraic sum of the individual waves. Illustration:

More information

FACTORS IN FACTORIZATION: DOES BETTER AUDIO SOURCE SEPARATION IMPLY BETTER POLYPHONIC MUSIC TRANSCRIPTION?

FACTORS IN FACTORIZATION: DOES BETTER AUDIO SOURCE SEPARATION IMPLY BETTER POLYPHONIC MUSIC TRANSCRIPTION? FACTORS IN FACTORIZATION: DOES BETTER AUDIO SOURCE SEPARATION IMPLY BETTER POLYPHONIC MUSIC TRANSCRIPTION? Tiago Fernandes Tavares, George Tzanetakis, Peter Driessen University of Victoria Department of

More information

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS Emad M. Grais and Hakan Erdogan Faculty of Engineering and Natural Sciences, Sabanci University, Orhanli

More information

Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution

Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution Clément Laroche, Hélène Papadopoulos, Matthieu Kowalski, Gaël Richard To cite this version: Clément

More information

ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY

ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY Jean-Rémy Gloaguen, Arnaud Can Ifsttar - LAE Route de Bouaye - CS4 44344, Bouguenais, FR jean-remy.gloaguen@ifsttar.fr Mathieu

More information

Introduction Basic Audio Feature Extraction

Introduction Basic Audio Feature Extraction Introduction Basic Audio Feature Extraction Vincent Koops (with slides by Meinhard Müller) Sound and Music Technology, December 6th, 2016 1 28 November 2017 Today g Main modules A. Sound and music for

More information

Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence

Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence Arnaud Dessein, Arshia Cont, Guillaume Lemaitre To cite this version: Arnaud Dessein, Arshia Cont, Guillaume

More information

Wave Phenomena Physics 15c. Lecture 9 Wave Reflection Standing Waves

Wave Phenomena Physics 15c. Lecture 9 Wave Reflection Standing Waves Wave Phenomena Physics 15c Lecture 9 Wave Reflection Standing Waves What We Did Last Time Energy and momentum in LC transmission lines Transfer rates for normal modes: and The energy is carried by the

More information

PHY-2464 Physical Basis of Music

PHY-2464 Physical Basis of Music Physical Basis of Music Presentation 21 Percussion Instruments II Adapted from Sam Matteson s Unit 3 Session 34 Sam Trickey Mar. 27, 2005 Percussion = striking Percussion instruments divide nicely into

More information

On Spectral Basis Selection for Single Channel Polyphonic Music Separation

On Spectral Basis Selection for Single Channel Polyphonic Music Separation On Spectral Basis Selection for Single Channel Polyphonic Music Separation Minje Kim and Seungjin Choi Department of Computer Science Pohang University of Science and Technology San 31 Hyoja-dong, Nam-gu

More information

PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals

PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals Masataka Goto National Institute of Advanced Industrial Science and Technology (AIST). IT, AIST, 1-1-1 Umezono, Tsukuba,

More information

OBJECT CODING OF HARMONIC SOUNDS USING SPARSE AND STRUCTURED REPRESENTATIONS

OBJECT CODING OF HARMONIC SOUNDS USING SPARSE AND STRUCTURED REPRESENTATIONS OBJECT CODING OF HARMONIC SOUNDS USING SPARSE AND STRUCTURED REPRESENTATIONS Grégory Cornuz 1, Emmanuel Ravelli 1,2, 1 Institut Jean Le Rond d Alembert, LAM team Université Pierre et Marie Curie - Paris

More information

MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION. Ikuo Degawa, Kei Sato, Masaaki Ikehara

MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION. Ikuo Degawa, Kei Sato, Masaaki Ikehara MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION Ikuo Degawa, Kei Sato, Masaaki Ikehara EEE Dept. Keio University Yokohama, Kanagawa 223-8522 Japan E-mail:{degawa,

More information

Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification

Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard To cite this version: Victor Bisot, Romain Serizel, Slim Essid,

More information

SPEECH ANALYSIS AND SYNTHESIS

SPEECH ANALYSIS AND SYNTHESIS 16 Chapter 2 SPEECH ANALYSIS AND SYNTHESIS 2.1 INTRODUCTION: Speech signal analysis is used to characterize the spectral information of an input speech signal. Speech signal analysis [52-53] techniques

More information

Chapter 17. Superposition & Standing Waves

Chapter 17. Superposition & Standing Waves Chapter 17 Superposition & Standing Waves Superposition & Standing Waves Superposition of Waves Standing Waves MFMcGraw-PHY 2425 Chap 17Ha - Superposition - Revised: 10/13/2012 2 Wave Interference MFMcGraw-PHY

More information

Lecture 14 1/38 Phys 220. Final Exam. Wednesday, August 6 th 10:30 am 12:30 pm Phys multiple choice problems (15 points each 300 total)

Lecture 14 1/38 Phys 220. Final Exam. Wednesday, August 6 th 10:30 am 12:30 pm Phys multiple choice problems (15 points each 300 total) Lecture 14 1/38 Phys 220 Final Exam Wednesday, August 6 th 10:30 am 12:30 pm Phys 114 20 multiple choice problems (15 points each 300 total) 75% will be from Chapters 10-16 25% from Chapters 1-9 Students

More information

Learning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis

Learning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis Learning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis Youngmin Cho yoc@cs.ucsd.edu Lawrence K. Saul saul@cs.ucsd.edu CSE Department, University of California, San Diego, 95 Gilman

More information

University of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015

University of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015 University of Colorado at Boulder ECEN 4/5532 Lab 2 Lab report due on February 16, 2015 This is a MATLAB only lab, and therefore each student needs to turn in her/his own lab report and own programs. 1

More information

Phase-dependent anisotropic Gaussian model for audio source separation

Phase-dependent anisotropic Gaussian model for audio source separation Phase-dependent anisotropic Gaussian model for audio source separation Paul Magron, Roland Badeau, Bertrand David To cite this version: Paul Magron, Roland Badeau, Bertrand David. Phase-dependent anisotropic

More information

A FINITE DIFFERENCE METHOD FOR THE EXCITATION OF A DIGITAL WAVEGUIDE STRING MODEL

A FINITE DIFFERENCE METHOD FOR THE EXCITATION OF A DIGITAL WAVEGUIDE STRING MODEL A FINITE DIFFERENCE METHOD FOR THE EXCITATION OF A DIGITAL WAVEGUIDE STRING MODEL Leonardo Gabrielli 1, Luca Remaggi 1, Stefano Squartini 1 and Vesa Välimäki 2 1 Universitá Politecnica delle Marche, Ancona,

More information

Uncertainty principles for far field patterns and applications to inverse source problems

Uncertainty principles for far field patterns and applications to inverse source problems for far field patterns and applications to inverse source problems roland.griesmaier@uni-wuerzburg.de (joint work with J. Sylvester) Paris, September 17 Outline Source problems and far field patterns AregularizedPicardcriterion

More information

Optimal spectral transportation with application to music transcription

Optimal spectral transportation with application to music transcription Optimal spectral transportation with application to music transcription Rémi Flamary, Cédric Févotte, Nicolas Courty, Valentin Emiya To cite this version: Rémi Flamary, Cédric Févotte, Nicolas Courty,

More information

t Tao Group Limited, Reading, RG6 IAZ, U.K. wenwu.wang(ieee.org t Samsung Electronics Research Institute, Staines, TW1 8 4QE, U.K.

t Tao Group Limited, Reading, RG6 IAZ, U.K.   wenwu.wang(ieee.org t Samsung Electronics Research Institute, Staines, TW1 8 4QE, U.K. NON-NEGATIVE MATRIX FACTORIZATION FOR NOTE ONSET DETECTION OF AUDIO SIGNALS Wenwu Wangt, Yuhui Luol, Jonathon A. Chambers', and Saeid Sanei t Tao Group Limited, Reading, RG6 IAZ, U.K. Email: wenwu.wang(ieee.org

More information

CEPSTRAL ANALYSIS SYNTHESIS ON THE MEL FREQUENCY SCALE, AND AN ADAPTATIVE ALGORITHM FOR IT.

CEPSTRAL ANALYSIS SYNTHESIS ON THE MEL FREQUENCY SCALE, AND AN ADAPTATIVE ALGORITHM FOR IT. CEPSTRAL ANALYSIS SYNTHESIS ON THE EL FREQUENCY SCALE, AND AN ADAPTATIVE ALGORITH FOR IT. Summarized overview of the IEEE-publicated papers Cepstral analysis synthesis on the mel frequency scale by Satochi

More information

THEORY AND DESIGN OF HIGH ORDER SOUND FIELD MICROPHONES USING SPHERICAL MICROPHONE ARRAY

THEORY AND DESIGN OF HIGH ORDER SOUND FIELD MICROPHONES USING SPHERICAL MICROPHONE ARRAY THEORY AND DESIGN OF HIGH ORDER SOUND FIELD MICROPHONES USING SPHERICAL MICROPHONE ARRAY Thushara D. Abhayapala, Department of Engineering, FEIT & Department of Telecom Eng, RSISE The Australian National

More information

Lecture 3: Acoustics

Lecture 3: Acoustics CSC 83060: Speech & Audio Understanding Lecture 3: Acoustics Michael Mandel mim@sci.brooklyn.cuny.edu CUNY Graduate Center, Computer Science Program http://mr-pc.org/t/csc83060 With much content from Dan

More information

Musical Genre Classication

Musical Genre Classication Musical Genre Classication Jan Müllers RWTH Aachen, 2015 Jan Müllers Finding Disjoint Paths 1 / 15 Musical Genres The Problem Musical Genres History Automatic Speech Regocnition categorical labels created

More information

Chapter 16: Oscillatory Motion and Waves. Simple Harmonic Motion (SHM)

Chapter 16: Oscillatory Motion and Waves. Simple Harmonic Motion (SHM) Chapter 6: Oscillatory Motion and Waves Hooke s Law (revisited) F = - k x Tthe elastic potential energy of a stretched or compressed spring is PE elastic = kx / Spring-block Note: To consider the potential

More information

Non-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology

Non-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology Non-Negative Matrix Factorization And Its Application to Audio Tuomas Virtanen Tampere University of Technology tuomas.virtanen@tut.fi 2 Contents Introduction to audio signals Spectrogram representation

More information

Numerical simulation of a piano soundboard under downbearing. Adrien Mamou-Mani, Joël Frelat, and Charles Besnainou. Institut Jean Le Rond d Alembert,

Numerical simulation of a piano soundboard under downbearing. Adrien Mamou-Mani, Joël Frelat, and Charles Besnainou. Institut Jean Le Rond d Alembert, Numerical simulation of a piano soundboard under downbearing Adrien Mamou-Mani, Joël Frelat, and Charles Besnainou Institut Jean Le Rond d Alembert, UPMC/CNRS (Dated: December 27, 2007) 1 Abstract A finite

More information

Linear and Non-Linear Responses to Dynamic Broad-Band Spectra in Primary Auditory Cortex

Linear and Non-Linear Responses to Dynamic Broad-Band Spectra in Primary Auditory Cortex Linear and Non-Linear Responses to Dynamic Broad-Band Spectra in Primary Auditory Cortex D. J. Klein S. A. Shamma J. Z. Simon D. A. Depireux,2,2 2 Department of Electrical Engineering Supported in part

More information

IMPROVEMENTS IN ACTIVE NOISE CONTROL OF HELICOPTER NOISE IN A MOCK CABIN ABSTRACT

IMPROVEMENTS IN ACTIVE NOISE CONTROL OF HELICOPTER NOISE IN A MOCK CABIN ABSTRACT IMPROVEMENTS IN ACTIVE NOISE CONTROL OF HELICOPTER NOISE IN A MOCK CABIN Jared K. Thomas Brigham Young University Department of Mechanical Engineering ABSTRACT The application of active noise control (ANC)

More information

Chapter 4 Discrete Fourier Transform (DFT) And Signal Spectrum

Chapter 4 Discrete Fourier Transform (DFT) And Signal Spectrum Chapter 4 Discrete Fourier Transform (DFT) And Signal Spectrum CEN352, DR. Nassim Ammour, King Saud University 1 Fourier Transform History Born 21 March 1768 ( Auxerre ). Died 16 May 1830 ( Paris ) French

More information

On the comparison of symmetric and unsymmetric formulations for experimental vibro-acoustic modal analysis

On the comparison of symmetric and unsymmetric formulations for experimental vibro-acoustic modal analysis Acoustics 8 Paris On the comparison of symmetric and unsymmetric formulations for experimental vibro-acoustic modal analysis M. Ouisse a and E. Foltete b a FEMO-S UMR CNRS 24 chemin de l Epitaphe 25 Besançon

More information

Vibrations of string. Henna Tahvanainen. November 8, ELEC-E5610 Acoustics and the Physics of Sound, Lecture 4

Vibrations of string. Henna Tahvanainen. November 8, ELEC-E5610 Acoustics and the Physics of Sound, Lecture 4 Vibrations of string EEC-E5610 Acoustics and the Physics of Sound, ecture 4 Henna Tahvanainen Department of Signal Processing and Acoustics Aalto University School of Electrical Engineering November 8,

More information

Chapter 15 Mechanical Waves

Chapter 15 Mechanical Waves Chapter 15 Mechanical Waves 1 Types of Mechanical Waves This chapter and the next are about mechanical waves waves that travel within some material called a medium. Waves play an important role in how

More information

Time-Frequency Analysis

Time-Frequency Analysis Time-Frequency Analysis Basics of Fourier Series Philippe B. aval KSU Fall 015 Philippe B. aval (KSU) Fourier Series Fall 015 1 / 0 Introduction We first review how to derive the Fourier series of a function.

More information

AN EFFICIENT MULTI-RESOLUTION SPECTRAL TRANSFORM FOR MUSIC ANALYSIS

AN EFFICIENT MULTI-RESOLUTION SPECTRAL TRANSFORM FOR MUSIC ANALYSIS 10th International Society for Music Information Retrieval Conference (ISMIR 2009) AN EFFICIENT MULTI-RESOLUTION SPECTRAL TRANSFORM FOR MUSIC ANALYSIS Pablo Cancela Martín Rocamora Ernesto López Universidad

More information

NON-NEGATIVE MATRIX FACTORIZATION WITH SELECTIVE SPARSITY CONSTRAINTS FOR TRANSCRIPTION OF BELL CHIMING RECORDINGS

NON-NEGATIVE MATRIX FACTORIZATION WITH SELECTIVE SPARSITY CONSTRAINTS FOR TRANSCRIPTION OF BELL CHIMING RECORDINGS NON-NEGAIVE MARIX ACORIZAION WIH SELECIVE SPARSIY CONSRAINS OR RANSCRIPION O BELL CHIMING RECORDINGS ABSRAC he paper presents a method for automatic transcription of recordings of bell chiming performances.

More information

Lecture 2: Acoustics. Acoustics & sound

Lecture 2: Acoustics. Acoustics & sound EE E680: Speech & Audio Processing & Recognition Lecture : Acoustics 1 3 4 The wave equation Acoustic tubes: reflections & resonance Oscillations & musical acoustics Spherical waves & room acoustics Dan

More information

REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES

REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES Kedar Patki University of Rochester Dept. of Electrical and Computer Engineering kedar.patki@rochester.edu ABSTRACT The paper reviews the problem of

More information

Patch similarity under non Gaussian noise

Patch similarity under non Gaussian noise The 18th IEEE International Conference on Image Processing Brussels, Belgium, September 11 14, 011 Patch similarity under non Gaussian noise Charles Deledalle 1, Florence Tupin 1, Loïc Denis 1 Institut

More information

Bayesian harmonic models for musical signal analysis. Simon Godsill and Manuel Davy

Bayesian harmonic models for musical signal analysis. Simon Godsill and Manuel Davy Bayesian harmonic models for musical signal analysis Simon Godsill and Manuel Davy June 2, 2002 Cambridge University Engineering Department and IRCCyN UMR CNRS 6597 The work of both authors was partially

More information

The Role of Continued Fractions in Rediscovering a Xenharmonic Tuning

The Role of Continued Fractions in Rediscovering a Xenharmonic Tuning The Role of Continued Fractions in Rediscovering a Xenharmonic Tuning Jordan Schettler University of California, Santa Barbara 10/11/2012 Outline 1 Motivation 2 Physics 3 Circle of Fifths 4 Continued Fractions

More information

USING STATISTICAL ROOM ACOUSTICS FOR ANALYSING THE OUTPUT SNR OF THE MWF IN ACOUSTIC SENSOR NETWORKS. Toby Christian Lawin-Ore, Simon Doclo

USING STATISTICAL ROOM ACOUSTICS FOR ANALYSING THE OUTPUT SNR OF THE MWF IN ACOUSTIC SENSOR NETWORKS. Toby Christian Lawin-Ore, Simon Doclo th European Signal Processing Conference (EUSIPCO 1 Bucharest, Romania, August 7-31, 1 USING STATISTICAL ROOM ACOUSTICS FOR ANALYSING THE OUTPUT SNR OF THE MWF IN ACOUSTIC SENSOR NETWORKS Toby Christian

More information

Audio Features. Fourier Transform. Fourier Transform. Fourier Transform. Short Time Fourier Transform. Fourier Transform.

Audio Features. Fourier Transform. Fourier Transform. Fourier Transform. Short Time Fourier Transform. Fourier Transform. Advanced Course Computer Science Music Processing Summer Term 2010 Fourier Transform Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Audio Features Fourier Transform Fourier

More information

Synthetic description of the piano soundboard mechanical mobility

Synthetic description of the piano soundboard mechanical mobility Proceedings of 0th International Symposium on Music Acoustics (Associated Meeting of the International Congress on Acoustics) 5-31 August 010, Sydney and Katoomba, Australia Synthetic description of the

More information

The Spectral Differences in Sounds and Harmonics. Through chemical and biological processes, we, as humans, can detect the movement and

The Spectral Differences in Sounds and Harmonics. Through chemical and biological processes, we, as humans, can detect the movement and John Chambers Semester Project 5/2/16 The Spectral Differences in Sounds and Harmonics Through chemical and biological processes, we, as humans, can detect the movement and vibrations of molecules through

More information

BILEVEL SPARSE MODELS FOR POLYPHONIC MUSIC TRANSCRIPTION

BILEVEL SPARSE MODELS FOR POLYPHONIC MUSIC TRANSCRIPTION BILEVEL SPARSE MODELS FOR POLYPHONIC MUSIC TRANSCRIPTION Tal Ben Yakar Tel Aviv University talby0@gmail.com Roee Litman Tel Aviv University roeelitman@gmail.com Alex Bronstein Tel Aviv University bron@eng.tau.ac.il

More information

Minimum bow force and the impact of torsion for the bowed string

Minimum bow force and the impact of torsion for the bowed string Minimum bow force and the impact of torsion for the bowed string 1 / 9 Robert Mores University of Applied Sciences Hamburg 1 Minimum bow force - definition Definition: transition between normal Helmholtz

More information

Audio Features. Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform

Audio Features. Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform Advanced Course Computer Science Music Processing Summer Term 2009 Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Audio Features Fourier Transform Tells which notes (frequencies)

More information

Waves Part 3A: Standing Waves

Waves Part 3A: Standing Waves Waves Part 3A: Standing Waves Last modified: 24/01/2018 Contents Links Contents Superposition Standing Waves Definition Nodes Anti-Nodes Standing Waves Summary Standing Waves on a String Standing Waves

More information

Timbral, Scale, Pitch modifications

Timbral, Scale, Pitch modifications Introduction Timbral, Scale, Pitch modifications M2 Mathématiques / Vision / Apprentissage Audio signal analysis, indexing and transformation Page 1 / 40 Page 2 / 40 Modification of playback speed Modifications

More information

An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms

An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms Sylvain Marchand To cite this version: Sylvain Marchand. An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier

More information

A Family of Nyquist Filters Based on Generalized Raised-Cosine Spectra

A Family of Nyquist Filters Based on Generalized Raised-Cosine Spectra Proc. Biennial Symp. Commun. (Kingston, Ont.), pp. 3-35, June 99 A Family of Nyquist Filters Based on Generalized Raised-Cosine Spectra Nader Sheikholeslami Peter Kabal Department of Electrical Engineering

More information

Physics 221 First Hourly Examination Prepared August 2006 Porter Johnson

Physics 221 First Hourly Examination Prepared August 2006 Porter Johnson Physics 221 First Hourly Examination Prepared August 2006 Porter Johnson PHYS 221-003/004; TEST 1; 22 September 2003 1. [25 points] The G string of a guitar has a length of 0.6 meters, with a fundamental

More information

Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain (HR-NMF)

Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain (HR-NMF) Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain HR-NMF Roland Badeau, Angélique Dremeau To cite this version: Roland Badeau, Angélique Dremeau.

More information

Sound 2: frequency analysis

Sound 2: frequency analysis COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2 Wave equation Pressure

More information

Publication V. c 2012 Copyright Holder. Reprinted with permission.

Publication V. c 2012 Copyright Holder. Reprinted with permission. Publication V R. C. D. Paiva, J. Pakarinen, and V. Välimäki. Reduced-complexity modeling of high-order nonlinear audio systems using swept-sine and principal component analysis. In Proc. AES 45th Conf.

More information

Recovery of Compactly Supported Functions from Spectrogram Measurements via Lifting

Recovery of Compactly Supported Functions from Spectrogram Measurements via Lifting Recovery of Compactly Supported Functions from Spectrogram Measurements via Lifting Mark Iwen markiwen@math.msu.edu 2017 Friday, July 7 th, 2017 Joint work with... Sami Merhi (Michigan State University)

More information

Chapter 18 Solutions

Chapter 18 Solutions Chapter 18 Solutions 18.1 he resultant wave function has the form y A 0 cos φ sin kx ω t + φ (a) A A 0 cos φ (5.00) cos (π /4) 9.4 m f ω π 100π π 600 Hz *18. We write the second wave function as hen y

More information

Real-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization

Real-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization Real-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization Fei Sha and Lawrence K. Saul Dept. of Computer and Information Science University of Pennsylvania, Philadelphia,

More information

Part 2 Introduction to Microlocal Analysis

Part 2 Introduction to Microlocal Analysis Part 2 Introduction to Microlocal Analysis Birsen Yazıcı & Venky Krishnan Rensselaer Polytechnic Institute Electrical, Computer and Systems Engineering August 2 nd, 2010 Outline PART II Pseudodifferential

More information

ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING. Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard

ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING. Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard LTCI, CNRS, Télćom ParisTech, Université Paris-Saclay, 75013,

More information

SPRING 2004 Final Exam, Part A

SPRING 2004 Final Exam, Part A Physics 151 SPRING 2004 Final Exam, Part A Roster No.: Score: 23 pts. possible Exam time limit: 120 minutes. You may use calculators and both sides of 2 pages of notes, handwritten only. Closed book; no

More information