TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION

Size: px
Start display at page:

Download "TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION"

Transcription

1 TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION 13 th International Conference on Digital Audio Effects Romain Hennequin, Roland Badeau and Bertrand David Telecom ParisTech September 8, 2010 Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 1/26

2 Introduction Musical spectrograms decomposition (on a basis of notes) Decomposition based on Non-negative Matrix Factorization (NMF) s are introduced into decomposition methods: parametric harmonic atoms makes it possible to model slight pitch variations Potential applications: Multipitch estimation/transcription Source separation Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 2/26

3 Sommaire Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 3/26

4 Contents Introduction Principle Issues Proposed solution 1 Principle Issues Proposed solution 2 3 Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 4/26

5 Principle of NMF Introduction Principle Issues Proposed solution Low-rank approximation: R V ˆV = WH ˆV ft = W fr H rt r=1 Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 5/26

6 Issues with NMF Introduction Principle Issues Proposed solution Pitch variations Low-rank approximation does not permit to model variations over time, such as slight pitch variations (vibrato...). Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 6/26

7 Issues with NMF Introduction Principle Issues Proposed solution Original spectrogram NMF spectrogram R = frequency (khz) 3 2 frequency (khz) time (frames) time (frames) Note with vibrato: Decomposition with a single atom. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 7/26

8 Issues with NMF Introduction Principle Issues Proposed solution Original spectrogram NMF spectrogram R = frequency (khz) 3 2 frequency (khz) time (frames) time (frames) Note with vibrato: Decomposition with 3 atoms. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 8/26

9 Proposed solution Introduction Principle Issues Proposed solution What does an atom look like in a musical spectrogram? In a musical spectrogram most of the (non-percussive) elements are instruments notes which are generally harmonic tones. Parameters of interest are generally the fundamental frequency of these tones, and the shape of the amplitudes of the harmonics. Proposed method: parametric model of spectrogram with harmonic atoms. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 9/26

10 Contents Introduction Parametric spectrogram Parametric atoms Algorithm 1 2 Parametric spectrogram Parametric atoms Algorithm 3 Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 10/26

11 Parametric spectrogram Parametric spectrogram Parametric atoms Algorithm Time-varying atoms in NMF: ˆV ft = R W fr H rt ˆV ft = r=1 R r=1 W θrt fr H rt θ rt is a time-varying parameter associated to each atom. In this paper, θ rt is the fundamental frequency f rt 0 of each atom. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 11/26

12 Parametric atoms Introduction Parametric spectrogram Parametric atoms Algorithm Parametric harmonic atom construction n h (f rt W f 0 rt fr = 0 ) k=1 a k g(f kf rt 0 ) Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 12/26

13 Parametric spectrogram Parametric spectrogram Parametric atoms Algorithm Hypotheses of the model The harmonic part of notes is supposed to be stationary within an analysis frame. Interferences between harmonics are supposed to be negligible. Classical hypothesis of NMF about positive summation of parts. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 13/26

14 Algorithm Introduction Parametric spectrogram Learnt parameters ˆV ft = Parametric spectrogram Parametric atoms Algorithm R n h a k g(f kf rt r=1 k=1 0 ) h rt }{{} W f 0 rt fr A divergence between V and ˆV is to be minimized w.r.t.: f0 rt : the fundamental frequency of each atom at each frame a k : the amplitudes of harmonics (Atoms share the same set of amplitudes) h rt : the activation of each atom at each frame Cost function: C(f rt 0, a k, h rt ) = D(V ft ˆV ft ) Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 14/26

15 Algorithm Introduction Parametric spectrogram Parametric atoms Algorithm Minimization Global optimization w.r.t. f rt 0 is impossible (numerous local minima in C). one atom is introduced for each MIDI note. Optimization thus becomes local (fine estimate of f rt 0 ). Minimization achieved with multiplicative update rules. Remark The proposed method is no longer a rank-reduction method but still reduces the data dimension. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 15/26

16 Contents Introduction Decomposition Improvement Estimated frequency Real signals Decomposition Improvement Estimated frequency Real signals Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 16/26

17 Decomposition Improvement Estimated frequency Real signals Decomposition of a synthetic spectrogram Original power spectrogram Frequency (khz) Time (frame) Spectrogram of the first bars of JS Bach s first prelude played by a synthesizer. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 17/26

18 Obtained decomposition Decomposition Improvement Estimated frequency Real signals Semitones Frames 35 Activations for each MIDI note. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 18/26

19 Obtained decomposition Decomposition Improvement Estimated frequency Real signals Decomposition Notes appear at the right place with decreasing amplitudes Numerous atoms activated at onset time Notes activated at octave, twelfth and double octave of the right note (note with many common partials). Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 19/26

20 Improvement Introduction Decomposition Improvement Estimated frequency Real signals Onset A few standard NMF atoms can be used to model onsets: ˆV ft = R r=1 W θrt fr H rt + K A fk B kt k=1 Octaves, twelfths... Add constraints to the cost function: Sparsity constraints on activations Decorrelation constraints (between activations of octaves...) Smoothness constraints on amplitudes Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 20/26

21 Obtained decomposition Decomposition Improvement Estimated frequency Real signals Semitones Frames Activations for each MIDI note. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 21/26

22 Time/frequency representation Decomposition Improvement Estimated frequency Real signals Semitones Frames Activations centered on estimated frequency for each MIDI note: vibrato appears. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 22/26

23 Issues with real signals Decomposition Improvement Estimated frequency Real signals Semitones Frames Activations for each MIDI note. (Piano sound) Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 23/26

24 Issues with real signals Decomposition Improvement Estimated frequency Real signals Issues The model of amplitudes of harmonics is quite rough Issues with onsets and octaves are more important Noisy components (breath... ) Some instruments are not perfectly harmonic (piano... ) Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 24/26

25 Summary New way of decomposing musical spectrograms with slight pitch variations in constituting elements. Parametric thus flexible model. Perspectives Improve decomposition to make it more adapted to real data: Better modeling of harmonic amplitudes Supervised learning of amplitudes Better onset and noise modeling Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 25/26

26 Any questions? Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 26/26

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION Romain Hennequin, Roland Badeau and Bertrand David, Institut Telecom; Telecom ParisTech; CNRS LTCI Paris France romainhennequin@telecom-paristechfr

More information

SUPPLEMENTARY MATERIAL FOR THE PAPER "A PARAMETRIC MODEL AND ESTIMATION TECHNIQUES FOR THE INHARMONICITY AND TUNING OF THE PIANO"

SUPPLEMENTARY MATERIAL FOR THE PAPER A PARAMETRIC MODEL AND ESTIMATION TECHNIQUES FOR THE INHARMONICITY AND TUNING OF THE PIANO SUPPLEMENTARY MATERIAL FOR THE PAPER "A PARAMETRIC MODEL AND ESTIMATION TECHNIQUES FOR THE INHARMONICITY AND TUNING OF THE PIANO" François Rigaud and Bertrand David Institut Telecom; Telecom ParisTech;

More information

10ème Congrès Français d Acoustique

10ème Congrès Français d Acoustique 1ème Congrès Français d Acoustique Lyon, 1-16 Avril 1 Spectral similarity measure invariant to pitch shifting and amplitude scaling Romain Hennequin 1, Roland Badeau 1, Bertrand David 1 1 Institut TELECOM,

More information

LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION.

LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION. LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION. Romain Hennequin Deezer R&D 10 rue d Athènes, 75009 Paris, France rhennequin@deezer.com

More information

Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms

Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms Masahiro Nakano 1, Jonathan Le Roux 2, Hirokazu Kameoka 2,YuKitano 1, Nobutaka Ono 1,

More information

Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription

Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription Benoît Fuentes, Roland Badeau, Gaël Richard To cite this version: Benoît Fuentes, Roland Badeau, Gaël Richard.

More information

Oracle Analysis of Sparse Automatic Music Transcription

Oracle Analysis of Sparse Automatic Music Transcription Oracle Analysis of Sparse Automatic Music Transcription Ken O Hanlon, Hidehisa Nagano, and Mark D. Plumbley Queen Mary University of London NTT Communication Science Laboratories, NTT Corporation {keno,nagano,mark.plumbley}@eecs.qmul.ac.uk

More information

Non-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology

Non-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology Non-Negative Matrix Factorization And Its Application to Audio Tuomas Virtanen Tampere University of Technology tuomas.virtanen@tut.fi 2 Contents Introduction to audio signals Spectrogram representation

More information

ACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES. Sebastian Ewert Mark D. Plumbley Mark Sandler

ACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES. Sebastian Ewert Mark D. Plumbley Mark Sandler ACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES Sebastian Ewert Mark D. Plumbley Mark Sandler Queen Mary University of London, London, United Kingdom ABSTRACT

More information

744 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 4, MAY 2011

744 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 4, MAY 2011 744 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 4, MAY 2011 NMF With Time Frequency Activations to Model Nonstationary Audio Events Romain Hennequin, Student Member, IEEE,

More information

MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION. Ikuo Degawa, Kei Sato, Masaaki Ikehara

MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION. Ikuo Degawa, Kei Sato, Masaaki Ikehara MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION Ikuo Degawa, Kei Sato, Masaaki Ikehara EEE Dept. Keio University Yokohama, Kanagawa 223-8522 Japan E-mail:{degawa,

More information

Jacobi Algorithm For Nonnegative Matrix Factorization With Transform Learning

Jacobi Algorithm For Nonnegative Matrix Factorization With Transform Learning Jacobi Algorithm For Nonnegative Matrix Factorization With Transform Learning Herwig Wt, Dylan Fagot and Cédric Févotte IRIT, Université de Toulouse, CNRS Toulouse, France firstname.lastname@irit.fr Abstract

More information

Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation

Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Mikkel N. Schmidt and Morten Mørup Technical University of Denmark Informatics and Mathematical Modelling Richard

More information

STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS

STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS 19th European Signal Processing Conference (EUSIPCO 2011) Barcelona, Spain, August 29 - September 2, 2011 STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS Ken O Hanlon and Mark D.Plumbley Queen

More information

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION Hiroazu Kameoa The University of Toyo / Nippon Telegraph and Telephone Corporation ABSTRACT This paper proposes a novel

More information

FACTORS IN FACTORIZATION: DOES BETTER AUDIO SOURCE SEPARATION IMPLY BETTER POLYPHONIC MUSIC TRANSCRIPTION?

FACTORS IN FACTORIZATION: DOES BETTER AUDIO SOURCE SEPARATION IMPLY BETTER POLYPHONIC MUSIC TRANSCRIPTION? FACTORS IN FACTORIZATION: DOES BETTER AUDIO SOURCE SEPARATION IMPLY BETTER POLYPHONIC MUSIC TRANSCRIPTION? Tiago Fernandes Tavares, George Tzanetakis, Peter Driessen University of Victoria Department of

More information

Non-negative Matrix Factorization: Algorithms, Extensions and Applications

Non-negative Matrix Factorization: Algorithms, Extensions and Applications Non-negative Matrix Factorization: Algorithms, Extensions and Applications Emmanouil Benetos www.soi.city.ac.uk/ sbbj660/ March 2013 Emmanouil Benetos Non-negative Matrix Factorization March 2013 1 / 25

More information

Audio Features. Fourier Transform. Fourier Transform. Fourier Transform. Short Time Fourier Transform. Fourier Transform.

Audio Features. Fourier Transform. Fourier Transform. Fourier Transform. Short Time Fourier Transform. Fourier Transform. Advanced Course Computer Science Music Processing Summer Term 2010 Fourier Transform Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Audio Features Fourier Transform Fourier

More information

Audio Features. Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform

Audio Features. Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform Advanced Course Computer Science Music Processing Summer Term 2009 Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Audio Features Fourier Transform Tells which notes (frequencies)

More information

University of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015

University of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015 University of Colorado at Boulder ECEN 4/5532 Lab 2 Lab report due on February 16, 2015 This is a MATLAB only lab, and therefore each student needs to turn in her/his own lab report and own programs. 1

More information

Monaural Music Separation via Supervised Non-negative Matrix Factor with Side-information

Monaural Music Separation via Supervised Non-negative Matrix Factor with Side-information Monaural Music Separation via Supervised Non-negative Matrix Factor with Side-information by Ce Peng, M.A.Sc A dissertation submitted to the Faculty of Graduate and Postdoctoral Affairs in partial fulfillment

More information

Introduction Basic Audio Feature Extraction

Introduction Basic Audio Feature Extraction Introduction Basic Audio Feature Extraction Vincent Koops (with slides by Meinhard Müller) Sound and Music Technology, December 6th, 2016 1 28 November 2017 Today g Main modules A. Sound and music for

More information

ORTHOGONALITY-REGULARIZED MASKED NMF FOR LEARNING ON WEAKLY LABELED AUDIO DATA. Iwona Sobieraj, Lucas Rencker, Mark D. Plumbley

ORTHOGONALITY-REGULARIZED MASKED NMF FOR LEARNING ON WEAKLY LABELED AUDIO DATA. Iwona Sobieraj, Lucas Rencker, Mark D. Plumbley ORTHOGONALITY-REGULARIZED MASKED NMF FOR LEARNING ON WEAKLY LABELED AUDIO DATA Iwona Sobieraj, Lucas Rencker, Mark D. Plumbley University of Surrey Centre for Vision Speech and Signal Processing Guildford,

More information

NONNEGATIVE MATRIX FACTORIZATION WITH TRANSFORM LEARNING. Dylan Fagot, Herwig Wendt and Cédric Févotte

NONNEGATIVE MATRIX FACTORIZATION WITH TRANSFORM LEARNING. Dylan Fagot, Herwig Wendt and Cédric Févotte NONNEGATIVE MATRIX FACTORIZATION WITH TRANSFORM LEARNING Dylan Fagot, Herwig Wendt and Cédric Févotte IRIT, Université de Toulouse, CNRS, Toulouse, France firstname.lastname@irit.fr ABSTRACT Traditional

More information

PHY 103: Standing Waves and Harmonics. Segev BenZvi Department of Physics and Astronomy University of Rochester

PHY 103: Standing Waves and Harmonics. Segev BenZvi Department of Physics and Astronomy University of Rochester PHY 103: Standing Waves and Harmonics Segev BenZvi Department of Physics and Astronomy University of Rochester Sounds of the Universe NASA/JPL, September 2016 2 Properties of Waves Wavelength: λ, length

More information

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement Simon Leglaive 1 Laurent Girin 1,2 Radu Horaud 1 1: Inria Grenoble Rhône-Alpes 2: Univ. Grenoble Alpes, Grenoble INP,

More information

BILEVEL SPARSE MODELS FOR POLYPHONIC MUSIC TRANSCRIPTION

BILEVEL SPARSE MODELS FOR POLYPHONIC MUSIC TRANSCRIPTION BILEVEL SPARSE MODELS FOR POLYPHONIC MUSIC TRANSCRIPTION Tal Ben Yakar Tel Aviv University talby0@gmail.com Roee Litman Tel Aviv University roeelitman@gmail.com Alex Bronstein Tel Aviv University bron@eng.tau.ac.il

More information

Constrained Nonnegative Matrix Factorization with Applications to Music Transcription

Constrained Nonnegative Matrix Factorization with Applications to Music Transcription Constrained Nonnegative Matrix Factorization with Applications to Music Transcription by Daniel Recoskie A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the

More information

Sound 2: frequency analysis

Sound 2: frequency analysis COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2 Wave equation Pressure

More information

Machine Learning for Signal Processing Non-negative Matrix Factorization

Machine Learning for Signal Processing Non-negative Matrix Factorization Machine Learning for Signal Processing Non-negative Matrix Factorization Class 10. 3 Oct 2017 Instructor: Bhiksha Raj With examples and slides from Paris Smaragdis 11755/18797 1 A Quick Recap X B W D D

More information

Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis

Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis LETTER Communicated by Andrzej Cichocki Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis Cédric Févotte fevotte@telecom-paristech.fr Nancy Bertin nbertin@telecom-paristech.fr

More information

NONNEGATIVE matrix factorization (NMF) is a powerful

NONNEGATIVE matrix factorization (NMF) is a powerful IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 2, NO. 2, DECEMBER 200 869 Stability Analysis of Multiplicative Update Algorithms and Application to Nonnegative Matrix Factorization Roland Badeau, Senior Member,

More information

THE task of identifying the environment in which a sound

THE task of identifying the environment in which a sound 1 Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification Victor Bisot, Romain Serizel, Slim Essid, and Gaël Richard Abstract In this paper, we study the usefulness of various

More information

Variational inference

Variational inference Simon Leglaive Télécom ParisTech, CNRS LTCI, Université Paris Saclay November 18, 2016, Télécom ParisTech, Paris, France. Outline Introduction Probabilistic model Problem Log-likelihood decomposition EM

More information

Fast adaptive ESPRIT algorithm

Fast adaptive ESPRIT algorithm Fast adaptive ESPRIT algorithm Roland Badeau, Gaël Richard, Bertrand David To cite this version: Roland Badeau, Gaël Richard, Bertrand David. Fast adaptive ESPRIT algorithm. Proc. of IEEE Workshop on Statistical

More information

Stability analysis of multiplicative update algorithms and application to non-negative matrix factorization

Stability analysis of multiplicative update algorithms and application to non-negative matrix factorization Stability analysis of multiplicative update algorithms and application to non-negative matrix factorization Analyse de la stabilité des règles de mises à jour multiplicatives et application à la factorisation

More information

Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification

Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard To cite this version: Victor Bisot, Romain Serizel, Slim Essid,

More information

SHIFTED AND CONVOLUTIVE SOURCE-FILTER NON-NEGATIVE MATRIX FACTORIZATION FOR MONAURAL AUDIO SOURCE SEPARATION. Tomohiko Nakamura and Hirokazu Kameoka,

SHIFTED AND CONVOLUTIVE SOURCE-FILTER NON-NEGATIVE MATRIX FACTORIZATION FOR MONAURAL AUDIO SOURCE SEPARATION. Tomohiko Nakamura and Hirokazu Kameoka, SHIFTED AND CONVOLUTIVE SOURCE-FILTER NON-NEGATIVE MATRIX FACTORIZATION FOR MONAURAL AUDIO SOURCE SEPARATION Tomohiko Nakamura and Hirokazu Kameoka, Graduate School of Information Science and Technology,

More information

Unsupervised Analysis of Polyphonic Music by Sparse Coding

Unsupervised Analysis of Polyphonic Music by Sparse Coding Unsupervised Analysis of Polyphonic Music by Sparse Coding Samer A. Abdallah and Mark D. Plumbley IEEE Transactions on Neural Networks, 17(1), 179-196, January 2006 Abstract We investigate a data-driven

More information

Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence

Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence Arnaud Dessein, Arshia Cont, Guillaume Lemaitre To cite this version: Arnaud Dessein, Arshia Cont, Guillaume

More information

Sparse and Shift-Invariant Feature Extraction From Non-Negative Data

Sparse and Shift-Invariant Feature Extraction From Non-Negative Data MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Sparse and Shift-Invariant Feature Extraction From Non-Negative Data Paris Smaragdis, Bhiksha Raj, Madhusudana Shashanka TR8-3 August 8 Abstract

More information

Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs

Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs Paris Smaragdis TR2004-104 September

More information

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS Emad M. Grais and Hakan Erdogan Faculty of Engineering and Natural Sciences, Sabanci University, Orhanli

More information

Topic 3: Fourier Series (FS)

Topic 3: Fourier Series (FS) ELEC264: Signals And Systems Topic 3: Fourier Series (FS) o o o o Introduction to frequency analysis of signals CT FS Fourier series of CT periodic signals Signal Symmetry and CT Fourier Series Properties

More information

Low-Rank Time-Frequency Synthesis

Low-Rank Time-Frequency Synthesis Low-Rank - Synthesis Cédric Févotte Laboratoire Lagrange CNRS, OCA & Université de Nice Nice, France cfevotte@unice.fr Matthieu Kowalski Laboratoire des Signaux et Systèmes CNRS, Supélec & Université Paris-Sud

More information

Signal Processing COS 323

Signal Processing COS 323 Signal Processing COS 323 Digital Signals D: functions of space or time e.g., sound 2D: often functions of 2 spatial dimensions e.g. images 3D: functions of 3 spatial dimensions CAT, MRI scans or 2 space,

More information

Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification

Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification Hafiz Mustafa and Wenwu Wang Centre for Vision, Speech and Signal Processing (CVSSP) University of Surrey,

More information

VARIATIONAL BAYESIAN EM ALGORITHM FOR MODELING MIXTURES OF NON-STATIONARY SIGNALS IN THE TIME-FREQUENCY DOMAIN (HR-NMF)

VARIATIONAL BAYESIAN EM ALGORITHM FOR MODELING MIXTURES OF NON-STATIONARY SIGNALS IN THE TIME-FREQUENCY DOMAIN (HR-NMF) VARIATIONAL BAYESIAN EM ALGORITHM FOR MODELING MIXTURES OF NON-STATIONARY SIGNALS IN THE TIME-FREQUENCY DOMAIN HR-NMF Roland Badeau, Angélique Drémeau Institut Mines-Telecom, Telecom ParisTech, CNRS LTCI

More information

Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain (HR-NMF)

Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain (HR-NMF) Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain HR-NMF Roland Badeau, Angélique Dremeau To cite this version: Roland Badeau, Angélique Dremeau.

More information

Time-Frequency Analysis

Time-Frequency Analysis Time-Frequency Analysis Basics of Fourier Series Philippe B. aval KSU Fall 015 Philippe B. aval (KSU) Fourier Series Fall 015 1 / 0 Introduction We first review how to derive the Fourier series of a function.

More information

Optimal spectral transportation with application to music transcription

Optimal spectral transportation with application to music transcription Optimal spectral transportation with application to music transcription Rémi Flamary, Cédric Févotte, Nicolas Courty, Valentin Emiya To cite this version: Rémi Flamary, Cédric Févotte, Nicolas Courty,

More information

Power-Scaled Spectral Flux and Peak-Valley Group-Delay Methods for Robust Musical Onset Detection

Power-Scaled Spectral Flux and Peak-Valley Group-Delay Methods for Robust Musical Onset Detection Power-Scaled Spectral Flux and Peak-Valley Group-Delay Methods for Robust Musical Onset Detection Li Su CITI, Academia Sinica, Taipei, Taiwan lisu@citi.sinica.edu.tw Yi-Hsuan Yang CITI, Academia Sinica,

More information

Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution

Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution Clément Laroche, Hélène Papadopoulos, Matthieu Kowalski, Gaël Richard To cite this version: Clément

More information

Discovering Convolutive Speech Phones using Sparseness and Non-Negativity Constraints

Discovering Convolutive Speech Phones using Sparseness and Non-Negativity Constraints Discovering Convolutive Speech Phones using Sparseness and Non-Negativity Constraints Paul D. O Grady and Barak A. Pearlmutter Hamilton Institute, National University of Ireland Maynooth, Co. Kildare,

More information

Complex NMF under phase constraints based on signal modeling: application to audio source separation

Complex NMF under phase constraints based on signal modeling: application to audio source separation Complex NMF under phase constraints based on signal modeling: application to audio source separation Paul Magron, Roland Badeau, Bertrand David To cite this version: Paul Magron, Roland Badeau, Bertrand

More information

Scalable audio separation with light Kernel Additive Modelling

Scalable audio separation with light Kernel Additive Modelling Scalable audio separation with light Kernel Additive Modelling Antoine Liutkus 1, Derry Fitzgerald 2, Zafar Rafii 3 1 Inria, Université de Lorraine, LORIA, UMR 7503, France 2 NIMBUS Centre, Cork Institute

More information

On Spectral Basis Selection for Single Channel Polyphonic Music Separation

On Spectral Basis Selection for Single Channel Polyphonic Music Separation On Spectral Basis Selection for Single Channel Polyphonic Music Separation Minje Kim and Seungjin Choi Department of Computer Science Pohang University of Science and Technology San 31 Hyoja-dong, Nam-gu

More information

Real-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization

Real-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization Real-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization Fei Sha and Lawrence K. Saul Dept. of Computer and Information Science University of Pennsylvania, Philadelphia,

More information

NON-NEGATIVE MATRIX FACTORIZATION WITH SELECTIVE SPARSITY CONSTRAINTS FOR TRANSCRIPTION OF BELL CHIMING RECORDINGS

NON-NEGATIVE MATRIX FACTORIZATION WITH SELECTIVE SPARSITY CONSTRAINTS FOR TRANSCRIPTION OF BELL CHIMING RECORDINGS NON-NEGAIVE MARIX ACORIZAION WIH SELECIVE SPARSIY CONSRAINS OR RANSCRIPION O BELL CHIMING RECORDINGS ABSRAC he paper presents a method for automatic transcription of recordings of bell chiming performances.

More information

ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY

ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY Jean-Rémy Gloaguen, Arnaud Can Ifsttar - LAE Route de Bouaye - CS4 44344, Bouguenais, FR jean-remy.gloaguen@ifsttar.fr Mathieu

More information

Convolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence

Convolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence Convolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence Fabio Louvatti do Carmo; Evandro Ottoni Teatini Salles Abstract This paper proposes a modification of the

More information

CONVOLUTIVE NON-NEGATIVE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT

CONVOLUTIVE NON-NEGATIVE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT CONOLUTIE NON-NEGATIE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT Paul D. O Grady Barak A. Pearlmutter Hamilton Institute National University of Ireland, Maynooth Co. Kildare, Ireland. ABSTRACT Discovering

More information

Single-channel source separation using non-negative matrix factorization

Single-channel source separation using non-negative matrix factorization Single-channel source separation using non-negative matrix factorization Mikkel N. Schmidt Technical University of Denmark mns@imm.dtu.dk www.mikkelschmidt.dk DTU Informatics Department of Informatics

More information

REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES

REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES Kedar Patki University of Rochester Dept. of Electrical and Computer Engineering kedar.patki@rochester.edu ABSTRACT The paper reviews the problem of

More information

Gaussian Processes for Audio Feature Extraction

Gaussian Processes for Audio Feature Extraction Gaussian Processes for Audio Feature Extraction Dr. Richard E. Turner (ret26@cam.ac.uk) Computational and Biological Learning Lab Department of Engineering University of Cambridge Machine hearing pipeline

More information

OBJECT CODING OF HARMONIC SOUNDS USING SPARSE AND STRUCTURED REPRESENTATIONS

OBJECT CODING OF HARMONIC SOUNDS USING SPARSE AND STRUCTURED REPRESENTATIONS OBJECT CODING OF HARMONIC SOUNDS USING SPARSE AND STRUCTURED REPRESENTATIONS Grégory Cornuz 1, Emmanuel Ravelli 1,2, 1 Institut Jean Le Rond d Alembert, LAM team Université Pierre et Marie Curie - Paris

More information

Environmental Sound Classification in Realistic Situations

Environmental Sound Classification in Realistic Situations Environmental Sound Classification in Realistic Situations K. Haddad, W. Song Brüel & Kjær Sound and Vibration Measurement A/S, Skodsborgvej 307, 2850 Nærum, Denmark. X. Valero La Salle, Universistat Ramon

More information

Sound Recognition in Mixtures

Sound Recognition in Mixtures Sound Recognition in Mixtures Juhan Nam, Gautham J. Mysore 2, and Paris Smaragdis 2,3 Center for Computer Research in Music and Acoustics, Stanford University, 2 Advanced Technology Labs, Adobe Systems

More information

NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION. Julian M. Becker, Christian Sohn and Christian Rohlfing

NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION. Julian M. Becker, Christian Sohn and Christian Rohlfing NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION Julian M. ecker, Christian Sohn Christian Rohlfing Institut für Nachrichtentechnik RWTH Aachen University D-52056

More information

ROBUST REALTIME POLYPHONIC PITCH DETECTION

ROBUST REALTIME POLYPHONIC PITCH DETECTION ROBUST REALTIME POLYPHONIC PITCH DETECTION by John M. Thomas A Thesis Submitted to the Graduate Faculty of George Mason University In Partial fulfillment of The Requirements for the Degree of Master of

More information

Machine Learning for Signal Processing Non-negative Matrix Factorization

Machine Learning for Signal Processing Non-negative Matrix Factorization Machine Learning for Signal Processing Non-negative Matrix Factorization Class 10. Instructor: Bhiksha Raj With examples from Paris Smaragdis 11755/18797 1 The Engineer and the Musician Once upon a time

More information

STATISTICAL APPROACH FOR SOUND MODELING

STATISTICAL APPROACH FOR SOUND MODELING STATISTICAL APPROACH FOR SOUND MODELING Myriam DESAINTE-CATHERINE SCRIME - LaBRI Université de Bordeaux 1 F-33405 Talence Cedex, France myriam@labri.u-bordeaux.fr Pierre HANNA SCRIME - LaBRI Université

More information

t Tao Group Limited, Reading, RG6 IAZ, U.K. wenwu.wang(ieee.org t Samsung Electronics Research Institute, Staines, TW1 8 4QE, U.K.

t Tao Group Limited, Reading, RG6 IAZ, U.K.   wenwu.wang(ieee.org t Samsung Electronics Research Institute, Staines, TW1 8 4QE, U.K. NON-NEGATIVE MATRIX FACTORIZATION FOR NOTE ONSET DETECTION OF AUDIO SIGNALS Wenwu Wangt, Yuhui Luol, Jonathon A. Chambers', and Saeid Sanei t Tao Group Limited, Reading, RG6 IAZ, U.K. Email: wenwu.wang(ieee.org

More information

ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING. Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard

ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING. Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard LTCI, CNRS, Télćom ParisTech, Université Paris-Saclay, 75013,

More information

Automatic Relevance Determination in Nonnegative Matrix Factorization

Automatic Relevance Determination in Nonnegative Matrix Factorization Automatic Relevance Determination in Nonnegative Matrix Factorization Vincent Y. F. Tan, Cédric Févotte To cite this version: Vincent Y. F. Tan, Cédric Févotte. Automatic Relevance Determination in Nonnegative

More information

IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL.?, NO.?, MONTH

IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL.?, NO.?, MONTH IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL.?, NO.?, MONTH 016 1 Piano Transcription in the Studio Using an Extensible Alternating Directions Framework Sebastian Ewert, Member,

More information

Analysis of polyphonic audio using source-filter model and non-negative matrix factorization

Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Tuomas Virtanen and Anssi Klapuri Tampere University of Technology, Institute of Signal Processing Korkeakoulunkatu

More information

Music transcription with ISA and HMM

Music transcription with ISA and HMM Music transcription with ISA and HMM Emmanuel Vincent, Xavier Rodet To cite this version: Emmanuel Vincent, Xavier Rodet. Music transcription with ISA and HMM. 5th Int. Conf. on Independent Component Analysis

More information

Learning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis

Learning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis Learning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis Youngmin Cho yoc@cs.ucsd.edu Lawrence K. Saul saul@cs.ucsd.edu CSE Department, University of California, San Diego, 95 Gilman

More information

Deep NMF for Speech Separation

Deep NMF for Speech Separation MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Deep NMF for Speech Separation Le Roux, J.; Hershey, J.R.; Weninger, F.J. TR2015-029 April 2015 Abstract Non-negative matrix factorization

More information

Computational Methods CMSC/AMSC/MAPL 460. Fourier transform

Computational Methods CMSC/AMSC/MAPL 460. Fourier transform Computational Methods CMSC/AMSC/MAPL 460 Fourier transform Ramani Duraiswami, Dept. of Computer Science Several slides from Prof. Healy s course at UMD Fourier Methods Fourier analysis ( harmonic analysis

More information

Temporal models with low-rank spectrograms

Temporal models with low-rank spectrograms Temporal models with low-rank spectrograms Cédric Févotte Institut de Recherche en Informatique de Toulouse (IRIT) IEEE MLSP Aalborg, Sep. 218 Outline NMF for spectral unmixing Generalities Itakura-Saito

More information

PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals

PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals Masataka Goto National Institute of Advanced Industrial Science and Technology (AIST). IT, AIST, 1-1-1 Umezono, Tsukuba,

More information

COMP 546, Winter 2018 lecture 19 - sound 2

COMP 546, Winter 2018 lecture 19 - sound 2 Sound waves Last lecture we considered sound to be a pressure function I(X, Y, Z, t). However, sound is not just any function of those four variables. Rather, sound obeys the wave equation: 2 I(X, Y, Z,

More information

CS229 Project: Musical Alignment Discovery

CS229 Project: Musical Alignment Discovery S A A V S N N R R S CS229 Project: Musical Alignment iscovery Woodley Packard ecember 16, 2005 Introduction Logical representations of musical data are widely available in varying forms (for instance,

More information

Enforcing Harmonicity and Smoothness in Bayesian Non-negative Matrix Factorization Applied to Polyphonic Music Transcription.

Enforcing Harmonicity and Smoothness in Bayesian Non-negative Matrix Factorization Applied to Polyphonic Music Transcription. Enforcing Harmonicity and Smoothness in Bayesian Non-negative Matrix Factorization Applied to Polyphonic Music Transcription. Nancy Bertin*, Roland Badeau, Member, IEEE and Emmanuel Vincent, Member, IEEE

More information

Reverberation Impulse Response Analysis

Reverberation Impulse Response Analysis MUS424: Signal Processing Techniques for Digital Audio Effects Handout #18 Jonathan Abel, David Berners April 27, 24 Lecture #9: April 27, 24 Lecture Notes 7 Reverberation Impulse Response Analysis 1 CCRMA

More information

High resolution NMF for modeling mixtures of non-stationary signals in the time-frequency domain

High resolution NMF for modeling mixtures of non-stationary signals in the time-frequency domain High resolution NMF for modeling mixtures of non-stationary signals in the time-frequency domain NMF à haute résolution pour la modélisation de mélanges de signaux non-stationnaires dans le domaine temps-fréquence

More information

POLYPHONIC MUSIC TRANSCRIPTION BY NON-NEGATIVE SPARSE CODING OF POWER SPECTRA

POLYPHONIC MUSIC TRANSCRIPTION BY NON-NEGATIVE SPARSE CODING OF POWER SPECTRA POLYPHONIC MUSIC TRANSCRIPTION BY NON-NEGATIVE SPARSE CODING OF POWER SPECTRA Samer A. Abdallah and Mark D. Plumbley Centre for Digital Music, Queen Mary, University of London ABSTRACT We present a system

More information

arxiv: v1 [stat.ml] 6 Nov 2018

arxiv: v1 [stat.ml] 6 Nov 2018 A Quasi-Newton algorithm on the orthogonal manifold for NMF with transform learning arxiv:1811.02225v1 [stat.ml] 6 Nov 2018 Pierre Ablin, Dylan Fagot, Herwig Wendt, Alexandre Gramfort, and Cédric Févotte

More information

Source Separation Tutorial Mini-Series III: Extensions and Interpretations to Non-Negative Matrix Factorization

Source Separation Tutorial Mini-Series III: Extensions and Interpretations to Non-Negative Matrix Factorization Source Separation Tutorial Mini-Series III: Extensions and Interpretations to Non-Negative Matrix Factorization Nicholas Bryan Dennis Sun Center for Computer Research in Music and Acoustics, Stanford University

More information

Functional and Structural Implications of Non-Separability of Spectral and Temporal Responses in AI

Functional and Structural Implications of Non-Separability of Spectral and Temporal Responses in AI Functional and Structural Implications of Non-Separability of Spectral and Temporal Responses in AI Jonathan Z. Simon David J. Klein Didier A. Depireux Shihab A. Shamma and Dept. of Electrical & Computer

More information

Music Synthesis. synthesis. 1. NCTU/CSIE/ DSP Copyright 1996 C.M. LIU

Music Synthesis. synthesis. 1. NCTU/CSIE/ DSP Copyright 1996 C.M. LIU Music Synthesis synthesis. 1 pintroduction pmodeling, Synthesis, and Overview padditive Synthesis psubtractive Synthesis pnonlinear Synthesis pwavetable Synthesis psummary and Conclusions 1. Introduction

More information

A NEW DISSIMILARITY METRIC FOR THE CLUSTERING OF PARTIALS USING THE COMMON VARIATION CUE

A NEW DISSIMILARITY METRIC FOR THE CLUSTERING OF PARTIALS USING THE COMMON VARIATION CUE A NEW DISSIMILARITY METRIC FOR THE CLUSTERING OF PARTIALS USING THE COMMON VARIATION CUE Mathieu Lagrange SCRIME LaBRI, Université Bordeaux 1 351, cours de la Libération, F-33405 Talence cedex, France

More information

Ken O Hanlon and Mark B. Sandler. Centre for Digital Music Queen Mary University of London

Ken O Hanlon and Mark B. Sandler. Centre for Digital Music Queen Mary University of London AN ITERATIVE HARD THRESHOLDING APPROACH TO l 0 SPARSE HELLINGER NMF Ken O Hanlon and Mark B. Sandler Centre for Digital Music Queen Mary University of London ABSTRACT Performance of Non-negative Matrix

More information

A Generative Model for Music Transcription

A Generative Model for Music Transcription 1 A Generative Model for Music Transcription Ali Taylan Cemgil, Student Member, IEEE, Bert Kappen, Senior Member, IEEE, and David Barber Abstract In this paper we present a graphical model for polyphonic

More information

General Physics I. Lecture 14: Sinusoidal Waves. Prof. WAN, Xin ( 万歆 )

General Physics I. Lecture 14: Sinusoidal Waves. Prof. WAN, Xin ( 万歆 ) General Physics I Lecture 14: Sinusoidal Waves Prof. WAN, Xin ( 万歆 ) xinwan@zju.edu.cn http://zimp.zju.edu.cn/~xinwan/ Motivation When analyzing a linear medium that is, one in which the restoring force

More information

Topic 6. Timbre Representations

Topic 6. Timbre Representations Topic 6 Timbre Representations We often say that singer s voice is magnetic the violin sounds bright this French horn sounds solid that drum sounds dull What aspect(s) of sound are these words describing?

More information

Nonlinear Losses in Electro-acoustical Transducers Wolfgang Klippel, Daniel Knobloch

Nonlinear Losses in Electro-acoustical Transducers Wolfgang Klippel, Daniel Knobloch The Association of Loudspeaker Manufacturers & Acoustics International (ALMA) Nonlinear Losses in Electro-acoustical Transducers Wolfgang Klippel, Daniel Knobloch Institute of Acoustics and Speech Communication

More information

Melody Extraction and Musical Onset Detection

Melody Extraction and Musical Onset Detection 1 Melody Extraction and Musical Onset Detection from Framewise STFT Peak Data Harvey D. Thornburg, Student Member, IEEE, Randal J. Leistikow, Student Member, IEEE, and Jonathan Berger Abstract We propose

More information