TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION
|
|
- Gervais Freeman
- 5 years ago
- Views:
Transcription
1 TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION 13 th International Conference on Digital Audio Effects Romain Hennequin, Roland Badeau and Bertrand David Telecom ParisTech September 8, 2010 Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 1/26
2 Introduction Musical spectrograms decomposition (on a basis of notes) Decomposition based on Non-negative Matrix Factorization (NMF) s are introduced into decomposition methods: parametric harmonic atoms makes it possible to model slight pitch variations Potential applications: Multipitch estimation/transcription Source separation Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 2/26
3 Sommaire Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 3/26
4 Contents Introduction Principle Issues Proposed solution 1 Principle Issues Proposed solution 2 3 Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 4/26
5 Principle of NMF Introduction Principle Issues Proposed solution Low-rank approximation: R V ˆV = WH ˆV ft = W fr H rt r=1 Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 5/26
6 Issues with NMF Introduction Principle Issues Proposed solution Pitch variations Low-rank approximation does not permit to model variations over time, such as slight pitch variations (vibrato...). Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 6/26
7 Issues with NMF Introduction Principle Issues Proposed solution Original spectrogram NMF spectrogram R = frequency (khz) 3 2 frequency (khz) time (frames) time (frames) Note with vibrato: Decomposition with a single atom. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 7/26
8 Issues with NMF Introduction Principle Issues Proposed solution Original spectrogram NMF spectrogram R = frequency (khz) 3 2 frequency (khz) time (frames) time (frames) Note with vibrato: Decomposition with 3 atoms. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 8/26
9 Proposed solution Introduction Principle Issues Proposed solution What does an atom look like in a musical spectrogram? In a musical spectrogram most of the (non-percussive) elements are instruments notes which are generally harmonic tones. Parameters of interest are generally the fundamental frequency of these tones, and the shape of the amplitudes of the harmonics. Proposed method: parametric model of spectrogram with harmonic atoms. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 9/26
10 Contents Introduction Parametric spectrogram Parametric atoms Algorithm 1 2 Parametric spectrogram Parametric atoms Algorithm 3 Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 10/26
11 Parametric spectrogram Parametric spectrogram Parametric atoms Algorithm Time-varying atoms in NMF: ˆV ft = R W fr H rt ˆV ft = r=1 R r=1 W θrt fr H rt θ rt is a time-varying parameter associated to each atom. In this paper, θ rt is the fundamental frequency f rt 0 of each atom. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 11/26
12 Parametric atoms Introduction Parametric spectrogram Parametric atoms Algorithm Parametric harmonic atom construction n h (f rt W f 0 rt fr = 0 ) k=1 a k g(f kf rt 0 ) Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 12/26
13 Parametric spectrogram Parametric spectrogram Parametric atoms Algorithm Hypotheses of the model The harmonic part of notes is supposed to be stationary within an analysis frame. Interferences between harmonics are supposed to be negligible. Classical hypothesis of NMF about positive summation of parts. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 13/26
14 Algorithm Introduction Parametric spectrogram Learnt parameters ˆV ft = Parametric spectrogram Parametric atoms Algorithm R n h a k g(f kf rt r=1 k=1 0 ) h rt }{{} W f 0 rt fr A divergence between V and ˆV is to be minimized w.r.t.: f0 rt : the fundamental frequency of each atom at each frame a k : the amplitudes of harmonics (Atoms share the same set of amplitudes) h rt : the activation of each atom at each frame Cost function: C(f rt 0, a k, h rt ) = D(V ft ˆV ft ) Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 14/26
15 Algorithm Introduction Parametric spectrogram Parametric atoms Algorithm Minimization Global optimization w.r.t. f rt 0 is impossible (numerous local minima in C). one atom is introduced for each MIDI note. Optimization thus becomes local (fine estimate of f rt 0 ). Minimization achieved with multiplicative update rules. Remark The proposed method is no longer a rank-reduction method but still reduces the data dimension. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 15/26
16 Contents Introduction Decomposition Improvement Estimated frequency Real signals Decomposition Improvement Estimated frequency Real signals Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 16/26
17 Decomposition Improvement Estimated frequency Real signals Decomposition of a synthetic spectrogram Original power spectrogram Frequency (khz) Time (frame) Spectrogram of the first bars of JS Bach s first prelude played by a synthesizer. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 17/26
18 Obtained decomposition Decomposition Improvement Estimated frequency Real signals Semitones Frames 35 Activations for each MIDI note. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 18/26
19 Obtained decomposition Decomposition Improvement Estimated frequency Real signals Decomposition Notes appear at the right place with decreasing amplitudes Numerous atoms activated at onset time Notes activated at octave, twelfth and double octave of the right note (note with many common partials). Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 19/26
20 Improvement Introduction Decomposition Improvement Estimated frequency Real signals Onset A few standard NMF atoms can be used to model onsets: ˆV ft = R r=1 W θrt fr H rt + K A fk B kt k=1 Octaves, twelfths... Add constraints to the cost function: Sparsity constraints on activations Decorrelation constraints (between activations of octaves...) Smoothness constraints on amplitudes Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 20/26
21 Obtained decomposition Decomposition Improvement Estimated frequency Real signals Semitones Frames Activations for each MIDI note. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 21/26
22 Time/frequency representation Decomposition Improvement Estimated frequency Real signals Semitones Frames Activations centered on estimated frequency for each MIDI note: vibrato appears. Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 22/26
23 Issues with real signals Decomposition Improvement Estimated frequency Real signals Semitones Frames Activations for each MIDI note. (Piano sound) Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 23/26
24 Issues with real signals Decomposition Improvement Estimated frequency Real signals Issues The model of amplitudes of harmonics is quite rough Issues with onsets and octaves are more important Noisy components (breath... ) Some instruments are not perfectly harmonic (piano... ) Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 24/26
25 Summary New way of decomposing musical spectrograms with slight pitch variations in constituting elements. Parametric thus flexible model. Perspectives Improve decomposition to make it more adapted to real data: Better modeling of harmonic amplitudes Supervised learning of amplitudes Better onset and noise modeling Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 25/26
26 Any questions? Romain Hennequin, Roland Badeau and Bertrand David Time-dependent parametric templates in NMF - slide 26/26
TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION
TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION Romain Hennequin, Roland Badeau and Bertrand David, Institut Telecom; Telecom ParisTech; CNRS LTCI Paris France romainhennequin@telecom-paristechfr
More informationSUPPLEMENTARY MATERIAL FOR THE PAPER "A PARAMETRIC MODEL AND ESTIMATION TECHNIQUES FOR THE INHARMONICITY AND TUNING OF THE PIANO"
SUPPLEMENTARY MATERIAL FOR THE PAPER "A PARAMETRIC MODEL AND ESTIMATION TECHNIQUES FOR THE INHARMONICITY AND TUNING OF THE PIANO" François Rigaud and Bertrand David Institut Telecom; Telecom ParisTech;
More information10ème Congrès Français d Acoustique
1ème Congrès Français d Acoustique Lyon, 1-16 Avril 1 Spectral similarity measure invariant to pitch shifting and amplitude scaling Romain Hennequin 1, Roland Badeau 1, Bertrand David 1 1 Institut TELECOM,
More informationLONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION.
LONG-TERM REVERBERATION MODELING FOR UNDER-DETERMINED AUDIO SOURCE SEPARATION WITH APPLICATION TO VOCAL MELODY EXTRACTION. Romain Hennequin Deezer R&D 10 rue d Athènes, 75009 Paris, France rhennequin@deezer.com
More informationNonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms
Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms Masahiro Nakano 1, Jonathan Le Roux 2, Hirokazu Kameoka 2,YuKitano 1, Nobutaka Ono 1,
More informationHarmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription
Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription Benoît Fuentes, Roland Badeau, Gaël Richard To cite this version: Benoît Fuentes, Roland Badeau, Gaël Richard.
More informationOracle Analysis of Sparse Automatic Music Transcription
Oracle Analysis of Sparse Automatic Music Transcription Ken O Hanlon, Hidehisa Nagano, and Mark D. Plumbley Queen Mary University of London NTT Communication Science Laboratories, NTT Corporation {keno,nagano,mark.plumbley}@eecs.qmul.ac.uk
More informationNon-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology
Non-Negative Matrix Factorization And Its Application to Audio Tuomas Virtanen Tampere University of Technology tuomas.virtanen@tut.fi 2 Contents Introduction to audio signals Spectrogram representation
More informationACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES. Sebastian Ewert Mark D. Plumbley Mark Sandler
ACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES Sebastian Ewert Mark D. Plumbley Mark Sandler Queen Mary University of London, London, United Kingdom ABSTRACT
More information744 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 4, MAY 2011
744 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 4, MAY 2011 NMF With Time Frequency Activations to Model Nonstationary Audio Events Romain Hennequin, Student Member, IEEE,
More informationMULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION. Ikuo Degawa, Kei Sato, Masaaki Ikehara
MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION Ikuo Degawa, Kei Sato, Masaaki Ikehara EEE Dept. Keio University Yokohama, Kanagawa 223-8522 Japan E-mail:{degawa,
More informationJacobi Algorithm For Nonnegative Matrix Factorization With Transform Learning
Jacobi Algorithm For Nonnegative Matrix Factorization With Transform Learning Herwig Wt, Dylan Fagot and Cédric Févotte IRIT, Université de Toulouse, CNRS Toulouse, France firstname.lastname@irit.fr Abstract
More informationNonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation
Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Mikkel N. Schmidt and Morten Mørup Technical University of Denmark Informatics and Mathematical Modelling Richard
More informationSTRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS
19th European Signal Processing Conference (EUSIPCO 2011) Barcelona, Spain, August 29 - September 2, 2011 STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS Ken O Hanlon and Mark D.Plumbley Queen
More informationMULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka
MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION Hiroazu Kameoa The University of Toyo / Nippon Telegraph and Telephone Corporation ABSTRACT This paper proposes a novel
More informationFACTORS IN FACTORIZATION: DOES BETTER AUDIO SOURCE SEPARATION IMPLY BETTER POLYPHONIC MUSIC TRANSCRIPTION?
FACTORS IN FACTORIZATION: DOES BETTER AUDIO SOURCE SEPARATION IMPLY BETTER POLYPHONIC MUSIC TRANSCRIPTION? Tiago Fernandes Tavares, George Tzanetakis, Peter Driessen University of Victoria Department of
More informationNon-negative Matrix Factorization: Algorithms, Extensions and Applications
Non-negative Matrix Factorization: Algorithms, Extensions and Applications Emmanouil Benetos www.soi.city.ac.uk/ sbbj660/ March 2013 Emmanouil Benetos Non-negative Matrix Factorization March 2013 1 / 25
More informationAudio Features. Fourier Transform. Fourier Transform. Fourier Transform. Short Time Fourier Transform. Fourier Transform.
Advanced Course Computer Science Music Processing Summer Term 2010 Fourier Transform Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Audio Features Fourier Transform Fourier
More informationAudio Features. Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform
Advanced Course Computer Science Music Processing Summer Term 2009 Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Audio Features Fourier Transform Tells which notes (frequencies)
More informationUniversity of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015
University of Colorado at Boulder ECEN 4/5532 Lab 2 Lab report due on February 16, 2015 This is a MATLAB only lab, and therefore each student needs to turn in her/his own lab report and own programs. 1
More informationMonaural Music Separation via Supervised Non-negative Matrix Factor with Side-information
Monaural Music Separation via Supervised Non-negative Matrix Factor with Side-information by Ce Peng, M.A.Sc A dissertation submitted to the Faculty of Graduate and Postdoctoral Affairs in partial fulfillment
More informationIntroduction Basic Audio Feature Extraction
Introduction Basic Audio Feature Extraction Vincent Koops (with slides by Meinhard Müller) Sound and Music Technology, December 6th, 2016 1 28 November 2017 Today g Main modules A. Sound and music for
More informationORTHOGONALITY-REGULARIZED MASKED NMF FOR LEARNING ON WEAKLY LABELED AUDIO DATA. Iwona Sobieraj, Lucas Rencker, Mark D. Plumbley
ORTHOGONALITY-REGULARIZED MASKED NMF FOR LEARNING ON WEAKLY LABELED AUDIO DATA Iwona Sobieraj, Lucas Rencker, Mark D. Plumbley University of Surrey Centre for Vision Speech and Signal Processing Guildford,
More informationNONNEGATIVE MATRIX FACTORIZATION WITH TRANSFORM LEARNING. Dylan Fagot, Herwig Wendt and Cédric Févotte
NONNEGATIVE MATRIX FACTORIZATION WITH TRANSFORM LEARNING Dylan Fagot, Herwig Wendt and Cédric Févotte IRIT, Université de Toulouse, CNRS, Toulouse, France firstname.lastname@irit.fr ABSTRACT Traditional
More informationPHY 103: Standing Waves and Harmonics. Segev BenZvi Department of Physics and Astronomy University of Rochester
PHY 103: Standing Waves and Harmonics Segev BenZvi Department of Physics and Astronomy University of Rochester Sounds of the Universe NASA/JPL, September 2016 2 Properties of Waves Wavelength: λ, length
More informationA Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement
A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement Simon Leglaive 1 Laurent Girin 1,2 Radu Horaud 1 1: Inria Grenoble Rhône-Alpes 2: Univ. Grenoble Alpes, Grenoble INP,
More informationBILEVEL SPARSE MODELS FOR POLYPHONIC MUSIC TRANSCRIPTION
BILEVEL SPARSE MODELS FOR POLYPHONIC MUSIC TRANSCRIPTION Tal Ben Yakar Tel Aviv University talby0@gmail.com Roee Litman Tel Aviv University roeelitman@gmail.com Alex Bronstein Tel Aviv University bron@eng.tau.ac.il
More informationConstrained Nonnegative Matrix Factorization with Applications to Music Transcription
Constrained Nonnegative Matrix Factorization with Applications to Music Transcription by Daniel Recoskie A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the
More informationSound 2: frequency analysis
COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2 Wave equation Pressure
More informationMachine Learning for Signal Processing Non-negative Matrix Factorization
Machine Learning for Signal Processing Non-negative Matrix Factorization Class 10. 3 Oct 2017 Instructor: Bhiksha Raj With examples and slides from Paris Smaragdis 11755/18797 1 A Quick Recap X B W D D
More informationNonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis
LETTER Communicated by Andrzej Cichocki Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis Cédric Févotte fevotte@telecom-paristech.fr Nancy Bertin nbertin@telecom-paristech.fr
More informationNONNEGATIVE matrix factorization (NMF) is a powerful
IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 2, NO. 2, DECEMBER 200 869 Stability Analysis of Multiplicative Update Algorithms and Application to Nonnegative Matrix Factorization Roland Badeau, Senior Member,
More informationTHE task of identifying the environment in which a sound
1 Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification Victor Bisot, Romain Serizel, Slim Essid, and Gaël Richard Abstract In this paper, we study the usefulness of various
More informationVariational inference
Simon Leglaive Télécom ParisTech, CNRS LTCI, Université Paris Saclay November 18, 2016, Télécom ParisTech, Paris, France. Outline Introduction Probabilistic model Problem Log-likelihood decomposition EM
More informationFast adaptive ESPRIT algorithm
Fast adaptive ESPRIT algorithm Roland Badeau, Gaël Richard, Bertrand David To cite this version: Roland Badeau, Gaël Richard, Bertrand David. Fast adaptive ESPRIT algorithm. Proc. of IEEE Workshop on Statistical
More informationStability analysis of multiplicative update algorithms and application to non-negative matrix factorization
Stability analysis of multiplicative update algorithms and application to non-negative matrix factorization Analyse de la stabilité des règles de mises à jour multiplicatives et application à la factorisation
More informationFeature Learning with Matrix Factorization Applied to Acoustic Scene Classification
Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard To cite this version: Victor Bisot, Romain Serizel, Slim Essid,
More informationSHIFTED AND CONVOLUTIVE SOURCE-FILTER NON-NEGATIVE MATRIX FACTORIZATION FOR MONAURAL AUDIO SOURCE SEPARATION. Tomohiko Nakamura and Hirokazu Kameoka,
SHIFTED AND CONVOLUTIVE SOURCE-FILTER NON-NEGATIVE MATRIX FACTORIZATION FOR MONAURAL AUDIO SOURCE SEPARATION Tomohiko Nakamura and Hirokazu Kameoka, Graduate School of Information Science and Technology,
More informationUnsupervised Analysis of Polyphonic Music by Sparse Coding
Unsupervised Analysis of Polyphonic Music by Sparse Coding Samer A. Abdallah and Mark D. Plumbley IEEE Transactions on Neural Networks, 17(1), 179-196, January 2006 Abstract We investigate a data-driven
More informationReal-time polyphonic music transcription with non-negative matrix factorization and beta-divergence
Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence Arnaud Dessein, Arshia Cont, Guillaume Lemaitre To cite this version: Arnaud Dessein, Arshia Cont, Guillaume
More informationSparse and Shift-Invariant Feature Extraction From Non-Negative Data
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Sparse and Shift-Invariant Feature Extraction From Non-Negative Data Paris Smaragdis, Bhiksha Raj, Madhusudana Shashanka TR8-3 August 8 Abstract
More informationNon-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs Paris Smaragdis TR2004-104 September
More informationSINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan
SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS Emad M. Grais and Hakan Erdogan Faculty of Engineering and Natural Sciences, Sabanci University, Orhanli
More informationTopic 3: Fourier Series (FS)
ELEC264: Signals And Systems Topic 3: Fourier Series (FS) o o o o Introduction to frequency analysis of signals CT FS Fourier series of CT periodic signals Signal Symmetry and CT Fourier Series Properties
More informationLow-Rank Time-Frequency Synthesis
Low-Rank - Synthesis Cédric Févotte Laboratoire Lagrange CNRS, OCA & Université de Nice Nice, France cfevotte@unice.fr Matthieu Kowalski Laboratoire des Signaux et Systèmes CNRS, Supélec & Université Paris-Sud
More informationSignal Processing COS 323
Signal Processing COS 323 Digital Signals D: functions of space or time e.g., sound 2D: often functions of 2 spatial dimensions e.g. images 3D: functions of 3 spatial dimensions CAT, MRI scans or 2 space,
More informationSingle Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification
Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification Hafiz Mustafa and Wenwu Wang Centre for Vision, Speech and Signal Processing (CVSSP) University of Surrey,
More informationVARIATIONAL BAYESIAN EM ALGORITHM FOR MODELING MIXTURES OF NON-STATIONARY SIGNALS IN THE TIME-FREQUENCY DOMAIN (HR-NMF)
VARIATIONAL BAYESIAN EM ALGORITHM FOR MODELING MIXTURES OF NON-STATIONARY SIGNALS IN THE TIME-FREQUENCY DOMAIN HR-NMF Roland Badeau, Angélique Drémeau Institut Mines-Telecom, Telecom ParisTech, CNRS LTCI
More informationVariational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain (HR-NMF)
Variational Bayesian EM algorithm for modeling mixtures of non-stationary signals in the time-frequency domain HR-NMF Roland Badeau, Angélique Dremeau To cite this version: Roland Badeau, Angélique Dremeau.
More informationTime-Frequency Analysis
Time-Frequency Analysis Basics of Fourier Series Philippe B. aval KSU Fall 015 Philippe B. aval (KSU) Fourier Series Fall 015 1 / 0 Introduction We first review how to derive the Fourier series of a function.
More informationOptimal spectral transportation with application to music transcription
Optimal spectral transportation with application to music transcription Rémi Flamary, Cédric Févotte, Nicolas Courty, Valentin Emiya To cite this version: Rémi Flamary, Cédric Févotte, Nicolas Courty,
More informationPower-Scaled Spectral Flux and Peak-Valley Group-Delay Methods for Robust Musical Onset Detection
Power-Scaled Spectral Flux and Peak-Valley Group-Delay Methods for Robust Musical Onset Detection Li Su CITI, Academia Sinica, Taipei, Taiwan lisu@citi.sinica.edu.tw Yi-Hsuan Yang CITI, Academia Sinica,
More informationDrum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution
Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution Clément Laroche, Hélène Papadopoulos, Matthieu Kowalski, Gaël Richard To cite this version: Clément
More informationDiscovering Convolutive Speech Phones using Sparseness and Non-Negativity Constraints
Discovering Convolutive Speech Phones using Sparseness and Non-Negativity Constraints Paul D. O Grady and Barak A. Pearlmutter Hamilton Institute, National University of Ireland Maynooth, Co. Kildare,
More informationComplex NMF under phase constraints based on signal modeling: application to audio source separation
Complex NMF under phase constraints based on signal modeling: application to audio source separation Paul Magron, Roland Badeau, Bertrand David To cite this version: Paul Magron, Roland Badeau, Bertrand
More informationScalable audio separation with light Kernel Additive Modelling
Scalable audio separation with light Kernel Additive Modelling Antoine Liutkus 1, Derry Fitzgerald 2, Zafar Rafii 3 1 Inria, Université de Lorraine, LORIA, UMR 7503, France 2 NIMBUS Centre, Cork Institute
More informationOn Spectral Basis Selection for Single Channel Polyphonic Music Separation
On Spectral Basis Selection for Single Channel Polyphonic Music Separation Minje Kim and Seungjin Choi Department of Computer Science Pohang University of Science and Technology San 31 Hyoja-dong, Nam-gu
More informationReal-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization
Real-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization Fei Sha and Lawrence K. Saul Dept. of Computer and Information Science University of Pennsylvania, Philadelphia,
More informationNON-NEGATIVE MATRIX FACTORIZATION WITH SELECTIVE SPARSITY CONSTRAINTS FOR TRANSCRIPTION OF BELL CHIMING RECORDINGS
NON-NEGAIVE MARIX ACORIZAION WIH SELECIVE SPARSIY CONSRAINS OR RANSCRIPION O BELL CHIMING RECORDINGS ABSRAC he paper presents a method for automatic transcription of recordings of bell chiming performances.
More informationESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY
ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY Jean-Rémy Gloaguen, Arnaud Can Ifsttar - LAE Route de Bouaye - CS4 44344, Bouguenais, FR jean-remy.gloaguen@ifsttar.fr Mathieu
More informationConvolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence
Convolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence Fabio Louvatti do Carmo; Evandro Ottoni Teatini Salles Abstract This paper proposes a modification of the
More informationCONVOLUTIVE NON-NEGATIVE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT
CONOLUTIE NON-NEGATIE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT Paul D. O Grady Barak A. Pearlmutter Hamilton Institute National University of Ireland, Maynooth Co. Kildare, Ireland. ABSTRACT Discovering
More informationSingle-channel source separation using non-negative matrix factorization
Single-channel source separation using non-negative matrix factorization Mikkel N. Schmidt Technical University of Denmark mns@imm.dtu.dk www.mikkelschmidt.dk DTU Informatics Department of Informatics
More informationREVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES
REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES Kedar Patki University of Rochester Dept. of Electrical and Computer Engineering kedar.patki@rochester.edu ABSTRACT The paper reviews the problem of
More informationGaussian Processes for Audio Feature Extraction
Gaussian Processes for Audio Feature Extraction Dr. Richard E. Turner (ret26@cam.ac.uk) Computational and Biological Learning Lab Department of Engineering University of Cambridge Machine hearing pipeline
More informationOBJECT CODING OF HARMONIC SOUNDS USING SPARSE AND STRUCTURED REPRESENTATIONS
OBJECT CODING OF HARMONIC SOUNDS USING SPARSE AND STRUCTURED REPRESENTATIONS Grégory Cornuz 1, Emmanuel Ravelli 1,2, 1 Institut Jean Le Rond d Alembert, LAM team Université Pierre et Marie Curie - Paris
More informationEnvironmental Sound Classification in Realistic Situations
Environmental Sound Classification in Realistic Situations K. Haddad, W. Song Brüel & Kjær Sound and Vibration Measurement A/S, Skodsborgvej 307, 2850 Nærum, Denmark. X. Valero La Salle, Universistat Ramon
More informationSound Recognition in Mixtures
Sound Recognition in Mixtures Juhan Nam, Gautham J. Mysore 2, and Paris Smaragdis 2,3 Center for Computer Research in Music and Acoustics, Stanford University, 2 Advanced Technology Labs, Adobe Systems
More informationNMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION. Julian M. Becker, Christian Sohn and Christian Rohlfing
NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION Julian M. ecker, Christian Sohn Christian Rohlfing Institut für Nachrichtentechnik RWTH Aachen University D-52056
More informationROBUST REALTIME POLYPHONIC PITCH DETECTION
ROBUST REALTIME POLYPHONIC PITCH DETECTION by John M. Thomas A Thesis Submitted to the Graduate Faculty of George Mason University In Partial fulfillment of The Requirements for the Degree of Master of
More informationMachine Learning for Signal Processing Non-negative Matrix Factorization
Machine Learning for Signal Processing Non-negative Matrix Factorization Class 10. Instructor: Bhiksha Raj With examples from Paris Smaragdis 11755/18797 1 The Engineer and the Musician Once upon a time
More informationSTATISTICAL APPROACH FOR SOUND MODELING
STATISTICAL APPROACH FOR SOUND MODELING Myriam DESAINTE-CATHERINE SCRIME - LaBRI Université de Bordeaux 1 F-33405 Talence Cedex, France myriam@labri.u-bordeaux.fr Pierre HANNA SCRIME - LaBRI Université
More informationt Tao Group Limited, Reading, RG6 IAZ, U.K. wenwu.wang(ieee.org t Samsung Electronics Research Institute, Staines, TW1 8 4QE, U.K.
NON-NEGATIVE MATRIX FACTORIZATION FOR NOTE ONSET DETECTION OF AUDIO SIGNALS Wenwu Wangt, Yuhui Luol, Jonathon A. Chambers', and Saeid Sanei t Tao Group Limited, Reading, RG6 IAZ, U.K. Email: wenwu.wang(ieee.org
More informationACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING. Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard
ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard LTCI, CNRS, Télćom ParisTech, Université Paris-Saclay, 75013,
More informationAutomatic Relevance Determination in Nonnegative Matrix Factorization
Automatic Relevance Determination in Nonnegative Matrix Factorization Vincent Y. F. Tan, Cédric Févotte To cite this version: Vincent Y. F. Tan, Cédric Févotte. Automatic Relevance Determination in Nonnegative
More informationIEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL.?, NO.?, MONTH
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL.?, NO.?, MONTH 016 1 Piano Transcription in the Studio Using an Extensible Alternating Directions Framework Sebastian Ewert, Member,
More informationAnalysis of polyphonic audio using source-filter model and non-negative matrix factorization
Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Tuomas Virtanen and Anssi Klapuri Tampere University of Technology, Institute of Signal Processing Korkeakoulunkatu
More informationMusic transcription with ISA and HMM
Music transcription with ISA and HMM Emmanuel Vincent, Xavier Rodet To cite this version: Emmanuel Vincent, Xavier Rodet. Music transcription with ISA and HMM. 5th Int. Conf. on Independent Component Analysis
More informationLearning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis
Learning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis Youngmin Cho yoc@cs.ucsd.edu Lawrence K. Saul saul@cs.ucsd.edu CSE Department, University of California, San Diego, 95 Gilman
More informationDeep NMF for Speech Separation
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Deep NMF for Speech Separation Le Roux, J.; Hershey, J.R.; Weninger, F.J. TR2015-029 April 2015 Abstract Non-negative matrix factorization
More informationComputational Methods CMSC/AMSC/MAPL 460. Fourier transform
Computational Methods CMSC/AMSC/MAPL 460 Fourier transform Ramani Duraiswami, Dept. of Computer Science Several slides from Prof. Healy s course at UMD Fourier Methods Fourier analysis ( harmonic analysis
More informationTemporal models with low-rank spectrograms
Temporal models with low-rank spectrograms Cédric Févotte Institut de Recherche en Informatique de Toulouse (IRIT) IEEE MLSP Aalborg, Sep. 218 Outline NMF for spectral unmixing Generalities Itakura-Saito
More informationPreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals
PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals Masataka Goto National Institute of Advanced Industrial Science and Technology (AIST). IT, AIST, 1-1-1 Umezono, Tsukuba,
More informationCOMP 546, Winter 2018 lecture 19 - sound 2
Sound waves Last lecture we considered sound to be a pressure function I(X, Y, Z, t). However, sound is not just any function of those four variables. Rather, sound obeys the wave equation: 2 I(X, Y, Z,
More informationCS229 Project: Musical Alignment Discovery
S A A V S N N R R S CS229 Project: Musical Alignment iscovery Woodley Packard ecember 16, 2005 Introduction Logical representations of musical data are widely available in varying forms (for instance,
More informationEnforcing Harmonicity and Smoothness in Bayesian Non-negative Matrix Factorization Applied to Polyphonic Music Transcription.
Enforcing Harmonicity and Smoothness in Bayesian Non-negative Matrix Factorization Applied to Polyphonic Music Transcription. Nancy Bertin*, Roland Badeau, Member, IEEE and Emmanuel Vincent, Member, IEEE
More informationReverberation Impulse Response Analysis
MUS424: Signal Processing Techniques for Digital Audio Effects Handout #18 Jonathan Abel, David Berners April 27, 24 Lecture #9: April 27, 24 Lecture Notes 7 Reverberation Impulse Response Analysis 1 CCRMA
More informationHigh resolution NMF for modeling mixtures of non-stationary signals in the time-frequency domain
High resolution NMF for modeling mixtures of non-stationary signals in the time-frequency domain NMF à haute résolution pour la modélisation de mélanges de signaux non-stationnaires dans le domaine temps-fréquence
More informationPOLYPHONIC MUSIC TRANSCRIPTION BY NON-NEGATIVE SPARSE CODING OF POWER SPECTRA
POLYPHONIC MUSIC TRANSCRIPTION BY NON-NEGATIVE SPARSE CODING OF POWER SPECTRA Samer A. Abdallah and Mark D. Plumbley Centre for Digital Music, Queen Mary, University of London ABSTRACT We present a system
More informationarxiv: v1 [stat.ml] 6 Nov 2018
A Quasi-Newton algorithm on the orthogonal manifold for NMF with transform learning arxiv:1811.02225v1 [stat.ml] 6 Nov 2018 Pierre Ablin, Dylan Fagot, Herwig Wendt, Alexandre Gramfort, and Cédric Févotte
More informationSource Separation Tutorial Mini-Series III: Extensions and Interpretations to Non-Negative Matrix Factorization
Source Separation Tutorial Mini-Series III: Extensions and Interpretations to Non-Negative Matrix Factorization Nicholas Bryan Dennis Sun Center for Computer Research in Music and Acoustics, Stanford University
More informationFunctional and Structural Implications of Non-Separability of Spectral and Temporal Responses in AI
Functional and Structural Implications of Non-Separability of Spectral and Temporal Responses in AI Jonathan Z. Simon David J. Klein Didier A. Depireux Shihab A. Shamma and Dept. of Electrical & Computer
More informationMusic Synthesis. synthesis. 1. NCTU/CSIE/ DSP Copyright 1996 C.M. LIU
Music Synthesis synthesis. 1 pintroduction pmodeling, Synthesis, and Overview padditive Synthesis psubtractive Synthesis pnonlinear Synthesis pwavetable Synthesis psummary and Conclusions 1. Introduction
More informationA NEW DISSIMILARITY METRIC FOR THE CLUSTERING OF PARTIALS USING THE COMMON VARIATION CUE
A NEW DISSIMILARITY METRIC FOR THE CLUSTERING OF PARTIALS USING THE COMMON VARIATION CUE Mathieu Lagrange SCRIME LaBRI, Université Bordeaux 1 351, cours de la Libération, F-33405 Talence cedex, France
More informationKen O Hanlon and Mark B. Sandler. Centre for Digital Music Queen Mary University of London
AN ITERATIVE HARD THRESHOLDING APPROACH TO l 0 SPARSE HELLINGER NMF Ken O Hanlon and Mark B. Sandler Centre for Digital Music Queen Mary University of London ABSTRACT Performance of Non-negative Matrix
More informationA Generative Model for Music Transcription
1 A Generative Model for Music Transcription Ali Taylan Cemgil, Student Member, IEEE, Bert Kappen, Senior Member, IEEE, and David Barber Abstract In this paper we present a graphical model for polyphonic
More informationGeneral Physics I. Lecture 14: Sinusoidal Waves. Prof. WAN, Xin ( 万歆 )
General Physics I Lecture 14: Sinusoidal Waves Prof. WAN, Xin ( 万歆 ) xinwan@zju.edu.cn http://zimp.zju.edu.cn/~xinwan/ Motivation When analyzing a linear medium that is, one in which the restoring force
More informationTopic 6. Timbre Representations
Topic 6 Timbre Representations We often say that singer s voice is magnetic the violin sounds bright this French horn sounds solid that drum sounds dull What aspect(s) of sound are these words describing?
More informationNonlinear Losses in Electro-acoustical Transducers Wolfgang Klippel, Daniel Knobloch
The Association of Loudspeaker Manufacturers & Acoustics International (ALMA) Nonlinear Losses in Electro-acoustical Transducers Wolfgang Klippel, Daniel Knobloch Institute of Acoustics and Speech Communication
More informationMelody Extraction and Musical Onset Detection
1 Melody Extraction and Musical Onset Detection from Framewise STFT Peak Data Harvey D. Thornburg, Student Member, IEEE, Randal J. Leistikow, Student Member, IEEE, and Jonathan Berger Abstract We propose
More information