Single-channel source separation using non-negative matrix factorization

Similar documents
Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation

Non-Negative Matrix Factorization with Quasi-Newton Optimization

NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION. Julian M. Becker, Christian Sohn and Christian Rohlfing

Non-negative matrix factorization and term structure of interest rates

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan

CONVOLUTIVE NON-NEGATIVE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT

Non-negative Matrix Factorization: Algorithms, Extensions and Applications

Nonnegative Tensor Factorization with Smoothness Constraints

Probabilistic Latent Variable Models as Non-Negative Factorizations

Nonparametric Bayesian Methods (Gaussian Processes)

Regularized Alternating Least Squares Algorithms for Non-negative Matrix/Tensor Factorization

MULTIPLICATIVE ALGORITHM FOR CORRENTROPY-BASED NONNEGATIVE MATRIX FACTORIZATION

Non-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology

CPSC 340: Machine Learning and Data Mining. Sparse Matrix Factorization Fall 2018

Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification

Bayesian Paradigm. Maximum A Posteriori Estimation

Bayesian Hierarchical Modeling for Music and Audio Processing at LabROSA

Independent Component Analysis. Contents

ORTHOGONALITY-REGULARIZED MASKED NMF FOR LEARNING ON WEAKLY LABELED AUDIO DATA. Iwona Sobieraj, Lucas Rencker, Mark D. Plumbley

Independent Component Analysis and Unsupervised Learning. Jen-Tzung Chien

Note on Algorithm Differences Between Nonnegative Matrix Factorization And Probabilistic Latent Semantic Indexing

Nonnegative Matrix Factorization

Constrained Nonnegative Matrix Factorization with Applications to Music Transcription

STA 4273H: Statistical Machine Learning

Incremental Multi-Source Recognition with Non-Negative Matrix Factorization

Nonnegative Matrix Factorization with Toeplitz Penalty

ECE521 week 3: 23/26 January 2017

Data Mining and Matrices

CORRELATED NOISE: HOW IT BREAKS NMF, AND WHAT TO DO ABOUT IT

On Spectral Basis Selection for Single Channel Polyphonic Music Separation

Parametric Models. Dr. Shuang LIANG. School of Software Engineering TongJi University Fall, 2012

REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES

Independent Component Analysis and Unsupervised Learning

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement

Bayesian Regularization and Nonnegative Deconvolution for Room Impulse Response Estimation

Gaussian Processes (10/16/13)

Matrix Factorization & Latent Semantic Analysis Review. Yize Li, Lanbo Zhang

Environmental Sound Classification in Realistic Situations

c Springer, Reprinted with permission.

Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis

Non-Negative Matrix Factorization

Variational Methods in Bayesian Deconvolution

Unsupervised learning: beyond simple clustering and PCA

Nonnegative matrix factorization with α-divergence

arxiv: v4 [math.na] 10 Nov 2014

Pattern Recognition. Parameter Estimation of Probability Density Functions

Density Estimation. Seungjin Choi

CS598 Machine Learning in Computational Biology (Lecture 5: Matrix - part 2) Professor Jian Peng Teaching Assistant: Rongda Zhu

Pattern Recognition and Machine Learning

Discovering Convolutive Speech Phones using Sparseness and Non-Negativity Constraints

PILCO: A Model-Based and Data-Efficient Approach to Policy Search

Novel Multi-layer Non-negative Tensor Factorization with Sparsity Constraints

Statistical approach for dictionary learning

NON-NEGATIVE SPARSE CODING

Exemplar-based voice conversion using non-negative spectrogram deconvolution

Gaussian Models

Fast Coordinate Descent methods for Non-Negative Matrix Factorization

One Picture and a Thousand Words Using Matrix Approximtions October 2017 Oak Ridge National Lab Dianne P. O Leary c 2017

Ch 4. Linear Models for Classification

BLIND SEPARATION OF INSTANTANEOUS MIXTURES OF NON STATIONARY SOURCES

Acoustic MIMO Signal Processing

Nonnegative Matrix Factorization with Gibbs Random Field Modeling

Learning with Noisy Labels. Kate Niehaus Reading group 11-Feb-2014

Introduction PCA classic Generative models Beyond and summary. PCA, ICA and beyond

Probabilistic Machine Learning. Industrial AI Lab.

Heeyoul (Henry) Choi. Dept. of Computer Science Texas A&M University

STA 4273H: Statistical Machine Learning

Principal Component Analysis (PCA) for Sparse High-Dimensional Data

Single Channel Signal Separation Using MAP-based Subspace Decomposition

Nonnegative Tensor Factorization with Smoothness Constraints

Gaussian Processes for Audio Feature Extraction

Advanced Introduction to Machine Learning CMU-10715

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka

Single-channel source separation using non-negative matrix factorization

Improved Bayesian Compression

Computer Vision Group Prof. Daniel Cremers. 3. Regression

Machine Learning Techniques for Computer Vision

Statistícal Methods for Spatial Data Analysis

A State-Space Approach to Dynamic Nonnegative Matrix Factorization

Variational inference

CSC321 Lecture 18: Learning Probabilistic Models

SUPERVISED NON-EUCLIDEAN SPARSE NMF VIA BILEVEL OPTIMIZATION WITH APPLICATIONS TO SPEECH ENHANCEMENT

A graph contains a set of nodes (vertices) connected by links (edges or arcs)

Regularized Multiplicative Algorithms for Nonnegative Matrix Factorization

TUTORIAL PART 1 Unsupervised Learning

cross-language retrieval (by concatenate features of different language in X and find co-shared U). TOEFL/GRE synonym in the same way.

A Fast Augmented Lagrangian Algorithm for Learning Low-Rank Matrices

Non-negative mixtures

Posterior Regularization

Riemannian Metric Learning for Symmetric Positive Definite Matrices

Fast Nonnegative Matrix Factorization with Rank-one ADMM

Generalized Nonnegative Matrix Approximations with Bregman Divergences

Sound Recognition in Mixtures

Sparse and Shift-Invariant Feature Extraction From Non-Negative Data

STA 4273H: Statistical Machine Learning

ADVANCED MACHINE LEARNING ADVANCED MACHINE LEARNING. Non-linear regression techniques Part - II

Learning Bayesian network : Given structure and completely observed data

Neural networks: Unsupervised learning

The connection of dropout and Bayesian statistics

Transcription:

Single-channel source separation using non-negative matrix factorization Mikkel N. Schmidt Technical University of Denmark mns@imm.dtu.dk www.mikkelschmidt.dk DTU Informatics Department of Informatics and Mathematical Modeling Page 1 of 30

Agenda Single-channel source separation Non-negative matrix factorization NMF 2-D deconvolution (with Morten Mørup) Speech separation using sparse NMF (with Rasmus K. Olsson) NMF with Gaussian process priors (with Hans Laurberg) Page 2 of 30

Single-channel source separation Additive model Mixture Sources Noise Under-determined problem: More information required Example: Two-source noise-free Page 3 of 30

Approaches to single-channel source separation Filtering Decomposition and grouping Source modelling Page 4 of 30

Model-based source-separation mixture training data feature extraction feature extraction data representation inference separation model model reconstruction estimate Page 5 of 30

Signal representation Emphasize desired characteristics Introduce invariances Allow assumptions of independence or exchangeability Reduce dimensionality Allow signal reconstruction Page 6 of 30

Model Likelihood Mixing model Source model Model building Prior Model training Model adaptation Goals Accurately model sources and mixing process Enable efficient inference Page 7 of 30

Inference Posterior Likelihood Prior Estimate sources: Maximum a posteriori, posterior mean, marginal MAP, etc. Solve optimization or integration problem Page 8 of 30

Non-negative matrix factorization Non-negativity constraints Data Factorizing matrices Page 9 of 30

Non-negative matrix factorization Non-negative bilinear model Sum of products of non-negative variables Page 10 of 30

Why non-negativity? Many signals are naturally non-negative Pixel intensities Amplitude spectra Occurrence counts Discrete probabilities Additive combination of features No cancellations Build the whole as a sum of parts Page 11 of 30

Basic computation Constrained minimization problem Divergence measure Constraints Page 12 of 30

Probabilistic formulation Maximum likelihood Likelihood Prior Maximum a posteriori Page 13 of 30

Divergence measures (Likelihoods) Squared error (Lee and Seung, 1999) Kullback-Leibler divergence (Lee and Seung, 1999) Bregman's divergence (Dhillon and Sra, 2005) Kompass' divergence (Kompass, 2007) Csiszár's divergence (Cichocki et al., 2006) Amari's alpha divergence (Cichocki et al., 2006) Weighted versions of the above (Guillamet et al., 2001) Page 14 of 30

Distribution of the factors (Priors) Sparsity (Hoyer, 2002) Orthogonality (Ding et al., 2005) Discriminative (Wang et al., 2004) Smoothness (Virtanen, 2003) Gaussian process (Schmidt and Laurberg, 2008) Transformation invariance (Wersing et al., 2003) Convolutive (Virtanen, 2004; Smaragdis, 2004) 2-D convolutive (Schmidt and Mørup, 2006) Page 15 of 30

Optimization strategies Direct optimization Non-convex Alternating optimization Alternating descent Can have convex subproblems Page 16 of 30

Algorithms Projected least squares (Paatero, 1997) Multiplicative updates (Lee and Seung, 1999) Projected gradient descent (Lin, 2007) Logarithmic barrier method (Lu and Wu, 2005) Active set (Berry et al., 2006) Quasi Newton (Kim et al., 2007) Reparametrization (Cichocki et al., 2006) SOCP (Heiler and Schnörr, 2006) Page 17 of 30

NMF 2-D deconvolution NMF NMF 2-D Page 18 of 30

Example: Flute and guitar 3.2 1.6.8.4 Frequency [khz] 2 4 6 8 10 Time [seconds].2 Page 19 of 30

Speech separation using sparse NMF Training data Learn basis for each source Mixture Basis Decompose mixture Reconstruct each source Source estimate Page 20 of 30

Example: Two speakers Bases Mixture 3.5 3 2.5 2 1.5 1.5 Frequency [khz] 1 2 3 4 Time [seconds] Page 21 of 30

NMF with Gaussian process priors Data Gaussian processes Link functions GP: General distribution over functions Link function: Non-linear map to non-negative reals Page 22 of 30

Gaussian process Distribution over functions Characterized by Mean function Covariance function Example Page 23 of 30

Change of variable Cholesky decomposition I.I.D. standard normal Same model, different parametrization Parameters are a priori uncorrelated Empirically better optimization properties Page 24 of 30

Illustration of NMF with GP priors Page 25 of 30

Example: Toy problem Noisy data Underlying data NMF GPP-NMF Page 26 of 30

Example: Toy problem Columns of A Rows of B Underlying data NMF GPP-NMF Page 27 of 30

Example: Chemical shift brain imaging Data: 369 point spectra @ 512 points (8 8 8 grid) in human head Task: Distinguish between brain and muscle tissue Spectra prior: Smooth exponential distribution Activation prior: 3-D smooth, exponential distribution, leftto-right symmetric Random draw from prior Page 28 of 30

Example: Chemical shift brain imaging Non-negative matrix factorization NMF with Gaussian process priors Brain Muscle Page 29 of 30

Conclusions Single-channel source separation is a difficult problem that occurs in many areas Model based source separation provides a principled approach to solving the problem NMF and its extensions are useful models for singlechannel source separation Gaussian processes can be used as a general framework for incorporating prior information in NMF Page 30 of 30