On the INFOMAX algorithm for blind signal separation

Similar documents
Blind deconvolution of dynamical systems using a balanced parameterized state space approach

Blind Machine Separation Te-Won Lee

Computing running DCTs and DSTs based on their second-order shift properties

1 Introduction Independent component analysis (ICA) [10] is a statistical technique whose main applications are blind source separation, blind deconvo

An improved dynamic model for optical feedback self-mixing interferometry-based measurement and instrumentation

Recursive Generalized Eigendecomposition for Independent Component Analysis

Massoud BABAIE-ZADEH. Blind Source Separation (BSS) and Independent Componen Analysis (ICA) p.1/39

Blind signal processing algorithms

Chapter 6 Effective Blind Source Separation Based on the Adam Algorithm

NOISE ROBUST RELATIVE TRANSFER FUNCTION ESTIMATION. M. Schwab, P. Noll, and T. Sikora. Technical University Berlin, Germany Communication System Group

Recovering the absolute phase maps of two fringe patterns with selected frequencies

On Information Maximization and Blind Signal Deconvolution

Blind Extraction of Singularly Mixed Source Signals

Independent Component Analysis. Contents

Robust extraction of specific signals with temporal structure

FREQUENCY-DOMAIN IMPLEMENTATION OF BLOCK ADAPTIVE FILTERS FOR ICA-BASED MULTICHANNEL BLIND DECONVOLUTION

ON-LINE BLIND SEPARATION OF NON-STATIONARY SIGNALS

An Improved Cumulant Based Method for Independent Component Analysis

Fundamentals of Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Independent Vector Analysis (IVA)

Distinguishing Causes from Effects using Nonlinear Acyclic Causal Models

Natural Gradient Learning for Over- and Under-Complete Bases in ICA

PROPERTIES OF THE EMPIRICAL CHARACTERISTIC FUNCTION AND ITS APPLICATION TO TESTING FOR INDEPENDENCE. Noboru Murata

Blind separation of instantaneous mixtures of dependent sources

Independent Component Analysis

A METHOD OF ICA IN TIME-FREQUENCY DOMAIN

Simultaneous Diagonalization in the Frequency Domain (SDIF) for Source Separation

Blind Source Separation with a Time-Varying Mixing Matrix

Gatsby Theoretical Neuroscience Lectures: Non-Gaussian statistics and natural images Parts I-II

Undercomplete Independent Component. Analysis for Signal Separation and. Dimension Reduction. Category: Algorithms and Architectures.

Entropy Manipulation of Arbitrary Non I inear Map pings

Independent Component Analysis

To appear in Proceedings of the ICA'99, Aussois, France, A 2 R mn is an unknown mixture matrix of full rank, v(t) is the vector of noises. The

Independent Component Analysis

Analytical solution of the blind source separation problem using derivatives

Independent Components Analysis

Independent Component Analysis of Incomplete Data

x 1 (t) Spectrogram t s

ORIENTED PCA AND BLIND SIGNAL SEPARATION

Old painting digital color restoration

ON SOME EXTENSIONS OF THE NATURAL GRADIENT ALGORITHM. Brain Science Institute, RIKEN, Wako-shi, Saitama , Japan

Independent Component Analysis on the Basis of Helmholtz Machine

One-unit Learning Rules for Independent Component Analysis

Distinguishing Causes from Effects using Nonlinear Acyclic Causal Models

CIFAR Lectures: Non-Gaussian statistics and natural images

Semi-Blind approaches to source separation: introduction to the special session

FUNDAMENTAL LIMITATION OF FREQUENCY DOMAIN BLIND SOURCE SEPARATION FOR CONVOLVED MIXTURE OF SPEECH

A Generalization of Blind Source Separation Algorithms for Convolutive Mixtures Based on Second-Order Statistics

On the Slow Convergence of EM and VBEM in Low-Noise Linear Models

MULTICHANNEL SIGNAL PROCESSING USING SPATIAL RANK COVARIANCE MATRICES

Absolute phase map recovery of two fringe patterns with flexible selection of fringe wavelengths

where A 2 IR m n is the mixing matrix, s(t) is the n-dimensional source vector (n» m), and v(t) is additive white noise that is statistically independ

ADAPTIVE LATERAL INHIBITION FOR NON-NEGATIVE ICA. Mark Plumbley

POLYNOMIAL SINGULAR VALUES FOR NUMBER OF WIDEBAND SOURCES ESTIMATION AND PRINCIPAL COMPONENT ANALYSIS

CONTROL SYSTEMS ANALYSIS VIA BLIND SOURCE DECONVOLUTION. Kenji Sugimoto and Yoshito Kikkawa

Adaptive beamforming for uniform linear arrays with unknown mutual coupling. IEEE Antennas and Wireless Propagation Letters.

Tutorial on Blind Source Separation and Independent Component Analysis

Comparative Analysis of ICA Based Features

A Convex Cauchy-Schwarz Divergence Measure for Blind Source Separation

Blind channel deconvolution of real world signals using source separation techniques

Notes on Latent Semantic Analysis

An Adaptive Sensor Array Using an Affine Combination of Two Filters

Introduction to Machine Learning

Information Theory. Coding and Information Theory. Information Theory Textbooks. Entropy

A Novel Approach For Sensor Noise Elimination In Real Time Process

A Constrained EM Algorithm for Independent Component Analysis

Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs

BLIND SEPARATION OF INSTANTANEOUS MIXTURES OF NON STATIONARY SOURCES

BLIND SOURCES SEPARATION BY SIMULTANEOUS GENERALIZED REFERENCED CONTRASTS DIAGONALIZATION

SPARSE REPRESENTATION AND BLIND DECONVOLUTION OF DYNAMICAL SYSTEMS. Liqing Zhang and Andrzej Cichocki

Advanced Introduction to Machine Learning CMU-10715

Introduction to Machine Learning Spring 2018 Note Neural Networks

Equivalence Probability and Sparsity of Two Sparse Solutions in Sparse Representation

1 Introduction Blind source separation (BSS) is a fundamental problem which is encountered in a variety of signal processing problems where multiple s

Blind Spectral-GMM Estimation for Underdetermined Instantaneous Audio Source Separation

A new method for a nonlinear acoustic echo cancellation system

A Canonical Genetic Algorithm for Blind Inversion of Linear Channels

Blind Source Separation via Generalized Eigenvalue Decomposition

ICA [6] ICA) [7, 8] ICA ICA ICA [9, 10] J-F. Cardoso. [13] Matlab ICA. Comon[3], Amari & Cardoso[4] ICA ICA

TRINICON: A Versatile Framework for Multichannel Blind Signal Processing

A Fast Algorithm for. Nonstationary Delay Estimation. H. C. So. Department of Electronic Engineering, City University of Hong Kong

An Iterative Blind Source Separation Method for Convolutive Mixtures of Images

DETECTING PROCESS STATE CHANGES BY NONLINEAR BLIND SOURCE SEPARATION. Alexandre Iline, Harri Valpola and Erkki Oja

Speaker Representation and Verification Part II. by Vasileios Vasilakakis

Independent Component Analysis (ICA) Bhaskar D Rao University of California, San Diego

Blind Signal Separation: Statistical Principles

Combining EMD with ICA to analyze combined EEG-fMRI Data

Single Channel Signal Separation Using MAP-based Subspace Decomposition

Maximum Likelihood Blind Source Separation: A Context-Sensitive Generalization of ICA

Adaptive Blind Equalizer for HF Channels

Probability and Information Theory. Sargur N. Srihari

Error Empirical error. Generalization error. Time (number of iteration)

Convolutive Blind Source Separation based on Multiple Decorrelation. Lucas Parra, Clay Spence, Bert De Vries Sarno Corporation, CN-5300, Princeton, NJ

Lecture: Adaptive Filtering

CONVOLUTIVE NON-NEGATIVE MATRIX FACTORISATION WITH SPARSENESS CONSTRAINT

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007

Real and Complex Independent Subspace Analysis by Generalized Variance

IN neural-network training, the most well-known online

MISEP Linear and Nonlinear ICA Based on Mutual Information

ICA. Independent Component Analysis. Zakariás Mátyás

Speed and Accuracy Enhancement of Linear ICA Techniques Using Rational Nonlinear Functions

Transcription:

University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2000 On the INFOMAX algorithm for blind signal separation Jiangtao Xi University of Wollongong, jiangtao@uow.edu.au Joe F. Chicharo University of Wollongong, chicharo@uow.edu.au Ah Chung Tsoi University of Wollongong, act@uow.edu.au W. C. Siu Hong Kong Polytechnic University Publication Details This paper originally appeared as: Xi, J, chicharo, JF, Tsoi, AC & Siu, WC, On the INFOMAX algorithm for blind signal separation, WCCC-ICSP 2000. 5th International Conference on Signal Processing Proceedings, 21-25 August 2000, vol 1, 425-428. Copyright IEEE 2000. Research Online is the open access institutional repository for the University of Wollongong. For further information contact the UOW Library: research-pubs@uow.edu.au

On the INFOMAX algorithm for blind signal separation Abstract This paper provides an analytical examination of the INFOMAX algorithm and establishes its effectiveness for blind signal separation using extensive simulation results. Results obtained show that the INFOMAX is not able to separate signal sources unless signal pre-processing is carried-out whereby the data to train the separating matrix is decorrelated. Further, results also show that if one uses the decorrelation pre-process alone it is able to effectively separate signal sources in many instances. Hence we conclude that the INFOMAX algorithm may not be a useful approach for signal separation. Keywords array signal processing, decorrelation, gradient methods, maximum entropy methods, statistical analysis Disciplines Physical Sciences and Mathematics Publication Details This paper originally appeared as: Xi, J, chicharo, JF, Tsoi, AC & Siu, WC, On the INFOMAX algorithm for blind signal separation, WCCC-ICSP 2000. 5th International Conference on Signal Processing Proceedings, 21-25 August 2000, vol 1, 425-428. Copyright IEEE 2000. This conference paper is available at Research Online: http://ro.uow.edu.au/infopapers/228

Proceedings of ICSPZOOO ON THE INFOMAX ALGORITHM FOR BLIND SIGNAL SEPARATION Jiangtao Xi, J.? Chicharo and Ah Chung Tsoi The University of Wollongong Australia jiangtao@uow.edu.au Wan-Chi Siu Hong Kong Polytechnic University Hong Kong Abstract This paper provides an analytical examination of the INFOMAX algorithm [3] and establishes its effectiveness for blind signal separation using extensive simulation results. Results obtained show that the INFOMAX is not able to separate signal sources unless signal preprocessing is carried-out whereby the data to train the separating matrix is decorrelated. Further, results also show that if one uses the decorrelation preprocess alone it is able to effectively separate signal sources in many instances.. Hence we conclude that the INFOMAX algorithm may not be a useful approach for signal separation. 1. INTRODUCTION Blind Signal Separation (BSS) has been proposed to separate or estimate the waveforms of unknown signal sources based on measurements from an array of sensors. A BSS problem can be modeled by the system depicted in Figure 1 wheren statistically independent signal sources are mixed through NxN unknown channels to produce N observations at the sensors. BSS is in effect a multiple-input multiple-output (MIMO) system referred to as a separating network which can be used to extract each of the original signal sources. BSS is a challenging problem due to its blind nature. That is, neither the original signal sources nor the transmission channels from the sources to the sensors are known apriori. There exists two main approaches for BSS, these include: Statistically based algorithms [ 11 and neural network based BSS algorithms [2]. Compared to the statistical approach, neural network methods are considered to be more computational efficient, although their performance may not be guaranteed in some instances. One of the most popular neural network-based approaches is called INFOMAX algorithm proposed by Bell and Sejnowski [3]. The INFOMAX method uses a gradient-based algorithm which leads to low complexity in terms of implementation. The theoretical basis for the claimed success of the INFOMAX algorithm is not entirely clear from existing literature. For instance the role and effectiveness of the preprocessing stage prior to using the INFOMAX algorithm needs to be established. The pre-processing usually performs DC component removal and de-correlation. In other words, we still do not know the answer to thc following questions: what is the effect of the preprocessing andcan the INFOMAX algorithm work without the pre-processing stage? This paper addresses these two important issues. For the purposes of the our study we only consider the instantaneous mixing cases, that is, where all the mixing channels are instantaneous scalars. In such cases the separating network also contains instantaneous channels only. Consider the system depicted in Figure 1 where a column vector S is used to denote N signal sources, X denotes the measurements, while the column vector U denotes the separated signals, and Y denotes the auxiliary outputs as follows: X=AS (1) U=M (2) Y = G(U) (3) where A and W are N by N matrices, representing the mapping from S to X and from X to U 0-7803-5747-7/00/$10.00(9 2000 IEEE. * 425

respectively, and G(.) is the non-linear mapping from U to Y. Figure 1. A blind separation system 2. REVISITING THE INFORMAX ALGORITHM [3] The INFORMAX method is based on concept of maximizing the entropy (ME) of the above auxiliary output Y. The rational is based on the assumption that the expectation of the joint entropy of signal vector Y is maximized when the components of Yare independent. In order to study the INFOMAX method more closely it is necessary to briefly reexamine its derivation. The joint entropy of Y is defined by: H(Y) = -jfy~y)~ogfy(y)~y (4) Where f,(y) is the joint probability density function (pdf) of Y. It can be shown that, H(Y) = H(X) + E(log JUnm/x + E(logJNon/in) (5) where fl(u) is the joint pdf of U, JNonlin is the Jacobian determinant of the transformation from U to Y, and Ju,,n,/x is the Jacobian determinant of the transformation from X to U. A gradient-based algorithm is used to maximize the entropy in Equation (5). However, it is a difficult task, because the first term, the entropy of the measurements, is unknown and therefore nothing can be done with it in the maximization. In this situation a sub-optimization approach was proposed, in which the gradient was evaluated based only on the remaining two terms [3], (7) The instantaneous gradient is given by VW(k) E [WT(k)rl +[I - 2Y(k)]XT(k) (8) It has been claimed that the above algorithm is able to separate the sources mixed by instantaneous network [3-4]. Note that the INFOMAX principle has also been extended to convolutive mixing cases [5-71. 3. PERFORMANCE EVALUATION OF THE INFOMAX ALGORITHM In this Section we examine the theoretical validity of the ME principle together with the gradient algorithm as presented in [3-71. We first question the issue where it is assumed, without theoretical proof, the claim that maximization of the entropy will lead to an independent output [4]. Secondly, we examine the gradient given in Equation (8) and look for the justification that it will indeed result in Maximization of the joint entropy. It is evident that only the last two terms are considered in determining the gradient, and t is also clear that the entropy will depend on the source pdf as well as the mixing matrix A. In other words, the optimal solution for Walso depends on the mixing matrix A. Hence it is evident that Equation (8) is not a perfect solution to the underlying ME problem. The above observations and discussion leads to the natural question: how the gradient algorithm achieves the blind separation? In other words, what is the major factor that steers the separating network toward a good solution? We examine this issue further by using extensive simulations. We evaluate the separation.performance of [3-71 by considering speech signals as an example. Consider the following matrix: - - E = WA = 1:;: (9) where e,.represents the of source si to output U,. The diagonal elements of thc above matrix can be used to establish the performance of signal separation. A. Simulations on INFOMAX algorithm Without Pre-Processing (Decorrelation) Assuming a sigmoid nonlinear function for y,, For the sake of simplicity, we only study cases where two signal sources are mixed through a 426

scalar (instantaneous) mixing network. The approach is to use two given signal sources, pass them through a artificial mixing network to get the two measurements, train the separation network using the measurement data, and pass the measurement data through the trained separation network to obtain the separated outputs. The first example used in the simulation is as follows. Two speech signal sources are mixed through the mixing matrix given by: L J The waveforms of the two original speech signals and the two mixed measurements are depicted in Figure 2. Obviously A is of full rank which guarantees the existence of a separation matrix. In our simulation, the measurement data samples are scaled into the range [-1,1], the step size is 0.01, the separation matrix is updated for every 30 data samples, and a total 16000 data samples are used for training the separation network. The resulting output waveforms are depicted in Figure 2. Note that the results obtained show poor performance in the sense that the signal are still mixed and signal sources remain largely unseperated as indicated by: E = p.3399 2.30981 1.6410 3.3379 It may be argued that the above example may be a special case and may not mean much. In order to address this we tried a variaty of mixing systems with different, initial values of separation matrix as well as various signal sources. The results were found to always be consistent with the above example. That is, the INFOMAX algorithm does not result in a solution for signal source separation, irrespective of signal sources being clean speech, noisy speech or noise and speech (?). B. Simulation Results in Literature As mentioned above signal separation could not be achieved. However, a lot of very good separation results have been reported in literature, which are seems to be contrary to the results depicted above. The question is why the published results are so good. In trying to answer this, we down loaded the original code used by [4]. Careful comparison the code used in [4] and our own code revealed that the data is pre-processed before feeding it into the adaptive learning algorithm. The pre-processing contains DC component removal and de-correlation (making the cross-correlation matrix to be 41). Simulations were carried out for the same signal sources and mixing matricies as discussed in Section B except that now we inserted the preprocessing stage. The results are shown in Figure 3, and the resulting E matrix is 23.3079-0.03 14 1 E=[ - 0.5952 23.4237 Clearly good separation has been achieved. C. The Effect of Pre-processing It is evident from the above that the pre-processing stage seems to be crucial in order for the INFOMAX algorithm [?I to provide a good solution. The next question is "what kind of role does the pre-processing (de-correlation) play in the process of separation? What is the effect of INFOMAX algorithm on the signal separation?" In trying to answer these two questions, let us see what happens to the measurements by the preprocessing stage. We consider the case where the measurements are zero mean: 2 = x - E{X} (15) and un-correlated: R=Ci (16) where C is the de-correlation matrix given by: CC = E(XXT 1 (17) Clearly the above pre-processing will lead to identy cross-covariance matrices: E(B} = E{CBC} = I (18) Computer simulations were also performed to investigate the effect of the above pre-processing. We used the same signal sources and mixing network as those in Section B. The results are shown in Figure 4. It is clear that the signal sources are separated by the pre-processing stage. The E matrix and L are given as follows. 10.3024 0.6671 1 E=[ - 0.5523 11.9845 The above results are not surprising because decorrelation itself has already been shown to capable of achieve signal separation [5]. 4 CONCLUSIONS AND CONSIDERATIONS In conclusion the following observations have been established: The de-correlation in pre-processing stage plays a very important role in the separation performance. The INFOMAX algorithm itself is not able to separate signal sources. * 427

In most cases, the de-correlation is able to separate the signal sources itself. REFERENCES [l]. S. Amari and A. Cichocki, Adaptive blind signal separation --- Neural network approaches, Proc. Of IEEE, Vo1.86, N0.10, October 1998, pp.2026-2048. [2]. J.-F. Cardoso, Bind signal separation: Statistical principles, Proc. Of IEEE, Vo1.86, No.10, October 1998, pp.2009-2047. [3]. A. J. Bell and T. J. Sejnowski, An information-maximization approach to blind separation and blind deconvolution, Neural Computation, Vo1.7, 1995, pp.1129-1159. [4]. Paris Independent Component Analysis & Blind Source Separation page http://sound.media.mit.edu/-paris/ica. html [5]. E. Weinstein, M. feder, and A. V. Oppenheim, Mult-channel signal separation by decorrelation, IEEE Trans. Speech and Audio Processing, Vol.1, no.4,oct.1993, pp.405-413. [6]. D.C.B Chan, P.J.W. Rayner and S.J.Godsil1, Multi-channel blind signal separation by decorrelation, Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Mohonk Mountain, House, New Paltz, New York, pp.15-18, Oct. 1995. - o G,. o, % >!> ; l l L * Figure 3. Simulations on INFOMAX with Decorrelation Figure 4. Separation results using decorrelation only I to 1 I* Figure 2. Simulations on INFOMAX without Decorrelation