POLYNOMIAL SINGULAR VALUES FOR NUMBER OF WIDEBAND SOURCES ESTIMATION AND PRINCIPAL COMPONENT ANALYSIS

Similar documents
Blind Source Separation with a Time-Varying Mixing Matrix

Blind Deconvolution via Maximum Kurtosis Adaptive Filtering

Applied Linear Algebra in Geoscience Using MATLAB

1 Introduction Independent component analysis (ICA) [10] is a statistical technique whose main applications are blind source separation, blind deconvo

FREQUENCY-DOMAIN IMPLEMENTATION OF BLOCK ADAPTIVE FILTERS FOR ICA-BASED MULTICHANNEL BLIND DECONVOLUTION

Contents. Preface for the Instructor. Preface for the Student. xvii. Acknowledgments. 1 Vector Spaces 1 1.A R n and C n 2

Estimation of the Optimum Rotational Parameter for the Fractional Fourier Transform Using Domain Decomposition

Chapter 3 Transformations

On frequency-domain implementations of filtered-gradient blind deconvolution algorithms

Comparative Performance Analysis of Three Algorithms for Principal Component Analysis

ORIENTED PCA AND BLIND SIGNAL SEPARATION

where A 2 IR m n is the mixing matrix, s(t) is the n-dimensional source vector (n» m), and v(t) is additive white noise that is statistically independ

Blind Identification of FIR Systems and Deconvolution of White Input Sequences

Acoustic MIMO Signal Processing

On Information Maximization and Blind Signal Deconvolution

Simultaneous Diagonalization in the Frequency Domain (SDIF) for Source Separation

/16/$ IEEE 1728

Prof. Dr.-Ing. Armin Dekorsy Department of Communications Engineering. Stochastic Processes and Linear Algebra Recap Slides

Dimension reduction, PCA & eigenanalysis Based in part on slides from textbook, slides of Susan Holmes. October 3, Statistics 202: Data Mining

Recursive Generalized Eigendecomposition for Independent Component Analysis

A Coupled Helmholtz Machine for PCA

1 Introduction Blind source separation (BSS) is a fundamental problem which is encountered in a variety of signal processing problems where multiple s

Performance Analysis for Strong Interference Remove of Fast Moving Target in Linear Array Antenna

Blind Machine Separation Te-Won Lee

GENERALIZED DEFLATION ALGORITHMS FOR THE BLIND SOURCE-FACTOR SEPARATION OF MIMO-FIR CHANNELS. Mitsuru Kawamoto 1,2 and Yujiro Inouye 1

A UNIFIED PRESENTATION OF BLIND SEPARATION METHODS FOR CONVOLUTIVE MIXTURES USING BLOCK-DIAGONALIZATION

An Iterative Blind Source Separation Method for Convolutive Mixtures of Images

Review of Some Concepts from Linear Algebra: Part 2

IS NEGATIVE STEP SIZE LMS ALGORITHM STABLE OPERATION POSSIBLE?

Wavelet de-noising for blind source separation in noisy mixtures.

NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION. Julian M. Becker, Christian Sohn and Christian Rohlfing

Independent Component Analysis

ADAPTIVE FILTER THEORY

Basic Concepts in Linear Algebra

CONTROL SYSTEMS ANALYSIS VIA BLIND SOURCE DECONVOLUTION. Kenji Sugimoto and Yoshito Kikkawa

MULTICHANNEL SIGNAL PROCESSING USING SPATIAL RANK COVARIANCE MATRICES

Lapped Unimodular Transform and Its Factorization

A NATURAL GRADIENT CONVOLUTIVE BLIND SOURCE SEPARATION ALGORITHM FOR SPEECH MIXTURES

HST.582J/6.555J/16.456J

Independent Component Analysis and Its Application on Accelerator Physics

ORDER-CONTROLLED MULTIPLE SHIFT SBR2 ALGORITHM FOR PARA-HERMITIAN POLYNOMIAL MATRICES

Review of Basic Concepts in Linear Algebra

Principal Component Analysis

Linear Algebra Review. Vectors

Eigenvalue and Eigenvector Homework

the tensor rank is equal tor. The vectorsf (r)

MULTIPLE-CHANNEL DETECTION IN ACTIVE SENSING. Kaitlyn Beaudet and Douglas Cochran

Undercomplete Blind Subspace Deconvolution via Linear Prediction

PCA & ICA. CE-717: Machine Learning Sharif University of Technology Spring Soleymani

TRINICON: A Versatile Framework for Multichannel Blind Signal Processing

Mathematical foundations - linear algebra

Statistical and Adaptive Signal Processing

Robust extraction of specific signals with temporal structure

Proposition 42. Let M be an m n matrix. Then (32) N (M M)=N (M) (33) R(MM )=R(M)

Independent Component Analysis

Lecture II: Linear Algebra Revisited

BLIND SOURCE SEPARATION ALGORITHM FOR MIMO CONVOLUTIVE MIXTURES. Kamran Rahbar and James P. Reilly

Advanced Digital Signal Processing -Introduction

We use the overhead arrow to denote a column vector, i.e., a number with a direction. For example, in three-space, we write

Blind Deconvolution by Modified Bussgang Algorithm

Least Squares with Examples in Signal Processing 1. 2 Overdetermined equations. 1 Notation. The sum of squares of x is denoted by x 2 2, i.e.

DATA MINING LECTURE 8. Dimensionality Reduction PCA -- SVD

Constrained Projection Approximation Algorithms for Principal Component Analysis

A METHOD OF ICA IN TIME-FREQUENCY DOMAIN

Vandermonde-form Preserving Matrices And The Generalized Signal Richness Preservation Problem

Short-Time ICA for Blind Separation of Noisy Speech

ONE-DIMENSIONAL (1-D) two-channel FIR perfect-reconstruction

2. Linear algebra. matrices and vectors. linear equations. range and nullspace of matrices. function of vectors, gradient and Hessian

COMPLEX CONSTRAINED CRB AND ITS APPLICATION TO SEMI-BLIND MIMO AND OFDM CHANNEL ESTIMATION. Aditya K. Jagannatham and Bhaskar D.

Math 102, Winter Final Exam Review. Chapter 1. Matrices and Gaussian Elimination

Dimensionality Reduction: PCA. Nicholas Ruozzi University of Texas at Dallas

BLIND DECONVOLUTION ALGORITHMS FOR MIMO-FIR SYSTEMS DRIVEN BY FOURTH-ORDER COLORED SIGNALS

New Recursive-Least-Squares Algorithms for Nonlinear Active Control of Sound and Vibration Using Neural Networks

STA141C: Big Data & High Performance Statistical Computing

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007

Preliminary Examination, Numerical Analysis, August 2016

Linear Algebra. and

Algebra C Numerical Linear Algebra Sample Exam Problems

VECTORIZED signals are often considered to be rich if

EIGENFILTERS FOR SIGNAL CANCELLATION. Sunil Bharitkar and Chris Kyriakakis

Lecture: Face Recognition and Feature Reduction

BLOCK-BASED MULTICHANNEL TRANSFORM-DOMAIN ADAPTIVE FILTERING

Enhancing Polynomial MUSIC Algorithm for Coherent Broadband Sources Through Spatial Smoothing

Review problems for MA 54, Fall 2004.

FUNDAMENTAL LIMITATION OF FREQUENCY DOMAIN BLIND SOURCE SEPARATION FOR CONVOLVED MIXTURE OF SPEECH

Lecture 02 Linear Algebra Basics

Wideband MIMO Channel Diagonalization in the Time Domain

1 Number Systems and Errors 1

Linear Algebra: Characteristic Value Problem

Applied Linear Algebra

The University of Texas at Austin Department of Electrical and Computer Engineering. EE381V: Large Scale Learning Spring 2013.

Lecture 7: Positive Semidefinite Matrices

Independent Component Analysis. Contents

MATH 423 Linear Algebra II Lecture 33: Diagonalization of normal operators.

BASED on the minimum mean squared error, Widrow

Properties of Matrices and Operations on Matrices

Separation of the EEG Signal using Improved FastICA Based on Kurtosis Contrast Function

x 1 (t) Spectrogram t s

Statistical Geometry Processing Winter Semester 2011/2012

Quantum Computing Lecture 2. Review of Linear Algebra

Transcription:

POLYNOMIAL SINGULAR VALUES FOR NUMBER OF WIDEBAND SOURCES ESTIMATION AND PRINCIPAL COMPONENT ANALYSIS Russell H. Lambert RF and Advanced Mixed Signal Unit Broadcom Pasadena, CA USA russ@broadcom.com Marcel Joho and Heinz Mathis Signal and Information Processing Lab Swiss Federal Institute of Technology Zurich, Switzerland joho,mathis@isi.ee.ethz.ch ABSTRACT A multipath enabled singular value decomposition (SVD) algorithm is presented, which will allow computation of wideband (polynomial) singular values, and hence, the signal+noise and noise subspaces. Polynomial singular values are ordered according to total energy. The number of sources can be estimated using the scalar total energy values. Results using both simulated data on the computer and actual speech recorded in a noisy multipath environment are given to demonstrate the usefulness of the techniques shown. After number of sources estimation, only the signal+noise subspace is used to create virtual sensors which have made optimal use of all the sensors. As a final signal copy step, standard blind independent component analysis (ICA) or blind source separation algorithms can be used to recover the original data from the virtual sensors. The number of estimated sources could also be given to a blind algorithm capable of using overdetermined sources and the algorithm can adaptively make use of all sensor data. 1. INTRODUCTION We address the problem of efficiently recovering Ms sources given M sensors of noisy, multipath mixed data. A key step in the recovery process is estimation of the number of original sources. In earlier work, we have given tools to compute eigenvalues for multipath matrices. Singular value decomposition (SVD) is conceptually an eigenroutine applied to the spectrum of a matrix, A A H ; in which eigenvalues and associated eigenvectors are ordered according to energy. Herein, we show how to use total energy (over all time extent or, equivalently, over all frequencies) of the eigenvalues for ordering of eigenvalues and eigenvectors and deciding between signal+noise and noise only subspaces. Narrowband/scalar mixture processes are well described by scalar matrix algorithms, but adding a wideband/multipath dimension to the problem requires a new set of tools. We present a fundamental singular value decomposition technique which uses (finite impulse response) FIR matrices, whose elements are vectors and have time domain (or equivalently, frequency domain) extent. 2. PROBLEM DESCRIPTION The mixing process is described as x t = As t + n t (1) where s t =(s 1 ; ;sm s ) T t contains the samples of the unknown source signals at time t, x t = (x 1 ; ;xm) T t the samples of the M sensor signals at sample time t, n t the samples of the sensor noise at time t, with A M Ms L as the unknown mixing matrix. In the overdetermined case, we have more sensors than source signals (M > Ms). 2.1. Notation The following notation is used throughout: Vectors are written in lower case, matrices in upper case. The sample index is denoted by t. Ef g denotes the expectation operator. Vector or matrix dimensions are given in superscript. The Frobenius norm of a matrix is denoted by k k F.Column vectors are bold-face, lower case symbols, as in x,and column vectors with filters as elements are underlined boldface, lower case symbols, as in x: Matrix variables are in bold face, upper case, as in A, and matrices of filters (FIR matrices (see [1]) are underlined bold face, upper case symbols, as in A. FIR matrices are usually assumed to be in the frequency domain where convolution is multiplication and standard Linear Algebra notation holds. We convert the matrices to the frequency domain, perform all the computations and then convert back to the time domain at the final step for plotting the result. The time domain or multipath extent of filters and FIR matrices is L taps in length. I is the unit FIR matrix and 0 is the zero FIR matrix. Inner product, row times column matrix multiplication rules, etc., all apply in FIR matrix algebra, but the scalar matrix term by 379

scalar matrix term multiplication is replaced with elementwise (across all L) multiplication of the frequency domain representations of the corresponding FIR matrix terms. with p 1 p 2 pm s > 0 (6) 2.2. Assumptions In addition to the problem proposed above, we make the following assumptions: ffl A1 Time-invariant mixing matrix A. ffl A2 A has rank Ms. ffl A3 Source signals sm, m =1;:::;Ms, are mutually independent and iid. R ss = Efs t s H t g = ff2 s I M s (2) ffl A4 Source signals sm (save possibly one) are non- Gaussian. This assumption is required for full independent component analysis (or signal copy), but not for the PCA stage. ffl A5 All source signals are unknown and have the same power ff 2 s. ffl A6 There are more sensors than source signals M > Ms. ffl A7 All sensors have the same noise characteristics. The sensor noise is additive white Gaussian noise with power ff 2 n. The sensor noise of the sensors is mutually independent. R nn = Efn t n H t g = ff2 n I M : (3) ffl A8 The source signals and the sensor noise are mutually independent. 3. POLYNOMIAL SVD USING TOTAL ENERGY ORDERING By applying SVD on A,wehave " # A = U ± V H ± ~ = U V H (4) where U M M L and V Ms Ms L are unitary FIR matrices (U U H = I M M L and V V H = I Ms Ms L ). Unitary FIR matrices are isomorphic to the paraunitary matrices defined in [2]. Unitary matrices contain only phase information. ± M Ms L and ~ ± Ms Ms L are diagonal FIR matrices which contain the polynomial singular values of A 0 ~± = diagfff 1 ;:::;ff Ms g (5) where the last inequality comes from the assumption A2; and where the scalars pm are defined as the total summed energy (same in time or frequency domain) in the wideband singular values. pm =± all k ff2 m (k) (7) The SVD of the input correlation matrix R xx gives with (2) and (3) with R xx = Efx t x H t g (8) = A R ss A H + R nn (9) = ff 2 s A AH + ff 2 n I M (10) = ff 2 s U ± VH V ± T U H + ff 2 n I M (11) = ff 2 s U ± ±T U H + ff 2 n I M (12) = ff 2 s U μ ± 2 U H + ff 2 n U UH (13) = U(ff 2 s μ ± 2 + ff 2 n I M )UH (14) = U ± 2 xx UH (15) μ± M M L = diagfff 1 ;:::;ff Ms ; 0;:::;0g ± 2 xx = diagfff2 x 1 ;:::;ff 2 x Ms ;ff 2 x Ms +1 ;:::;ff2 x M g ± 2 xx = diagfff2 1 ff2 s + ff2 n ;:::;ff2 M s ff 2 s + ff2 n ;ff2 n ;:::;ff2 n g: 4. SIMULATION 4.1. PCA Results using Speech Recorded in a Noisy Reverberant Environment (Ms=1, M=5) Five sensors were recorded of a speech signal uttered in an automobile traveling at freeway speeds. The signal-to-noise ratio is a function of frequency, but was very poor, in the subjective range of 5 to -5 db at each sensor. The multipath correlation matrix for this data corresponding to R xx ; equation (9) is shown in figure 1. A hundred seconds of data was used, with a sample rate of 8000Hz. A time extent of 16000 samples (2 seconds) was used. The energy order eigenvalues of R xx are shown in figure 2. Note that as in scalar matrix algebra the eigenvalues of R xx will contain spectral magnitude information only (no phase). The eigenvalues shown are symmetric about time zero (completely phaseless). Also, the singular vector matrices U and V are unitary FIR matrices. We demonstrate this property in figure 3 by plotting U U H = I: 380

4.2. PCA Results using Computer Generated Data (Ms=3, M=6) Next, we perform a controlled test using simulated uniformly distributed source and white Gaussian noise. Three sources of white uniformly distributed data (from :5 to :5)were convolved with the mixture matrix A given by: 2 6 4 :4 :1 + :1z 1 :2 :1z 1 :2 + : 2z 1 :4 + :23z 1 :1 :2z 1 :1 + :5z 1 :3 :1z 1 :3 :3z 1 :4z 2 :5z 1 :1 :25z 1 + :15z 2 :45 :38z 1 :2z 2 :3z 1 + :1z 2 (19) Then, white Gaussian noise of ffn = :1 was added. Results of PCA follow as above, see figures 4, 5, 6, 7, 8. 4.3. ICA Results using MBLMS and Computer Generated Data (Ms=3, M=6) The same data from the previous PCA simulation was applied to the multichannel blind LMS algorithm ([1]) for ICA (full true-phase signal copy). The algorithm must be told the number of sources to look for (this case 3). Figure 9 shows a high quality separation using a scale invariant measure ISI. 3 7 5 6. REFERENCES [1] R. H. Lambert and C. L. Nikias, Blind deconvolution of multipath mixtures, in Unsupervised Adaptive Filtering, S. Haykin, Ed. 2000, vol. 1, pp. 377 436, John Wiley & Sons. [2] P. P. Vaidyanathan, Multirate Systems and Filter Banks, Prentice Hall, 1993. [3] S. C. Douglas, S.-I. Amari, and S.-Y. Kung, Adaptive paraunitary filter banks for spatio-temporal principal and minor subspace analysis, in ICASSP, Phoenix, AZ, March 1999, vol. 2, pp. 1089 1092. [4] L.-Q. Zhang, A. Cichocki, and S. Amari, Natural gradient algorithm for blind separation of overdetermined mixture with additive noise, IEEE Signal Processing Letters, pp. 293 295, November 1999. [5] M. Joho, H. Mathis, and R. H. Lambert, Overdetermined blind source separation: Using more sensors than source signals in a noisy mixture, in Proc. International Conference on Independent Component Analysis and Blind Signal Separation, Helsinki, Finland, June 19 22, 2000, pp. 81 86. 5. CONCLUSION AND IMPLICATION TO SOURCE SEPARATION OF OVERDETERMINED MULTIPATH MIXTURES Using FIR matrix tools, we have presented a multipath extension of SVD for signal subspace processing for estimation of number of sources and principal component analysis. As the FIR matrix singular values are vectors with time/frequency extent, we use the scalar total energy of each singular value for ordering purposes. We have demonstrated the extended SVD and number or sources estimation on noisy reverberant speech data and simulated systems. Recently in the area of source separation, there has been interest to find the best ways to use extra (overdetermined) sensors, see [3, 4, 5]. This has been especially true in light of the fact that key algorithms which use RLS-type or relative gradient updates are restricted to dealing with square (Ms = M) mixtures. Our proposal is to first estimate the number of original sources using the presented PCA/SVD technique. Then, one has the option of collapsing the data to virtual sensors containing only the principal sources and proceeding to the ICA step were one can now use algorithms limited to square mixtures. Alternatively, one can use an LMS type update algorithm (not restricted to square mixtures) which is instructed to recover the estimated number of original sources. Fig. 1. R xx of acoustic recording with one true speech source in noise. 381

Fig. 2. Energy ordered singular values of R xx for acoustic recording. Fig. 5. Energy ordered singular values of R xx of computer data with 3 sources and 6 sensors. 8 Total energy of eig(r xx ) (M s = 3, M=6) 10 12 14 p m [db] 16 18 20 22 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 m Fig. 3. The result of U U H for acoustic recording. Fig. 6. The terms pm; scalar summed energy of R xx of computer data with 3 sources and 6 sensors. Fig. 4. R xx of computer data with 3 sources and 6 sensors. Fig. 7. Unitary singular vector matrix U of computer data with 3 sources and 6 sensors. 382

Fig. 8. The result of U U H for computer data with 3 sources and 6 sensors. 5 Blind LMS method 0 5 Column ISI (db) 10 15 20 25 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 number of iterations x 10 5 Fig. 9. ICA results for (Ms=3, M =6) multichannel blind LMS adaptation. Column ISI (a scale invariant measure of convergence) taken from the three columns of the global system as convergence proceeds. 383