Sparse Time-Frequency Transforms and Applications.

Size: px
Start display at page:

Download "Sparse Time-Frequency Transforms and Applications."

Transcription

1 Sparse Time-Frequency Transforms and Applications. Bruno Torrésani LATP, Université de Provence, Marseille DAFx, Montreal, September 2006 B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

2 1 Introduction 2 Signal waveform representations Bases Frames Multiple frames More realistic time-frequency atoms? 3 Coefficient domain models Hybrid random waveform models Estimation algorithms based on observed coefficients Estimation algorithms based on synthesis coefficients 4 Conclusion 5 References B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

3 Introduction Introduction During the last twenty years (and much more than that in fact): harmonic analysis has provided many new techniques for expanding signals into elementary waveforms. Redundant Gabor wavelet systems (frames) Wavelet bases MDCT and wilson bases Matching pursuit and cognates... Most often, sparsity of the representation was a key issue. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

4 Introduction Introduction During the last twenty years (and much more than that in fact): harmonic analysis has provided many new techniques for expanding signals into elementary waveforms. Redundant Gabor wavelet systems (frames) Wavelet bases MDCT and wilson bases Matching pursuit and cognates... Most often, sparsity of the representation was a key issue. In this talk: we review a number of such approaches, in view of a few selected applications. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

5 Introduction Introduction: What is sparsity? A signal representation is sparse when most information is concentrated in a small amount of data (coefficients). For example, a sine wave is sparsely represented in the Fourier domain, not in the time domain. Sparsity is an vague concept. Ideally, the volume of data (number of coefficients for example) would be a good sparsity measure. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

6 Introduction Introduction: What is sparsity? A signal representation is sparse when most information is concentrated in a small amount of data (coefficients). For example, a sine wave is sparsely represented in the Fourier domain, not in the time domain. Sparsity is an vague concept. Ideally, the volume of data (number of coefficients for example) would be a good sparsity measure. In noisy situations, this measure is generally polluted by a large number of small coefficients, originating from noise. Other measures may be used (entropies)... but they often do not yield the same results [Jaillet & BT 2003]. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

7 Introduction Introduction: sparsity: what for? A sparse time-frequency representation concentrates the relevant information in a small amount of coefficients: the pdf of the coefficients is peaked at 0, and heavy tailed. Most popular applications Signal coding... if the cost of encoding the representation itself is not too high Signal modeling: expand signals into components that make sense. Denoising: most often, noise is not sparse. Source separation (exploiting dimension reduction).... B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

8 Introduction 1 Introduction 2 Signal waveform representations Bases Frames Multiple frames More realistic time-frequency atoms? 3 Coefficient domain models Hybrid random waveform models Estimation algorithms based on observed coefficients Estimation algorithms based on synthesis coefficients 4 Conclusion 5 References B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

9 Signal waveform representations Signal representations Signal waveform expansion: decompose a signal as a linear combination of elementary waveforms ψ λ, often generated using simple rules. x(t) = λ α λ ψ λ (t) with α λ the coefficients, and ψ λ the waveforms. Examples: Time-frequency atoms (MDCT or Wilson bases, Gabor atoms,...) Time-scale atoms (wavelets, multiwavelets,...) Chirplets,... Higher dimensional versions See [Mallat 1998], [Carmona et al. 1998] or [Wickerhauser 1994]. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

10 Signal waveform representations Bases Signal representations: bases The mathematically simplest situation: orthonormal bases. The waveform system W = {ψ λ, λ Λ} is an orthonormal basis of the signal space (inner product space, or Hilbert space) H is The atoms are mutually orthogonal and normalized: ψ λ, ψ µ = δ µν They form a complete set in H: if the signal x H is such that x, ψ λ = 0 for all λ Λ, then x = 0. Then, any signal may be written in an unique way as x(t) = λ Λ α λ ψ λ (t), with α λ = x, ψ λ Thus, analysis and synthesis involve the same atoms. In addition, the coefficient mapping x {α λ, λ Λ} preserves energy (Parseval s formula) α λ 2 = x 2. λ Λ B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

11 Signal waveform representations Bases Signal representations: bases MDCT basis: smooth windows modulated by a sinusoidal function. In the continuous-time setting, the following (infinite) family of functions forms an orthonormal basis of L 2 (R). [ ( 2 π u kn (t) = w k (t) cos n + 1 ) ] (t a k ), k Z, n = 0, 1, 2,... l k l k 2 In bounded intervals, as well as finite dimensional settings, similar bases may be constructed (Malvar, Suter,...) B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

12 Signal waveform representations Bases Signal representations: bases More precisely, the only assumption is that the window functions w k must satisfy some symmetry conditions at boundaries. In general, windows are taken as regular translates of a single one. More freedom may be introduced, as long as the symmetry conditions are fullfilled. For example, some audio coders use systems with wide and narrow windows: B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

13 Signal waveform representations Bases Signal representations: bases More precisely, the only assumption is that the window functions w k must satisfy some symmetry conditions at boundaries. In general, windows are taken as regular translates of a single one. More freedom may be introduced, as long as the symmetry conditions are fullfilled. For example, some audio coders use systems with wide and narrow windows: Simple implementations are available on the Wavelab Stanford package: B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

14 Signal waveform representations Bases Signal representations: bases MDCT basis is well adapted for audio signals: the expansion of most signals is sparse. See below: pdf (log scale) of MDCT coefficients of some organ recording. Besides signal coding/compression, sparsity also helps for several applications. Application: denoising: as noise is generally not sparse in the MDCT basis, simply threshold the MDCT coefficients of the noisy signal before reconstruction. Organ signal; Noisy organ signal; Denoised organ signal. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

15 Signal waveform representations Bases Signal representations: bases Application: source separation: Consider two mixtures (linear combinations): Mix 1; Mix 2. Below: scatter plots of the samples of mix 1 against mix 2 (left), and the mdct coefficients of mix 1 against mix 2 (right). B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

16 Signal waveform representations Bases Signal representations: bases Application: source separation: Consider two mixtures (linear combinations): Mix 1; Mix 2. Below: scatter plots of the samples of mix 1 against mix 2 (left), and the mdct coefficients of mix 1 against mix 2 (right). Method: identify the two directions, and project. Reconstructed organ; B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

17 Signal waveform representations Bases Signal representations: bases Advantages: Optimal in terms of redundancy. There exist bases for which fast algorithms have been developed (MDCT, Wilson, wavelets,...) Drawbacks: Being an orthonormal basis has a price: rigidity. Not any window function will generate a basis. Mathematically speaking, windows are not as smooth as one would like. Being a basis also imposes constraints on the sampling in time and frequency. No free access to the time-frequency domain. Question: can we make it sparser by introducing redundancy? B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

18 Signal waveform representations Frames Signal representations: frames As an alternative to orthonormal bases, frames (wavelet, or Gabor) offer more flexibility. A frame is a (generally overcomplete) system of waveforms W = {ψ λ, λ Λ} with respect to which signals may be expanded, with stable synthesis. In the case of frames, the energy conservation (Parseval s formula) is generally replaced by an inequality of the form A x 2 λ Λ x, ψ λ 2 B x 2, for some constants 0 < A B <, for all signal x. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

19 Signal waveform representations Frames Signal representations: frames In such cases, one does not have exact reconstruction as before, but an approximation x 2 2 x, ψ λ ψ λ A + B λ Λ B A B + A B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

20 Signal waveform representations Frames Signal representations: frames In such cases, one does not have exact reconstruction as before, but an approximation x 2 2 x, ψ λ ψ λ A + B λ Λ B A B + A Good news: there exists a (non unique) dual waveform system { ψ λ, λ Λ} such that for all signal: x = x, ψ λ ψ λ = ψ λ ψ λ. λ Λ λ Λ x, Hence: analysis and synthesis do not involve the same waveforms. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

21 Signal waveform representations Frames Signal representations: frames An example: as an alternative to MDCT bases, the Gabor frames ψ mn (t) = e 2iπnν 0t ψ(t mb 0 ) provide a regular sampling of the time-frequency plane: a regular grid with mesh sizes b 0 and ν 0. For b 0 ν 0 small enough, these indeed for a frame of the considered signal space, and the (canonical) dual system is a Gabor frame too. There exists a dual window ψ such that the dual atoms are of the form ψ mn. The smaller b 0 ν 0, the more redundant the system, the closer A and B, and the closer ψ and ψ. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

22 Signal waveform representations Frames Signal representations: frames Sampling grids in time-frequency domain have to be adapted to the time/frequency resolution of the atoms. Left: good frequency resolution (wide windows); Right: good time resolution (narrow window). Full circles correspond to large coefficients. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

23 Signal waveform representations Frames Signal representations: frames A Gaussian Gabor atom (blue), and its duals for low redundancy (left) and high redundancy (right) To play with Gabor atoms, dual atoms,...: the Linear Time-Frequency Analysis Toolbox (P. Söndergaard) LTFAT: B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

24 Signal waveform representations Frames Signal representations: multiple frames Gabor frames offer more flexibility than MDCT bases in the choice of the window. However, in the standard construction, the Gabor atoms are of constant size, which is not always convenient for describing all features of audio signals. The time-frequency resolution of the atoms plays a significant role. Wide windows (from 20 to 40 ms) are well adapted to tonals (partials), while shorter ones (or wavelets instead of Gabor atoms) are beter suited for transients. Example: xilophone; B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

25 Signal waveform representations Frames Signal representations: multiple frames Question: can we get the best of the two frames? B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

26 Signal waveform representations Frames Signal representations: multiple frames Question: can we get the best of the two frames? Answer: yes, provided we can select the right signal expansion (among infinitely many), and control sparsity. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

27 Signal waveform representations Multiple frames Signal representations: multiple frames Idea: expand signals with respect to a larger system, involving both wide atoms W g = {g mn, (m, n) Λ g } and narrow atoms W h = {h mn, (m, n) Λ h }: D = W g W h D is still a frame, i.e. stable signal expansions on D exist. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

28 Signal waveform representations Multiple frames Signal representations: multiple frames Idea: expand signals with respect to a larger system, involving both wide atoms W g = {g mn, (m, n) Λ g } and narrow atoms W h = {h mn, (m, n) Λ h }: D = W g W h D is still a frame, i.e. stable signal expansions on D exist. For all finite-energy signal x, there exist infinitely many expansions x = x g + x h = λ Λ g α λ g λ + µ Λ h β µ h µ The coefficients α and β provide information on the time-frequency content of x; but some expansions are more meaningful than others. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

29 Signal waveform representations Multiple frames Signal representations: multiple frames How to pick the right time-frequency atoms? Quilted frames [Dörfler 2002]: tile the time-frequency plane into domains corresponding to different time-frequency resolutions Time-frequency Jigsaw Puzzle [Jaillet & BT 2006]: let the computer choose the right atoms in time-frequency domain, using sparsity requirement. Matching Pursuit and Orthogonal Matching Pursuit [Mallat & Zhang 1993]: recursive search of atoms that correlate best with the signal. Basis Pursuit and Basis Pursuit Denoising [Chen et al 1998]. In all cases, partial synthesis from atoms of similar properties (i.e. time-frequency resolution) is possible B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

30 Signal waveform representations Multiple frames Signal representations: multiple frames Example with the TFJP algorithm [Jaillet & BT 2006] B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

31 Signal waveform representations More realistic time-frequency atoms? More realistic time-frequency atoms? In the previous approaches, sparsity was the only requirement. Can we do more? In addition, whatever the choice of the waveform system, the time-frequency atoms are generally not realistic as sound atoms. Several possible approaches for improvement Learn more realistic atoms from sound databases: dictionary learning approach (e.g. [Bluemensath & Davies 2004]). B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

32 Signal waveform representations More realistic time-frequency atoms? More realistic time-frequency atoms? In the previous approaches, sparsity was the only requirement. Can we do more? In addition, whatever the choice of the waveform system, the time-frequency atoms are generally not realistic as sound atoms. Several possible approaches for improvement Learn more realistic atoms from sound databases: dictionary learning approach (e.g. [Bluemensath & Davies 2004]). Build time-frequency molecules from atoms as compound objects (see for example [Daudet 2006]) B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

33 Signal waveform representations More realistic time-frequency atoms? More realistic time-frequency atoms? In the previous approaches, sparsity was the only requirement. Can we do more? In addition, whatever the choice of the waveform system, the time-frequency atoms are generally not realistic as sound atoms. Several possible approaches for improvement Learn more realistic atoms from sound databases: dictionary learning approach (e.g. [Bluemensath & Davies 2004]). Build time-frequency molecules from atoms as compound objects (see for example [Daudet 2006]) Model dependencies between atoms in the coefficient domain. In the rest of the lecture, we focus on this last approach, using a pair of orthonormal bases (following [Daudet & Torrésani 2005]). B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

34 Signal waveform representations More realistic time-frequency atoms? 1 Introduction 2 Signal waveform representations Bases Frames Multiple frames More realistic time-frequency atoms? 3 Coefficient domain models Hybrid random waveform models Estimation algorithms based on observed coefficients Estimation algorithms based on synthesis coefficients 4 Conclusion 5 References B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

35 Coefficient domain models Coefficient domain models Experimental observation : Interesting features in signals are often characterized by localized families of large coefficients, forming structured sets: Tonals: Horizontal lines in the short time Fourier domain. Transients: Vertical lines in the short time Fourier domain, or vertical trees in the wavelet domain. To encode separately such structures, explicit models may be introduced in the coefficient domain. Strategy: characterize the behavior of certain indicators (coefficients, or others) in the framework of the model, in view of estimation from real data. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

36 Coefficient domain models Hybrid random waveform models Hybrid random waveform models The generic form of such models (in N-dimensional space) is the following [Kowalski & BT 2006] x = λ Λ α λ g λ + δ β δ h δ + r where W g = {g λ, λ = 1,... N} and W h = {h δ, δ = 1... N} are two orthonormal bases of waveforms. The coefficients α λ and β δ are iid Gaussian random variables, with frequency dependent variances (λ and δ are time-frequency indices) and r is a small residual signal, modeled as white noise. The sets Λ and are sparse random subsets of the index set. The simplest model is the Bernoulli model: iid sets, with membership probabilities p and p. More complex models (for example Markov models) introduce dependencies between coefficients. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

37 Coefficient domain models Hybrid random waveform models Hybrid random waveform models Are such models able to reproduce experimental observations? Study the behavior of observed coefficients (which differ from the synthesis coefficients α n and β n ) a n = x, g n, b m = x, h m Introduce the membership variables X Λ n = 1 if n Λ and 0 otherwise, and similarly for X n. Then B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

38 Coefficient domain models Hybrid random waveform models Hybrid random waveform models Are such models able to reproduce experimental observations? Study the behavior of observed coefficients (which differ from the synthesis coefficients α n and β n ) a n = x, g n, b m = x, h m Introduce the membership variables X Λ n = 1 if n Λ and 0 otherwise, and similarly for X n. Then a n = x, g n = α n X Λ n + N m=1 β mx m h m, g n b n = x, h n = β n X n + N m=1 α mx Λ m g m, h n B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

39 Coefficient domain models Estimation: observed coefficients Estimation algorithms based on observed coefficients In particular, assuming for simplicity that all coefficients α (resp. β) have the same variance σ 2 (resp. σ 2 ), one has ( var{a k } = σ 2 Xk Λ + σ ) 2 g k, h δ 2 + σ0 2. δ If the significance maps are sparse, and if the two bases are sufficiently different, one recovers the experimental observations. This justifies the fact of approximating the signal by keeping the largest coefficients. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

40 Coefficient domain models Estimation: observed coefficients Estimation algorithms based on observed coefficients Bernoulli model: It may be proved that the observed coefficients follow a mixture distribution: A small variance Gaussian mixture for coefficients whose time-frequency index does not belong to the significance map A large variance Gaussian mixture for coefficients whose time-frequency index does belong to the significance map. Exploiting numerically such a results yields an algorithm for estimating the significance maps, which yields significant dimension reduction, and allows one to estimate the coefficients. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

41 Coefficient domain models Estimation algorithms based on observed coefficients Bernoulli-based estimation algorithm: 3 steps Goal: decompose the signal into two layers (+ residual) Parameter estimation: membership probabilities and synthesis coefficients variances (EM algorithm). Estimation of the significance maps Λ and (maximum likelihood... thresholding for the Bernoulli model) Estimation of the layers: orthogonal projection onto the subspace generated by the selected time-frequency atoms. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

42 Coefficient domain models Estimation algorithms based on observed coefficients Bernoulli-based estimation algorithm: 3 steps Goal: decompose the signal into two layers (+ residual) Parameter estimation: membership probabilities and synthesis coefficients variances (EM algorithm). Estimation of the significance maps Λ and (maximum likelihood... thresholding for the Bernoulli model) Estimation of the layers: orthogonal projection onto the subspace generated by the selected time-frequency atoms. The algorithm is in fact more complex, and involves several iterations of steps 1 and 2. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

43 Coefficient domain models Estimation algorithms based on observed coefficients Bernoulli-based estimation algorithm: example Decomposition of a Xilophone signal (top) into transient (bottom left) and tonal (bottom right) layers B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

44 Coefficient domain models Estimation algorithms based on observed coefficients Structured model-based estimation algorithm Structured model: implements other a priori information, such as the fact that significant coefficients tend to form clusters, or lines (horizontal or vertical). Several models may be developed, among which Markov models for the significance maps [Molla & Torrésani 2005] Two-levels Bernoulli models (M. Kowalski) The estimation procedure has to be modified accordingly. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

45 Coefficient domain models Estimation algorithms based on observed coefficients Structured model-based estimation algorithm Example: Markov model: (see [Molla & Torrésani 2005]) for the tonal layer, the distribution of the indicator random variables Xkn is characterized by transition matrices ( ) πn 1 π P n = n 1 π n π n, with π n = P { X k+1,n = 1 X k,n = 1 }, π n = P { X k+1,n = 0 X k,n = 0 } In particular, P{X k+1,n = 1, X k,n = 1} > P{X k+1,n = 1}P{X k,n = 1} P{X k+1,n = 0, X k,n = 0} > P{X k+1,n = 0}P{X k,n = 0}. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

46 Coefficient domain models Estimation algorithms based on observed coefficients Structured model-based estimation algorithm The estimation algorithm keeps a similar structure as before... but becomes more complex: the estimation of the significance maps is not local (in the coefficient domain) anymore. Parameter estimation may be performed via EM algorithms The estimation of membership probabilities has to be replaced with the estimation of Markov matrices The estimation of the maps is done using Viterbi algorithm. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

47 Coefficient domain models Estimation algorithms based on observed coefficients Structured model-based algorithm: example Blues Brothers recording: original, tonal, transient and residual. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

48 Coefficient domain models Estimation: synthesis coefficients Estimation algorithms based on synthesis coefficients Alternative: in the framework of such random models, work directly on the distribution of the synthesis coefficients. A hierarchical Bayesian model model can be constructed [Févotte et al 2006], implementing Sparse signal decomposition into a dictionary of two MDCT bases, with Gaussian random coefficients Markov significance maps, implementing structured sets of coefficients Suitable priors for the model coefficients MAP and MMSE estimates are obtained by MCMC algorithms, significantly heavier than the previous ones. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

49 Coefficient domain models Estimation: synthesis coefficients Estimation algorithms based on synthesis coefficients Alternative: in the framework of such random models, work directly on the distribution of the synthesis coefficients. A hierarchical Bayesian model model can be constructed [Févotte et al 2006], implementing Sparse signal decomposition into a dictionary of two MDCT bases, with Gaussian random coefficients Markov significance maps, implementing structured sets of coefficients Suitable priors for the model coefficients MAP and MMSE estimates are obtained by MCMC algorithms, significantly heavier than the previous ones. Example: S. Raman; noisy version; denoised version (MMSE estimate); Tonal; Transient. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

50 Conclusions Conclusion Hybrid expansions generally provide sparser signal representations. The introduction of structured significance maps also improves sparsity, generally at the price of increased computational burden. In addition, tonal layers turn out to be more difficult to model accurately (MDCT bases do not offer the same flexibility as harmonic models). These techniques yield transient + tonal + residual signal decompositions: a sort of elementary (single captor) source separation. These may be exploited for various tasks (denoising, source separation, coding...), in situations where the residual signal is not too important. Otherwise, the residual will have to be modelled. To do next: relax the assumption of independence of the tonal and transient layers. B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

51 Bibliography References T. Blumensath, M.E. Davies, Unsupervised learning of sparse and shift-invariant decompositions of polyphonic music, in: Proceedings of ICASSP 04, vol. 5, 2004, pp. V:497-V:500. R. Carmona, W.L. Hwang, and B. Torrésani. Practical Time-Frequency Analysis: continuous wavelet and Gabor transforms, with an implementation in S, volume 9 of Wavelet Analysis and its Applications. Academic Press, San Diego, S.S. Chen, D.L. Donoho and M.A. Saunders, Atomic Decomposition by Basis Pursuit SIAM Journal on Scientific Computing 20:1 (1998), pp L. Daudet. Sparse and structured decompositions of signals with the molecular matching pursuit. IEEE Transactions on Acoustics, Speech, and Signal Processing, 2006, to appear. L. Daudet and B. Torrésani, Sparse adaptive representations for musical signals, Technical report, to appear in Signal processing for music transcription, M. Davy and A. Klapuri Eds. (2005). B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

52 Bibliography References M. Dörfler, Gabor Analysis for a Class of Signals called Music, PhD Dissertation, 2002, Mathematics Department, University of Vienna. C. Fevotte, L. Daudet, S.J. Godsill and B. Torrésani, Sparse Regression with Structured Priors: Application to Audio Denoising. Proceedings of ICASSP 2006, Volume: 3, pp. III-57 - III-60. C. Févotte, B. Torrésani, L. Daudet and S. Godsill, Denoising of musical audio using sparse linear regression and structured priors, submitted. F. Jaillet and B. Torrésani, Remarques sur l adaptativit des reprsentations temps-frquence Proceedings of the GRETSI 03 conference, Vol 1, pp F. Jaillet and B. Torrésani, Time-Frequency Jigsaw Puzzles, To appear in Int. J. on Wavelets and Multiresolution Information Processing (2006). B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

53 Bibliography References M. Kowalski, and B. Torrésani, A Family of Random Waveform Models for Audio Coding, Proceedings of ICASSP 2006, Volume: 3, pp III III-475. S. Mallat and Z. Zhang. Matching pursuits with time-frequency dictionaries. IEEE Transactions on Signal Processing, 41: , S. Mallat. A wavelet tour of signal processing. Academic Press, S. Molla and B. Torrésani. Hybrid Audio Scheme using Hidden Markov Models of Waveforms Applied and Computational Harmonic Analysis 18 (2005), pp M. V. Wickerhauser. Adapted Wavelet Analysis from Theory to Software. AK Peters, Boston, MA, USA, B. Torrésani (LATP Marseille) Sparse Time-Frequency Transforms September / 41

AN HYBRID AUDIO SCHEME USING HIDDEN MARKOV MODELS OF WAVEFORMS

AN HYBRID AUDIO SCHEME USING HIDDEN MARKOV MODELS OF WAVEFORMS AN HYBRID AUDIO SCHEME USING HIDDEN MARKOV MODELS OF WAVEFORMS S. MOLLA AND B. TORRESANI Abstract. This paper reports on recent results related to audiophonic signals encoding using time-scale and time-frequency

More information

Sparse linear models

Sparse linear models Sparse linear models Optimization-Based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/obda_spring16 Carlos Fernandez-Granda 2/22/2016 Introduction Linear transforms Frequency representation Short-time

More information

PERCEPTUAL MATCHING PURSUIT WITH GABOR DICTIONARIES AND TIME-FREQUENCY MASKING. Gilles Chardon, Thibaud Necciari, and Peter Balazs

PERCEPTUAL MATCHING PURSUIT WITH GABOR DICTIONARIES AND TIME-FREQUENCY MASKING. Gilles Chardon, Thibaud Necciari, and Peter Balazs 21 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) PERCEPTUAL MATCHING PURSUIT WITH GABOR DICTIONARIES AND TIME-FREQUENCY MASKING Gilles Chardon, Thibaud Necciari, and

More information

HYBRID SPARSE AND LOW-RANK TIME-FREQUENCY SIGNAL DECOMPOSITION

HYBRID SPARSE AND LOW-RANK TIME-FREQUENCY SIGNAL DECOMPOSITION HYBRID SPARSE AND LOW-RANK TIME-FREQUENCY SIGNAL DECOMPOSITION Cédric Févotte, Matthieu Kowalski, Laboratoire Lagrange (CNRS, OCA & Université Nice Sophia Antipolis), Nice, France Laboratoire des Signaux

More information

Sparse linear models and denoising

Sparse linear models and denoising Lecture notes 4 February 22, 2016 Sparse linear models and denoising 1 Introduction 1.1 Definition and motivation Finding representations of signals that allow to process them more effectively is a central

More information

Wavelet Footprints: Theory, Algorithms, and Applications

Wavelet Footprints: Theory, Algorithms, and Applications 1306 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 5, MAY 2003 Wavelet Footprints: Theory, Algorithms, and Applications Pier Luigi Dragotti, Member, IEEE, and Martin Vetterli, Fellow, IEEE Abstract

More information

Estimation Error Bounds for Frame Denoising

Estimation Error Bounds for Frame Denoising Estimation Error Bounds for Frame Denoising Alyson K. Fletcher and Kannan Ramchandran {alyson,kannanr}@eecs.berkeley.edu Berkeley Audio-Visual Signal Processing and Communication Systems group Department

More information

Lecture Notes 5: Multiresolution Analysis

Lecture Notes 5: Multiresolution Analysis Optimization-based data analysis Fall 2017 Lecture Notes 5: Multiresolution Analysis 1 Frames A frame is a generalization of an orthonormal basis. The inner products between the vectors in a frame and

More information

An Introduction to Wavelets and some Applications

An Introduction to Wavelets and some Applications An Introduction to Wavelets and some Applications Milan, May 2003 Anestis Antoniadis Laboratoire IMAG-LMC University Joseph Fourier Grenoble, France An Introduction to Wavelets and some Applications p.1/54

More information

A simple test to check the optimality of sparse signal approximations

A simple test to check the optimality of sparse signal approximations A simple test to check the optimality of sparse signal approximations Rémi Gribonval, Rosa Maria Figueras I Ventura, Pierre Vergheynst To cite this version: Rémi Gribonval, Rosa Maria Figueras I Ventura,

More information

Some aspects of Time-Frequency multipliers

Some aspects of Time-Frequency multipliers Some aspects of Time-Frequency multipliers B. Torrésani LATP, Université de Provence, Marseille, France MulAc kickoff meeting, September 28 The group in Marseille LMA : Olivier Derrien, Richard Kronland-Martinet,

More information

Multiple Change Point Detection by Sparse Parameter Estimation

Multiple Change Point Detection by Sparse Parameter Estimation Multiple Change Point Detection by Sparse Parameter Estimation Department of Econometrics Fac. of Economics and Management University of Defence Brno, Czech Republic Dept. of Appl. Math. and Comp. Sci.

More information

HYBRID REPRESENTATIONS FOR AUDIOPHONIC SIGNAL ENCODING. 1. Introduction

HYBRID REPRESENTATIONS FOR AUDIOPHONIC SIGNAL ENCODING. 1. Introduction HYBRID REPRESENTATIONS FOR AUDIOPHONIC SIGNAL ENCODING L. DAUDET AND B. TORRÉSANI Abstract. We discuss in this paper a new approach for signal models in the context of audio signal encoding. The method

More information

Recent developments on sparse representation

Recent developments on sparse representation Recent developments on sparse representation Zeng Tieyong Department of Mathematics, Hong Kong Baptist University Email: zeng@hkbu.edu.hk Hong Kong Baptist University Dec. 8, 2008 First Previous Next Last

More information

Denoising and Compression Using Wavelets

Denoising and Compression Using Wavelets Denoising and Compression Using Wavelets December 15,2016 Juan Pablo Madrigal Cianci Trevor Giannini Agenda 1 Introduction Mathematical Theory Theory MATLAB s Basic Commands De-Noising: Signals De-Noising:

More information

174 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 1, JANUARY 2008

174 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 1, JANUARY 2008 174 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 1, JANUARY 2008 Sparse Linear Regression With Structured Priors Application to Denoising of Musical Audio Cédric Févotte, Bruno

More information

Bayesian Estimation of Time-Frequency Coefficients for Audio Signal Enhancement

Bayesian Estimation of Time-Frequency Coefficients for Audio Signal Enhancement Bayesian Estimation of Time-Frequency Coefficients for Audio Signal Enhancement Patrick J. Wolfe Department of Engineering University of Cambridge Cambridge CB2 1PZ, UK pjw47@eng.cam.ac.uk Simon J. Godsill

More information

Introduction to Wavelets and Wavelet Transforms

Introduction to Wavelets and Wavelet Transforms Introduction to Wavelets and Wavelet Transforms A Primer C. Sidney Burrus, Ramesh A. Gopinath, and Haitao Guo with additional material and programs by Jan E. Odegard and Ivan W. Selesnick Electrical and

More information

Introduction to Discrete-Time Wavelet Transform

Introduction to Discrete-Time Wavelet Transform Introduction to Discrete-Time Wavelet Transform Selin Aviyente Department of Electrical and Computer Engineering Michigan State University February 9, 2010 Definition of a Wavelet A wave is usually defined

More information

A Generalized Uncertainty Principle and Sparse Representation in Pairs of Bases

A Generalized Uncertainty Principle and Sparse Representation in Pairs of Bases 2558 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL 48, NO 9, SEPTEMBER 2002 A Generalized Uncertainty Principle Sparse Representation in Pairs of Bases Michael Elad Alfred M Bruckstein Abstract An elementary

More information

MATCHING PURSUIT WITH STOCHASTIC SELECTION

MATCHING PURSUIT WITH STOCHASTIC SELECTION 2th European Signal Processing Conference (EUSIPCO 22) Bucharest, Romania, August 27-3, 22 MATCHING PURSUIT WITH STOCHASTIC SELECTION Thomas Peel, Valentin Emiya, Liva Ralaivola Aix-Marseille Université

More information

Morphological Diversity and Source Separation

Morphological Diversity and Source Separation Morphological Diversity and Source Separation J. Bobin, Y. Moudden, J.-L. Starck, and M. Elad Abstract This paper describes a new method for blind source separation, adapted to the case of sources having

More information

EUSIPCO

EUSIPCO EUSIPCO 013 1569746769 SUBSET PURSUIT FOR ANALYSIS DICTIONARY LEARNING Ye Zhang 1,, Haolong Wang 1, Tenglong Yu 1, Wenwu Wang 1 Department of Electronic and Information Engineering, Nanchang University,

More information

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS Emad M. Grais and Hakan Erdogan Faculty of Engineering and Natural Sciences, Sabanci University, Orhanli

More information

c 2011 International Press Vol. 18, No. 1, pp , March DENNIS TREDE

c 2011 International Press Vol. 18, No. 1, pp , March DENNIS TREDE METHODS AND APPLICATIONS OF ANALYSIS. c 2011 International Press Vol. 18, No. 1, pp. 105 110, March 2011 007 EXACT SUPPORT RECOVERY FOR LINEAR INVERSE PROBLEMS WITH SPARSITY CONSTRAINTS DENNIS TREDE Abstract.

More information

Review: Learning Bimodal Structures in Audio-Visual Data

Review: Learning Bimodal Structures in Audio-Visual Data Review: Learning Bimodal Structures in Audio-Visual Data CSE 704 : Readings in Joint Visual, Lingual and Physical Models and Inference Algorithms Suren Kumar Vision and Perceptual Machines Lab 106 Davis

More information

Decompositions of frames and a new frame identity

Decompositions of frames and a new frame identity Decompositions of frames and a new frame identity Radu Balan a, Peter G. Casazza b, Dan Edidin c and Gitta Kutyniok d a Siemens Corporate Research, 755 College Road East, Princeton, NJ 08540, USA; b Department

More information

Image Denoising using Uniform Curvelet Transform and Complex Gaussian Scale Mixture

Image Denoising using Uniform Curvelet Transform and Complex Gaussian Scale Mixture EE 5359 Multimedia Processing Project Report Image Denoising using Uniform Curvelet Transform and Complex Gaussian Scale Mixture By An Vo ISTRUCTOR: Dr. K. R. Rao Summer 008 Image Denoising using Uniform

More information

MIT 9.520/6.860, Fall 2017 Statistical Learning Theory and Applications. Class 19: Data Representation by Design

MIT 9.520/6.860, Fall 2017 Statistical Learning Theory and Applications. Class 19: Data Representation by Design MIT 9.520/6.860, Fall 2017 Statistical Learning Theory and Applications Class 19: Data Representation by Design What is data representation? Let X be a data-space X M (M) F (M) X A data representation

More information

BAYESIAN COMPUTATIONAL METHODS FOR SPARSE AUDIO AND MUSIC PROCESSING

BAYESIAN COMPUTATIONAL METHODS FOR SPARSE AUDIO AND MUSIC PROCESSING 15th European Signal Processing Conference EUSIPCO 2007, Poznan, Poland, September 3-7, 2007, copyright by EURASIP BAYESIAN COMPUTATIONAL METHODS FOR SPARSE AUDIO AND MUSIC PROCESSING S.J. Godsill 1, A.T.

More information

Digital Image Processing

Digital Image Processing Digital Image Processing, 2nd ed. Digital Image Processing Chapter 7 Wavelets and Multiresolution Processing Dr. Kai Shuang Department of Electronic Engineering China University of Petroleum shuangkai@cup.edu.cn

More information

Statistical approach for dictionary learning

Statistical approach for dictionary learning Statistical approach for dictionary learning Tieyong ZENG Joint work with Alain Trouvé Page 1 Introduction Redundant dictionary Coding, denoising, compression. Existing algorithms to generate dictionary

More information

Analysis of polyphonic audio using source-filter model and non-negative matrix factorization

Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Tuomas Virtanen and Anssi Klapuri Tampere University of Technology, Institute of Signal Processing Korkeakoulunkatu

More information

Sparse signal representation and the tunable Q-factor wavelet transform

Sparse signal representation and the tunable Q-factor wavelet transform Sparse signal representation and the tunable Q-factor wavelet transform Ivan Selesnick Polytechnic Institute of New York University Brooklyn, New York Introduction Problem: Decomposition of a signal into

More information

Design of Image Adaptive Wavelets for Denoising Applications

Design of Image Adaptive Wavelets for Denoising Applications Design of Image Adaptive Wavelets for Denoising Applications Sanjeev Pragada and Jayanthi Sivaswamy Center for Visual Information Technology International Institute of Information Technology - Hyderabad,

More information

CONTROL SYSTEMS, ROBOTICS, AND AUTOMATION Vol. VI - System Identification Using Wavelets - Daniel Coca and Stephen A. Billings

CONTROL SYSTEMS, ROBOTICS, AND AUTOMATION Vol. VI - System Identification Using Wavelets - Daniel Coca and Stephen A. Billings SYSTEM IDENTIFICATION USING WAVELETS Daniel Coca Department of Electrical Engineering and Electronics, University of Liverpool, UK Department of Automatic Control and Systems Engineering, University of

More information

The Generalized Haar-Walsh Transform (GHWT) for Data Analysis on Graphs and Networks

The Generalized Haar-Walsh Transform (GHWT) for Data Analysis on Graphs and Networks The Generalized Haar-Walsh Transform (GHWT) for Data Analysis on Graphs and Networks Jeff Irion & Naoki Saito Department of Mathematics University of California, Davis SIAM Annual Meeting 2014 Chicago,

More information

Compressed Sensing and Sparse Recovery

Compressed Sensing and Sparse Recovery ELE 538B: Sparsity, Structure and Inference Compressed Sensing and Sparse Recovery Yuxin Chen Princeton University, Spring 217 Outline Restricted isometry property (RIP) A RIPless theory Compressed sensing

More information

STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS

STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS 19th European Signal Processing Conference (EUSIPCO 2011) Barcelona, Spain, August 29 - September 2, 2011 STRUCTURE-AWARE DICTIONARY LEARNING WITH HARMONIC ATOMS Ken O Hanlon and Mark D.Plumbley Queen

More information

Low-Rank Time-Frequency Synthesis

Low-Rank Time-Frequency Synthesis Low-Rank - Synthesis Cédric Févotte Laboratoire Lagrange CNRS, OCA & Université de Nice Nice, France cfevotte@unice.fr Matthieu Kowalski Laboratoire des Signaux et Systèmes CNRS, Supélec & Université Paris-Sud

More information

Computational Harmonic Analysis (Wavelet Tutorial) Part II

Computational Harmonic Analysis (Wavelet Tutorial) Part II Computational Harmonic Analysis (Wavelet Tutorial) Part II Understanding Many Particle Systems with Machine Learning Tutorials Matthew Hirn Michigan State University Department of Computational Mathematics,

More information

Sparse Solutions of Systems of Equations and Sparse Modelling of Signals and Images

Sparse Solutions of Systems of Equations and Sparse Modelling of Signals and Images Sparse Solutions of Systems of Equations and Sparse Modelling of Signals and Images Alfredo Nava-Tudela ant@umd.edu John J. Benedetto Department of Mathematics jjb@umd.edu Abstract In this project we are

More information

Atomic decompositions of square-integrable functions

Atomic decompositions of square-integrable functions Atomic decompositions of square-integrable functions Jordy van Velthoven Abstract This report serves as a survey for the discrete expansion of square-integrable functions of one real variable on an interval

More information

Approximately dual frames in Hilbert spaces and applications to Gabor frames

Approximately dual frames in Hilbert spaces and applications to Gabor frames Approximately dual frames in Hilbert spaces and applications to Gabor frames Ole Christensen and Richard S. Laugesen October 22, 200 Abstract Approximately dual frames are studied in the Hilbert space

More information

Oracle Analysis of Sparse Automatic Music Transcription

Oracle Analysis of Sparse Automatic Music Transcription Oracle Analysis of Sparse Automatic Music Transcription Ken O Hanlon, Hidehisa Nagano, and Mark D. Plumbley Queen Mary University of London NTT Communication Science Laboratories, NTT Corporation {keno,nagano,mark.plumbley}@eecs.qmul.ac.uk

More information

Contents. 0.1 Notation... 3

Contents. 0.1 Notation... 3 Contents 0.1 Notation........................................ 3 1 A Short Course on Frame Theory 4 1.1 Examples of Signal Expansions............................ 4 1.2 Signal Expansions in Finite-Dimensional

More information

A REDUCED MULTIPLE GABOR FRAME FOR LOCAL TIME ADAPTATION OF THE SPECTROGRAM

A REDUCED MULTIPLE GABOR FRAME FOR LOCAL TIME ADAPTATION OF THE SPECTROGRAM A REDUCED MULTIPLE GABOR FRAME FOR LOCAL TIME ADAPTATION OF THE SPECTROGRAM Marco Liuni, Università di Firenze, Dip. di Matematica U. Dini, Viale Morgagni, 67/a - 5034 Florence - ITALY IRCAM - CNRS STMS,

More information

Scale Mixture Modeling of Priors for Sparse Signal Recovery

Scale Mixture Modeling of Priors for Sparse Signal Recovery Scale Mixture Modeling of Priors for Sparse Signal Recovery Bhaskar D Rao 1 University of California, San Diego 1 Thanks to David Wipf, Jason Palmer, Zhilin Zhang and Ritwik Giri Outline Outline Sparse

More information

EMPLOYING PHASE INFORMATION FOR AUDIO DENOISING. İlker Bayram. Istanbul Technical University, Istanbul, Turkey

EMPLOYING PHASE INFORMATION FOR AUDIO DENOISING. İlker Bayram. Istanbul Technical University, Istanbul, Turkey EMPLOYING PHASE INFORMATION FOR AUDIO DENOISING İlker Bayram Istanbul Technical University, Istanbul, Turkey ABSTRACT Spectral audio denoising methods usually make use of the magnitudes of a time-frequency

More information

Sparsity in Underdetermined Systems

Sparsity in Underdetermined Systems Sparsity in Underdetermined Systems Department of Statistics Stanford University August 19, 2005 Classical Linear Regression Problem X n y p n 1 > Given predictors and response, y Xβ ε = + ε N( 0, σ 2

More information

Introduction p. 1 Compression Techniques p. 3 Lossless Compression p. 4 Lossy Compression p. 5 Measures of Performance p. 5 Modeling and Coding p.

Introduction p. 1 Compression Techniques p. 3 Lossless Compression p. 4 Lossy Compression p. 5 Measures of Performance p. 5 Modeling and Coding p. Preface p. xvii Introduction p. 1 Compression Techniques p. 3 Lossless Compression p. 4 Lossy Compression p. 5 Measures of Performance p. 5 Modeling and Coding p. 6 Summary p. 10 Projects and Problems

More information

5742 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 55, NO. 12, DECEMBER /$ IEEE

5742 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 55, NO. 12, DECEMBER /$ IEEE 5742 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 55, NO. 12, DECEMBER 2009 Uncertainty Relations for Shift-Invariant Analog Signals Yonina C. Eldar, Senior Member, IEEE Abstract The past several years

More information

REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES

REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES REVIEW OF SINGLE CHANNEL SOURCE SEPARATION TECHNIQUES Kedar Patki University of Rochester Dept. of Electrical and Computer Engineering kedar.patki@rochester.edu ABSTRACT The paper reviews the problem of

More information

Compressed sensing. Or: the equation Ax = b, revisited. Terence Tao. Mahler Lecture Series. University of California, Los Angeles

Compressed sensing. Or: the equation Ax = b, revisited. Terence Tao. Mahler Lecture Series. University of California, Los Angeles Or: the equation Ax = b, revisited University of California, Los Angeles Mahler Lecture Series Acquiring signals Many types of real-world signals (e.g. sound, images, video) can be viewed as an n-dimensional

More information

Wavelets and Signal Processing

Wavelets and Signal Processing Wavelets and Signal Processing John E. Gilbert Mathematics in Science Lecture April 30, 2002. Publicity Mathematics In Science* A LECTURE SERIES FOR UNDERGRADUATES Wavelets Professor John Gilbert Mathematics

More information

Bayesian Paradigm. Maximum A Posteriori Estimation

Bayesian Paradigm. Maximum A Posteriori Estimation Bayesian Paradigm Maximum A Posteriori Estimation Simple acquisition model noise + degradation Constraint minimization or Equivalent formulation Constraint minimization Lagrangian (unconstraint minimization)

More information

arxiv: v1 [math.ca] 6 Feb 2015

arxiv: v1 [math.ca] 6 Feb 2015 The Fourier-Like and Hartley-Like Wavelet Analysis Based on Hilbert Transforms L. R. Soares H. M. de Oliveira R. J. Cintra Abstract arxiv:150.0049v1 [math.ca] 6 Feb 015 In continuous-time wavelet analysis,

More information

Transients Detection in the Time-Scale Domain

Transients Detection in the Time-Scale Domain Transients Detection in the Time-Scale Domain V. Bruni and D. Vitulano Istituto per le Applicazioni del Calcolo - C.N.R., Viale del Policlinico 37, 6 Rome Italy {bruni,vitulano}@iac.rm.cnr.it Abstract.

More information

Sparse signal representation and the tunable Q-factor wavelet transform

Sparse signal representation and the tunable Q-factor wavelet transform Sparse signal representation and the tunable Q-factor wavelet transform Ivan Selesnick Polytechnic Institute of New York University Brooklyn, New York Introduction Problem: Decomposition of a signal into

More information

Independent Component Analysis and Unsupervised Learning. Jen-Tzung Chien

Independent Component Analysis and Unsupervised Learning. Jen-Tzung Chien Independent Component Analysis and Unsupervised Learning Jen-Tzung Chien TABLE OF CONTENTS 1. Independent Component Analysis 2. Case Study I: Speech Recognition Independent voices Nonparametric likelihood

More information

Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation

Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation CENTER FOR COMPUTER RESEARCH IN MUSIC AND ACOUSTICS DEPARTMENT OF MUSIC, STANFORD UNIVERSITY REPORT NO. STAN-M-4 Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation

More information

Digital Image Processing Lectures 15 & 16

Digital Image Processing Lectures 15 & 16 Lectures 15 & 16, Professor Department of Electrical and Computer Engineering Colorado State University CWT and Multi-Resolution Signal Analysis Wavelet transform offers multi-resolution by allowing for

More information

Recovery of Compressible Signals in Unions of Subspaces

Recovery of Compressible Signals in Unions of Subspaces 1 Recovery of Compressible Signals in Unions of Subspaces Marco F. Duarte, Chinmay Hegde, Volkan Cevher, and Richard G. Baraniuk Department of Electrical and Computer Engineering Rice University Abstract

More information

Low-Complexity Image Denoising via Analytical Form of Generalized Gaussian Random Vectors in AWGN

Low-Complexity Image Denoising via Analytical Form of Generalized Gaussian Random Vectors in AWGN Low-Complexity Image Denoising via Analytical Form of Generalized Gaussian Random Vectors in AWGN PICHID KITTISUWAN Rajamangala University of Technology (Ratanakosin), Department of Telecommunication Engineering,

More information

Blind Spectral-GMM Estimation for Underdetermined Instantaneous Audio Source Separation

Blind Spectral-GMM Estimation for Underdetermined Instantaneous Audio Source Separation Blind Spectral-GMM Estimation for Underdetermined Instantaneous Audio Source Separation Simon Arberet 1, Alexey Ozerov 2, Rémi Gribonval 1, and Frédéric Bimbot 1 1 METISS Group, IRISA-INRIA Campus de Beaulieu,

More information

SPARSE ATOMIC MODELING OF AUDIO: A REVIEW

SPARSE ATOMIC MODELING OF AUDIO: A REVIEW SPARSE ATOMIC MODELING OF AUDIO: A REVIEW Corey Kereliuk, SPCL & CIRMMT McGill University, Montréal, Canada corey.kereliuk@mail.mcgill.ca Philippe Depalle, SPCL & CIRMMT McGill University, Montréal, Canada

More information

IN many image processing applications involving wavelets

IN many image processing applications involving wavelets IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. X, NO. Y, MM 2007 1 Phase-Shifting for Non-separable 2D Haar Wavelets (This correspondence extends our earlier paper also under review with IEEE TIP to 2D non-separable

More information

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION Hiroazu Kameoa The University of Toyo / Nippon Telegraph and Telephone Corporation ABSTRACT This paper proposes a novel

More information

A primer on the theory of frames

A primer on the theory of frames A primer on the theory of frames Jordy van Velthoven Abstract This report aims to give an overview of frame theory in order to gain insight in the use of the frame framework as a unifying layer in the

More information

Inverse problems and sparse models (1/2) Rémi Gribonval INRIA Rennes - Bretagne Atlantique, France

Inverse problems and sparse models (1/2) Rémi Gribonval INRIA Rennes - Bretagne Atlantique, France Inverse problems and sparse models (1/2) Rémi Gribonval INRIA Rennes - Bretagne Atlantique, France remi.gribonval@inria.fr Structure of the tutorial Session 1: Introduction to inverse problems & sparse

More information

Edge preserved denoising and singularity extraction from angles gathers

Edge preserved denoising and singularity extraction from angles gathers Edge preserved denoising and singularity extraction from angles gathers Felix Herrmann, EOS-UBC Martijn de Hoop, CSM Joint work Geophysical inversion theory using fractional spline wavelets: ffl Jonathan

More information

446 SCIENCE IN CHINA (Series F) Vol. 46 introduced in refs. [6, ]. Based on this inequality, we add normalization condition, symmetric conditions and

446 SCIENCE IN CHINA (Series F) Vol. 46 introduced in refs. [6, ]. Based on this inequality, we add normalization condition, symmetric conditions and Vol. 46 No. 6 SCIENCE IN CHINA (Series F) December 003 Construction for a class of smooth wavelet tight frames PENG Lizhong (Λ Π) & WANG Haihui (Ξ ) LMAM, School of Mathematical Sciences, Peking University,

More information

Musical Audio Denoising Assuming Symmetric α-stable Noise

Musical Audio Denoising Assuming Symmetric α-stable Noise Proceedings ICMC SMC 4 4- September 4, Athens, Greece Musical Audio Denoising Assuming Symmetric α-stable Noise Nikoletta Bassiou Constantine Kotropoulos Department of Informatics Aristotle University

More information

PCA & ICA. CE-717: Machine Learning Sharif University of Technology Spring Soleymani

PCA & ICA. CE-717: Machine Learning Sharif University of Technology Spring Soleymani PCA & ICA CE-717: Machine Learning Sharif University of Technology Spring 2015 Soleymani Dimensionality Reduction: Feature Selection vs. Feature Extraction Feature selection Select a subset of a given

More information

Handbook of Blind Source Separation, Independent Component Analysis and Applications. P. Comon and C. Jutten Eds

Handbook of Blind Source Separation, Independent Component Analysis and Applications. P. Comon and C. Jutten Eds Handbook of Blind Source Separation, Independent Component Analysis and Applications P. Comon and C. Jutten Eds October 8, 9 Glossary x vector of components x p, p P s, x, y sources, observations, separator

More information

A simple test to check the optimality of sparse signal approximations

A simple test to check the optimality of sparse signal approximations A simple test to check the optimality of sparse signal approximations Rémi Gribonval, Rosa Maria Figueras I Ventura, Pierre Vandergheynst To cite this version: Rémi Gribonval, Rosa Maria Figueras I Ventura,

More information

Wavelets and multiresolution representations. Time meets frequency

Wavelets and multiresolution representations. Time meets frequency Wavelets and multiresolution representations Time meets frequency Time-Frequency resolution Depends on the time-frequency spread of the wavelet atoms Assuming that ψ is centred in t=0 Signal domain + t

More information

Sparse Approximation and Variable Selection

Sparse Approximation and Variable Selection Sparse Approximation and Variable Selection Lorenzo Rosasco 9.520 Class 07 February 26, 2007 About this class Goal To introduce the problem of variable selection, discuss its connection to sparse approximation

More information

2D Wavelets. Hints on advanced Concepts

2D Wavelets. Hints on advanced Concepts 2D Wavelets Hints on advanced Concepts 1 Advanced concepts Wavelet packets Laplacian pyramid Overcomplete bases Discrete wavelet frames (DWF) Algorithme à trous Discrete dyadic wavelet frames (DDWF) Overview

More information

Sparsity Measure and the Detection of Significant Data

Sparsity Measure and the Detection of Significant Data Sparsity Measure and the Detection of Significant Data Abdourrahmane Atto, Dominique Pastor, Grégoire Mercier To cite this version: Abdourrahmane Atto, Dominique Pastor, Grégoire Mercier. Sparsity Measure

More information

An Introduction to HILBERT-HUANG TRANSFORM and EMPIRICAL MODE DECOMPOSITION (HHT-EMD) Advanced Structural Dynamics (CE 20162)

An Introduction to HILBERT-HUANG TRANSFORM and EMPIRICAL MODE DECOMPOSITION (HHT-EMD) Advanced Structural Dynamics (CE 20162) An Introduction to HILBERT-HUANG TRANSFORM and EMPIRICAL MODE DECOMPOSITION (HHT-EMD) Advanced Structural Dynamics (CE 20162) M. Ahmadizadeh, PhD, PE O. Hemmati 1 Contents Scope and Goals Review on transformations

More information

Independent Component Analysis and Unsupervised Learning

Independent Component Analysis and Unsupervised Learning Independent Component Analysis and Unsupervised Learning Jen-Tzung Chien National Cheng Kung University TABLE OF CONTENTS 1. Independent Component Analysis 2. Case Study I: Speech Recognition Independent

More information

An Investigation of 3D Dual-Tree Wavelet Transform for Video Coding

An Investigation of 3D Dual-Tree Wavelet Transform for Video Coding MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com An Investigation of 3D Dual-Tree Wavelet Transform for Video Coding Beibei Wang, Yao Wang, Ivan Selesnick and Anthony Vetro TR2004-132 December

More information

A Novel Fast Computing Method for Framelet Coefficients

A Novel Fast Computing Method for Framelet Coefficients American Journal of Applied Sciences 5 (11): 15-157, 008 ISSN 1546-939 008 Science Publications A Novel Fast Computing Method for Framelet Coefficients Hadeel N. Al-Taai Department of Electrical and Electronic

More information

Representation of sound fields for audio recording and reproduction

Representation of sound fields for audio recording and reproduction Representation of sound fields for audio recording and reproduction F. M. Fazi a, M. Noisternig b and O. Warusfel b a University of Southampton, Highfield, SO171BJ Southampton, UK b Institut de Recherche

More information

On the Projection Matrices Influence in the Classification of Compressed Sensed ECG Signals

On the Projection Matrices Influence in the Classification of Compressed Sensed ECG Signals On the Projection Matrices Influence in the Classification of Compressed Sensed ECG Signals Monica Fira, Liviu Goras Institute of Computer Science Romanian Academy Iasi, Romania Liviu Goras, Nicolae Cleju,

More information

Orthogonal Matching Pursuit for Sparse Signal Recovery With Noise

Orthogonal Matching Pursuit for Sparse Signal Recovery With Noise Orthogonal Matching Pursuit for Sparse Signal Recovery With Noise The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published

More information

Fast Hard Thresholding with Nesterov s Gradient Method

Fast Hard Thresholding with Nesterov s Gradient Method Fast Hard Thresholding with Nesterov s Gradient Method Volkan Cevher Idiap Research Institute Ecole Polytechnique Federale de ausanne volkan.cevher@epfl.ch Sina Jafarpour Department of Computer Science

More information

Which wavelet bases are the best for image denoising?

Which wavelet bases are the best for image denoising? Which wavelet bases are the best for image denoising? Florian Luisier a, Thierry Blu a, Brigitte Forster b and Michael Unser a a Biomedical Imaging Group (BIG), Ecole Polytechnique Fédérale de Lausanne

More information

Deep Learning: Approximation of Functions by Composition

Deep Learning: Approximation of Functions by Composition Deep Learning: Approximation of Functions by Composition Zuowei Shen Department of Mathematics National University of Singapore Outline 1 A brief introduction of approximation theory 2 Deep learning: approximation

More information

Invariant Scattering Convolution Networks

Invariant Scattering Convolution Networks Invariant Scattering Convolution Networks Joan Bruna and Stephane Mallat Submitted to PAMI, Feb. 2012 Presented by Bo Chen Other important related papers: [1] S. Mallat, A Theory for Multiresolution Signal

More information

On the Hilbert Transform of Wavelets

On the Hilbert Transform of Wavelets On the Hilbert Transform of Wavelets Kunal Narayan Chaudhury and Michael Unser Abstract A wavelet is a localized function having a prescribed number of vanishing moments. In this correspondence, we provide

More information

MLCC 2018 Variable Selection and Sparsity. Lorenzo Rosasco UNIGE-MIT-IIT

MLCC 2018 Variable Selection and Sparsity. Lorenzo Rosasco UNIGE-MIT-IIT MLCC 2018 Variable Selection and Sparsity Lorenzo Rosasco UNIGE-MIT-IIT Outline Variable Selection Subset Selection Greedy Methods: (Orthogonal) Matching Pursuit Convex Relaxation: LASSO & Elastic Net

More information

Equivalence Probability and Sparsity of Two Sparse Solutions in Sparse Representation

Equivalence Probability and Sparsity of Two Sparse Solutions in Sparse Representation IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 19, NO. 12, DECEMBER 2008 2009 Equivalence Probability and Sparsity of Two Sparse Solutions in Sparse Representation Yuanqing Li, Member, IEEE, Andrzej Cichocki,

More information

An Introduction to Sparse Approximation

An Introduction to Sparse Approximation An Introduction to Sparse Approximation Anna C. Gilbert Department of Mathematics University of Michigan Basic image/signal/data compression: transform coding Approximate signals sparsely Compress images,

More information

Bayesian harmonic models for musical signal analysis. Simon Godsill and Manuel Davy

Bayesian harmonic models for musical signal analysis. Simon Godsill and Manuel Davy Bayesian harmonic models for musical signal analysis Simon Godsill and Manuel Davy June 2, 2002 Cambridge University Engineering Department and IRCCyN UMR CNRS 6597 The work of both authors was partially

More information

MATCHING-PURSUIT DICTIONARY PRUNING FOR MPEG-4 VIDEO OBJECT CODING

MATCHING-PURSUIT DICTIONARY PRUNING FOR MPEG-4 VIDEO OBJECT CODING MATCHING-PURSUIT DICTIONARY PRUNING FOR MPEG-4 VIDEO OBJECT CODING Yannick Morvan, Dirk Farin University of Technology Eindhoven 5600 MB Eindhoven, The Netherlands email: {y.morvan;d.s.farin}@tue.nl Peter

More information

Multiresolution Analysis

Multiresolution Analysis Multiresolution Analysis DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/obda_fall17/index.html Carlos Fernandez-Granda Frames Short-time Fourier transform

More information

Bayesian Methods for Sparse Signal Recovery

Bayesian Methods for Sparse Signal Recovery Bayesian Methods for Sparse Signal Recovery Bhaskar D Rao 1 University of California, San Diego 1 Thanks to David Wipf, Jason Palmer, Zhilin Zhang and Ritwik Giri Motivation Motivation Sparse Signal Recovery

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 11 Project

More information