arxiv: v1 [cs.it] 4 Nov 2017

Size: px
Start display at page:

Download "arxiv: v1 [cs.it] 4 Nov 2017"

Transcription

1 Separation-Free Super-Resolution from Compressed Measurements is Possible: an Orthonormal Atomic Norm Minimization Approach Weiyu Xu Jirong Yi Soura Dasgupta Jian-Feng Cai arxiv: v1 csit] 4 Nov 17 Mathews Jacob November 7, 17 Abstract Myung Cho We consider the problem of recovering the superposition of R distinct complex exponential functions from compressed non-uniform time-domain samples Total Variation (TV) minimization or atomic norm minimization was proposed in the literature to recover the R frequencies or the missing data However, it is known that in order for TV minimization and atomic norm minimization to recover the missing data or the frequencies, the underlying R frequencies are required to be well-separated, even when the measurements are noiseless This paper shows that the Hankel matrix recovery approach can super-resolve the R complex exponentials and their frequencies from compressed non-uniform measurements, regardless of how close their frequencies are to each other We propose a new concept of orthonormal atomic norm minimization (OANM), and demonstrate that the success of Hankel matrix recovery in separation-free superresolution comes from the fact that the nuclear norm of a Hankel matrix is an orthonormal atomic norm More specifically, we show that, in traditional atomic norm minimization, the underlying parameter values must be well separated to achieve successful signal recovery, if the atoms are changing continuously with respect to the continuously-valued parameter In contrast, for OANM, it is possible the OANM is successful even though the original atoms can be arbitrarily close As a byproduct of this research, we provide one matrix-theoretic inequality of nuclear norm, and give its proof from the theory of compressed sensing The first two authors contributed equally to this work Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 54 weiyu-xu@uiowaedu Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 54 Yi is co-first author Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 54 Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 1

2 1 Introduction In super-resolution, we are interested in recovering the high-end spectral information of signals from observations of it low-end spectral components 3] In one setting of super-resolution problems, one aims to recover a superposition of complex exponential functions from time-domain samples In fact, many problems arising in science and engineering involve high-dimensional signals that can be modeled or approximated by a superposition of a few complex exponential functions In particular, if we choose the exponential functions to be complex sinusoids, this superposition of complex exponentials models signals in acceleration of medical imaging 18], analog-to-digital conversion 9], and signals in array signal processing 4] Accelerated NMR (Nuclear magnetic resonance) spectroscopy, which is a prerequisite for studying short-lived molecular systems and monitoring chemical reactions in real time, is another application where signals can be modeled or approximated by a superposition of complex exponential functions How to recover the superposition of complex exponential functions or parameters of these complex exponential functions is of prominent importance in these applications In this paper, we consider how to recover those superposition of complex exponential from linear measurements More specifically, let x C N 1 be a vector satisfying x j = R k=1 c k z j k, j =, 1,, N, (1) where z k C, k = 1,, R, are some unknown complex numbers with R being a positive integer In other words, x is a superposition of R complex exponential functions We assume R N 1 When z k = 1, k = 1,, R, x is a superposition of complex sinusoid When z k = e τ k e πıf k, k = 1,, R, ı = 1, x can model the signal in NMR spectroscopy Since R N 1 and often R N 1, the degree of freedom to determine x is much less than the ambient dimension N 1 Therefore, it is possible to recover x from its under sampling 5,11] In particular, we consider recovering x from its linear measurements b = A(x), () where A is a linear mapping to C M, M < N 1 After x is recovered, we can use the singlesnapshot MUSIC or the Prony s method to recover the parameter z k s The problem on recovering x from its linear measurements () can be solved using Compressed Sensing (CS) 5], by discretizing the dictionary of basis vectors into grid points corresponding to discrete values of z k When the parameters f k s in signals from spectral compressed sensing or (f k, τ k ) s from signals in accelerated NMR spectroscopy indeed fall on the grid, CS is a powerful tool to recover those signals even when the number of samples is far below its ambient dimension (R N 1) 5, 11] Nevertheless, the parameters in our problem setting often take continuous values, leading to a continuous dictionary, and may not exactly fall on a grid The basis mismatch problem between the continuously-valued parameters and the grid-valued parameters degenerates the performance of conventional compressed sensing 8] In two seminal papers 3, 7], the authors proposed to use the total variation minimization or the atomic norm minimization to recover x or to recover the parameter z k, when z k = e ıπf k with f k taking continuous values from, 1) In these two papers, the author showed that the TV minimization or the atomic norm minimization can recover correctly the continuously-valued frequency f k s when there are no observation noises However, as shown in 3, 6, 7], in order for

3 the TV minimization or the atomic norm minimization to recover spectrally sparse data or the associated frequencies correctly, it is necessary to require that adjacent frequencies be separated far enough from each other For example, for complex exponentials with z k s taking values on the complex unit circle, it is required that their adjacent frequencies f k, 1] s be at least N 1 apart This separation condition is necessary, even if we observe the full (N 1) data samples, and even if the observations are noiseless This raises a natural question, Can we super-resolve the superposition of complex exponentials with continuously-valued parameter z k, without requiring frequency separations, from compressed measurements? In this paper, we answer this question in positive More specifically, we show that a Hankel matrix recovery approach using nuclear norm minimization can super-resolve the superposition of complex exponentials with continuously-valued parameter z k, without requiring frequency separations, from compressed measurements This separation-free super-resolution result holds even when we only compressively observe x over a subset M {,, N } In this paper, we give the worst-case and average-case performance guarantees of Hankel matrix recovery in recovering the superposition of complex exponentials In establishing the worst-case performance guarantees, we establish the conditions under which the Hankel matrix recovery can recover the underlying complex exponentials, no matter what values the coefficients c k s of the complex exponentials take For the average-case performance guarantee, we assume that the phases of the coefficients c k s are uniformly distributed over, π) For both the worst-case and average-case performance guarantees, we establish that Hankel matrix recovery can super-resolve complex exponentials with continuously-valued parameters z k s, no matter how close two adjacent frequencies or parameters z k s are to each other We further introduce a new concept of orthonormal atomic norm minimization (OANM), and discover that the success of Hankel matrix recovery in separation-free super-resolution comes from the fact that the nuclear norm of the Hankel matrix is an orthonormal atomic norm In particular, we show that, in traditional atomic norm minimization, for successful signal recovery, the underlying parameters must be well separated, if the atoms are changing continuously with respect to the continuously-valued parameters; however, it is possible the OANM is successful even though the original atoms can be arbitrarily close As a byproduct of this research, we discover one interesting matrix-theoretic inequality of nuclear norm, and give its proof from the theory of compressed sensing 11 Comparisons with related works on Hankel matrix recovery and atomic norm minimization Low-rank Hankel matrix recovery approaches were used for recovering parsimonious models in system identifications, control, and signal processing In 19], Markovsky considered low-rank approximations for Hankel structured matrices with applications in signal processing, system identifications and control In 1, 13], Fazel et al introduced low-rank Hankel matrix recovery via nuclear norm minimization, motivated by applications including realizations and identification of linear timeinvariant systems, inferring shapes (points on the complex plane) from moments estimation (which is related to super-resolution with z k from the complex plane), and moment matrix rank minimization for polynomial optimization In 13], Fazel et al further designed optimization algorithms to solve the nuclear norm minimization problem for low-rank Hankel matrix recovery In 6], Chen and Chi proposed to use multi-fold Hankel matrix completion for spectral compressed sensing, studied the performance guarantees of spectral compressed sensing via structured multi-fold Hankel matrix completion, and derived performance guarantees of structured matrix completion However, the 3

4 results in 6] require that the Dirichlet kernel associated with underlying frequencies satisfies certain incoherence conditions, and these conditions require the underlying frequencies to be well separated from each other In 9, 3], the authors derived performance guarantees for Hankel matrix completion in system identifications However, the performance guarantees in 9, 3] require a very specific sampling patterns of fully sampling the upper-triangular part of the Hankel matrix Moreover, the performance guarantees in 9, 3] require that the parameters z k s be very small (or smaller than 1) in magnitude In our earlier work ], we established performance guarantees of Hankel matrix recovery for spectral compressed sensing under Gaussian measurements of x, and by comparison, this paper considers direct observations of x over a set M {, 1,,, N }, which is a more relevant sampling model in many applications In the single-snapshot MUSIC algorithm 17], the Prony s method 1] or the matrix pencil approach 15], one would need the full (N 1) consecutive samples to perform frequency identifications, while the Hankel matrix recovery approach can work with compressed measurements When prior information of the locations of the frequencies are available, one can use weighted atomic norm minimization to relax the separation conditions in successful signal recovery ] In 5], the authors consider super-resolution without separation using atomic norm minimization, but under the restriction that the coefficients are non-negative and for a particular set of atoms 1 Organizations of this paper The rest of the paper is organized as follows In Section, we present the problem model, and introduce the Hankel matrix recovery approach In Section 3, we investigate the worst-case performance guarantees of recovering spectrally sparse signals regardless of frequency separation, using the Hankel matrix recovery approach In Section 4, we study the Hankel matrix recovery s average-case performance guarantees of recovering spectrally sparse signals regardless of frequency separation In Section 5, we show that in traditional atomic norm minimization, for successful signal recovery, the underlying parameters must be well separated, if the atoms are changing continuously with respect to the continuously-valued parameters In Section 6, we introduce the concept of orthonormal atomic norm minimization, and show that it is possible that the atomic norm minimization is successful even though the original atoms can be arbitrarily close In Section 7, as a byproduct of this research, we provide one matrix-theoretic inequality of nuclear norm, and give its proof from the theory of compressed sensing Numerical results are given in Section 8 to validate our theoretical predictions We conclude our paper in Section 9 13 Notations We denote the set of complex numbers and real number as C and R respectively We use calligraphic uppercase letters to represent index sets, and use to represent a set s cardinality When we use an index set as the subscript of a vector, we refer to the part of the vector over the index set For example, x Ω is the part of vector x over the index set Ω We use C n1 n r to represent the set of matrices from C n1 n with rank r We denote the trace of a matrix X by Tr(X), and denote the real part and imaginary parts of a matrix X by Re(X) and Im(X) respectively The superscripts T and are used to represent transpose, and conjugate transpose of matrices or vectors The Frobenius norm, nuclear norm, and spectral norm of a matrix are denoted by F, and (or ) respectively The notation represents the spectral norm if its argument is a matrix, and represents the Euclidean norm if its argument is vector The probability of an event S is denoted 4

5 by P(S) Problem statement The underlying model for spectrally sparse signal is a mixture of complex exponentials x j = R c k e (ıπf k τ k )j, j {, 1,, N } (3) k=1 where ı = 1, f k, 1), c k C, and τ k are the normalized frequency, coefficients, and damping factor, respectively We observe x over a subset M {, 1,,, N } To estimate the continuous parameter f k s, in 3, 7], the authors proposed to use the total variation minimization or the atomic norm minimization to recover x or to recover the parameter z k, when z k = e ıπf k with f k taking continuous values from, 1) In these two papers, the author showed that the TV minimization or the atomic norm minimization can recover correctly the continuously-valued frequency f k s when there are no observation noises However, as shown in 3,6,7], in order for atomic norm minimization to recover spectrally sparse data or the associated frequencies correctly, it is necessary to require that adjacent frequencies be separated far enough from each other Let us define the minimum separation between frequencies as the following: Definition 1 (minimum separation, see 3]) For a frequency subset F, 1) with a group of points, the minimum separation is defined as smallest distance two arbitrary different elements in F, ie dist(f) = where d(f i, f l ) is the wrap around distance between two frequencies inf d(f i, f l ), (4) f i,f l F,f i f l As shown in 6], for complex exponentials with z k s taking values on the complex unit circle, it is required that their adjacent frequencies f k, 1] s be at least N 1 apart This separation condition is necessary, even if we observe the full (N 1) data samples, and even if the observations are noiseless Following the idea the matrix pencil method in 15] and Enhanced Matrix Completion (EMaC) in 6], we construct a Hankel matrix based on signal x More specifically, define the Hankel matrix H(x) C N N by H jk (x) = x j+k, j, k = 1,,, N (5) The expression (3) leads to a rank-r decomposition: 1 1 z 1 z R 1 H(x) = c z1 N 1 z N 1 R c R 1 z 1 z N z R z N 1 R Instead of reconstructing x directly, we reconstruct the rank-r Hankel matrix H, subject to the observation constraints Low rank matrix recovery has been widely studied in recovering a matrix from incomplete observations 4] It is well known that minimizing the nuclear norm can 5

6 lead to a solution of low-rank matrices We therefore use the nuclear norm minimization to recover the low-rank matrix H More specifically, for any given x C N 1, let H(x) C N N be the corresponding Hankel matrix We solve the following optimization problem: min x H(x), subject to A(x) = b, (6) where is the nuclear norm, and A and b are the linear measurements and measurement results When there is noise η contained in the observation, ie, b = Ax + η, we solve min H(x), x subject to Ax b δ, (7) where δ = η is the noise level It is known that the nuclear norm minimization (6) can be transformed into a semidefinite program 1 min x,q 1,Q (Tr(Q 1) + Tr(Q )) st b = Ax, ] Q 1 H(x) (8) H(x) Q which can be easily solved with existing convex program solvers such as interior point algorithms After successfully recovering all the time samples, we can use the single-snapshot MUSIC algorithm (as discussed in 17]) to identify the underlying frequencies f k For the recovered Hankel matrix H(x), let its SVD be ] Σ1 H(x) = U 1 U ] V 1 V ], U 1, V 1 C N R, (9) and we define the vector φ N (f) and imaging function J(f) as φ N (f) = (e ıπf, e ıπf1,, e ıπf(n 1) ) T, J(f) = φn (f) U φn (f), f, 1) (1) The single-snapshot MUSIC algorithm is given in Algorithm 1 Algorithm 1 The Single-Snapshot MUSIC algorithm 17] 1: require: solution x, parameter R and N : form Hankel matrix Z C N N ] Σ1 3: SVD Z = U 1 U ] V 1 V ] with U 1 C N R and Σ 1 C R R 4: compute imaging function J(f) = φn (f) U φn (f), f, 1) 5: get set ˆF={ f: f corresponds to R largest local maxima of J(f)} In 17], the author showed that the MUSIC algorithm can exactly recover all the frequencies by finding the local maximal of J(f) Namely, for undamped signal (3) with the set of frequencies F, if N R, then f F is equivalent to J(f) = 6

7 3 Worst-case performance guarantees of separation-free superresolution In this section, we provide the worst-case performance guarantees of Hankel matrix recovery for recovering the superposition of complex exponentials Namely, we provide the conditions under which the Hankel matrix recovery can uniformly recover the superposition of every possible R complex exponentials Our results show that the Hankel matrix recovery can achieve separationfree super-resolution, even if we consider the criterion of worst-case performance guarantees Later in Section 4, we further show that our derived worst-performance guarantees are tight, namely we can find examples where R is bigger than the predicted recoverable sparsity by our theory, and the nuclear norm minimization fails to recover the superposition of complex exponentials We let w i be the number of elements in the i-th anti-diagonal of matrix H, namely, { i, i = 1,,, N, w i = (11) N i, i = N + 1,, N 1 Here we call the (N 1) anti-diagonals of H(x) from the left top to the right bottom as the 1-st anti-diagonal,, and the (N 1)-th anti-diagonal We also define w min as w min = min w i+1 (1) i {,1,,,N }\M With the setup above, we give the following Theorem 1 concerning the worst-case performance guarantee of Hankel matrix recovery Theorem 1 Let us consider the signal model of the superposition of R complex exponentials (3), the observation set M {, 1,,, N } We further define w i of an N N Hankel matrix H(x) as in (11), and define w min as in (1) Then the nuclear norm minimization (6) will uniquely recover H(x), regardless of the (frequency) separation between the R continuously-valued (frequencies) parameters if R < Proof We can change (6) to the following optimization problem: w min (N 1 M ) (13) min B, subject to B = H(x), A(x) = b (14) B,x We can think of (14) as a nuclear norm minimization problem, where the null space of the linear operator (applied to (B,x)) in the constraints of (14) is given by (H(z), z) such that A(z) = From 1, ], we have the following lemma about the null space condition for successful signal recovery via nuclear norm minimization Lemma 1 1] Let X be any N N matrix of rank R, and we observe it through a linear mapping A(X ) = b Then the nuclear norm minimization (15) min X X, subject to A(X) = A(X ), (15) 7

8 can uniquely and correctly recover every matrix X with rank no more than R if and only if, for all nonzero Z N (A), Z R < Z, (16) where Z R is the sum of the largest R singular values of Z, and N (A) is the null space of A Using this lemma, we can see that (14) or (6) can correctly recover x as a superposition of R complex exponentials if H(z) R < H(z) holds true for every nonzero z from the null space of A Considering the sampling set M {, 1,,, N }, the null space of the sampling operator A is composed of (N 1) 1 vectors z s such that z i+1 = if i M For such a vector z, let us denote the element across the i-th anti-diagonal of H(z) as a i, and thus Q = H(z) is a Hankel matrix with its (i + 1) anti-diagonal element equal to if i M Let σ 1,, σ N be the N singular values of H(z) arranged in a descending order To verify the null space condition for nuclear norm minimization, we would like to find the largest R such that σ σ R < σ R σ N, (17) for every nonzero z in the null space of A Towards this goal, we first obtain a bound for its largest singular value (for a matrix Q, we use Q i,: to denote its i-th row vector): σ 1 = max u C N, u =1 Qu (18) = max N Q i,: u (19) u C N, u =1 i=1 N = max u C N, u =1 a j u j i+1 () i=1 j {j: i Ind(j),a j } N max a j u j i+1 (1) u C N, u =1 = max u C N, u =1 max u C N, u =1 i=1 j {j:i Ind(j),a j } j {j:i Ind(j),a j } N 1 a j1 j 1=1 max N 1 u C N, u =1 j 1=1 = j {,1,,N }\M i Ind(j 1) N a j i=1 j {j:i Ind(j),a j } j {j:i Ind(j),a j } j {j:i Ind(j),a j } u j i+1 u j i+1 () (3) a j1 (N 1 M)] (4) a j+1 (N 1 M) (5) 8

9 where () is obtained by looking at the rows (with Ind(j) being the set of indices of rows which intersect with the j-th anti-diagonal and {j : i Ind(j), a j } being the set of all non-zero anti-diagonals intersecting with the i-th row), (1) is due to the Cauchy-Schwarz inequality, and ((4) is because u = 1 and, for each i, u i appears for no more than (N 1 M) times in ) i Ind(j1) j u {j:i Ind(j),a j } j i+1 Furthermore, summing up the energy of the matrix Q, we have N σi = i=1 Thus for any integer k N, we have i {,1,,N }\M a i+1 w i+1 (6) N i=1 σ i k i=1 σ i N i=1 σ i kσ 1 N i=1 σ i kσ 1 i {,1,,N }\M a i+1 w i+1 k j {,1,,N }\M a j+1 (N 1 M) min i {,1,,N }\M w i+1 k(n 1 M) w min = k(n 1 M) (7) So if min i {,1,,N }\M w i+1 R(N 1 M) >, (8) then for ever nonzero vector z in the null space of A, and the corresponding Hankel matrix Q = H(z), N R σ i > σ i (9) i=1 w It follows that, for any superposition of R < min (N 1 M) complex exponentials, we can correctly recover x over the whole set {, 1,, N } using the incomplete sampling set M, regardless of the separations between different frequencies (or between the continuously-valued parameters z k s for damped complex exponentials) On the one hand, the performance guarantees given in Theorem 1 can be conservative: for average-case performance guarantees,even when the number of complex exponentials R is bigger than predicted by Theorem 1, the Hankel matrix recovery can still recover the missing data, even though the sinusoids can be very close to each other On the other hand, the bounds on recoverable sparsity level R given in Theorem 1 is tight for worst-case performance guarantees, as shown in the next section i=1 9

10 31 Tightness of recoverable sparsity guaranteed by Theorem 1 We will show the tightness of recoverable sparsity guaranteed by Theorem 1 in Section 4, which is built on the developments in Section 41 4 Average-case performance guarantees In this section, we study the performance guarantees for Hankel matrix recovery, when the phases of the coefficients of the R sinusoids are iid and uniformly distributed over, π) For averagecase performance guarantees, we show that we can recover the superposition of a larger number of complex exponentials than Theorem 1 offers 41 Average-case performance guarantees for orthogonal frequency atoms and the tightness of worst-case performance guarantees Theorem Let us consider the signal model of the superposition of R complex exponentials (3) with τ k = We assume that the R frequencies f 1, f,, and f R are such that the atoms (e ıπfi, e ıπfi1,, e ıπfi(n 1) ) T, 1 i R, are orthogonal to each other We let the observation set be M = {, 1,,, N } \ {N 1} We assume the phases of coefficients c 1,, c R in signal model (3) are independent and uniformly distributed over, π) Then the nuclear norm minimization (6) will successfully and uniquely recover H(x) and x, with probability approaching 1 as N if where c > is a constant R = N c log(n)n, (3) Proof We use the following Lemma about the condition for successful signal recovery through nuclear norm minimization This lemma is an extension of Lemma 13 in 1] The key difference is Lemma deals with complex-numbered matrices Moreover, Lemma gets rid of the iff claim for the null space condition in Lemma 13 of 1], because we find that the condition in Lemma 13 of 1] is a sufficient condition for the success of nuclear norm minimization, but not a necessary condition for the success of nuclear norm minimization We give the proof of Lemma, and prove the null space condition is only a sufficient condition for nuclear norm minimization in Appendix 11 and Appendix 1 respectively Lemma Let X be any M N matrix of rank R in C M N, and we observe it through a linear mapping A(X ) = b We also assume that X has a singular value decomposition (SVD) X = UΣV, where U C M R, V C N R, and Σ C R R is a diagonal matrix Then the nuclear norm minimization (31) min X X, subject to A(X) = A(X ), (31) correctly and uniquely recovers X if, for every nonzero element Q N (A), Tr(U QV ) + Ū Q V >, (3) where Ū and V are such that U Ū] and V V ] are unitary 1

11 We can change (6) to the following optimization problem min B, subject to B = H(x), A(x) = b (33) B,x We can think of (33) as a nuclear norm minimization, where he null space of the linear operator (applied to B and x) in the constraints of (14) is given by (H(z), z) such that A(z) = Then we can see that (33) or (6) can correctly recover x as a superposition of R complex exponentials if Tr(V U H(z)) < Ū H(z) V, (34) hold true for any z which is a nonzero vector from the null space of A, where H(x) = UΣV is the singular value decomposition (SVD) of H(x) with U C N R and V C N R, and Ū and V are such that U, Ū] and V, V ] are unitary Without loss of generality, let f k = s k N for 1 k R, where s k s are distinct integers between and N 1 Then e ıπ s 1 N e ıπ s N e ıπ s R N e ıθ1 U = 1 e ıπ s 1 N 1 e ıπ s N 1 e ıπ s R N 1 e ıθ N, e ıπ s 1 N (N 1) e ıπ s N (N 1) e ıπ s R N (N 1) e ıθ R (35) and V = 1 N e ıπ s 1 N e ıπ s N e ıπ s R N e ıπ s 1 N 1 e ıπ s N 1 e ıπ s R N 1 e ı π s 1 N (N 1) e ıπ s N (N 1) e ıπ s R N (N 1), (36) and Σ = N c 1 c c R, (37) where θ k s (1 k R) are iid random variables uniformly distributed over, π) When the observation set M = {, 1,,, N } \ {N 1}, any Q = H(z) with z from the null space of A takes the following form: Q = a 1 1, a C (38) 11

12 Thus 1 Tr(V U Q) = atr U 1 ( ) 1 R N 1 = a N = a N = a R N 1 i=1 t= R i=1 i=1 t= V e ıθi e ıπ s i (N 1 t) N e ıπ s i (t) N (39) e ıθi e ıπ s i (N 1) N (4) e ıθi e ıπ s i (N 1) N (41) Notice that random variable e ıθi e ıπ s i (N 1) N are mutually independent random variables uniformly distributed over the complex unit circle We will use the following lemma (see for example, 8]) to provide a concentration of measure result for the summation of e ıθi e ıπ s i (N 1) N : Lemma 3 (see for example, 8]) For a sequence of iid random matrix matrices M 1,, M K with dimension d 1 d and their sum M = K i=1 M i, if M i satisfies EM i ] =, M i L, i = 1,, K, (4) and M satisfies then { } K ν(m) = max EM i Mi ], K EMi M i ], (43) i=1 ( P( M t) (d 1 + d ) exp i=1 t / v(m) + Lt/3 ), t (44) Applying Lemma 3 to the 1 matrix composed of the real and imaginary parts of e ıθi e ıπ s i (N 1) N, with ν = max(r, R/) = R, L = 1, d 1 + d = 3, we have t / P ( Tr(V U Q) a t) 3e ν+lt/3 = 3e t / R+t/3, t > (45) We further notice that, Ū s ( V s) columns are normalized orthogonal frequency atoms (or their complex conjugates) with frequencies li N, with integer l i s different from the integers s k s of those R complex exponentials Thus (N R) / ( R+(N R)/3 U QV = a (N R) (46) Pick t = N R, and let = c log(n) with c being a positive constant Solving for ) R, we obtain R = N 3 log(n) c log (N) + c log(n)n, which implies (34) holds with probability approaching 1 if N This proves our claims 1

13 4 Tightness of recoverable sparsity guaranteed by Theorem We show that the recoverable sparsity R provided by Theorem is tight for M = {, 1,,, N }\{N 1} For such a sampling set, w min = N, and Theorem provides a bound R < wmin = N In fact, we can show that if R N, we can construct signal examples where the Hankel matrix recovery approach cannot recover the original signal x Consider the signal in Theorem We choose the coefficients c i s such that e ıθi e ıπ s i (N 1) N = 1, then and we also have Tr(V U Q) = a R i=1 e ıθi e ıπ s i (N 1) N = a R, (47) U QV = a (N R) (48) Thus Tr(V U Q) U QV for every Q = H(z) with z in the null space of the sampling operator Thus the Hankel matrix recovery cannot recover the ground truth x This shows that the prediction by Theorem is tight 43 Average-Case performance analysis with arbitrarily close frequency atoms In this section, we further show that the Hankel matrix recovery can successfully recover the superposition of complex exponentials with arbitrarily close frequency atoms In particular, we give average performance guarantees on recoverable sparsity R when frequency atoms are arbitrarily close, and show that the Hankel matrix recovery can deal with much larger recoverable sparsity R for average-case signals than predicted by Theorem 1 We consider a signal x composed of R complex exponentials Among them, R 1 orthogonal complex exponentials (without loss of generality, we assume that these R 1 frequencies take values l i N, where l i s are integers) The other complex exponential c cl e ıπf clj has a frequency arbitrarily close to one of the R 1 frequencies, ie, x j = R 1 k=1 c k e ıπf kj + c cl e ıπf clj, j {, 1,, N }, (49) where f cl is arbitrarily close to one of the first (R 1) frequencies For this setup with arbitrarily close atoms, we have the following average-case performance guarantee Theorem 3 Consider the signal model (49) with R complex exponentials, where the coefficients of these complex exponentials are iid uniformly distributed random over, π), and the first (R 1) of the complex exponentials are such that the corresponding atom vectors in (1) are mutually orthogonal, while the R-th exponential has frequency arbitrarily close to one of the first (R 1) frequencies (in wrap-around distance) Let c min = min{ c 1, c,, c R 1 }, and define d rel = c cl c min + 1 (5) 13

14 if If we have the observation set M = {, 1,,, N } \ {N 1}, and, for any constant c >, R = (N 8 Nd rel + c log(n)) + 1cN log(n) 48c Nd rel log(n) + 4c log (N), (51) then we can recover the true signal x via Hankel matrix recovery with high probability as N, regardless of frequency separations Remarks: 1 We can extend this result to cases where neither of the two arbitrarily close frequencies has on-the-grid frequencies, and the Hankel matrix recovery method can recover a similar sparsity; Under a similar number of complex exponentials, with high probability, the Hankel matrix recovery can correctly recover the signal x, uniformly over every possible phases of the two complex exponentials with arbitrarily close frequencies; 3 We can extend our results to other sampling sets, but for clarity of presentations, we choose M = {, 1,,, N } \ {N 1} Proof To prove this theorem, we consider a perturbed signal x The original signal x and the perturbed x satisfy x j = x j + c cl e ıπf clj c rm e ıπfrmj, j {, 1,, N }, (5) where f rm is a frequency such that its corresponding frequency atom is mutually orthogonal with the atoms corresponding to f 1,, and f R 1 We further define d min = min{ c 1,, c R 1, c rm } (53) Let us define X = H( x), X = H(x), and the error matrix E such that: X = X + E, (54) where E = c cl e ıπfcl c rm e ıπfrm c cl e ıπfcl (N 1) c rm e ıπfrm (N 1) (55) c cl e ıπfcl (N 1) c rm e ıπfrm (N 1) c cl e ıπfcl (N ) c rm e ıπfrm (N ) Both X and X are rank-r matrices For the error matrix E, we have E F = i,k {,,N 1} i,k {,,N 1} = (N ( c cl + c rm ) ) 1/ c cl e ıπfcl (i+k) c rm e ıπfrm (i+k) ( c cl + c rm ) 1/ = N ( c cl + c rm ) (56) 1/ 14

15 Following the derivations in Theorem, X has the following SVD ] Σ X = Ũ ΣṼ H, Ũ = Ũ 1 Ũ ] C N N, Ṽ = Ṽ 1 Ṽ ] C N N, Σ = 1 C N N (57) where Ũ 1 C N R, Σ 1 C R R and Ṽ 1 C N R are defined as in (35), (36) and (37) The polar decomposition of X using its SVD is given by X = P H, P = Ũ 1 Ṽ 1, H = Ṽ 1 Σ 1 Ṽ 1, (58) and the matrix P is called the unitary polar factor Similarly, for X, we have ] X = UΣV, U = U 1 U ] C N N, V = V 1 V ] C N N Σ1, Σ =, (59) and its polar decomposition for X, X = P H, P = U 1 V 1, H = V 1 Σ 1 V 1 (6) Suppose that we try to recover x through the following nuclear norm minimization: min H(x) x st A(x) = b (61) As in the proof of Theorem, we analyze the null space condition for successful recovery using Hankel matrix recovery Towards this, we first bound the unitary polar factor of X through the perturbation theory for polar decomposition Lemma 4 ( 16], Theorem 34) For matrices X C m n r and X C m n r with SVD defined as (57) and (59), let σ r and σ r be the smallest nonzero singular values of X and X, respectively, then ( P P σ r + σ r + where is any unitary invariant norm max{σ r, σ r } ) X X, (6) For our problem, we have m = n = N, r = R Let σ R be the R-th singular value of X From (36) and (53), an explicit form for σ R is Then Lemma 4 implies that σ R = Nd min (63) P P F 4 σ R X X F (64) From the null space condition for nuclear norm minimization (61), we can correctly recover x if Tr(V 1 U 1 Q) < Ū 1 Q V 1, (65) 15

16 for every nonzero Q = H(z) with z N (A), where Ū1 and V 1 are such that U 1 Ū 1 ] and V 1 V1 ] are unitary, ie, Ū1 = U and V 1 = V Since the observation set M = {, 1,,, N }\{N 1}, Q takes the form in (38) with a C, and Ū 1 Q V 1 = a (N R) (66) Let us define P = P P = Ũ1Ṽ 1 U 1 V 1 Then it follows from Lemma 4 and (56) that Tr(V 1 U 1 Q) = < Q, U 1 V 1 > < Q, P > + < Q, P > = Tr(Ṽ 1 Ũ 1 Q) + < Q, P > Tr(Ṽ 1 Ũ 1 Q) + ( Q F P F ) 1/ Thus if namely, Tr(Ṽ 1 Ũ 1 Q) + a N P F Tr(Ṽ 1 Ũ1 Q) + a 4 N E F σ R Tr(Ṽ 1 Ũ1 Q) + a 4 N N ( c cl + c rm ) Nd min Tr(Ṽ 1 Ũ 1 Q) + 4 a N c cl + c rm d min (67) Tr(Ṽ 1 Ũ 1 Q) + 4 a N c cl + c rm d min < a (N R), (68) Tr(Ṽ 1 Ũ1 Q) < a (N R 4 ) N d rel, (69) where d rel is defined as c cl + c rm d min, then solving (61) will correctly recover x The proof of Theorem leads to the concentration inequality (45): ( ) P Tr(Ṽ Ũ Q) a t 3e t / R+t/3, t > Taking t = N R 4 Nd rel, we have t / R + t/3 = (N R 4 Ndrel ) / R + (N R 4 Nd rel )/3 = 3 N + R + 16Nd rel NR 8N Nd rel + 8R Nd rel 3R + N R 4 Nd rel (7) To have successful signal recovery with high probability, we let 3 N + R + 16Nd rel NR 8N Ndrel + 8R Nd rel 3R + N R 4 = c log(n), (71) Nd rel 16

17 where c > is a constant So we have where R + sr + r =, (7) s = 8 Nd rel N c log(n), r = 16Nd rel 8 NNd rel cn log(n) + 4c Nd rel log(n) (73) Solving this leads to R = (N 8 Nd rel + c log(n)) + 1cN log(n) 48c Nd rel log(n) + 4c log (N) (74) Since we can freely choose the coefficient c rm, we choose c rm such that c rm = c min = min{ c 1, c,, c R 1 } Under such a choice for c rm, d min = c rm = c min, leading to d rel = = c cl +c min c min This concludes the proof of Theorem 3 c cl + c rm d min 5 Separation is always necessary for the success of atomic norm minimization In the previous sections, we have shown that the Hankel matrix recovery can recover the superposition of complex exponentials, even though the frequencies of the complex exponentials can be arbitrarily close In this section, we show that, broadly, the atomic norm minimization must obey a non-trivial resolution limit This is very different from the behavior of Hankel matrix recovery Our results in this section also greatly generalize the the necessity of resolution limit results in 6], to general continuously-parametered dictionary, beyond the dictionary of frequency atoms Moreover, our analysis is very different from the derivations in 6], which used the Markov-Bernstein type inequalities for finite-degree polynomials Theorem 4 Let us consider a dictionary with its atoms parameterized by a continuous-valued parameter τ C We also assume that each atom a(τ) belongs to C N, where N is a positive integer We assume that the set of all the atoms span a Q-dimensional subspace in C N Suppose the signal is the superposition of several atoms: x = R c k a(τ k ), (75) k=1 where the nonzero c k C for each k, and R is a positive integer representing the number of active atoms Consider any active atoms a(τ k1 ) and a(τ k ) With the other (R ) active atoms and their coefficients fixed (this includes the case R =, namely there are only two active atoms in total), if the atomic norm minimization can always identify the two active atoms, and correctly recover their coefficients for a(τ k1 ) and a(τ k ), then the two atoms a(τ k1 ) and a(τ k ) must be well separated such that a(τ k1 ) a(τ k ) max max σ min (A), (76) S Q A M S S where S is a positive integer, M S is the set of matrices with S columns and with each of these S columns corresponding to an atom, and σ min ( ) is the smallest singular value of a matrix 17

18 Proof Define the sign of a coefficient c k as sign(c k ) = c k c k (77) Then according to 3] and 6], a necessary condition for the atomic norm to identify the two active atoms, and correctly recover their coefficients is that, there exists a dual vector q C N such that { a(τ k1 ) q = sign(c k1 ), a(τ k ) q = sign(c k ) a(τ ) (78) q 1, τ / {τ k1, τ k } We pick c k1 and c k such that sign(c k ) sign(c j ) = Then sign(c k ) sign(c j ) = a(τ k1 ) q a(τ k ) q q a(τ k1 ) a(τ k ), (79) Thus q a(τ k1 ) a(τ k ) (8) Now we take a group of S atoms, denoted by a select,1,, and a select,s, and use them to form the S columns of a matrix A Then σ min (A) q A q = S a select,j q j=1 S, (81) where the last inequality comes form (78) It follows that S q σ min (A) (8) Define β as the maximal value of σmin(a) S among all the choices for A and S, namely, β = max max σ min (A) (83) S Q A M S S Combining (8) and (8), we have a(τ k1 ) a(τ k ) β, (84) proving this theorem 18

19 6 Orthonormal Atomic Norm Minimization: Hankel matrix recovery can be immune from atom separation requirements Our results in the previous sections naturally raise the following question: why can Hankel matrix recovery work without requiring separations between the underlying atoms while it is necessary for the atomic norm minimization to require separations between the underlying atoms? In this section, we introduce the concept of orthonormal atomic norm and its minimization, which explains the success of Hankel matrix recovery in recovering the superposition of complex exponentials regardless of the separations between frequency atoms Let us consider a vector w C N, where N is a positive integer We denote the set of atoms by AT OMSET, and assume that each atom a(τ) (parameterized by τ) belongs to C N Then the atomic norm of w AT OMIC is given by 3, 7] { s } s w AT OMIC = inf c k : w = c k a(τ k ) (85) s,τ k,c k k=1 We say the atomic norm x AT OMIC is an orthonormal atomic norm if, for every x, s w AT OMIC = c k, (86) where w = s k=1 c ka(τ k ), a(τ k ) = 1 for every k, and a(τ k ) s are mutually orthogonal to each other In the Hankel matrix recovery, the atom set AT OMSET is composed of all the rank-1 matrices in the form uv, where u and v are unit-norm vectors in C N Let us assume x C N 1 We can see the nuclear norm of a Hankel matrix is an orthonormal atomic norm of H(x): H(x) = k=1 k=1 R c k, (87) where H(x) = s k=1 c ku k vk, H(x) = UΣV is the singular value decomposition of H(x), u k is the k-th column of U, and v k is the k-th column of V This is because the matrices u k vk s are orthogonal to each other and each of these rank-1 matrices has unit energy Let us now further assume that x C N 1 is the superposition of R complex exponentials with R N, as defined in (3) Then H(x) is a rank-r matrix, and can be written as H(x) = R k=1 c ku k vk, where H(x) = UΣV is the singular value decomposition of H(x), u k is the k-th column of U, and v k is the k-th column of V Even though the original R frequency atoms a(τ k ) s for x can be arbitrarily close, we can always write H(x) as a superposition of R orthonormal atoms u k vk s from the singular value decomposition of H(x) Because the original R frequency atoms a(τ k ) s for x can be arbitrarily close, they can violate the necessary separation condition set forth in (76) However, for H(x), its composing atoms can be R orthonormal atoms u k vk s from the singular value decomposition of H(x) These atoms u k vk s are of unit energy, and are orthogonal to each other Thus these atoms are well separated and have the opportunity of not violating the necessary separation condition set forth in (76) This explains why the Hankel matrix recovery approach can break free from the separation condition which is required for traditional atomic norm minimizations k=1 19

20 7 A matrix-theoretic inequality of nuclear norms and its proof from the theory of compressed sensing In this section, we present a new matrix-theoretic inequality of nuclear norms, and give a proof of it from the theory of compressed sensing (using nuclear norm minimization) To the best of our knowledge, we have not seen the statement of this inequality of nuclear norms, or its proof elsewhere in the literature Theorem 5 Let A C m n, and t = min(m, n) Let σ 1, σ,, and σ t be the singular values of A arranged in descending order, namely Let k be any integer such that σ 1 σ σ t σ σ k < σ k σ t Then for any orthogonal projector P onto k-dimensional subspaces in C m, and any orthogonal projector Q onto k-dimensional subspaces in C n, we have In particular, P AQ (I P )A(I Q) (88) A 1:k,1:k A (k+1):m,(k+1):n, (89) where A 1:k,1:k is the submatrix of A with row indices between 1 and k and column indices between 1 and k, and A (k+1):m,(k+1):n is the submatrix of A with row indices between k + 1 and m and column indices between k + 1 and n Proof We first consider the case where all the elements of A are real numbers Without loss of generality, we consider ] I P = k k k (m k), (9) (m k) k (m k) (m k) and Q = ] I k k k (n k) (n k) k (n k) (n k) (91) Then P AQ = A 1,1 A 1,k A k,1 A k,k, (9)

21 and (I P )A(I Q) = A P A AQ + P AQ = A k+1,k+1 A k+1,n A m,k+1 A m,n (93) Thus P AQ = A 1:k,1:k, (I P )A(I Q) = A k+1:m,k+1:n (94) To prove this theorem, we first show that if σ σ k < σ k σ t, for any matrix X C m n with rank no more than k, for any positive number l >, we have The proof of (95) follows similar arguments as in 1] X + la > X (95) X + la (96) t σ i (X) σ i (la) (97) > i=1 k (σ i (X) σ i (la)) + i=1 k σ i (X) + ( i=1 t i=k+1 t i=k+1 σ i (la) σ i (X) σ i (la) (98) k σ i (la)) (99) i=1 k σ i (X) = X, (1) i=1 where, for the first inequality, we used the following lemma, which instead follows from Lemma 6 Lemma 5 Let G and H be two matrices of the same dimension Then t i=1 σ i(g) σ i (H) G H Lemma 6 ( 1, 14]) For arbitrary matrices X, Y, and Z = X Y C m n Let Let σ 1, σ,, and σ t (t = min{m, n}) be the singular values of A arranged in descending order, namely σ 1 σ σ t Let s i (X, Y ) be the distance between the i-th singular value of X and Y, namely, s i (X, Y ) = σ i (X) σ i (Y ), i = 1,,, k (11) Let s i] (X, Y ) be the i-th largest value of sequence s 1 (X, Y ), s (X, Y ),, s t (X, Y ), then k s i] (X, Y ) Z k, k = 1,,, t, (1) i=1 where Z k is defined as k i=1 σ i(z) 1

22 We next show that if A 1:k,1:k > A (k+1):m,(k+1):n, one can construct a matrix X with rank at most k such that X + la X for a certain l > We divide this construction into two cases: when A 1:k,1:k has rank equal to k, and when A 1:k,1:k has rank smaller than k When A 1:k,1:k has rank k, we denote its SVD as Then the SVD of and We now construct Let us denote A 1:k,1:k = U 1 ΣV 1 ] A1:k,1:k is given by ] ] A1:k,1:k U1 = Σ X = U1 U = V = then the subdifferential of at X is given by For any Z X, where ] V1 U1 V1 ] ], ] V1 ] (13) X = {Z : Z = U V + M, where M 1, M X =, XM = } (14) Let us partition the matrix A into four blocks: A11 A 1 A 1 A Z, A = I 1 + I, (15) I 1 = Tr(V U A), I = Tr(M A) (16) ], (17) where A 11 R k k, A 1 R k (n k), A 1 R (m k) (n k), and A 1 R (m k) (n k) Then we have ( I 1 = Tr(V U V1 A) = Tr ( V1 U1 = Tr ] A11 A 1 ] U 1 ] A11 A 1 ] ) A 1 A = Tr(V 1 U 1 A 11 ) = Tr(V 1 U 1 U 1 ΣV 1 ) = A 1 A ] ) k σ i (A 11 ) = A 11 (18) i=1

23 Since M X =, XM =, M 1 and X is a rank-k left top corner matrix, we must have ] M =, M where M is of dimension (m k) (n k), and M 1 Then I = Tr(M A) (19) ( ] ] ) A11 A = Tr 1 M (11) A 1 A = Tr (MA ) A, (111) where the last inequality is from the fact that the nuclear norm is the dual norm of the spectral norm, and M 1 Thus we have Z, A = I 1 + I A 11 + A <, (11) because we assume that A 1:k,1:k > A (k+1):m,(k+1):n Since Z, A < for every Z X, A is in the normal cone of the convex cone generated by at the point X By Theorem 37 in 3], we know that the normal cone of the convex cone generated by at the point X is the cone of descent directions for at the point of X Thus A is in the descent cone of at the point X This means that, when A 11 has rank equal to k, there exists a positive number l >, such that X + la X Let us suppose instead that A 11 has rank b < k We can write the SVD of A 11 as A 11 = ] ] Σ ] U 1 U 3 V1 V 3 (113) Then we construct X = U1 U 3 ] V1 V 3 By going through similar arguments as above (except for taking care of extra terms involving U 3 and V 3 ), one can obtain that Z, A < for every Z X In summary, no matter whether A 11 has rank equal to k or smaller to k, there always exists a positive number l >, such that X + la X However, this contradicts (95), and we conclude A 1:k,1:k A (k+1):m,(k+1):n, when A has real-numbered elements We further consider the case when A is a complex-numbered matrix We first derive the subdifferential of X for any complex-numbered matrix m n X For any α R m n and any β R m n, we define F : R m n R as ( ]) α F = (α + ıβ) β (114) ( ]) Re(X) To find the subdifferential of X, we need to derive F, for which we have the Im(X) following lemma, the proof of which is given in Appendix 13 (Note that in our earlier work ] ] 3

24 and the ( corresponding ]) preprint on arxivorg, we have already shown one direction of 116, namely Re(X) H F Now we show the two sets are indeed equal ) Im(X) Lemma 7 Suppose a rank-r matrix X C M N admits a singular value decomposition X = UΣV, where Σ R R R is a diagonal matrix, and U C M R and V C N R satisfy U U = V V = I Define S C M N as: S = { UV + W U W =, W V =, W 1, W C M N }, (115) and define Then we have F ( ]) Re(X) = X Im(X) { } α H α + ıβ S = F β] ( ]) Re(X) (116) Im(X) For a complex-numbered matrix A, we will similarly show that, if A 1:k,1:k > A (k+1):m,(k+1):n, one can construct a matrix X with rank at most k such that X + la X for a certain l > We divide this construction into two cases: when A 1:k,1:k has rank equal to k, and when A 1:k,1:k has rank smaller than k When A 1:k,1:k has rank k, we denote its SVD as Then the SVD of A1:k,1:k ] A 1:k,1:k = U 1 ΣV 1 is given by A1:k,1:k ] = U1 ] Σ V1 ] (117) We now construct X = U1 ] V1 ] We denote U = U1 ], and V = V1 ], then by Lemma 7, the subdifferential of at X is given by For any Z X, X = {Z : Z = U V + M], where M 1, M X =, XM = } (118) Z, A = I 1 + I, (119) 4

25 where I 1 = Re (Tr(V U A)), I = Re (Tr(M A)) (1) Similar to the real-numbered case, let us partition the matrix A into four blocks: ] A11 A 1, (11) A 1 A where A 11 C k k, A 1 C k (n k), A 1 C (m k) (n k), and A 1 C (m k) (n k) We still have ( ] Tr(V U V1 A) = Tr U 1 ] ] ) A11 A 1 A 1 A ( ] ] ) V1 U1 = Tr A11 A 1 A 1 A k = Tr(V 1 U1 A 11 ) = Tr(V 1 U1 U 1 ΣV1 ) = σ i (A 11 ) = A 11 (1) So I 1 = Re (Tr(V U A)) = A 11 Since M X =, XM =, M 1 and X is a rank-k left top corner matrix, we must have ] M =, M where M is of dimension (m k) (n k), and M 1 Then we have I = Re (Tr(M A)) (13) ( ( ] ] )) A11 A = Re Tr 1 M (14) A 1 A = Re (Tr (MA )) A, (15) where the last inequality is because the nuclear norm is the dual norm of the spectral norm, and M 1 Thus we have Z, A = I 1 + I A 11 + A <, (16) because we assume that A 1:k,1:k > A (k+1):m,(k+1):n Since Z, A < for every Z X, A is in the descent cone of at the point X This means that, when A 11 has rank equal to k, there exists a positive number l >, such that X + la X Let us suppose instead that A 11 has rank b < k We can write the SVD of A 11 as i=1 A 11 = U 1 U 3 ] Σ ] V1 V 3 ] (17) 5

26 Then we construct X = U1 U 3 ] V1 V 3 ] By going through similar arguments as above (except for taking care of extra terms involving U 3 and V 3 ), one can obtain that Z, A < for every Z X In summary, no matter whether complex-numbered A 11 has rank equal to k or smaller to k, there always exists a positive number l >, such that X +la X However, this contradicts (95), and we conclude A 1:k,1:k A (k+1):m,(k+1):n 8 Numerical results In this section, we perform numerical experiments to demonstrate the empirical performance of Hankel matrix recovery, and show its robustness to the separations between atoms We use superpositions of complex sinusoids as test signals But we remark that Hankel matrix recovery can also work for superpositions of complex exponentials We consider the non-uniform sampling of entries studied in 6, 7], where we uniformly randomly observe M entries (without replacement) of x from {, 1,, N } We also consider two signal (frequency) reconstruction algorithms: the Hankel nuclear norm minimization and the atomic norm minimization We fix N = 64, ie, the dimension of the ground truth signal x is 17 We conduct experiments under different M and R for different approaches For each approach with a fixed M and R, we test 1 trials, where each trial is performed as follows We first generate the true signal x = x, x 1,, x 16 ] T with x t = R k=1 c ke ıπfkt for t =, 1,, 16, where f k are frequencies drawn from the interval, 1] uniformly at random, and c k are complex coefficients satisfying the model c k = ( m k )e iπθ k with m k and θ k uniformly randomly drawn from the interval, 1] Let the reconstructed signal be represented by ˆx If ˆx x x 1 3, then we regard it as a successful reconstruction We also provide the simulation results under the Gaussian measurements of x as in ] We plot in Figure 1 the rate of successful reconstruction with respect to different M and R for different approaches The black and white region indicate a % and 1% of successful reconstruction respectively, and a grey between % and 1% From the figure, we see that the atomic norm minimization still suffers from non-negligible failure probability even if the number of measurements approach the full 17 samples The reason is that, since the underlying frequencies are randomly chosen, there is a sizable probability that some frequencies are close to each other When the frequencies are close to each other violating the atom separation condition, the atomic norm minimization can still fail even if we observe the full 17 samples By comparison, the Hankel matrix recovery approach experiences a sharper phase transition, and is robust to the frequency separations We also see that under both the Gaussian projection and the non-uniform sampling models, both the atomic norm minimization and the Hankel matrix recovery approach have similar performance We further demonstrate the robustness of the Hankel matrix recovery approach to the separations between frequency atoms, as we vary the separations between frequencies In our first set of experiments, we take N = 64, M = M = 65 ( 51% sampling rate) and R = 8, and consider noiseless measurements Again we generate the magnitude of the coefficients as p where p is uniformly randomly generated from, 1), and the realized magnitudes are 318, 5894, 6

ECE 8201: Low-dimensional Signal Models for High-dimensional Data Analysis

ECE 8201: Low-dimensional Signal Models for High-dimensional Data Analysis ECE 8201: Low-dimensional Signal Models for High-dimensional Data Analysis Lecture 7: Matrix completion Yuejie Chi The Ohio State University Page 1 Reference Guaranteed Minimum-Rank Solutions of Linear

More information

Combining Sparsity with Physically-Meaningful Constraints in Sparse Parameter Estimation

Combining Sparsity with Physically-Meaningful Constraints in Sparse Parameter Estimation UIUC CSL Mar. 24 Combining Sparsity with Physically-Meaningful Constraints in Sparse Parameter Estimation Yuejie Chi Department of ECE and BMI Ohio State University Joint work with Yuxin Chen (Stanford).

More information

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra. DS-GA 1002 Lecture notes 0 Fall 2016 Linear Algebra These notes provide a review of basic concepts in linear algebra. 1 Vector spaces You are no doubt familiar with vectors in R 2 or R 3, i.e. [ ] 1.1

More information

arxiv: v1 [cs.it] 26 Oct 2018

arxiv: v1 [cs.it] 26 Oct 2018 Outlier Detection using Generative Models with Theoretical Performance Guarantees arxiv:1810.11335v1 [cs.it] 6 Oct 018 Jirong Yi Anh Duc Le Tianming Wang Xiaodong Wu Weiyu Xu October 9, 018 Abstract This

More information

Conditions for Robust Principal Component Analysis

Conditions for Robust Principal Component Analysis Rose-Hulman Undergraduate Mathematics Journal Volume 12 Issue 2 Article 9 Conditions for Robust Principal Component Analysis Michael Hornstein Stanford University, mdhornstein@gmail.com Follow this and

More information

IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 57, NO. 11, NOVEMBER On the Performance of Sparse Recovery

IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 57, NO. 11, NOVEMBER On the Performance of Sparse Recovery IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 57, NO. 11, NOVEMBER 2011 7255 On the Performance of Sparse Recovery Via `p-minimization (0 p 1) Meng Wang, Student Member, IEEE, Weiyu Xu, and Ao Tang, Senior

More information

Random projections. 1 Introduction. 2 Dimensionality reduction. Lecture notes 5 February 29, 2016

Random projections. 1 Introduction. 2 Dimensionality reduction. Lecture notes 5 February 29, 2016 Lecture notes 5 February 9, 016 1 Introduction Random projections Random projections are a useful tool in the analysis and processing of high-dimensional data. We will analyze two applications that use

More information

The University of Texas at Austin Department of Electrical and Computer Engineering. EE381V: Large Scale Learning Spring 2013.

The University of Texas at Austin Department of Electrical and Computer Engineering. EE381V: Large Scale Learning Spring 2013. The University of Texas at Austin Department of Electrical and Computer Engineering EE381V: Large Scale Learning Spring 2013 Assignment Two Caramanis/Sanghavi Due: Tuesday, Feb. 19, 2013. Computational

More information

Going off the grid. Benjamin Recht Department of Computer Sciences University of Wisconsin-Madison

Going off the grid. Benjamin Recht Department of Computer Sciences University of Wisconsin-Madison Going off the grid Benjamin Recht Department of Computer Sciences University of Wisconsin-Madison Joint work with Badri Bhaskar Parikshit Shah Gonnguo Tang We live in a continuous world... But we work

More information

Solving Corrupted Quadratic Equations, Provably

Solving Corrupted Quadratic Equations, Provably Solving Corrupted Quadratic Equations, Provably Yuejie Chi London Workshop on Sparse Signal Processing September 206 Acknowledgement Joint work with Yuanxin Li (OSU), Huishuai Zhuang (Syracuse) and Yingbin

More information

SPARSE signal representations have gained popularity in recent

SPARSE signal representations have gained popularity in recent 6958 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 57, NO. 10, OCTOBER 2011 Blind Compressed Sensing Sivan Gleichman and Yonina C. Eldar, Senior Member, IEEE Abstract The fundamental principle underlying

More information

Robust Principal Component Analysis

Robust Principal Component Analysis ELE 538B: Mathematics of High-Dimensional Data Robust Principal Component Analysis Yuxin Chen Princeton University, Fall 2018 Disentangling sparse and low-rank matrices Suppose we are given a matrix M

More information

Chapter 3 Transformations

Chapter 3 Transformations Chapter 3 Transformations An Introduction to Optimization Spring, 2014 Wei-Ta Chu 1 Linear Transformations A function is called a linear transformation if 1. for every and 2. for every If we fix the bases

More information

ECE G: Special Topics in Signal Processing: Sparsity, Structure, and Inference

ECE G: Special Topics in Signal Processing: Sparsity, Structure, and Inference ECE 18-898G: Special Topics in Signal Processing: Sparsity, Structure, and Inference Low-rank matrix recovery via convex relaxations Yuejie Chi Department of Electrical and Computer Engineering Spring

More information

Distance Geometry-Matrix Completion

Distance Geometry-Matrix Completion Christos Konaxis Algs in Struct BioInfo 2010 Outline Outline Tertiary structure Incomplete data Tertiary structure Measure diffrence of matched sets Def. Root Mean Square Deviation RMSD = 1 n x i y i 2,

More information

Recovering overcomplete sparse representations from structured sensing

Recovering overcomplete sparse representations from structured sensing Recovering overcomplete sparse representations from structured sensing Deanna Needell Claremont McKenna College Feb. 2015 Support: Alfred P. Sloan Foundation and NSF CAREER #1348721. Joint work with Felix

More information

Elementary linear algebra

Elementary linear algebra Chapter 1 Elementary linear algebra 1.1 Vector spaces Vector spaces owe their importance to the fact that so many models arising in the solutions of specific problems turn out to be vector spaces. The

More information

The Singular Value Decomposition

The Singular Value Decomposition The Singular Value Decomposition We are interested in more than just sym+def matrices. But the eigenvalue decompositions discussed in the last section of notes will play a major role in solving general

More information

An algebraic perspective on integer sparse recovery

An algebraic perspective on integer sparse recovery An algebraic perspective on integer sparse recovery Lenny Fukshansky Claremont McKenna College (joint work with Deanna Needell and Benny Sudakov) Combinatorics Seminar USC October 31, 2018 From Wikipedia:

More information

MATH36001 Generalized Inverses and the SVD 2015

MATH36001 Generalized Inverses and the SVD 2015 MATH36001 Generalized Inverses and the SVD 201 1 Generalized Inverses of Matrices A matrix has an inverse only if it is square and nonsingular. However there are theoretical and practical applications

More information

Lecture Notes 5: Multiresolution Analysis

Lecture Notes 5: Multiresolution Analysis Optimization-based data analysis Fall 2017 Lecture Notes 5: Multiresolution Analysis 1 Frames A frame is a generalization of an orthonormal basis. The inner products between the vectors in a frame and

More information

THE SINGULAR VALUE DECOMPOSITION MARKUS GRASMAIR

THE SINGULAR VALUE DECOMPOSITION MARKUS GRASMAIR THE SINGULAR VALUE DECOMPOSITION MARKUS GRASMAIR 1. Definition Existence Theorem 1. Assume that A R m n. Then there exist orthogonal matrices U R m m V R n n, values σ 1 σ 2... σ p 0 with p = min{m, n},

More information

Lecture Notes 9: Constrained Optimization

Lecture Notes 9: Constrained Optimization Optimization-based data analysis Fall 017 Lecture Notes 9: Constrained Optimization 1 Compressed sensing 1.1 Underdetermined linear inverse problems Linear inverse problems model measurements of the form

More information

Strengthened Sobolev inequalities for a random subspace of functions

Strengthened Sobolev inequalities for a random subspace of functions Strengthened Sobolev inequalities for a random subspace of functions Rachel Ward University of Texas at Austin April 2013 2 Discrete Sobolev inequalities Proposition (Sobolev inequality for discrete images)

More information

ACCORDING to Shannon s sampling theorem, an analog

ACCORDING to Shannon s sampling theorem, an analog 554 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL 59, NO 2, FEBRUARY 2011 Segmented Compressed Sampling for Analog-to-Information Conversion: Method and Performance Analysis Omid Taheri, Student Member,

More information

Introduction to Compressed Sensing

Introduction to Compressed Sensing Introduction to Compressed Sensing Alejandro Parada, Gonzalo Arce University of Delaware August 25, 2016 Motivation: Classical Sampling 1 Motivation: Classical Sampling Issues Some applications Radar Spectral

More information

ROBUST BLIND SPIKES DECONVOLUTION. Yuejie Chi. Department of ECE and Department of BMI The Ohio State University, Columbus, Ohio 43210

ROBUST BLIND SPIKES DECONVOLUTION. Yuejie Chi. Department of ECE and Department of BMI The Ohio State University, Columbus, Ohio 43210 ROBUST BLIND SPIKES DECONVOLUTION Yuejie Chi Department of ECE and Department of BMI The Ohio State University, Columbus, Ohio 4 ABSTRACT Blind spikes deconvolution, or blind super-resolution, deals with

More information

Sparse Parameter Estimation: Compressed Sensing meets Matrix Pencil

Sparse Parameter Estimation: Compressed Sensing meets Matrix Pencil Sparse Parameter Estimation: Compressed Sensing meets Matrix Pencil Yuejie Chi Departments of ECE and BMI The Ohio State University Colorado School of Mines December 9, 24 Page Acknowledgement Joint work

More information

New Coherence and RIP Analysis for Weak. Orthogonal Matching Pursuit

New Coherence and RIP Analysis for Weak. Orthogonal Matching Pursuit New Coherence and RIP Analysis for Wea 1 Orthogonal Matching Pursuit Mingrui Yang, Member, IEEE, and Fran de Hoog arxiv:1405.3354v1 [cs.it] 14 May 2014 Abstract In this paper we define a new coherence

More information

Linear Algebra Massoud Malek

Linear Algebra Massoud Malek CSUEB Linear Algebra Massoud Malek Inner Product and Normed Space In all that follows, the n n identity matrix is denoted by I n, the n n zero matrix by Z n, and the zero vector by θ n An inner product

More information

Lecture notes: Applied linear algebra Part 1. Version 2

Lecture notes: Applied linear algebra Part 1. Version 2 Lecture notes: Applied linear algebra Part 1. Version 2 Michael Karow Berlin University of Technology karow@math.tu-berlin.de October 2, 2008 1 Notation, basic notions and facts 1.1 Subspaces, range and

More information

Lecture 2: Linear Algebra Review

Lecture 2: Linear Algebra Review EE 227A: Convex Optimization and Applications January 19 Lecture 2: Linear Algebra Review Lecturer: Mert Pilanci Reading assignment: Appendix C of BV. Sections 2-6 of the web textbook 1 2.1 Vectors 2.1.1

More information

Lecture Notes 1: Vector spaces

Lecture Notes 1: Vector spaces Optimization-based data analysis Fall 2017 Lecture Notes 1: Vector spaces In this chapter we review certain basic concepts of linear algebra, highlighting their application to signal processing. 1 Vector

More information

arxiv: v5 [math.na] 16 Nov 2017

arxiv: v5 [math.na] 16 Nov 2017 RANDOM PERTURBATION OF LOW RANK MATRICES: IMPROVING CLASSICAL BOUNDS arxiv:3.657v5 [math.na] 6 Nov 07 SEAN O ROURKE, VAN VU, AND KE WANG Abstract. Matrix perturbation inequalities, such as Weyl s theorem

More information

Rank minimization via the γ 2 norm

Rank minimization via the γ 2 norm Rank minimization via the γ 2 norm Troy Lee Columbia University Adi Shraibman Weizmann Institute Rank Minimization Problem Consider the following problem min X rank(x) A i, X b i for i = 1,..., k Arises

More information

An Introduction to Compressed Sensing

An Introduction to Compressed Sensing An Introduction to Compressed Sensing Mathukumalli Vidyasagar March 8, 216 2 Contents 1 Compressed Sensing: Problem Formulation 5 1.1 Mathematical Preliminaries..................................... 5 1.2

More information

Linear Algebra Review. Vectors

Linear Algebra Review. Vectors Linear Algebra Review 9/4/7 Linear Algebra Review By Tim K. Marks UCSD Borrows heavily from: Jana Kosecka http://cs.gmu.edu/~kosecka/cs682.html Virginia de Sa (UCSD) Cogsci 8F Linear Algebra review Vectors

More information

Recovery of Low Rank and Jointly Sparse. Matrices with Two Sampling Matrices

Recovery of Low Rank and Jointly Sparse. Matrices with Two Sampling Matrices Recovery of Low Rank and Jointly Sparse 1 Matrices with Two Sampling Matrices Sampurna Biswas, Hema K. Achanta, Mathews Jacob, Soura Dasgupta, and Raghuraman Mudumbai Abstract We provide a two-step approach

More information

5742 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 55, NO. 12, DECEMBER /$ IEEE

5742 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 55, NO. 12, DECEMBER /$ IEEE 5742 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 55, NO. 12, DECEMBER 2009 Uncertainty Relations for Shift-Invariant Analog Signals Yonina C. Eldar, Senior Member, IEEE Abstract The past several years

More information

Exact Low-rank Matrix Recovery via Nonconvex M p -Minimization

Exact Low-rank Matrix Recovery via Nonconvex M p -Minimization Exact Low-rank Matrix Recovery via Nonconvex M p -Minimization Lingchen Kong and Naihua Xiu Department of Applied Mathematics, Beijing Jiaotong University, Beijing, 100044, People s Republic of China E-mail:

More information

Compressed Sensing and Robust Recovery of Low Rank Matrices

Compressed Sensing and Robust Recovery of Low Rank Matrices Compressed Sensing and Robust Recovery of Low Rank Matrices M. Fazel, E. Candès, B. Recht, P. Parrilo Electrical Engineering, University of Washington Applied and Computational Mathematics Dept., Caltech

More information

Information-Theoretic Limits of Matrix Completion

Information-Theoretic Limits of Matrix Completion Information-Theoretic Limits of Matrix Completion Erwin Riegler, David Stotz, and Helmut Bölcskei Dept. IT & EE, ETH Zurich, Switzerland Email: {eriegler, dstotz, boelcskei}@nari.ee.ethz.ch Abstract We

More information

Solution-recovery in l 1 -norm for non-square linear systems: deterministic conditions and open questions

Solution-recovery in l 1 -norm for non-square linear systems: deterministic conditions and open questions Solution-recovery in l 1 -norm for non-square linear systems: deterministic conditions and open questions Yin Zhang Technical Report TR05-06 Department of Computational and Applied Mathematics Rice University,

More information

Characterization of half-radial matrices

Characterization of half-radial matrices Characterization of half-radial matrices Iveta Hnětynková, Petr Tichý Faculty of Mathematics and Physics, Charles University, Sokolovská 83, Prague 8, Czech Republic Abstract Numerical radius r(a) is the

More information

Singular Value Decomposition

Singular Value Decomposition Chapter 6 Singular Value Decomposition In Chapter 5, we derived a number of algorithms for computing the eigenvalues and eigenvectors of matrices A R n n. Having developed this machinery, we complete our

More information

Notes on singular value decomposition for Math 54. Recall that if A is a symmetric n n matrix, then A has real eigenvalues A = P DP 1 A = P DP T.

Notes on singular value decomposition for Math 54. Recall that if A is a symmetric n n matrix, then A has real eigenvalues A = P DP 1 A = P DP T. Notes on singular value decomposition for Math 54 Recall that if A is a symmetric n n matrix, then A has real eigenvalues λ 1,, λ n (possibly repeated), and R n has an orthonormal basis v 1,, v n, where

More information

ECE G: Special Topics in Signal Processing: Sparsity, Structure, and Inference

ECE G: Special Topics in Signal Processing: Sparsity, Structure, and Inference ECE 18-898G: Special Topics in Signal Processing: Sparsity, Structure, and Inference Low-rank matrix recovery via nonconvex optimization Yuejie Chi Department of Electrical and Computer Engineering Spring

More information

DS-GA 1002 Lecture notes 10 November 23, Linear models

DS-GA 1002 Lecture notes 10 November 23, Linear models DS-GA 2 Lecture notes November 23, 2 Linear functions Linear models A linear model encodes the assumption that two quantities are linearly related. Mathematically, this is characterized using linear functions.

More information

LINEAR ALGEBRA BOOT CAMP WEEK 1: THE BASICS

LINEAR ALGEBRA BOOT CAMP WEEK 1: THE BASICS LINEAR ALGEBRA BOOT CAMP WEEK 1: THE BASICS Unless otherwise stated, all vector spaces in this worksheet are finite dimensional and the scalar field F has characteristic zero. The following are facts (in

More information

Math Camp Lecture 4: Linear Algebra. Xiao Yu Wang. Aug 2010 MIT. Xiao Yu Wang (MIT) Math Camp /10 1 / 88

Math Camp Lecture 4: Linear Algebra. Xiao Yu Wang. Aug 2010 MIT. Xiao Yu Wang (MIT) Math Camp /10 1 / 88 Math Camp 2010 Lecture 4: Linear Algebra Xiao Yu Wang MIT Aug 2010 Xiao Yu Wang (MIT) Math Camp 2010 08/10 1 / 88 Linear Algebra Game Plan Vector Spaces Linear Transformations and Matrices Determinant

More information

arxiv: v1 [math.oc] 11 Jun 2009

arxiv: v1 [math.oc] 11 Jun 2009 RANK-SPARSITY INCOHERENCE FOR MATRIX DECOMPOSITION VENKAT CHANDRASEKARAN, SUJAY SANGHAVI, PABLO A. PARRILO, S. WILLSKY AND ALAN arxiv:0906.2220v1 [math.oc] 11 Jun 2009 Abstract. Suppose we are given a

More information

Maths for Signals and Systems Linear Algebra in Engineering

Maths for Signals and Systems Linear Algebra in Engineering Maths for Signals and Systems Linear Algebra in Engineering Lectures 13 15, Tuesday 8 th and Friday 11 th November 016 DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR) IN SIGNAL PROCESSING IMPERIAL COLLEGE

More information

Lecture 22: More On Compressed Sensing

Lecture 22: More On Compressed Sensing Lecture 22: More On Compressed Sensing Scribed by Eric Lee, Chengrun Yang, and Sebastian Ament Nov. 2, 207 Recap and Introduction Basis pursuit was the method of recovering the sparsest solution to an

More information

Throughout these notes we assume V, W are finite dimensional inner product spaces over C.

Throughout these notes we assume V, W are finite dimensional inner product spaces over C. Math 342 - Linear Algebra II Notes Throughout these notes we assume V, W are finite dimensional inner product spaces over C 1 Upper Triangular Representation Proposition: Let T L(V ) There exists an orthonormal

More information

Selected Examples of CONIC DUALITY AT WORK Robust Linear Optimization Synthesis of Linear Controllers Matrix Cube Theorem A.

Selected Examples of CONIC DUALITY AT WORK Robust Linear Optimization Synthesis of Linear Controllers Matrix Cube Theorem A. . Selected Examples of CONIC DUALITY AT WORK Robust Linear Optimization Synthesis of Linear Controllers Matrix Cube Theorem A. Nemirovski Arkadi.Nemirovski@isye.gatech.edu Linear Optimization Problem,

More information

INDUSTRIAL MATHEMATICS INSTITUTE. B.S. Kashin and V.N. Temlyakov. IMI Preprint Series. Department of Mathematics University of South Carolina

INDUSTRIAL MATHEMATICS INSTITUTE. B.S. Kashin and V.N. Temlyakov. IMI Preprint Series. Department of Mathematics University of South Carolina INDUSTRIAL MATHEMATICS INSTITUTE 2007:08 A remark on compressed sensing B.S. Kashin and V.N. Temlyakov IMI Preprint Series Department of Mathematics University of South Carolina A remark on compressed

More information

UNIT 6: The singular value decomposition.

UNIT 6: The singular value decomposition. UNIT 6: The singular value decomposition. María Barbero Liñán Universidad Carlos III de Madrid Bachelor in Statistics and Business Mathematical methods II 2011-2012 A square matrix is symmetric if A T

More information

MIT Final Exam Solutions, Spring 2017

MIT Final Exam Solutions, Spring 2017 MIT 8.6 Final Exam Solutions, Spring 7 Problem : For some real matrix A, the following vectors form a basis for its column space and null space: C(A) = span,, N(A) = span,,. (a) What is the size m n of

More information

Sparse Error Correction from Nonlinear Measurements with Applications in Bad Data Detection for Power Networks

Sparse Error Correction from Nonlinear Measurements with Applications in Bad Data Detection for Power Networks Sparse Error Correction from Nonlinear Measurements with Applications in Bad Data Detection for Power Networks 1 Weiyu Xu, Meng Wang, Jianfeng Cai and Ao Tang arxiv:1112.6234v2 [cs.it] 5 Jan 2013 Abstract

More information

Recovering any low-rank matrix, provably

Recovering any low-rank matrix, provably Recovering any low-rank matrix, provably Rachel Ward University of Texas at Austin October, 2014 Joint work with Yudong Chen (U.C. Berkeley), Srinadh Bhojanapalli and Sujay Sanghavi (U.T. Austin) Matrix

More information

15-780: LinearProgramming

15-780: LinearProgramming 15-780: LinearProgramming J. Zico Kolter February 1-3, 2016 1 Outline Introduction Some linear algebra review Linear programming Simplex algorithm Duality and dual simplex 2 Outline Introduction Some linear

More information

ELE 538B: Sparsity, Structure and Inference. Super-Resolution. Yuxin Chen Princeton University, Spring 2017

ELE 538B: Sparsity, Structure and Inference. Super-Resolution. Yuxin Chen Princeton University, Spring 2017 ELE 538B: Sparsity, Structure and Inference Super-Resolution Yuxin Chen Princeton University, Spring 2017 Outline Classical methods for parameter estimation Polynomial method: Prony s method Subspace method:

More information

Low-Rank Matrix Recovery

Low-Rank Matrix Recovery ELE 538B: Mathematics of High-Dimensional Data Low-Rank Matrix Recovery Yuxin Chen Princeton University, Fall 2018 Outline Motivation Problem setup Nuclear norm minimization RIP and low-rank matrix recovery

More information

14 Singular Value Decomposition

14 Singular Value Decomposition 14 Singular Value Decomposition For any high-dimensional data analysis, one s first thought should often be: can I use an SVD? The singular value decomposition is an invaluable analysis tool for dealing

More information

Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012

Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012 Instructions Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012 The exam consists of four problems, each having multiple parts. You should attempt to solve all four problems. 1.

More information

1 Regression with High Dimensional Data

1 Regression with High Dimensional Data 6.883 Learning with Combinatorial Structure ote for Lecture 11 Instructor: Prof. Stefanie Jegelka Scribe: Xuhong Zhang 1 Regression with High Dimensional Data Consider the following regression problem:

More information

A Fast Algorithm for Reconstruction of Spectrally Sparse Signals in Super-Resolution

A Fast Algorithm for Reconstruction of Spectrally Sparse Signals in Super-Resolution A Fast Algorithm for Reconstruction of Spectrally Sparse Signals in Super-Resolution Jian-Feng ai a, Suhui Liu a, and Weiyu Xu b a Department of Mathematics, University of Iowa, Iowa ity, IA 52242; b Department

More information

Supremum of simple stochastic processes

Supremum of simple stochastic processes Subspace embeddings Daniel Hsu COMS 4772 1 Supremum of simple stochastic processes 2 Recap: JL lemma JL lemma. For any ε (0, 1/2), point set S R d of cardinality 16 ln n S = n, and k N such that k, there

More information

Lecture: Introduction to Compressed Sensing Sparse Recovery Guarantees

Lecture: Introduction to Compressed Sensing Sparse Recovery Guarantees Lecture: Introduction to Compressed Sensing Sparse Recovery Guarantees http://bicmr.pku.edu.cn/~wenzw/bigdata2018.html Acknowledgement: this slides is based on Prof. Emmanuel Candes and Prof. Wotao Yin

More information

Fast and Robust Phase Retrieval

Fast and Robust Phase Retrieval Fast and Robust Phase Retrieval Aditya Viswanathan aditya@math.msu.edu CCAM Lunch Seminar Purdue University April 18 2014 0 / 27 Joint work with Yang Wang Mark Iwen Research supported in part by National

More information

Demixing Sines and Spikes: Robust Spectral Super-resolution in the Presence of Outliers

Demixing Sines and Spikes: Robust Spectral Super-resolution in the Presence of Outliers Demixing Sines and Spikes: Robust Spectral Super-resolution in the Presence of Outliers Carlos Fernandez-Granda, Gongguo Tang, Xiaodong Wang and Le Zheng September 6 Abstract We consider the problem of

More information

Designing Information Devices and Systems II

Designing Information Devices and Systems II EECS 16B Fall 2016 Designing Information Devices and Systems II Linear Algebra Notes Introduction In this set of notes, we will derive the linear least squares equation, study the properties symmetric

More information

Finding normalized and modularity cuts by spectral clustering. Ljubjana 2010, October

Finding normalized and modularity cuts by spectral clustering. Ljubjana 2010, October Finding normalized and modularity cuts by spectral clustering Marianna Bolla Institute of Mathematics Budapest University of Technology and Economics marib@math.bme.hu Ljubjana 2010, October Outline Find

More information

A fast randomized algorithm for overdetermined linear least-squares regression

A fast randomized algorithm for overdetermined linear least-squares regression A fast randomized algorithm for overdetermined linear least-squares regression Vladimir Rokhlin and Mark Tygert Technical Report YALEU/DCS/TR-1403 April 28, 2008 Abstract We introduce a randomized algorithm

More information

Analysis of Robust PCA via Local Incoherence

Analysis of Robust PCA via Local Incoherence Analysis of Robust PCA via Local Incoherence Huishuai Zhang Department of EECS Syracuse University Syracuse, NY 3244 hzhan23@syr.edu Yi Zhou Department of EECS Syracuse University Syracuse, NY 3244 yzhou35@syr.edu

More information

A strongly polynomial algorithm for linear systems having a binary solution

A strongly polynomial algorithm for linear systems having a binary solution A strongly polynomial algorithm for linear systems having a binary solution Sergei Chubanov Institute of Information Systems at the University of Siegen, Germany e-mail: sergei.chubanov@uni-siegen.de 7th

More information

Vandermonde Decomposition of Multilevel Toeplitz Matrices With Application to Multidimensional Super-Resolution

Vandermonde Decomposition of Multilevel Toeplitz Matrices With Application to Multidimensional Super-Resolution IEEE TRANSACTIONS ON INFORMATION TEORY, 016 1 Vandermonde Decomposition of Multilevel Toeplitz Matrices With Application to Multidimensional Super-Resolution Zai Yang, Member, IEEE, Lihua Xie, Fellow,

More information

8.1 Concentration inequality for Gaussian random matrix (cont d)

8.1 Concentration inequality for Gaussian random matrix (cont d) MGMT 69: Topics in High-dimensional Data Analysis Falll 26 Lecture 8: Spectral clustering and Laplacian matrices Lecturer: Jiaming Xu Scribe: Hyun-Ju Oh and Taotao He, October 4, 26 Outline Concentration

More information

Applied Numerical Linear Algebra. Lecture 8

Applied Numerical Linear Algebra. Lecture 8 Applied Numerical Linear Algebra. Lecture 8 1/ 45 Perturbation Theory for the Least Squares Problem When A is not square, we define its condition number with respect to the 2-norm to be k 2 (A) σ max (A)/σ

More information

The Singular Value Decomposition

The Singular Value Decomposition The Singular Value Decomposition An Important topic in NLA Radu Tiberiu Trîmbiţaş Babeş-Bolyai University February 23, 2009 Radu Tiberiu Trîmbiţaş ( Babeş-Bolyai University)The Singular Value Decomposition

More information

Bindel, Fall 2016 Matrix Computations (CS 6210) Notes for

Bindel, Fall 2016 Matrix Computations (CS 6210) Notes for 1 Logistics Notes for 2016-08-29 General announcement: we are switching from weekly to bi-weekly homeworks (mostly because the course is much bigger than planned). If you want to do HW but are not formally

More information

Bindel, Fall 2009 Matrix Computations (CS 6210) Week 8: Friday, Oct 17

Bindel, Fall 2009 Matrix Computations (CS 6210) Week 8: Friday, Oct 17 Logistics Week 8: Friday, Oct 17 1. HW 3 errata: in Problem 1, I meant to say p i < i, not that p i is strictly ascending my apologies. You would want p i > i if you were simply forming the matrices and

More information

Constrained optimization

Constrained optimization Constrained optimization DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/obda_fall17/index.html Carlos Fernandez-Granda Compressed sensing Convex constrained

More information

Least squares regularized or constrained by L0: relationship between their global minimizers. Mila Nikolova

Least squares regularized or constrained by L0: relationship between their global minimizers. Mila Nikolova Least squares regularized or constrained by L0: relationship between their global minimizers Mila Nikolova CMLA, CNRS, ENS Cachan, Université Paris-Saclay, France nikolova@cmla.ens-cachan.fr SIAM Minisymposium

More information

Deep Learning Book Notes Chapter 2: Linear Algebra

Deep Learning Book Notes Chapter 2: Linear Algebra Deep Learning Book Notes Chapter 2: Linear Algebra Compiled By: Abhinaba Bala, Dakshit Agrawal, Mohit Jain Section 2.1: Scalars, Vectors, Matrices and Tensors Scalar Single Number Lowercase names in italic

More information

Balanced Truncation 1

Balanced Truncation 1 Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science 6.242, Fall 2004: MODEL REDUCTION Balanced Truncation This lecture introduces balanced truncation for LTI

More information

Sparse Sensing in Colocated MIMO Radar: A Matrix Completion Approach

Sparse Sensing in Colocated MIMO Radar: A Matrix Completion Approach Sparse Sensing in Colocated MIMO Radar: A Matrix Completion Approach Athina P. Petropulu Department of Electrical and Computer Engineering Rutgers, the State University of New Jersey Acknowledgments Shunqiao

More information

arxiv:submit/ [cs.it] 25 Jul 2012

arxiv:submit/ [cs.it] 25 Jul 2012 Compressed Sensing off the Grid Gongguo Tang, Badri Narayan Bhaskar, Parikshit Shah, and Benjamin Recht University of Wisconsin-Madison July 25, 202 arxiv:submit/05227 [cs.it] 25 Jul 202 Abstract We consider

More information

Sparse and Low Rank Recovery via Null Space Properties

Sparse and Low Rank Recovery via Null Space Properties Sparse and Low Rank Recovery via Null Space Properties Holger Rauhut Lehrstuhl C für Mathematik (Analysis), RWTH Aachen Convexity, probability and discrete structures, a geometric viewpoint Marne-la-Vallée,

More information

Computational Methods. Eigenvalues and Singular Values

Computational Methods. Eigenvalues and Singular Values Computational Methods Eigenvalues and Singular Values Manfred Huber 2010 1 Eigenvalues and Singular Values Eigenvalues and singular values describe important aspects of transformations and of data relations

More information

Math 102, Winter Final Exam Review. Chapter 1. Matrices and Gaussian Elimination

Math 102, Winter Final Exam Review. Chapter 1. Matrices and Gaussian Elimination Math 0, Winter 07 Final Exam Review Chapter. Matrices and Gaussian Elimination { x + x =,. Different forms of a system of linear equations. Example: The x + 4x = 4. [ ] [ ] [ ] vector form (or the column

More information

Duke University, Department of Electrical and Computer Engineering Optimization for Scientists and Engineers c Alex Bronstein, 2014

Duke University, Department of Electrical and Computer Engineering Optimization for Scientists and Engineers c Alex Bronstein, 2014 Duke University, Department of Electrical and Computer Engineering Optimization for Scientists and Engineers c Alex Bronstein, 2014 Linear Algebra A Brief Reminder Purpose. The purpose of this document

More information

COMPRESSED Sensing (CS) is a method to recover a

COMPRESSED Sensing (CS) is a method to recover a 1 Sample Complexity of Total Variation Minimization Sajad Daei, Farzan Haddadi, Arash Amini Abstract This work considers the use of Total Variation (TV) minimization in the recovery of a given gradient

More information

Self-Calibration and Biconvex Compressive Sensing

Self-Calibration and Biconvex Compressive Sensing Self-Calibration and Biconvex Compressive Sensing Shuyang Ling Department of Mathematics, UC Davis July 12, 2017 Shuyang Ling (UC Davis) SIAM Annual Meeting, 2017, Pittsburgh July 12, 2017 1 / 22 Acknowledgements

More information

Sparsity in system identification and data-driven control

Sparsity in system identification and data-driven control 1 / 40 Sparsity in system identification and data-driven control Ivan Markovsky This signal is not sparse in the "time domain" 2 / 40 But it is sparse in the "frequency domain" (it is weighted sum of six

More information

Problem Set 1. Homeworks will graded based on content and clarity. Please show your work clearly for full credit.

Problem Set 1. Homeworks will graded based on content and clarity. Please show your work clearly for full credit. CSE 151: Introduction to Machine Learning Winter 2017 Problem Set 1 Instructor: Kamalika Chaudhuri Due on: Jan 28 Instructions This is a 40 point homework Homeworks will graded based on content and clarity

More information

18.06 Problem Set 10 - Solutions Due Thursday, 29 November 2007 at 4 pm in

18.06 Problem Set 10 - Solutions Due Thursday, 29 November 2007 at 4 pm in 86 Problem Set - Solutions Due Thursday, 29 November 27 at 4 pm in 2-6 Problem : (5=5+5+5) Take any matrix A of the form A = B H CB, where B has full column rank and C is Hermitian and positive-definite

More information

Reconstruction from Anisotropic Random Measurements

Reconstruction from Anisotropic Random Measurements Reconstruction from Anisotropic Random Measurements Mark Rudelson and Shuheng Zhou The University of Michigan, Ann Arbor Coding, Complexity, and Sparsity Workshop, 013 Ann Arbor, Michigan August 7, 013

More information

List of Errata for the Book A Mathematical Introduction to Compressive Sensing

List of Errata for the Book A Mathematical Introduction to Compressive Sensing List of Errata for the Book A Mathematical Introduction to Compressive Sensing Simon Foucart and Holger Rauhut This list was last updated on September 22, 2018. If you see further errors, please send us

More information

Sparse Solutions of an Undetermined Linear System

Sparse Solutions of an Undetermined Linear System 1 Sparse Solutions of an Undetermined Linear System Maddullah Almerdasy New York University Tandon School of Engineering arxiv:1702.07096v1 [math.oc] 23 Feb 2017 Abstract This work proposes a research

More information