arxiv: v1 [cs.it] 4 Nov 2017

Size: px

Start display at page:

Download "arxiv: v1 [cs.it] 4 Nov 2017"

Ashlee Banks
5 years ago
Views:

1 Separation-Free Super-Resolution from Compressed Measurements is Possible: an Orthonormal Atomic Norm Minimization Approach Weiyu Xu Jirong Yi Soura Dasgupta Jian-Feng Cai arxiv: v1 csit] 4 Nov 17 Mathews Jacob November 7, 17 Abstract Myung Cho We consider the problem of recovering the superposition of R distinct complex exponential functions from compressed non-uniform time-domain samples Total Variation (TV) minimization or atomic norm minimization was proposed in the literature to recover the R frequencies or the missing data However, it is known that in order for TV minimization and atomic norm minimization to recover the missing data or the frequencies, the underlying R frequencies are required to be well-separated, even when the measurements are noiseless This paper shows that the Hankel matrix recovery approach can super-resolve the R complex exponentials and their frequencies from compressed non-uniform measurements, regardless of how close their frequencies are to each other We propose a new concept of orthonormal atomic norm minimization (OANM), and demonstrate that the success of Hankel matrix recovery in separation-free superresolution comes from the fact that the nuclear norm of a Hankel matrix is an orthonormal atomic norm More specifically, we show that, in traditional atomic norm minimization, the underlying parameter values must be well separated to achieve successful signal recovery, if the atoms are changing continuously with respect to the continuously-valued parameter In contrast, for OANM, it is possible the OANM is successful even though the original atoms can be arbitrarily close As a byproduct of this research, we provide one matrix-theoretic inequality of nuclear norm, and give its proof from the theory of compressed sensing The first two authors contributed equally to this work Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 54 weiyu-xu@uiowaedu Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 54 Yi is co-first author Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 54 Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 1

2 1 Introduction In super-resolution, we are interested in recovering the high-end spectral information of signals from observations of it low-end spectral components 3] In one setting of super-resolution problems, one aims to recover a superposition of complex exponential functions from time-domain samples In fact, many problems arising in science and engineering involve high-dimensional signals that can be modeled or approximated by a superposition of a few complex exponential functions In particular, if we choose the exponential functions to be complex sinusoids, this superposition of complex exponentials models signals in acceleration of medical imaging 18], analog-to-digital conversion 9], and signals in array signal processing 4] Accelerated NMR (Nuclear magnetic resonance) spectroscopy, which is a prerequisite for studying short-lived molecular systems and monitoring chemical reactions in real time, is another application where signals can be modeled or approximated by a superposition of complex exponential functions How to recover the superposition of complex exponential functions or parameters of these complex exponential functions is of prominent importance in these applications In this paper, we consider how to recover those superposition of complex exponential from linear measurements More specifically, let x C N 1 be a vector satisfying x j = R k=1 c k z j k, j =, 1,, N, (1) where z k C, k = 1,, R, are some unknown complex numbers with R being a positive integer In other words, x is a superposition of R complex exponential functions We assume R N 1 When z k = 1, k = 1,, R, x is a superposition of complex sinusoid When z k = e τ k e πıf k, k = 1,, R, ı = 1, x can model the signal in NMR spectroscopy Since R N 1 and often R N 1, the degree of freedom to determine x is much less than the ambient dimension N 1 Therefore, it is possible to recover x from its under sampling 5,11] In particular, we consider recovering x from its linear measurements b = A(x), () where A is a linear mapping to C M, M < N 1 After x is recovered, we can use the singlesnapshot MUSIC or the Prony s method to recover the parameter z k s The problem on recovering x from its linear measurements () can be solved using Compressed Sensing (CS) 5], by discretizing the dictionary of basis vectors into grid points corresponding to discrete values of z k When the parameters f k s in signals from spectral compressed sensing or (f k, τ k ) s from signals in accelerated NMR spectroscopy indeed fall on the grid, CS is a powerful tool to recover those signals even when the number of samples is far below its ambient dimension (R N 1) 5, 11] Nevertheless, the parameters in our problem setting often take continuous values, leading to a continuous dictionary, and may not exactly fall on a grid The basis mismatch problem between the continuously-valued parameters and the grid-valued parameters degenerates the performance of conventional compressed sensing 8] In two seminal papers 3, 7], the authors proposed to use the total variation minimization or the atomic norm minimization to recover x or to recover the parameter z k, when z k = e ıπf k with f k taking continuous values from, 1) In these two papers, the author showed that the TV minimization or the atomic norm minimization can recover correctly the continuously-valued frequency f k s when there are no observation noises However, as shown in 3, 6, 7], in order for

3 the TV minimization or the atomic norm minimization to recover spectrally sparse data or the associated frequencies correctly, it is necessary to require that adjacent frequencies be separated far enough from each other For example, for complex exponentials with z k s taking values on the complex unit circle, it is required that their adjacent frequencies f k, 1] s be at least N 1 apart This separation condition is necessary, even if we observe the full (N 1) data samples, and even if the observations are noiseless This raises a natural question, Can we super-resolve the superposition of complex exponentials with continuously-valued parameter z k, without requiring frequency separations, from compressed measurements? In this paper, we answer this question in positive More specifically, we show that a Hankel matrix recovery approach using nuclear norm minimization can super-resolve the superposition of complex exponentials with continuously-valued parameter z k, without requiring frequency separations, from compressed measurements This separation-free super-resolution result holds even when we only compressively observe x over a subset M {,, N } In this paper, we give the worst-case and average-case performance guarantees of Hankel matrix recovery in recovering the superposition of complex exponentials In establishing the worst-case performance guarantees, we establish the conditions under which the Hankel matrix recovery can recover the underlying complex exponentials, no matter what values the coefficients c k s of the complex exponentials take For the average-case performance guarantee, we assume that the phases of the coefficients c k s are uniformly distributed over, π) For both the worst-case and average-case performance guarantees, we establish that Hankel matrix recovery can super-resolve complex exponentials with continuously-valued parameters z k s, no matter how close two adjacent frequencies or parameters z k s are to each other We further introduce a new concept of orthonormal atomic norm minimization (OANM), and discover that the success of Hankel matrix recovery in separation-free super-resolution comes from the fact that the nuclear norm of the Hankel matrix is an orthonormal atomic norm In particular, we show that, in traditional atomic norm minimization, for successful signal recovery, the underlying parameters must be well separated, if the atoms are changing continuously with respect to the continuously-valued parameters; however, it is possible the OANM is successful even though the original atoms can be arbitrarily close As a byproduct of this research, we discover one interesting matrix-theoretic inequality of nuclear norm, and give its proof from the theory of compressed sensing 11 Comparisons with related works on Hankel matrix recovery and atomic norm minimization Low-rank Hankel matrix recovery approaches were used for recovering parsimonious models in system identifications, control, and signal processing In 19], Markovsky considered low-rank approximations for Hankel structured matrices with applications in signal processing, system identifications and control In 1, 13], Fazel et al introduced low-rank Hankel matrix recovery via nuclear norm minimization, motivated by applications including realizations and identification of linear timeinvariant systems, inferring shapes (points on the complex plane) from moments estimation (which is related to super-resolution with z k from the complex plane), and moment matrix rank minimization for polynomial optimization In 13], Fazel et al further designed optimization algorithms to solve the nuclear norm minimization problem for low-rank Hankel matrix recovery In 6], Chen and Chi proposed to use multi-fold Hankel matrix completion for spectral compressed sensing, studied the performance guarantees of spectral compressed sensing via structured multi-fold Hankel matrix completion, and derived performance guarantees of structured matrix completion However, the 3

4 results in 6] require that the Dirichlet kernel associated with underlying frequencies satisfies certain incoherence conditions, and these conditions require the underlying frequencies to be well separated from each other In 9, 3], the authors derived performance guarantees for Hankel matrix completion in system identifications However, the performance guarantees in 9, 3] require a very specific sampling patterns of fully sampling the upper-triangular part of the Hankel matrix Moreover, the performance guarantees in 9, 3] require that the parameters z k s be very small (or smaller than 1) in magnitude In our earlier work ], we established performance guarantees of Hankel matrix recovery for spectral compressed sensing under Gaussian measurements of x, and by comparison, this paper considers direct observations of x over a set M {, 1,,, N }, which is a more relevant sampling model in many applications In the single-snapshot MUSIC algorithm 17], the Prony s method 1] or the matrix pencil approach 15], one would need the full (N 1) consecutive samples to perform frequency identifications, while the Hankel matrix recovery approach can work with compressed measurements When prior information of the locations of the frequencies are available, one can use weighted atomic norm minimization to relax the separation conditions in successful signal recovery ] In 5], the authors consider super-resolution without separation using atomic norm minimization, but under the restriction that the coefficients are non-negative and for a particular set of atoms 1 Organizations of this paper The rest of the paper is organized as follows In Section, we present the problem model, and introduce the Hankel matrix recovery approach In Section 3, we investigate the worst-case performance guarantees of recovering spectrally sparse signals regardless of frequency separation, using the Hankel matrix recovery approach In Section 4, we study the Hankel matrix recovery s average-case performance guarantees of recovering spectrally sparse signals regardless of frequency separation In Section 5, we show that in traditional atomic norm minimization, for successful signal recovery, the underlying parameters must be well separated, if the atoms are changing continuously with respect to the continuously-valued parameters In Section 6, we introduce the concept of orthonormal atomic norm minimization, and show that it is possible that the atomic norm minimization is successful even though the original atoms can be arbitrarily close In Section 7, as a byproduct of this research, we provide one matrix-theoretic inequality of nuclear norm, and give its proof from the theory of compressed sensing Numerical results are given in Section 8 to validate our theoretical predictions We conclude our paper in Section 9 13 Notations We denote the set of complex numbers and real number as C and R respectively We use calligraphic uppercase letters to represent index sets, and use to represent a set s cardinality When we use an index set as the subscript of a vector, we refer to the part of the vector over the index set For example, x Ω is the part of vector x over the index set Ω We use C n1 n r to represent the set of matrices from C n1 n with rank r We denote the trace of a matrix X by Tr(X), and denote the real part and imaginary parts of a matrix X by Re(X) and Im(X) respectively The superscripts T and are used to represent transpose, and conjugate transpose of matrices or vectors The Frobenius norm, nuclear norm, and spectral norm of a matrix are denoted by F, and (or ) respectively The notation represents the spectral norm if its argument is a matrix, and represents the Euclidean norm if its argument is vector The probability of an event S is denoted 4

5 by P(S) Problem statement The underlying model for spectrally sparse signal is a mixture of complex exponentials x j = R c k e (ıπf k τ k )j, j {, 1,, N } (3) k=1 where ı = 1, f k, 1), c k C, and τ k are the normalized frequency, coefficients, and damping factor, respectively We observe x over a subset M {, 1,,, N } To estimate the continuous parameter f k s, in 3, 7], the authors proposed to use the total variation minimization or the atomic norm minimization to recover x or to recover the parameter z k, when z k = e ıπf k with f k taking continuous values from, 1) In these two papers, the author showed that the TV minimization or the atomic norm minimization can recover correctly the continuously-valued frequency f k s when there are no observation noises However, as shown in 3,6,7], in order for atomic norm minimization to recover spectrally sparse data or the associated frequencies correctly, it is necessary to require that adjacent frequencies be separated far enough from each other Let us define the minimum separation between frequencies as the following: Definition 1 (minimum separation, see 3]) For a frequency subset F, 1) with a group of points, the minimum separation is defined as smallest distance two arbitrary different elements in F, ie dist(f) = where d(f i, f l ) is the wrap around distance between two frequencies inf d(f i, f l ), (4) f i,f l F,f i f l As shown in 6], for complex exponentials with z k s taking values on the complex unit circle, it is required that their adjacent frequencies f k, 1] s be at least N 1 apart This separation condition is necessary, even if we observe the full (N 1) data samples, and even if the observations are noiseless Following the idea the matrix pencil method in 15] and Enhanced Matrix Completion (EMaC) in 6], we construct a Hankel matrix based on signal x More specifically, define the Hankel matrix H(x) C N N by H jk (x) = x j+k, j, k = 1,,, N (5) The expression (3) leads to a rank-r decomposition: 1 1 z 1 z R 1 H(x) = c z1 N 1 z N 1 R c R 1 z 1 z N z R z N 1 R Instead of reconstructing x directly, we reconstruct the rank-r Hankel matrix H, subject to the observation constraints Low rank matrix recovery has been widely studied in recovering a matrix from incomplete observations 4] It is well known that minimizing the nuclear norm can 5

6 lead to a solution of low-rank matrices We therefore use the nuclear norm minimization to recover the low-rank matrix H More specifically, for any given x C N 1, let H(x) C N N be the corresponding Hankel matrix We solve the following optimization problem: min x H(x), subject to A(x) = b, (6) where is the nuclear norm, and A and b are the linear measurements and measurement results When there is noise η contained in the observation, ie, b = Ax + η, we solve min H(x), x subject to Ax b δ, (7) where δ = η is the noise level It is known that the nuclear norm minimization (6) can be transformed into a semidefinite program 1 min x,q 1,Q (Tr(Q 1) + Tr(Q )) st b = Ax, ] Q 1 H(x) (8) H(x) Q which can be easily solved with existing convex program solvers such as interior point algorithms After successfully recovering all the time samples, we can use the single-snapshot MUSIC algorithm (as discussed in 17]) to identify the underlying frequencies f k For the recovered Hankel matrix H(x), let its SVD be ] Σ1 H(x) = U 1 U ] V 1 V ], U 1, V 1 C N R, (9) and we define the vector φ N (f) and imaging function J(f) as φ N (f) = (e ıπf, e ıπf1,, e ıπf(n 1) ) T, J(f) = φn (f) U φn (f), f, 1) (1) The single-snapshot MUSIC algorithm is given in Algorithm 1 Algorithm 1 The Single-Snapshot MUSIC algorithm 17] 1: require: solution x, parameter R and N : form Hankel matrix Z C N N ] Σ1 3: SVD Z = U 1 U ] V 1 V ] with U 1 C N R and Σ 1 C R R 4: compute imaging function J(f) = φn (f) U φn (f), f, 1) 5: get set ˆF={ f: f corresponds to R largest local maxima of J(f)} In 17], the author showed that the MUSIC algorithm can exactly recover all the frequencies by finding the local maximal of J(f) Namely, for undamped signal (3) with the set of frequencies F, if N R, then f F is equivalent to J(f) = 6

7 3 Worst-case performance guarantees of separation-free superresolution In this section, we provide the worst-case performance guarantees of Hankel matrix recovery for recovering the superposition of complex exponentials Namely, we provide the conditions under which the Hankel matrix recovery can uniformly recover the superposition of every possible R complex exponentials Our results show that the Hankel matrix recovery can achieve separationfree super-resolution, even if we consider the criterion of worst-case performance guarantees Later in Section 4, we further show that our derived worst-performance guarantees are tight, namely we can find examples where R is bigger than the predicted recoverable sparsity by our theory, and the nuclear norm minimization fails to recover the superposition of complex exponentials We let w i be the number of elements in the i-th anti-diagonal of matrix H, namely, { i, i = 1,,, N, w i = (11) N i, i = N + 1,, N 1 Here we call the (N 1) anti-diagonals of H(x) from the left top to the right bottom as the 1-st anti-diagonal,, and the (N 1)-th anti-diagonal We also define w min as w min = min w i+1 (1) i {,1,,,N }\M With the setup above, we give the following Theorem 1 concerning the worst-case performance guarantee of Hankel matrix recovery Theorem 1 Let us consider the signal model of the superposition of R complex exponentials (3), the observation set M {, 1,,, N } We further define w i of an N N Hankel matrix H(x) as in (11), and define w min as in (1) Then the nuclear norm minimization (6) will uniquely recover H(x), regardless of the (frequency) separation between the R continuously-valued (frequencies) parameters if R < Proof We can change (6) to the following optimization problem: w min (N 1 M ) (13) min B, subject to B = H(x), A(x) = b (14) B,x We can think of (14) as a nuclear norm minimization problem, where the null space of the linear operator (applied to (B,x)) in the constraints of (14) is given by (H(z), z) such that A(z) = From 1, ], we have the following lemma about the null space condition for successful signal recovery via nuclear norm minimization Lemma 1 1] Let X be any N N matrix of rank R, and we observe it through a linear mapping A(X ) = b Then the nuclear norm minimization (15) min X X, subject to A(X) = A(X ), (15) 7

8 can uniquely and correctly recover every matrix X with rank no more than R if and only if, for all nonzero Z N (A), Z R < Z, (16) where Z R is the sum of the largest R singular values of Z, and N (A) is the null space of A Using this lemma, we can see that (14) or (6) can correctly recover x as a superposition of R complex exponentials if H(z) R < H(z) holds true for every nonzero z from the null space of A Considering the sampling set M {, 1,,, N }, the null space of the sampling operator A is composed of (N 1) 1 vectors z s such that z i+1 = if i M For such a vector z, let us denote the element across the i-th anti-diagonal of H(z) as a i, and thus Q = H(z) is a Hankel matrix with its (i + 1) anti-diagonal element equal to if i M Let σ 1,, σ N be the N singular values of H(z) arranged in a descending order To verify the null space condition for nuclear norm minimization, we would like to find the largest R such that σ σ R < σ R σ N, (17) for every nonzero z in the null space of A Towards this goal, we first obtain a bound for its largest singular value (for a matrix Q, we use Q i,: to denote its i-th row vector): σ 1 = max u C N, u =1 Qu (18) = max N Q i,: u (19) u C N, u =1 i=1 N = max u C N, u =1 a j u j i+1 () i=1 j {j: i Ind(j),a j } N max a j u j i+1 (1) u C N, u =1 = max u C N, u =1 max u C N, u =1 i=1 j {j:i Ind(j),a j } j {j:i Ind(j),a j } N 1 a j1 j 1=1 max N 1 u C N, u =1 j 1=1 = j {,1,,N }\M i Ind(j 1) N a j i=1 j {j:i Ind(j),a j } j {j:i Ind(j),a j } j {j:i Ind(j),a j } u j i+1 u j i+1 () (3) a j1 (N 1 M)] (4) a j+1 (N 1 M) (5) 8

9 where () is obtained by looking at the rows (with Ind(j) being the set of indices of rows which intersect with the j-th anti-diagonal and {j : i Ind(j), a j } being the set of all non-zero anti-diagonals intersecting with the i-th row), (1) is due to the Cauchy-Schwarz inequality, and ((4) is because u = 1 and, for each i, u i appears for no more than (N 1 M) times in ) i Ind(j1) j u {j:i Ind(j),a j } j i+1 Furthermore, summing up the energy of the matrix Q, we have N σi = i=1 Thus for any integer k N, we have i {,1,,N }\M a i+1 w i+1 (6) N i=1 σ i k i=1 σ i N i=1 σ i kσ 1 N i=1 σ i kσ 1 i {,1,,N }\M a i+1 w i+1 k j {,1,,N }\M a j+1 (N 1 M) min i {,1,,N }\M w i+1 k(n 1 M) w min = k(n 1 M) (7) So if min i {,1,,N }\M w i+1 R(N 1 M) >, (8) then for ever nonzero vector z in the null space of A, and the corresponding Hankel matrix Q = H(z), N R σ i > σ i (9) i=1 w It follows that, for any superposition of R < min (N 1 M) complex exponentials, we can correctly recover x over the whole set {, 1,, N } using the incomplete sampling set M, regardless of the separations between different frequencies (or between the continuously-valued parameters z k s for damped complex exponentials) On the one hand, the performance guarantees given in Theorem 1 can be conservative: for average-case performance guarantees,even when the number of complex exponentials R is bigger than predicted by Theorem 1, the Hankel matrix recovery can still recover the missing data, even though the sinusoids can be very close to each other On the other hand, the bounds on recoverable sparsity level R given in Theorem 1 is tight for worst-case performance guarantees, as shown in the next section i=1 9

10 31 Tightness of recoverable sparsity guaranteed by Theorem 1 We will show the tightness of recoverable sparsity guaranteed by Theorem 1 in Section 4, which is built on the developments in Section 41 4 Average-case performance guarantees In this section, we study the performance guarantees for Hankel matrix recovery, when the phases of the coefficients of the R sinusoids are iid and uniformly distributed over, π) For averagecase performance guarantees, we show that we can recover the superposition of a larger number of complex exponentials than Theorem 1 offers 41 Average-case performance guarantees for orthogonal frequency atoms and the tightness of worst-case performance guarantees Theorem Let us consider the signal model of the superposition of R complex exponentials (3) with τ k = We assume that the R frequencies f 1, f,, and f R are such that the atoms (e ıπfi, e ıπfi1,, e ıπfi(n 1) ) T, 1 i R, are orthogonal to each other We let the observation set be M = {, 1,,, N } \ {N 1} We assume the phases of coefficients c 1,, c R in signal model (3) are independent and uniformly distributed over, π) Then the nuclear norm minimization (6) will successfully and uniquely recover H(x) and x, with probability approaching 1 as N if where c > is a constant R = N c log(n)n, (3) Proof We use the following Lemma about the condition for successful signal recovery through nuclear norm minimization This lemma is an extension of Lemma 13 in 1] The key difference is Lemma deals with complex-numbered matrices Moreover, Lemma gets rid of the iff claim for the null space condition in Lemma 13 of 1], because we find that the condition in Lemma 13 of 1] is a sufficient condition for the success of nuclear norm minimization, but not a necessary condition for the success of nuclear norm minimization We give the proof of Lemma, and prove the null space condition is only a sufficient condition for nuclear norm minimization in Appendix 11 and Appendix 1 respectively Lemma Let X be any M N matrix of rank R in C M N, and we observe it through a linear mapping A(X ) = b We also assume that X has a singular value decomposition (SVD) X = UΣV, where U C M R, V C N R, and Σ C R R is a diagonal matrix Then the nuclear norm minimization (31) min X X, subject to A(X) = A(X ), (31) correctly and uniquely recovers X if, for every nonzero element Q N (A), Tr(U QV ) + Ū Q V >, (3) where Ū and V are such that U Ū] and V V ] are unitary 1

11 We can change (6) to the following optimization problem min B, subject to B = H(x), A(x) = b (33) B,x We can think of (33) as a nuclear norm minimization, where he null space of the linear operator (applied to B and x) in the constraints of (14) is given by (H(z), z) such that A(z) = Then we can see that (33) or (6) can correctly recover x as a superposition of R complex exponentials if Tr(V U H(z)) < Ū H(z) V, (34) hold true for any z which is a nonzero vector from the null space of A, where H(x) = UΣV is the singular value decomposition (SVD) of H(x) with U C N R and V C N R, and Ū and V are such that U, Ū] and V, V ] are unitary Without loss of generality, let f k = s k N for 1 k R, where s k s are distinct integers between and N 1 Then e ıπ s 1 N e ıπ s N e ıπ s R N e ıθ1 U = 1 e ıπ s 1 N 1 e ıπ s N 1 e ıπ s R N 1 e ıθ N, e ıπ s 1 N (N 1) e ıπ s N (N 1) e ıπ s R N (N 1) e ıθ R (35) and V = 1 N e ıπ s 1 N e ıπ s N e ıπ s R N e ıπ s 1 N 1 e ıπ s N 1 e ıπ s R N 1 e ı π s 1 N (N 1) e ıπ s N (N 1) e ıπ s R N (N 1), (36) and Σ = N c 1 c c R, (37) where θ k s (1 k R) are iid random variables uniformly distributed over, π) When the observation set M = {, 1,,, N } \ {N 1}, any Q = H(z) with z from the null space of A takes the following form: Q = a 1 1, a C (38) 11

12 Thus 1 Tr(V U Q) = atr U 1 ( ) 1 R N 1 = a N = a N = a R N 1 i=1 t= R i=1 i=1 t= V e ıθi e ıπ s i (N 1 t) N e ıπ s i (t) N (39) e ıθi e ıπ s i (N 1) N (4) e ıθi e ıπ s i (N 1) N (41) Notice that random variable e ıθi e ıπ s i (N 1) N are mutually independent random variables uniformly distributed over the complex unit circle We will use the following lemma (see for example, 8]) to provide a concentration of measure result for the summation of e ıθi e ıπ s i (N 1) N : Lemma 3 (see for example, 8]) For a sequence of iid random matrix matrices M 1,, M K with dimension d 1 d and their sum M = K i=1 M i, if M i satisfies EM i ] =, M i L, i = 1,, K, (4) and M satisfies then { } K ν(m) = max EM i Mi ], K EMi M i ], (43) i=1 ( P( M t) (d 1 + d ) exp i=1 t / v(m) + Lt/3 ), t (44) Applying Lemma 3 to the 1 matrix composed of the real and imaginary parts of e ıθi e ıπ s i (N 1) N, with ν = max(r, R/) = R, L = 1, d 1 + d = 3, we have t / P ( Tr(V U Q) a t) 3e ν+lt/3 = 3e t / R+t/3, t > (45) We further notice that, Ū s ( V s) columns are normalized orthogonal frequency atoms (or their complex conjugates) with frequencies li N, with integer l i s different from the integers s k s of those R complex exponentials Thus (N R) / ( R+(N R)/3 U QV = a (N R) (46) Pick t = N R, and let = c log(n) with c being a positive constant Solving for ) R, we obtain R = N 3 log(n) c log (N) + c log(n)n, which implies (34) holds with probability approaching 1 if N This proves our claims 1

13 4 Tightness of recoverable sparsity guaranteed by Theorem We show that the recoverable sparsity R provided by Theorem is tight for M = {, 1,,, N }\{N 1} For such a sampling set, w min = N, and Theorem provides a bound R < wmin = N In fact, we can show that if R N, we can construct signal examples where the Hankel matrix recovery approach cannot recover the original signal x Consider the signal in Theorem We choose the coefficients c i s such that e ıθi e ıπ s i (N 1) N = 1, then and we also have Tr(V U Q) = a R i=1 e ıθi e ıπ s i (N 1) N = a R, (47) U QV = a (N R) (48) Thus Tr(V U Q) U QV for every Q = H(z) with z in the null space of the sampling operator Thus the Hankel matrix recovery cannot recover the ground truth x This shows that the prediction by Theorem is tight 43 Average-Case performance analysis with arbitrarily close frequency atoms In this section, we further show that the Hankel matrix recovery can successfully recover the superposition of complex exponentials with arbitrarily close frequency atoms In particular, we give average performance guarantees on recoverable sparsity R when frequency atoms are arbitrarily close, and show that the Hankel matrix recovery can deal with much larger recoverable sparsity R for average-case signals than predicted by Theorem 1 We consider a signal x composed of R complex exponentials Among them, R 1 orthogonal complex exponentials (without loss of generality, we assume that these R 1 frequencies take values l i N, where l i s are integers) The other complex exponential c cl e ıπf clj has a frequency arbitrarily close to one of the R 1 frequencies, ie, x j = R 1 k=1 c k e ıπf kj + c cl e ıπf clj, j {, 1,, N }, (49) where f cl is arbitrarily close to one of the first (R 1) frequencies For this setup with arbitrarily close atoms, we have the following average-case performance guarantee Theorem 3 Consider the signal model (49) with R complex exponentials, where the coefficients of these complex exponentials are iid uniformly distributed random over, π), and the first (R 1) of the complex exponentials are such that the corresponding atom vectors in (1) are mutually orthogonal, while the R-th exponential has frequency arbitrarily close to one of the first (R 1) frequencies (in wrap-around distance) Let c min = min{ c 1, c,, c R 1 }, and define d rel = c cl c min + 1 (5) 13

14 if If we have the observation set M = {, 1,,, N } \ {N 1}, and, for any constant c >, R = (N 8 Nd rel + c log(n)) + 1cN log(n) 48c Nd rel log(n) + 4c log (N), (51) then we can recover the true signal x via Hankel matrix recovery with high probability as N, regardless of frequency separations Remarks: 1 We can extend this result to cases where neither of the two arbitrarily close frequencies has on-the-grid frequencies, and the Hankel matrix recovery method can recover a similar sparsity; Under a similar number of complex exponentials, with high probability, the Hankel matrix recovery can correctly recover the signal x, uniformly over every possible phases of the two complex exponentials with arbitrarily close frequencies; 3 We can extend our results to other sampling sets, but for clarity of presentations, we choose M = {, 1,,, N } \ {N 1} Proof To prove this theorem, we consider a perturbed signal x The original signal x and the perturbed x satisfy x j = x j + c cl e ıπf clj c rm e ıπfrmj, j {, 1,, N }, (5) where f rm is a frequency such that its corresponding frequency atom is mutually orthogonal with the atoms corresponding to f 1,, and f R 1 We further define d min = min{ c 1,, c R 1, c rm } (53) Let us define X = H( x), X = H(x), and the error matrix E such that: X = X + E, (54) where E = c cl e ıπfcl c rm e ıπfrm c cl e ıπfcl (N 1) c rm e ıπfrm (N 1) (55) c cl e ıπfcl (N 1) c rm e ıπfrm (N 1) c cl e ıπfcl (N ) c rm e ıπfrm (N ) Both X and X are rank-r matrices For the error matrix E, we have E F = i,k {,,N 1} i,k {,,N 1} = (N ( c cl + c rm ) ) 1/ c cl e ıπfcl (i+k) c rm e ıπfrm (i+k) ( c cl + c rm ) 1/ = N ( c cl + c rm ) (56) 1/ 14

15 Following the derivations in Theorem, X has the following SVD ] Σ X = Ũ ΣṼ H, Ũ = Ũ 1 Ũ ] C N N, Ṽ = Ṽ 1 Ṽ ] C N N, Σ = 1 C N N (57) where Ũ 1 C N R, Σ 1 C R R and Ṽ 1 C N R are defined as in (35), (36) and (37) The polar decomposition of X using its SVD is given by X = P H, P = Ũ 1 Ṽ 1, H = Ṽ 1 Σ 1 Ṽ 1, (58) and the matrix P is called the unitary polar factor Similarly, for X, we have ] X = UΣV, U = U 1 U ] C N N, V = V 1 V ] C N N Σ1, Σ =, (59) and its polar decomposition for X, X = P H, P = U 1 V 1, H = V 1 Σ 1 V 1 (6) Suppose that we try to recover x through the following nuclear norm minimization: min H(x) x st A(x) = b (61) As in the proof of Theorem, we analyze the null space condition for successful recovery using Hankel matrix recovery Towards this, we first bound the unitary polar factor of X through the perturbation theory for polar decomposition Lemma 4 ( 16], Theorem 34) For matrices X C m n r and X C m n r with SVD defined as (57) and (59), let σ r and σ r be the smallest nonzero singular values of X and X, respectively, then ( P P σ r + σ r + where is any unitary invariant norm max{σ r, σ r } ) X X, (6) For our problem, we have m = n = N, r = R Let σ R be the R-th singular value of X From (36) and (53), an explicit form for σ R is Then Lemma 4 implies that σ R = Nd min (63) P P F 4 σ R X X F (64) From the null space condition for nuclear norm minimization (61), we can correctly recover x if Tr(V 1 U 1 Q) < Ū 1 Q V 1, (65) 15

16 for every nonzero Q = H(z) with z N (A), where Ū1 and V 1 are such that U 1 Ū 1 ] and V 1 V1 ] are unitary, ie, Ū1 = U and V 1 = V Since the observation set M = {, 1,,, N }\{N 1}, Q takes the form in (38) with a C, and Ū 1 Q V 1 = a (N R) (66) Let us define P = P P = Ũ1Ṽ 1 U 1 V 1 Then it follows from Lemma 4 and (56) that Tr(V 1 U 1 Q) = < Q, U 1 V 1 > < Q, P > + < Q, P > = Tr(Ṽ 1 Ũ 1 Q) + < Q, P > Tr(Ṽ 1 Ũ 1 Q) + ( Q F P F ) 1/ Thus if namely, Tr(Ṽ 1 Ũ 1 Q) + a N P F Tr(Ṽ 1 Ũ1 Q) + a 4 N E F σ R Tr(Ṽ 1 Ũ1 Q) + a 4 N N ( c cl + c rm ) Nd min Tr(Ṽ 1 Ũ 1 Q) + 4 a N c cl + c rm d min (67) Tr(Ṽ 1 Ũ 1 Q) + 4 a N c cl + c rm d min < a (N R), (68) Tr(Ṽ 1 Ũ1 Q) < a (N R 4 ) N d rel, (69) where d rel is defined as c cl + c rm d min, then solving (61) will correctly recover x The proof of Theorem leads to the concentration inequality (45): ( ) P Tr(Ṽ Ũ Q) a t 3e t / R+t/3, t > Taking t = N R 4 Nd rel, we have t / R + t/3 = (N R 4 Ndrel ) / R + (N R 4 Nd rel )/3 = 3 N + R + 16Nd rel NR 8N Nd rel + 8R Nd rel 3R + N R 4 Nd rel (7) To have successful signal recovery with high probability, we let 3 N + R + 16Nd rel NR 8N Ndrel + 8R Nd rel 3R + N R 4 = c log(n), (71) Nd rel 16

17 where c > is a constant So we have where R + sr + r =, (7) s = 8 Nd rel N c log(n), r = 16Nd rel 8 NNd rel cn log(n) + 4c Nd rel log(n) (73) Solving this leads to R = (N 8 Nd rel + c log(n)) + 1cN log(n) 48c Nd rel log(n) + 4c log (N) (74) Since we can freely choose the coefficient c rm, we choose c rm such that c rm = c min = min{ c 1, c,, c R 1 } Under such a choice for c rm, d min = c rm = c min, leading to d rel = = c cl +c min c min This concludes the proof of Theorem 3 c cl + c rm d min 5 Separation is always necessary for the success of atomic norm minimization In the previous sections, we have shown that the Hankel matrix recovery can recover the superposition of complex exponentials, even though the frequencies of the complex exponentials can be arbitrarily close In this section, we show that, broadly, the atomic norm minimization must obey a non-trivial resolution limit This is very different from the behavior of Hankel matrix recovery Our results in this section also greatly generalize the the necessity of resolution limit results in 6], to general continuously-parametered dictionary, beyond the dictionary of frequency atoms Moreover, our analysis is very different from the derivations in 6], which used the Markov-Bernstein type inequalities for finite-degree polynomials Theorem 4 Let us consider a dictionary with its atoms parameterized by a continuous-valued parameter τ C We also assume that each atom a(τ) belongs to C N, where N is a positive integer We assume that the set of all the atoms span a Q-dimensional subspace in C N Suppose the signal is the superposition of several atoms: x = R c k a(τ k ), (75) k=1 where the nonzero c k C for each k, and R is a positive integer representing the number of active atoms Consider any active atoms a(τ k1 ) and a(τ k ) With the other (R ) active atoms and their coefficients fixed (this includes the case R =, namely there are only two active atoms in total), if the atomic norm minimization can always identify the two active atoms, and correctly recover their coefficients for a(τ k1 ) and a(τ k ), then the two atoms a(τ k1 ) and a(τ k ) must be well separated such that a(τ k1 ) a(τ k ) max max σ min (A), (76) S Q A M S S where S is a positive integer, M S is the set of matrices with S columns and with each of these S columns corresponding to an atom, and σ min ( ) is the smallest singular value of a matrix 17

18 Proof Define the sign of a coefficient c k as sign(c k ) = c k c k (77) Then according to 3] and 6], a necessary condition for the atomic norm to identify the two active atoms, and correctly recover their coefficients is that, there exists a dual vector q C N such that { a(τ k1 ) q = sign(c k1 ), a(τ k ) q = sign(c k ) a(τ ) (78) q 1, τ / {τ k1, τ k } We pick c k1 and c k such that sign(c k ) sign(c j ) = Then sign(c k ) sign(c j ) = a(τ k1 ) q a(τ k ) q q a(τ k1 ) a(τ k ), (79) Thus q a(τ k1 ) a(τ k ) (8) Now we take a group of S atoms, denoted by a select,1,, and a select,s, and use them to form the S columns of a matrix A Then σ min (A) q A q = S a select,j q j=1 S, (81) where the last inequality comes form (78) It follows that S q σ min (A) (8) Define β as the maximal value of σmin(a) S among all the choices for A and S, namely, β = max max σ min (A) (83) S Q A M S S Combining (8) and (8), we have a(τ k1 ) a(τ k ) β, (84) proving this theorem 18

19 6 Orthonormal Atomic Norm Minimization: Hankel matrix recovery can be immune from atom separation requirements Our results in the previous sections naturally raise the following question: why can Hankel matrix recovery work without requiring separations between the underlying atoms while it is necessary for the atomic norm minimization to require separations between the underlying atoms? In this section, we introduce the concept of orthonormal atomic norm and its minimization, which explains the success of Hankel matrix recovery in recovering the superposition of complex exponentials regardless of the separations between frequency atoms Let us consider a vector w C N, where N is a positive integer We denote the set of atoms by AT OMSET, and assume that each atom a(τ) (parameterized by τ) belongs to C N Then the atomic norm of w AT OMIC is given by 3, 7] { s } s w AT OMIC = inf c k : w = c k a(τ k ) (85) s,τ k,c k k=1 We say the atomic norm x AT OMIC is an orthonormal atomic norm if, for every x, s w AT OMIC = c k, (86) where w = s k=1 c ka(τ k ), a(τ k ) = 1 for every k, and a(τ k ) s are mutually orthogonal to each other In the Hankel matrix recovery, the atom set AT OMSET is composed of all the rank-1 matrices in the form uv, where u and v are unit-norm vectors in C N Let us assume x C N 1 We can see the nuclear norm of a Hankel matrix is an orthonormal atomic norm of H(x): H(x) = k=1 k=1 R c k, (87) where H(x) = s k=1 c ku k vk, H(x) = UΣV is the singular value decomposition of H(x), u k is the k-th column of U, and v k is the k-th column of V This is because the matrices u k vk s are orthogonal to each other and each of these rank-1 matrices has unit energy Let us now further assume that x C N 1 is the superposition of R complex exponentials with R N, as defined in (3) Then H(x) is a rank-r matrix, and can be written as H(x) = R k=1 c ku k vk, where H(x) = UΣV is the singular value decomposition of H(x), u k is the k-th column of U, and v k is the k-th column of V Even though the original R frequency atoms a(τ k ) s for x can be arbitrarily close, we can always write H(x) as a superposition of R orthonormal atoms u k vk s from the singular value decomposition of H(x) Because the original R frequency atoms a(τ k ) s for x can be arbitrarily close, they can violate the necessary separation condition set forth in (76) However, for H(x), its composing atoms can be R orthonormal atoms u k vk s from the singular value decomposition of H(x) These atoms u k vk s are of unit energy, and are orthogonal to each other Thus these atoms are well separated and have the opportunity of not violating the necessary separation condition set forth in (76) This explains why the Hankel matrix recovery approach can break free from the separation condition which is required for traditional atomic norm minimizations k=1 19

20 7 A matrix-theoretic inequality of nuclear norms and its proof from the theory of compressed sensing In this section, we present a new matrix-theoretic inequality of nuclear norms, and give a proof of it from the theory of compressed sensing (using nuclear norm minimization) To the best of our knowledge, we have not seen the statement of this inequality of nuclear norms, or its proof elsewhere in the literature Theorem 5 Let A C m n, and t = min(m, n) Let σ 1, σ,, and σ t be the singular values of A arranged in descending order, namely Let k be any integer such that σ 1 σ σ t σ σ k < σ k σ t Then for any orthogonal projector P onto k-dimensional subspaces in C m, and any orthogonal projector Q onto k-dimensional subspaces in C n, we have In particular, P AQ (I P )A(I Q) (88) A 1:k,1:k A (k+1):m,(k+1):n, (89) where A 1:k,1:k is the submatrix of A with row indices between 1 and k and column indices between 1 and k, and A (k+1):m,(k+1):n is the submatrix of A with row indices between k + 1 and m and column indices between k + 1 and n Proof We first consider the case where all the elements of A are real numbers Without loss of generality, we consider ] I P = k k k (m k), (9) (m k) k (m k) (m k) and Q = ] I k k k (n k) (n k) k (n k) (n k) (91) Then P AQ = A 1,1 A 1,k A k,1 A k,k, (9)

21 and (I P )A(I Q) = A P A AQ + P AQ = A k+1,k+1 A k+1,n A m,k+1 A m,n (93) Thus P AQ = A 1:k,1:k, (I P )A(I Q) = A k+1:m,k+1:n (94) To prove this theorem, we first show that if σ σ k < σ k σ t, for any matrix X C m n with rank no more than k, for any positive number l >, we have The proof of (95) follows similar arguments as in 1] X + la > X (95) X + la (96) t σ i (X) σ i (la) (97) > i=1 k (σ i (X) σ i (la)) + i=1 k σ i (X) + ( i=1 t i=k+1 t i=k+1 σ i (la) σ i (X) σ i (la) (98) k σ i (la)) (99) i=1 k σ i (X) = X, (1) i=1 where, for the first inequality, we used the following lemma, which instead follows from Lemma 6 Lemma 5 Let G and H be two matrices of the same dimension Then t i=1 σ i(g) σ i (H) G H Lemma 6 ( 1, 14]) For arbitrary matrices X, Y, and Z = X Y C m n Let Let σ 1, σ,, and σ t (t = min{m, n}) be the singular values of A arranged in descending order, namely σ 1 σ σ t Let s i (X, Y ) be the distance between the i-th singular value of X and Y, namely, s i (X, Y ) = σ i (X) σ i (Y ), i = 1,,, k (11) Let s i] (X, Y ) be the i-th largest value of sequence s 1 (X, Y ), s (X, Y ),, s t (X, Y ), then k s i] (X, Y ) Z k, k = 1,,, t, (1) i=1 where Z k is defined as k i=1 σ i(z) 1

22 We next show that if A 1:k,1:k > A (k+1):m,(k+1):n, one can construct a matrix X with rank at most k such that X + la X for a certain l > We divide this construction into two cases: when A 1:k,1:k has rank equal to k, and when A 1:k,1:k has rank smaller than k When A 1:k,1:k has rank k, we denote its SVD as Then the SVD of and We now construct Let us denote A 1:k,1:k = U 1 ΣV 1 ] A1:k,1:k is given by ] ] A1:k,1:k U1 = Σ X = U1 U = V = then the subdifferential of at X is given by For any Z X, where ] V1 U1 V1 ] ], ] V1 ] (13) X = {Z : Z = U V + M, where M 1, M X =, XM = } (14) Let us partition the matrix A into four blocks: A11 A 1 A 1 A Z, A = I 1 + I, (15) I 1 = Tr(V U A), I = Tr(M A) (16) ], (17) where A 11 R k k, A 1 R k (n k), A 1 R (m k) (n k), and A 1 R (m k) (n k) Then we have ( I 1 = Tr(V U V1 A) = Tr ( V1 U1 = Tr ] A11 A 1 ] U 1 ] A11 A 1 ] ) A 1 A = Tr(V 1 U 1 A 11 ) = Tr(V 1 U 1 U 1 ΣV 1 ) = A 1 A ] ) k σ i (A 11 ) = A 11 (18) i=1

23 Since M X =, XM =, M 1 and X is a rank-k left top corner matrix, we must have ] M =, M where M is of dimension (m k) (n k), and M 1 Then I = Tr(M A) (19) ( ] ] ) A11 A = Tr 1 M (11) A 1 A = Tr (MA ) A, (111) where the last inequality is from the fact that the nuclear norm is the dual norm of the spectral norm, and M 1 Thus we have Z, A = I 1 + I A 11 + A <, (11) because we assume that A 1:k,1:k > A (k+1):m,(k+1):n Since Z, A < for every Z X, A is in the normal cone of the convex cone generated by at the point X By Theorem 37 in 3], we know that the normal cone of the convex cone generated by at the point X is the cone of descent directions for at the point of X Thus A is in the descent cone of at the point X This means that, when A 11 has rank equal to k, there exists a positive number l >, such that X + la X Let us suppose instead that A 11 has rank b < k We can write the SVD of A 11 as A 11 = ] ] Σ ] U 1 U 3 V1 V 3 (113) Then we construct X = U1 U 3 ] V1 V 3 By going through similar arguments as above (except for taking care of extra terms involving U 3 and V 3 ), one can obtain that Z, A < for every Z X In summary, no matter whether A 11 has rank equal to k or smaller to k, there always exists a positive number l >, such that X + la X However, this contradicts (95), and we conclude A 1:k,1:k A (k+1):m,(k+1):n, when A has real-numbered elements We further consider the case when A is a complex-numbered matrix We first derive the subdifferential of X for any complex-numbered matrix m n X For any α R m n and any β R m n, we define F : R m n R as ( ]) α F = (α + ıβ) β (114) ( ]) Re(X) To find the subdifferential of X, we need to derive F, for which we have the Im(X) following lemma, the proof of which is given in Appendix 13 (Note that in our earlier work ] ] 3

24 and the ( corresponding ]) preprint on arxivorg, we have already shown one direction of 116, namely Re(X) H F Now we show the two sets are indeed equal ) Im(X) Lemma 7 Suppose a rank-r matrix X C M N admits a singular value decomposition X = UΣV, where Σ R R R is a diagonal matrix, and U C M R and V C N R satisfy U U = V V = I Define S C M N as: S = { UV + W U W =, W V =, W 1, W C M N }, (115) and define Then we have F ( ]) Re(X) = X Im(X) { } α H α + ıβ S = F β] ( ]) Re(X) (116) Im(X) For a complex-numbered matrix A, we will similarly show that, if A 1:k,1:k > A (k+1):m,(k+1):n, one can construct a matrix X with rank at most k such that X + la X for a certain l > We divide this construction into two cases: when A 1:k,1:k has rank equal to k, and when A 1:k,1:k has rank smaller than k When A 1:k,1:k has rank k, we denote its SVD as Then the SVD of A1:k,1:k ] A 1:k,1:k = U 1 ΣV 1 is given by A1:k,1:k ] = U1 ] Σ V1 ] (117) We now construct X = U1 ] V1 ] We denote U = U1 ], and V = V1 ], then by Lemma 7, the subdifferential of at X is given by For any Z X, X = {Z : Z = U V + M], where M 1, M X =, XM = } (118) Z, A = I 1 + I, (119) 4

25 where I 1 = Re (Tr(V U A)), I = Re (Tr(M A)) (1) Similar to the real-numbered case, let us partition the matrix A into four blocks: ] A11 A 1, (11) A 1 A where A 11 C k k, A 1 C k (n k), A 1 C (m k) (n k), and A 1 C (m k) (n k) We still have ( ] Tr(V U V1 A) = Tr U 1 ] ] ) A11 A 1 A 1 A ( ] ] ) V1 U1 = Tr A11 A 1 A 1 A k = Tr(V 1 U1 A 11 ) = Tr(V 1 U1 U 1 ΣV1 ) = σ i (A 11 ) = A 11 (1) So I 1 = Re (Tr(V U A)) = A 11 Since M X =, XM =, M 1 and X is a rank-k left top corner matrix, we must have ] M =, M where M is of dimension (m k) (n k), and M 1 Then we have I = Re (Tr(M A)) (13) ( ( ] ] )) A11 A = Re Tr 1 M (14) A 1 A = Re (Tr (MA )) A, (15) where the last inequality is because the nuclear norm is the dual norm of the spectral norm, and M 1 Thus we have Z, A = I 1 + I A 11 + A <, (16) because we assume that A 1:k,1:k > A (k+1):m,(k+1):n Since Z, A < for every Z X, A is in the descent cone of at the point X This means that, when A 11 has rank equal to k, there exists a positive number l >, such that X + la X Let us suppose instead that A 11 has rank b < k We can write the SVD of A 11 as i=1 A 11 = U 1 U 3 ] Σ ] V1 V 3 ] (17) 5

26 Then we construct X = U1 U 3 ] V1 V 3 ] By going through similar arguments as above (except for taking care of extra terms involving U 3 and V 3 ), one can obtain that Z, A < for every Z X In summary, no matter whether complex-numbered A 11 has rank equal to k or smaller to k, there always exists a positive number l >, such that X +la X However, this contradicts (95), and we conclude A 1:k,1:k A (k+1):m,(k+1):n 8 Numerical results In this section, we perform numerical experiments to demonstrate the empirical performance of Hankel matrix recovery, and show its robustness to the separations between atoms We use superpositions of complex sinusoids as test signals But we remark that Hankel matrix recovery can also work for superpositions of complex exponentials We consider the non-uniform sampling of entries studied in 6, 7], where we uniformly randomly observe M entries (without replacement) of x from {, 1,, N } We also consider two signal (frequency) reconstruction algorithms: the Hankel nuclear norm minimization and the atomic norm minimization We fix N = 64, ie, the dimension of the ground truth signal x is 17 We conduct experiments under different M and R for different approaches For each approach with a fixed M and R, we test 1 trials, where each trial is performed as follows We first generate the true signal x = x, x 1,, x 16 ] T with x t = R k=1 c ke ıπfkt for t =, 1,, 16, where f k are frequencies drawn from the interval, 1] uniformly at random, and c k are complex coefficients satisfying the model c k = ( m k )e iπθ k with m k and θ k uniformly randomly drawn from the interval, 1] Let the reconstructed signal be represented by ˆx If ˆx x x 1 3, then we regard it as a successful reconstruction We also provide the simulation results under the Gaussian measurements of x as in ] We plot in Figure 1 the rate of successful reconstruction with respect to different M and R for different approaches The black and white region indicate a % and 1% of successful reconstruction respectively, and a grey between % and 1% From the figure, we see that the atomic norm minimization still suffers from non-negligible failure probability even if the number of measurements approach the full 17 samples The reason is that, since the underlying frequencies are randomly chosen, there is a sizable probability that some frequencies are close to each other When the frequencies are close to each other violating the atom separation condition, the atomic norm minimization can still fail even if we observe the full 17 samples By comparison, the Hankel matrix recovery approach experiences a sharper phase transition, and is robust to the frequency separations We also see that under both the Gaussian projection and the non-uniform sampling models, both the atomic norm minimization and the Hankel matrix recovery approach have similar performance We further demonstrate the robustness of the Hankel matrix recovery approach to the separations between frequency atoms, as we vary the separations between frequencies In our first set of experiments, we take N = 64, M = M = 65 ( 51% sampling rate) and R = 8, and consider noiseless measurements Again we generate the magnitude of the coefficients as p where p is uniformly randomly generated from, 1), and the realized magnitudes are 318, 5894, 6

ECE 8201: Low-dimensional Signal Models for High-dimensional Data Analysis

ECE 8201: Low-dimensional Signal Models for High-dimensional Data Analysis Lecture 7: Matrix completion Yuejie Chi The Ohio State University Page 1 Reference Guaranteed Minimum-Rank Solutions of Linear