Majorizing measures and proportional subsets of bounded orthonormal systems

Similar documents
Subspaces and orthogonal decompositions generated by bounded orthogonal systems

Uniform uncertainty principle for Bernoulli and subgaussian ensembles

Entropy extension. A. E. Litvak V. D. Milman A. Pajor N. Tomczak-Jaegermann

An example of a convex body without symmetric projections.

Diameters of Sections and Coverings of Convex Bodies

Sections of Convex Bodies via the Combinatorial Dimension

Rademacher Averages and Phase Transitions in Glivenko Cantelli Classes

The covering numbers and low M -estimate for quasi-convex bodies.

On the constant in the reverse Brunn-Minkowski inequality for p-convex balls.

On the singular values of random matrices

c Birkhäuser Verlag, Basel 2004 GAFA Geometric And Functional Analysis

A Banach space with a symmetric basis which is of weak cotype 2 but not of cotype 2

Lecture 3. Random Fourier measurements

ALEXANDER KOLDOBSKY AND ALAIN PAJOR. Abstract. We prove that there exists an absolute constant C so that µ(k) C p max. ξ S n 1 µ(k ξ ) K 1/n

The Rademacher Cotype of Operators from l N

A Bernstein-Chernoff deviation inequality, and geometric properties of random families of operators

Random sets of isomorphism of linear operators on Hilbert space

Sparse Recovery with Pre-Gaussian Random Matrices

QUOTIENTS OF ESSENTIALLY EUCLIDEAN SPACES

MAJORIZING MEASURES WITHOUT MEASURES. By Michel Talagrand URA 754 AU CNRS

Empirical Processes and random projections

Reconstruction from Anisotropic Random Measurements

Rapid Steiner symmetrization of most of a convex body and the slicing problem

REPRESENTABLE BANACH SPACES AND UNIFORMLY GÂTEAUX-SMOOTH NORMS. Julien Frontisi

The Knaster problem and the geometry of high-dimensional cubes

Another Low-Technology Estimate in Convex Geometry

Semi-strongly asymptotically non-expansive mappings and their applications on xed point theory

Weak and strong moments of l r -norms of log-concave vectors

Inverse Brascamp-Lieb inequalities along the Heat equation

Houston Journal of Mathematics. c 2002 University of Houston Volume 28, No. 3, 2002

On the minimum of several random variables

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

GAUSSIAN MEASURE OF SECTIONS OF DILATES AND TRANSLATIONS OF CONVEX BODIES. 2π) n

Subsequences of frames

Boundedly complete weak-cauchy basic sequences in Banach spaces with the PCP

THERE IS NO FINITELY ISOMETRIC KRIVINE S THEOREM JAMES KILBANE AND MIKHAIL I. OSTROVSKII

Banach spaces without local unconditional structure

Extensions of Lipschitz functions and Grothendieck s bounded approximation property

Universal low-rank matrix recovery from Pauli measurements

ON UNCONDITIONALLY SATURATED BANACH SPACES. 1. Introduction

Isoperimetry of waists and local versus global asymptotic convex geometries

Ginés López 1, Miguel Martín 1 2, and Javier Merí 1

Spectral theory for linear operators on L 1 or C(K) spaces

Smallest singular value of sparse random matrices

On the Equivalence Between Geometric and Arithmetic Means for Log-Concave Measures

A REMARK ON THE GEOMETRY OF SPACES OF FUNCTIONS WITH PRIME FREQUENCIES.

On John type ellipsoids

arxiv: v1 [math.pr] 22 May 2008

WEAKLY NULL SEQUENCES IN L 1

ON THE CONVERGENCE OF GREEDY ALGORITHMS FOR INITIAL SEGMENTS OF THE HAAR BASIS

Geometry of log-concave Ensembles of random matrices

Djalil Chafaï Olivier Guédon Guillaume Lecué Alain Pajor INTERACTIONS BETWEEN COMPRESSED SENSING RANDOM MATRICES AND HIGH DIMENSIONAL GEOMETRY

BERNARD HOST AND BRYNA KRA

The small ball property in Banach spaces (quantitative results)

THE TRACE FORMULA IN BANACH SPACES. Dedicated to the memory of Joram Lindenstrauss

On isotropicity with respect to a measure

Invertibility of random matrices

Super-Gaussian directions of random vectors

Invertibility of symmetric random matrices

NOWHERE LOCALLY UNIFORMLY CONTINUOUS FUNCTIONS

ON SUBSPACES OF NON-REFLEXIVE ORLICZ SPACES

Dimensional behaviour of entropy and information

REAL RENORMINGS ON COMPLEX BANACH SPACES

THE SMALLEST SINGULAR VALUE OF A RANDOM RECTANGULAR MATRIX

On metric characterizations of some classes of Banach spaces

On the Irreducibility of the Commuting Variety of the Symmetric Pair so p+2, so p so 2

A DECOMPOSITION THEOREM FOR FRAMES AND THE FEICHTINGER CONJECTURE

A Lower Estimate for Entropy Numbers

APPROXIMATE ISOMETRIES ON FINITE-DIMENSIONAL NORMED SPACES

Optimisation Combinatoire et Convexe.

1 Cheeger differentiation

Czechoslovak Mathematical Journal

Spectral Gap and Concentration for Some Spherically Symmetric Probability Measures

Non-linear factorization of linear operators

Estimates for the affine and dual affine quermassintegrals of convex bodies

ON THE CONVEX INFIMUM CONVOLUTION INEQUALITY WITH OPTIMAL COST FUNCTION

Lipschitz p-convex and q-concave maps

Small Ball Probability, Arithmetic Structure and Random Matrices

INSTITUTE of MATHEMATICS. ACADEMY of SCIENCES of the CZECH REPUBLIC. A universal operator on the Gurariĭ space

Chapter 2 Smooth Spaces

Curriculum Vitae. Education: Employment history:

Strong subdifferentiability of norms and geometry of Banach spaces. G. Godefroy, V. Montesinos and V. Zizler

ANSWER TO A QUESTION BY BURR AND ERDŐS ON RESTRICTED ADDITION, AND RELATED RESULTS Mathematics Subject Classification: 11B05, 11B13, 11P99

Least singular value of random matrices. Lewis Memorial Lecture / DIMACS minicourse March 18, Terence Tao (UCLA)

On Riesz-Fischer sequences and lower frame bounds

THE SMALLEST SINGULAR VALUE OF A RANDOM RECTANGULAR MATRIX

Auerbach bases and minimal volume sufficient enlargements

Convex inequalities, isoperimetry and spectral gap III

LARGE DEVIATIONS OF TYPICAL LINEAR FUNCTIONALS ON A CONVEX BODY WITH UNCONDITIONAL BASIS. S. G. Bobkov and F. L. Nazarov. September 25, 2011

arxiv:math/ v1 [math.fa] 29 Apr 1999

ON THE ISOTROPY CONSTANT OF PROJECTIONS OF POLYTOPES

arxiv:math/ v1 [math.mg] 20 May 1996

Curriculum Vitae. Education: Employment history:

Gaussian Measure of Sections of convex bodies

ESTIMATES FOR MAXIMAL SINGULAR INTEGRALS

arxiv: v1 [math.oc] 21 Mar 2015

On Z p -norms of random vectors

Geometry of Banach spaces and sharp versions of Jackson and Marchaud inequalities

Potpourri, 3. Stephen William Semmes Rice University Houston, Texas. 1 Gaussian random variables 2. 2 Rademacher functions 3. 3 Lacunary series 5

FREMLIN TENSOR PRODUCTS OF CONCAVIFICATIONS OF BANACH LATTICES

Transcription:

Majorizing measures and proportional subsets of bounded orthonormal systems Olivier GUÉDON Shahar MENDELSON1 Alain PAJOR Nicole TOMCZAK-JAEGERMANN Abstract In this article we prove that for any orthonormal system (ϕ j ) n L that is bounded in L, and any 1 < k < n, there exists a subset I of cardinality greater than n k such that on span{ϕ i } i I, the L 1 norm and the L norm are equivalent up to a factor µ(log µ) 5/, where µ = n/k log k. The proof is based on a new estimate of the supremum of an empirical process on the unit ball of a Banach space with a good modulus of convexity, via the use of majorizing measures. 0 Introduction. We study some natural empirical processes determined by uniformly convex Banach spaces with modulus of convexity of power type. Results of this kind were extensively studied in a Hilbertian setting, and became an important tool for investigations, for example, of the behaviour of various random sets of vectors (as in [14, 15, 7, 16,, 6]). We then apply these results to address a problem of selecting a subset of a bounded orthonormal system (for example, a set of characters), in the spirit of a result of Bourgain (see [0]) and of Talagrand [0], that also has applications to a sparse reconstruction ([, 16, 6]). A particular case of this result, formulated in Theorem 3, is a so called Kashin s splitting of a set of k orthonormal vectors in L that is bounded in L. 0 000 MSC-classification: 46B07, 46B09, 4A05, 4A61 1 Partially supported by an Australian Research Council Discovery grant and an Israel Sciences foundation grant 666/06. This author holds the Canada Research Chair in Geometric Analysis. 1

Theorem. There exist two positive constants c and C such that for any even integer n and any orthonormal system (ϕ j ) n in L with ϕ j L L for 1 j n, we can find a subset I {1,..., n} with n/ c n I n/ + c n such that for every a = (a i ) C n, a i ϕ i C L log n(log log n) 5/ a i ϕ i i I L i I L1 and a i ϕ i i/ I L C L log n(log log n) 5/ a i ϕ i. i/ I L1 This result strengthens Theorem.4 of [6] and is almost optimal. We have been told by J. Bourgain that the term log n is necessary. We would like to thank L. Rodriguez-Piazza for showing us the details of the proof of this optimality in the case of the Walsh system and for allowing us to present the argument at the end of this paper. We thank also the referee who remarks that this construction allows to show that this bound is almost optimal for the trigonometric system as well. The technical proof of Theorem 1 about empirical processes will be presented in the first part. It is based on a construction of majorizing measures developped in [14] and [7] and the main new ideas we use are some observations on packing and covering numbers. 1 Maximal deviation of the empirical moment. We begin this section with some definitions and notation. If E is a normed space we denote by E the dual space to E and the dual norm is denoted by. The modulus of convexity of E is defined for any ε (0, ) by δ E (ε) = inf { 1 x + y }, x = 1, y = 1, x y > ε. We say that E has modulus of convexity of power type if there is a constant c such that δ E (ε) cε for every ε (0, ). It is well-known (see e.g., [11], Proposition.4) that this property is equivalent to the fact that the inequality x + y + λ x y 1 ( x + y ) (1)

holds for all x, y E (where λ > 0 is a constant depending only on c). If (1) is satisfied then we say that E has modulus of convexity of power type with constant λ (in such a case it is clear that δ E (ε) ε /λ ). The notions of type (and cotype) of a Banach space were studied extensively during the 70 s (see, for example, [10] and the survey [9]). A Banach space E has type p if there is a constant C such that for every N N and every x 1,..., x N ( N N ) 1/p E g i x i C x i p () i=1 where g 1,..., g N are standard independent gaussian variables (that is g i N (0, 1)). The smallest constant C for which () holds is called the type p constant of E and is denoted by T p (E). Moreover, it is well known that if E has modulus of convexity of power type then the dual space E has also modulus of smoothness of power type, and therefore, E has type (see, for example, Theorem 1.e.16 of [8]). i=1 1.1 Results on empirical processes. Our first theorem generalizes a result of Rudelson [15], proved in a Hilbertian setting, to the case of a Banach space with modulus of convexity of power type. In fact, in Theorem 1 we solve a question left open in [7] by removing the condition on the distance of E to an Euclidean space of the same dimension. Theorem 1. There exists an absolute constant C for which the following holds. Let E be a Banach space with modulus of convexity of power type with constant λ. Then, for every vectors X 1,..., X m in E, m E sup ε y B E j X j, y (3) C λ 4 T (E ) ( m ) 1/ log m max X j sup X j, y, 1 j m y B E where the expectation is taken over the i.i.d. Bernoulli random variables (ε j ) 1 j m and T (E ) is the type constant of E. The proof of Theorem 1 uses the same construction of a majorizing measure used in [14] and in [7], and this construction was inspired by the work of 3

Talagrand in [18]. The improvement in Theorem 1 compared with [7] comes from entropy estimates that will be presented in Section 1.. We will only sketch the construction of the majorizing measure in Section 1.3 and explain how the argument from [7] may be adapted using the new estimates. As it has been proved in [15], one can apply a symmetrization argument due to Giné and Zinn [5] combined with Theorem 1 and deduce the following result. Theorem. There exists an absolute constant C for which the following holds. Let E be a Banach space with modulus of convexity of power type with constant λ. Let X be a random vector in E and set X 1,..., X m to be independent copies of X. If A = Cλ 4 T (E ) log m m ( ) 1/ E max X j and σ = sup E X, y 1 j m y B E then E sup y B E 1 m m X j, y E X, y A + σ A. We omit further details and refer the reader to one of the articles [15, 7, 16, 6]. 1. Covering and packing numbers. The new ingredient of our proof, in comparison with [7], are estimates on packing and covering numbers which we shall now discuss. Definition 1. Let T and B be symmetric convex bodies in a Banach space E. Define N(T, B) to be the minimal number of translates of B needed to cover T i.e. N(T, B) = inf{n, {x 1,..., x N } T such that T N x j + B}. We denote by M(T, B) the maximal number of disjoint translates of B by elements of T i.e. M(T, B) = sup{n, {x 1,..., x N } T such that x j x k / B for j k}. 4

It is well known that these two quantities are related by N(T, B) M(T, B) N(T, B/). (4) Note that if E is a normed space, B E is its unit ball and B = εb E, then M(T, εb E ) is the cardinality of a maximal ε-separated subset of T with respect to the norm in E. Also, if S : X Y is an operator, we sometimes denote M(SB X, εb Y ) by M(S : X Y, ε), and use a similar notation for the covering numbers N. The following is a combination of results that appeared in [1] and in [3] (see Lemma 3.3 in [18]). We repeat its proof since we need precise estimates on the dependence on the modulus of convexity of the space E. Lemma 1. Let E be a Banach space of modulus of convexity of power type with constant λ and denote by T (E ) the type constant of E. Let X 1,..., X m be vectors in E such that for every 1 j m, X j L, and define for every y E, y,m = max 1 l m X l, y. If ε > 0 and x 1,..., x N B E are ε-separated with respect to,m then where C is an absolute constant. ε log N Cλ T (E ) L log m Proof. Let (e i ) m i=1 be the standard basis in l m 1 and define S : l m 1 E by Se l = X l for every l = 1,..., m. Hence, for every l l, S (x l x l ) ε where denotes the norm in l m. Thus, N M(S : E l m, ε). Since S = max 1 l N X l L and E has type, Proposition 1 in [3] yields that ε log N(S : l m 1 E, ε) C L T (E ) log m where C is an absolute constant. By (4), a similar estimate holds for M(S : l m 1 E, ε), where C is replaced by a new constant, also denoted by C. Define the function f(ε) = ε log M(S : l m 1 E, ε) 5

and observe that f is bounded by C L T (E ) log m. Since E has modulus of convexity of power type with constant λ, Proposition in [1] states that for every θ ε, M(S : E l m, ε) M(S : E l m, θ)m(s : l m 1 E, cθδ E (ε/θ)) and recall that δ E (ε/θ) ε /8λ θ. Therefore, if h(ε) = ε log M(S : E l m, ε), then the previous inequality implies that h(ε) ε θ h(θ) + c ( λ θ ε ) f( cε θλ ) where c, c are absolute constants. Choosing θ = ε, it follows that h is bounded by C λ L T (E ) log m proving the announced result. The next lemma is a simple application of Sudakov s inequality [17]. Lemma. There exists an absolute constant C for which the following holds. Let E be a Banach space such that E has type. Assume that X 1,..., X m are vectors in E with sup 1 j m X j L. Let α 1,..., α m such that m l=1 α l 1 and for every y E set m y E = X l, y αl. l=1 If ε > 0 and x 1,..., x N B E are ε-separated with respect to E, then ε log N CT (E )L. Proof. Let E 1 be the ellipsoid in R m consisting of all y R m such that y E1 := ( m l=1 α l y, e l ) 1/ 1. Set H = (R m, E1 ) and define the operator S : H E by Se i = X i, where (e i ) m i=1 is the standard basis in R m. Note that for every x i, x j, S (x i x j ) E 1 = m αl S (x i x j ), e l = x i x j E, l=1 6

and thus the points {S (x 1 ),..., S (x N )} are ε-separated in E1 and belong to S (B E ). By (4), N N(S (B E ), (ε/)e 1 ). On the other hand, E 1 = T 1 (B m ) where T : R m R m is the diagonal operator T e l = α l e l. Applying Sudakov s inequality [17], ε log N ε log N(S (B E ), (ε/)t 1 (B m )) CE sup z T (S (B E )) G, z (5) where G is a canonical Gaussian vector in l m Moreover and C is an absolute constant. E sup G, z = E sup T G, S y z T (S (B E )) y B E ( m m 1/ = E α l g l X l T (E ) αl X l ) T (E )L l=1 where we have used the type inequality for E and the fact that for every 1 l m, X l L. The result now follows from (5). 1.3 Construction of a majorizing measure. The construction that we present here is the same one that was presented in [14] and in [7]. Let X 1,..., X m be m fixed vectors in E and define the random process {V y : y B E } by V y = l=1 m ε j X j, y, where ε j are independent symmetric Bernoulli random variables. Our aim is to show that when E has modulus of convexity of power type, ( ) 1/ E sup V y C λ 4 T (E ) max X m j log m X j, y (6) y B E 1 j m sup y B E for a suitable absolute constant C. It is known that the process {V y : y B E } satisfies a sub-gaussian tail estimate, namely, that for every y, y E and any t > 0, ) P ( V y V y t) exp ( ct d (y, y) 7 i=1

where d (y, y) = m ( Xj, y X j, y ) and c is an absolute constant. It will be preferable to consider the following quasi-metric m d (y, y) = X j, y y ( X j, y + X j, y ) and the quasi-norm,m endowed on E by x,m = max 1 j m X j, x. The proof of inequality (6) is based on the majorizing measure theory of Talagrand [19]. The following theorem is a combination of Proposition.3, Theorem 4.1 and Proposition 4.4 of [19]. Theorem [19] Assume that the process {V y : y B E } is subgaussian with respect to a metric d. Let r and k 0 Z be the largest integer such that r k 0 is greater than the radius of B E with respect to the metric d. For every k k 0 let φ k : B E R + be a family of maps satisfying the following assumption: there exists A > 0 such that for any point x B E, any k k 0 and any N N for any points x 1,..., x N B r k(x) with d(x i, x j ) r k 1, i j (H) we have max φ k+(x i ) φ k (x) + 1 i=1,...,n A r k log N. Then E sup V y V 0 c A sup φ k (x). y B E k k 0,x K The construction requires certain properties of the quasi metric d and the quasi norm,m. We refer to Propositions 1 and in [7] for precise properties of these metrics and list the ones we require in the following lemma. Lemma 3. For every y, y R n and every u B E, d(y, y) d(y, y), (7) d(y, y) y y,m M, (8) y y,m max 1 j m X j y y, (9) d (z, z) 8 ( z z E u + M z z,m( z u + z u ) ), (10) 8

where x E u = m i=1 X i, x X i, u and M = sup y B E m X j, y. Moreover, for every x E and ρ > 0, the ball (with respect to the quasimetric d) centered in x and with radius ρ, denoted by B x (ρ), is convex. Note that by combining (8) and (9) it follows that for every ρ > 0 and every x B E, inf y Bρ(x) y is attained. Proof of Theorem 1. Since there is only a finite number of points X 1,..., X m then by passing to a quotient of E we can assume that E is a finite dimensional space. We will denote its dimension by n and obviously one can assume that m n. Also, recall that if E is a Banach space with modulus of convexity of power type with constant λ then every quotient of E satisfies that property with a constant smaller than λ (see, e.g., [8]). By the homogeneity of the statement we can assume that sup y B E m X j, y = 1 (11) and by inequality (7), V y is a sub-gaussian process with the quasi-metric d. Therefore, if we denote our aim is to show that L = max 1 j m X j E sup y B E V y C λ 4 T (E ) L log m for an absolute constant C. By inequality (8), the diameter of B E with respect to the metric d is bounded by L. Let r be a fixed number chosen large enough, set k 0 to be the largest integer such that r k 0 L and put k 1 to be the smallest integer such that r k 1 L/ n, where n is the dimension of E. We shall use the same definition of the functionals φ k : B E R + as in [14] and [7], 9

namely: k k 1 + 1, φ k (x) = 1 + 1 log r + n L log m k l=k 1 r l k 0 k k 1, φ k (x) = min{ y, y B 8r k(x)} + k k 0 log m. It is easy to verify using definitions of k 0 and k 1 that sup φ k (x) c, x B E,k k 0 log(1 + 4Lrl ) where c is an absolute constant. It remains to prove that our functionals satisfy condition (H) for A = C λ 4 L log m, and that will conclude the proof of Theorem 1. Proof of condition (H). Fix integers N and k, let x B E and x 1,..., x N B r k(x) for which d(x i, x j ) r k 1. For k k 1 1, we always have n log(1 + 4Lr k+ ) φ k+ (x i ) φ k (x) L r k. log m Since the points x 1,..., x N are well separated with respect to the metric d, then by (8) and (9), x i x j r k 1 /L. By a classical volumetric estimate (see, for example [1]), N(B E, tb E ) (1 + /t) n where n is the dimension of E. Therefore, log N n log(1 + Lr k+1 ), which proves the desired inequality. The case k 0 k k 1 is more delicate and the main ingredients in this part are the entropy estimates proved in part 1.. For j = 1,..., N denote by z j B E points at which min{ y, y B 8r k (x j )} is attained and set u B E to be a point at which min{ y, y B 8r k(x)} is attained. Put θ = max z j u, j 10

and then max j φ k+ (x j ) φ k (x) = θ +. We shall prove that log m θ + log m r k log N/A. (1) Following [7], it is evident from the properties of B ρ (see proposition 1 and in [7] and Lemma 3) that for any i j, d(z i, z j ) cr k 1 and that d(x, z j ) 8r k. It implies that (z j + u)/ B 8r k(x) and by the definition of u, u z j + u /. Since B E has modulus of convexity of power type, then for all j = 1,..., N z j u λ θ. Thus, the set U = u + λ θb E contains all the z j s. Fix an absolute constant c to be named later, set δ = cλ 1 r k θ 1/ and let S be the maximal number of points in U that are δ separated in,m. By Lemma 1, δ log S C λ L log m λ θ where C is an absolute constant. We consider now two cases. First, assume that S N. Then by the previous estimate and the definition of δ, log N C L log m λ 3 θ/ δ C θ r k λ 4 L log m which proves (1). The second case is when S N. Since S is the maximal number of points in U that are δ separated with respect to,m, U is covered by S balls of diameter smaller than δ in,m. Thus, there exists a subset J of {1,..., N} with cardinality J N such that Recall that for any y E, i, j J, z i z j,m δ. y E u = m X l, y X l, u = l=1 11 m X l, y αl. l=1

It is evident from (11) that αl 1 and from (10) that for every z, z, u E, d (z, z) 8 ( z z E u + z z,m( z u + z u ) ). Since d(z i, z j ) cr k 1, z i u λ θ and z i z j,m δ for any i, j J, we can define c small enough such that for all i j J, z i z j Eu cr k 1. Thus, there are J points in u + λ θb E that are cr k 1 -separated for Eu. Using Lemma, log J CT (E )L r k λ θ, and a simple application of the arithmetic-geometric means inequality proves that log N CT (E )L r k λ ( log m θ + ), log m completing the proof of (1) (because λ 1). Selecting an arbitrary proportion of a bounded orthonormal system. In this section, we will prove the Theorem 3. There exist two positive constants c, C such that for any orthonormal system (ϕ j ) n in L with ϕ j L L for 1 j n, the following holds. 1) For any 1 < k < n there exists a subset I {1,..., n} with I n k such that for every a = (a i ) C n, a i ϕ i C µ (log µ) 5/ a i ϕ i (13) i I L i I L1 where µ = L n/k log k. ) Moreover, if n is an even integer, there exists a subset I {1,..., n} with n/ c n I n/ + c n such that for every a = (a i ) C n, a i ϕ i C L log n(log log n) 5/ a i ϕ i i I L i I L1 1

and a i ϕ i i/ I L C L log n(log log n) 5/ a i ϕ i. i/ I L1 Remark. Theorem 3 is satisfactory when k is an arbitrary proportion of n. It strengthens Theorem A from [6] and extends the result of Talagrand [0] which is applicable only when I = n k is a sufficiently small proportion of n. Here we get an arbitrary proportion, with the distance log n(log log n) 5/, which is the right power of log n as we will see at the end of the paper. Define ρ = ρ k,n, the restricted Kolmogorov k-width of the system as the smallest number ρ such that there exists a subset I {1,..., n} with I n k satisfying a i ϕ i ρ a i ϕ i (14) i I L i I L1 for every a = (a i ) C n (see [6] section 3). It was proved in [] that ρ = O( n/k log 3 n). This result was improved to ρ = O( n/k log n log 3/ k) in [16]. The proof of Theorem 3 is based on a random method and follows the argument given in [6] for proving Theorem.1. However, instead of working with the space L 1, which is not uniformly convex, we will approximate it by an L p space for p close to 1 and use the full strength of the estimate given in Theorem. Proposition 1. There exist two positive constants c, C such that for any orthonormal system (ϕ j ) n in L with ϕ j L L for 1 j n, the following holds. 1) For any p (1, ) and any 1 < k < n there exists a subset I {1,..., n} with I n k such that for every a = (a i ) C n, C a i ϕ i (p 1) L n/k log k a 5/ i ϕ i. i I L i I Lp (15) ) Moreover, if n is an even integer, there exists a subset I {1,..., n} 13

with n/ c n I n/ + c n such that for every a = (a i ) C n, C a i ϕ i (p 1) L n/k log k a 5/ i ϕ i i I L i I Lp and a i ϕ i L i/ I C (p 1) L n/k log k a 5/ i ϕ i. i/ I Lp Proof of Proposition 1. Let X be the random vector taking the value ϕ i with probability 1/n and denote by E the complex vectorial space spanned by {ϕ 1,..., ϕ n }. Let 1 < k < n and let X 1,..., X k be independent copies of X. We define an operator Γ : E l k by y E, Γy = k X i, y e i i=1 where (e 1,..., e k ) denotes the canonical basis of l k. Since (ϕ j ) n is an orthonormal system of L, the basic properties of Γ are: { (i) E Γy l = k k n y L, (ii) ker Γ = span{(ϕ j ) n \ (X i ) k i=1}. We shall first prove that for any δ (0, 1), with probability greater than 1 δ, the set (ϕ i ) i I = {(ϕ j ) n \ (X i ) k i=1} satisfies (a i ) i I C I C, a i ϕ i δ (p 1) L n/k log k a 5/ i ϕ i i I L i I Lp for a universal constant C. Let S L = {y : y L = 1} be the unit sphere in L and observe that for any star-shaped subset T L the following holds: if ρ > 0 satisfies sup y T ρs L k X j, y k n ρ kρ 3n, (16) then diam(ker Γ T ) ρ. (17) 14

Indeed, condition (16) implies that for all y T ρs L, kρ 3n Γy l k 4kρ 3n. (18) The homogeneity of (18) and the fact that T is star-shaped imply that if the lower bound in (18) holds for all y T ρs L, then the same lower bound also holds for all y T with y L ρ. This in turn shows that if y ker Γ T then y L ρ, as required in (17). Since ker Γ = span{(ϕ j ) n \ (X i ) k i=1}, we get for T = B Lp E that whenever ρ satisfies (16), then a i ϕ i ρ a i ϕ i i I L i I Lp for all scalars a i, proving that (15) is satisfied with the constant ρ. In order to find ρ that satisfies (16) with positive probability we use Theorem. Denote by E p the complex Banach space spanned by ϕ 1,..., ϕ n endowed with the norm defined by y = ( y Lp + ρ y L ) 1/. It is clear that (B Lp E) ρb L B Ep (B Lp E) ρb L and that the following properties are satisfied: E p is a Banach space with modulus of convexity of power type with constant λ = p(p 1)/8, Ep is a Banach space of type and T (Ep) C q = C p/(p 1). Indeed, the first property follows from Clarkson inequality [4], that for any f, g L p, f + g L p + p(p 1) 8 f g 1 L p ( f L p + g L p ). The second property is evident because for any q, L q has type with constant C q. 15

By property (i) of Γ, we get, taking T = B Lp E and applying Theorem to E p, k E sup X j, y k k y T ρs L n ρ = E sup ( X j, y E X j, y ) y T ρs L k E sup ( X j, y E X j, y ) y T ρb L k E sup ( X y B Ep j, y E X j, y ) k(a + A σ) where σ = sup y BEp y L / n ρ/ n and log k ( A = C λ 4 T (Ep) E max k ϕ j E j J p log k C (p 1) 5/ L k since for every 1 j n, ϕ j E p ϕ j L L. By Chebychev inequality, for any δ (0, 1), we conclude that with probability greater than 1 δ, there exists X 1,..., X k such that for any positive ρ, sup y T ρs L Γy l k ) 1/ k n ρ k δ (A + A σ). To conclude, we choose a constant c large enough such that, for n L ρ = c log k k δ (p 1), 5/ the inequality (16) is satisfied with probability greater than 1 δ. The cardinality of the set (ϕ i ) i I = {(ϕ j ) n \ (X i ) k i=1} is greater than n k and the first part of the proposition is proven choosing δ = 1/. To prove the second part of the proposition, we make two more observations. First, if ρ > 0 satisfies (16) then we have proved that (18) holds true. The upper estimate of this inequality implies that for all y T ρs L, n k ( ϕ i, y ϕ i, y X j, y ρ 1 4k ). 3n i I i=1 16

Since T is star shaped, we conclude as before that this inequality holds for all y T for which y L ρ. If k < 3n/4 then we have proved that if y T and ϕ i, y = 0 for all i I (i.e. y T (ker Γ) ), then y L ρ. But (ker Γ) = span{(ϕ i ) i/ I } and we conclude that if ρ > 0 satisfies (16) with k < 3n/4 and T = B Lp E then for any (a i ) n i=1 C n, a i ϕ i ρ a i ϕ i i I L i I Lp and a i ϕ i ρ a i ϕ i. i/ I L i/ I Lp Secondly, it is not difficult to prove with a combinatorial argument (see Lemma 1.3 in [6]) that if k = [λn] with λ = log < 3/4 then with probability greater than 3/4, n/ c n I = n {X 1,..., X k } n/ + c n, (19) for some absolute constant c > 0. Choosing δ = 1/, we get that with a positive probability, both inequalities (16) and (19) are satisfied. This concludes the proof of the second point of the Proposition 1. Proof of Theorem 3. For any p (1, ), Hölder inequality states that for θ = ( p)/p, f Lp f θ L 1 f 1 θ L. Let µ = L n/k log k and choose p = 1+1/ log µ. Using Proposition 1, there is a subset I of cardinality greater than n k for which a i ϕ i C p µ a i ϕ i i I L i I Lp where C p = C/(p 1) 5/. By the choice of p and Hölder inequality, a i ϕ i µ Cp p/( p) µ (p 1)/( p) a i ϕ i C µ h(µ) a i ϕ i i I L i I L1 i I L1 where h(µ) = (log µ) 5/ and C is an absolute constant. The same argument works for the second part of the Theorem 3. 17

Appendix. As we mentioned in the Introduction, from a result of J. Bourgain Theorem 3 is almost optimal. We would like to thank L. Rodriguez-Piazza for showing us the details of the proof of this optimality in the case of the Walsh system. We consider the Walsh system on L [0, 1] and take the n first functions with n = N. In that case, it is well known that (ϕ 1,..., ϕ n ) form a commutative multiplicative group that we denote by (G, ). The main result is based on constructing in any subset of a group a translate of a subgroup of big cardinality. Lemma 4. Let (G, ) be the multiplicative group generated by the N first Walsh functions. For any c (0, 1) and for any subset Λ G with Λ c N, we can find b G and a subgroup Γ of G such that, whenever log(1/c) 1/ N/, { 1. Γ = p N log / (3 log(1/c)). b Γ Λ. Assuming this result, we are able to prove the almost optimality of Theorem 3 in the case of the Walsh system (ϕ 1,..., ϕ n ) with n = N. Let I be a subset of cardinality n k with k n. Taking c = 1 k/n, we have log(1/c) k/n 1/ n and Lemma 4 states that there exists b (φ i ) n i=1 and a subgroup Γ of G such that b Γ (φ i ) i I and Γ (n log n)/(0k). However, on any subgroup Γ of G, the L 1 norm and the L norm can not be compared with a better estimate than Γ since γ Γ γ γ Γ L = Therefore, inequality (13) can not be satisfied with a better constant than n log n/0k. In particular when k is proportional to n, the factor log n is necessary in (13). Proof of Lemma 4. We will prove the result in the case of the abelian additive group G = ({0, 1} N, +). Note that this lands to a minor change of notation. Let n = N be the cardinality of this group. Let γ 1 G \ {0} be such that (γ 1 + Λ) Λ is maximal and define Λ 1 = (γ 1 + Λ) Γ. Applying convolution, it is evident that (g + Λ) Λ = Λ g G 18 γ Γ L 1.

and thus Λ 1 1 n 1 g G\{0} (g + Λ) Λ = Λ Λ n 1 c n 1 1/cn 1 1/n ( c n 1 1 ). cn For notational convenience, we define γ 0 = 0 and Λ 0 = Λ. We iterate this procedure to construct a family of points γ j and sets Λ j such that (i) γ j / gr{γ 1,..., γ j 1 }, (ii) Λ j = (γ j + Λ j 1 ) Λ j 1 has maximal cardinality, ( (iii) Λ j c j n (1 j 1 1/c +... + 1/c (j 1))), n where gr{γ 1,..., γ j 1 } is the group generated by {γ 1,..., γ j 1 }. Once γ j 1 and Λ j 1 are constructed, we again use the fact that (g + Λ j 1 ) Λ j 1 = Λ j 1 and deduce that g G g G\gr{γ 1,...,γ j 1 } (g + Λ j 1 ) Λ j 1 Λ j 1 j 1 Λ j 1. Therefore, there exists γ j / gr{γ 1,..., γ j 1 } and Λ j = (γ j + Λ j 1 ) Λ j 1 of maximal cardinality such that Λ j Λ j 1 Λ j 1 j 1 n j 1 and (iii) follows from a straightforward computation. This construction can be continued as long as 1 p 1 n ( 1/c +... + 1/c (p 1)) > 0. This means that if p is an integer such that n > p (p 1) /c (p 1) (0) 19

then Λ p. Let b Λ p, we shall prove that b + gr{γ 1,..., γ p } Λ. Indeed let x b + gr{γ 1,..., γ p } and define (ε 1,..., ε p ) {0, 1} p such that x = b + ε i γ i. Since b Λ p = Λ p 1 (γ p + Λ p 1 ), there exists λ p 1 Λ p 1 such that b = ε p γ p + λ p 1. If one repeats the same argument for λ p 1 instead of b, then at the last step, we get λ 1 Λ 1 = Λ (γ 1 + Λ) and again, there exists λ Λ such that λ 1 = ε 1 γ 1 + λ. Summarizing, we have found λ Λ such that p b = λ + ε i γ i. Therefore x = b + i=1 p ε i γ i = λ Λ. i=1 To conclude, we have to notice that if p is chosen to be the integer such that p N log > 3 log(1/c) p 1 then p (p 1) ( 1 c ) (p 1) < N log 3 log(1/c) ( ) N log /3 log(1/c) 1 N N/+N/3 N c using the fact that log(1/c) 1/ N/. The inequality (0) is satisfied and Lemma 4 holds true. Remark. We would like to thank the referee for pointing out to us that the bound of Theorem 3 is almost optimal for the trigonometric system as well. Indeed, let n = N and let φ j (t) = exp(iπjt) for j = 0,..., n 1. Define a one-to-one mapping F : {0, 1} N {0,..., n 1} by N F (ε 1,..., ε N ) = ε j j 1. Let I {0,..., n 1} be a subset of cardinality n k. By Lemma 4, the set Λ = F 1 (I) contains b Γ where Γ = gr(γ 1,..., γ p ). This means that for any set J {1,..., p}, F (b) + j J F (γ j) I. Therefore, the function p ( ψ(t) = φ F (b) 1 + φf (γj )) is contained in span(φ j, j J). Obviously, ψ L = Γ ψ L1. 0

References [1] J. Bourgain, A. Pajor, S.J. Szarek, N. Tomczak-Jaegermann. On the duality problem for entropy numbers of operators. Geometric Aspects of Functional Analysis (1987 88), Springer Lecture Notes in Math. 1376 (1989), 50 63. [] E. Candes, T. Tao. Near optimal recovery from random projections: universal encoding strategies, IEEE Inf. Theory 5 (006), 5406-545. [3] B. Carl. Inequalities of Bernstein-Jackson-type and the degree of compactness of operators in Banach spaces. Ann. Inst. Fourier (Grenoble) 35 (1985), no. 3, 79 118. [4] J. A. Clarkson. Uniformly convex spaces, Trans. Amer. Math. Soc. 40 (1936), no. 3, 396 414 [5] E. Giné, J. Zinn. Some limit theorems for empirical processes, Ann. Probab. 1 (1984), no. 4, 99 998. [6] O. Guédon, S. Mendelson, A. Pajor, N. Tomczak-Jaegermann. Subspaces and orthogonal decompositions generated by bounded orthogonal systems. Positivity, 11 (007), no., 69 83. [7] O. Guédon, M. Rudelson. L p moments of random vectors via majorizing measures. Adv. Math, 08 (007), no., 798 83. [8] J. Lindenstrauss, L. Tzafriri. Classical Banach spaces. II. Function spaces. Ergebnisse der Mathematik und ihrer Grenzgebiete [Results in Mathematics and Related Areas], 97. Springer-Verlag, Berlin-New York, 1979. [9] B. Maurey. Type, cotype and K-convexity. Handbook of the geometry of Banach spaces, Vol., 199 133, North-Holland, Amsterdam, 003. [10] B. Maurey, G. Pisier. Séries de variables aléatoires vectorielles indépendantes et propriétés géométriques des espaces de Banach. (French) Studia Math. 58 (1976), no. 1, 45 90. [11] G. Pisier. Martingales with values in uniformly convex spaces, Israel J. Math. 0 (1975), no. 3-4, 36 350. [1] G. Pisier. The Volume of Convex Bodies and Banach Space Geometry, Cambridge Tracts in Mathematics, 94, Cambridge University Press, Cambridge, 1989. [13] L. Rodriguez-Piazza, Personal communication. [14] M. Rudelson. Random vectors in isotropic position, MSRI preprint (1996). [15] M. Rudelson. Random vectors in isotropic position, J. Funct. Anal. 164 (1999), no. 1, 60 7. [16] M. Rudelson, R.Vershynin. Sparse reconstruction by convex relaxation: Fourier and Gaussian measurements. CISS 006 (40th Annual Conference on Information Sciences and Systems). 1

[17] V.N. Sudakov, Gaussian random processes and measures of solide angles in Hilbert spaces, Soviet. Math. Dokl. 1 (1971), 41 415. [18] M. Talagrand. Sections of smooth convex bodies via majorizing measures. Acta Math. 175 (1995), no., 73 300. [19] M. Talagrand. Majorizing measures: the generic chaining. Ann. Probab. 4 (1996), no. 3, 1049 1103. [0] M. Talagrand. Selecting a proportion of characters, Israel J. Math. 108 (1998), 173 191. O. Guédon, Université Pierre et Marie Curie, Paris 6, Institut de Mathématiques de Jussieu, 4 place Jussieu, 75005 Paris, France e-mail: guedon@math.jussieu.fr S. Mendelson, CMA, The Australian National University, Canberra, ACT 000, Australia, and Department of Mathematics, Technion, Haifa, 3000, Israel e-mail: shahar.mendelson@anu.edu.au A. Pajor, Université Paris-Est Équipe d Analyse et Mathématiques Appliquées, 5, boulevard Descartes, Champs sur Marne, 77454 Marne-la-Vallée, Cedex, France e-mail: Alain.Pajor@univ-mlv.fr N. Tomczak-Jaegermann, Dept. of Math. and Stat. Sciences, University of Alberta, Edmonton, Alberta, Canada, T6G G1. e-mail: nicole.tomczak@ualberta.ca