Universal discretization

Similar documents
2014:05 Incremental Greedy Algorithm and its Applications in Numerical Integration. V. Temlyakov

Sampling discretization error for function classes

Approximation of High-Dimensional Rank One Tensors

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

INDUSTRIAL MATHEMATICS INSTITUTE. B.S. Kashin and V.N. Temlyakov. IMI Preprint Series. Department of Mathematics University of South Carolina

Convergence of greedy approximation I. General systems

Nonlinear tensor product approximation

Optimal global rates of convergence for interpolation problems with random design

arxiv: v2 [math.fa] 17 May 2016

Math 321 Final Examination April 1995 Notation used in this exam: N. (1) S N (f,x) = f(t)e int dt e inx.

recent developments of approximation theory and greedy algorithms

arxiv: v1 [math.nt] 4 Oct 2016

arxiv: v2 [math.na] 15 Oct 2015

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

Constructions of digital nets using global function fields

REAL AND COMPLEX ANALYSIS

arxiv:math/ v1 [math.fa] 4 Jun 2004

Normed and Banach spaces

Greedy bases are best for m-term approximation

arxiv: v1 [math.co] 7 Jul 2014

Incomplete exponential sums over finite fields and their applications to new inversive pseudorandom number generators

Part III. Quasi Monte Carlo methods 146/349

Polynomial approximation via de la Vallée Poussin means

be the set of complex valued 2π-periodic functions f on R such that

On Multivariate Newton Interpolation at Discrete Leja Points

A SHORT PROOF OF THE COIFMAN-MEYER MULTILINEAR THEOREM

Deterministic constructions of compressed sensing matrices

JUHA KINNUNEN. Harmonic Analysis

MATH 304 Linear Algebra Lecture 8: Vector spaces. Subspaces.

ESTIMATES FOR MAXIMAL SINGULAR INTEGRALS

1 Directional Derivatives and Differentiability

On an Approximation Result for Piecewise Polynomial Functions. O. Karakashian

Topics in Fourier analysis - Lecture 2.

Riemann integral and volume are generalized to unbounded functions and sets. is an admissible set, and its volume is a Riemann integral, 1l E,

Math 118B Solutions. Charles Martin. March 6, d i (x i, y i ) + d i (y i, z i ) = d(x, y) + d(y, z). i=1

Geometry of Banach spaces and sharp versions of Jackson and Marchaud inequalities

Bivariate Lagrange interpolation at the Padua points: The generating curve approach

Bernstein-Szegö Inequalities in Reproducing Kernel Hilbert Spaces ABSTRACT 1. INTRODUCTION

Remarks on the Rademacher-Menshov Theorem

A DECOMPOSITION THEOREM FOR FRAMES AND THE FEICHTINGER CONJECTURE

On a Mazur problem from Scottish Book concerning second partial derivatives

Nonlinear Approximation by Trigonometric Sums

Fourier Series. 1. Review of Linear Algebra

VALUATION THEORY, GENERALIZED IFS ATTRACTORS AND FRACTALS

Analysis Qualifying Exam

Problem Set 2: Solutions Math 201A: Fall 2016

Random Bernstein-Markov factors

Rate of convergence for certain families of summation integral type operators

Solutions Final Exam May. 14, 2014

APPLIED MATHEMATICS REPORT AMR04/16 FINITE-ORDER WEIGHTS IMPLY TRACTABILITY OF MULTIVARIATE INTEGRATION. I.H. Sloan, X. Wang and H.

1. Let A R be a nonempty set that is bounded from above, and let a be the least upper bound of A. Show that there exists a sequence {a n } n N

Course 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra

1/12/05: sec 3.1 and my article: How good is the Lebesgue measure?, Math. Intelligencer 11(2) (1989),

Energy method for wave equations

Existence and Uniqueness

On metric characterizations of some classes of Banach spaces

INTRODUCTION TO FINITE ELEMENT METHODS

How smooth is the smoothest function in a given refinable space? Albert Cohen, Ingrid Daubechies, Amos Ron

arxiv: v1 [math.nt] 8 Apr 2016

BALANCING GAUSSIAN VECTORS. 1. Introduction

Functional Analysis Exercise Class

1 Take-home exam and final exam study guide

On John type ellipsoids

Simple Examples on Rectangular Domains

290 J.M. Carnicer, J.M. Pe~na basis (u 1 ; : : : ; u n ) consisting of minimally supported elements, yet also has a basis (v 1 ; : : : ; v n ) which f

P-adic Functions - Part 1

TADAHIRO OH 0, 3 8 (T R), (1.5) The result in [2] is in fact stated for time-periodic functions: 0, 1 3 (T 2 ). (1.4)

LOWER BOUNDS FOR HAAR PROJECTIONS: DETERMINISTIC EXAMPLES. 1. Introduction

Wiener Measure and Brownian Motion

SYMMETRIC INTEGRALS DO NOT HAVE THE MARCINKIEWICZ PROPERTY

Solutions to Tutorial 8 (Week 9)

SHARP BOUNDARY TRACE INEQUALITIES. 1. Introduction

Rectangular Young tableaux and the Jacobi ensemble

Topological properties

Lecture 6: September 19

Construction algorithms for plane nets in base b

SCALE INVARIANT FOURIER RESTRICTION TO A HYPERBOLIC SURFACE

MATH 51H Section 4. October 16, Recall what it means for a function between metric spaces to be continuous:

Problem Set 5: Solutions Math 201A: Fall 2016

A NEW PROOF OF THE ATOMIC DECOMPOSITION OF HARDY SPACES

UNIVERSAL DERIVED EQUIVALENCES OF POSETS

MATH 426, TOPOLOGY. p 1.

Math 489AB A Very Brief Intro to Fourier Series Fall 2008

Banach Spaces II: Elementary Banach Space Theory

Subsequences of frames

Summation of Walsh-Fourier Series, Convergence and Divergence

(0, 0), (1, ), (2, ), (3, ), (4, ), (5, ), (6, ).

Prime Numbers and Irrational Numbers

Math 117: Topology of the Real Numbers

Logarithmic functional and reciprocity laws

Uniform convergence of N-dimensional Walsh Fourier series

Basicity of System of Exponents with Complex Coefficients in Generalized Lebesgue Spaces

INTRODUCTION TO REAL ANALYSIS II MATH 4332 BLECHER NOTES

Geometric intuition: from Hölder spaces to the Calderón-Zygmund estimate

An arithmetic extension of van der Waerden s theorem on arithmetic progressions

Functional Analysis Exercise Class

Polarization constant K(n, X) = 1 for entire functions of exponential type

N-Widths and ε-dimensions for high-dimensional approximations

Singular Integrals. 1 Calderon-Zygmund decomposition

HARMONIC ANALYSIS TERENCE TAO

Transcription:

arxiv:708.08544v [math.na] 28 Aug 207 Universal discretization V.N. Temlyakov Abstract The paper is devoted to discretization of integral norms of functions from a given collection of finite dimensional subspaces. For natural collections of subspaces of the multivariate trigonometric polynomials we construct sets of points, which are optimally (in the sense of order) good for each subspace of a collection from the point of view of the integral norm discretization. We call such sets universal. Our construction of the universal sets is based on deep results on existence of special nets, known as (t,r,d)-nets. Introduction This paper is a follow up to the two recent papers [9] and [20]. As it is clear from the title the two main concepts of the paper are discretization and universality. Discretization is a very important step in making a continuous problem computationally feasible. The problem of construction of good sets of points in a multidimensional domain is a fundamental problem of mathematics and computational mathematics. A prominent example of classical discretization problem is a problem of metric entropy (covering numbers, entropy numbers). Bounds for the entropy numbers of function classes are important by themselves and also have important connections to other fundamental problems (see, for instance, [8], Ch.3 and [3], Ch.6). Another prominent example of a discretization problem is the problem of numerical integration. Numerical integration in the mixed smoothness classes requires deep number theoretical results for constructing optimal (in the sense of order) cubature formulas (see, for instance, [3], Ch.8). A typical approach to University of South Carolina and Steklov Institute of Mathematics.

solving a continuous problem numerically the Galerkin method suggests to look for an approximate solution from a given finite dimensional subspace. A standard way to measure an error of approximation is an appropriate L q norm, q. Thus, the problem of discretization of the L q norms of functions from a given finite dimensional subspace arises in a very natural way. We now discuss the universality principle formulated in [5]. The concept of smoothness becomes more complicated in the multivariate case than it is in the univariate case. In the multivariate case a function may have different smoothness properties in different coordinate directions. In other words, functions may belong to different anisotropic smoothness classes (see anisotropic Sobolev and Nikol skii classes Wq,α r and Hr q in [6], Chapter 2). It is known that approximation characteristics of anisotropic smoothness classes depend on the average smoothness g(r) and optimal approximation methods depend on anisotropy of classes, on the vector r. This motivated a study in [5] of existence of an approximation method that is good for all anisotropic smoothness classes. This is a problem of existence of a universal method of approximation. There is a similar setting for existence of universal cubature formulas (see, for instance, [6], Chapter 4). We note that the universality concept in learning theory is very important and it is close to the concepts of adaptation and distribution-free estimation in non-parametric statistics ([5], [2], [7]). We now proceed to the detailed presentation. Marcinkiewicz problem. Let Ω be a compact subset of R d with the probability measure µ. We say that a linear subspace X N of the L q (Ω), q <, admits the Marcinkiewicz-type discretization theorem with parameters m and q if there exist a set {ξ ν Ω,ν =,...,m} and two positive constants C j (d,q), j =,2, such that for any f X N we have C (d,q) f q q m f(ξ ν ) q C 2 (d,q) f q q. (.) In the case q = we define L as the space of continuous on Ω functions and ask for C (d) f max ν m f(ξν ) f. (.2) We will also use a brief way to express the above property: the M(m,q) theorem holds for a subspace X N or X N M(m,q). 2

Universal discretization problem. This problem is about finding (proving existence) of a set of points, which is good in the sense of the above Marcinkiewicz-type discretization for a collection of linear subspaces. We be a collection of linear formulate it in an explicit form. Let X N := {X j N }k j= subspaces X j N of the L q(ω), q. We say that a set {ξ ν Ω,ν =,...,m} provides universal discretization for the collection X N if, in the case q <, there are two positive constants C i (d,q), i =,2, such that for each j [,k] and any f X j N we have C (d,q) f q q m f(ξ ν ) q C 2 (d,q) f q q. (.3) In the case q = for each j [,k] and any f X j N we have C (d) f max ν m f(ξν ) f. (.4) We study the Universal discretization for subspaces of the trigonometric polynomials. Let Q be a finite subset of Z d. We denote T(Q) := {f : f = k Qc k e i(k,x) }. We briefly present well known results related to the Marcinkiewicz-type discretization theorems for the trigonometric polynomials. We begin with the case Q = Π(N) := [ N,N ] [ N d,n d ], N j N or N j = 0, j =,...,d, N = (N,...,N d ). We denote and set P(N) := { n = (n,...,n d ), x n := Then for any t T(Π(N)) n j are nonnegative integers, 0 n j 2N j, j =,...,d }, ( ) 2πn 2N +,..., 2πn d, n P(N). 2N d + t 2 2 = ϑ(n) n P(N) t(x n ) 2 (.5) 3

where ϑ(n) := d j= (2N j + ) = dimt(π(n)). The following version of (.5) for < q < is the well known Marcinkiewicz discretization theorem (for d = ) (see [2], Ch.0, 7 and [6], Ch., Section 2) C (d,q) t q q ϑ(n) t(x n ) q C 2 (d,q) t q q, < q <, which implies n P(N) T (Π(N)) M(ϑ(N),q), < q <. (.6) Some modifications are needed in the case q = or q =. Denote and set P (N) := { n = (n,...,n d ), n j are natural numbers, 0 n j 4N j, j =,...,d } x(n) := ( πn,..., πn ) d, n P (N). 2N 2N d In the case N j = 0 we assume x j (n) = 0. Denote N := max(n,) and v(n) := d j= N j. Then the following Marcinkiewicz-type discretization theorem is known for all q C (d,q) t q q v(4n) which implies the following relation n P (N) t(x(n)) q C 2 (d,q) t q q, (.7) T(Π(N)) M(v(4N),q), q. (.8) Note that v(4n) C(d)dimT (Π(N)). It is clear from the above construction that the set {x(n) : n P (N) depends substantially on N. The main goal of this paper is to construct for a given q and M a set, which satisfies an analog of (.7) for all N with v(n) M. In this paper we are primarily interested in the Universal discretization for the collection of subspaces of trigonometric polynomials with frequencies from parallelepipeds (rectangles). For s Z d + define R(s) := {k Z d : k j < 2 s j, j =,...,d}. Clearly, R(s) = Π(N) with N j = 2 s j. Consider the collection C(n,d) := {T(R(s)), s = n}. We prove here the following result. 4

Theorem.. For every q there exists a large enough constant C(d,q), which depends only on d and q, such that for any n N there is a set Ξ m := {ξ ν } m T d, with m C(d,q)2 n that provides universal discretization in L q for the collection C(n,d). Theorem., basically, solves the Universal discretization problem for the collection C(n,d). It provides the upper bound m C(d,q)2 n with 2 n being of the order of the dimension of each T (R(s)) from the collection C(n, d). Obviously, the lower bound for the cardinality of a set, providing the Marcinkiewicz discretization theorem for T (R(s)) with s = n, is C(d)2 n. We treat separately the case q = (see Section 2) and the case q < (see Section 3). Our construction of the universal set is based on deep results on existence of special nets, known as (t, r, d)-nets (see Section 2 for the definition). 2 Universal discretization in L We begin with a simple observation. Denote for brevity T(N) := T (Π(N)), T d := [0,2π) d. Proposition 2.. Let a set (ξ,...,ξ m ) T d be such that for any x T d there exists ξ ν(x) such that x j ξ ν(x) j (2dN j ), j =,...,d. (2.) Then for each f T (N), N N d, we have f 2 max ν m f(ξν ). Proof. The Bernstein inequality for f T(N) f x j N j f shows, by the Mean Value Theorem (the Lagrange theorem), that for any y T d we have for all coordinate directions e j and all h f(y+he j ) f(y) N j h f. This easily implies that for any x,y T d f(x) f(y) d N j x j y j f. (2.2) j= 5

Let x 0 be a point of maximum of f(x). Then (2.) and (2.2) give f(x 0 ) f(ξ ν(x0) ) 2 f and, therefore, max ν m f(ξν ) f(ξ ν(x0) ) 2 f. We now show how known results on the dispersion of a finite set of points can be used in the Universal discretization problem in L. We remind the definition of dispersion. Let d 2 and [0,) d be the d-dimensional unit cube. For x,y [0,) d with x = (x,...,x d ) and y = (y,...,y d ) we write x < y if this inequality holds coordinate-wise. For x < y we write [x,y) for the axis-parallel box [x,y ) [x d,y d ) and define B := {[x,y) : x,y [0,) d,x < y}. For n let T be a set of points in [0,) d of cardinality T = n. The volume of the largest empty (from points of T) axis-parallel box, which can be inscribed in [0,) d, is called the dispersion of T: disp(t) := sup vol(b). B B:B T = An interesting extremal problem is to find (estimate) the minimal dispersion of point sets of fixed cardinality: It is known that disp*(n,d) := inf disp(t). T [0,) d, T =n disp*(n,d) C (d)/n. (2.3) Inequality (2.3) with C (d) = 2 d d i= p i, where p i denotes the ith prime number, was proved in [4] (see also [4]). The authors of [4] used the Halton- Hammersly set of n points (see []). Inequality (2.3) with C (d) = 2 7d+ was proved in []. The authors of [], following G. Larcher, used the (t,r,d)-nets. Definition 2.. A (t,r,d)-net (in base 2) is a set T of 2 r points in [0,) d such that each dyadic box [(a )2 s,a 2 s ) [(a d )2 s d,ad 2 s d ), a j 2 s j, j =,...,d, of volume 2 t r contains exactly 2 t points of T. 6

Theorem 2.. There exists a large enough constant C(d), which depends only on d, such that for any n N there is a set Ξ m := {ξ ν } m Td, with m C(d)2 n that provides universal discretization in L for the collection C(n,d). Proof. We will use a (t, r, d)-net, which satisfies the conditions of Proposition 2. for all N = 2 s, s = n. More precisely, it will be a net adjusted to the T d, namely, a net T for [0,) d multiplied by 2π. A construction of such nets for all d and t Cd, r t is given in [2]. For a given n N define r := n+t+(+[log(4dπ)])d and consider the (t,r,d)-net T(t,r,d). Denote T π (t,r,d) := 2πT(t,r,d) := {2πx : x T(t,r,d)}. Take an s Z d + such that s = n and consider s with s j := s j + + [log(4dπ)]. Any x T d belongs to a dyadic box of the form [2π(a )2 s,2πa 2 s ) [2π(ad )2 s d,2πad 2 s d ) with proper a,...,a d. The volume of this box is equal to (2π) d 2 t r. Therefore, by the definition of the T π (t,r,d) := {ξ ν } 2r there is ξ ν(x) from that box. Note that here we use a weaker property of T(t,r,d) than required by Definition 2. we only need one point from the set to be in the box. Then for each j =,...,d x j ξ ν(x) j 2π2 s j 2 s j 2π(4dπ) = (2d2 s j ). This means that {ξ ν } 2r satisfies the condition of Proposition 2. with m = 2 r C(d)2 n for all s Z d + with s = n. By Proposition 2. the set {ξ ν } 2r provides for each f T (2 s ) the inequality f 2 max ν m f(ξν ). This completes the proof of Theorem 2.. In the above theorem we used a set of points with special structure the (t, r, d)-net as the one, which provided the universal discretization for the collection C(n, d). We now prove a conditional theorem based on the concept of dispersion. Theorem 2.2. Let a set T with cardinality T = 2 r =: m have dispersion satisfying the bound disp(t) < C(d)2 r with some constant C(d). Then there exists a constant c(d) N such that the set 2πT := {2πx : x T} provides the universal discretization in L for the collection C(n,d) with n = r c(d). 7

Proof. The proof is based on Proposition 2.. The proof goes along the lines of the above proof of Theorem 2.. We use notations from the proof of Theorem 2.. Set c(d) := +[logc(d)]+d(+[log4dπ]). Take an s Z d + such that s = n = r c(d) (assuming r > c(d)) and consider s as in the proof of Theorem 2.. Any x T d belongs to a dyadic box of the form [2π(a )2 s,2πa 2 s ) [2π(ad )2 s d,2πad 2 s d ) with some a,...,a d. The volume of this box is equal to (2π) d 2 s (2π) d C(d)2 r. Therefore, by the definition of dispersion this box has a point from 2πT. Denote 2πT := {ξ ν } 2r and denote a point from the the above box by ξ ν(x). Then for each j =,...,d x j ξ ν(x) j 2π2 s j 2 s j 2π(4dπ) = (2d2 s j ). We complete the proof in the same way as in the proof of Theorem 2.. It turns out that two properties of a set T: (I) dispersion of T is of order 2 n and(ii)theset2πt providesuniversal discretizationinl forthecollection C(n, d) are closely connected. Theorem 2.2 shows that, loosely speaking, property (I) implies property (II). We now prove the inverse implication. Theorem 2.3. Assume that T [0,) d is such that the set 2πT provides universal discretization in L for the collection C(n,d) with a constant C (d) (see (.4)). Then there exists a positive constant C(d) with the following property disp(t) C(d)2 n. Proof. We prove this theorem by contradiction. Suppose that disp(t) > C(d)2 n. We will show that for large enough C(d) this contradicts (.4) for the collection C(n,d). Inequality disp(t) > C(d)2 n with large enough C(d) implies that there exists s Z d +, s = n, and an empty box [u,v) [,0) d such that v j u j 2 a(d) s j, j =,...,d. Note that the bigger the C(d) the bigger a(d) can be chosen. We will specify it later. Denote w := (u+v)/2 the center of the box [u,v) and consider (see Section 3 for the definition and properties of K N ) f(x) := K 2 s(x w) T(R(s)). Then by (3.3) f(w) = 2 s = 2 n and by (3.2) outside the box 2π[u,v) we have f(x) 2 2a(d)d f(w). Thus, making a(d) large enough to satisfy 2 2a(d)d < C (d) we obtain a contradiction. 8

We now discuss a straight forward way of constructing point sets for universal discretization. The following statement is obvious. Proposition 2.2. Suppose that for each subspace X i, i =,...,k, there exists a set {ξ ν (i)} m i such that for any f X i we have f C max ν m i f(ξ ν (i)) with a constant C independent of i. Then for any f k i= Xi we have f C max i k max ν m i f(ξ ν (i)). Proposition 2.2 shows that the set k i={ξ ν (i)} m i is a universal set for discretization of the collection X,...,X k in the uniform norm. Note that the cardinality of this set is m + +m k. Let us discuss an application of this idea to the collection C(n,d). Take s Z d + such that s = n and choose the set {x(n),n P (2 s )} (see Introduction), which is a discretization set for T (2 s ) in L (see (.7)). The union SG(n,d) := s: s =n{x(n),n P (2 s )} is known as the sparse grids. Proposition 2.2 shows that the sparse grids SG(n, d) provide universal discretization for the collection C(n,d). It is well known and easy to check that the cardinality of SG(n,d) is of order 2 n n d. Comparing this bound to the bound in Theorem 2. we see that the sparse grids do not provide an optimal (in the sense of order) set for universal discretization. Let us explore one more idea of constructing point sets for universal discretization of the collection C(n,d). It is clear from the definition of the R(s) = Π(N), with N j = 2 s j, that s, s =nt (R(s)) T(Q n ), Q n := s, s =nr(s). Certainly, a set {ξ ν } m which provides discretization for the subspace of the hyperbolic cross polynomials T (Q n ) also provides universal discretization for the collection C(n, d). This way of construction of universal methods of approximation and universal cubature formulas works very well (see [6]). However, surprisingly, it does not work for building a set, which provides universal discretization for the collection C(n,d) in L. This follows from a 9

verynontriviallower boundforthenumber ofpointsinasetthatprovidesthe Marcinkiewicz discretization theorem for T(Q n ) in L (see [7], [8], and [9]). The authors proved that the necessary condition for T (Q n ) M(m, ) is m Q n +c with absolute constant c > 0. This lower bound and the upper bound from Theorem 2. show that on this way we loose not only an extra (logm) d factor (as in the case of sparse grids) but we loose even an extra factor of the power type m c. 3 Universal discretization in L q, q < ThemaingoalofthissectionistoextendTheorem 2.tothecase q <, i.e. to prove the following theorem. Theorem 3.. For every q < there exists a large enough constant C(d,q), which depends only on d and q, such that for any n N there is a set Ξ m := {ξ ν } m Td, with m C(d,q)2 n that provides universal discretization in L q for the collection C(n,d). Proof. We prove this theorem in two steps. First, under some assumptions onthe set Ξ m we prove one-sided discretization inequalities, which bound the discrete norm by the continuous norm. Second, under stronger assumptions on the set Ξ m we prove the inverse inequalities. 3. Upper bounds for the discrete norm We need some classical trigonometric polynomials. We begin with the univariate case. The Dirichlet kernel of order n: D n (x) := e ikx = e inx (e i(2n+)x )(e ix ) k n = ( sin(n+/2)x ) / sin(x/2) (3.) is an even trigonometric polynomial. The Fejér kernel of order n : n K n (x) := n D k (x) = ( ) k /n e ikx k=0 k n = ( sin(nx/2) ) 2 / ( n(sin(x/2) ) 2 ). 0

The Fejér kernel is an even nonnegative trigonometric polynomial in T (n ) with the majorant Kn (x) = Kn (x) min ( n,π 2 /(nx 2 ) ), x π. (3.2) It satisfies the obvious relations The de la Vallée Poussin kernel: K n =, K n = n (3.3) 2n V n (x) := n l=n D l (x) = 2K 2n K n, is an even trigonometric polynomials of order 2n with the majorant Vn (x) Cmin ( n, (nx 2 ) ), x π. (3.4) The above relation (3.4) easily implies the following lemma. Lemma 3.. For a set Ξ m := {ξ ν } m T satisfying the condition Ξ m [x(l ),x(l)) b, x(l) := πl/2n, l =,...,4n, we have Vn (x ξ ν ) Cbn. We use the above Lemma 3. to prove a one-sided inequality. Lemma 3.2. For a set Ξ m := {ξ ν } m T satisfying the condition Ξ m [x(l ),x(l)) b, x(l) := πl/2n, l =,...,4n, we have for q m ( ) /q a ν V n (x ξ ν ) C(bn/m) /q a ν q. m q Proof. Denote by l q (m) the C m equipped with the norm ( ) /q a q := a ν q, a = (a,...,a d ). m

Let V n be the operator on l q (m) defined as follows: V(a) := m a ν V n (x ξ ν ). It is known that V n 3. This implies Using Lemma 3. we obtain V n l (m) L 3. (3.5) V n l (m) L Cbn/m. (3.6) From relations (3.5) and (3.6), using the Riesz-Torin interpolation theorem we find V lq(m) L q C(bn/m) /q, which implies the conclusion of the lemma. We now proceed to the multivariate case. Denote the multivariate Fejér and de la Vallée Poussin kernels: K N (x) := d K Nj (x j ), V N (x) := j= d V Nj (x j ), N = (N,...,N d ). Inthesamewayasaboveintheunivariatecaseonecanestablishthefollowing multivariate analog of Lemma 3.2. Lemma 3.3. For a set Ξ m := {ξ ν } m T d satisfying the condition Ξ m [x(n),x(n+)) b, n P (N), is a vector with coordinates for all j, we have for q m j= ( ) /q a ν V N (x ξ ν ) C(d)(bv(N)/m) /q a ν q. m q Theorem 3.2. Let a set Ξ m := {ξ ν } m Td satisfy the condition Ξ m [x(n),x(n+)) b(d), n P (N), is a vector with coordinates for all j. Then for m ϑ(n) we have for each f T (N) and q ( ) /q f(ξ ν ) q C(d) f q. m 2

Proof. For a set Ξ m := {ξ ν } m Td denote M(f) := M(f,Ξ m ) := (f(ξ ),...,f(ξ m )). We have for f T (N) M(f) q = q f(ξ ν ) q = f(ξ ν )ε ν f(ξ ν ) q = m m 2π = (2π) d f(x) ε ν f(ξ ν ) q V N (x ξ ν )dx 0 m f q ε ν f(ξ ν ) q V N (x ξ ν ). m Using Lemma 3.3 we see that the last term is which implies the required bound. The theorem is proved. C(d) f q M(f) q q, 3.2 Lower bounds for the discrete norm We now proceed to the inverse bounds for the discrete norm. Denote q (n) := [x(n),x(n+)), n P (N). Suppose that a sequence Ξ m := {ξ ν } m Td has the following property. Property E(b). There is a number b N such that for any n P (N) we have (n) Ξ m = b. Clearly, in this case m = v(n)b, where v(n) = P (N). Lemma 3.4. Suppose that two sequences Ξ m := {ξ ν } m T d and Γ m := {γ ν } m Td satisfy the following condition. For a given j {,...,d}, γ ν may only differ from ξ ν in the jth coordinate. Moreover, assume that if ξ ν (n) then also γ ν (n). Finally, assume that Ξ m has property E(b) with b C (d). Then for f T(K) with K N we have m f(ξ ν ) q f(γ ν ) q C(d,q)(Kj /N j ) f q q. 3

Proof. Using the simple relation v q u q = qc q (v u) with c [u,v] for numbers 0 u v we write m f(ξ ν ) q f(γ ν ) q qπ 2mN j f x j (α ν ) f(β ν ) q with α ν and β ν belonging to the same (n) as ξ ν and γ ν. Using Hölder inequality with parameters q and q we continue the above inequality ( ) /q( ) qπ (q )/q f x 2N j m j (α ν ) q f(β ν ) q. m Functions f and f are from T(K) T (N) and sequences {α ν } m and {β ν } m satisfy the conditions of Theorem 3.2. Thus, we continue C(d,q) 2N j f x j q f q q. (3.7) Using the Bernstein inequality we obtain f x j q K j f q. Substituting it into (3.7) we complete the proof of Lemma 3.4. We now complete the proof of Theorem 3.. As in the proof of Theorem 2. we use the (t,r,d)-nets with an appropriate r. Then Theorem 3.2 gives us an upper bound for the discrete norm. For the lower bound we use the net {x(n),n P (N)} with an appropriate N, which satisfies the desired lower bound by (.7). Then we use Lemma 3.4 with appropriate K to derive from (.7) the lower bound for the set build with a help of the (t,r,d)-nets. We now proceed to the detailed argument. Let t be such that t t(d) and the (t,r,d)-net exists. A construction of such nets for all d and t Cd, r t is given in [2]. For a given n N define r := n+t+a(d,q)d with a(d,q) N, which will be specified later. Consider the (t,r,d)-net T(t,r,d). Denote T π (t,r,d) := 2πT(t,r,d) := {2πx : x T(t,r,d)}. Take an s Z d + such that s = n and consider s with s j := s j +a(d,q). Consider a dyadic box of the form [2π(a )2 s,2πa 2 s ) [2π(ad )2 s d,2πad 2 s d ). (3.8) The volume of this box is equal to (2π) d 2 t r. Specify N := 2 s 2 and K := 2 s. Then each (n), n P (N), is a dyadic box of the above form (3.8). 4

Therefore, for each n P (N), by the definition of the T π (t,r,d) := {ξ ν } 2r, there are 2 t points ξ ν from the box (n). We plan to apply Lemma 3.4 and define the set Γ m in such a way that each x(n), n P (N), is included in Γ m exactly 2 t times. Then from the definition of s it follows that K j /N j = 2 2 a(d,q), j =,...,d. Certainly, we can make a(d,q) large enough to get from Lemma 3.4 the inequality f(ξ ν ) q f(γ ν ) q C (d,q) f q q m /2 where C (d,q) is from (.7). This completes the proof of the lower bound in Theorem 3.. The upper bound follows directly from Theorem 3.2. 4 Discussion Theorems 2. and 3. give sets of cardinality C(d,q)2 n, which provide universal discretization in L q, q, for the collection C(n,d). This, basically, solves the Universal discretization problem for the collection C(n, d), because, obviously, the lower bound for the cardinality of a set, providing the Marcinkiewicz discretization theorem for T (R(s)) with s = n, is C(d)2 n. In Section 2 we discussed the idea of using the set, which provides the Marcinkiewicz discretization theorem for T(Q n ) as the one providing universal discretization for the collection C(n, d). We pointed out there that the known results on the Marcinkiewicz discretization theorem for T(Q n ) in L show that this idea does not work well in case L we loose an extra factor m c, c > 0. We now discuss an application of this idea in the case of L q, q <. We begin with the case q = 2. The following theorem is obtained in [9]. Theorem 4.. There are three positive absolute constants C, C 2, and C 3 with the following properties: For any d N and any Q Z d there exists a set of m C Q points ξ j T d, j =,...,m such that for any f T (Q) we have C 2 f 2 2 f(ξ j ) 2 C 3 f 2 m 2. j= 5

In [9] we showed how to derive Theorem 4. from the recent paper by S. Nitzan, A. Olevskii, and A. Ulanovskii [3], which in turn is based on the paper of A. Marcus, D.A. Spielman, and N. Srivastava [0]. Theorem 4., basically, solves the problem of the Marcinkiewicz-type discretization theorem for the T (Q n ) in the L 2 case. The reader can find some more discussion ofthe L 2 case in[20]. Wealso refer tothe paper [6]for adiscussion of a recent outstanding progress in the area of submatrices of orthogonal matrices. In particular, in the case of Q = Q n Theorem 4. provides a set of m C Q n C(d)2 n n d points ξ j T d, j =,...,m, such that for any f T (Q n ) we have C 2 f 2 2 m f(ξ j ) 2 C 3 f 2 2. j= Thus, this set with cardinality C(d)2 n n d is universal for discretization in L 2 of the collection C(n,d). It shows that on this way we only loose an extra factor (logm) d in the cardinality of the set. It is clear that we cannot do better on this way because dim(t (Q n )) C(d)2 n n d. Let us discuss the case L. The following Marcinkiewicz-type theorem for discretization of the L norm of polynomials from T (Q) was proved in [20]. Theorem 4.2. There is large enough constant C (d) with the property: For any Q Π(N) with N = (2 n,...,2 n ) there exists a set of m C (d) Q n 7/2 points ξ j T d, j =,...,m such that for any f T (Q) we have 2 f m f(ξ j ) 3 2 f. j= ThistheoremappliedtoT(Q n )givesasetofcardinalitym C (d) Q n n 7/2, which provides universal discretization in L for the collection C(n,d). Thus, on the way of using the above idea we only loose an extra factor (logm) d+5/2 in the cardinality of the set. In the case q 2 and q the known results on the Marcinkiewicz-type theorem for the T(Q n ) are weaker than in the case q = 2 or q =. In this case known results from [9] only give us results with an extra power factor m /q. 6

References [] C. Aistleitner, A. Hinrichs, and D. Rudolf, On the size of the largest empty box amidst a point set, arxiv:507.02067v3 [cs.cg] 8 Jun 207. [2] P. Binev, A. Cohen, W. Dahmen, R. DeVore, and V.N. Temlyakov, Universal algorithms for learning theory. Part I: piecewise constant functions, Journal of Machine Learning Theory (JMLT),6 (2005), 297 32. [3] Ding Dung, V.N. Temlyakov, and T. Ullrich, Hyperbolic Cross Approximation, arxiv:60.03978v2 [math.na] 2 Dec 206. [4] A. Dumitrescu and M. Jiang, On the largest empty axis-parallel box amidst n points, Algorithmica, 66 (203), 225 248. [5] L. Györfy, M. Kohler, A. Krzyzak, and H. Walk, A distribution-free theory of nonparametric regression, Springer, Berlin, 2002. [6] B.S. Kashin, Lunin s method for selecting large submatrices with small norm, Matem. Sb., 206 (205), 95 02. [7] B.S. Kashin and V.N. Temlyakov, On a norm and related applications, Mat. Zametki 64 (998), 637 640. [8] B.S. Kashin and V.N. Temlyakov, On a norm and approximation characteristics of classes of functions of several variables, Metric theory of functions and related problems in analysis, Izd. Nauchno-Issled. Aktuarno-Finans. Tsentra (AFTs), Moscow, 999, 69 99. [9] B.S. Kashin and V.N. Temlyakov, The volume estimates and their applications, East J. Approx., 9 (2003), 469 485. [0] A. Marcus, D.A. Spielman, and N. Srivastava, Interlacing families II: Mixed characteristic polynomials and the Kadison-Singer problem, Annals of Math., 82 (205), 327 350. [] J. Matousek, Geometric Discrepancy, Springer, 999. 7

[2] H. Niederreiter and C. Xing, Low-discrepancy sequences and global function fields with many rational places, Finite Fields Appl., 2 (996), 24 273. [3] S. Nitzan, A. Olevskii, and A. Ulanovskii, Exponential frames on unbounded sets, Proc. Amer. Math. Soc., 44 (206),09 8. [4] G. Rote and F. Tichy, Quasi-Monte Carlo methods and the dispersion of point sequences, Math. Comput. Modelling, 23 (996), 9 23. [5] V.N. Temlyakov, Approximation by elements of a finite-dimensional subspace of functions from various Sobolev or Nikol skii spaces, Matem. Zametki, 43 (988), 770 786; English transl. in Math. Notes, 43 (988). [6] V.N. Temlyakov, Approximation of periodic functions, Nova Science Publishes, Inc., New York., 993. [7] V.N. Temlyakov, On universal estimators in learning theory, Trudy MIAN im. V.A. Steklova, 255 (2006), 256 272; English transl. in Proceedings of the Steklov Inst. Math., 255 (2006), 244 259. [8] V.N. Temlyakov, Greedy approximation, Cambridge University Press, 20. [9] V.N. Temlyakov, The Marcinkewiecz-type discretization theorems for the hyperbolic cross polynomials, arxiv: 702.067v2 [math.na] 26 May 207. [20] V.N. Temlyakov, The Marcinkiewicz-type discretization theorems, arxiv:703.03743v [math.na], 0 Mar 207. [2] A. Zygmund, Trigonometric Series, Cambridge University Press, 959. 8