arxiv: v2 [math.oc] 30 Sep 2015

Similar documents
Convergence rates of moment-sum-of-squares hierarchies for volume approximation of semialgebraic sets

Linear conic optimization for nonlinear optimal control

Kernel Method: Data Analysis with Positive Definite Kernels

Convex Optimization & Parsimony of L p-balls representation

WEAK CONVERGENCES OF PROBABILITY MEASURES: A UNIFORM PRINCIPLE

THEOREMS, ETC., FOR MATH 515

Strong duality in Lasserre s hierarchy for polynomial optimization

On a Class of Multidimensional Optimal Transportation Problems

MATH MEASURE THEORY AND FOURIER ANALYSIS. Contents

CHAPTER V DUAL SPACES

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

A new look at nonnegativity on closed sets

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Clarkson Inequalities With Several Operators

APPENDIX A. Background Mathematics. A.1 Linear Algebra. Vector algebra. Let x denote the n-dimensional column vector with components x 1 x 2.

LMI MODELLING 4. CONVEX LMI MODELLING. Didier HENRION. LAAS-CNRS Toulouse, FR Czech Tech Univ Prague, CZ. Universidad de Valladolid, SP March 2009

INVARIANT SUBSPACES FOR CERTAIN FINITE-RANK PERTURBATIONS OF DIAGONAL OPERATORS. Quanlei Fang and Jingbo Xia

Part II Probability and Measure

The moment-lp and moment-sos approaches

Mean squared error minimization for inverse moment problems

STAT 7032 Probability Spring Wlodek Bryc

Approximate Optimal Designs for Multivariate Polynomial Regression

+ 2x sin x. f(b i ) f(a i ) < ɛ. i=1. i=1

Vectors in Function Spaces

On Riesz-Fischer sequences and lower frame bounds

Real Analysis Problems

ON THE HÖLDER CONTINUITY OF MATRIX FUNCTIONS FOR NORMAL MATRICES

Cone-Constrained Linear Equations in Banach Spaces 1

2 Lebesgue integration

SEPARABILITY AND COMPLETENESS FOR THE WASSERSTEIN DISTANCE

Elements of Positive Definite Kernel and Reproducing Kernel Hilbert Space

A List of Problems in Real Analysis

The optimal partial transport problem

arxiv:math/ v1 [math.oc] 13 Mar 2007

Hilbert Spaces. Contents

Decomposition of Riesz frames and wavelets into a finite union of linearly independent sets

Local semiconvexity of Kantorovich potentials on non-compact manifolds

Weighted Sums of Orthogonal Polynomials Related to Birth-Death Processes with Killing

Product measure and Fubini s theorem

COURSE ON LMI PART I.2 GEOMETRY OF LMI SETS. Didier HENRION henrion

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by

Stochastic Realization of Binary Exchangeable Processes

A CLASS OF SCHUR MULTIPLIERS ON SOME QUASI-BANACH SPACES OF INFINITE MATRICES

Reducing subspaces. Rowan Killip 1 and Christian Remling 2 January 16, (to appear in J. Funct. Anal.)

arxiv: v2 [math.ap] 23 Apr 2014

A JOINT+MARGINAL APPROACH TO PARAMETRIC POLYNOMIAL OPTIMIZATION

FACULTY OF SCIENCE FINAL EXAMINATION MATHEMATICS MATH 355. Analysis 4. Examiner: Professor S. W. Drury Date: Tuesday, April 18, 2006 INSTRUCTIONS

1 Math 241A-B Homework Problem List for F2015 and W2016

MATH & MATH FUNCTIONS OF A REAL VARIABLE EXERCISES FALL 2015 & SPRING Scientia Imperii Decus et Tutamen 1

Estimates for probabilities of independent events and infinite series

here, this space is in fact infinite-dimensional, so t σ ess. Exercise Let T B(H) be a self-adjoint operator on an infinitedimensional

Chapter 1. Preliminaries. The purpose of this chapter is to provide some basic background information. Linear Space. Hilbert Space.

Mean squared error minimization for inverse moment problems

Integer programming, Barvinok s counting algorithm and Gomory relaxations

arxiv: v1 [math.cv] 30 Jun 2008

A VERY BRIEF REVIEW OF MEASURE THEORY

THEOREMS, ETC., FOR MATH 516

MATH41011/MATH61011: FOURIER SERIES AND LEBESGUE INTEGRATION. Extra Reading Material for Level 4 and Level 6

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1

Outline of Fourier Series: Math 201B

Lecture Note 5: Semidefinite Programming for Stability Analysis

Lecture 2: Computing functions of dense matrices

Chapter 4. The dominated convergence theorem and applications

Are There Sixth Order Three Dimensional PNS Hankel Tensors?

Solutions to Tutorial 11 (Week 12)

A note on the convex infimum convolution inequality

c 2008 Society for Industrial and Applied Mathematics

Second Order Elliptic PDE

arxiv: v1 [math.oc] 31 Jan 2017

arxiv: v2 [math.oc] 31 May 2010

SPECTRAL THEOREM FOR COMPACT SELF-ADJOINT OPERATORS

Convex computation of the region of attraction for polynomial control systems

REGULARITY OF MONOTONE TRANSPORT MAPS BETWEEN UNBOUNDED DOMAINS

We denote the space of distributions on Ω by D ( Ω) 2.

Some SDEs with distributional drift Part I : General calculus. Flandoli, Franco; Russo, Francesco; Wolf, Jochen

Zeros of lacunary random polynomials

Remarks on the Rademacher-Menshov Theorem

arxiv: v2 [math.pr] 27 Oct 2015

Convex computation of the region of attraction for polynomial control systems

arxiv: v1 [math.oc] 18 Jul 2011

3. (a) What is a simple function? What is an integrable function? How is f dµ defined? Define it first

PARTIAL REGULARITY OF BRENIER SOLUTIONS OF THE MONGE-AMPÈRE EQUATION

LARGE DEVIATIONS OF TYPICAL LINEAR FUNCTIONALS ON A CONVEX BODY WITH UNCONDITIONAL BASIS. S. G. Bobkov and F. L. Nazarov. September 25, 2011

HILBERT SPACES AND THE RADON-NIKODYM THEOREM. where the bar in the first equation denotes complex conjugation. In either case, for any x V define

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing.

MATHS 730 FC Lecture Notes March 5, Introduction

Means of unitaries, conjugations, and the Friedrichs operator

b i (µ, x, s) ei ϕ(x) µ s (dx) ds (2) i=1

Disintegration into conditional measures: Rokhlin s theorem

Lebesgue-Stieltjes measures and the play operator

Random Process Lecture 1. Fundamentals of Probability

Continuous Functions on Metric Spaces

Functional Analysis I

On the simplest expression of the perturbed Moore Penrose metric generalized inverse

Overview of normed linear spaces

A NEW ITERATIVE METHOD FOR THE SPLIT COMMON FIXED POINT PROBLEM IN HILBERT SPACES. Fenghui Wang

Solving the 3D Laplace Equation by Meshless Collocation via Harmonic Kernels

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces.

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

Transcription:

Symmetry, Integrability and Geometry: Methods and Applications Moments and Legendre Fourier Series for Measures Supported on Curves Jean B. LASSERRE SIGMA 11 (215), 77, 1 pages arxiv:158.6884v2 [math.oc] 3 Sep 215 LAAS-CNRS and Institute of Mathematics, University of Toulouse, 7 Avenue du Colonel Roche, BP 54 2, 3131 Toulouse Cédex 4, France E-mail: lasserre@laas.fr URL: http://homepages.laas.fr/lasserre/ Received August 28, 215, in final form September 26, 215; Published online September 29, 215 http://dx.doi.org/1.3842/sigma.215.77 Abstract. Some important problems (e.g., in optimal transport and optimal control) have a relaxed (or weak) formulation in a space of appropriate measures which is much easier to solve. However, an optimal solution µ of the latter solves the former if and only if the measure µ is supported on a trajectory {(t, x(t)): t [, T ]} for some measurable function x(t). We provide necessary and sufficient conditions on moments (γ ij ) of a measure dµ(x, t) on [, 1] 2 to ensure that µ is supported on a trajectory {(t, x(t)): t [, 1]}. Those conditions are stated in terms of Legendre Fourier coefficients f j = (f j (i)) associated with some functions f j : [, 1] R, j = 1,..., where each f j is obtained from the moments γ ji, i =, 1,..., of µ. Key words: moment problem; Legendre polynomials; Legendre Fourier series 21 Mathematics Subject Classification: 42C5; 42C1; 42A16; 44A6 1 Introduction This paper is in the line of research concerned with the following issue: which type and how much of information on the support of a measure can be extracted from its moments (a research issue outlined in a Problem session at the 213 Oberwolfach meeting on Structured Function Systems and Applications [2]). In particular, a highly desirable result is to obtain necessary and/or sufficient conditions on moments of a given measure to ensure that its support has certain geometric properties. For instance there is a vast literature on the old and classical L-moment problem, which asks for moment conditions to ensure that the underlying measure µ is absolutely continuous with respect to some reference measure ν, and with a density in L (ν). See, for instance, [3, 1, 11], more recently [7], and the many references therein. Here we are interested in a problem that is somehow orthogonal to the L-moment problem. Namely, we consider the following generic problem: Let dµ(x, t) be a probability measure on [, 1] [, 1]. Provide necessary and/or sufficient conditions on the moments of µ to ensure that µ is singular with respect to the Lebesgue measure d(x, t) on [, 1] 2. In fact, and more precisely, suppose that: one knows all moments γ i (j) = x i t j dµ(x, t), i, j =, 1,..., of the measure µ, and the marginal of µ with respect to the t variable is the Lebesgue measure dt on [, 1]. Then provide necessary and/or sufficient conditions on the moments (γ i (j)) of µ to ensure that µ is supported on a trajectory {(t, x(t)): t [, 1]} [, 1] 2, for some measurable function x: [, 1] [, 1]. This paper is a contribution to the Special Issue on Orthogonal Polynomials, Special Functions and Applications. The full collection is available at http://www.emis.de/journals/sigma/opsfa215.html

2 J.B. Lasserre In contrast to the L-moment problem, and to the best of our knowledge, the above problem stated in this form has not received a lot of attention in the past even though it is crucial in some important applications (two of them having motivated our interest). Motivation. In addition of being of independent interest, this investigation is motivated by at least two important applications: The mass transfer (or optimal transport) problem. In the weak (or relaxed) Monge Kantorovich formulation of the mass transport problem originally stated by Monge, one searches for a measure dµ(x, t) with prescribed marginals ν x and ν t, and which minimizes some cost functional c(x, t) dµ(x, t). However in the original Monge formulation, ultimately one would like to obtain an optimal solution µ of the form dµ (x, t) = δ x(t) dν t (t) for some measurable function t x(t) (the transportation plan) and a crucial issue is to provide conditions for this to happen 1. For more details the interested reader is referred, e.g., to [13, pp. 1 5] and [9]. There exist some characterizations of the support of an optimal measure for the weak formulation. For instance, c-cyclical monotonicity relates optimality with the support of solutions, and more recently [1] have shown in the (more general) context of the generalized moment problem that under some weak conditions the support of optimal solutions is finitely minimal / c-monotone. (As defined in [1] a set Γ is called finitely minimal / c-monotone if each finite measure α concentrated on finitely many atoms of Γ is cost minimizing among its competitors; in the optimal transport context, a competitor of α is any finite measure α with same marginals as α.) For more details the interested reader is referred to [1] and the references therein. But such a characterization does not say when this support is a trajectory. Deterministic optimal control. Using the concept of occupation measures, a weak formulation of deterministic optimal control problems replaces the original control problem with an infinite-dimensional optimization problem P on a space of appropriate (occupation) measures on a Borel space X U [, 1] with X R n, U R m. For more details the interested reader is referred, e.g., to [8, 14], and the many references therein. An important issue is to provide conditions on the problem data under which the optimal value of the relaxed problem P is the same as that of the original problem; see, e.g., [14]. Again this is the case if some optimal solution µ (or every element of a minimizing sequence) of the relaxed problem is such that every marginal µ j of µ with respect to (x j, t), j = 1,..., n, and every marginal µ l of µ with respect to (u l, t), l = 1,..., m, is supported on a trajectory {(t, x j (t)): t [, 1]} and on a trajectory {(t, u l (t)): t [, 1]} for some measurable functions t x j (t) and t u l (t) on [, 1]. Contribution. Of course there is a particular case where one may conclude that µ is singular with respect to the Lebesgue measure on [, 1] 2. If there is a polynomial p R[x, t] of degree say d, such that its vector of coefficients p is in the kernel of the moment matrix M s (where M s [(i, j), (k, l)] = γ i+k,j+l, i + j, k + l s, with d s), then µ is supported on the variety {(x, t) [, 1] 2 : p(x, t) = } and therefore is singular with respect to the Lebesgue measure on [, 1] 2. But it may happen that p(x, t) = p(y, t) = for some t and some x y and so even in this case additional conditions are needed to ensure existence of a trajectory {(t, x(t)): t [, 1]}. We provide a set of explicit necessary and sufficient conditions on the moments γ i = (γ i (j)) which state that for every fixed i, the moments γ i (j), j =, 1,..., are limits of certain i-powers of the moments γ 1. More precisely, an explicit linear transformation γ 1 of the infinite vector γ 1 is the vector of (shifted) Legendre Fourier coefficients associated with the function t x(t). Then the conditions state that for each fixed i = 2, 3,..., the vector γ i should be the vector of (shifted) Legendre Fourier coefficients associated with the function t x(t) i, which in turn are expressible in terms of limits of i-powers of coefficients of γ 1. At last but not least, it should be noted that all results of this paper are easily transposed to the multi-dimensional case of a measure dµ(x, t) on [, 1] n [, 1] and supported on a trajectory 1 Here δ z denote the Dirac measure at the point z.

Moments and Legendre Fourier Series for Measures Supported on Curves 3 {(t, x(t)): t [, 1]} [, 1] n+1 for some measurable mapping x: [, 1] [, 1] n. Indeed by proceeding coordinate-wise for each function t x i (t), i = 1,..., n, one is reduced to the case [, 1] 2 investigated here. 2 Notation, def initions and preliminary results 2.1 Notation and def initions Given a Borel probability measure µ on [, 1] 2, define γ i (j) = t j x i dµ(x, t), i, j =, 1,..., (2.1) [,1] 2 and for every fixed i N, denote by γ i the vector of moments (γ i (j)), j =, 1,.... Let (L j ), j =, 1,..., be the family of orthonormal polynomials with respect to the Lebesgue measure on [, 1]. They can be deduced from the Legendre polynomials 2 via the change of variable t = (2t 1); the (L j ) are called the shifted Legendre polynomials. The polynomials (L j ) can be also computed exactly from the moments γ of the Lebesgue measure dt on [, 1] by computing some determinants of modified Hankel moment matrices. For instance, L = 1, and ( ) 1 1/2 L 1 (t) = a det = a(t 1/2) with a 2 (1/3 1/2 + 1/4) = 1, 1 t i.e., a = 12 and L 1 (t) = 2 3t 3, and 1 1/2 1/3 L 2 (t) = b det 1/2 1/3 1/4 = b [ t 2 /12 t/12 + 1/72 ], 1 t t 2 with b > such that L 2(t) 2 dt = 1. See, e.g., [4] and [6]. The Lebesgue space L 2 ([, 1]) := {f : [, 1] R, f 2 dx < }, equipped with the scalar product f, g = fg dx and the associated norm, is a Hilbert space and the polynomials are dense in L 2 ([, 1]). In particular the family (L j ), j =, 1,..., form an orthonormal basis of L 2 ([, 1]). Let l 2 denotes the space of square-summable sequences with norm also denoted by. Finally, let f := ess sup f(x), and similarly, for p R[x] let p := x [,1] Then L ([, 1]) := {f : f < }. 2.2 Some preliminary results sup p(x). x [,1] We next state some useful auxiliary results, some of them being standard in Real Analysis. Proposition 2.1. Let t f(t) be an element of L 2 ([, 1]) and define f = (f(j)) by f(j) := Then one has f(j)l j := lim j= L j (t)f(t) dt, j =, 1,.... j= f(j)l j = f in L 2 ([, 1]), (2.2) and this decomposition is unique. Moreover f l 2 and f = f. 2 The Legendre polynomials are orthonormal w.r.t. to the Lebesgue measure on [ 1, 1].

4 J.B. Lasserre When the interval is [ 1, 1] (instead of [, 1] here) (2.2) is called the Legendre (or Legendre Fourier) series expansion of the function f and f = (f(j)) is called the vector of Legendre Fourier coefficients. The notation f k stands for the function t f(t) k, k N. If f k L 2 ([, 1]) we denote by f k = (f k (j)) l 2 its (shifted) Legendre Fourier coefficients so that f k = f k (where again the latter norm is that of l 2 ). Notice that we also have: Proposition 2.2. Let f, f k, g L 2 ([, 1]) with k N. Then g = f k if and only if ĝ = ˆ f k. This follows from the uniqueness of the decomposition in the basis (L j ). We also have the following helpful results: Lemma 2.3. Let f L 2 ([, 1]) with Legendre Fourier coefficients f = (f(j)), and let (p n ) R[t] be a sequence of polynomials such that p n f as n. If ˆp n = (ˆp n (j)) denotes the (shifted) Legendre Fourier coefficients of p n, for all n = 1, 2,..., then ˆp n f in l 2 as n. Proof. As p n f 2 = (p n f) 2 dt and with d n = deg(p n ), (p n f) 2 dt = = d n j= 2 d n j= d n j= ˆp n (j)l j f 2 dt (ˆp n (j)) 2 Lj 2 dt +2 ˆp n (j)ˆp n (k) }{{} k<j d n j= ˆp n (j) =1 L j f dt } {{ } f(j) + f 2 }{{} = f = j f(j)2 ˆp n (j) 2 2ˆp n (j)f(j) + f(j) 2 = d n j= L j L k dt } {{ } = (ˆp n (j) f(j)) 2, and the result follows because p n f as n. Lemma 2.4. Let f L ([, 1]) (hence f k L 2 ([, 1]) for every k N). If a sequence (p n ) R[x] is such that sup n p n < and p n f as n, then for every k N, lim p k n f k =. In addition, if ˆp k n denotes the (shifted) Legendre Fourier coefficients of p k n, n = 1,..., then ˆp k n f k in l 2 as n. Proof. Let M > max[ f, sup n p n ] and fix k N, p k n f k 2 = ( p k n f k) ( 1 k 1 ) 2 2 dx = (p n f) 2 pn k 1 l f l dx (p n f) 2 ( k l=1 l= ) 2 p k l n f l 1 dx ( km k 1) 2 = ( km k 1) 2 pn f 2 ( as n.) (p n f) 2 dx Then the last statement follows from Lemma 2.3.

Moments and Legendre Fourier Series for Measures Supported on Curves 5 Definition 2.5. Let f L 2 ([, 1]) with Legendre Fourier coefficients f. For every k, n N, define the polynomial f n (k) R[x] and the vector f n (k) R kn+1 by t f (k) n (t) := f(j)l j (t) j= k nk = f n (k) (j)l j (t). j= Observe that each entry f n (k) (j), j =,..., nk, is a degree-k form of the first n + 1 Legendre Fourier coefficients of ˆf. Completing with zeros, consider f n (k) converges in l 2 as n, call f (k) l 2 its limit. to be an element of l 2 and if f (k) n The limit f (k) can also be denoted f f, the limit of the k times -product in l 2 of the vector f l 2 by itself. Equivalently one may write f (k) = f (k 1) f since f n (k) (t) = f n (k 1) (t)f n (1) (t) for all t [, 1], and f n (k 1) f k 1 as n, as well as f n (1) f. Lemma 2.6. Let f L ([, 1]), hence f k L 2 ([, 1) with (shifted) Legendre Fourier coefficients f k l 2 for every k N, and assume that sup f n (1) = sup f(j)l j <. n n j= Then f k = f (k) = f f (k times) for every k = 1, 2,..., meaning that for every fixed k N, k lim f(j)l j f k k = lim f n (k) (j)l j f k j= j= = lim f k (j)l j f k =. j= Equivalently, f k = f k 1 f for every k = 2, 3,.... Proof. The result is a direct consequence of Lemmas 2.3 and 2.4 with p n = f n (1) definition of the limit -product in Definition 2.5). (and the 3 Main result Assume that we are given all moments of a nonnegative measure dµ(x, t) on a box [a, b] [c, d] R 2. After a re-scaling of its moments we may and will assume that µ is a probability measure supported on [, 1] 2 with associated moments γ i (j) = x i t j dµ(x, t), i, j =, 1,.... [,1] 2 We further assume that the marginal measure µ t with respect to the variable t, is the Lebesgue measure on [, 1], that is, γ (j) = 1/(j + 1), j =, 1,.... A standard disintegration of the measure µ yields ( 1 ) γ i (j) = t j x i ψ(dx t) dt, i, j =, 1,..., (3.1) [,1] [,1] 2 t j x i dµ(x, t) = } {{ } =:f i (t)

6 J.B. Lasserre where the stochastic kernel ψ( t) is the conditional probability on [, 1] given t [, 1]. Observe that the measurable function f i in (3.1) is nonnegative and uniformly bounded by 1 because x i 1 on [, 1] for every i, and so f i L ([, 1]) for every i = 1,.... The vector γ i = (γ i (j)), j =, 1,..., is the vector of moments of the measure dµ i (t) = f i (t)dt on [, 1], for every i = 1, 2,.... The (shifted) Legendre Fourier vector of coefficients ˆq i of f i are obtained easily from the (infinite) vector γ i via a triangular linear system. Indeed write L j (t) = j jk t k, t [, 1], j =, 1,..., k= where jj >, or in compact matrix form L 1 L 1 L n = t t n, (3.2) for some infinite lower triangular matrix with all diagonal elements being strictly positive. Therefore γ i = 1 t t n f i(t) dt = L L 1 L n f i(t) dt = ˆq i. Suppose that the measure µ is supported on a trajectory {(t, x(t)): t [, 1]} [, 1] 2 for some measurable (density) function x: [, 1] [, 1]. The measurable function t x(t) is an element of L ([, 1]) because x 1. Then by Proposition 2.1, x = ˆx(j)L j in L 2 ([, 1]), (3.3) j= where ˆx = (ˆx(j)) l 2 is its vector of (shifted) Legendre Fourier coefficients (with ˆx = x ). Similarly, for every k = 2, 3,..., the function t x(t) k is in L ([, 1]) and x k = ˆx k (j)l j in L 2 ([, 1]), j= with vector of (shifted) Legendre Fourier coefficients ˆx k l 2 such that x k = ˆx k. We also recall the notation ˆx (k) n R kn+1 for the vector of coefficients in the basis (L j ) of ( k the polynomial t ˆx(j)L j (t)), and when considered as an element of l 2 (by completing j= with zeros) denote by ˆx (k) l 2 its limit when it exists. Theorem 3.1. Let µ be a Borel probability measure on [, 1] 2 and let γ i (j), i, j =, 1,..., be the moments of µ in (2.1).

Moments and Legendre Fourier Series for Measures Supported on Curves 7 (a) If µ is supported on a trajectory {(t, x(t)): t [, 1]} for some nonnegative measurable function t x(t) on [, 1] and if sup ˆx(j)L j <, then n j= ˆx i = ˆx (i) = } ˆx {{ ˆx } = ˆx i 1 ˆx, i = 2, 3,..., (3.4) i times Equivalently γ i = ( γ 1 ) (i) = ( γ i 1 γ 1 ) (i), i = 2, 3,..., (3.5) where is the non singular triangular matrix defined in (3.2). (b) Conversely, if (3.5) holds then µ is supported on a trajectory {(t, x(t)): t [, 1]} for some measurable function t x(t) on [, 1], and (3.4) also holds. Proof. The (a) part. As µ is supported on [, 1] 2 one has x 1 and so the function t x(t) i is in L 2 ([, 1]) for every i = 1, 2,.... So let t x(t) be written as in (3.3). Consider the function t x(t) i, for every fixed i N, so that x i = ˆx i (j)l j in L 2 ([, 1]), j= where the (shifted) Legendre Fourier vector of coefficients ˆx i is obtained by ˆx i = 1 γ i. But by Lemma 2.6, we also have i i x i = ˆx(j)L j = lim ˆx(j)L j in L 2 ([, 1]), j= j= with x i = ˆx i and ˆx i = γ i. In other words, ˆx (i) = ˆx i or equivalently, ˆx (i) = γ i = ( γ 1 ) (i), which is (3.4). We next prove the (b) part. By the disintegration (3.1) of the measure µ, γ i (j) = t j f i (t) dt, i, j =, 1,... for some nonnegative measurable functions f i L ([, 1]), i = 1, 2,.... As (3.5) holds one may conclude that ˆq i = ˆq (i) 1 where ˆq i is the (shifted) Legendre Fourier vector of coefficients associated with f i L 2 ([, 1]), i = 1,.... Hence by Proposition 2.2, f i (t) = f 1 (t) i a.e. on [, 1], for every i = 1, 2,.... That is, for every i = 1, 2,..., there exists a Borel set B i [, 1] with Lebesgue measure zero such that f i (t) = f 1 (t) i for all t [, 1]\B i. Therefore the Borel set B = i=1 B i has Lebesgue measure zero and for all i = 1, 2,..., f i (t) = f 1 (t) i, t [, 1]\B. Hence for every t [, 1]\B, x i ψ(dx t) = f 1 (t) i = [,1] [,1] x i dδ f1 (t), i = 1, 2,.... where δ f1 (t) is the Dirac measure at the point f 1 (t) [, 1]. As measures on compact sets are moment determinate, one must have ψ(dx t) = δ f1 (t), for all t [, 1]\B. Therefore dµ(x, t) = δ f1 (t) dt, i.e., the measure µ is supported on the trajectory {(t, x(t)): t [, 1]}, where x(t) = f 1 (t) for almost all t [, 1].

8 J.B. Lasserre Remark 3.2. If the trajectory t x(t) is a polynomial of degree say d, then the vector of Legendre Fourier coefficients ˆx l 2 has at most d + 1 non-zero elements. Therefore for every j = 2,..., ˆx j l 2 also has at most jd + 1 non-zero elements and the condition (3.5) can be checked easily. In Theorem 3.1(a) one assume that sup ˆx(j)L j < which is much weaker than, n j= e.g., assuming the uniform convergence ˆx(j)L j x as n. The latter (which is j= also much stronger than the a.e. pointwise convergence) can be obtained if the function x(t) has some smoothness properties. For instance if x belongs to some Lipschitz class of order larger then or equal to 1/2, then uniform convergence takes place and one may even obtain rates of convergence; see, e.g., [12] and also [15] for a comparison (in terms of convergence) of Legendre and Chebyshev expansions. In fact, quoting the authors of [5],... knowledge of the partial spectral sum of an L 2 function in [ 1, 1] furnishes enough information such that an exponential convergent approximation can be constructed in any subinterval in which f is analytic. Example 3.3. To illustrate Theorem 3.1 consider the following toy example with µ on [, 1] 2 and with marginal w.r.t. t being the uniform distribution on [, 1] and conditional ψ(dx t) = δ exp( t) for all t [, 1]. That is, t x(t) = exp( t). Then the first 11 Legendre Fourier coefficients ˆx(j), j =,..., 1 of x read ˆx = [.6321255.179568.2315.1937.1217.61.2.1.4.15.625]. Similarly the first 11 Legendre Fourier coefficients of t x(t) 2 = exp( 2t) read ˆx 2 = [.4323323.234475.588678.97965.12219.1219.11.7.4.4.4]. With n = 5 the polynomial t x (2) 5 (t) := ( 5 k= ˆx(j)L j (t)) 2 reads x (2) 1 5 (t) = k= ˆx (2) 5 (j) L j(t), t R, with ˆx (2) 5 = [.4323336.2344129.588626.97976.12219.1218.98.6.3.1.], and we can observe that ˆx (2) 5 ˆx 2 O ( 1 5). In fact the curves t x (2) 5 (t) and t exp( 2t) are almost indistinguishable on the interval [, 1]. 3.1 A more general case We have considered a measure µ on [, 1] 2 whose marginal with respect to t [, 1] is the Lebesgue measure. The conditions of Theorem 3.1 are naturally stated in terms of the (shifted) Legendre Fourier coefficients associated with the functions t f i (t) of L 2 ([, 1]) defined in (3.1). However, the same conclusions also hold if the marginal of µ with respect to t [, 1] is some measure dν = h(t)dt for some nonnegative function h L 1 ([, 1]) with all moments finite. The

Moments and Legendre Fourier Series for Measures Supported on Curves 9 only change is that now we have to consider the orthonormal polynomials t H j (t), j =, 1,..., with respect to ν. Recall that all the H j s can be computed from the moments γ (j) = t j dµ(x, t) = t j dν(t) = t j h(t) dt, j =, 1,.... Then proceeding as before, for every i = 1, 2,..., ( 1 ) t j x i dµ(x, t) = t j x i ψ(dx t) h(t) dt, j =, 1,..., [,1] }{{} =f i (t) L 2 ([,1],ν) and we now consider the vector of coefficients ˆf hi = (ˆf hi (j)) defined by ˆf hi (j) = H j (t)f i (t)h(t) dt, j =, 1,... the analogues for the measure dν = h(t)dt and the function f i L 2 ([, 1], ν), of the (shifted) Legendre Fourier coefficients ˆf i (j) of f i in (3.1) for the Lebesgue measure on [, 1]. Then the conditions in Theorem 3.1 would be exactly the same as before, excepted that now, ˆx = (ˆx(j)) with ˆx(j) = 3.2 Discussion H j (t)x(t)h(t) dt, j =, 1,.... Theorem 3.1 may have some practical implications. For instance consider the weak formulation P of an optimal control problem P as an infinite-dimensional optimization problem on an appropriate space of (occupation) measures, as described, e.g., in [14]. In [8] the authors propose to solve a hierarchy of semidefinite relaxations (P k ), k = 1, 2,..., of P. Each optimal solution of P k provides with a finite sequence z k = (z k j,α,β ) such that when k, zk z where z is the infinite sequence of some measure dµ(t, x, u) on [, 1] X U, where X R n, U R m, are compact sets. Under some conditions both problems P and its relaxation P have same optimal value. If µ is supported on feasible trajectories {(t, x(t), u(t)): t [, 1]} then these trajectories are optimal for the initial optimal control problem P. So it is highly desirable to check whether indeed µ is supported on trajectories from the only knowledge of its moments z = (z j,α,β ). By construction of the moment sequences z k one already knows that the marginal of µ with respect to the variable t is the Lebesgue measure on [, 1]. Therefore we are typically in the situation described in the present paper. Indeed to check whether µ is supported on trajectories {(t, (x 1 (t),..., x n (t), u 1 (t),..., u m (t)): t [, 1]}, one considers each coordinate x i (t) or u j (t) separately. For instance, for x i (t) one considers the subset of moments γ k (j) = (z j,α, ) with j =, 1,..., α = (,...,, k,,..., ) N n, k =, 1,..., with k in position i. If (3.4) holds then indeed the marginal µ t,xi of µ on (t, x i ), with moments (γ k (j)) is supported on a trajectory {(t, x i (t)): t [, 1]}. Of course, in (3.4) there are countably many conditions to check whereas in principle only finitely many moments of z are available (and with some inaccuracy due to (a) solving numerically a truncation P k of P, and (b) the convergence z k z has not taken place yet). So an issue of future investigation is to provide necessary (or sufficient?) conditions based only on finitely many (approximate) moments of µ.

1 J.B. Lasserre Acknowledgements Research funded by the European Research Council (ERC) under the European Union s Horizon 22 research and innovation program (grant agreement ERC-ADG 666981 TAMING). References [1] Beiglböck M., Griessler C., A land of monotone plenty, Technical report, Department of Mathematics, University of Vienna, 215. [2] Charina M., Lasserre J.B., Putinar M., Stöckler J., Structured function systems and applications, Oberwolfach Rep. 1 (213), 579 655. [3] Diaconis P., Freedman D., The Markov moment problem and de Finetti s theorem. I, Math. Z. 247 (24), 183 199. [4] Dunkl C.F., Xu Y., Orthogonal polynomials of several variables, Encyclopedia of Mathematics and its Applications, Vol. 81, Cambridge University Press, Cambridge, 21. [5] Gottlieb D., Shu C.-W., On the Gibbs phenomenon. III. Recovering exponential accuracy in a sub-interval from a spectral partial sum of a piecewise analytic function, SIAM J. Numer. Anal. 33 (1996), 28 29. [6] Helton J.W., Lasserre J.B., Putinar M., Measures with zeros in the inverse of their moment matrix, Ann. Probab. 36 (28), 1453 1471, math.pr/72314. [7] Lasserre J.B., Borel measures with a density on a compact semi-algebraic set, Arch. Math. (Basel) 11 (213), 361 371, arxiv:134.1716. [8] Lasserre J.B., Henrion D., Prieur C., Trélat E., Nonlinear optimal control via occupation measures and LMI-relaxations, SIAM J. Control Optim. 47 (28), 1643 1666, math.oc/73377. [9] McCann R.J., Guillen N., Five lectures on optimal transportation: geometry, regularity and applications, arxiv:111.2911. [1] Putinar M., Extremal solutions of the two-dimensional L-problem of moments, J. Funct. Anal. 136 (1996), 331 364. [11] Putinar M., Extremal solutions of the two-dimensional L-problem of moments. II, J. Approx. Theory 92 (1998), 38 58, math.ca/9512222. [12] Suetin P.K., On the representation of continuous and differentiable functions by Fourier series in Legendre polynomials, Sov. Math. Dokl. 158 (1964), 148 141. [13] Villani C., Topics in optimal transportation, Graduate Studies in Mathematics, Vol. 58, Amer. Math. Soc., Providence, RI, 23. [14] Vinter R., Convex duality and nonlinear optimal control, SIAM J. Control Optim. 31 (1993), 518 538. [15] Wang H., Xiang S., On the convergence rates of Legendre approximation, Math. Comp. 81 (212), 861 877.