SEPARABILITY AND COMPLETENESS FOR THE WASSERSTEIN DISTANCE

Similar documents
Contractive metrics for scalar conservation laws

Local semiconvexity of Kantorovich potentials on non-compact manifolds

Contents 1. Introduction 1 2. Main results 3 3. Proof of the main inequalities 7 4. Application to random dynamical systems 11 References 16

Metric Spaces and Topology

Problem Set 2: Solutions Math 201A: Fall 2016

Weak KAM pairs and Monge-Kantorovich duality

Introduction to Empirical Processes and Semiparametric Inference Lecture 08: Stochastic Convergence

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

SUMMARY OF RESULTS ON PATH SPACES AND CONVERGENCE IN DISTRIBUTION FOR STOCHASTIC PROCESSES

MATHS 730 FC Lecture Notes March 5, Introduction

On a Class of Multidimensional Optimal Transportation Problems

THEOREMS, ETC., FOR MATH 515

STAT 7032 Probability Spring Wlodek Bryc

Wiener Measure and Brownian Motion

REPRESENTABLE BANACH SPACES AND UNIFORMLY GÂTEAUX-SMOOTH NORMS. Julien Frontisi

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define

CHAPTER V DUAL SPACES

Empirical Processes: General Weak Convergence Theory

A note on some approximation theorems in measure theory

AN ELEMENTARY PROOF OF THE TRIANGLE INEQUALITY FOR THE WASSERSTEIN METRIC

Chapter 2 Metric Spaces

Invariant measures for iterated function systems

arxiv: v3 [stat.co] 22 Apr 2016

E.7 Alaoglu s Theorem

REGULARITY OF MONOTONE TRANSPORT MAPS BETWEEN UNBOUNDED DOMAINS

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Weak convergence and Brownian Motion. (telegram style notes) P.J.C. Spreij

Weak convergence and Compactness.

Introduction and Preliminaries

OPTIMAL TRANSPORTATION PLANS AND CONVERGENCE IN DISTRIBUTION

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

An introduction to some aspects of functional analysis

Tools from Lebesgue integration

On the distributional divergence of vector fields vanishing at infinity

Prohorov s theorem. Bengt Ringnér. October 26, 2008

Maths 212: Homework Solutions

Continuity of convex functions in normed spaces

Some SDEs with distributional drift Part I : General calculus. Flandoli, Franco; Russo, Francesco; Wolf, Jochen

MAT 578 FUNCTIONAL ANALYSIS EXERCISES

INTRODUCTION TO TOPOLOGY, MATH 141, PRACTICE PROBLEMS

(convex combination!). Use convexity of f and multiply by the common denominator to get. Interchanging the role of x and y, we obtain that f is ( 2M ε

The optimal partial transport problem

Parcours OJD, Ecole Polytechnique et Université Pierre et Marie Curie 05 Mai 2015

REAL AND COMPLEX ANALYSIS

Math 426 Homework 4 Due 3 November 2017

Stone-Čech compactification of Tychonoff spaces

A note on the σ-algebra of cylinder sets and all that

From now on, we will represent a metric space with (X, d). Here are some examples: i=1 (x i y i ) p ) 1 p, p 1.

Topological vectorspaces

is a Borel subset of S Θ for each c R (Bertsekas and Shreve, 1978, Proposition 7.36) This always holds in practical applications.

NECESSARY CONDITIONS FOR WEIGHTED POINTWISE HARDY INEQUALITIES

P-adic Functions - Part 1

Some Background Material

The Arzelà-Ascoli Theorem

A description of transport cost for signed measures

Compact operators on Banach spaces

TYPICAL BOREL MEASURES ON [0, 1] d SATISFY A MULTIFRACTAL FORMALISM

Measure and Integration: Solutions of CW2

Another Riesz Representation Theorem

4 Countability axioms

Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, Metric Spaces

ECARES Université Libre de Bruxelles MATH CAMP Basic Topology

It follows from the above inequalities that for c C 1

Real Analysis Notes. Thomas Goller

LECTURE 15: COMPLETENESS AND CONVEXITY

Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping.

Stationary states for the aggregation equation with power law attractive-repulsive potentials

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

g 2 (x) (1/3)M 1 = (1/3)(2/3)M.

Weak convergence. Amsterdam, 13 November Leiden University. Limit theorems. Shota Gugushvili. Generalities. Criteria

Math 209B Homework 2

PCA sets and convexity

Homework Assignment #2 for Prob-Stats, Fall 2018 Due date: Monday, October 22, 2018

REVIEW OF ESSENTIAL MATH 346 TOPICS

Optimal transportation on non-compact manifolds

Spaces with Ricci curvature bounded from below

Uniqueness of the solution to the Vlasov-Poisson system with bounded density

Mixed exterior Laplace s problem

CLASS NOTES FOR APRIL 14, 2000

Applied Analysis (APPM 5440): Final exam 1:30pm 4:00pm, Dec. 14, Closed books.

It follows from the above inequalities that for c C 1

ANSWER TO A QUESTION BY BURR AND ERDŐS ON RESTRICTED ADDITION, AND RELATED RESULTS Mathematics Subject Classification: 11B05, 11B13, 11P99

Measurable Choice Functions

b i (µ, x, s) ei ϕ(x) µ s (dx) ds (2) i=1

Dynkin (λ-) and π-systems; monotone classes of sets, and of functions with some examples of application (mainly of a probabilistic flavor)

Lecture Notes in Advanced Calculus 1 (80315) Raz Kupferman Institute of Mathematics The Hebrew University

Analysis III Theorems, Propositions & Lemmas... Oh My!

Math 201 Topology I. Lecture notes of Prof. Hicham Gebran

Final. due May 8, 2012

GLUING LEMMAS AND SKOROHOD REPRESENTATIONS

Your first day at work MATH 806 (Fall 2015)

CHAPTER I THE RIESZ REPRESENTATION THEOREM

Polishness of Weak Topologies Generated by Gap and Excess Functionals

Solutions to Tutorial 8 (Week 9)

WEAK CURVATURE CONDITIONS AND FUNCTIONAL INEQUALITIES

Optimal Transport for Data Analysis

BERNARD HOST AND BRYNA KRA

A generic property of families of Lagrangian systems

Transcription:

SEPARABILITY AND COMPLETENESS FOR THE WASSERSTEIN DISTANCE FRANÇOIS BOLLEY Abstract. In this note we prove in an elementary way that the Wasserstein distances, which play a basic role in optimal transportation issues, turn some spaces of probability measures into separable complete metric spaces. introduction Let (, d) be a separable complete metric space and P be the set of Borel probability measures on. Given p > 0, the R {+ }-valued map W p defined on P P by W p (µ, ν) = inf π W p (µ, ν) = inf π ( d(x, y) p dπ(x, y) ) 1 p if 1 p d(x, y) p dπ(x, y) if 0 < p < 1, where π runs over the set of probability measures on with marginals µ and ν, defines a metric on the subset P p of measures µ in P such that d(x 0, x) p dµ(x) be finite for some (and hence any) x 0 in : it is called the Wasserstein distance of order p (see [1], [4], [5] or [6] for instance). These distances are strongly linked to the theory of optimal transportation and have been widely used in various applications to partial differential equations, functional inequalities and probability theory. Some of them involve probability measures on infinite dimensional spaces such as the Wiener space of R d -valued continuous functions on the interval [0, T ] (as in [3] for instance) or some sets of probability measures on a phase space; this is a motivation to the general framework considered in this note, in which we shall prove: Theorem. If (, d) is a separable complete metric space and p a positive number, then the metric space (P p, W p ) is separable and complete. The completeness property has been proven in [4] by comparing the W p distances with the weaker Prohorov distance for which the property is known, and in [1] by means of a deep result by Kolmogorov. Here we shall give a more direct and elementary argument. Let us actually note that these kind of properties can be studied as in [2] within the following broader scope of weighted spaces of probability measures. Let (, τ) be a topological space and ω be a real-valued continuous function on, bounded by below by a positive constant, and let P ω denote the set of Borel probability measures µ on such that ω(x) dµ(x) be finite. We equip P ω with the natural weak topology defined by the set C bω of real-valued continuous functions f on such that ω 1 f be bounded on : this topology, which will be denoted w-c bω, is defined by the seminorms µ sup f i (x) dµ(x) i=1,...,n 1

2 FRANÇOIS BOLLEY for any finite family f 1,..., f n of functions in C bω. Then one can prove that if the topological space (, τ) is separable (resp. separable and metrizable, resp. separable, metrizable and topologically complete), then so is (P ω, w-c bω ). Conversely if (P ω, w-c bω ) is separable (resp. separable, metrizable and topologically complete), then so is (, τ) if (, τ) is a priori metrizable. As in the case when ω = 1, that is, without weight, where they are known, these properties can be proven either by building some explicit distances on the considered spaces of probability measures, or by abstract functional methods as in [2]. In the case when (, d) is a separable complete metric space and ω = 1 + d(x 0, ) p for p > 0, then the w-c bω topology on the set P ω = P p is metrized by the distance W p : in particular (P p, W p ) is separable and topologically complete. The following two sections are devoted to a direct proof of the above theorem, which in particular ensures that (P p, W p ) is complete. 1. Separability In this section we prove that the metric space (P p, W p ) is separable if (, d) is a separable complete metric space and p is a positive number. If (x n ) n is a sequence dense in (, d) we actually prove that the countable set of measures of the form b n δ xn, where N is an integer number, the b n s are nonnegative rational numbers with unit sum and δ x stands for the point mass at x, is dense in (P p, W p ). Let indeed µ be a given measure in P p and ε be a given positive number. 1. We first approach µ by a measure µ 1 = numbers with a n = 1. a n δ xn where the a n s are nonnegative real For this we note that is covered by the balls B(x n, ε max(1,1/p) ) with centers x n and radius ε max(1,1/p), and is partitioned by the sets B n = B(x n, ε max(1,1/p) )\ B(x k, ε max(1,1/p) ), k n 1 so that the a n = µ[ B n ] have unit sum. Moreover sending each point in B n onto x n for each n defines a transport map between µ and µ 1 = consequently B n x x n p dµ(x) W p (µ, µ 1 ) ε. 2. Then we approach µ 1 by a measure µ 2 = rational numbers with b n = 1. a n δ xn with cost a n ε p max(1,1/p) = ε max(p,1) ; b n δ xn where the b n s are nonnegative

SEPARABILITY AND COMPLETENESS FOR THE WASSERSTEIN DISTANCE 3 First of all µ 1 belongs to P p since it is at finite W p distance from the measure µ in P p. Hence a n x n x 1 p = W p (µ 1, δ x1 ) max(p,1) is finite since δ x1 also belongs to P p. In particular there exists an integer N such that n=n+1 a n x n x 1 p ε max(p,1). For each 2 n N we now let b n be a nonnegative rational number such that and such that ( N ) 1an 0 a n b n ε max(p,1) a j x j x 1 p, b 1 = a 1 + j=1 (a n b n ) + n=2 n=n+1 be rational: in particular the b n s have unit sum. Moreover one can transport µ 1 onto µ 2 = b n δ xn by keeping a b n mass at x n for each n N and sending the remaining a n b n mass from x n onto x 1, and sending the whole a n mass from x n onto x 1 for each n N + 1; the associated cost is (a n b n ) x n x 1 p + a n x n x 1 p 2 ε max(p,1), so that n=n+1 W p (µ 1, µ 2 ) 2 ε. 3. To sum up we have approached µ by a measure µ 2 in P p, of the expected form and which is at most 3 ε distant in W p metric. a n 2. Completeness In this section we prove that the metric space (P p, W p ) is complete if (, d) is a separable complete metric space and p is a positive number. Let indeed (µ n ) n be a Cauchy sequence in (P p, W p ). 1. For p 1 we first prove that (µ n ) n is uniformly tight by adapting a classical proof of Ulam lemma. Let ε be a given positive number. We note that (µ n ) n is Cauchy in (P 1, W 1 ) since W 1 W p. Hence there exists N such that W 1 (µ n, µ N ) ε 2 for any n N so that, for any n, there exists j N such that W 1 (µ n, µ j ) ε 2. (1)

4 FRANÇOIS BOLLEY The finite family (µ j ) j N is uniformly tight by Ulam lemma, so there exist a compact set K such that µ j (K) 1 ε for any j N, whence q points x 1,..., x q in such that for any j N, where U = q B(x k, ε). µ j (U) 1 ε (2) Then let φ be the 1 ( ε -Lipschitz function defined on by φ(x) = d(x, U) ) +. 1 Given ε j and n, if π is any joint measure on with marginals µ j and µ n, then φ(x) dµ j (x) φ(y) dµ n (y) = dπ(x, y) (φ(x) φ(y)) 1 d(x, y) dπ(x, y); ε hence φ(x) dµ j (x) φ(y) dµ n (y) 1 ε W 1(µ j, µ n ). On the other hand 1 U φ 1 U ε where U ε = {x; d(x, U) < ε}, so φ(x) dµ j (x) µ j (U) and φ(y) dµ n (y) µ n (U ε ). Consequently µ n (U ε ) µ j (U) 1 ε W 1(µ j, µ n ). (3) Thus, by (1), (2) and (3), for any ε > 0 we have found q points x 1,..., x q such that for any n since U ε q B(x k, 2 ε). ( q µ n \ B(x k, 2 ε) ) 2 ε Therefore, replacing ε by ε 2 m 1 where m is any integer, there exist q(m) points x m 1,..., x m q(m) in such that for any n. In particular the set is such that for any n. µ n ( \ S) q(m) ( µ n \ m=1 S = + B ( x m k, ε 2 m)) ε 2 m q(m) m=1 q(m) ( µ n \ B ( x m k, ε 2 m) B ( x m k, ε 2 m)) m=1 ε 2 m = ε

SEPARABILITY AND COMPLETENESS FOR THE WASSERSTEIN DISTANCE 5 On the other hand, for any ρ, and choosing m such that ε 2 m ρ, the set S can be covered by the q(m) balls B(x m k, ε 2 m ) with radius ε 2 m ρ: in other words it is totally bounded, so that its closure S is compact since is complete. To sum up, the set S is compact and satisfies µ n ( \ S) ε for any n: this means that the sequence (µ n ) n is indeed uniformly tight. 2. We deduce from step 1 that (µ n ) n converges in (P p, W p ) in the case when p 1. Indeed (µ n ) n is uniformly tight by step 1, so by Prohorov theorem there exists a subsequence (µ n ) n of (µ n ) n converging to a probability measure µ on for the narrow weak topology. The distance W p (µ, µ n ) actually tends to 0 as n goes to infinity. Let indeed π n m be a probability measure on with marginals µ n and µ m, optimal in the sense that d(x, y) p dπ n m (x, y) = W p(µ n, µ m ) p. The sequence (µ n ) n is uniformly tight, hence so is (π n m ) n for given m. Thus by Prohorov theorem again there exists a subsequence (π n m ) n of (π n m ) n converging to a probability measure π m on for the narrow weak topology. Then by semicontinuity d(x, y) p dπ m (x, y) lim inf d(x, y) p dπ n n m (x, y) = lim inf W p(µ n, µ m ) p. + n + (4) But on one hand π n m has marginals µ n and µ m, so at the limit (in n ) π m has marginals µ and µ m ; hence W p (µ, µ m ) p d(x, y) p dπ m (x, y) (5) for any m. On the other hand the sequence (µ n ) n is Cauchy for the distance W p, so for any ε > 0 and n, m large enough W p (µ n, µ m ) ε. (6) It finally follows from (4), (5) and (6) that W p (µ, µ m ) ε for m large enough, which means that µ belongs to P p and that W p (µ, µ n ) indeed tends to 0 as n goes to infinity. Finally W p (µ n, µ) tends to 0 as n goes to infinity since the whole sequence (µ n ) n is Cauchy in (P p, W p ). 3. We deduce from step 2 that (µ n ) n converges in (P p, W p ) in the case when 0 < p < 1. Indeed d p is a distance on which defines the same topology as d and (, d p ) is complete if so is (, d). Moreover P p (, d) = P 1 (, d p ) and W p (, d) = W 1 (, d p ) in obvious notation. Thus, given an exponent p ]0, 1[ and a metric d on, the results associated with the exponent p and the metric d stem from the results proven in step 2 for the exponent 1 and the metric d p.

6 FRANÇOIS BOLLEY References [1] L. Ambrosio, N. Gigli, and G. Savaré. Gradient flows in metric spaces and in the spaces of probability measures. Birkhäuser, Basel, 2005. [2] F. Bolley. Applications du transport optimal à des problèmes de limites de champ moyen. Thèse de doctorat, Ecole Normale Supérieure de Lyon. Available at http://www.lsp.ups-tlse.fr/fp/bolley, 2005. [3] F. Bolley. Quantitative concentration inequalities on sample path space for mean field interaction. Preprint available at http://www.lsp.ups-tlse.fr/fp/bolley, 2005. [4] S. T. Rachev. Probability metrics and the stability of stochastic models. John Wiley and Sons, Chichester, 1991. [5] S. T. Rachev and L. Rüschendorf. Mass transportation problems. Vol I and II. Springer, New York, 1998. [6] C. Villani. Topics in optimal transportation, volume 58 of Grad. Stud. Math. AMS, Providence, 2003. Ecole Normale Supérieure de Lyon, Umpa (UMR 5669), 46 allée d Italie, F-69364 Lyon cedex 7 Current address: Institut de Mathématiques, LSP (UMR C5583), Université Paul Sabatier, Route de Narbonne, F-31062 Toulouse cedex 9 E-mail address: bolley@cict.fr