Introduction to optimal transport

Similar documents
Optimal Transportation and Economic Applications

FULL CHARACTERIZATION OF OPTIMAL TRANSPORT PLANS FOR CONCAVE COSTS

Optimal Transport for Applied Mathematicians

Introduction to Optimal Transport Theory

Optimal Transport And WGAN

Optimal Transport for Data Analysis

Optimal Transport: A Crash Course

arxiv: v1 [math.ap] 3 Sep 2014

Gradient Flow. Chang Liu. April 24, Tsinghua University. Chang Liu (THU) Gradient Flow April 24, / 91

Optimal Transportation. Nonlinear Partial Differential Equations

Convex analysis and profit/cost/support functions

The optimal partial transport problem

Moment Measures. Bo az Klartag. Tel Aviv University. Talk at the asymptotic geometric analysis seminar. Tel Aviv, May 2013

Local semiconvexity of Kantorovich potentials on non-compact manifolds

On a Class of Multidimensional Optimal Transportation Problems

A planning problem combining optimal control and optimal transport

Spaces with Ricci curvature bounded from below

arxiv: v1 [math.ca] 20 Sep 2010

Optimal Transport for Data Analysis

Math 742: Geometric Analysis

Lagrange duality. The Lagrangian. We consider an optimization program of the form

A description of transport cost for signed measures

Canonical Supermartingale Couplings

On the regularity of solutions of optimal transportation problems

Integration on Measure Spaces

Martingale optimal transport with Monge s cost function

A STRATEGY FOR NON-STRICTLY CONVEX TRANSPORT COSTS AND THE EXAMPLE OF x y p IN R 2

FACULTY OF SCIENCE FINAL EXAMINATION MATHEMATICS MATH 355. Analysis 4. Examiner: Professor S. W. Drury Date: Wednesday, April 18, 2007 INSTRUCTIONS

Representation of the polar cone of convex functions and applications

Stability results for Logarithmic Sobolev inequality

Duality (Continued) min f ( x), X R R. Recall, the general primal problem is. The Lagrangian is a function. defined by

II KLUWER ACADEMIC PUBLISHERS. Abstract Convexity and Global Optimization. Alexander Rubinov

OPTIMAL TRANSPORT AND SKOROKHOD EMBEDDING

Dynamic and Stochastic Brenier Transport via Hopf-Lax formulae on Was

Optimal transportation on non-compact manifolds

arxiv: v2 [math.ap] 23 Apr 2014

Analysis of a Mollified Kinetic Equation for Granular Media. William Thompson B.Sc., University of Victoria, 2014

Optimal and Better Transport Plans

1 Fourier Integrals of finite measures.

Optimization for Communications and Networks. Poompat Saengudomlert. Session 4 Duality and Lagrange Multipliers

Convex discretization of functionals involving the Monge-Ampère operator

Optimal Martingale Transport problem in higher dimensions

Lecture Notes Introduction to Ergodic Theory

The semi-geostrophic equations - a model for large-scale atmospheric flows

Geometry and Optimization of Relative Arbitrage

C 1 regularity of solutions of the Monge-Ampère equation for optimal transport in dimension two

Optimal and Better Transport Plans

ICS-E4030 Kernel Methods in Machine Learning

Optimal transportation and optimal control in a finite horizon framework

A new Hellinger-Kantorovich distance between positive measures and optimal Entropy-Transport problems

Applied Analysis (APPM 5440): Final exam 1:30pm 4:00pm, Dec. 14, Closed books.

L. Decreusefond. Optimal transportation on configurations spaces

Duality. Lagrange dual problem weak and strong duality optimality conditions perturbation and sensitivity analysis generalized inequalities

The Monge problem on non-compact manifolds

Shiqian Ma, MAT-258A: Numerical Optimization 1. Chapter 4. Subgradient

PARTIAL REGULARITY OF BRENIER SOLUTIONS OF THE MONGE-AMPÈRE EQUATION

Strong Duality for a Multiple Good Monopolist

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing.

Generalized Orlicz spaces and Wasserstein distances for convex concave scale functions

The local equicontinuity of a maximal monotone operator

Generative Models and Optimal Transport

Variational approach to mean field games with density constraints

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Density of Lipschitz functions and equivalence of weak gradients in metric measure spaces

DUALITY FOR BOREL MEASURABLE COST FUNCTIONS

Dual representations of risk measures

Optimal controls for gradient systems associated with grain boundary motions

A few words about the MTW tensor

Calculus of Variations. Final Examination

Moment Measures. D. Cordero-Erausquin 1 and B. Klartag 2

Nonlinear Programming

1 An Elementary Introduction to Monotone Transportation

Poincaré Inequalities and Moment Maps

Anisotropic congested transport

Asymptotics of minimax stochastic programs

1 Introduction. A SHARP STABILITY RESULT FOR THE RELATIVE ISOPERIMETRIC INEQUALITY INSIDE CONVEX CONES A. Figalli and E. Indrei. 1.

12. Interior-point methods

On Lagrangian solutions for the semi-geostrophic system with singular initial data

On Lagrangian solutions for the semi-geostrophic system with singular initial data

Weak KAM pairs and Monge-Kantorovich duality

I.3. LMI DUALITY. Didier HENRION EECI Graduate School on Control Supélec - Spring 2010

Nonparametric one-sided testing for the mean and related extremum problems

Lecture 4 Lebesgue spaces and inequalities

On the velocities of flows consisting of cyclically monotone maps

Math 5051 Measure Theory and Functional Analysis I Homework Assignment 3

Optimal transportation and action-minimizing measures

Interior Point Algorithms for Constrained Convex Optimization

Convex discretization of functionals involving the Monge-Ampère operator

Convex Optimization & Lagrange Duality

t y n (s) ds. t y(s) ds, x(t) = x(0) +

Lecture 2. We now introduce some fundamental tools in martingale theory, which are useful in controlling the fluctuation of martingales.

Relative Loss Bounds for Multidimensional Regression Problems

Constrained Optimization and Lagrangian Duality

On fuzzy entropy and topological entropy of fuzzy extensions of dynamical systems

CAPACITIES ON METRIC SPACES

Proving the Regularity of the Minimal Probability of Ruin via a Game of Stopping and Control

The Lagrangian L : R d R m R r R is an (easier to optimize) lower bound on the original problem:

REGULARITY OF POTENTIAL FUNCTIONS IN OPTIMAL TRANSPORTATION. Centre for Mathematics and Its Applications The Australian National University

14. Duality. ˆ Upper and lower bounds. ˆ General duality. ˆ Constraint qualifications. ˆ Counterexample. ˆ Complementary slackness.

THEOREMS, ETC., FOR MATH 515

Transcription:

Introduction to optimal transport Nicola Gigli May 20, 2011

Content Formulation of the transport problem The notions of c-convexity and c-cyclical monotonicity The dual problem Optimal maps: Brenier s theorem

Content Formulation of the transport problem The notions of c-convexity and c-cyclical monotonicity The dual problem Optimal maps: Brenier s theorem

A notion: the push forward Let µ P(R d ) and T : R d R k a (Borel) map. The measure T # µ P(R k ) is defined by T # µ(a) := µ(t 1 )(A), for every (Borel) set A R k The measure T # µ is characterized by f dt # µ = f T dµ, for any (Borel) function f.

Monge s formulation of the transport problem Let µ, ν P(R d ) be given, and let c : R d R d be a cost function, say continuous. Problem: Minimize c(x, T (x))dµ(x), among all transport maps from µ to ν, i.e., among all maps T such that T # µ = ν

Why this is a bad formulation There are several issues with this formulation: it may be that no transport map exists at all (eg., if µ is a Delta and ν is not) the constraint T # µ = ν is not closed w.r.t. any reasonable weak topology

Kantorovich s formulation A measure γ P(R d R d ) is a transport plan from µ to ν if π 1 #γ = µ, π 2 #γ = ν. Problem Minimize c(x, y)dγ(x, y), among all transport plans from µ to ν.

Why this is a good formulation There always exists at least one transport plan: µ ν, Transport plans include transport maps: if T # µ = ν, then (Id, T ) # µ is a transport plan The set of transport plans is closed w.r.t. the weak topology of measures. The map γ c(x, y)dγ(x, y) is linear and weakly continuous,

Why this is a good formulation There always exists at least one transport plan: µ ν, Transport plans include transport maps: if T # µ = ν, then (Id, T ) # µ is a transport plan The set of transport plans is closed w.r.t. the weak topology of measures. The map γ c(x, y)dγ(x, y) is linear and weakly continuous, In particular, minima exist.

Now what? What can we say about optimal plans? In particular: Do they have any particular structure? If so, which one? Are they unique? Are they induced by maps?

Content Formulation of the transport problem The notions of c-cyclical monotonicity and c-convexity The dual problem Optimal maps: Brenier s theorem

A key example Let {x i } i, {y i } i, i = 1,..., N be points in R d and µ := 1 δ xi, N i ν := 1 δ yi. N i

A key example Let {x i } i, {y i } i, i = 1,..., N be points in R d and µ := 1 δ xi, N ν := 1 δ yi. N Then a plan γ is optimal iff for any n N, permutation σ of {1,..., n} and any {(x k, y k )} k=1,...,n supp(γ) it holds c(x k, y k ) c(x k, y σ(k) ) k k i i

The general definition We say that a set Γ R d R d is c-cyclically monotone if for any n N, permutation σ of {1,..., n} and any {(x k, y k )} k=1,...,n Γ it holds c(x k, y k ) c(x k, y σ(k) ) k k

First structural theorem Theorem A transport plan γ is optimal if and only if its support supp(γ) is c-cyclically monotone.

First structural theorem Theorem A transport plan γ is optimal if and only if its support supp(γ) is c-cyclically monotone. In particular, being optimal depends only on the support of γ, and not on how the mass is distributed on the support (!).

Content Formulation of the transport problem The notions of c-cyclical monotonicity and c-convexity The dual problem Optimal maps: Brenier s theorem

The dual formulation Given the measures µ, ν P(R d ) and the cost function c : R d R d R, maximize ϕdµ + ψdν, among all couples of functions ϕ, ψ : R d R such that ϕ(x) + ψ(y) c(x, y), x, y R d. We call such a couple of functions admissible potentials

A simple inequality Let γ be a transport plan from µ to ν and (ϕ, ψ) admissible potentials. Then c(x, y)dγ(x, y) ϕ(x) + ψ(y)dγ(x, y) = ϕ(x)dµ(x) + ψ(y)dν(y). Thus inf{transport problem} sup{dual problem}

A property of admissible potentials Say that (ϕ, ψ) are admissible potentials and define ϕ c (y) := inf c(x, y) ϕ(x). x Then ϕ c ψ and (ϕ, ϕ c ) are admissible as well.

A property of admissible potentials Say that (ϕ, ψ) are admissible potentials and define ϕ c (y) := inf c(x, y) ϕ(x). x Then ϕ c ψ and (ϕ, ϕ c ) are admissible as well. Similarly, we can define ψ c (x) := inf c(x, y) ψ(y), y so that ψ c ϕ and (ψ c, ψ) are admissible

The process stabilizes Starting from (ϕ, ψ), we can consider the admissible potentials (ϕ, ϕ c ), (ϕ cc, ϕ c ), (ϕ cc, ϕ ccc )...

The process stabilizes Starting from (ϕ, ψ), we can consider the admissible potentials (ϕ, ϕ c ), (ϕ cc, ϕ c ), (ϕ cc, ϕ ccc )... This process stops, because ϕ ccc = ϕ c.

The process stabilizes Starting from (ϕ, ψ), we can consider the admissible potentials (ϕ, ϕ c ), (ϕ cc, ϕ c ), (ϕ cc, ϕ ccc )... This process stops, because ϕ ccc = ϕ c. Indeed ϕ ccc (y) = inf x sup ỹ inf c(x, y) c(x, ỹ) + c( x, ỹ) ϕ( x), x and picking x = x we get ϕ ccc ϕ c, and picking ỹ = y we get ϕ ccc ϕ c.

c-concavity and c-superdifferential A function ϕ is c-concave if ϕ = ψ c for some function ψ. The c-superdifferential c ϕ R d R d is the set of (x, y) such that ϕ(x) + ϕ c (y) = c(x, y).

Second structural theorem For any c-concave function ϕ, the set c ϕ is c-cyclically monotone, indeed if {(x k, y k )} k c ϕ it holds c(x k, y k ) = ϕ(x k ) + ϕ c (y k ) k k = k ϕ(x k ) + ϕ c (y σ(k) ) k c(x k, y σ(k) )

Second structural theorem For any c-concave function ϕ, the set c ϕ is c-cyclically monotone, indeed if {(x k, y k )} k c ϕ it holds c(x k, y k ) = ϕ(x k ) + ϕ c (y k ) k k = k ϕ(x k ) + ϕ c (y σ(k) ) k c(x k, y σ(k) ) Actually much more holds: Theorem A set Γ is c-cyclically monotone iff Γ c ϕ for some ϕ c-concave.

To summarize Given µ, ν P(R d ) and a cost function c, for an admissible plan γ the following three are equivalent: γ is optimal supp(γ) is c-cyclically monotone supp(γ) c ϕ for some c-concave function ϕ

No duality gap It holds inf{transport problem} = sup{dual problem} Indeed, if γ is optimal, then supp(γ) c ϕ for some c-concave ϕ. Thus c(x, y)dγ(x, y) = ϕ(x) + ϕ c (y)dγ(x, y) = ϕdµ + ψdν

Content Formulation of the transport problem The notions of c-cyclical monotonicity and c-convexity The dual problem Optimal maps: Brenier s theorem

The case c(x, y) = x y 2 /2: c-concavity and convexity ϕ is c-concave iff ϕ(x) := x 2 /2 ϕ(x) is convex. Indeed: x y 2 ϕ(x) = inf ψ(y) y 2 x 2 y 2 ϕ(x) = inf + x, y + y 2 2 ψ(y) ( ) ϕ(x) x 2 y 2 2 = inf x, y + y 2 ψ(y) ( ) y 2 ϕ(x) = sup x, y y 2 ψ(y),

The case c(x, y) = x y 2 /2: c-superdifferential and subdifferential (x, y) c ϕ iff y ϕ(x).

The case c(x, y) = x y 2 /2: c-superdifferential and subdifferential (x, y) c ϕ iff y ϕ(x). Indeed: (x, y) c ϕ { ϕ(x) = x y 2 /2 ϕ c (y), ϕ(z) z y 2 /2 ϕ c (y), z R d { ϕ(x) x 2 /2 = x, y + y 2 /2 ϕ c (y), ϕ(z) z 2 /2 z, y + y 2 /2 ϕ c (y), ϕ(z) z 2 /2 ϕ(x) x 2 /2 + z x, y y + (ϕ 2 /2)(x) y ϕ(x) z R d z R d

Reminder: differentiability of convex functions Let ϕ : R d R be convex. Then for a.e. x, ϕ is differentiable at x. This is the same as to say that for a.e. x the set ϕ(x) has only one element.

Brenier s theorem: statement Let µ, ν P(R d ). Assume that µ L d. Then: there exists a unique transport plan this transport plan is induced by a map the map is the gradient of a convex function

Brenier s theorem: proof Pick an optimal plan γ.

Brenier s theorem: proof Pick an optimal plan γ. Then supp(γ) c ϕ for some c-concave function ϕ.

Brenier s theorem: proof Pick an optimal plan γ. Then supp(γ) c ϕ for some c-concave function ϕ. Then supp(γ) ϕ for some convex function ϕ.

Brenier s theorem: proof Pick an optimal plan γ. Then supp(γ) c ϕ for some c-concave function ϕ. Then supp(γ) ϕ for some convex function ϕ. Thus for γ-a.e. (x, y) it holds y ϕ(x).

Brenier s theorem: proof Pick an optimal plan γ. Then supp(γ) c ϕ for some c-concave function ϕ. Then supp(γ) ϕ for some convex function ϕ. Thus for γ-a.e. (x, y) it holds y ϕ(x). Therefore for µ-a.e. x there is only one y such that (x, y) supp(γ), and this y is given by y := ϕ(x).

Brenier s theorem: proof Pick an optimal plan γ. Then supp(γ) c ϕ for some c-concave function ϕ. Then supp(γ) ϕ for some convex function ϕ. Thus for γ-a.e. (x, y) it holds y ϕ(x). Therefore for µ-a.e. x there is only one y such that (x, y) supp(γ), and this y is given by y := ϕ(x). This is the same as to say that γ = (Id, ϕ) # µ.

Brenier s theorem: proof Pick an optimal plan γ. Then supp(γ) c ϕ for some c-concave function ϕ. Then supp(γ) ϕ for some convex function ϕ. Thus for γ-a.e. (x, y) it holds y ϕ(x). Therefore for µ-a.e. x there is only one y such that (x, y) supp(γ), and this y is given by y := ϕ(x). This is the same as to say that γ = (Id, ϕ) # µ. Now assume that γ is another optimal plan. Then the plan (γ + γ)/2 would be optimal and not induced by a map.