Distance-Divergence Inequalities
|
|
- Simon Walter Parrish
- 5 years ago
- Views:
Transcription
1 Distance-Divergence Inequalities Katalin Marton Alfréd Rényi Institute of Mathematics of the Hungarian Academy of Sciences
2 Motivation To find a simple proof of the Blowing-up Lemma, proved by Ahlswede, Gács and Körner, with the aim to prove Strong Converses in Shannon theory; Ornstein s Copying Lemma in the proof of his Isomorphism Theorem; My experience with Single-Letter Characterization techniques in Shannon theory.
3 What does measure concentration mean? Enlargements and distances of sets Notation (X, d, q): a metric probability space. More precisely, Assume: X is a Polish space, with the Borel σ-algebra; d is a Borel measurable distance on X ; q is a Borel probability measure on X. Subsets of X are assumed to be measurable.
4 Notation For A X and κ > 0: The κ-enlargement (or κ-neighborhood) of the set A is { } [A] κ = x X : y A, d(x, y) κ. Informal definition of Measure Concentration: The a metric probability space (X, d, q) satisfies the measure concentration property if: For every ε > 0 there exists a κ = κ(ε) such that q(a) ε = q([a] κ ) 1 ε, A X, and κ(ε) can be overbounded in a non-trivial way.
5
6 By the equality d ( A, [A] c κ) = κ : Equivalent definition The a metric probability space (X, d, q) satisfies the measure concentration property if: There is a function κ : R + R + such that: For every A, B X d(a, B) κ ( q(a) ) + κ ( q(b) ), A, B X, and κ can be overbounded in a non-trivial way.
7 Sub-Gaussian measure concentration κ(ε) = c(q) log 1 ( ( ) κ 2 ) ε(κ) = exp ε c(q) I.e.: d(a, B) c(q) [ log ] 1 q(a) + 1 log, A, B X. q(b)
8 Example: Hamming distance, C. McDiarmid 1989, Marton 1996 X n : n-th power of a Borel space X ; δ n (x n, y n ) = n i=1 δ(x i, y i ): Hamming distance on X n ; x n = (x 1, x 2,..., x n ); q n = n i=1 q i: product measure on X n. Then, taking neighborhood with respect to Hamming distance: q n ([A] κ ) 1 ( ( ) ) κ exp 2n n 1 2n log 1 2 q n, κ, A X n. (A)
9 Cont d q n ([A] κ ) 1 ( ( ) ) κ exp 2n n 1 2n log 1 2 q n, κ, A X n. (A) Equivalently: 1 δ n (A, B) 2n log 1 q n (A) + 1 2n log 1 q n (B), A, B X n Thus, for Hamming distance δ n 1 κ(ε) = 2n log 1 ε.
10 Cont d q n ([A] κ ) 1 ( ( ) ) κ exp 2n n 1 2n log 1 2 q n, κ, A X n. (A) Equivalently: 1 δ n (A, B) 2n log 1 q n (A) + 1 2n log 1 q n (B), A, B X n Thus, for Hamming distance δ n 1 κ(ε) = 2n log 1 ε.
11 Example: Euclidean distance, Talagrand 1996 R n : n-dimensional Euclidean space; e n : Euclidean distance; q n = n i=1 q i = n i=1 exp( V i): product density on R n. Assume: V i : R R is strictly uniformly convex: V i (x) ρ > 0, x. Then: e n (A, B) 2 ρ log 1 q n (A) + 2 ρ log 1 q n (B), A, B X n.
12 Example: Euclidean distance, Talagrand 1996 R n : n-dimensional Euclidean space; e n : Euclidean distance; q n = n i=1 q i = n i=1 exp( V i): product density on R n. Assume: V i : R R is strictly uniformly convex: V i (x) ρ > 0, x. Then: e n (A, B) 2 ρ log 1 q n (A) + 2 ρ log 1 q n (B), A, B X n.
13 Cont d Assume: V i : R R is strictly uniformly convex: V i (x) ρ > 0, x. Then: e n (A, B) 2 ρ log 1 q n (A) + 2 ρ log 1 q n (B), A, B X n. Thus, for Euclidean distance, and for product density q n = n i=1 exp( V i) satisfying V i ρ > 0, 2 κ(ε) = ρ log 1, independently of n. ε
14 Cont d Assume: V i : R R is strictly uniformly convex: V i (x) ρ > 0, x. Then: e n (A, B) 2 ρ log 1 q n (A) + 2 ρ log 1 q n (B), A, B X n. Thus, for Euclidean distance, and for product density q n = n i=1 exp( V i) satisfying V i ρ > 0, 2 κ(ε) = ρ log 1, independently of n. ε
15 Distance-Divergence Inequalities: A way to prove measure concentration Notation P(X ): the set of probability measures on X. q is a distinguished element of P(X ). Definition: set ofcouplings Π(p, r) For p, r P(X ): Π(p, r) set of all couplings of p and r set of all joint probability distributions π = L(Y, Z) on X X, with marginals: p = L(Y ) and r = L(Z).
16 Definition: Wasserstein distance Fix a β, 1 β 2. For p, r P(X ) : W β (p, r) = W (d) β (p, r) { ( E π d(y, Z) β) } 1/β : L(Y ) = p, L(Z) = r,. inf π Π(p,r) W β (p, r) is the Wasserstein distance, of order β, of p and r. Claim W β is a distance on P(X ). (May take the value.)
17 Definition: Wasserstein distance Fix a β, 1 β 2. For p, r P(X ) : W β (p, r) = W (d) β (p, r) { ( E π d(y, Z) β) } 1/β : L(Y ) = p, L(Z) = q,. inf π Π(p,r) W β (p, r) is the Wasserstein distance, of order β, of p and r. Example: Variational distance δ: Kronecker s distance: δ(x, y) = 1 for x y, δ(x, x) = 0. W (δ) 1 (p, r) p r T V = min E π δ(y, Z) ( ) = min P r π {Y Z} = max p(a) r(a), A X where π = L(Y, Z) Π(p, r).
18 Definition: Wasserstein distance Fix a β, 1 β 2. For p, r P(X ) : W β (p, r) = W (d) β (p, r) { ( E π d(y, Z) β) } 1/β : L(Y ) = p, L(Z) = q,. inf π Π(p,r) W β (p, r) is the Wasserstein distance, of order β, of p and r. Example: Variational distance δ: Kronecker s distance: δ(x, y) = 1 for x y, δ(x, x) = 0. W (δ) 1 (p, r) p r T V = min E π δ(y, Z) ( ) = min P r π {Y Z} = max p(a) r(a), A X where π = L(Y, Z) Π(p, r).
19 Definition: (Informational) Divergence For p, q P(X ) : ( dp ) D(p q) log dp dq if p q; + otherwise. X Definition: Distance-Divergence Inequality q P(X ) satisfies a distance-divergence inequality of order β, with constant ρ, if: 2 W β (p, q) D(p q) for all p P(X ). ρ Most important cases: β = 1 and β = 2.
20 Definition: Distance-Divergence Inequality q P(X ) satisfies a distance-divergence inequality of order β, with constant ρ, if: 2 W β (p, q) D(p q) for all p P(X ). ρ Example: Pinsker-Csiszár-Kullback inequality p q T V 1 D(p q) p, q P(X ). 2 This distance-divergence inequality holds with the same constant ρ = 4 for every q.
21 Definition: Distance-Divergence Inequality q P(X ) satisfies a distance-divergence inequality of order β, with constant ρ, if: 2 W β (p, q) D(p q) for all p P(X ). ρ Example: Pinsker-Csiszár-Kullback inequality p q T V 1 D(p q) p, q P(X ). 2 This distance-divergence inequality holds with the same constant ρ = 4 for every q.
22 Notation W 2 : Wasserstein distance on P(R) from the Euclidean distance, i.e.: For densities p and r on R W 2 (p, r) = W (e) 2 (p, r) { ( = inf E π Y Z 2) } 1/2 : L(Y ) = p, L(Z) = q. π Π(p,r)
23 M. Talagrand s Theorem on R, 1996 Assume: q(x) = exp( V (x)) strictly uniformly log-concave density on R, i.e., V (x) ρ > 0, x. E.g., for q normal, ρ = 1/Variance. Then: W 2 (p, r) 2 ρ D(p q), p P(R).
24 The proof of Talagrand s Theorem is based on the unique monotone map T that takes the measure p on R to the measure q (provided both measures are absolutely continuous): x q(t)dt = T (x) p(t)dt. A monotone map taking a given density p to the density q exists also on R n. (Proved, e.g., by Y. Brenier 1991 or R. McCann 1994.)
25 Talagrand s Theorem on R n Assume: q(x) = exp( V (x)): strictly uniformly log-concave density on R n : HessV (x) ρ Id x, ρ > 0. Then: W 2 (p, q) 2 ρ D(p q) p P(R). W 2 denotes the Wasserstein distance on P(R n ) derived from the Euclidean distance. Proved e.g. by S. Bobkov - M. Ledoux 2000, F. Otto - C. Villani 2000, D. Cordero-Erausquin
26 Lemma: Distance-Divergence Inequality implies Measure Concentration, Marton 1986, 1996 (X, d, q): metric probability space, W (d) β Wasserstein distance of order β associated with d. Then: W (d) 2 β (p, q) D(p q), p P(X ) ρ = d(a, B) 2 ρ log 1 q(a) + 2 ρ log 1 q(b), A, B X.
27 Proof Define Triangle inequality q(a C) p = q A : p(c) =, q(a) q(b C) r = q B : r(c) =. q(b) + Distance-Divergence inequality: d(a, B) W β (p, r) W β (p, q) + W β (q, r) 2 2 ρ D(p q) + D(r q) ρ β [1, 2].
28 Cont d d(a, B) 2 2 ρ D(p q) + ρ D(r q). Further: and similarly p(x) q(x) = 1 q(a) = D(p q) = log p almost everywhere 1 q(a), D(r q) = log 1 q(b).
29 Product spaces Definition: Distances in Product Spaces Given the metric d on X, define distances on X n : ( n ) 1/β d β,n (x n, y n ) = d(x i, y i ) β, 1 β 2. i=1 Hamming distance: from Kronecker s distance, with β = 1, Euclidean distance: from one-dimensional Euclidean distance, with β = 2.
30 Definition: Wasserstein Distances in Product Spaces The distance d β,n on X n gives rise to the Wasserstein distance W β,n = W (d) β,n on P(X n ): W β,n (p n, r n ) = W (d) β,n (pn, r n ) { ( n = inf E π d(y i, Z i ) β) 1/β : π Π(p n,r n ) i=1 L(Y n ) = p n, L(Z n ) = r n }.
31 Lemma: Tensorisation of Distance-Divergence inequalities, Marton 1996, M. Talagrand 1996 q n = n i=1 q i: product measure on X n. Then: 2 W β (p, q i ) ρ D(p q i), i, p P(X ) = Thus: W β,n (p n, q n ) 2n 2/β 1 ρ D(p n q n ), p n P(X n ). For Hamming distance: n-fold product measures satisfy a distance-divergence inequality with the same constant 4/n. For Euclidean distance: n-fold product measures satisfy distance-divergence inequalities with the smallest of the constants valid for the factors, independently of n.
32 Lemma: Tensorisation of Distance-Divergence inequalities, Marton 1996, M. Talagrand 1996 q n = n i=1 q i: product measure on X n. Then: 2 W β (p, q i ) ρ D(p q i), i, p P(X ) = Thus: W β,n (p n, q n ) 2n 2/β 1 ρ D(p n q n ), p n P(X n ). For Hamming distance: n-fold product measures satisfy a distance-divergence inequality with the same constant 4/n. For Euclidean distance: n-fold product measures satisfy distance-divergence inequalities with the smallest of the constants valid for the factors, independently of n.
33 Proof From couplings between q i and the conditional distributions p i ( y i 1 ) L(Y i Y 1 = y 1,..., Y i 1 = y i 1 ) construct good couplings between measures p n and q n on X n : Successively define L ( Y i, X i Y i 1 = y i 1, X i 1 = x i 1) best coupling of p i ( y i 1 ) and q i.
34 Cont d We have defined a joint distribution π = L(Y n, X n ) with marginals p n = L(Y n ), q n = L(X n ). E π δ n (Y n, X n ) = = n P r π {Y i X i } i=1 n E { Y i X i Y i 1} i=1 n i=1 n 1 = 2 D(Y i Y i 1 qi ) i=1 n n 2 D(Y i Y i 1 qi ) = i=1 n 2 D(pn q n ). 1 2 D( Y i Y i 1 qi ( Y i 1 ) )
35 Distance-Divergence Inequalities and Large Deviation type Inequalities S. Bobkov and Götze s theorem, 1999 (X, d, q): metric probability space. The following two statements are equivalent: (i) W (d) 1 (p, q) (ii) X 2 D(p q) ρ for all p P(X ), ) ( t e tf(x) 2 dq(x) exp 2ρ + t E qf for all Lipschitz functions f : X R with Lipschitz coefficient 1, and all t > 0.
36 Remark Bobkov and Götze actually proved that: For any fixed function f : X R ( ) t e tf(x) 2 dq(x) exp 2ρ + t E qf X = Ep f E q f 2 D(p q) ρ for all p P(X ).
37 Distance-Divergence Inequalities and Large Deviation type Inequalities N. Gozlan s theorem, 2009 (X, d, q): metric probability space, q n : n-th power of q. The following two statements are equivalent: (i) W (d) 2 (p, q) (ii) q n {x n : f(x n ) Ef + r 2 D(p q) ρ for all p P(X ), } ) ( ρr2 exp for all n, all Lipschitz functions f : X n R with Lipschitz coefficient 1 (with respect to distance d 2,n ), and all r > 0. 2
38 Measure Concentration for Non-product Measures: Contracting Markov Chains, Marton 1996 Assume: q 1 : probability measure on the Borel space X ; q i ( x): Markov kernels, X X, Contracting: q n on X n : Then: q i ( x) q i ( y) T V (1 β), β > 0, x, y; q n (x n ) q 1 (x 1 ) n q i (x i x i 1 ), x n X n. i=2 W (δ) 1,n (pn, q n ) 1 β 1 2n D(pn q n ), p n P(X ).
39 Measure Concentration for Non-product Measures X n : n-th power of a Borel space X ; q n : Borel probability measure on X n ; X n : random sequence with L(X n ) = q n ; p n : another probability measure on X n ; Y n : random sequence in X n, L(Y n ) = p n. Notation For y n X n and 1 i n : y i (y 1, y 2,..., y i ) and y n i (y i+1, y i+2,..., y n ); p i ( y i 1 ) L(Y i Y i 1 = y i 1 ); p n i ( y i ) L(Y n i Y i = y i );
40 Definition: Measures admitting coupling with bounded distance q n admits coupling with distance bounded by constant C if: For every i n 1 and every pair of sequences y i, z i X i such that y i 1 = z i 1, y i z i, there exists a coupling of q n i ( yi ) and q n i ( zi ): π n i ( y i, z i ) = L(Y n i, Z n i Y i = y i, Z i = z i ) satisfying E π n i { n j=i+1 δ(y j, Z j ) } y i, z i ) C.
41 Theorem: Measure Concentration for random processes, Marton 1998 Assume: q n admits coupling with distance bounded by the constant C. Then W (δ) n 1,n (pn, q n ) (C + 1) 2 D(pn q n ). Thus: For measures admitting couplings with bounded distance, the distance-divergence inequality is worse by a constant factor (only) than the inequality for product measures. The Theorem applies for segments of sufficiently mixing stationary random processes.
42 Measure Concentration for Gibbs Measures Notation q n : Borel probability measure on X n ; X n : random sequence, L(X n ) = q n ; For y n X n : ȳ i (y 1, y 2,..., y i 1, y i+1,..., y n ); Q i ( ȳ i ) L(X i X i = ȳ i ). The conditional distributions Q i ( ȳ i ) are called the local specifications of q n.
43 (Non-standard) Definition: Gibbs Measures q n, when defined by the local specifications Q i ( ȳ i ), is called a Gibbs measure.
44 Definition: Dobrushin s Uniqueness Condition q n satisfies Dobrushin s uniqueness condition if: there exist numbers a i,k 0, (i, k [1, n], i k), such that: (i) For any i, k, i k, and any ȳ k, z k X n 1, differing only in the i-th coordinate: Q k ( ȳ k ) Q k ( z k ) T V a i,k ; (ii) For the matrix A ( a i,k ) n i,k=1, where a i,i = 0, we have A = max k n a i,k < 1. The numbers a i,k are called Dobrushin s interdependence coefficients. i=1
45 Definition: Gibbs Sampler Gibbs sampler associated with the local specifications Q i ( ȳ i ): Markov chain with state space X n and transition law Γ(z n y n ) defined as follows: Given y n X n, (i) select a random index i [1, n] independently of y n and uniformly distributed; (ii) Set Γ(z n y n ) = δȳi, z i Q i (z i ȳ i ). Dobrushin s uniqueness condition implies that the Gibbs sampler Γ is a contraction with respect to the distance W (δ) W (δ) 1,n (pn Γ, r n Γ) ( 1 1 A n ) W (δ) 1,n (pn, r n ). 1,n :
46 Definition: Gibbs Sampler Gibbs sampler associated with the local specifications Q i ( ȳ i ): Markov chain with state space X n and transition law Γ(z n y n ) defined as follows: Given y n X n, (i) select a random index i [1, n] independently of y n and uniformly distributed; (ii) Set Γ(z n y n ) = δȳi, z i Q i (z i ȳ i ). Dobrushin s uniqueness condition implies that the Gibbs sampler Γ is a contraction with respect to the distance W (δ) W (δ) 1,n (pn Γ, r n Γ) ( 1 1 A n ) W (δ) 1,n (pn, r n ). 1,n :
47 Theorem, Li Ming Wu 2006 If q n satisfies Dobrushin s uniqueness condition then W (δ) 1,n (pn, q n 1 n ) 1 A 2 D(pn q n ) for all p n P(X ) n. Remark A similar theorem holds for Gibbs samplers with state space R n and W (e) 2,n. (Wiith the appropriate modification of Dobrushin s uniqueness condition.) Proved by Marton 2004, 2009.
48 Theorem, Li Ming Wu 2006 If q n satisfies Dobrushin s uniqueness condition then W (δ) 1,n (pn, q n 1 n ) 1 A 2 D(pn q n ) for all p n P(X ) n. Remark A similar theorem holds for Gibbs samplers with state space R n and W (e) 2,n. (Wiith the appropriate modification of Dobrushin s uniqueness condition.) Proved by Marton 2004, 2009.
49 Rate of decrease of divergence for Gibbs samplers Theorem: The Gibbs sampler Decreases Divergence by a rate < 1, Marton, in preparation Assume (i) α inf x n,i Q i (x i x i ) > 0; (ii) q n satisfies the L 2 -version of Dobrushin s uniqueness condition: For the matrix of Dobrushin s interdependence coefficients A 2 < 1. Then: ( α (1 A ) 2 ) 2 D(p n Γ q n ) 1 2n D(p n q n ).
50 Cont d The proof uses the following "Converse" of Pinsker s lemma Let q be a probability measure on a finite set X, and assume: α min x q(x) > 0. Then: D(p q) 4 α p q 2 T V, p P(X ).
51 Corollary D(p n Γ t q n ) 0 exponentially, as t.
52 Distances between measures that are not Wasserstein distances Example: X n : product Borel space, p n and r n : Borel probability measures on X n. W (p n, r n ) min n } P rπ{ 2 Yi Z i. π=l(y n,z n ) Π(p n,r n ) i=1
Logarithmic Sobolev inequalities in discrete product spaces: proof by a transportation cost distance
Logarithmic Sobolev inequalities in discrete product spaces: proof by a transportation cost distance Katalin Marton Alfréd Rényi Institute of Mathematics of the Hungarian Academy of Sciences Relative entropy
More informationContents 1. Introduction 1 2. Main results 3 3. Proof of the main inequalities 7 4. Application to random dynamical systems 11 References 16
WEIGHTED CSISZÁR-KULLBACK-PINSKER INEQUALITIES AND APPLICATIONS TO TRANSPORTATION INEQUALITIES FRANÇOIS BOLLEY AND CÉDRIC VILLANI Abstract. We strengthen the usual Csiszár-Kullback-Pinsker inequality by
More informationRefined Bounds on the Empirical Distribution of Good Channel Codes via Concentration Inequalities
Refined Bounds on the Empirical Distribution of Good Channel Codes via Concentration Inequalities Maxim Raginsky and Igal Sason ISIT 2013, Istanbul, Turkey Capacity-Achieving Channel Codes The set-up DMC
More informationConcentration inequalities: basics and some new challenges
Concentration inequalities: basics and some new challenges M. Ledoux University of Toulouse, France & Institut Universitaire de France Measure concentration geometric functional analysis, probability theory,
More informationLogarithmic Sobolev Inequalities
Logarithmic Sobolev Inequalities M. Ledoux Institut de Mathématiques de Toulouse, France logarithmic Sobolev inequalities what they are, some history analytic, geometric, optimal transportation proofs
More informationx log x, which is strictly convex, and use Jensen s Inequality:
2. Information measures: mutual information 2.1 Divergence: main inequality Theorem 2.1 (Information Inequality). D(P Q) 0 ; D(P Q) = 0 iff P = Q Proof. Let ϕ(x) x log x, which is strictly convex, and
More informationLECTURE 15: COMPLETENESS AND CONVEXITY
LECTURE 15: COMPLETENESS AND CONVEXITY 1. The Hopf-Rinow Theorem Recall that a Riemannian manifold (M, g) is called geodesically complete if the maximal defining interval of any geodesic is R. On the other
More informationFokker-Planck Equation on Graph with Finite Vertices
Fokker-Planck Equation on Graph with Finite Vertices January 13, 2011 Jointly with S-N Chow (Georgia Tech) Wen Huang (USTC) Hao-min Zhou(Georgia Tech) Functional Inequalities and Discrete Spaces Outline
More informationcurvature, mixing, and entropic interpolation Simons Feb-2016 and CSE 599s Lecture 13
curvature, mixing, and entropic interpolation Simons Feb-2016 and CSE 599s Lecture 13 James R. Lee University of Washington Joint with Ronen Eldan (Weizmann) and Joseph Lehec (Paris-Dauphine) Markov chain
More informationConcentration inequalities and the entropy method
Concentration inequalities and the entropy method Gábor Lugosi ICREA and Pompeu Fabra University Barcelona what is concentration? We are interested in bounding random fluctuations of functions of many
More informationCOMMENT ON : HYPERCONTRACTIVITY OF HAMILTON-JACOBI EQUATIONS, BY S. BOBKOV, I. GENTIL AND M. LEDOUX
COMMENT ON : HYPERCONTRACTIVITY OF HAMILTON-JACOBI EQUATIONS, BY S. BOBKOV, I. GENTIL AND M. LEDOUX F. OTTO AND C. VILLANI In their remarkable work [], Bobkov, Gentil and Ledoux improve, generalize and
More informationBest approximations in normed vector spaces
Best approximations in normed vector spaces Mike de Vries 5699703 a thesis submitted to the Department of Mathematics at Utrecht University in partial fulfillment of the requirements for the degree of
More informationDiscrete Ricci curvature via convexity of the entropy
Discrete Ricci curvature via convexity of the entropy Jan Maas University of Bonn Joint work with Matthias Erbar Simons Institute for the Theory of Computing UC Berkeley 2 October 2013 Starting point McCann
More informationConcentration Inequalities for Dependent Random Variables via the Martingale Method
Concentration Inequalities for Dependent Random Variables via the Martingale Method Leonid Kontorovich, and Kavita Ramanan Carnegie Mellon University L. Kontorovich School of Computer Science Carnegie
More informationDisplacement convexity of the relative entropy in the discrete h
Displacement convexity of the relative entropy in the discrete hypercube LAMA Université Paris Est Marne-la-Vallée Phenomena in high dimensions in geometric analysis, random matrices, and computational
More informationStatistics & Data Sciences: First Year Prelim Exam May 2018
Statistics & Data Sciences: First Year Prelim Exam May 2018 Instructions: 1. Do not turn this page until instructed to do so. 2. Start each new question on a new sheet of paper. 3. This is a closed book
More informationStein s method, logarithmic Sobolev and transport inequalities
Stein s method, logarithmic Sobolev and transport inequalities M. Ledoux University of Toulouse, France and Institut Universitaire de France Stein s method, logarithmic Sobolev and transport inequalities
More informationLecture 35: December The fundamental statistical distances
36-705: Intermediate Statistics Fall 207 Lecturer: Siva Balakrishnan Lecture 35: December 4 Today we will discuss distances and metrics between distributions that are useful in statistics. I will be lose
More informationNEW FUNCTIONAL INEQUALITIES
1 / 29 NEW FUNCTIONAL INEQUALITIES VIA STEIN S METHOD Giovanni Peccati (Luxembourg University) IMA, Minneapolis: April 28, 2015 2 / 29 INTRODUCTION Based on two joint works: (1) Nourdin, Peccati and Swan
More informationTwo Applications of the Gaussian Poincaré Inequality in the Shannon Theory
Two Applications of the Gaussian Poincaré Inequality in the Shannon Theory Vincent Y. F. Tan (Joint work with Silas L. Fong) National University of Singapore (NUS) 2016 International Zurich Seminar on
More informationA concentration theorem for the equilibrium measure of Markov chains with nonnegative coarse Ricci curvature
A concentration theorem for the equilibrium measure of Markov chains with nonnegative coarse Ricci curvature arxiv:103.897v1 math.pr] 13 Mar 01 Laurent Veysseire Abstract In this article, we prove a concentration
More informationKatalin Marton. Abbas El Gamal. Stanford University. Withits A. El Gamal (Stanford University) Katalin Marton Withits / 9
Katalin Marton Abbas El Gamal Stanford University Withits 2010 A. El Gamal (Stanford University) Katalin Marton Withits 2010 1 / 9 Brief Bio Born in 1941, Budapest Hungary PhD from Eötvös Loránd University
More informationCourse 212: Academic Year Section 1: Metric Spaces
Course 212: Academic Year 1991-2 Section 1: Metric Spaces D. R. Wilkins Contents 1 Metric Spaces 3 1.1 Distance Functions and Metric Spaces............. 3 1.2 Convergence and Continuity in Metric Spaces.........
More informationAnalysis Finite and Infinite Sets The Real Numbers The Cantor Set
Analysis Finite and Infinite Sets Definition. An initial segment is {n N n n 0 }. Definition. A finite set can be put into one-to-one correspondence with an initial segment. The empty set is also considered
More informationCS229T/STATS231: Statistical Learning Theory. Lecturer: Tengyu Ma Lecture 11 Scribe: Jongho Kim, Jamie Kang October 29th, 2018
CS229T/STATS231: Statistical Learning Theory Lecturer: Tengyu Ma Lecture 11 Scribe: Jongho Kim, Jamie Kang October 29th, 2018 1 Overview This lecture mainly covers Recall the statistical theory of GANs
More informationWasserstein GAN. Juho Lee. Jan 23, 2017
Wasserstein GAN Juho Lee Jan 23, 2017 Wasserstein GAN (WGAN) Arxiv submission Martin Arjovsky, Soumith Chintala, and Léon Bottou A new GAN model minimizing the Earth-Mover s distance (Wasserstein-1 distance)
More informationh(x) lim H(x) = lim Since h is nondecreasing then h(x) 0 for all x, and if h is discontinuous at a point x then H(x) > 0. Denote
Real Variables, Fall 4 Problem set 4 Solution suggestions Exercise. Let f be of bounded variation on [a, b]. Show that for each c (a, b), lim x c f(x) and lim x c f(x) exist. Prove that a monotone function
More informationDiscrete Ricci curvature: Open problems
Discrete Ricci curvature: Open problems Yann Ollivier, May 2008 Abstract This document lists some open problems related to the notion of discrete Ricci curvature defined in [Oll09, Oll07]. Do not hesitate
More informationStability results for Logarithmic Sobolev inequality
Stability results for Logarithmic Sobolev inequality Daesung Kim (joint work with Emanuel Indrei) Department of Mathematics Purdue University September 20, 2017 Daesung Kim (Purdue) Stability for LSI Probability
More informationConcentration of Measure with Applications in Information Theory, Communications, and Coding
Concentration of Measure with Applications in Information Theory, Communications, and Coding Maxim Raginsky (UIUC) and Igal Sason (Technion) A Tutorial, Presented at 2015 IEEE International Symposium on
More informationHeat Flows, Geometric and Functional Inequalities
Heat Flows, Geometric and Functional Inequalities M. Ledoux Institut de Mathématiques de Toulouse, France heat flow and semigroup interpolations Duhamel formula (19th century) pde, probability, dynamics
More informationN. GOZLAN, C. ROBERTO, P-M. SAMSON
FROM DIMENSION FREE CONCENTRATION TO THE POINCARÉ INEQUALITY N. GOZLAN, C. ROBERTO, P-M. SAMSON Abstract. We prove that a probability measure on an abstract metric space satisfies a non trivial dimension
More informationWeek 5 Lectures 13-15
Week 5 Lectures 13-15 Lecture 13 Definition 29 Let Y be a subset X. A subset A Y is open in Y if there exists an open set U in X such that A = U Y. It is not difficult to show that the collection of all
More informationStein s Method: Distributional Approximation and Concentration of Measure
Stein s Method: Distributional Approximation and Concentration of Measure Larry Goldstein University of Southern California 36 th Midwest Probability Colloquium, 2014 Concentration of Measure Distributional
More informationGeneralized Orlicz spaces and Wasserstein distances for convex concave scale functions
Bull. Sci. math. 135 (2011 795 802 www.elsevier.com/locate/bulsci Generalized Orlicz spaces and Wasserstein distances for convex concave scale functions Karl-Theodor Sturm Institut für Angewandte Mathematik,
More informationEntropy and Ergodic Theory Lecture 17: Transportation and concentration
Entropy and Ergodic Theory Lecture 17: Transportation and concentration 1 Concentration in terms of metric spaces In this course, a metric probability or m.p. space is a triple px, d, µq in which px, dq
More informationJune 21, Peking University. Dual Connections. Zhengchao Wan. Overview. Duality of connections. Divergence: general contrast functions
Dual Peking University June 21, 2016 Divergences: Riemannian connection Let M be a manifold on which there is given a Riemannian metric g =,. A connection satisfying Z X, Y = Z X, Y + X, Z Y (1) for all
More informationPhenomena in high dimensions in geometric analysis, random matrices, and computational geometry Roscoff, France, June 25-29, 2012
Phenomena in high dimensions in geometric analysis, random matrices, and computational geometry Roscoff, France, June 25-29, 202 BOUNDS AND ASYMPTOTICS FOR FISHER INFORMATION IN THE CENTRAL LIMIT THEOREM
More informationUniversity of Toronto Department of Statistics
Norm Comparisons for Data Augmentation by James P. Hobert Department of Statistics University of Florida and Jeffrey S. Rosenthal Department of Statistics University of Toronto Technical Report No. 0704
More informationEC 521 MATHEMATICAL METHODS FOR ECONOMICS. Lecture 1: Preliminaries
EC 521 MATHEMATICAL METHODS FOR ECONOMICS Lecture 1: Preliminaries Murat YILMAZ Boğaziçi University In this lecture we provide some basic facts from both Linear Algebra and Real Analysis, which are going
More informationImmerse Metric Space Homework
Immerse Metric Space Homework (Exercises -2). In R n, define d(x, y) = x y +... + x n y n. Show that d is a metric that induces the usual topology. Sketch the basis elements when n = 2. Solution: Steps
More informationRandomized Quantization and Optimal Design with a Marginal Constraint
Randomized Quantization and Optimal Design with a Marginal Constraint Naci Saldi, Tamás Linder, Serdar Yüksel Department of Mathematics and Statistics, Queen s University, Kingston, ON, Canada Email: {nsaldi,linder,yuksel}@mast.queensu.ca
More informationECE598: Information-theoretic methods in high-dimensional statistics Spring 2016
ECE598: Information-theoretic methods in high-dimensional statistics Spring 06 Lecture : Mutual Information Method Lecturer: Yihong Wu Scribe: Jaeho Lee, Mar, 06 Ed. Mar 9 Quick review: Assouad s lemma
More informationBOUNDS ON THE DEFICIT IN THE LOGARITHMIC SOBOLEV INEQUALITY
BOUNDS ON THE DEFICIT IN THE LOGARITHMIC SOBOLEV INEQUALITY S. G. BOBKOV, N. GOZLAN, C. ROBERTO AND P.-M. SAMSON Abstract. The deficit in the logarithmic Sobolev inequality for the Gaussian measure is
More informationNishant Gurnani. GAN Reading Group. April 14th, / 107
Nishant Gurnani GAN Reading Group April 14th, 2017 1 / 107 Why are these Papers Important? 2 / 107 Why are these Papers Important? Recently a large number of GAN frameworks have been proposed - BGAN, LSGAN,
More informationLecture 5 Channel Coding over Continuous Channels
Lecture 5 Channel Coding over Continuous Channels I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw November 14, 2014 1 / 34 I-Hsiang Wang NIT Lecture 5 From
More informationWeak and strong moments of l r -norms of log-concave vectors
Weak and strong moments of l r -norms of log-concave vectors Rafał Latała based on the joint work with Marta Strzelecka) University of Warsaw Minneapolis, April 14 2015 Log-concave measures/vectors A measure
More informationHypercontractivity of spherical averages in Hamming space
Hypercontractivity of spherical averages in Hamming space Yury Polyanskiy Department of EECS MIT yp@mit.edu Allerton Conference, Oct 2, 2014 Yury Polyanskiy Hypercontractivity in Hamming space 1 Hypercontractivity:
More informationEECS 750. Hypothesis Testing with Communication Constraints
EECS 750 Hypothesis Testing with Communication Constraints Name: Dinesh Krithivasan Abstract In this report, we study a modification of the classical statistical problem of bivariate hypothesis testing.
More informationHard-Core Model on Random Graphs
Hard-Core Model on Random Graphs Antar Bandyopadhyay Theoretical Statistics and Mathematics Unit Seminar Theoretical Statistics and Mathematics Unit Indian Statistical Institute, New Delhi Centre New Delhi,
More informationConcentration, self-bounding functions
Concentration, self-bounding functions S. Boucheron 1 and G. Lugosi 2 and P. Massart 3 1 Laboratoire de Probabilités et Modèles Aléatoires Université Paris-Diderot 2 Economics University Pompeu Fabra 3
More informationMA651 Topology. Lecture 10. Metric Spaces.
MA65 Topology. Lecture 0. Metric Spaces. This text is based on the following books: Topology by James Dugundgji Fundamental concepts of topology by Peter O Neil Linear Algebra and Analysis by Marc Zamansky
More informationMartingale Problems. Abhay G. Bhatt Theoretical Statistics and Mathematics Unit Indian Statistical Institute, Delhi
s Abhay G. Bhatt Theoretical Statistics and Mathematics Unit Indian Statistical Institute, Delhi Lectures on Probability and Stochastic Processes III Indian Statistical Institute, Kolkata 20 24 November
More informationLecture 1 Measure concentration
CSE 29: Learning Theory Fall 2006 Lecture Measure concentration Lecturer: Sanjoy Dasgupta Scribe: Nakul Verma, Aaron Arvey, and Paul Ruvolo. Concentration of measure: examples We start with some examples
More informationON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS
Bendikov, A. and Saloff-Coste, L. Osaka J. Math. 4 (5), 677 7 ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS ALEXANDER BENDIKOV and LAURENT SALOFF-COSTE (Received March 4, 4)
More informationStochastic Dynamic Programming: The One Sector Growth Model
Stochastic Dynamic Programming: The One Sector Growth Model Esteban Rossi-Hansberg Princeton University March 26, 2012 Esteban Rossi-Hansberg () Stochastic Dynamic Programming March 26, 2012 1 / 31 References
More informationEcon Lecture 3. Outline. 1. Metric Spaces and Normed Spaces 2. Convergence of Sequences in Metric Spaces 3. Sequences in R and R n
Econ 204 2011 Lecture 3 Outline 1. Metric Spaces and Normed Spaces 2. Convergence of Sequences in Metric Spaces 3. Sequences in R and R n 1 Metric Spaces and Metrics Generalize distance and length notions
More informationEconomics 204 Summer/Fall 2011 Lecture 5 Friday July 29, 2011
Economics 204 Summer/Fall 2011 Lecture 5 Friday July 29, 2011 Section 2.6 (cont.) Properties of Real Functions Here we first study properties of functions from R to R, making use of the additional structure
More informationg 2 (x) (1/3)M 1 = (1/3)(2/3)M.
COMPACTNESS If C R n is closed and bounded, then by B-W it is sequentially compact: any sequence of points in C has a subsequence converging to a point in C Conversely, any sequentially compact C R n is
More information< k 2n. 2 1 (n 2). + (1 p) s) N (n < 1
List of Problems jacques@ucsd.edu Those question with a star next to them are considered slightly more challenging. Problems 9, 11, and 19 from the book The probabilistic method, by Alon and Spencer. Question
More informationTheory of Probability Fall 2008
MIT OpenCourseWare http://ocw.mit.edu 8.75 Theory of Probability Fall 008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. Section Prekopa-Leindler inequality,
More informationContinuity. Matt Rosenzweig
Continuity Matt Rosenzweig Contents 1 Continuity 1 1.1 Rudin Chapter 4 Exercises........................................ 1 1.1.1 Exercise 1............................................. 1 1.1.2 Exercise
More informationf-divergence Inequalities
SASON AND VERDÚ: f-divergence INEQUALITIES f-divergence Inequalities Igal Sason, and Sergio Verdú Abstract arxiv:508.00335v7 [cs.it] 4 Dec 06 This paper develops systematic approaches to obtain f-divergence
More informationExercises Measure Theoretic Probability
Exercises Measure Theoretic Probability 2002-2003 Week 1 1. Prove the folloing statements. (a) The intersection of an arbitrary family of d-systems is again a d- system. (b) The intersection of an arbitrary
More informationComplex Systems Methods 2. Conditional mutual information, entropy rate and algorithmic complexity
Complex Systems Methods 2. Conditional mutual information, entropy rate and algorithmic complexity Eckehard Olbrich MPI MiS Leipzig Potsdam WS 2007/08 Olbrich (Leipzig) 26.10.2007 1 / 18 Overview 1 Summary
More informationSpaces with Ricci curvature bounded from below
Spaces with Ricci curvature bounded from below Nicola Gigli March 10, 2014 Lessons Basics of optimal transport Definition of spaces with Ricci curvature bounded from below Analysis on spaces with Ricci
More informationErgodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R.
Ergodic Theorems Samy Tindel Purdue University Probability Theory 2 - MA 539 Taken from Probability: Theory and examples by R. Durrett Samy T. Ergodic theorems Probability Theory 1 / 92 Outline 1 Definitions
More informationArimoto Channel Coding Converse and Rényi Divergence
Arimoto Channel Coding Converse and Rényi Divergence Yury Polyanskiy and Sergio Verdú Abstract Arimoto proved a non-asymptotic upper bound on the probability of successful decoding achievable by any code
More informationConvergence to equilibrium for rough differential equations
Convergence to equilibrium for rough differential equations Samy Tindel Purdue University Barcelona GSE Summer Forum 2017 Joint work with Aurélien Deya (Nancy) and Fabien Panloup (Angers) Samy T. (Purdue)
More informationSpectral Gap and Concentration for Some Spherically Symmetric Probability Measures
Spectral Gap and Concentration for Some Spherically Symmetric Probability Measures S.G. Bobkov School of Mathematics, University of Minnesota, 127 Vincent Hall, 26 Church St. S.E., Minneapolis, MN 55455,
More informationSupplementary Notes for W. Rudin: Principles of Mathematical Analysis
Supplementary Notes for W. Rudin: Principles of Mathematical Analysis SIGURDUR HELGASON In 8.00B it is customary to cover Chapters 7 in Rudin s book. Experience shows that this requires careful planning
More informationApplication of Information Theory, Lecture 7. Relative Entropy. Handout Mode. Iftach Haitner. Tel Aviv University.
Application of Information Theory, Lecture 7 Relative Entropy Handout Mode Iftach Haitner Tel Aviv University. December 1, 2015 Iftach Haitner (TAU) Application of Information Theory, Lecture 7 December
More informationMoment Measures. Bo az Klartag. Tel Aviv University. Talk at the asymptotic geometric analysis seminar. Tel Aviv, May 2013
Tel Aviv University Talk at the asymptotic geometric analysis seminar Tel Aviv, May 2013 Joint work with Dario Cordero-Erausquin. A bijection We present a correspondence between convex functions and Borel
More informationREGULARITY OF MONOTONE TRANSPORT MAPS BETWEEN UNBOUNDED DOMAINS
REGULARITY OF MONOTONE TRANSPORT MAPS BETWEEN UNBOUNDED DOMAINS DARIO CORDERO-ERAUSQUIN AND ALESSIO FIGALLI A Luis A. Caffarelli en su 70 años, con amistad y admiración Abstract. The regularity of monotone
More informationPointwise convergence rates and central limit theorems for kernel density estimators in linear processes
Pointwise convergence rates and central limit theorems for kernel density estimators in linear processes Anton Schick Binghamton University Wolfgang Wefelmeyer Universität zu Köln Abstract Convergence
More informationMetric Spaces and Topology
Chapter 2 Metric Spaces and Topology From an engineering perspective, the most important way to construct a topology on a set is to define the topology in terms of a metric on the set. This approach underlies
More informationLarge Deviations Principles for McKean-Vlasov SDEs, Skeletons, Supports and the law of iterated logarithm
Large Deviations Principles for McKean-Vlasov SDEs, Skeletons, Supports and the law of iterated logarithm Gonçalo dos Reis University of Edinburgh (UK) & CMA/FCT/UNL (PT) jointly with: W. Salkeld, U. of
More informationSTAT 992 Paper Review: Sure Independence Screening in Generalized Linear Models with NP-Dimensionality J.Fan and R.Song
STAT 992 Paper Review: Sure Independence Screening in Generalized Linear Models with NP-Dimensionality J.Fan and R.Song Presenter: Jiwei Zhao Department of Statistics University of Wisconsin Madison April
More informationConcentration function and other stuff
Concentration function and other stuff Sabrina Sixta Tuesday, June 16, 2014 Sabrina Sixta () Concentration function and other stuff Tuesday, June 16, 2014 1 / 13 Table of Contents Outline 1 Chernoff Bound
More informationCombinatorial Criteria for Uniqueness of Gibbs Measures
Combinatorial Criteria for Uniqueness of Gibbs Measures Dror Weitz* School of Mathematics, Institute for Advanced Study, Princeton, New Jersey 08540; e-mail: dror@ias.edu Received 17 November 2003; accepted
More informationf-divergence Inequalities
f-divergence Inequalities Igal Sason Sergio Verdú Abstract This paper develops systematic approaches to obtain f-divergence inequalities, dealing with pairs of probability measures defined on arbitrary
More informationPush Forward Measures And Concentration Phenomena
Push Forward Measures And Concentration Phenomena Joint work with C. Hugo Jiménez and Raffael Villa Márton Naszódi Dept. of Geometry, Eötvös University, Budapest Notations All measures regular Borel measures,
More informationConcentration Properties of Restricted Measures with Applications to Non-Lipschitz Functions
Concentration Properties of Restricted Measures with Applications to Non-Lipschitz Functions S G Bobkov, P Nayar, and P Tetali April 4, 6 Mathematics Subject Classification Primary 6Gxx Keywords and phrases
More information1 Introduction. 2 Measure theoretic definitions
1 Introduction These notes aim to recall some basic definitions needed for dealing with random variables. Sections to 5 follow mostly the presentation given in chapter two of [1]. Measure theoretic definitions
More informationSpaces with Ricci curvature bounded from below
Spaces with Ricci curvature bounded from below Nicola Gigli February 23, 2015 Topics 1) On the definition of spaces with Ricci curvature bounded from below 2) Analytic properties of RCD(K, N) spaces 3)
More informationExtreme points of compact convex sets
Extreme points of compact convex sets In this chapter, we are going to show that compact convex sets are determined by a proper subset, the set of its extreme points. Let us start with the main definition.
More informationMathematics for Economists
Mathematics for Economists Victor Filipe Sao Paulo School of Economics FGV Metric Spaces: Basic Definitions Victor Filipe (EESP/FGV) Mathematics for Economists Jan.-Feb. 2017 1 / 34 Definitions and Examples
More informationAngle contraction between geodesics
Angle contraction between geodesics arxiv:0902.0315v1 [math.ds] 2 Feb 2009 Nikolai A. Krylov and Edwin L. Rogers Abstract We consider here a generalization of a well known discrete dynamical system produced
More informationABSOLUTE VALUES AND VALUATIONS
ABSOLUTE VALUES AND VALUATIONS YIFAN WU, wuyifan@umich.edu Abstract. We introduce the basis notions, properties and results of absolute values, valuations, discrete valuation rings and higher unit groups.
More informationEntropy and Large Deviations
Entropy and Large Deviations p. 1/32 Entropy and Large Deviations S.R.S. Varadhan Courant Institute, NYU Michigan State Universiy East Lansing March 31, 2015 Entropy comes up in many different contexts.
More informationConvex Analysis and Economic Theory AY Elementary properties of convex functions
Division of the Humanities and Social Sciences Ec 181 KC Border Convex Analysis and Economic Theory AY 2018 2019 Topic 6: Convex functions I 6.1 Elementary properties of convex functions We may occasionally
More informationSome Applications of Mass Transport to Gaussian-Type Inequalities
Arch. ational Mech. Anal. 161 (2002) 257 269 Digital Object Identifier (DOI) 10.1007/s002050100185 Some Applications of Mass Transport to Gaussian-Type Inequalities Dario Cordero-Erausquin Communicated
More informationAliprantis, Border: Infinite-dimensional Analysis A Hitchhiker s Guide
aliprantis.tex May 10, 2011 Aliprantis, Border: Infinite-dimensional Analysis A Hitchhiker s Guide Notes from [AB2]. 1 Odds and Ends 2 Topology 2.1 Topological spaces Example. (2.2) A semimetric = triangle
More informationStatistical Approaches to Learning and Discovery. Week 4: Decision Theory and Risk Minimization. February 3, 2003
Statistical Approaches to Learning and Discovery Week 4: Decision Theory and Risk Minimization February 3, 2003 Recall From Last Time Bayesian expected loss is ρ(π, a) = E π [L(θ, a)] = L(θ, a) df π (θ)
More informationµ X (A) = P ( X 1 (A) )
1 STOCHASTIC PROCESSES This appendix provides a very basic introduction to the language of probability theory and stochastic processes. We assume the reader is familiar with the general measure and integration
More informationTRANSPORT INEQUALITIES FOR STOCHASTIC
TRANSPORT INEQUALITIES FOR STOCHASTIC PROCESSES Soumik Pal University of Washington, Seattle Jun 6, 2012 INTRODUCTION Three problems about rank-based processes THE ATLAS MODEL Define ranks: x (1) x (2)...
More informationOn Isoperimetric Functions of Probability Measures Having Log-Concave Densities with Respect to the Standard Normal Law
On Isoerimetric Functions of Probability Measures Having Log-Concave Densities with Resect to the Standard Normal Law Sergey G. Bobkov Abstract Isoerimetric inequalities are discussed for one-dimensional
More informationGeneralized Ricci Bounds and Convergence of Metric Measure Spaces
Generalized Ricci Bounds and Convergence of Metric Measure Spaces Bornes Généralisées de la Courbure Ricci et Convergence des Espaces Métriques Mesurés Karl-Theodor Sturm a a Institut für Angewandte Mathematik,
More informationTHE INVERSE FUNCTION THEOREM FOR LIPSCHITZ MAPS
THE INVERSE FUNCTION THEOREM FOR LIPSCHITZ MAPS RALPH HOWARD DEPARTMENT OF MATHEMATICS UNIVERSITY OF SOUTH CAROLINA COLUMBIA, S.C. 29208, USA HOWARD@MATH.SC.EDU Abstract. This is an edited version of a
More information6 Markov Chain Monte Carlo (MCMC)
6 Markov Chain Monte Carlo (MCMC) The underlying idea in MCMC is to replace the iid samples of basic MC methods, with dependent samples from an ergodic Markov chain, whose limiting (stationary) distribution
More information