ULTRAMETRIC SKELETONS. 1. Introduction

Similar documents
SCALE-OBLIVIOUS METRIC FRAGMENTATION AND THE NONLINEAR DVORETZKY THEOREM

MANOR MENDEL AND ASSAF NAOR

Finite Metric Spaces & Their Embeddings: Introduction and Basic Tools

Topological properties

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

Locally convex spaces, the hyperplane separation theorem, and the Krein-Milman theorem

7 Complete metric spaces and function spaces

Fréchet embeddings of negative type metrics

Fragmentability and σ-fragmentability

Metric Spaces and Topology

The Caratheodory Construction of Measures

Basic Properties of Metric and Normed Spaces

ON A MEASURE THEORETIC AREA FORMULA

LECTURE 15: COMPLETENESS AND CONVEXITY

Measurable Choice Functions

ANALYSIS WORKSHEET II: METRIC SPACES

Existence and Uniqueness

Real Analysis Notes. Thomas Goller

The small ball property in Banach spaces (quantitative results)

Chapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries

Final. due May 8, 2012

Chapter 2. Metric Spaces. 2.1 Metric Spaces

CHAPTER 9. Embedding theorems

Chapter 2 Metric Spaces

Aliprantis, Border: Infinite-dimensional Analysis A Hitchhiker s Guide

Geometry and topology of continuous best and near best approximations

A BOREL SOLUTION TO THE HORN-TARSKI PROBLEM. MSC 2000: 03E05, 03E20, 06A10 Keywords: Chain Conditions, Boolean Algebras.

NOTES ON VECTOR-VALUED INTEGRATION MATH 581, SPRING 2017

Some Background Material

Euclidean Quotients of Finite Metric Spaces

Part III. 10 Topological Space Basics. Topological Spaces

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES

MA651 Topology. Lecture 10. Metric Spaces.

MATH 54 - TOPOLOGY SUMMER 2015 FINAL EXAMINATION. Problem 1

Lecture 18: March 15

Subsequences of frames

Séminaire BOURBAKI Janvier ème année, , n o 1047 THE RIBE PROGRAMME. by Keith BALL

Topology. Xiaolong Han. Department of Mathematics, California State University, Northridge, CA 91330, USA address:

01. Review of metric spaces and point-set topology. 1. Euclidean spaces

Maths 212: Homework Solutions

Some results in support of the Kakeya Conjecture

Hausdorff Measure. Jimmy Briggs and Tim Tyree. December 3, 2016

Chapter 4. Measure Theory. 1. Measure Spaces

Mid Term-1 : Practice problems

IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES

AN INTRODUCTION TO EXTREME POINTS AND APPLICATIONS IN ISOMETRIC BANACH SPACE THEORY

CHAPTER 6. Differentiation

1 Cheeger differentiation

Random sets of isomorphism of linear operators on Hilbert space

Mathematics for Economists

Construction of a general measure structure

MATH 4200 HW: PROBLEM SET FOUR: METRIC SPACES

GAUSSIAN MEASURE OF SECTIONS OF DILATES AND TRANSLATIONS OF CONVEX BODIES. 2π) n

Hausdorff measure. Jordan Bell Department of Mathematics, University of Toronto. October 29, 2014

UNIFORMLY DISTRIBUTED MEASURES IN EUCLIDEAN SPACES

MAT 544 Problem Set 2 Solutions

An introduction to Mathematical Theory of Control

Introduction to Dynamical Systems

Overview of normed linear spaces

Moreover, µ is monotone, that is for any A B which are both elements of A we have

VALUATION THEORY, GENERALIZED IFS ATTRACTORS AND FRACTALS

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm

1.4 The Jacobian of a map

Math 341: Convex Geometry. Xi Chen

Tools from Lebesgue integration

Economics 204 Fall 2011 Problem Set 1 Suggested Solutions

Course 212: Academic Year Section 1: Metric Spaces

AN EXPLORATION OF THE METRIZABILITY OF TOPOLOGICAL SPACES

arxiv: v1 [math.fa] 14 Jul 2018

PACKING-DIMENSION PROFILES AND FRACTIONAL BROWNIAN MOTION

CHAPTER 7. Connectedness

After taking the square and expanding, we get x + y 2 = (x + y) (x + y) = x 2 + 2x y + y 2, inequality in analysis, we obtain.

Topological properties of Z p and Q p and Euclidean models

Math 117: Topology of the Real Numbers

Lecture 5 - Hausdorff and Gromov-Hausdorff Distance

Real Analysis Chapter 4 Solutions Jonathan Conder

arxiv:math/ v1 [math.ho] 2 Feb 2005

UNIVERSAL DERIVED EQUIVALENCES OF POSETS

g 2 (x) (1/3)M 1 = (1/3)(2/3)M.

Ramsey partitions and proximity data structures

1 k x k. d(x, y) =sup k. y k = max

Measure and Integration: Concepts, Examples and Exercises. INDER K. RANA Indian Institute of Technology Bombay India

Non-Asymptotic Theory of Random Matrices Lecture 4: Dimension Reduction Date: January 16, 2007

On metric characterizations of some classes of Banach spaces

Boundedly complete weak-cauchy basic sequences in Banach spaces with the PCP

Entropy and Ergodic Theory Lecture 15: A first look at concentration

5 Set Operations, Functions, and Counting

A PROOF OF A CONVEX-VALUED SELECTION THEOREM WITH THE CODOMAIN OF A FRÉCHET SPACE. Myung-Hyun Cho and Jun-Hui Kim. 1. Introduction

Another Riesz Representation Theorem

Doubling metric spaces and embeddings. Assaf Naor

4 Countability axioms

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing.

Introduction to Real Analysis Alternative Chapter 1

Lecture Notes in Advanced Calculus 1 (80315) Raz Kupferman Institute of Mathematics The Hebrew University

ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS

Lebesgue Measure on R n

Packing-Dimension Profiles and Fractional Brownian Motion

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

Lecture Notes Introduction to Ergodic Theory

Introduction and Preliminaries

Transcription:

ULTRAMETRIC SKELETONS MANOR MENDEL AND ASSAF NAOR Abstract. We prove that for every ε, there exists C ε, with the following property. If X, d is a compact metric space and µ is a Borel probability measure on X then there exists a compact subset S X that embeds into an ultrametric space with distortion O/ε, and a probability measure ν supported on S satisfying ν B d x, r µb d x, C εr ε for all x X and r,. The dependence of the distortion on ε is sharp. We discuss an extension of this statement to multiple measures, as well as how it implies Talagrand s majorizing measure theorem. Our main result is the following theorem.. Introduction Theorem.. For every ε, there exists C ε, with the following property. Let X, d be a compact metric space and let µ be a Borel probability measure on X. Then there exists a compact subset S X satisfying S embeds into an utrametric space with distortion O/ε. 2 There exists a Borel probability measure supported on S satisfying for all x X and r [,. ν B d x, r µb d x, C ε r ε Recall that an ultrametric space is a metric space U, ρ satisfying the strengthened triangle inequality ρx, y max{ρx, z, ρy, z} for all x, y, z U. Saying that S, d embeds with distortion D [, into an ultrametric space means that there exists an ultrametric space U, ρ and an injection f : S U satisfying dx, y ρfx, fy Ddx, y for all x, y S. In the statement of Theorem., and in the rest of this paper, given a metric space X, d, a point x X and a radius r [,, the corresponding closed ball is denoted B d x, r = {y X : dy, x r}, and the corresponding open ball is denoted Bd x, r = {y X : dy, x < r}. We explicitly indicate the underlying metric since the ensuing discussion involves multiple metrics on the same set. We call the metric measure space S, d, ν from Theorem. an ultrametric skeleton of the metric measure space X, d, µ. The literature contains several theorems about the existence of large ultrametric subsets of metric spaces; some of these results will be mentioned below. As we shall see, the subset S of Theorem. must indeed be large, but it is also geometrically spread out with respect to the initial probability measure µ. For example, if µ assigns positive mass to two balls B d x, r and B d y, r, where x, y X satisfy dx, y > C ε +r, then the probability measure ν, which is supported on S, cannot assign full mass to any one of these balls. This is one reason why S, d, ν serves as a skeleton of X, d, µ. More significantly, we call S, d, µ an ultrametric skeleton because it can be used to deduce global information about the entire initial metric measure space X, d, µ. Examples of such global M. M. was partially supported by ISF grants 22/7 and 93/, BSF grants 269 and 22, and a gift from Cisco Research Center. A. N. was partially supported by NSF grant CCF-832795, BSF grants 269 and 22, and the Packard Foundation. Part of this work was completed when M. M. was a visiting researcher at Microsoft Research and A. N. was visiting the Quantitative Geometry program at the Mathematical Sciences Research Institute.

applications of statements that are implied by Theorem. are described in [3, 4], and an additional example will be presented below. As a qualitative illustration of this phenomenon, consider a stochastic process {Z t } t T, assuming for simplicity that the index set T is finite and that each random variable Z t has finite second moment. Equip T with the metric ds, t = E [Z s Z t 2 ]. Assume that there exists a unique random point τ T satisfying Z τ = max t T Z t. Let µ be the law of τ, and apply Theorem., say, with ε = /2, to the metric measure space X, d, µ. One obtains a subset S X that embeds into an ultrametric space with distortion O, and a probability measure ν that is supported on S and satisfies with ε = /2. If σ S is a random point of S whose law is ν, then it follows that for every x T, p, and r [,, if σ falls into B d x, r with probability at least p, then the global maximum τ falls into B d x, Or with probability at least p 2. One can therefore always find a subset of T that is more structured due to the fact that it is approximately an ultrametric space e.g., such structure can be harnessed for chaining-type arguments, yet this subset reflects the location of the global maximum of {Z t } t T in the above distributional/geometric sense. A quantitatively sharp variant of the above qualitative interpretation of Theorem. is discussed in Section.. below... Nonlinear Dvoretzky theorems. Nonlinear Dvoretzky theory, as initiated by Bourgain, Figiel and Milman [4], asks for theorems asserting that any large metric space contains a large subset that embeds with specified distortion into Hilbert space. We will see below examples of notions of largeness of a metric space for which a nonlinear Dvoretzky theorem can be proved. For an explanation of the relation of such problems to the classical Dvoretzky theorem [5], see [4,, 4]. Most known nonlinear Dvoretzky theorems actually obtain subsets that admit a low distortion embedding into an ultrametric space. Since ultrametric spaces admit an isometric embedding into Hilbert space [7], such a result falls into the Bourgain-Figiel-Milman framework. Often see [4, ] one can prove an asymptotically matching impossibility result which shows that all subsets of a given metric space that admit a low distortion embedding into Hilbert space must be small. Thus, in essence, it is often the case that the best way to find an almost Hilbertian subset is actually to aim for a subset satisfying the seemingly more stringent requirement of being almost ultrametric. Apply Theorem. to an n-point metric space X, d, with µ{x} = /n for all x X. Since ν is a probability measure on S, there exists x X with ν{x} / S. An application of with r = shows that / S µ{x} ε = /n ε, or S n ε. Since S embeds into an ultrametric space with distortion O/ε, this shows that Theorem. implies the sharp solution of the Bourgain-Figiel-Milman nonlinear Dvoreztky problem that was first obtained in [3]. Sharpness in this context means that, as shown in [], there exists a universal constant c, and for every n N there exists an n-point metric space X n such that every S X n with S n ε incurs distortion at least c/ε in any embedding into Hilbert space. Thus, the distortion bound in Theorem. cannot be improved up to constants, even if we only require that S embeds into Hilbert space. Assume that X, d is a compact metric space of Hausdorff dimension greater than α,. Then there exists [9, ] an α-frostman measure on X, d, i.e., a Borel probability measure µ satisfying µb d x, r Kr α for every x X and r,, where K is a constants that may depend on X and α but not on x and r. An application of Theorem. to X, d, µ yields a compact subset S X that embeds into an ultrametric space with distortion O/ε, and a Borel probability measure ν supported on S satisfying νb d x, r µb d x, C ε r ε K ε C ε εα r εα for all x X and r,. Hence ν is a εα-frostman measure on S, implying [] that S has Hausdorff dimension at least εα. Thus Theorem. implies the sharp solution of the nonlinear Dvoretzky problem for Hausdorff dimension that was first obtained in [4]. More generally, the following result was proved in [4] as the main step towards the solution of a question of Tao concerning the nonlinear Dvoretzky problem for Hausdorff dimension. 2

Theorem.2. For every ε, there exists c ε = e O/ε2, with the following property. Let X, d be a compact metric space and let µ be a Borel probability measure on X. Then there exists a compact subset S X satisfying S embeds into an utrametric space with distortion O/ε. 2 If {x i } i I X and {r i } i I [, satisfy I B dx i, r i S then µ B d x i, c ε r i ε. 2 i I Theorem.2 is a consequence of Theorem.. Indeed, if S X and ν are the subset and probability measure from Theorem., then = νs i I νb dx i, r i i I µb dx i, C ε r i ε whenever i I B dx i, r i S. But, Theorem.2 is the main reason for the validity of the phenomenon described in Theorem.: here we show how to formally deduce Theorem. from Theorem.2, with C ε = Oc ε /ε = e O/ε2. Alternatively, with more work, one can repeat the proof of Theorem.2 in [4] while making changes to several lemmas in order to prove Theorem. directly, and obtain C ε = c ε. Since the proof of Theorem.2 in [4] is quite involved, we believe that it is instructive to establish Theorem. via the argument described here.... Majorizing measures and stochastic processes. Theorem. makes it possible to relate the nonlinear Dvoretzky framework of [4] to Talagrand s nonlinear Dvoretzky theorem [6], and consequently to Talagrand s majorizing measure theorem [6]. Given a metric space X, d let P X be the Borel probability measures on X. The Fernique-Talagrand γ 2 functional is defined as follows. γ 2 X, d = inf sup µ P X x X log µb d x, r dr. 3 A nonlinear Dvoretzky-like theorem due to Talagrand [6] asserts that every finite metric space X, d has a subset S X that embeds into an ultrametric space with distortion O and γ 2 S, d γ 2 X, d. Talagrand proved this nonlinear version of Dvoretzky s theorem in order to prove his celebrated majorizing measure theorem, which asserts that if {G x } x X is a centered Gaussian process and for x, y X we set dx, y = E [G x G y 2 ], then E [sup x X G x ] γ 2 X, d. There is also a simpler earlier matching upper bound on E [sup x X G x ] due to Fernique [6], so E [sup x X G x ] γ 2 X, d. The fact that Talagrand s nonlinear Dvoretzky theorem implies the majorizing measure theorem is simple; see [6, Prop. 3] and also the discussion in [4, Sec..3]. To understand the link between Theorem. and Talagrand s nonlinear Dvoretzky theorem, consider the following quantity, associated to every compact metric space X, d. δ 2 X, d = sup inf log dr. 4 µ P X x X µb d x, r Intuitively, γ 2 X, d should be viewed as a multi-scale version of a covering number, while δ 2 X, d should be viewed as a multi-scale version of a packing number. It is therefore not surprising that γ 2 X, d δ 2 X, d. In fact, in Section 3 we note that γ 2 U, ρ = δ 2 U, ρ for every finite ultrametric space U, ρ, and δ 2 X, d γ 2 X, d for every finite metric space X, d the latter inequality is an improvement of our original bound δ 2 X, d γ 2 X, d, due to an elegant argument of Witold Bednorz [2]. The remaining estimate δ 2 X, d γ 2 X, d will not be needed here, though it follows from our discussion see Remark.3, and it also has a simpler direct proof. Let µ P X satisfy δ 2 X, d = inf x X log /µbd x, rdr. Theorem. applied to X, d, µ yields S X and an ultrametric ρ : S S [, satisfying dx, y ρx, y Kdx, y Here, and in what follows, the relations, indicate the corresponding inequalities up to factors which are universal constants. The relation A B stands for A B A B. 3

for all x, y S. There also exists ν P S satisfying ν B d x, r µb d x, Kr for all x X and r [,. Here K, is a universal constant. Since B ρ x, r B d x, r S B ρ x, Kr for all x S and r [,, we have δ 2 S, d δ 2 S, ρ = γ 2 S, ρ Kγ 2 S, d, where we used the fact that γ 2 and δ 2 coincide for ultrametrics and a simple change of variable argument. Hence, Kγ 2 S, d δ 2 S, d inf x S log νb d x, r log = K 2 inf x S dr inf x S µb d x, r log dr µbd x, Kr dr δ 2X, d K 2 γ 2X, d K 2. 5 This completes the deduction of Talagrand s nonlinear Dvoretzky theorem from Theorem.. Remark.3. It is easy to check see [6, Lem. 6] that γ 2 S, d 2γ 2 X, d for every S X and, even more trivially, δ 2 S, d δ 2 X, d. Thus, it follows from 5 that γ 2 X, d δ 2 X, d for every finite metric space X, d. Since the original 987 publication of Talagrand s majorizing measure theorem, this theorem has been reproved and simplified in several subsequent works, yielding important applications and generalizations mainly due to Talagrand himself. These proofs are variants of the same basic idea: a greedy top-down construction, in which one looks at a given scale for a ball on which a certain functional is maximized, removes a neighborhood of this ball, and iterates this step on the remainder of the metric space. It seems that the framework described here is genuinely different. The proof of Theorem.2 in [4] has two phases. One first constructs a nested family of partitions in a bottom-up fashion: starting with singletons one iteratively groups the points together based on a gluing rule that is tailor-made in anticipation of the ensuing pruning or sparsification step. This second step is a top-down iterative removal of appropriately sparse regions of the partitions that were constructed in the first step; here one combines an analytic argument with the pigeonhole principle to show that there are sufficiently many potential pruning locations so that successive iterations of this step can be made to align appropriately. Our new approach has the advantage that it yields distributional statements such as, the majorizing measure theorem itself being a result of integrating these pointwise estimates..2. Multiple measures. In anticipation of further applications of ultrametric skeletons, we end by addressing what is perhaps the simplest question that one might ask about these geometric objects: to what extent is the union of two ultrametric skeletons also an ultrametric skeleton? We show in Remark 4.3 that for arbitrarily large D, D 2 [,, one can find a finite metric space X, d, and two disjoint subsets U, U 2 X, such that each U i embeds into an ultrametric space with distortion D i, yet any embedding of U U 2 into an ultrametric space incurs distortion at least D + D 2 +. In Section 4 we prove the following geometric result of independent interest which, as explained above, is sharp up to lower order terms. Theorem.4. Fix D, D 2 [,. Let X, d be a metric space and U, U 2 X. Assume that U, d embeds with distortion D into an ultrametric space and that U 2, d embeds with distortion D 2 into an ultrametric space. Then the metric space U U 2, d embeds with distortion at most D + 2D 2 + 2 2 into an ultrametric space. Consequently, one can always find an ultrametric skeleton that is large with respect to any finite list of probability measures. Corollary.5. For every ε, let C ε be as in Theorem.. Let X, d be a compact metric space, and let µ,..., µ k be Borel probability measures on X. Then there exists a compact subset 4

S X, and Borel probability measures ν,..., ν k supported on S, such that S emebds into an ultrametric space with distortion at most O/ε k and for every x X and r [, we have ν i B d x, r µ i B d x, C ε r ε for all i {,..., k}. 2. Proof of Theorem. A submeasure on a set X is a function ξ : 2 X [, satisfying the following conditions. a ξ =, b A A 2 X = ξa ξa 2, c {A i } i I X = ξ i I A i i I ξa i. If in addition ξx = we call ξ a probability submeasure. Lemma 2.. Let U, ρ be a compact ultrametric space, and let ξ : 2 U [, be a probability submeasure. Then there exists a Borel probability measure ν on U satisfying νb ρ x, r ξb ρ x, r for all x X and r [,. Remark 2.2. It is known [8, 5] that there exist probability submeasures that do not dominate any nonzero measure in the literature such measures are called pathological submeasures. Lemma 2. shows that probability submeasures on ultrametric spaces always dominate on all balls some probability measure. Assuming the validity of Lemma 2. for the moment, we prove Theorem.. Proof of Theorem.. Let X, d be a compact metric space and µ a Borel probability measure on X. By Theorem.2 there exists a compact subset S X satisfying the covering estimate 2, and an ultrametric ρ : S S [, satisfying dx, y ρx, y K ε dx, y for all x, y S, where K is a universal constant. For every A S define { ξa = inf µ B d x i, c ε r i ε : {x i, r i } i I X [, } B d x i, r i A. 6 i I i I In 6 the index set I can be countably infinite or finite, with the convention that an empty sum vanishes. One checks that ξ : 2 S [, is a submeasure on S. Moreover, for every x X and r [,, by considering B d x, r as covering itself, we deduce from 6 that ξ S B d x, r µ B d x, c ε r ε. 7 Since µ is a probability measure and X has bounded diameter, it follows from 7 that ξs. The covering estimate 2 implies that ξs, so in fact ξ is a probability submeasure on S. An application of Lemma 2. to S, ρ, ξ yields a Borel probability measure ν supported on S and satisfying νb ρ y, r ξb ρ y, r for all y S and r [,. Fix x X and r [,. The desired estimate holds trivially if B d x, r S =, so we may assume that there exists y S with dx, y r. Thus S B d x, r S B d y, 2r B ρ y, 2K ε r S B d y, 2K ε r S B d x, + 2K ε r. It follows that νb d x, r ν B ρ y, 2Kε r ξ B ρ y, 2Kε r ξ S B d x, + 2K ε r µ B d x, c ε + 2K ε This completes the deduction of Theorem. from Theorem.2 and Lemma 2.. r ε. 5

Prior to proving Lemma 2., we review some basic facts about compact ultrametric spaces; see [] for an extended and more general treatment of this topic. Fix a compact ultrametric space U, ρ. For every r, we have {B ρ x, s : x, s U [r, } <, i.e., there are only finitely many closed balls in U of radius at least r. Indeed, by compactness U contains only finitely many disjoint closed balls of radius at least r. Since B ρ x, s B ρ y, t {, B ρ x, s, B ρ y, t} for every x, y U and s, t [,, assuming for contradiction that {B ρ x, s : x, s U [r, } is infinite, we deduce that there exist {x i, s i } i= U [r, satisfying B ρx i, s i B ρ x i+, s i+ for all i N. Fix y i B ρ x i+, s i+ B ρ x i, s i. If i < j then y j / B ρ x j, s j B ρ x i+, s i+ and y i B ρ x i+, s i+. Hence s i+ < ρy j, x i+ max{ρy j, y i, ρy i, x i+ } max{ρy j, y i, s i+ }. It follows that ρy i, y j > s i+ r for all j > i, contradicting the compactness of U, ρ. A consequence of the above discussion is that for every x U and r, there exists ε, such that B ρ x, r = B ρ x + ε. Therefore B ρ x, r = Bρx, r + ε/2. Similarly, since Bρx, r = δ,r/2] B ρx, r δ, where there are only finitely many distinct balls appearing in this union, there exists δ, r/2] such that Bρx, r = B ρ x, r δ. Thus every open ball in U of positive radius is also a closed ball, and every closed ball in U of positive radius is also an open ball. Consider the equivalence relation on U given by x y ρx, y < diam ρ U. This is indeed an equivalence relation since ρ is an ultrametric. The corresponding equivalence classes are all of the form Bρx, diam ρ U for some x U. Being open sets that cover U, there are only finitely many such equivalence classes, say, {B, B2,..., B k }. By the above discussion, each of the open balls B i is also a closed ball, and hence Bi, ρ is a compact ultrametric space. We can therefore continue the above construction iteratively, obtaining a sequence {P j } j= of partitions of U with the following properties. P = {U}. 2 P j is finite for all j. 3 P j+ is a refinement of P j for all j. 4 Every C P j is of the form B ρx, r for some x U and r [,. 5 For every j, if C P j is not a singleton then there exists x,..., x k U such that {Bρx i, diam ρ C} k i= P j+, the open balls {Bρx i, diam ρ C} k i= are disjoint, and C = k i= B ρx i, diam ρ C. 6 lim j max C Pj diam ρ C =. 7 For every x U and r, there exists j such that B ρx, r P j. The first five items above are valid by construction. The sixth item follows from the fact that for all j N either P j consists of singletons or max C Pj diam ρ C < max C Pj diam ρ C. Since for every r, there are only finitely many balls of radius at least r in U, necessarily lim j max C Pj diam ρ C =. To prove the seventh item above, assume for contradiction that x, r U, is such that Bρx, r / P j for all j. Since the set {Bρx, s} s r is finite, and Bρx, diamu + = U P, by replacing, if necessary, Bρx, r with the largest open ball centered at x that is not contained in j= P j we may assume without loss of generality that Bρx, s j= P j for all s r,. But since B ρ x, r = Bρx, s for some s r,, it follows that B ρ x, r P j for some j. In particular, Bρx, r B ρ x, r, implying that diam ρ B ρ x, r = r. Therefore by construction Bρx, r P j+, a contradiction. Proof of Lemma 2.. Let {P j } j= be the sequence of partitions of U that was constructed above. We will first define ν on S = j= P j { }, which is the set of all open balls in U allowing the radius to vanish, in which case the corresponding open ball is empty. Setting νx = and ν =, assume inductively that ν has been defined on P j. For C P j+ let D P j be the unique set satisfying C D. There exist disjoint sets C,..., C k P j+, with C {C,..., C k }, 6

such that D = C C k. Define νc = ξc k i= ξc νd. 8 i This completes the inductive definition of ν : S [,. We claim that one can apply the Carathéodory extension theorem to extend ν to a Borel measure on U. To this end, note that S is a semi-ring of sets. Indeed, S is closed under intersection since C D {, C, D} for all C, D S. We therefore need to check that for every C, D S, the set D C is a finite disjoint union of elements in S. For this purpose we may assume that D C, implying that C D. Assume that C P j and D P i for i < j. Let C,..., C k P j be the distinct elements of P j that are contained in D, enumerated so that C = C. Then D C = C 2 C k, and this union is disjoint, as required. In order to apply the Carathéodory extension theorem, it remains to check that if {A i } i= S are pairwise disjoint and i= A i S, then ν i= A i = i= νa i. Since all the elements of S are both open and closed, compactness implies that it suffices to show that if A,..., A m S are pairwise disjoint and A A m S, then νa A m = m i= νa i. We proceed by induction on m, the case m = being vacuous. For every i {,..., m} there is a unique k i N such that A i P ki P ki. Define k = max{k,..., k m }. If k = then necessarily m = and A = U. Assume that k > and fix j {,..., m} satisfying k j = k. Let D P k 2 be the unique element of P k 2 containing A j, and let C,..., C l P k be the distinct elements of P k contained in D. Since A A m is a ball containing A j D, we have A A m D = C C l. By maximality of k it follows that A j {C,..., C l } {A,..., A m }. For i {,..., l} let n i {,..., m} be such that C i = A ni. Since i {,...,m} {n,...,nl} A i D = m i= A i, the inductive hypothesis implies that i {,...,m} {n,...,n l } νa i+νd = νa A m. But by our definition 8 we have νd = νa n + + νa nl, so that indeed νa A m = m i= νa i. Having defined the Borel probability measure ν, it remains to check by induction on j that if C P j then νc ξc. If j = then C = U and νu = ξu =. If j then let D P j satisfy C D. There exist disjoint sets C,..., C k P j+, with C {C,..., C k }, such that D = C C k. Since ξ is a submeasure, ξd ξc + + ξc k. By the inductive hypothesis νd ξd. Our definition 8 now implies that νc ξc. The proof of Lemma 2. is complete. and 3. γ 2 X, d and δ 2 X, d Let X, d be a finite metric space. For every measurable φ :, [, define γ φ X, d = inf sup µ P X x X δ φ X, d = sup µ P X inf x X φµb d x, rdr, φµb d x, rdr. Thus γ 2 = γ φ and δ 2 = δ φ for φx = log/x. The following lemma is a variant of an argument of Bednorz [2, Lem. 4]. The elegant proof below was shown to us by Keith Ball; it is a generalization and a major simplification of our original proof of the estimate δ 2 X, d γ 2 X, d. Lemma 3.. Assume that φ :,, is continuous and lim x + φx =. δ φ X, d γ φ X, d. Then Proof. Write X = {x,..., x n }. Thus P X can be identified with the n -dimensional simplex n = {µ,..., µ n [, ] n ; µ + + µ n = } by setting µ{x i } = µ i. 7

Define f,..., f n : n [, by { if µi =, f i µ = + φµbx i, rdr if µ i >. Writing Sµ = n i= f iµ, we define F : n n by F µ = f µ,..., f n µ/sµ. Since φ is continuous and lim x + φx =, all the f i are continuous on n. Since each µ n has at least one positive coordinate, Sµ >. Thus F is continuous. Note that by definition F maps each face of n into itself. By a standard reformulation of the Brouwer fixed point theorem see, e.g., [7, Sec. 4.29 2 + ], it follows that f n = n. In particular, there exists µ n for which F µ = /n,..., /n. In other words, there exists µ P X such that φµb d x, rdr does not depend on x X. Hence, δ φ X, d inf x X φµb d x, rdr = sup x X φµb d x, rdr γ φ X, d. Lemma 3.2. Assume that φ :, [, is non-increasing. Let U, ρ be a finite ultrametric space. Then δ φ U, ρ γ φ U, ρ. Proof. We claim that if µ, ν are nonnegative measures on U satisfying µu νu then there exists a U satisfying µb ρ a, r νb ρ a, r for all r,. This would imply the desired estimate since if µ, ν P X are chosen so that sup x X φµb ρ x, rdr = γ φ U, ρ and inf x X φµb ρ x, rdr = δ φ U, ρ, then γ φ U, ρ φµb ρ a, rdr φνb ρ a, rdr δ φ U, ρ. The proof of the existence of a U is by induction on U. If U = there is nothing to prove. Otherwise, as explained in Section 2, there exist x,..., x k U such that the balls {Bρx i, diam ρ U} k i= are nonempty, pairwise disjoint, and k i= B ρx i, diam ρ U = U. It follows that k i= µb ρx i, diam ρ U = µu νu = k i= νb ρx i, diam ρ U. Consequently there exists i {,..., k} such that µbρx i, diam ρ U νbρx i, diam ρ U. By the inductive hypothesis there exists a Bρx i, diam ρ U satisfying µb ρ a, r νb ρ a, r for all r < diam ρ U. Since for r diam ρ U we have B ρ a, r = U, the proof is complete. A combination of Lemma 3. and Lemma 3.2 yields the following corollary. Corollary 3.3. If φ :, [, is non-increasing, continuous, and lim x + φx =, then δ φ U, ρ = γ φ U, ρ for all finite ultrametric spaces U, ρ. Remark 3.4. Consider the star metric d n on {,,..., n}, i.e., d n, i = for all i {,..., n} and d n p, q = 2 for all distinct p, q {,..., n}. The measure ν on {,,..., n} given by ν{} = and ν{i} = /n for i {,..., n}, shows that δ 2 {,,..., n}, d n 2 log n. At the same time, the measure µ on {,,..., n} given by µ{} = /2 and µ{i} = /2n for i {,..., n}, shows that γ 2 {,,..., n}, d n log2n + log2n/n + /2 + oδ 2 {,,..., n}, d n. Thus, unlike the case of ultrametric spaces, for general metric spaces it is not always true that γ 2 X, d = δ 2 X, d. Of course, due to Lemma 3. and Remark.3 we know that γ 2 X, d δ 2 X, d. It seems plausible that always δ 2 X, d 2γ 2 X, d, but we do not investigate this here. 4. Unions of approximate ultrametrics In this section we prove Theorem.4 and present some related examples. Below, given a partition P of a set X, for x X we denote by P x the element of P to which x belongs. 8

Lemma 4.. Fix D, D 2. Let X, d be a metric space and let U, U 2 X be two bounded subsets of X. Assume that U, d embeds with distortion D into an ultrametric space and that U 2, d embeds with distortion D 2 into an ultrametric space. Then for every ε, there is a partition P of U U 2 with the following properties. For every C P, where δ def = For every distinct C, C 2 P, diam d C δ diam d U U 2, 9 2εD 2 D D 2 + 2D + 2D 2 + 2D D 2 + 2D + 2D 2 + 2 + ε. dc, C 2 diam d U U 2 D D 2 + 2D + 2D 2 + 2 + ε. Proof. By rescaling we may assume that diam d U U 2 =. Let ρ be an ultrametric on U satisfying dx, y ρ x, y D dx, y for all x, y U. Define a def = D D 2 + 2D D D 2 + 2D + 2D 2 + 2, 2 and consider the equivalence relation on U given by x y ρ x, y a this is an equivalence relation since ρ is an ultrametric. Let {E i } i I 2 U be the corresponding equivalence classes. Thus for all i I, and for distinct i, j I we have diam d E i diam ρ E i a 3 de i, E j ρ E i, E j D a D. 4 Let ρ 2 be an ultrametric on U 2 satisfying dx, y ρ 2 x, y D 2 dx, y for all x, y U 2. Define b def = D 2 D D 2 + 2D + 2D 2 + 2 + ε, 5 and consider similarly the equivalence relation on U 2 given by x 2 y ρ 2 x, y b. The corresponding equivalence classes will be denoted {F j } j J 2 U 2. Thus for all i J, and for distinct i, j J we have diam d F i diam ρ F i b 6 For every i I denote where df i, F j ρ 2F i, F j D 2 b D 2. 7 J i def = {j J : de i, F j c}, 8 c def = D D 2 + 2D + 2D 2 + 2. 9 9

Note that for every j J there is at most one i I for which j J i. Indeed, if j J i J l, where i l, then de i, E l de i, F j + diam d F j + df j, E l 6 8 5 9 = < 2c + b 2 D D 2 + 2D + 2D 2 + 2 + D 2 D D 2 + 2D + 2D 2 + 2 + ε 2 + D 2 2 = a, D D 2 + 2D + 2D 2 + 2 D Contradicting 4. Consider the partition P of U U 2 consisting of the sets E i and {F j U } j J i I J i. j J i F j i I It follows from 4, 7 and 8 that for every distinct C, C 2 P, { } a b dc, C 2 min,, c = D D 2 D D 2 + 2D + 2D 2 + 2 + ε. Thus the partition P satisfies. It also follows from 3, 6 and 8 that for every C P, diam d C a + 2b + 2c = δ, where we used the definitions, 2, 5, 9. Thus the partition P satisfies 9, completing the proof of Lemma 4.. Proof of Theorem.4. Assume first that U, U 2 are bounded. Define a sequence {P k } k= of partitions of U U 2 as follows. Start with the trivial partition P = {U U 2 }, and having defined P k, the partition P k+ is obtained by applying Lemma 4. to the sets U C and U 2 C for each C P k. Then the partitions {P k } k= have the following properties. P k+ is a refinement of P k, for every C P k we have diam d C δ k diam d U U 2, 2 for every distinct C, C 2 P k+ such that C, C 2 C for some C P k, we have dc, C 2 diam d C D D 2 + 2D + 2D 2 + 2 + ε. 2 It follows from 2 that for every distinct x, y U U 2 we have P k x P k y for k large enough. Thus for distinct x, y U U 2 let kx, y denote the largest integer k such that P k x = P k y. Define { diamd Pkx,y x x y, ρx, y = x = y. Then ρ is an ultrametric on U U 2. Indeed, for distinct x, y, z U U 2 let k be the largest integer such that P k x = P k y = P k z. Then k = min{kx, z, ky, z} and P k x P kx,y x, implying that ρx, y = diam d Pkx,y x diam d P k x = max{ρx, z, ρy, z}. For distinct

x, y U U 2, since x, y P kx,y x, we have ρx, y = diam d Pkx,y x dx, y, while since P kx,y+ x P kx,y+ y, we deduce from 2 that dx, y d P kx,y+ x, P kx,y+ y diam d Pkx,y x D D 2 + 2D + 2D 2 + 2 + ε = ρx, y D D 2 + 2D + 2D 2 + 2 + ε. The above argument shows that if U, U 2 are bounded, the metric space U U 2, d embeds with distortion D D 2 + 2D + 2D 2 + 2 + ε into an ultrametric space for every ε,. For possibly unbounded U, U 2 X, fix x X, and for every n N let ρ n be an ultrametric on U B d x, n U 2 B d x, n satisfying dx, y ρ n x, y D D 2 + 2D + 2D 2 + 2 + n dx, y for all x, y U B d x, n U 2 B d x, n. Define also ρ n x, y = if {x, y} is not contained in U B d x, n U 2 B d x, n. Let U be a free ultrafilter on N and set ρ x, y def = lim n U ρ nx, y. Then ρ is an ultrametric on U U 2 satisfying d ρ D D 2 + 2D + 2D 2 + 2d. Remark 4.2. There are several interesting variants of the problem studied in Theorem.4. For example, answering our initial question, Konstantin Makarychev and Yury Makarychev proved private communication that if X, d is a metric space and E, E 2 X embed into Hilbert space with distortion D then E E 2, d embeds into Hilbert space with distortion fd. Their argument crucially uses Hilbert space geometry, and therefore the following natural question remains open: if X, d is a metric space and E, E 2 X embed into L with distortion D, does it follow that E E 2, d embeds into L with distortion fd? Regarding unions of more than two subsets, perhaps even the following ambitious question has a positive answer: if E,..., E n X embed into Hilbert space with distortion D, does it follow that E... E n, d embeds into Hilbert space with distortion Olog nfd? If true, this statement in the isometric case D = would yield a very interesting strengthening of Bourgain s embedding theorem [3], which asserts that any n-point metric space embeds into Hilbert space with distortion Olog n. Remark 4.3. The following example shows that Theorem.4 is sharp up to lower order terms. Fix two integers M, N 2, and write MN = KM+N+L, where K N and L {,,..., M+N }. Consider the following two subsets of the real line: U def = U 2 def = K i= K i= {im + N, im + N +,..., im + N + M }, {im + N + M, im + N + M +,..., i + M + N }. For i {, 2}, let D i be the best possible distortion of U i with the metric inherited from R in an ultrametric space. For i, i 2 {,..., K } and j, j 2 {,..., M } define ρ i M + N + j, i 2 M + N + j 2 def = i = i 2 j = j 2, M i = i 2 j j 2, K M + N + M i i 2.

Then ρ is an ultrametric on U satisfying x y ρ x, y M x y for all x, y U. Hence D M. Similarly, for i, i 2 {,..., K } and j, j 2 {M,..., M + N } define ρ 2 i M + N + j, i 2 M + N + j 2 def = i = i 2 j = j 2, N i = i 2 j j 2, K M + N + N i i 2. Then ρ 2 is an ultrametric on U 2 satisfying x y ρ x, y N x y for all x, y U 2. Hence D 2 N. But U U 2 = {,,..., KM + N }, and hence any embedding of U U 2 into an ultrametric space incurs distortion at least KM +N see for example [2, Lem. 2.4]. Observe that this lower bound on the distortion equals MN L M N D D 2. When L = e.g., when M = N = 2S or M = 2N = 6S for some S N, the above distortion lower bound becomes MN L D + D 2 +. Thus one cannot improve the bound in Theorem.4 to D D 2, i.e., additional lower order terms are necessary. Acknowledgements. We are grateful to the following people for helpful suggestions: Keith Ball, Witold Bednorz, Rafa l Lata la, Gilles Pisier, Gideon Schechtman, Michel Talagrand. References [] Y. Bartal, N. Linial, M. Mendel, and A. Naor. On metric Ramsey-type phenomena. Ann. of Math. 2, 622:643 79, 25. [2] W. Bednorz. The measure approach to sample boundedness. Preprint, 2. [3] J. Bourgain. On Lipschitz embedding of finite metric spaces in Hilbert space. Israel J. Math., 52-2:46 52, 985. [4] J. Bourgain, T. Figiel, and V. Milman. On Hilbertian subsets of finite metric spaces. Israel J. Math., 552:47 52, 986. [5] A. Dvoretzky. Some results on convex bodies and Banach spaces. In Proc. Internat. Sympos. Linear Spaces Jerusalem, 96, pages 23 6. Jerusalem Academic Press, Jerusalem, 96. [6] X. Fernique. Évaluations de processus gaussiens composés. In Probability in Banach spaces Proc. First Internat. Conf., Oberwolfach, 975, pages 67 83. Lecture Notes in Math., Vol. 526. Springer, Berlin, 976. [7] M. Gromov. Metric structures for Riemannian and non-riemannian spaces. Modern Birkhäuser Classics. Birkhäuser Boston Inc., Boston, MA, english edition, 27. Based on the 98 French original, With appendices by M. Katz, P. Pansu and S. Semmes, Translated from the French by Sean Michael Bates. [8] W. Herer and J. P. R. Christensen. On the existence of pathological submeasures and the construction of exotic topological groups. Math. Ann., 23:23 2, 975. [9] J. D. Howroyd. On dimension and on the existence of sets of finite positive Hausdorff measure. Proc. London Math. Soc. 3, 73:58 64, 995. [] B. Hughes. Trees and ultrametric spaces: a categorical equivalence. Adv. Math., 89:48 9, 24. [] P. Mattila. Geometry of sets and measures in Euclidean spaces, volume 44 of Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge, 995. Fractals and rectifiability. [2] M. Mendel and A. Naor. Euclidean quotients of finite metric spaces. Adv. Math., 892:45 494, 24. [3] M. Mendel and A. Naor. Ramsey partitions and proximity data structures. J. Eur. Math. Soc., 92:253 275, 27. [4] M. Mendel and A. Naor. Ultrametric subsets with large Hausdorff dimension. To appear in Invent. Math., preprint available at http://arxiv.org/abs/6.879, 2. [5] M. Talagrand. A simple example of pathological submeasure. Math. Ann., 2522:97 2, 979/8. [6] M. Talagrand. Regularity of Gaussian processes. Acta Math., 59-2:99 49, 987. [7] I. A. Vestfrid and A. F. Timan. A universality property of Hilbert spaces. Dokl. Akad. Nauk SSSR, 2463:528 53, 979. Mathematics and Computer Science Department, Open University of Israel, University Road, P.O. Box 88 Raanana 437, Israel E-mail address: mendelma@gmail.com Courant Institute, New York University, 25 Mercer Street, New York NY 2, USA E-mail address: naor@cims.nyu.edu 2