Strong Consistency of Set-Valued Frechet Sample Mean in Metric Spaces

Similar documents
Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation

Problem Set 2: Solutions Math 201A: Fall 2016

Math 4317 : Real Analysis I Mid-Term Exam 1 25 September 2012

1. Supremum and Infimum Remark: In this sections, all the subsets of R are assumed to be nonempty.

Hypothesis Testing For Multilayer Network Data

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results

Applied Analysis (APPM 5440): Final exam 1:30pm 4:00pm, Dec. 14, Closed books.

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Econ Lecture 3. Outline. 1. Metric Spaces and Normed Spaces 2. Convergence of Sequences in Metric Spaces 3. Sequences in R and R n

Basic Properties of Metric and Normed Spaces

REAL VARIABLES: PROBLEM SET 1. = x limsup E k

On the convergence of sequences of random variables: A primer

02. Measure and integral. 1. Borel-measurable functions and pointwise limits

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define

converges as well if x < 1. 1 x n x n 1 1 = 2 a nx n

SEPARABILITY AND COMPLETENESS FOR THE WASSERSTEIN DISTANCE

Problem set 1, Real Analysis I, Spring, 2015.

MATHS 730 FC Lecture Notes March 5, Introduction

Weighted Network Analysis for Groups:

The Lindeberg central limit theorem

REVIEW OF ESSENTIAL MATH 346 TOPICS

1. Let A R be a nonempty set that is bounded from above, and let a be the least upper bound of A. Show that there exists a sequence {a n } n N

Closest Moment Estimation under General Conditions

The properties of L p -GMM estimators

The Arzelà-Ascoli Theorem

Math Solutions to homework 5

Math 118B Solutions. Charles Martin. March 6, d i (x i, y i ) + d i (y i, z i ) = d(x, y) + d(y, z). i=1

Solution. 1 Solution of Homework 7. Sangchul Lee. March 22, Problem 1.1

Introduction and Preliminaries

Closest Moment Estimation under General Conditions

large number of i.i.d. observations from P. For concreteness, suppose

A note on some approximation theorems in measure theory

3 Integration and Expectation

Probability and Measure

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

of the set A. Note that the cross-section A ω :={t R + : (t,ω) A} is empty when ω/ π A. It would be impossible to have (ψ(ω), ω) A for such an ω.

Spectral theory for compact operators on Banach spaces

Course 212: Academic Year Section 1: Metric Spaces

d(x n, x) d(x n, x nk ) + d(x nk, x) where we chose any fixed k > N

1 Math 241A-B Homework Problem List for F2015 and W2016

Statistics 581 Revision of Section 4.4: Consistency of Maximum Likelihood Estimates Wellner; 11/30/2001

is a Borel subset of S Θ for each c R (Bertsekas and Shreve, 1978, Proposition 7.36) This always holds in practical applications.

Analysis III Theorems, Propositions & Lemmas... Oh My!

Brownian motion. Samy Tindel. Purdue University. Probability Theory 2 - MA 539

PROBLEMS. (b) (Polarization Identity) Show that in any inner product space

Verifying Regularity Conditions for Logit-Normal GLMM

CONSISTENCY OF CHEEGER AND RATIO GRAPH CUTS

Homework for MATH 4603 (Advanced Calculus I) Fall Homework 13: Due on Tuesday 15 December. Homework 12: Due on Tuesday 8 December

Empirical Processes: General Weak Convergence Theory

1 The Glivenko-Cantelli Theorem

FACULTY OF SCIENCE FINAL EXAMINATION MATHEMATICS MATH 355. Analysis 4. Examiner: Professor S. W. Drury Date: Tuesday, April 18, 2006 INSTRUCTIONS

a) Let x,y be Cauchy sequences in some metric space. Define d(x, y) = lim n d (x n, y n ). Show that this function is well-defined.

l(y j ) = 0 for all y j (1)

Serena Doria. Department of Sciences, University G.d Annunzio, Via dei Vestini, 31, Chieti, Italy. Received 7 July 2008; Revised 25 December 2008

Normed Vector Spaces and Double Duals

Locally convex spaces, the hyperplane separation theorem, and the Krein-Milman theorem

General Theory of Large Deviations

Part III. 10 Topological Space Basics. Topological Spaces

MATH 566 LECTURE NOTES 6: NORMAL FAMILIES AND THE THEOREMS OF PICARD

Probability and Measure

Introduction to Empirical Processes and Semiparametric Inference Lecture 09: Stochastic Convergence, Continued

Chapter 5. Weak convergence

Introduction to Topology

Lopsided Convergence: an Extension and its Quantification

ABSTRACT INTEGRATION CHAPTER ONE

Math 4121 Spring 2012 Weaver. Measure Theory. 1. σ-algebras

Lecture 5 - Hausdorff and Gromov-Hausdorff Distance

Theorems of H. Steinhaus, S. Picard and J. Smital

Real Analysis Notes. Thomas Goller

Midterm 1. Every element of the set of functions is continuous

We are now going to go back to the concept of sequences, and look at some properties of sequences in R

MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION. Chapter 2: Countability and Cantor Sets

1 Continuity Classes C m (Ω)

Bonus Homework. Math 766 Spring ) For E 1,E 2 R n, define E 1 + E 2 = {x + y : x E 1,y E 2 }.

Dynkin (λ-) and π-systems; monotone classes of sets, and of functions with some examples of application (mainly of a probabilistic flavor)

1 Glivenko-Cantelli type theorems

Summary of Real Analysis by Royden

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond

Math 209B Homework 2

Lebesgue s Differentiation Theorem via Maximal Functions

Part II Probability and Measure

Optimization and Optimal Control in Banach Spaces

B. Appendix B. Topological vector spaces

7 Convergence in R d and in Metric Spaces

The Kolmogorov continuity theorem, Hölder continuity, and the Kolmogorov-Chentsov theorem

Final. due May 8, 2012

212a1214Daniell s integration theory.

REAL AND COMPLEX ANALYSIS

A new look at nonnegativity on closed sets

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing.

Notes for Functional Analysis

Theoretical Statistics. Lecture 1.

arxiv: v2 [math.co] 11 Mar 2008

Solutions to Tutorial 8 (Week 9)

Max-min (σ-)additive representation of monotone measures

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Introduction to Real Analysis

Introduction to Hausdorff Measure and Dimension

The Uniform Weak Law of Large Numbers and the Consistency of M-Estimators of Cross-Section and Time Series Models

Transcription:

Strong Consistency of Set-Valued Frechet Sample Mean in Metric Spaces Cedric E. Ginestet Department of Mathematics and Statistics Boston University JSM 2013

The Frechet Mean Barycentre as Average Given a set of points x 1,..., x n R k ; And a set of masses w 1,..., w n R; The barycentre is x := 1 n w i x i. (1) When the distribution of mass is uniform, this is the centroid.

The Frechet Mean Barycentre as Average Given a set of points x 1,..., x n R k ; And a set of masses w 1,..., w n R; The barycentre is x := 1 n w i x i. (1) When the distribution of mass is uniform, this is the centroid. Barycentre as Minimizer Take the L 2 -metric on R k ; The barycentre is a minimizer, 1 x = argmin x R n k w i x i x 2 2. (2)

The Frechet Mean The Most Central Element Given a metric space (, d), Let an abstract-valued r.v. : (Ω, F, P) (, B), Fréchet (1948) defined the mean value, for every r 1, Θ r := arginf d(x, x ) r dµ(x). x

The Frechet Mean The Most Central Element Given a metric space (, d), Let an abstract-valued r.v. : (Ω, F, P) (, B), Fréchet (1948) defined the mean value, for every r 1, Θ r := arginf d(x, x ) r dµ(x). x Frechet Sample Mean Given a family of abstract-valued r.v.s i s, i = 1,..., n, Θ r n := arginf x 1 n d( i, x ) r. In general, Θ r, Θ r n will not be unique: i.e. Θ r, Θ r n.

Motivating Example G 1 G 2 Figure: A Sample of Two Simple Graphs, G with V (G) = 4.

Motivating Example G 1 G 2 Figure: A Sample of Two Simple Graphs, G with V (G) = 4. Hamming Distance d(g, G ) := i<j I{e ij e ij}.

Non-unique Means G 1 G 2 Figure: A Sample of Two Simple Graphs, G with V (G) = 4. Θ 1 Θ 2 Θ 3 Θ 4 Figure: Set of Frechet means (Θ) is a superset of the sampled graphs.

Questions and Assumptions Convergence of Frechet Sample Mean Θ r n := arginf x 1 n d( i, x ) r arginf x d(x, x ) r dµ(x) =: Θ r, Ziezold (1977) in separable finite metric spaces, for r = 2; Sverdrup-Thygeson (1981) in compact metric spaces, for r 1.

Questions and Assumptions Convergence of Frechet Sample Mean Θ r n := arginf x 1 n d( i, x ) r arginf x d(x, x ) r dµ(x) =: Θ r, Ziezold (1977) in separable finite metric spaces, for r = 2; Sverdrup-Thygeson (1981) in compact metric spaces, for r 1. Topological and Measure-theoretic Assumptions Separable bounded (, d). Possibly empty Θ r n, Θ r. Possibly non-unique Θ r n, Θ r.

Set-valued Outer and Inner Limits Figure: Outer and inner limits, defined wrt set inclusion on. limsup S n liminf S n limsup S n := n n=1 m=n S n, and liminf n S n := n=1 m=n S n. Outer Limit: The x s that belong to infinitely many S n. Inner Limit: The x s that belong to all but finitely many S n.

Kuratowski Upper Limit Equivalent Definitions Given a sequence of subsets A n : Set of cluster points of sequences a n A n ; The Kuratowski upper limit is { } Limsup A n := x : liminf d(x, A n) = 0. n n where liminf and Limsup are taken with respect to real numbers and subsets of, respectively. Lemma Given a metric space (, d), for any sequence of sets A n, Limsup A n = n n=1 m=n A m.

Almost Sure Consistency of Frechet Sample Mean Main Result In a separable bounded metric space (, d), For any 1,..., n be an iid sequence, Convergence of the infima: σ n r 1 := inf x n d( i, x ) r inf d(x, x ) r dx =: σ r x a.s., Convergence of the arginf s: Limsup n Θ r n Θ r a.s.. For every r > 0.

Almost Sure Uniform Weak Convergence Glivenko-Cantelli Lemma (Rao, 1962) i. Let F( ) be a class of real-valued functions on separable, ii. Let a sequence of finite measures µ n, and µ, iii. F( ) is dominated by a continuous integrable function on, iv. F( ) is equicontinuous, and v. µ n µ, a.s.; then we obtain uniform a.s. weak convergence, lim fdµ n fdµ = 0, a.s.. sup n f F

Point Functions Definition For some z, the z-point function is d z (x) := d(z, x), for every x. The class of point functions on (, d) is then denoted by D r ( ) := {d r z : z }. for every r 1. Lemma If (, d) is a bounded metric space, then for every r 1, D r ( ) is i. Uniformly bounded; ii. Uniformly equicontinuous.

Almost Sure Consistency of Frechet Sample Mean Strengthening of a.s. Weak Convergence Using the Glivenko-Cantelli lemma, we obtain for every r 1, lim sup n d r z dµ n d r z dµ z D r = 0 a.s.. where µ n := 1 n n δ i.

Almost Sure Consistency of Frechet Sample Mean Strengthening of a.s. Weak Convergence Using the Glivenko-Cantelli lemma, we obtain for every r 1, lim sup n d r z dµ n d r z dµ z D r = 0 a.s.. where µ n := 1 n n δ i. Almost Sure Convergence of σ n r In order to show that σ n r σ r a.s., We also need the following sandwich relationship, Tn r σ n r σ r Tn. r

Almost Sure Consistency of Frechet Sample Mean Sandwich Argument I By the minimality of θ Θ 2, T n (ˆθ n ) := 1 n 1 n d 2ˆθn ( i ) d 2ˆθn (x)dµ(x) d 2ˆθn ( i ) d 2 θ(x)dµ(x) =: Tn(ˆθ n ).

Almost Sure Consistency of Frechet Sample Mean Sandwich Argument I By the minimality of θ Θ 2, T n (ˆθ n ) := 1 n 1 n d 2ˆθn ( i ) d 2ˆθn (x)dµ(x) d 2ˆθn ( i ) d 2 θ(x)dµ(x) =: Tn(ˆθ n ). Sandwich Argument II By the minimality of ˆθ n Θ 2 n, T n(ˆθ n ) := 1 n 1 n d 2ˆθn ( i ) d 2 θ(x)dµ(x) d 2 θ( i ) d 2 θ(x)dµ(x) =: T n (θ).

Almost Sure Consistency of Frechet Sample Mean Main Result In a separable bounded metric space (, d), For any 1,..., n be an iid sequence, Convergence of the infima: σ n r 1 := inf x n d( i, x ) r inf d(x, x ) r dx =: σ r x a.s., Convergence of the arginf s: Limsup n Θ r n Θ r a.s.. For every r > 0.

Conclusion and Extensions Consistency of Frechet Sample Mean Under separable and bounded d. For non-unique means. For every r 1. Possible Extensions Distance functions without the triangle inequality. Non-iid random variables (See Kendall et al., 2012). Rate of convergence. Statistical inference.

Publications & Funding Agencies Papers Ginestet, C.E., Simmons, A. and Kolaczyk, E.D. (2012). Probability and Statistics Letters, 82(10), 1859 1863. Fournel, A.P., Reynaud, E., Brammer, M.J., Simmons, A. and Ginestet, C.E. (2013). Neuroimage, 76, 373 385. Ginestet, C.E. (Submitted). Strong Consistency of Fréchet Sample Mean Sets for Graph-Valued Random Variables. Ariv. Acknowledgements Eric D. Kolaczyk (Boston University) Tom Nichols and Wilfrid S. Kendall (Warwick University) Funding Agency Air Force Office for Scientific Research (AFOSR).