Consistency of Modularity Clustering on Random Geometric Graphs

Size: px
Start display at page:

Download "Consistency of Modularity Clustering on Random Geometric Graphs"

Transcription

1 Consistency of Modularity Clustering on Random Geometric Graphs Erik Davis The University of Arizona May 10, 2016

2 Outline Introduction to Modularity Clustering Pointwise Convergence Convergence of Optimal Partitions

3 Graph Clustering G = (X, W )

4 Graph Clustering G = (X, W ) clustering = partition = coloring

5 Graph Clustering G = (X, W ) clustering = partition = coloring 1 2m W ij δ(c i, c j ) where m = 1 2 W ij. i,j i,j

6 Modularity (Newman & Girvan 04) Q(U) = 1 2m where d i = j W ij, U a clustering. i,j W ij δ(c i, c j ) 1 d i d j 2m 2m δ(c i, c j ) i,j

7 Modularity Clustering Modularity Clustering: U = arg max Q(U), U K

8 Modularity Clustering Modularity Clustering: More generally (α-modularity): Q(U) = 1 2m where S = i d α i. U = arg max Q(U), U K W ij δ(c i, c j ) 1 S 2 i,j i,j d α i d α j δ(c i, c j )

9 Random Geometric Graphs Fix open D R d with Lipschitz boundary, ν = ρ(x) dx probability measure on D.

10 Random Geometric Graphs Fix open D R d with Lipschitz boundary, ν = ρ(x) dx probability measure on D. Take {X i } i N i.i.d., and X n = {X i } n i=1.

11 Random Geometric Graphs Fix open D R d with Lipschitz boundary, ν = ρ(x) dx probability measure on D. Take {X i } i N i.i.d., and X n = {X i } n i=1. To assign weights, we pick a kernel η : R d R, length scale ɛ n.

12 Random Geometric Graphs Fix open D R d with Lipschitz boundary, ν = ρ(x) dx probability measure on D. Take {X i } i N i.i.d., and X n = {X i } n i=1. To assign weights, we pick a kernel η : R d R, length scale ɛ n. W ij = { ( Xi X η j ɛ n ), if i j, 0, otherwise.

13 Random Geometric Graphs Fix open D R d with Lipschitz boundary, ν = ρ(x) dx probability measure on D. Take {X i } i N i.i.d., and X n = {X i } n i=1. To assign weights, we pick a kernel η : R d R, length scale ɛ n. ( Xi X j { 1 η W ij = ɛ d n ɛ n )=: η ɛn (X i X j ), if i j, 0, otherwise. Graph G n = (X n, W ).

14 Questions G n gives (random) modularity functional Q n.

15 Questions G n gives (random) modularity functional Q n. 1. What is the behavior of Q n as n?

16 Questions G n gives (random) modularity functional Q n. 1. What is the behavior of Q n as n? 2. What do optimal modularity clusterings look like? Un arg max Q n (U n ) U n K

17 Questions G n gives (random) modularity functional Q n. 1. What is the behavior of Q n as n? 2. What do optimal modularity clusterings look like? Un arg max Q n (U n ) U n K Consistency: Subject to certain technical assumptions, U n U where U is a partition of D characterized as the solution to a (deterministic) continuum optimization problem.

18 Consistency of Clustering Methods K-Means (Pollard 1981) Spectral Clustering (von Luxburg, Belkin, & Bosquet 2008) Modularity (Bickel & Chen 2009, Zhao, Levina, & Zhu 2012) Cheeger Cut (García-Trillos, Slepčev, von Brecht, Laurent, & Bresson 2014)

19 Outline Introduction to Modularity Clustering Pointwise Convergence Convergence of Optimal Partitions

20 Key Identity Let U n = {U n,k } K k=1 be a partition of X n, and let u n,k = 1 Un,k.

21 Key Identity Let U n = {U n,k } K k=1 be a partition of X n, and let u n,k = 1 Un,k. Then 1 1/K Q n (U n ) = 1 S 2 K ( di α [ un,k (X i ) 1/K ]) 2 k=1 + ɛ n n 2 4m i K GTV n (u n,k ), k=1 where GTV n (u) := 1 1 ɛ n n 2 η ɛn (X i X j ) u(x i ) u(x j ). i,j

22 Continuum Partitioning Domain D R d, fixed K 1, and partition U = {U k } K k=1 of D.

23 Continuum Partitioning Domain D R d, fixed K 1, and partition U = {U k } K k=1 of D. U is balanced with respect to µ if µ(u k ) = 1/K for k = 1,..., K.

24 Continuum Partitioning Domain D R d, fixed K 1, and partition U = {U k } K k=1 of D. U is balanced with respect to µ if µ(u k ) = 1/K for k = 1,..., K. The perimeter of U k in D, with respect to a weight ρ 2, is Per(U k ; ρ 2 ) = ρ 2 (x) dh d 1 (x). U k

25 Continuum Partitioning Domain D R d, fixed K 1, and partition U = {U k } K k=1 of D. U is balanced with respect to µ if µ(u k ) = 1/K for k = 1,..., K. The perimeter of U k in D, with respect to a weight ρ 2, is Per(U k ; ρ 2 ) = ρ 2 (x) dh d 1 (x). U k More generally, Per(U k ; ρ 2 ) = TV (1 Uk ; ρ 2 ) := sup 1 Uk (x)div Φ(x) dx. Φ C 1 c (D;Rd ) D Φ(x) ρ 2 (x) Remark: When f smooth, TV (f ; ρ 2 ) = D f ρ2 (x) dx.

26 Continuum Partitioning U = arg min U =K µ(u k )=1/K K Per(U k ; ρ 2 ). k=1

27 Continuum Partitioning U = arg min U =K µ(u k )=1/K K Per(U k ; ρ 2 ). k=1 (a) K = 4 (b) K = 9. (c) K = 25. Figure: Local minimizers on D = (0, 1) 2, with ρ(x) = 1, dµ = dx, produced using The Surface Evolver.

28 Pointwise Convergence A (finite perimeter) partition U of D induces a partition U n of X n D.

29 Pointwise Convergence A (finite perimeter) partition U of D induces a partition U n of X n D.

30 Pointwise Convergence A (finite perimeter) partition U of D induces a partition U n of X n D.

31 Pointwise Convergence What is the behavior of Q n as n?

32 Pointwise Convergence What is the behavior of Q n as n? Theorem (Asymptotics) Let U be a finite perimeter partition. Suppose {ɛ n} satisfies n=1 exp( nɛ(d+1)/2 n ) < when α = 0, 1 and n=1 exp( nɛd n) < otherwise.

33 Pointwise Convergence What is the behavior of Q n as n? Theorem (Asymptotics) Let U be a finite perimeter partition. Suppose {ɛ n} satisfies n=1 exp( nɛ(d+1)/2 n ) < when α = 0, 1 and n=1 exp( nɛd n) < otherwise. Then, as n, 1 1/K Q n(u n) a.s. C K η,ρ k=1 Per(U k; ρ 2 ) if ( 2 K k=1 µ(u k ) 1/K) = 0, ɛ n otherwise, where dµ(x) = ρ1+α (x) dx D ρ1+α (x) dx and Cη,ρ = R n η(x) x 1 dx 2 D ρ2 (x) dx.

34 Sketch of Proof: Convergence of Graph Total Variation Recall GTV n (u) = 1 1 ɛ n n 2 η ɛn (X i X j ) u(x i ) u(x j ). i,j

35 Sketch of Proof: Convergence of Graph Total Variation Recall GTV n (u) = 1 1 ɛ n n 2 η ɛn (X i X j ) u(x i ) u(x j ). i,j Proposition Fix u = 1 U, and let {ɛ n } n N be a sequence converging to zero such that n=1 exp( nɛ n (d+1)/2 ) < +. Then GTV n (u) a.s. σ η TV (u; ρ 2 ).

36 Sketch of Proof: Convergence of Graph Total Variation Recall GTV n (u) = 1 1 ɛ n n 2 η ɛn (X i X j ) u(x i ) u(x j ). i,j Proposition Fix u = 1 U, and let {ɛ n } n N be a sequence converging to zero such that n=1 exp( nɛ n (d+1)/2 ) < +. Then GTV n (u) a.s. σ η TV (u; ρ 2 ). Ingredients:

37 Sketch of Proof: Convergence of Graph Total Variation Recall GTV n (u) = 1 1 ɛ n n 2 η ɛn (X i X j ) u(x i ) u(x j ). i,j Proposition Fix u = 1 U, and let {ɛ n } n N be a sequence converging to zero such that n=1 exp( nɛ n (d+1)/2 ) < +. Then GTV n (u) a.s. σ η TV (u; ρ 2 ). Ingredients: Nonlocal TV (Ponce 04)

38 Sketch of Proof: Convergence of Graph Total Variation Recall GTV n (u) = 1 1 ɛ n n 2 η ɛn (X i X j ) u(x i ) u(x j ). i,j Proposition Fix u = 1 U, and let {ɛ n } n N be a sequence converging to zero such that n=1 exp( nɛ n (d+1)/2 ) < +. Then GTV n (u) a.s. σ η TV (u; ρ 2 ). Ingredients: Nonlocal TV (Ponce 04) Exponential bounds for U-statistics (Giné, Lata la, & Zinn 00)

39 Outline Introduction to Modularity Clustering Pointwise Convergence Convergence of Optimal Partitions

40 Convergence of Partitions

41 Convergence of Partitions U 3,1 U 3,2

42 Convergence of Partitions γ 3,1 P(D {0, 1})

43 Convergence of Partitions γ 3,1 P(D {0, 1}) U 1 U 2

44 Convergence of Partitions γ 3,1 P(D {0, 1}) γ 1 P(D {0, 1})

45 Convergence of Partitions Given partition U n = {U n,k } K k=1 of X n, we associate the measures γ n,k P(D {0, 1}), for k = 1,..., K, by γ n,k = 1 n n δ (Xi,1 Un,k (X i )) = (Id 1 Un,k ) ν n. i=1

46 Convergence of Partitions Given partition U n = {U n,k } K k=1 of X n, we associate the measures γ n,k P(D {0, 1}), for k = 1,..., K, by γ n,k = 1 n n δ (Xi,1 Un,k (X i )) = (Id 1 Un,k ) ν n. i=1 Given partition U = {U k } K k=1 of D, we similarly define γ k = (Id 1 Uk ) ν.

47 Convergence of Partitions Given partition U n = {U n,k } K k=1 of X n, we associate the measures γ n,k P(D {0, 1}), for k = 1,..., K, by γ n,k = 1 n n δ (Xi,1 Un,k (X i )) = (Id 1 Un,k ) ν n. i=1 Given partition U = {U k } K k=1 of D, we similarly define γ k = (Id 1 Uk ) ν. w We say that U n U if there exists a sequence {πn } n N of permutations of {1,..., K} such that γ w n,πnk γ k, for k = 1,..., K.

48 Convergence of Optimal Partitions Theorem (Convergence of Optimal Partitions) Suppose {ɛ n } n N satisfies suitable conditions. For n 1, let Un arg max U K Q n (U) be an optimal partition. If U is the unique solution (up to relabeling of its constituent sets) to the problem minimize U =K µ(u k )=1/K K Per(U k ; ρ 2 ) k=1 with dµ(x) = ρ 1+α (x) dx/ D ρ1+α (x) dx, then U n a.s. U. (P)

49 Convergence of Optimal Partitions Theorem (Convergence of Optimal Partitions) Suppose {ɛ n } n N satisfies suitable conditions. For n 1, let Un arg max U K Q n (U) be an optimal partition. If U is the unique solution (up to relabeling of its constituent sets) to the problem minimize U =K µ(u k )=1/K K Per(U k ; ρ 2 ) k=1 with dµ(x) = ρ 1+α (x) dx/ D ρ1+α (x) dx, then Un a.s. U. If there is more than one solution to (P), then almost surely i) {Un } n N has at least one cluster point, and ii) every cluster point is a solution to (P). (P)

50 Example

51 Remarks on Theorem Weak convergence of measures via Wasserstein metric.

52 Remarks on Theorem Weak convergence of measures via Wasserstein metric. Useful tool: transport maps relating empirical measures ν n to ν, with bound on -transport cost (García-Trillos, Slepčev).

53 Remarks on Theorem Weak convergence of measures via Wasserstein metric. Useful tool: transport maps relating empirical measures ν n to ν, with bound on -transport cost (García-Trillos, Slepčev). Because modularity clusterings are optimizers of discrete energies, we use Γ-convergence to prove that their limit is the optimizer of a continuum energy.

54 Remarks on Theorem Weak convergence of measures via Wasserstein metric. Useful tool: transport maps relating empirical measures ν n to ν, with bound on -transport cost (García-Trillos, Slepčev). Because modularity clusterings are optimizers of discrete energies, we use Γ-convergence to prove that their limit is the optimizer of a continuum energy. Balance constraint in the continuum problem presents a technical difficulty.

55 Remarks on Theorem Weak convergence of measures via Wasserstein metric. Useful tool: transport maps relating empirical measures ν n to ν, with bound on -transport cost (García-Trillos, Slepčev). Because modularity clusterings are optimizers of discrete energies, we use Γ-convergence to prove that their limit is the optimizer of a continuum energy. Balance constraint in the continuum problem presents a technical difficulty. We modified the notion of Γ-convergence for random functionals to allow the use of our pointwise convergence result.

56 Existence of Transport Maps Let D R d be open, connected with Lipschitz boundary. Assume ν = ρ(x) dx with ρ continuous and bounded above/below by positive constants. Proposition (García-Trillos, Slepčev 14) There is a constant C > 0 such that, with probability one, there exists a sequence of transportation maps {T n} n N, T n ν = ν n with lim sup n lim sup n lim sup n n 1/2 Id T n (2 log log n) 1/2 C (d = 1), n 1/d Id T n (log n) 3/4 C (d = 2), n 1/d Id T n (log n) 1/d C (d 3).

57 Assumption on ɛ n Assume {ɛ n } n N is such that ɛ n > 0 and ɛ n 0. For α = 0, 1, assume that 2 log log n 1 lim = 0, if d = 1 n n ɛ n For α 0, 1, assume that (log n) 3/4 1 lim = 0 n n 1/2 ɛ n if d = 2 (log n) 1/d 1 lim = 0 n n 1/d ɛ n if d 3. n=1 n exp( nɛ d+1 n ) <.

58 The role of α (a) Level sets (b) α = 1 (c) α = 0 (d) α = 1 dµ(x) ρ(x) 1+α, ρ(x) min(2 exp( 4 x x 0 2 ), 1/2).

59 Thanks for coming!

CONSISTENCY OF CHEEGER AND RATIO GRAPH CUTS

CONSISTENCY OF CHEEGER AND RATIO GRAPH CUTS CONSISTENCY OF CHEEGER AN RATIO GRAPH CUTS NICOLÁS GARCÍA TRILLOS 1, EJAN SLEPČEV 1, JAMES VON BRECHT 2, THOMAS LAURENT 3, XAVIER BRESSON 4. ABSTRACT. This paper establishes the consistency of a family

More information

Regularizing objective functionals in semi-supervised learning

Regularizing objective functionals in semi-supervised learning Regularizing objective functionals in semi-supervised learning Dejan Slepčev Carnegie Mellon University February 9, 2018 1 / 47 References S,Thorpe, Analysis of p-laplacian regularization in semi-supervised

More information

Consistency of Cheeger and Ratio Graph Cuts

Consistency of Cheeger and Ratio Graph Cuts Journal of Machine Learning Research 17 (2016) 1-46 Submitted 11/14; Revised 2/16; Published 10/16 Consistency of Cheeger and Ratio Graph Cuts Nicolás García Trillos ejan Slepčev epartment of Mathematical

More information

CONTINUUM LIMIT OF TOTAL VARIATION ON POINT CLOUDS

CONTINUUM LIMIT OF TOTAL VARIATION ON POINT CLOUDS CONTINUUM LIMIT OF TOTAL VARIATION ON POINT CLOUS NICOLÁS GARCÍA TRILLOS AN EJAN SLEPČEV ABSTRACT. We consider point clouds obtained as random samples of a measure on a Euclidean domain. A graph representing

More information

The Lindeberg central limit theorem

The Lindeberg central limit theorem The Lindeberg central limit theorem Jordan Bell jordan.bell@gmail.com Department of Mathematics, University of Toronto May 29, 205 Convergence in distribution We denote by P d the collection of Borel probability

More information

A spectral clustering algorithm based on Gram operators

A spectral clustering algorithm based on Gram operators A spectral clustering algorithm based on Gram operators Ilaria Giulini De partement de Mathe matiques et Applications ENS, Paris Joint work with Olivier Catoni 1 july 2015 Clustering task of grouping

More information

Lecture No 2 Degenerate Diffusion Free boundary problems

Lecture No 2 Degenerate Diffusion Free boundary problems Lecture No 2 Degenerate Diffusion Free boundary problems Columbia University IAS summer program June, 2009 Outline We will discuss non-linear parabolic equations of slow diffusion. Our model is the porous

More information

2 Lebesgue integration

2 Lebesgue integration 2 Lebesgue integration 1. Let (, A, µ) be a measure space. We will always assume that µ is complete, otherwise we first take its completion. The example to have in mind is the Lebesgue measure on R n,

More information

Concentration for Coulomb gases and Coulomb transport inequalities

Concentration for Coulomb gases and Coulomb transport inequalities Concentration for Coulomb gases and Coulomb transport inequalities Mylène Maïda U. Lille, Laboratoire Paul Painlevé Joint work with Djalil Chafaï and Adrien Hardy U. Paris-Dauphine and U. Lille ICERM,

More information

Inference For High Dimensional M-estimates. Fixed Design Results

Inference For High Dimensional M-estimates. Fixed Design Results : Fixed Design Results Lihua Lei Advisors: Peter J. Bickel, Michael I. Jordan joint work with Peter J. Bickel and Noureddine El Karoui Dec. 8, 2016 1/57 Table of Contents 1 Background 2 Main Results and

More information

Priors for the frequentist, consistency beyond Schwartz

Priors for the frequentist, consistency beyond Schwartz Victoria University, Wellington, New Zealand, 11 January 2016 Priors for the frequentist, consistency beyond Schwartz Bas Kleijn, KdV Institute for Mathematics Part I Introduction Bayesian and Frequentist

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 09: Stochastic Convergence, Continued

Introduction to Empirical Processes and Semiparametric Inference Lecture 09: Stochastic Convergence, Continued Introduction to Empirical Processes and Semiparametric Inference Lecture 09: Stochastic Convergence, Continued Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and

More information

CMS winter meeting 2008, Ottawa. The heat kernel on connected sums

CMS winter meeting 2008, Ottawa. The heat kernel on connected sums CMS winter meeting 2008, Ottawa The heat kernel on connected sums Laurent Saloff-Coste (Cornell University) December 8 2008 p(t, x, y) = The heat kernel ) 1 x y 2 exp ( (4πt) n/2 4t The heat kernel is:

More information

Lecture I: Asymptotics for large GUE random matrices

Lecture I: Asymptotics for large GUE random matrices Lecture I: Asymptotics for large GUE random matrices Steen Thorbjørnsen, University of Aarhus andom Matrices Definition. Let (Ω, F, P) be a probability space and let n be a positive integer. Then a random

More information

Geometric Inference for Probability distributions

Geometric Inference for Probability distributions Geometric Inference for Probability distributions F. Chazal 1 D. Cohen-Steiner 2 Q. Mérigot 2 1 Geometrica, INRIA Saclay, 2 Geometrica, INRIA Sophia-Antipolis 2009 July 8 F. Chazal, D. Cohen-Steiner, Q.

More information

Measure and Integration: Solutions of CW2

Measure and Integration: Solutions of CW2 Measure and Integration: s of CW2 Fall 206 [G. Holzegel] December 9, 206 Problem of Sheet 5 a) Left (f n ) and (g n ) be sequences of integrable functions with f n (x) f (x) and g n (x) g (x) for almost

More information

Class 2 & 3 Overfitting & Regularization

Class 2 & 3 Overfitting & Regularization Class 2 & 3 Overfitting & Regularization Carlo Ciliberto Department of Computer Science, UCL October 18, 2017 Last Class The goal of Statistical Learning Theory is to find a good estimator f n : X Y, approximating

More information

Geometric inference for measures based on distance functions The DTM-signature for a geometric comparison of metric-measure spaces from samples

Geometric inference for measures based on distance functions The DTM-signature for a geometric comparison of metric-measure spaces from samples Distance to Geometric inference for measures based on distance functions The - for a geometric comparison of metric-measure spaces from samples the Ohio State University wan.252@osu.edu 1/31 Geometric

More information

GRADIENT FLOWS FOR NON-SMOOTH INTERACTION POTENTIALS

GRADIENT FLOWS FOR NON-SMOOTH INTERACTION POTENTIALS GRADIENT FLOWS FOR NON-SMOOTH INTERACTION POTENTIALS J. A. CARRILLO, S. LISINI, E. MAININI Abstract. We deal with a nonlocal interaction equation describing the evolution of a particle density under the

More information

Weak quenched limiting distributions of a one-dimensional random walk in a random environment

Weak quenched limiting distributions of a one-dimensional random walk in a random environment Weak quenched limiting distributions of a one-dimensional random walk in a random environment Jonathon Peterson Cornell University Department of Mathematics Joint work with Gennady Samorodnitsky September

More information

Invariances in spectral estimates. Paris-Est Marne-la-Vallée, January 2011

Invariances in spectral estimates. Paris-Est Marne-la-Vallée, January 2011 Invariances in spectral estimates Franck Barthe Dario Cordero-Erausquin Paris-Est Marne-la-Vallée, January 2011 Notation Notation Given a probability measure ν on some Euclidean space, the Poincaré constant

More information

Probability and Measure

Probability and Measure Probability and Measure Robert L. Wolpert Institute of Statistics and Decision Sciences Duke University, Durham, NC, USA Convergence of Random Variables 1. Convergence Concepts 1.1. Convergence of Real

More information

Limits of Spectral Clustering

Limits of Spectral Clustering Limits of Spectral Clustering Ulrike von Luxburg and Olivier Bousquet Max Planck Institute for Biological Cybernetics Spemannstr. 38, 72076 Tübingen, Germany {ulrike.luxburg,olivier.bousquet}@tuebingen.mpg.de

More information

Information theoretic perspectives on learning algorithms

Information theoretic perspectives on learning algorithms Information theoretic perspectives on learning algorithms Varun Jog University of Wisconsin - Madison Departments of ECE and Mathematics Shannon Channel Hangout! May 8, 2018 Jointly with Adrian Tovar-Lopez

More information

Statistical Properties of Numerical Derivatives

Statistical Properties of Numerical Derivatives Statistical Properties of Numerical Derivatives Han Hong, Aprajit Mahajan, and Denis Nekipelov Stanford University and UC Berkeley November 2010 1 / 63 Motivation Introduction Many models have objective

More information

d(x n, x) d(x n, x nk ) + d(x nk, x) where we chose any fixed k > N

d(x n, x) d(x n, x nk ) + d(x nk, x) where we chose any fixed k > N Problem 1. Let f : A R R have the property that for every x A, there exists ɛ > 0 such that f(t) > ɛ if t (x ɛ, x + ɛ) A. If the set A is compact, prove there exists c > 0 such that f(x) > c for all x

More information

TUM 2016 Class 1 Statistical learning theory

TUM 2016 Class 1 Statistical learning theory TUM 2016 Class 1 Statistical learning theory Lorenzo Rosasco UNIGE-MIT-IIT July 25, 2016 Machine learning applications Texts Images Data: (x 1, y 1 ),..., (x n, y n ) Note: x i s huge dimensional! All

More information

Learning Patterns for Detection with Multiscale Scan Statistics

Learning Patterns for Detection with Multiscale Scan Statistics Learning Patterns for Detection with Multiscale Scan Statistics J. Sharpnack 1 1 Statistics Department UC Davis UC Davis Statistics Seminar 2018 Work supported by NSF DMS-1712996 Hyperspectral Gas Detection

More information

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by L p Functions Given a measure space (, µ) and a real number p [, ), recall that the L p -norm of a measurable function f : R is defined by f p = ( ) /p f p dµ Note that the L p -norm of a function f may

More information

Lecture No 1 Introduction to Diffusion equations The heat equat

Lecture No 1 Introduction to Diffusion equations The heat equat Lecture No 1 Introduction to Diffusion equations The heat equation Columbia University IAS summer program June, 2009 Outline of the lectures We will discuss some basic models of diffusion equations and

More information

MLCC Clustering. Lorenzo Rosasco UNIGE-MIT-IIT

MLCC Clustering. Lorenzo Rosasco UNIGE-MIT-IIT MLCC 2018 - Clustering Lorenzo Rosasco UNIGE-MIT-IIT About this class We will consider an unsupervised setting, and in particular the problem of clustering unlabeled data into coherent groups. MLCC 2018

More information

Convergence of spectral measures and eigenvalue rigidity

Convergence of spectral measures and eigenvalue rigidity Convergence of spectral measures and eigenvalue rigidity Elizabeth Meckes Case Western Reserve University ICERM, March 1, 2018 Macroscopic scale: the empirical spectral measure Macroscopic scale: the empirical

More information

Metric Spaces and Topology

Metric Spaces and Topology Chapter 2 Metric Spaces and Topology From an engineering perspective, the most important way to construct a topology on a set is to define the topology in terms of a metric on the set. This approach underlies

More information

Triangular matrices and biorthogonal ensembles

Triangular matrices and biorthogonal ensembles /26 Triangular matrices and biorthogonal ensembles Dimitris Cheliotis Department of Mathematics University of Athens UK Easter Probability Meeting April 8, 206 2/26 Special densities on R n Example. n

More information

Uniform Convergence Examples

Uniform Convergence Examples Uniform Convergence Examples James K. Peterson Department of Biological Sciences and Department of Mathematical Sciences Clemson University October 13, 2017 Outline More Uniform Convergence Examples Example

More information

On some properties of modular function spaces

On some properties of modular function spaces Diana Caponetti University of Palermo, Italy Joint work with G. Lewicki Integration Vector Measures and Related Topics VI Bȩdlewo, June 15-21, 2014 Aim of this talk is to introduce modular function spaces

More information

Negative Association, Ordering and Convergence of Resampling Methods

Negative Association, Ordering and Convergence of Resampling Methods Negative Association, Ordering and Convergence of Resampling Methods Nicolas Chopin ENSAE, Paristech (Joint work with Mathieu Gerber and Nick Whiteley, University of Bristol) Resampling schemes: Informal

More information

Definition 6.1. A metric space (X, d) is complete if every Cauchy sequence tends to a limit in X.

Definition 6.1. A metric space (X, d) is complete if every Cauchy sequence tends to a limit in X. Chapter 6 Completeness Lecture 18 Recall from Definition 2.22 that a Cauchy sequence in (X, d) is a sequence whose terms get closer and closer together, without any limit being specified. In the Euclidean

More information

7 Complete metric spaces and function spaces

7 Complete metric spaces and function spaces 7 Complete metric spaces and function spaces 7.1 Completeness Let (X, d) be a metric space. Definition 7.1. A sequence (x n ) n N in X is a Cauchy sequence if for any ɛ > 0, there is N N such that n, m

More information

Free Entropy for Free Gibbs Laws Given by Convex Potentials

Free Entropy for Free Gibbs Laws Given by Convex Potentials Free Entropy for Free Gibbs Laws Given by Convex Potentials David A. Jekel University of California, Los Angeles Young Mathematicians in C -algebras, August 2018 David A. Jekel (UCLA) Free Entropy YMC

More information

Fast learning rates for plug-in classifiers under the margin condition

Fast learning rates for plug-in classifiers under the margin condition Fast learning rates for plug-in classifiers under the margin condition Jean-Yves Audibert 1 Alexandre B. Tsybakov 2 1 Certis ParisTech - Ecole des Ponts, France 2 LPMA Université Pierre et Marie Curie,

More information

Relations Between Adjacency And Modularity Graph Partitioning: Principal Component Analysis vs. Modularity Component Analysis

Relations Between Adjacency And Modularity Graph Partitioning: Principal Component Analysis vs. Modularity Component Analysis Relations Between Adjacency And Modularity Graph Partitioning: Principal Component Analysis vs. Modularity Component Analysis Hansi Jiang Carl Meyer North Carolina State University October 27, 2015 1 /

More information

Susceptible-Infective-Removed Epidemics and Erdős-Rényi random

Susceptible-Infective-Removed Epidemics and Erdős-Rényi random Susceptible-Infective-Removed Epidemics and Erdős-Rényi random graphs MSR-Inria Joint Centre October 13, 2015 SIR epidemics: the Reed-Frost model Individuals i [n] when infected, attempt to infect all

More information

Nonparametric estimation under Shape Restrictions

Nonparametric estimation under Shape Restrictions Nonparametric estimation under Shape Restrictions Jon A. Wellner University of Washington, Seattle Statistical Seminar, Frejus, France August 30 - September 3, 2010 Outline: Five Lectures on Shape Restrictions

More information

Asymptotic results for empirical measures of weighted sums of independent random variables

Asymptotic results for empirical measures of weighted sums of independent random variables Asymptotic results for empirical measures of weighted sums of independent random variables B. Bercu and W. Bryc University Bordeaux 1, France Workshop on Limit Theorems, University Paris 1 Paris, January

More information

Convergence of Feller Processes

Convergence of Feller Processes Chapter 15 Convergence of Feller Processes This chapter looks at the convergence of sequences of Feller processes to a iting process. Section 15.1 lays some ground work concerning weak convergence of processes

More information

CHAPTER V DUAL SPACES

CHAPTER V DUAL SPACES CHAPTER V DUAL SPACES DEFINITION Let (X, T ) be a (real) locally convex topological vector space. By the dual space X, or (X, T ), of X we mean the set of all continuous linear functionals on X. By the

More information

L p Spaces and Convexity

L p Spaces and Convexity L p Spaces and Convexity These notes largely follow the treatments in Royden, Real Analysis, and Rudin, Real & Complex Analysis. 1. Convex functions Let I R be an interval. For I open, we say a function

More information

Ergodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R.

Ergodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R. Ergodic Theorems Samy Tindel Purdue University Probability Theory 2 - MA 539 Taken from Probability: Theory and examples by R. Durrett Samy T. Ergodic theorems Probability Theory 1 / 92 Outline 1 Definitions

More information

Problem List MATH 5143 Fall, 2013

Problem List MATH 5143 Fall, 2013 Problem List MATH 5143 Fall, 2013 On any problem you may use the result of any previous problem (even if you were not able to do it) and any information given in class up to the moment the problem was

More information

Introduction to Bandit Algorithms. Introduction to Bandit Algorithms

Introduction to Bandit Algorithms. Introduction to Bandit Algorithms Stochastic K-Arm Bandit Problem Formulation Consider K arms (actions) each correspond to an unknown distribution {ν k } K k=1 with values bounded in [0, 1]. At each time t, the agent pulls an arm I t {1,...,

More information

1 The classification of Monotone sets in the Heisenberg group

1 The classification of Monotone sets in the Heisenberg group 1 The classification of Monotone sets in the Heisenberg group after J. Cheeger and B. Kleiner [1] A summary written by Jeehyeon Seo Abstract We show that a monotone set E in the Heisenberg group H is either

More information

Asymptotic results for empirical measures of weighted sums of independent random variables

Asymptotic results for empirical measures of weighted sums of independent random variables Asymptotic results for empirical measures of weighted sums of independent random variables B. Bercu and W. Bryc University Bordeaux 1, France Seminario di Probabilità e Statistica Matematica Sapienza Università

More information

Integration on Measure Spaces

Integration on Measure Spaces Chapter 3 Integration on Measure Spaces In this chapter we introduce the general notion of a measure on a space X, define the class of measurable functions, and define the integral, first on a class of

More information

Learning Spectral Graph Segmentation

Learning Spectral Graph Segmentation Learning Spectral Graph Segmentation AISTATS 2005 Timothée Cour Jianbo Shi Nicolas Gogin Computer and Information Science Department University of Pennsylvania Computer Science Ecole Polytechnique Graph-based

More information

Stationary states for the aggregation equation with power law attractive-repulsive potentials

Stationary states for the aggregation equation with power law attractive-repulsive potentials Stationary states for the aggregation equation with power law attractive-repulsive potentials Daniel Balagué Joint work with J.A. Carrillo, T. Laurent and G. Raoul Universitat Autònoma de Barcelona BIRS

More information

Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood

Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood Spatially Smoothed Kernel Density Estimation via Generalized Empirical Likelihood Kuangyu Wen & Ximing Wu Texas A&M University Info-Metrics Institute Conference: Recent Innovations in Info-Metrics October

More information

Gradient Estimation for Attractor Networks

Gradient Estimation for Attractor Networks Gradient Estimation for Attractor Networks Thomas Flynn Department of Computer Science Graduate Center of CUNY July 2017 1 Outline Motivations Deterministic attractor networks Stochastic attractor networks

More information

RATE-OPTIMAL GRAPHON ESTIMATION. By Chao Gao, Yu Lu and Harrison H. Zhou Yale University

RATE-OPTIMAL GRAPHON ESTIMATION. By Chao Gao, Yu Lu and Harrison H. Zhou Yale University Submitted to the Annals of Statistics arxiv: arxiv:0000.0000 RATE-OPTIMAL GRAPHON ESTIMATION By Chao Gao, Yu Lu and Harrison H. Zhou Yale University Network analysis is becoming one of the most active

More information

Estimation of large dimensional sparse covariance matrices

Estimation of large dimensional sparse covariance matrices Estimation of large dimensional sparse covariance matrices Department of Statistics UC, Berkeley May 5, 2009 Sample covariance matrix and its eigenvalues Data: n p matrix X n (independent identically distributed)

More information

1 Matrix notation and preliminaries from spectral graph theory

1 Matrix notation and preliminaries from spectral graph theory Graph clustering (or community detection or graph partitioning) is one of the most studied problems in network analysis. One reason for this is that there are a variety of ways to define a cluster or community.

More information

X n D X lim n F n (x) = F (x) for all x C F. lim n F n(u) = F (u) for all u C F. (2)

X n D X lim n F n (x) = F (x) for all x C F. lim n F n(u) = F (u) for all u C F. (2) 14:17 11/16/2 TOPIC. Convergence in distribution and related notions. This section studies the notion of the so-called convergence in distribution of real random variables. This is the kind of convergence

More information

Myopic Models of Population Dynamics on Infinite Networks

Myopic Models of Population Dynamics on Infinite Networks Myopic Models of Population Dynamics on Infinite Networks Robert Carlson Department of Mathematics University of Colorado at Colorado Springs rcarlson@uccs.edu June 30, 2014 Outline Reaction-diffusion

More information

R. Lachieze-Rey Recent Berry-Esseen bounds obtained with Stein s method andgeorgia PoincareTech. inequalities, 1 / with 29 G.

R. Lachieze-Rey Recent Berry-Esseen bounds obtained with Stein s method andgeorgia PoincareTech. inequalities, 1 / with 29 G. Recent Berry-Esseen bounds obtained with Stein s method and Poincare inequalities, with Geometric applications Raphaël Lachièze-Rey, Univ. Paris 5 René Descartes, Georgia Tech. R. Lachieze-Rey Recent Berry-Esseen

More information

Can we do statistical inference in a non-asymptotic way? 1

Can we do statistical inference in a non-asymptotic way? 1 Can we do statistical inference in a non-asymptotic way? 1 Guang Cheng 2 Statistics@Purdue www.science.purdue.edu/bigdata/ ONR Review Meeting@Duke Oct 11, 2017 1 Acknowledge NSF, ONR and Simons Foundation.

More information

Asymptotics for posterior hazards

Asymptotics for posterior hazards Asymptotics for posterior hazards Pierpaolo De Blasi University of Turin 10th August 2007, BNR Workshop, Isaac Newton Intitute, Cambridge, UK Joint work with Giovanni Peccati (Université Paris VI) and

More information

AW -Convergence and Well-Posedness of Non Convex Functions

AW -Convergence and Well-Posedness of Non Convex Functions Journal of Convex Analysis Volume 10 (2003), No. 2, 351 364 AW -Convergence Well-Posedness of Non Convex Functions Silvia Villa DIMA, Università di Genova, Via Dodecaneso 35, 16146 Genova, Italy villa@dima.unige.it

More information

Beyond incoherence and beyond sparsity: compressed sensing in the real world

Beyond incoherence and beyond sparsity: compressed sensing in the real world Beyond incoherence and beyond sparsity: compressed sensing in the real world Clarice Poon 1st November 2013 University of Cambridge, UK Applied Functional and Harmonic Analysis Group Head of Group Anders

More information

Matrix estimation by Universal Singular Value Thresholding

Matrix estimation by Universal Singular Value Thresholding Matrix estimation by Universal Singular Value Thresholding Courant Institute, NYU Let us begin with an example: Suppose that we have an undirected random graph G on n vertices. Model: There is a real symmetric

More information

Convergence of Multivariate Quantile Surfaces

Convergence of Multivariate Quantile Surfaces Convergence of Multivariate Quantile Surfaces Adil Ahidar Institut de Mathématiques de Toulouse - CERFACS August 30, 2013 Adil Ahidar (Institut de Mathématiques de Toulouse Convergence - CERFACS) of Multivariate

More information

Empirical Processes: General Weak Convergence Theory

Empirical Processes: General Weak Convergence Theory Empirical Processes: General Weak Convergence Theory Moulinath Banerjee May 18, 2010 1 Extended Weak Convergence The lack of measurability of the empirical process with respect to the sigma-field generated

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics

More information

Statistical Estimation: Data & Non-data Information

Statistical Estimation: Data & Non-data Information Statistical Estimation: Data & Non-data Information Roger J-B Wets University of California, Davis & M.Casey @ Raytheon G.Pflug @ U. Vienna, X. Dong @ EpiRisk, G-M You @ EpiRisk. a little background Decision

More information

Neighbourhoods of Randomness and Independence

Neighbourhoods of Randomness and Independence Neighbourhoods of Randomness and Independence C.T.J. Dodson School of Mathematics, Manchester University Augment information geometric measures in spaces of distributions, via explicit geometric representations

More information

On the definitions of Sobolev and BV spaces into singular spaces and the trace problem

On the definitions of Sobolev and BV spaces into singular spaces and the trace problem On the definitions of Sobolev and BV spaces into singular spaces and the trace problem David CHIRON Laboratoire J.A. DIEUDONNE, Université de Nice - Sophia Antipolis, Parc Valrose, 68 Nice Cedex 2, France

More information

Statistical inference on Lévy processes

Statistical inference on Lévy processes Alberto Coca Cabrero University of Cambridge - CCA Supervisors: Dr. Richard Nickl and Professor L.C.G.Rogers Funded by Fundación Mutua Madrileña and EPSRC MASDOC/CCA student workshop 2013 26th March Outline

More information

Geometric Inference for Probability distributions

Geometric Inference for Probability distributions Geometric Inference for Probability distributions F. Chazal 1 D. Cohen-Steiner 2 Q. Mérigot 2 1 Geometrica, INRIA Saclay, 2 Geometrica, INRIA Sophia-Antipolis 2009 June 1 Motivation What is the (relevant)

More information

Data dependent operators for the spatial-spectral fusion problem

Data dependent operators for the spatial-spectral fusion problem Data dependent operators for the spatial-spectral fusion problem Wien, December 3, 2012 Joint work with: University of Maryland: J. J. Benedetto, J. A. Dobrosotskaya, T. Doster, K. W. Duke, M. Ehler, A.

More information

Decomposition Methods for Representations of Quivers appearing in Topological Data Analysis

Decomposition Methods for Representations of Quivers appearing in Topological Data Analysis Decomposition Methods for Representations of Quivers appearing in Topological Data Analysis Erik Lindell elindel@kth.se SA114X Degree Project in Engineering Physics, First Level Supervisor: Wojtek Chacholski

More information

VISCOSITY SOLUTIONS. We follow Han and Lin, Elliptic Partial Differential Equations, 5.

VISCOSITY SOLUTIONS. We follow Han and Lin, Elliptic Partial Differential Equations, 5. VISCOSITY SOLUTIONS PETER HINTZ We follow Han and Lin, Elliptic Partial Differential Equations, 5. 1. Motivation Throughout, we will assume that Ω R n is a bounded and connected domain and that a ij C(Ω)

More information

However, in actual topology a distance function is not used to define open sets.

However, in actual topology a distance function is not used to define open sets. Chapter 10 Dimension Theory We are used to the notion of dimension from vector spaces: dimension of a vector space V is the minimum number of independent bases vectors needed to span V. Therefore, a point

More information

Reproducing Kernel Hilbert Spaces

Reproducing Kernel Hilbert Spaces Reproducing Kernel Hilbert Spaces Lorenzo Rosasco 9.520 Class 03 February 9, 2011 About this class Goal To introduce a particularly useful family of hypothesis spaces called Reproducing Kernel Hilbert

More information

Local Density Approximation for the Almost-bosonic Anyon Gas. Michele Correggi

Local Density Approximation for the Almost-bosonic Anyon Gas. Michele Correggi Local Density Approximation for the Almost-bosonic Anyon Gas Michele Correggi Università degli Studi Roma Tre www.cond-math.it QMATH13 Many-body Systems and Statistical Mechanics joint work with D. Lundholm

More information

Inference for High Dimensional Robust Regression

Inference for High Dimensional Robust Regression Department of Statistics UC Berkeley Stanford-Berkeley Joint Colloquium, 2015 Table of Contents 1 Background 2 Main Results 3 OLS: A Motivating Example Table of Contents 1 Background 2 Main Results 3 OLS:

More information

Inverse problems Total Variation Regularization Mark van Kraaij Casa seminar 23 May 2007 Technische Universiteit Eindh ove n University of Technology

Inverse problems Total Variation Regularization Mark van Kraaij Casa seminar 23 May 2007 Technische Universiteit Eindh ove n University of Technology Inverse problems Total Variation Regularization Mark van Kraaij Casa seminar 23 May 27 Introduction Fredholm first kind integral equation of convolution type in one space dimension: g(x) = 1 k(x x )f(x

More information

Inverse Power Method for Non-linear Eigenproblems

Inverse Power Method for Non-linear Eigenproblems Inverse Power Method for Non-linear Eigenproblems Matthias Hein and Thomas Bühler Anubhav Dwivedi Department of Aerospace Engineering & Mechanics 7th March, 2017 1 / 30 OUTLINE Motivation Non-Linear Eigenproblems

More information

Convex Optimization in Computer Vision:

Convex Optimization in Computer Vision: Convex Optimization in Computer Vision: Segmentation and Multiview 3D Reconstruction Yiyong Feng and Daniel P. Palomar The Hong Kong University of Science and Technology (HKUST) ELEC 5470 - Convex Optimization

More information

The Dirichlet boundary problems for second order parabolic operators satisfying a Carleson condition

The Dirichlet boundary problems for second order parabolic operators satisfying a Carleson condition The Dirichlet boundary problems for second order parabolic operators satisfying a Carleson condition Sukjung Hwang CMAC, Yonsei University Collaboration with M. Dindos and M. Mitrea The 1st Meeting of

More information

A Conditional Approach to Modeling Multivariate Extremes

A Conditional Approach to Modeling Multivariate Extremes A Approach to ing Multivariate Extremes By Heffernan & Tawn Department of Statistics Purdue University s April 30, 2014 Outline s s Multivariate Extremes s A central aim of multivariate extremes is trying

More information

COMPLETE METRIC SPACES AND THE CONTRACTION MAPPING THEOREM

COMPLETE METRIC SPACES AND THE CONTRACTION MAPPING THEOREM COMPLETE METRIC SPACES AND THE CONTRACTION MAPPING THEOREM A metric space (M, d) is a set M with a metric d(x, y), x, y M that has the properties d(x, y) = d(y, x), x, y M d(x, y) d(x, z) + d(z, y), x,

More information

An inverse of Sanov s theorem

An inverse of Sanov s theorem An inverse of Sanov s theorem Ayalvadi Ganesh and Neil O Connell BRIMS, Hewlett-Packard Labs, Bristol Abstract Let X k be a sequence of iid random variables taking values in a finite set, and consider

More information

Measures on spaces of Riemannian metrics CMS meeting December 6, 2015

Measures on spaces of Riemannian metrics CMS meeting December 6, 2015 Measures on spaces of Riemannian metrics CMS meeting December 6, 2015 D. Jakobson (McGill), jakobson@math.mcgill.ca [CJW]: Y. Canzani, I. Wigman, DJ: arxiv:1002.0030, Jour. of Geometric Analysis, 2013

More information

Chapter 4: Modelling

Chapter 4: Modelling Chapter 4: Modelling Exchangeability and Invariance Markus Harva 17.10. / Reading Circle on Bayesian Theory Outline 1 Introduction 2 Models via exchangeability 3 Models via invariance 4 Exercise Statistical

More information

NEW FUNCTIONAL INEQUALITIES

NEW FUNCTIONAL INEQUALITIES 1 / 29 NEW FUNCTIONAL INEQUALITIES VIA STEIN S METHOD Giovanni Peccati (Luxembourg University) IMA, Minneapolis: April 28, 2015 2 / 29 INTRODUCTION Based on two joint works: (1) Nourdin, Peccati and Swan

More information

Learning Theory. Ingo Steinwart University of Stuttgart. September 4, 2013

Learning Theory. Ingo Steinwart University of Stuttgart. September 4, 2013 Learning Theory Ingo Steinwart University of Stuttgart September 4, 2013 Ingo Steinwart University of Stuttgart () Learning Theory September 4, 2013 1 / 62 Basics Informal Introduction Informal Description

More information

A Resampling Method on Pivotal Estimating Functions

A Resampling Method on Pivotal Estimating Functions A Resampling Method on Pivotal Estimating Functions Kun Nie Biostat 277,Winter 2004 March 17, 2004 Outline Introduction A General Resampling Method Examples - Quantile Regression -Rank Regression -Simulation

More information

Spectral theory for compact operators on Banach spaces

Spectral theory for compact operators on Banach spaces 68 Chapter 9 Spectral theory for compact operators on Banach spaces Recall that a subset S of a metric space X is precompact if its closure is compact, or equivalently every sequence contains a Cauchy

More information

III. Quantum ergodicity on graphs, perspectives

III. Quantum ergodicity on graphs, perspectives III. Quantum ergodicity on graphs, perspectives Nalini Anantharaman Université de Strasbourg 24 août 2016 Yesterday we focussed on the case of large regular (discrete) graphs. Let G = (V, E) be a (q +

More information

8.1 Concentration inequality for Gaussian random matrix (cont d)

8.1 Concentration inequality for Gaussian random matrix (cont d) MGMT 69: Topics in High-dimensional Data Analysis Falll 26 Lecture 8: Spectral clustering and Laplacian matrices Lecturer: Jiaming Xu Scribe: Hyun-Ju Oh and Taotao He, October 4, 26 Outline Concentration

More information

l 1 -Regularized Linear Regression: Persistence and Oracle Inequalities

l 1 -Regularized Linear Regression: Persistence and Oracle Inequalities l -Regularized Linear Regression: Persistence and Oracle Inequalities Peter Bartlett EECS and Statistics UC Berkeley slides at http://www.stat.berkeley.edu/ bartlett Joint work with Shahar Mendelson and

More information