Stein s Method and Stochastic Geometry

Similar documents
Second Order Results for Nodal Sets of Gaussian Random Waves

Nodal Sets of High-Energy Arithmetic Random Waves

Kolmogorov Berry-Esseen bounds for binomial functionals

ON MEHLER S FORMULA. Giovanni Peccati (Luxembourg University) Conférence Géométrie Stochastique Nantes April 7, 2016

R. Lachieze-Rey Recent Berry-Esseen bounds obtained with Stein s method andgeorgia PoincareTech. inequalities, 1 / with 29 G.

Stein s method and weak convergence on Wiener space

NEW FUNCTIONAL INEQUALITIES

Stein s Method: Distributional Approximation and Concentration of Measure

Normal approximation of Poisson functionals in Kolmogorov distance

A Gentle Introduction to Stein s Method for Normal Approximation I

Different scaling regimes for geometric Poisson functionals

Normal approximation of geometric Poisson functionals

Stein s method, logarithmic Sobolev and transport inequalities

arxiv: v1 [math.pr] 7 May 2013

Malliavin calculus and central limit theorems

Gaussian Phase Transitions and Conic Intrinsic Volumes: Steining the Steiner formula

KOLMOGOROV DISTANCE FOR MULTIVARIATE NORMAL APPROXIMATION. Yoon Tae Kim and Hyun Suk Park

Malliavin-Stein Method in Stochastic Geometry

Stein s Method and the Zero Bias Transformation with Application to Simple Random Sampling

Stein s method and stochastic analysis of Rademacher functionals

arxiv: v3 [math.mg] 3 Nov 2017

Optimal Berry-Esseen bounds on the Poisson space

Concentration inequalities: basics and some new challenges

STEIN MEETS MALLIAVIN IN NORMAL APPROXIMATION. Louis H. Y. Chen National University of Singapore

arxiv: v2 [math.pr] 12 May 2015

Chapter 6. Stein s method, Malliavin calculus, Dirichlet forms and the fourth moment theorem

Normal approximation on Poisson spaces: Mehler s formula, second order Poincaré inequalities and stabilization arxiv: v1 [math.

Multi-dimensional Gaussian fluctuations on the Poisson space

arxiv: v1 [math.pr] 7 Sep 2018

Cumulants on the Wiener Space

The Stein and Chen-Stein Methods for Functionals of Non-Symmetric Bernoulli Processes

Poisson-Voronoi graph on a Riemannian manifold

arxiv: v3 [math.pr] 17 Apr 2012

On the spatial distribution of critical points of Random Plane Waves

A Short Introduction to Stein s Method

CAMBRIDGE TRACTS IN MATHEMATICS. General Editors B. BOLLOBÁS, W. FULTON, A. KATOK, F. KIRWAN, P. SARNAK, B. SIMON, B.TOTARO

Stein's method meets Malliavin calculus: a short survey with new estimates. Contents

The Central Limit Theorem: More of the Story

Differential Stein operators for multivariate continuous distributions and applications

Stein approximation for functionals of independent random sequences

Stein s method and zero bias transformation: Application to CDO pricing

Stein s Method: Distributional Approximation and Concentration of Measure

Stein s method on Wiener chaos

Wasserstein-2 bounds in normal approximation under local dependence

LAN property for sde s with additive fractional noise and continuous time observation

CLASSICAL AND FREE FOURTH MOMENT THEOREMS: UNIVERSALITY AND THRESHOLDS. I. Nourdin, G. Peccati, G. Poly, R. Simone

arxiv: v2 [math.pr] 22 Aug 2009

Contents. 1 Preliminaries 3. Martingales

Inference For High Dimensional M-estimates. Fixed Design Results

Concentration inequalities for non-lipschitz functions

Lecture 1 Measure concentration

NON-UNIVERSALITY OF THE NAZAROV-SODIN CONSTANT

Mod-φ convergence I: examples and probabilistic estimates

THE LINDEBERG-FELLER CENTRAL LIMIT THEOREM VIA ZERO BIAS TRANSFORMATION

ON THE STRUCTURE OF GAUSSIAN RANDOM VARIABLES

Integration by Parts and Its Applications

Gaussian Fields and Percolation

Anti-concentration Inequalities

Giovanni Peccati. 1. Introduction QUANTITATIVE CLTS ON A GAUSSIAN SPACE: A SURVEY OF RECENT DEVELOPMENTS

The Stein and Chen-Stein methods for functionals of non-symmetric Bernoulli processes

Notes on Poisson Approximation

4: The Pandemic process

Brownian motion. Samy Tindel. Purdue University. Probability Theory 2 - MA 539

Alignment processes on the sphere

Approximately Gaussian marginals and the hyperplane conjecture

Phenomena in high dimensions in geometric analysis, random matrices, and computational geometry Roscoff, France, June 25-29, 2012

Universal Gaussian fluctuations of non-hermitian matrix ensembles: From weak convergence to almost sure CLTs

An Introduction to Probability Theory and Its Applications

Normal Approximation for Hierarchical Structures

Superconcentration inequalities for centered Gaussian stationnary processes

u( x) = g( y) ds y ( 1 ) U solves u = 0 in U; u = 0 on U. ( 3)

Oberwolfach Preprints

arxiv: v2 [math.pr] 15 Nov 2016

Module 3. Function of a Random Variable and its distribution

Probability and Measure

Probability approximation by Clark-Ocone covariance representation

Lectures on Gaussian approximations with Malliavin calculus

Decouplings and applications

An Introduction to Malliavin calculus and its applications

Proving the central limit theorem

Handbook of Stochastic Methods

Quantitative stable limit theorems on the Wiener space

On the Bennett-Hoeffding inequality

Stein s Method for concentration inequalities

9 Brownian Motion: Construction

Brownian Motion. 1 Definition Brownian Motion Wiener measure... 3

Inference For High Dimensional M-estimates: Fixed Design Results

Some functional (Hölderian) limit theorems and their applications (II)

Pathwise volatility in a long-memory pricing model: estimation and asymptotic behavior

STAT 200C: High-dimensional Statistics

Measures on spaces of Riemannian metrics CMS meeting December 6, 2015

Normal approximation for sums of stabilizing functionals

Harmonic Analysis. 1. Hermite Polynomials in Dimension One. Recall that if L 2 ([0 2π] ), then we can write as

On the Error Bound in the Normal Approximation for Jack Measures (Joint work with Le Van Thanh)

King s Research Portal

S chauder Theory. x 2. = log( x 1 + x 2 ) + 1 ( x 1 + x 2 ) 2. ( 5) x 1 + x 2 x 1 + x 2. 2 = 2 x 1. x 1 x 2. 1 x 1.

BOUNDARY VALUE PROBLEMS ON A HALF SIERPINSKI GASKET

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Gaussian Processes. 1. Basic Notions

Mod-φ convergence II: dependency graphs

Transcription:

1 / 39 Stein s Method and Stochastic Geometry Giovanni Peccati (Luxembourg University) Firenze 16 marzo 2018

2 / 39 INTRODUCTION Stein s method, as devised by Charles Stein at the end of the 60s, is a collection of probabilistic techniques, for measuring the distance between probability distributions, by means of characterising differential operators. Stein s motivation was to develop an effective alternative to Fourier methods, for dealing with functionals of dependent random variables.

3 / 39 INTRODUCTION Applications of Stein s method now span an enormous amount of domains, e.g.: random matrices, statistics, biology, algebra, mathematical physics, finance, geometry,... Main features: quantitative, and local to global. In these lectures: my view of Stein s method, with focus on Gaussian random fields and random geometric graphs.

4 / 39 SOME NAMES L.H.Y. Chen A.D. Barbour E. Bolthausen F. Götze L. Goldstein G. Reinert I. Nourdin S. Chatterjee

5 / 39 CONVENTIONS From now on: everything is defined on a suitable triple (Ω, F, P) We write N N (0, 1) for a standard Gaussian random variable: P(N B) = e y2 /2 dy. 2π B Often: given a random element Y, we write Y 1, Y 2,... to indicate a sequence of independent copies of Y.

6 / 39 THE (QUANTITATIVE) CENTRAL LIMIT THEOREM Theorem (CLT & Berry-Esseen bound) Let X 1, X 2,... be a sequence of independent and identically distributed r.v. s, such that E[X 1 ] = 0, and Var(X 1 ) = 1. Write Then, as n, [ ] 1 n (z) := P n S n z S n := X 1 + + X n. z e y2 /2 dy 0, z R. 2π Moreover, sup n (z) C E X 1 3 ) (0.4 < C optimal < 0.48. z n

7 / 39 FIRST PROOF: FOURIER (LYAPOUNOV, LÉVY) Write the characteristic function f n (z) of n 1/2 S n as a n- product. Prove that f n (z) exp{ z 2 /2}, as n, by a direct analytical argument.

8 / 39 SECOND PROOF: SWAPPING (LINDEBERG, TROTTER) For a smooth ϕ, with red and blue independent, write E[ϕ(N)] E[ϕ(n 1/2 S n )] = E[ϕ(n 1/2 (N 1 + + N n ))] E[ϕ(n 1/2 (X 1 + + X n )]. Deduce that: E[ϕ(N)] E[ϕ(n 1/2 S n )] Eϕ(n 1/2 (N 1 + + N i + X i+1 + + X n )) n i=1 Eϕ(n 1/2 (N 1 + + N i 1 + X i + + X n )), and control each summand by a Taylor expansion.

9 / 39 QUESTION What happens if the summands X 1, X 2,... display some form of dependence and/or the considered random element is not a linear mapping? Typical example: length, number of edges / triangles / connected components /... in a random geometric graph: Even more extreme: random graphs arising in combinatorial optimisation (MST, TSP, MM,...)

10 / 39 SETTING In what follows, I will mainly focus on one-dimensional normal approximations in the 1-Wasserstein distance W 1 (, ). Recall that W 1 (X, Y) := inf E A B A X ; B Y = sup h Lip(1) E[h(X)] E[h(Y)], whenever E X, E Y <.

11 / 39 INGREDIENTS In order to implement Stein s method, one typically needs: 1. A Lemma 2. A heuristic 3. An equation 4. Uniform bounds

12 / 39 THE LEMMA Stein s Lemma Let Z be a real-valued random variable. Then, Z N (0, 1) if and only if E[ f (Z)] = E[Z f (Z)], for every smooth f. [Proof: (= ) integration by parts. ( =) method of moments (or unicity of Fourier transform) ]

13 / 39 THE HEURISTIC Stein s Heuristic Assume Z is a real random variable such that E[ f (Z)] E[Z f (Z)] for a large class of smooth mappings f. Then, the distribution of Z has to be close to Gaussian.

14 / 39 THE EQUATION For h Lip(K) fixed and N N (0, 1), define the Stein s equation equivalent to f (x) x f (x) = h(x) E[h(N)], x R; d dx e x2 /2 f (x) = e x2 /2 (h(x) E[h(N)]). Every solution has the form f (x) = ce x2 /2 + e x2 /2 Set f h (x) := x x (h(y) E[h(N)])e y2 /2 dy, x R. (h(y) E[h(N)])e y2 /2 dy, x R.

15 / 39 THE BOUNDS By direct inspection, one proves Stein s Magic Factors and Bounds For every h Lip(K), f h C 1, and 2 f h π K. As a consequence, for X integrable, W 1 (X, N) = sup E[h(X)] E[h(N)] h Lip(1) = sup h Lip(1) sup f : f 1 E[ f h (X) X f h(x)] E[ f (X) X f (X)].

16 / 39 AND NOW? The name of the game is now to compare as sharply as possible E[ f (X)] and E[X f (X)], for every smooth mapping f. Several techniques: exchangeable pairs, dependency graphs, zero-bias transforms, size-bias transforms,...

A SIMPLE EXAMPLE: BACK TO THE CLT For a fixed n, write Z := n 1/2 (X 1 + + X n ), and Z i = Z n 1/2 X i. One has, by Taylor and independence, E[X i f (Z)] = E[X i ( f (Z) f (Z i ))] E[X i (Z Z i ) f (Z)] It follows that E[Z f (Z)] = n 1/2 i = n 1/2 E[X 2 i f (Z)]. E[X i f (Z)] E[n 1 Xi 2 f (Z)]. i By the law of large numbers, E[Z f (Z)] E[ f (Z)] for n large, and using Stein s bounds one deduces the CLT. 17 / 39

18 / 39 SECOND ORDER POINCARÉ ESTIMATES Assume now g = (g 1,..., g d ) N d (0, Id.), and define F = ψ(g 1,..., g d ), for some smooth ψ : R d R s.t. E[F] = 0 and Var(F) = 1. Remember the Poincaré inequality : Var(F) E[ ψ(g) 2 ].

SECOND ORDER POINCARÉ ESTIMATES It turns out that F verifies an exact integration by parts formula: [ ] E[F f (F)] = E f (F) ψ(g), L 1 ψ(g), where L 1 is the pseudo-inverse of the Ornstein-Uhlenbeck generator L = x, +. Plugging this into Stein s bound and applying once more Poincaré yields that, for N N (0, 1), W 1 (F, N) Var( ψ(g), L 1 ψ(g) ) 2E[ Hess ψ(g) 4 op] 1/4 E[ ψ(g) 4 ] 1/4. This is a second order Poincaré inequality, see Chatterjee (2007), Nourdin, Peccati and Reinert (2010), and Vidotto (2017). Applications in random matrix theory & analysis of fractional fields. 19 / 39

20 / 39 BEYOND GAUSSIAN Stein s approach extends to much more general densities for instance to elements of the Pearson family. See Stein s 1986 monograph. In the discrete setting, the equivalent of Stein s method is the Chen-Stein method. See the monograph by Barbour, Holst and Janson (1990).

21 / 39 TWO EXAMPLES In what follows, I will illustrate two striking applications of Stein s method, that are relevant in a geometric setting: (1) capturing the fluctuations of chaotic random variables, and (2) quantifying second order interactions. Both are connected to (generalized) integration by parts formulae.

BERRY S RANDOM WAVES (1977) Let E > 0. The Berry s random wave model on R 2, with parameter E, written B E = {B E (x) : x R 2 }, is defined as the unique (in law) centred, isotropic Gaussian field on R 2 such that B + E B = 0, where = 2 x1 2 + 2 x2 2. Equivalently, E[B E (x)b E (y)] = J 0 ( E x y ) (J 0 = Bessel function of the 1st kind). Its high-energy local behaviour is conjectured to be a universal model for Laplace eigenfunctions on arbitrary manifolds (Berry, 1977). It is the local scaling limit of monochromatic random waves on arbitrary manifolds (Canzani & Hanin, 2016). 22 / 39

N ODAL S ETS One is interested in the length L E of the nodal set (components are the nodal lines): BE 1 ({0}) Q := { x Q : BE ( x ) = 0}, where Q is e.g. a square of size 1, as E. Image: D. Belyaev (2016) 23 / 39

CHLADNI PLATES (1787) 24 / 39

25 / 39 MEAN AND VARIANCE (BERRY, 2002) Berry (J. Phys. A, 2002) : semi-rigorous computations give E[L E ] E, Var(L E ) log E, although the natural guess for the order of the variance is E. See Wigman (2010) for the spherical case. Such a variance reduction... results from a cancellation whose meaning is still obscure... (Berry (2002), p. 3032). Question: can one explain such a cancellation phenomenon, and characterise second-order fluctuations, involving the normalised length L E := L E E[L E ] Var(LE )?

26 / 39 EXPLAINING THE CANCELLATION Starting from seminal contributions by Marinucci and Wigman (2010, 2011): geometric functionals of random Laplace eigenfunctions on compact manifolds (e.g. tori and spheres) can be studied by means of Wiener-Itô chaotic decompositions and in particular by detecting specific domination effects. Such geometric functionals include: lengths of level sets, excursion areas, Euler-Poincaré characteristics, # critical points, # nodal intersections. See several works by Cammarota, Dalmao, Marinucci, Nourdin, Peccati, Rossi, Wigman,... (2010 2018). As first observed in Marinucci, Peccati, Rossi and Wigman (2016 for arithmetic waves) domination of a single chaotic projection fully explains cancellation phenomena.

27 / 39 VIGNETTE: WIENER CHAOS Consider a generic Gaussian field G = {G(u) : u U }. For every q = 0, 1, 2..., set { P q := v.s. p ( G(u 1 ),..., G(u r ) ) } : d p q. Then: P q P q+1. Define the family of orthogonal spaces {C q C 0 = R and C q := P q Pq 1 ; one has : q 0} as L 2 (σ(g)) = C q = qth Wiener chaos of G. C q. q=0

28 / 39 CHAOS AND INTEGRATION BY PARTS Elements of the Wiener chaos verify an exact integration by parts formula: for every F C q, every q 2 and every smooth f, E[F f (F)] = 1 q E[ DF 2 f (F)], where D is a generalized gradient (Malliavin derivative). This yields the striking inequality (Nourdin and Peccati, 2009): E[ f (F)] E[F f (F)] f Var(q 1 DF 2 ) 1/2 q 1 E[F 3q 4 ] 3E[F 2 ] 2.

A RIGID ASYMPTOTIC STRUCTURE For fixed q 2, let {F k : k 1} C q (with unit variance). Nourdin and Poly (2013): If F k Z, then Z has necessarily a density (and the set of possible laws for Z does not depend on G) Nualart and Peccati (2005): F k Z N (0, 1) if and only if EFk 4 3(= EZ4 ), and W 1 (F k, Z) E[Fk 4 ] 3 (Nourdin and Peccati, 2009). Peccati and Tudor (2005): Componentwise convergence to Gaussian implies joint convergence. Nourdin and Peccati (2009): F k Z 2 1 if and only if EF 4 k 12EF 3 k 36. Nourdin, Nualart and Peccati (2015): given {H k } C p, then F k, H k are asymptotically independent if and only if Cov(H 2 k, F2 k ) 0. 29 / 39

30 / 39 STATEMENT Theorem (Nourdin, Peccati and Rossi, 2017) 1. (Cancellation) For every fixed E > 0, proj(l E C 2q+1 ) = 0, q 0, and proj( L E C 2 ) reduces to a negligible boundary term, as E. 2. (4 th chaos dominates) Let E. Then, L E = proj( L E C 4 ) + o P (1). 3. (CLT) As E, L E Z N(0, 1).

O THER M ANIFOLDS? What about the high-energy behaviour of random waves on T2 and S2? Figures: A. Barnett, G. Poly and Z. Rudnick 31 / 39

32 / 39 SOME RECENT FINDINGS Similarly to planar waves, the projection of the (renormalized) nodal length on the second chaos disappears exactly, and global fluctuations are dominated (in L 2 ) by the projection on the 4th Wiener chaos. The nodal length of random spherical harmonics verifies a Gaussian CLT (Marinucci, Rossi, Wigman (2017)).

33 / 39 SOME RECENT FINDINGS The nodal length of arithmetic random waves verifies a non-central χ 2 limit theorem (Marinucci, Rossi, Peccati, Wigman (2016)), and similar results hold for nodal intersections (Dalmao, Nourdin, Peccati, Rossi, 2016), as well as for local nodal lengths above the Planck scale (Benatar, Marinucci and Wigman, 2017). Monochromatic random waves on general manifolds can be coupled with Berry s wave at small scales, so that the local length fluctuations are Gaussian (Dierinckx, Nourdin, Peccati, Rossi, 2018+).

POISSON SETTING For every t > 0, let R d A η t (A) be a Poisson random measure with intensity t Leb. Malliavin calculus and Wiener Chaos are available for η t : as in the Gaussian framework, they combine admirably well with Stein s bounds. Stochastic analysis on the Poisson space is tightly connected to add-one cost operators, defined for every x R d and every F = F(η) as D x F(η t ) := F(η t + δ x ) F(η t ). 34 / 39

35 / 39 BEYOND CHAOS The operators D x are ersatz of gradients on the Poisson space. In particular, one has the Poincaré inequality: for every F = F(η t ) Var(F) t E (D x F) 2 dx. R d In this framework, several geometric quantities naturally emerge for which there is no dominating chaotic projection : typically, characteristics of random graphs built from some intrinsic geometric rule like the nearest neighbour graph, or graphs emerging in combinatorial optimization.

36 / 39 EXAMPLE: THE NEAREST NEIGHBOUR GRAPH

36 / 39 EXAMPLE: THE NEAREST NEIGHBOUR GRAPH

37 / 39 SECOND ORDER INEQUALITIES AND STABILIZATION Second order Poincaré inequalities are available also in this framework (Last, Peccati & Schulte (2015)): [ ] W 1 (F, N) 2 E (D x F) 4 dx [ ] [ ] +E (D x F) 2 dx E (Dx,yF) 2 2 dxdy, yielding that normality arises from small local contributions, and vanishing second order interactions. Such an estimate has recently been used to recover a generalised notion of stabilising geometric functionals (Kesten and Lee, 1996; Penrose and Yukich, 2001) see Lachièze-Rey, Schulte and Yukich (2017). Applications to: Voronoi tessellations, radial graphs, volume approximations,...

38 / 39 ADVERTISING & THANKS THANK YOU FOR YOUR ATTENTION!