Speeding up Convergence to Equilibrium for Diffusion Processes

Similar documents
Lecture 12: Detailed balance and Eigenfunction methods

Effective dynamics for the (overdamped) Langevin equation

Lecture 12: Detailed balance and Eigenfunction methods

Linear Response Theory, The Green-Kubo formula and Langevin Dynamics

Markovian Approximation and Linear Response Theory for Classical Open Systems

2012 NCTS Workshop on Dynamical Systems

A Concise Course on Stochastic Partial Differential Equations

MARKOV CHAIN MONTE CARLO AND IRREVERSIBILITY

Intertwinings for Markov processes

EXPONENTIAL RETURN TO EQUILIBRIUM FOR HYPOELLIPTIC QUADRATIC SYSTEMS

Strong uniqueness for stochastic evolution equations with possibly unbounded measurable drift term

Exponential approach to equilibrium for a stochastic NLS

Nonparametric Drift Estimation for Stochastic Differential Equations

A Lévy-Fokker-Planck equation: entropies and convergence to equilibrium

dynamical zeta functions: what, why and what are the good for?

Introduction to Infinite Dimensional Stochastic Analysis

NPTEL

Convex Optimization CMU-10725

Computing ergodic limits for SDEs

Convergence to equilibrium for rough differential equations

Convergence to equilibrium of Markov processes (eventually piecewise deterministic)

Discrete Ricci curvature: Open problems

Two recent works on molecular systems out of equilibrium

EIGENVALUES AND EIGENVECTORS 3

Interest Rate Models:

Convex decay of Entropy for interacting systems

EIGENFUNCTIONS OF DIRAC OPERATORS AT THE THRESHOLD ENERGIES

degenerate Fokker-Planck equations with linear drift

Gaussian processes for inference in stochastic differential equations

In these chapter 2A notes write vectors in boldface to reduce the ambiguity of the notation.

1. Diagonalize the matrix A if possible, that is, find an invertible matrix P and a diagonal

Quadratic forms. Here. Thus symmetric matrices are diagonalizable, and the diagonalization can be performed by means of an orthogonal matrix.

Introduction to Machine Learning CMU-10701

Langevin Methods. Burkhard Dünweg Max Planck Institute for Polymer Research Ackermannweg 10 D Mainz Germany

Harnack Inequalities and Applications for Stochastic Equations

Geometric projection of stochastic differential equations

LAN property for sde s with additive fractional noise and continuous time observation

State Space Representation of Gaussian Processes

Myopic Models of Population Dynamics on Infinite Networks

Least Squares Estimators for Stochastic Differential Equations Driven by Small Lévy Noises

Lattice Gaussian Sampling with Markov Chain Monte Carlo (MCMC)

Estimates for the density of functionals of SDE s with irregular drift

Differential Equations for Dyson Processes

Markov operators, classical orthogonal polynomial ensembles, and random matrices

5 Applying the Fokker-Planck equation

Path Integral methods for solving stochastic problems. Carson C. Chow, NIH

Robust control and applications in economic theory

BOOK REVIEW. Review by Denis Bell. University of North Florida

Handbook of Stochastic Methods

COMPARISON THEOREMS FOR THE SPECTRAL GAP OF DIFFUSIONS PROCESSES AND SCHRÖDINGER OPERATORS ON AN INTERVAL. Ross G. Pinsky

Hypoelliptic multiscale Langevin diffusions and Slow fast stochastic reaction diffusion equations.

1 9/5 Matrices, vectors, and their applications

Stochastic Viral Dynamics with Beddington-DeAngelis Functional Response

LECTURE VI: SELF-ADJOINT AND UNITARY OPERATORS MAT FALL 2006 PRINCETON UNIVERSITY

M4A42 APPLIED STOCHASTIC PROCESSES

Math 108b: Notes on the Spectral Theorem

Review problems for MA 54, Fall 2004.

p 1 ( Y p dp) 1/p ( X p dp) 1 1 p

SPDEs, criticality, and renormalisation

WEYL S LEMMA, ONE OF MANY. Daniel W. Stroock

Brownian Motion. 1 Definition Brownian Motion Wiener measure... 3

Analysis of SPDEs Arising in Path Sampling Part I: The Gaussian Case

5 Compact linear operators

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

Irreversible Langevin samplers and variance reduction: a large deviations approach

ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS

LONG TIME BEHAVIOUR OF PERIODIC STOCHASTIC FLOWS.

Math Ordinary Differential Equations

06. Stochastic Processes: Concepts

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

Kolmogorov Equations and Markov Processes

Newton s Method and Localization

Bulk scaling limits, open questions

The concentration of a drug in blood. Exponential decay. Different realizations. Exponential decay with noise. dc(t) dt.

1 Lyapunov theory of stability

Elliptic Operators with Unbounded Coefficients

LAN property for ergodic jump-diffusion processes with discrete observations

Random and Deterministic perturbations of dynamical systems. Leonid Koralov

Numerical methods in molecular dynamics and multiscale problems

CAAM 335 Matrix Analysis

Elementary linear algebra

Parameter Estimation for Stochastic Evolution Equations with Non-commuting Operators

A Minimal Uncertainty Product for One Dimensional Semiclassical Wave Packets

A Detailed Look at a Discrete Randomw Walk with Spatially Dependent Moments and Its Continuum Limit

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

Sampling the posterior: An approach to non-gaussian data assimilation

Potential Theory on Wiener space revisited

MA 265 FINAL EXAM Fall 2012

APPLICATIONS The eigenvalues are λ = 5, 5. An orthonormal basis of eigenvectors consists of

ORNSTEIN-UHLENBECK PROCESSES ON LIE GROUPS

Homogenization with stochastic differential equations

MATH 583A REVIEW SESSION #1

Bridging the Gap between Center and Tail for Multiscale Processes

Thermostatic Controls for Noisy Gradient Systems and Applications to Machine Learning

SYLLABUS. 1 Linear maps and matrices

A mathematical introduction to coarse-grained stochastic dynamics

Combinatorial semigroups and induced/deduced operators

22m:033 Notes: 7.1 Diagonalization of Symmetric Matrices

LAPLACIANS COMPACT METRIC SPACES. Sponsoring. Jean BELLISSARD a. Collaboration:

ECE 602 Exam 2 Solutions, 3/23/2011.

Transcription:

Speeding up Convergence to Equilibrium for Diffusion Processes G.A. Pavliotis Department of Mathematics Imperial College London Joint Work with T. Lelievre(CERMICS), F. Nier (CERMICS), M. Ottobre (Warwick), K. Pravda-Starov (Cergy) Multiscale Inverse Problems Warwick University 17/06/2013 G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 1 / 43

Nonreversible optimization of the convergence to equilibrium for diffusions with linear drift, T. Lelièvre, F. Nier, G.P. J Stat Phys. 2013. Exponential Return to Equilibrium for Quadratic Hypoelliptic Systems M. Ottobre, G.P., K. Pravda-Starov. J. Func. Analysis 262(9) pp. 4000-4039 (2012). Asymptotic Analysis for the Generalized Langevin Equation, (M. Ottobre and G. P.), Nonlinearity, 24 (2011) 1629-1653. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 2 / 43

Goal: sample from a distribution π(x) that is known only up to a constant. Construct ergodic stochastic dynamics whose invariant distribution is π(x). There are many different dynamics whose invariant distribution is given by π(x). Different discretizations of the corresponding SDE can behave very differently, even fail to converge to π(x). computational efficiency: choose the dynamics that converges to equilibrium as quickly as possible. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 3 / 43

Consider the long time asymptotics of finite dimensional Itô diffusions dx t = b(x t ) dt + σ(x t ) dw t, (1) where W t is a standard Brownian motion in R d. This is a Markov process with generator where Σ = σσ T. L = b(x) + 1 2 Tr (Σ(x)D 2), (2) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 4 / 43

In order for π(x) to be the invariant distribution of X t, we require that it is the (unique) solution of the stationary Fokker-Planck equation ( b(x)π(x) + 12 ) (Σ(x)π(x)) = 0, (3) together with appropriate boundary conditions (if we are in a bounded domain). If the detailed balance condition J s := bπ + 1 (Σπ) = 0, (4) 2 is satisfied, then the process X t is reversible wrt π(x). G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 5 / 43

A stationary diffusion process X t is reversible (wrt π(x)) if X t and X T t have the same law for t [0, T]. Time reversibility is equivalent to: The detailed balance condition (4); the generator L being self-adjoint in L 2 (R d ; π(x)) (equivalently, the Fokker-Planck operator L = ( b(x) + Σ(x)) being self-adjoint in L 2 (R d ; π 1 (x))); Xt having zero entropy production rate. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 6 / 43

There are (infinitely) many reversible diffusions that can be used in order to sample from π(x): Either fix b(x) in the detailed balance equation (4) and choose the diffusion matrix Σ(x), or, fix Σ(x) and choose the drift b(x). The rate of convergence to equilibrium depends on the tails of the distribution π(x) and on the choice of diffusion process X t (Stramer and Tweedie 1999, Bakry, Cattiaux, Guillin, 2008). The overdampled Langevin dynamics, Σ = 2I, b(x) = log π(x) is not (always) the best choice. dx t = logπ(x t ) dt + 2dW t. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 7 / 43

We can also use higher order Markovian models, for example the underdamped Langevin dynamics dq t = p t dt, dp t = log(π)(q t ) dt γp t dt + 2γ dw t. (5) The generator is L = p q + q log π(q) p + γ ( p p + p ). (6) We have convergence to the equilibrium distribution ψ (p, q) = 1 (2π) N/2 Z π(q)e p2 /2. The parameter γ > 0 in (5) can be tuned in order to optimize the rate of convergence to equilibrium. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 8 / 43

To speed up convergence to the target distribution π(x): Choose optimally (b(x), Σ(x)) within the class of reversible diffusions. Consider higher order Markovian models (underdamped Langevin, generalized Langevin equations...) and optimize over the set of parameters in these models, e.g. the friction coefficient γ. Consider time dependent temperature (simulated annealing): Holley, Kusuoka, Stroock: Asymptotics of the spectral gap with applications to the theory of simulated annealing (1989). G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 9 / 43

To speed up convergence to the target distribution π(x) (contd.): Choose an appropriate numerical scheme for the solution of the underlying SDE: Stramer, Tweedie, Langevin-type models. I. Diffusions with given stationary distributions and their discretizations. (1999). Use a Metropolis-Hastings alrgorithm (acceptance-rejection step): Stramer, Tweedie, Langevin-type models. II. Self-targeting candidates for MCMC algorithms. (1999). G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 10 / 43

Choose optimally (b(x), Σ(x)) within the class of reversible diffusions: Convergence to equilibrium is determined by the first nonzero eigenvalue of the generator of X t, which is a self-adjoint operator in L 2 (R d ;π(x)): λ 1 = Lφ, φ min φ D(L), φ const φ 2. (7) We want to maximize λ 1 over the set of drift and diffusion coefficients that satisfy the detailed balance condition (4). This leads to a max-min problem for a self-adjoint operator: λ 1 = max min b,σ,j s=0 φ D(L), φ const Lφ, φ φ 2. (8) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 11 / 43

If we stay within the class of reversible diffusions then it is sufficient to consider spectral optimization problems for selfadjoint operators. When sampling from multimodal distributions (metastability...) it is reasonable to expect that a well chosen space-dependent diffusion coefficient can speed up convergence to equilibrium. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 12 / 43

Consider the overdamped Langevin dynamics dx t = V(X t ) dt + 2β 1 dw t. (9) The law (PDF) ψ t of X t satisfies the Fokker-Planck equation t ψ t = ( Vψ t + ψ t ). (10) Under appropriate assumptions on the potential, ψ satisfies a Poincaré inequality: there exists λ > 0 such that for all probability density functions φ, R N ( ) φ 2 1 ψ dx 2 ψ λ R N ( φ ψ ) 2 ψ dx. (11) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 13 / 43

The optimal parameter λ in (11) is the spectral gap of the Fokker-Planck operator ( V + ), which is self-adjoint in L 2 (R N,ψ 1 dx). (11) is equivalent to exponential convergence to equilibrium for (9): for all initial conditions ψ 0 L 2 (R N,ψ 1 dx), for all times t 0, ψ t ψ L 2 (ψ 1 dx) e λt ψ 0 ψ L 2 (ψ 1 dx), (12) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 14 / 43

We will consider the nonreversible dynamics (Hwang et al 1993, 2005). dxt b = ( V(Xt b ) + b(xt b ) ) dt + 2 dw t, (13) where b is taken to be divergence-free with respect to the invariant distribution ψ dx: (be V) = 0. (14) This ensures that ψ (x) dx is still the invariant measure of the dynamics (13). We can construct such vector fields by taking b = J V, J = J T. (15) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 15 / 43

The dynamics (13) is non-reversible: (Xt b) 0 t T has the same law as (X b T t ) 0 t T and thus not the same law as (XT b t ) 0 t T. Equivalently, the system does not satisfy detailed balance the stationary probability flux is not zero. From (15) it is clear that there are many (in fact, infinitely many) different ways for modifying the reversible dynamics without changing the invariant measure. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 16 / 43

We ask whether the addition of a nonreversible term can improve the rate of convergence to equilibrium and, if so, whether there exists an optimal choice for the perturbation. This question leads to a min-max problem: max min Re( σ( L b ) ). b A where L b = ( V + b) +. We are perturbing a self-adjoint operator L with discrete spectrum by adding an antisymmetric operator that preserves the null space of L. We want to find the antisymmetric perturbation that maximizes the spectral gap. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 17 / 43

The reversible case is the worst (Hwang et al 2005): Convergence to equilibrium (measured in L 2 (R d, ψ 1 )) is slowest for the reversible case. In other words: the spectral gap is the smallest for the self-adoint problem. Related work on Poincare and logarithmic Sobolev inequalities for nonreversible diffusions (Arnold, Carlen, Ju 2008). Intuition: the addition of a nonreversible (Hamiltonian) part introduces a drift that can help the system escape from metastable states. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 18 / 43

We consider the nonreversible dynamics dx t = ( I + δj) V(X t ) dt + 2β 1 dw t, (16) with δ R and J the standard 2 2 antisymmetric matrix, i.e. J 12 = 1, J 21 = 1. For this class of nonreversible perturbations the parameter that we wish to choose in an optimal way is δ. However, the numerical experiments will illustrate that even a non-optimal choice of δ can significantly accelerate convergence to equilibrium. We will use the potential V(x, y) = 1 4 (x 2 1) 2 + 1 2 y 2. (17) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 19 / 43

2 1.8 δ = 0 δ =10 1.6 1.4 E(x 2 +y 2 ) 1.2 1 0.8 0.6 0.4 0.2 0 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 t Figure: Second moment as a function of time for (16) with the potential (17). We take 0 initial conditions and β 1 = 0.1. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 20 / 43

The upper bound for the reversible dynamics (9) is still valid: ψ b t ψ L 2 (ψ 1 dx) e λt ψ b 0 ψ L 2 (ψ 1 dx). (18) Adding a non-reversible part to the dynamics cannot be worse than the original dynamics (9) (where b = 0) in terms of exponential rate of convergence. Our main result is that for a linear drift it is possible to choose b in order to obtain convergence at exponential rate of the form: ψ b t ψ L 2 (ψ 1 dx) C(V, b)e λt ψ b 0 ψ L 2 (ψ 1 dx), (19) with λ > λ and C(V, b) > 1. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 21 / 43

Let V(x) = 1 2 x T Sx. The nonreversible perturbations that satisfy (14) are of the form The corresponding SDE is dx J t The invariant distribution is b(x) = Ax A = JS, with J = J T. (20) = (I + J)SX J t dt + 2 dw t. (21) ψ (x) = det(s)1/2 (2π) N/2 ( exp x T ) Sx. (22) 2 G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 22 / 43

Theorem Define B J = (I + J)S. Then max min Re(σ(B J)) = Tr(S) J A N (R) N. (23) Furthermore, one can construct matrices J opt A N (R) such that the maximum in (23) is attained. The matrix J opt can be chosen so that the semigroup associated to B Jopt satisfies the bound e (I+J opt)st C (1) N ( κ(s)1/2 exp Tr(S) N t where κ(s) = S S 1 denotes the condition number. ), (24) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 23 / 43

Theorem For B J = (I + J)S with J A N, the drift-diffusion operator L J = (B J x). + defined in L 2 (R N,ψ dx; C) with domain of definition { } D(L J ) = u L 2 (R N,ψ dx; C), L J u L 2 (R N,ψ dx; C) generates a contraction semigroup (e tl J) t 0 and it has a compact resolvent. Optimizing its spectrum with respect to J gives max min Re(σ( L J)) = Tr(S) J A N (R) N. (25) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 24 / 43

Theorem (contd.) Furthermore, the maximum in (25) is attained for the matrices J opt A N (R) constructed as before. The matrix J opt can be chosen so that ( ) etl J opt u uψ dx R N C (2) N κ(s)7/2 exp ( L 2 (ψ dx) Tr(S) N t ) ( ) u uψ dx R N holds for all u L 2 (R N,ψ dx; C) and all t 0. L 2 (ψ dx) (26) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 25 / 43

Corollary Let us consider the Fokker Planck equation associated to the dynamics (21) on X J t : t ψ J t = ( ) B J x ψt J + ψt J, (27) where B J = (I + J)S. Let us assume that ψ0 J L2 (R N,ψ 1 dx). Then, by considering J = J opt, where J opt A N (R) refers to the matrix considered in Theorem 2 to get (26). Then the inequality ψ J t ψ L 2 (ψ 1 dx) C(2) N ( κ(s)7/2 exp Tr(S) N t holds for all t 0, when ψ is defined by (22). ) ψ J 0 ψ, L 2 (ψ 1 dx) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 26 / 43

The proofs of these results consist of three steps: Solve the min-max problem for the matrix B J = (I + J)S. Calculate the spectrum of L J in terms of the eigenvalues of B J. Control the constant and optimize wrt to N. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 27 / 43

Proposition Assume that J = S 1/2 JS 1/2 A N (R) and that S S N >0 (R). Then the following two conditions are equivalent: (i) The matrix B J = S + J is diagonalizable (in C) and the spectrum of B J satisfies σ( B J ) Tr(S) + ir. (28) N (ii) There exists a real symmetric positive definite matrix Q = Q T such that JQ Q J = QS SQ + 2Tr(S) N Q. (29) G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 28 / 43

Let {λ k } N k=1 denote the (positive real) eigenvalues of Q (counted with multiplicity), and {ψ k } N k=1 the associated eigenvectors, which form an orthonormal basis of R N. Equation (29) is equivalent to the following two conditions: for all k in {1,...,N}, (ψ k, Sψ k ) R = Tr(S) (30) N and, for all j k in {1,...,N}, (λ j λ k )(ψ j, Jψ k ) R = (λ k + λ j )(ψ j, Sψ k ) R. (31) This equivalence enables us to develop an algorithm for calculating the optimal nonreversible perturbation. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 29 / 43

The calculation of the eigenvalues of the matrix in the drift is sufficient in order to calculate the spectrum of the generator. Consider the linear SDE (Ornstein-Uhlenbeck process) The generator is (with Σ = σσ T ) dx t = BX t dt + σ dw t (32) L = 1 2 Tr (ΣD 2) + Bx, (33) Σ can be degenerate, provided that L is hypoelliptic. We assume that (32) is ergodic with unique Gaussian invariant measure π(x) dx. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 30 / 43

The spectrum of L in L p (R N ;π(x)) has been calculated in Metafune, Pallara and Priola, J. Func. Analysis 196 (2002), pp. 40-60: For all p > 1 the spectrum consists of integer linear combinations of the drift matrix. It is independent of the diffusion matrix: r σ(l) = n j λ j, n j N, (34) j=1 where { } r λ j denote the r (distinct) eigenvalues of A. In j=1 particular, the spectral gap of the generator L is determined by the eigenvalues of A. A new proof of this result (for p = 2) can be found in M Ottobre, G.P., K. Pravda-Starov, J. Func. analysis 262(9), 4000-4039 (2012). A similar result can also be proved in infinite dimensions: van Neerven, Infin. Dimens. Anal. Quantum Probab. Relat. Top. 8 (2005), no. 3, 473-495. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 31 / 43

To prove the exponential convergence for the solution of the Fokker-Planck equation and to estimate the constant: Since L J is not self-adjoint, in addition to spectral information we also need estimates on the resolvent. We use the second quantization formalism (Wick calculus): L J = a,t (S J)a. (35) where a, a denote creation and annihilation operators. Use an appropriate expansion in Hermite polynomials. For the constant in front of the exponential we have that C (2) N = O(N3 ). G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 32 / 43

Algorithm for constructing the optimal nonreversible perturbation for n = 1 : N 1 do begin Start from an arbitrary orthonormal basis (ψ 1,...,ψ N ). 1 Make a permutation of (ψ n,..., ψ N ) so that (ψ n, Sψ n ) R = max k=n,...,n (ψ k, Sψ k ) R > Tr(S)/N and (ψ n+1, Sψ n+1 ) R = min k=n,...,n (ψ k, Sψ k ) R < Tr(S)/N. 2 Compute t such that ψ t = cos(t )ψ n + sin(t )ψ n+1 satisfies (ψ t, Sψ t ) R = Tr(S)/N 3 Use a Gram-Schmidt procedure to change the set of vectors (ψ t, ψ n+1,..., ψ N ) to an orthonormal basis (ψ t, ψ n+1,..., ψ N ). 4 Increase n by one and go back to step 1. end G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 33 / 43

We consider the three dimensional problem with the symmetric matrix S = diag(1, 0.1, 0.01). (36) The spectral gap of the optimally perturbed nonreversible matrix (and of the generator of the semigroup) is given by TrS 3 = 0.37, a substantial improvement over that of S, 0.01. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 34 / 43

0 10 20 30 40 50 60 70 80 90 100 10 2 10 0 10 2 10 4 B J S e tb 10 6 10 8 10 10 10 12 10 14 10 16 t Figure: Norms of the matrix exponentials for the 3 3 diagonal matrix (36) and its optimal nonreversible perturbation. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 35 / 43

We consider a 100 100 diagonal matrix with random entries, uniformly distributed on [0, 1]. For our example the minimum diagonal element (spectral gap) is 0.0012. On the contrary, the spectral gap of B J with J = J opt is 0.4762. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 36 / 43

0 10 20 30 40 50 60 70 80 90 100 10 0 B J S 10 5 e tb 10 10 10 15 10 20 t Figure: Norms of the matrix exponentials for a diagonal matrix with random uniformly distributed entries and its optimal nonreversible perturbation for N = 100. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 37 / 43

Consider a finite difference approximation of the SPDE t u = u + ξ. (37) We consider this SPDE in 1 dimension on [0, 1] with Dirichlet boundary conditions. We can use the finite difference approximation to sample approximately from an infinite dimensional Gaussian measure. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 38 / 43

0 1 2 3 4 5 6 7 8 9 10 10 2 B J 10 1 S 10 0 10 1 e tb 10 2 10 3 10 4 10 5 10 6 10 7 t Figure: Norms of the matrix exponentials for the the discrete Laplacian and its optimal nonreversible perturbation for N = 100. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 39 / 43

There are many other stochastic dynamics that can be used in order to sample from a given distribution. Consider for example the second order Langevin dynamics q = V(q) γ q + 2γβ 1Ẇ. For V(q) = 1 2 ω2 0 q2 the optimal spectral gap is (using (34)) σ opt = γ 2, forγ = 2ω 0. We can perturb the Langevin dynamics without changing the invariant measure 1 Z e βh(q,p) : q = p J V(q), ṗ = q V(q) + Jp Γp + 2Γβ 1 Ẇ. We have also considered a general symmetric drift matrix Γ instead of a scalar. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 40 / 43

q = p, ṗ = V (q) + λz, ż = αz λp + 2αẆ. (38c) Consider the three models q = V (q) + 2Ẇ, (38a) q = V (q) γ q + 2γẆ, (38b) All these three models can be used in order to sample from the distribution Z 1 e V(q). Notice that there are no control parameters in (38a), 1 (the friction coefficient) in (38b) and 2 (α and λ) in (38c). We would like to choose (α, λ) in (38c) in order to optimize the rate of convergence to equilibrium. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 41 / 43

Figure: Spectral gap as a function of α and λ. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 42 / 43

1.4 1.3 1.2 1.1 1 σ 0.9 0.8 0.7 0.6 0.5 0.4 1 2 3 4 5 6 7 8 9 10 λ Figure: Spectral gap as a function of λ with γ = λ2 α fixed. G.A. Pavliotis (Imperial College London) Nonreversible Diffusions 43 / 43