arxiv: v1 [math.pr] 5 Dec 2018

Similar documents
arxiv: v2 [math.pr] 9 Mar 2018

ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS

arxiv: v1 [math.ap] 28 Aug 2018

2012 NCTS Workshop on Dynamical Systems

EXPOSITORY NOTES ON DISTRIBUTION THEORY, FALL 2018

Stability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games

Some SDEs with distributional drift Part I : General calculus. Flandoli, Franco; Russo, Francesco; Wolf, Jochen

The Dirichlet s P rinciple. In this lecture we discuss an alternative formulation of the Dirichlet problem for the Laplace equation:

Laplace s Equation. Chapter Mean Value Formulas

An introduction to Birkhoff normal form

for all subintervals I J. If the same is true for the dyadic subintervals I D J only, we will write ϕ BMO d (J). In fact, the following is true

LECTURE 1: SOURCES OF ERRORS MATHEMATICAL TOOLS A PRIORI ERROR ESTIMATES. Sergey Korotov,

S chauder Theory. x 2. = log( x 1 + x 2 ) + 1 ( x 1 + x 2 ) 2. ( 5) x 1 + x 2 x 1 + x 2. 2 = 2 x 1. x 1 x 2. 1 x 1.

SHARP BOUNDARY TRACE INEQUALITIES. 1. Introduction

MATH 205C: STATIONARY PHASE LEMMA

Analysis in weighted spaces : preliminary version

Dynamical systems with Gaussian and Levy noise: analytical and stochastic approaches

ξ,i = x nx i x 3 + δ ni + x n x = 0. x Dξ = x i ξ,i = x nx i x i x 3 Du = λ x λ 2 xh + x λ h Dξ,

MATH 425, FINAL EXAM SOLUTIONS

Regularity for Poisson Equation

Smooth Structure. lies on the boundary, then it is determined up to the identifications it 1 2

Metastability for interacting particles in double-well potentials and Allen Cahn SPDEs

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

Equilibria with a nontrivial nodal set and the dynamics of parabolic equations on symmetric domains

On the fast convergence of random perturbations of the gradient flow.

Metastability for the Ginzburg Landau equation with space time white noise

Traces, extensions and co-normal derivatives for elliptic systems on Lipschitz domains

VISCOSITY SOLUTIONS. We follow Han and Lin, Elliptic Partial Differential Equations, 5.

POINTWISE BOUNDS ON QUASIMODES OF SEMICLASSICAL SCHRÖDINGER OPERATORS IN DIMENSION TWO

Math The Laplacian. 1 Green s Identities, Fundamental Solution

Topics in Harmonic Analysis Lecture 1: The Fourier transform

2 A Model, Harmonic Map, Problem

We denote the space of distributions on Ω by D ( Ω) 2.

Propagating terraces and the dynamics of front-like solutions of reaction-diffusion equations on R

On semilinear elliptic equations with measure data

The continuity method

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm

1 Lyapunov theory of stability

Harmonic Functions and Brownian motion

Brownian Motion. 1 Definition Brownian Motion Wiener measure... 3

A LOCALIZATION PROPERTY AT THE BOUNDARY FOR MONGE-AMPERE EQUATION

Random and Deterministic perturbations of dynamical systems. Leonid Koralov

SPECTRAL PROPERTIES OF THE LAPLACIAN ON BOUNDED DOMAINS

Eilenberg-Steenrod properties. (Hatcher, 2.1, 2.3, 3.1; Conlon, 2.6, 8.1, )

Whitney s Extension Problem for C m

INTRODUCTION TO REAL ANALYTIC GEOMETRY

Green s Functions and Distributions

Separation of Variables in Linear PDE: One-Dimensional Problems

(x k ) sequence in F, lim x k = x x F. If F : R n R is a function, level sets and sublevel sets of F are any sets of the form (respectively);

A random perturbation approach to some stochastic approximation algorithms in optimization.

u xx + u yy = 0. (5.1)

Topics in Harmonic Analysis Lecture 6: Pseudodifferential calculus and almost orthogonality

at time t, in dimension d. The index i varies in a countable set I. We call configuration the family, denoted generically by Φ: U (x i (t) x j (t))

Non-degeneracy of perturbed solutions of semilinear partial differential equations

Some lecture notes for Math 6050E: PDEs, Fall 2016

2. Function spaces and approximation

and BV loc R N ; R d)

Lecture 21 Representations of Martingales

Pseudo-Poincaré Inequalities and Applications to Sobolev Inequalities

DYNAMICAL CUBES AND A CRITERIA FOR SYSTEMS HAVING PRODUCT EXTENSIONS

On the Ladyzhenskaya Smagorinsky turbulence model of the Navier Stokes equations in smooth domains. The regularity problem

WEYL S LEMMA, ONE OF MANY. Daniel W. Stroock

Poisson Equation on Closed Manifolds

Implicit Functions, Curves and Surfaces

Applied Math Qualifying Exam 11 October Instructions: Work 2 out of 3 problems in each of the 3 parts for a total of 6 problems.

Problem: A class of dynamical systems characterized by a fast divergence of the orbits. A paradigmatic example: the Arnold cat.

Vector Spaces. Vector space, ν, over the field of complex numbers, C, is a set of elements a, b,..., satisfying the following axioms.

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond

Non-degeneracy of perturbed solutions of semilinear partial differential equations

arxiv: v1 [math.oc] 22 Sep 2016

Existence and Regularity of Stable Branched Minimal Hypersurfaces

The optimal partial transport problem

SPECTRAL GAP FOR ZERO-RANGE DYNAMICS. By C. Landim, S. Sethuraman and S. Varadhan 1 IMPA and CNRS, Courant Institute and Courant Institute

Lecture 12. F o s, (1.1) F t := s>t

Exponential Mixing Properties of Stochastic PDEs Through Asymptotic Coupling

i=1 α i. Given an m-times continuously

JUHA KINNUNEN. Harmonic Analysis

DRIFT OF SPECTRALLY STABLE SHIFTED STATES ON STAR GRAPHS

ON THE FOLIATION OF SPACE-TIME BY CONSTANT MEAN CURVATURE HYPERSURFACES

SYMMETRY RESULTS FOR PERTURBED PROBLEMS AND RELATED QUESTIONS. Massimo Grosi Filomena Pacella S. L. Yadava. 1. Introduction

MINIMAL SURFACES AND MINIMIZERS OF THE GINZBURG-LANDAU ENERGY

Stability of Stochastic Differential Equations

n 2 xi = x i. x i 2. r r ; i r 2 + V ( r) V ( r) = 0 r > 0. ( 1 1 ) a r n 1 ( 1 2) V( r) = b ln r + c n = 2 b r n 2 + c n 3 ( 1 3)

TOOLS FROM HARMONIC ANALYSIS

Kramers formula for chemical reactions in the context of Wasserstein gradient flows. Michael Herrmann. Mathematical Institute, University of Oxford

APPLICATIONS OF DIFFERENTIABILITY IN R n.

INTEGRATION ON MANIFOLDS and GAUSS-GREEN THEOREM

Spline Element Method for Partial Differential Equations

Empirical Processes: General Weak Convergence Theory

ATTRACTORS FOR SEMILINEAR PARABOLIC PROBLEMS WITH DIRICHLET BOUNDARY CONDITIONS IN VARYING DOMAINS. Emerson A. M. de Abreu Alexandre N.

Chapter One. The Calderón-Zygmund Theory I: Ellipticity

Lecture No 1 Introduction to Diffusion equations The heat equat

1 Directional Derivatives and Differentiability

New Helmholtz-Weyl decomposition in L r and its applications to the mathematical fluid mechanics

Piecewise Smooth Solutions to the Burgers-Hilbert Equation

Hofer s Proof of the Weinstein Conjecture for Overtwisted Contact Structures Julian Chaidez

Hypoelliptic multiscale Langevin diffusions and Slow fast stochastic reaction diffusion equations.

Residence-time distributions as a measure for stochastic resonance

Riesz potentials and nonlinear parabolic equations

The Dirichlet boundary problems for second order parabolic operators satisfying a Carleson condition

Transcription:

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS FRAYDOUN REZAKHANLOU AND INSUK SEO arxiv:82.02069v math.pr] 5 Dec 208 Abstract. In this article, we prove that a small random perturbation of dynamical system with multiple stable equilibria converges to a Markov chain whose states are neighborhoods of the deepest stable equilibria, under a suitable time-rescaling, provided that the perturbed dynamics is reversible in time. Such a result has been anticipated from 970s, when the foundation of mathematical treatment for this problem has been established by Freidlin and Wentzell. We solve this long-standing problem by reducing the entire analysis to an investigation of the solution of an associated Poisson equation, and furthermore provide a method to carry out this analysis by using well-known test functions in a novel manner.. Introduction Dynamical systems that are perturbed by small random noises are known to exhibit metastable behavior. There have been numerous progresses in the last two decades on the rigorous verification of metastability for a class of models that are collectively known as Small Random Perturbation of Dynamical System (SRPDS). In this introductory section, we briefly review some of the existing results on SRPDS, and describe the main contribution of this article. We refer to a classical monograph 2] and a recent monograph 9] for the comprehensive discussion on SRPDS... Small random perturbation of dynamical systems: historical review. Consider a dynamical system given by the ordinary differential equation in R d dx(t) = b(x(t))dt, (.) where b : R d R d is a smooth vector field. Suppose that this dynamical system owns multiple stable equilibria as illustrated in Figure., and consider the random dynamical system obtained by perturbing (.) with a small Brownian noise. Such a random dynamical system is defined by a stochastic differential equation of the form dx ɛ (t) = b(x ɛ (t))dt + 2ɛ dw t ; t 0, (.2) where (w t : t 0) is the standard d-dimensional Brownian motion, and ɛ > 0 is a small positive parameter representing the magnitude of the noise. Suppose now that the diffusion

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 2 Figure.. The flow chart of the dynamical systems dx(t) = b(x(t))dt with three stable equilibria. There are four unstable equilibria as well. process x ɛ (t) starts from a neighborhood of a stable equilibrium of the unperturbed dynamics (.). Then, because of the small random noise, one can expect that the perturbed dynamics (.2) exhibits a rare transition from this starting neighborhood to another one around different stable equilibrium. This is a typical metastable or tunneling transition and its quantitative analysis was originated from Freidlin and Wentzell 2, 3, 4]. However, beyond the large-deviation type estimate that was obtained by Freidlin and Wentzell (explained below), not much is known about the precise nature of the metastable behavior of the model (.2), unless the drift b is a gradient vector field. For instance, we do not know of any sharp asymptotic for the expectation of the metastable transition time..2. Small random perturbation of dynamical systems: gradient model. Suppose that the vector field b in (.2) can be expressed as b = U, for a smooth potential function U : R d R. In other words, the stochastic differential equation (.2) is of the form dx ɛ (t) = U(x ɛ (t))dt + 2ɛ dw t ; t 0. (.3) In particular, if the function U( ) has several local minima as illustrated in Figure., then the dynamical system associated with the unperturbed equation dx(t) = U(x(t))dt, has multiple stable equilibria, and hence the diffusion process (x ɛ (t) : t 0) is destined to exhibit a metastable behavior. In order to explain some of the classical results obtained in 2, 3] by Freidlin and Wentzell in its simplest form, let us assume that U is a double-well potential. That is, the function U has exactly two local minima m and m 2, and a saddle point σ between them, as illustrated

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 3 Figure.2. Potential U with two global minima m and m2 (left) and multiple global minima (right). in Figure.2-(left). For such a choice of U, the diffusion x, wonders mostly in one of the two potential wells surrounding m and m2, and occasionally makes transitions from one well to the other. To understand the metastable nature of x qualitatively, we analyze the asymptotic behavior of the transition time of x between the two potential wells. Writing τ for the time that it takes for x (t) to reach a small ball around m2, we wish to estimate the mean transition time E m τ ], where E m denotes the expectation with respect to the law of x (t) starting from m. Freidlin and Wentzell in 2, 3] establishes a large-deviation type estimate of the form U (σ) U (m ) log E m τ ] ' as 0. (.4) For the precise metastable behavior of x, we need to go beyond (.4) and evaluate the low limit of ( ) U (σ) U (m ) Em τ ] exp. This was achieved by Bovier et. al. in 8] by verifying a classical conjecture of Eyring ] and Kramers 8]. By developing a robust methodology which is now known as the potential theoretic approach, Bovier et. al. derive an Eyring-Kramers type formula in the form E m τ ] 2π ' λσ v u u det( 2 U )(σ) t ( U (σ) U (m ) exp 2 det( U )(m ) ) as 0, (.5) provided that the Hessians of U at m, m2, and σ are non-degenerate, ( 2 U )(σ) has a unique negative eigenvalue λσ, and some additional technical assumptions on U (corresponding to (2.) and (2.2) of the current paper) are valid. It is also verified in the same work that τ /E m τ ] converges to the mean-one exponential random variable. Similar formulas can be derived when U has multiple local minima as in Figure.2 (right)..3. Main result. We starts with an informal explanation of our main result when U is a double-well potential with U (m ) = U (m2 ). Heuristically speaking, the process starting

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 4 from a neighborhood of m makes a transition to that of m 2 after an exponentially long time, as suggested by (.4). After spending another exponentially long time, the process makes a transition back to the neighborhood of m. These tunneling-type transitions take place repeatedly and may be explained in terms of a Markov chain among two valleys around m and m 2. More generally, if U has several global minima as in Figure.2 (right), then the successive inter-valley dynamics seems to be approximated by a Markov chain whose states are the deepest valleys of U. In spite of the appeal of the above heuristic description, and its consistency with (.4), its rigorous verification for our process (.3) was not known before. In the main result of the current paper (Theorem 2.3), we show that after a rescaling of time, a finite state Markov chain governs the inner-valley dynamics of x ɛ..4. Scaling limit of metastable random processes. The most natural way to describe the inter-valley dynamics of metastable random processes is to demonstrate that their scaling limits are governed by finite state Markov chains whose jump rates are evaluated with the aid of Eyring-Kramers type formulas. Recently, there have been numerous active researches toward this direction, especially when the underlying metastable process lives in a discrete space. Beltran and Landim in 2, 3] provide a general framework, known as the martingale approach to obtain the scaling limit of metastable Markov chains. This method is quite robust and has been applied to a wide scope of metastable processes including the condensing zerorange processes, 4, 20, 32], the condensing simple inclusion processes 5, 7], the random walks in potential fields 24, 25], and the Potts models 26, 29]. The method of Beltran and Landim relies on a careful analysis of the so-called trace process. A trace process is obtained from the original process by turning off the clock when the process is not in a suitable neighborhood of a stable equilibrium. However, as Landim pointed out in 2], it is not clear how to apply this methodology when the underlying metastable process is a diffusion. In this paper, instead of modifying the approach outlined in 2, 3], we appeal to an entirely new method that is a refinement of a scheme that was utilized in 0, 3]. We establish the metastable behavior of our diffusion x ɛ by analyzing the solutions of certain classes of Poisson equations related to its infinitesimal generator. Theorem 4. is the main step of our approach and will play an essential role in the proof of our main result Theorem 2.3. The proof of Theorem 4. is to some extent model-dependent, though the deduction of the main result from this Theorem is robust and applicable to many other examples. Hence, we hope that our work reveals the importance of studying the Poisson equation of type (4.2) below in the study of scaling limit of metastable random processes.

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 5.5. Non-gradient model. As we mentioned earlier, except for the exponential estimate similar to (.4), the analog of (.5) is not known for the general case (.2). Even for (.4), the term U(σ) U(m ) on the right-hand side is replaced with the so-called quasi-potential V (σ; m ). For the sake of comparison, let us describe three simplifying features of the diffusion (.3) that play essential roles in our work: The quasi-potential function governing the rare behaviors of the process (.3) is given by U. In general, the quasi-potential V is given by a variational principle in a suitable function space. For the metastability questions, we need to study the regularity of this quasi-potential that in general is a very delicate issue. The diffusion x ɛ of the equation (.3) admits an invariant measure with a density of the form Zɛ exp { U/ɛ}. For the general case, no explicit formula for the invariant measure is expected. The invariant measure density is specified as the unique solution of an elliptic PDE associated with the adjoint of the generator of (.2). The diffusion x ɛ of the equation (.3) is reversible with respect to its invariant measure. This is no longer the case for non-gradient models. The main tool for proving the Eyring-Kramers formula for the gradient model (.2) in 8] is the potential theory associated with reversible processes. Of course the special form of the invariant measure is also critically used, and hence its extension to general case requires non-trivial additional work. Recently, in 23] a potential theory for non-reversible processes is obtained, and accordingly the Eyring-Kramers formula is extended to a class of non-reversible diffusions with Gibbsian invariant measures. This result offers a meaningful advance to the general case. The current work can be regarded as an entirely new alternative approach to the general case. Comparing to previous approaches, the main difference of ours is the fact that we do not rely on potential theory, especially the estimation of the capacity. Hence our approach does not rely on the reversibility of the process x ɛ. Keeping in mind that one of main challenge of the non-reversible case is the estimation of the capacity between valleys, the methodology adopted in the current paper appears to be well-suited for treating nonreversible models. This possibility is partially verified in 27] by Landim and an author of the current paper. In this work, the scaling limit for the diffusion x ɛ of the equation (.2) on a circle is obtained. It is worth mentioning that in the case of a circle, many simplifications and explicit computations are available. Nonetheless, the results of 27] demonstrates that the Eyring-Kramers formula as well as the limiting Markov chain are very different from the reversible case, and many peculiar features are observed.

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 6 Figure 2.. Shadow area represents Ω. For this case S = {, 2, 3, 4, 5}, S = {σ, σ 2, σ 3, σ 4, σ 5 }, and M = {m, m 2 }. 2. Model and Main result Our main interest in this paper is the metastable behavior of the diffusion process (.3) when the potential function U has multiple global minima. In Section 2., we explain basic assumptions on U and the geometric structure of its graph related to the metastable valleys and saddle points between them. In Section 2.2 some elementary results about the invariant measure of the process (.3) is recalled. Finally, in Section 2.3 we describe the main result of the paper, which is a convergence theorem for the metastable process (.3). We remark that the presentation and the result in the current section are similar to a discrete counterpart model considered in 24], though our proof of the main result is entirely different from the one that is presented therein. 2.. Potential function and its landscape. We shall consider the potential function U : R d R that belongs to C 2 (R d ), satisfying the growth condition U(x) lim x x =, (2.) and the tightness condition e U(x)/ɛ dx C a e a/ɛ for all a R and ɛ (0, ], (2.2) {x:u(x) a} where C a, a R, is a constant that depends on a, but not on ɛ. These two conditions are required to confine the process x ɛ (t) in a compact region with high probability. The metastable behavior of our model critically depends on the graphical structures of the level sets of the potential function U. To guarantee the occurrence of a metastable behavior of the type we have described in Section, we need to make some standard assumptions on U. We refer to Figure 2. for the visualization of some the notations that appear in the rest of the current section.

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 7 2... Structure of the metastable wells. Fix H R and let S = {σ, σ 2,, σ L } be the set of saddle point of U with height H, i.e., U(σ ) = U(σ 2 ) = = U(σ L ) = H. Denote by W,, W K the connected components of the set Ω = {x : U(x) < H}. (2.3) Let us write S = {, 2,, K}. By the growth condition (2.), all the sets W i, i S, are bounded. We assume that Ω = i S W i is a connected set, where A represents the topological closure of the set A R d. Let h i, i S, be the minimum of the function U in the well W i. We regard H h i as the depth of the well W i. Define h = min h i (2.4) i S and let S = {i S : h i = h} S. (2.5) Note that the collection {W i : i S } represents the set of deepest wells. The purpose of the current article is to describe the metastable behavior of the diffusion process x ɛ (t) among these deepest wells. For a non-trivial result, we assume that S 2. Remark 2.. When the set Ω is not connected, we can still apply our result to each connected component to get the metastability among the neighborhood of this component. In order to deduce the global result instead, one must find a larger H to unify the connected components. Because of this, our assumptions are quite general. For the details for such a multi-scale analysis, we refer to 24]. 2..2. Assumptions on the critical points of U. For i S, define M i = {m W i : U(m) = h i } which represents the set of minima of U in the set W i. We assume that M i is a finite set for all i S. Define M = M i and M = M i, (2.6) i S i S so that the set M denotes the set of global minima of U. We assume that those critical points of U that belong to M S are non-degenerate, i.e., the Hessian of U is invertible at each point of M S. Furthermore, we assume that the Hessian ( 2 U)(σ) has one negative eigenvalue and (d ) positive eigenvalues for all σ S. These assumptions are standard in the study of metastability (cf. 8, 23, 24, 25]). In particular, they are satisfied if the function U is a Morse function.

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 8 2..3. Metastable valleys. Fix a small constant a > 0 such that there is no critical point c of U satisfying U(c) H a, H). For i S, denote by Wi o the unique connected component of the level set {x : U(x) < H a} which is a subset of W i. We write B(x, r) for the ball of radius r > 0 centered at x R d, i.e., B(x, r) = {y R d : x y < r}. (2.7) Assume that r 0 > 0 is small enough so that the ball B(m, r 0 ) does not contain any critical points of U other than m, and B(m, r 0 ) i S Wi o for all m M. For i S, the metastable valley corresponding to the well W i is defined by V i = B(m, r 0 ). (2.8) m M i Finally, we write V = i S V i, and = R d \ V. (2.9) 2.2. Invariant measure. The generator corresponding to the diffusion process x ɛ (t) that satisfies the stochastic differential equation (.3), can be written as L ɛ = ɛ U = ɛ e U(x)/ɛ e U(x)/ɛ ]. From this, it is not hard to show that the invariant measure for the process x ɛ ( ) is given by µ ɛ (dx) = Z ɛ e U(x)/ɛ dx := ˆµ ɛ (x) dx (2.0) where Z ɛ is the partition function defined by Z ɛ = e U(x)/ɛ dx <. R d Notice that Z ɛ is finite because of (2.2). Define ν i = m M i det( 2 U)(m) for i S and ν = j S ν j. (2.) We state some asymptotic results for the partition function Z ɛ and the invariant measure µ ɛ ( ). We write o ɛ () for a term that vanishes as ɛ 0. Proposition 2.2. It holds that Z ɛ = ( + o ɛ ()) (2πɛ) d/2 e h/ɛ ν, (2.2) µ ɛ (V i ) = ( + o ɛ ()) ν i ν for i S, and µ ɛ ( ) = o ɛ (). (2.3) Proof. By Laplace s method, we can deduce that, for i S, µ ɛ (V i ) = Z ɛ ( + o ɛ ()) (2πɛ) d/2 e h/ɛ ν i. (2.4)

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 9 On the other hand, by (2.2), we have Now, (2.2) follows from (2.4) and (2.5) since = µ ɛ ( ) + µ ɛ ( ) = Z ɛ o ɛ () ɛ d/2 e h/ɛ. (2.5) i S µ ɛ (V i ). Finally, (2.3) is obtained by inserting (2.2) into (2.4) and (2.5), respectively. 2.3. Main result. The metastable behavior of the process x ɛ (t) is a consequence of its convergence to a Markov chain y(t) on S in a proper sense, as is explained in Section 2.3.3 below. The Markov chain y(t) is defined in Section 2.3.2, based on an auxiliary Markov chain x(t) on S that is introduced below. 2.3.. Markov chain x(t) on S. For a saddle point σ S, we write λ σ for the unique negative eigenvalue of the Hessian ( 2 U)(σ), and define ω σ = λ σ 2π det( 2 U)(σ). For distinct i, j S, let S i, j be the set of saddle points between wells W i and W j in the sense that S i, j = W i W j S. Define ω i, j = ω σ. σ S i, j For convenience, we set ω i, i = 0 for all i S. For i S, we define ω i = ω i, j and µ(i) = ω i /( ω j ). j S j S We have ω i > 0 since the set Ω is connected by our assumption. Denote by {x(t) : t 0} the continuous time Markov chain on S whose jump rate from i S to j S is given by ω i, j /µ(i). For i S, denote by P i the law of the Markov chain x(t) starting from i. Notice that this Markov chain is reversible with respect to the probability measure µ( ). The generator L x corresponding to the chain x(t) can be written as, (L x f)(i) = ω i, j f(j) f(i)] ; i S, j S µ(i) for f R S. Define, for f, g R S, D x (f, g) = µ(i) f(i)( L x g)(i) = i S 2 ω i, j f(j) f(i)] g(j) g(i)]. (2.6) i, j S

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 0 Then, D x (f, f) represents the Dirichlet form associated with the chain x(t). Now we define the equilibrium potential and the capacity corresponding to the chain x(t). For A S, denote by H A the hitting time of the set A, i.e., H A = inf{t 0 : x(t) A}. For two non-empty disjoint subsets A and B of S, define a function h A, B : S 0, ] by h A, B (i) = P i (H A < H B ). (2.7) The function h A, B is called the equilibrium potential between two sets A and B with respect to the Markov chain x(t). One of the notable fact about the equilibrium potential is that, h A, B can be characterized as the unique solution of the following equation: (L x h A, B )(i) = 0 for all i (A B) c, h A, B (a) = for all a A, (2.8) h A, B (b) = 0 for all b B. The capacity between these two sets A and B is now defined as cap x (A, B) = D x (h A, B, h A, B ). 2.3.2. Markov chain y(t) on S. For distinct i, j S, define β i, j = 2 cap x({i}, S \ {i}) + cap x ({j}, S \ {j}) cap x ({i, j}, S \ {i, j})] (2.9) and set β i, i = 0 for all i S. Note that β i, j = β j, i for all i, j S. Recall ν i from (2.) and let {y(t) : t 0} be a continuous time Markov chain on S whose jump rate from i S to j S is given by β i, j /ν i. Denote by Q i, i S, the law of Markov chain y(t) starting from i. Notice that the probability measure µ on S, defined by µ (i) = ν i ν for i S (2.20) is the invariant measure for the Markov chain y(t). For f R S, the generator L y corresponding to the Markov chain y(t) is given by (L y f)(i) = j S β i, j Similar to (2.6), we define, for f, g R S, D y (f, g) = i S ν i f(j) f(i)] ; i S. ν i f(i)( L y g)(i) = β i, j f(j) f(i)] g(j) g(i)]. ν 2ν i, j S

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS We acknowledge here that a similar construction has been carried out in 30] at which a sharp asymptotics of the low-lying spectra of the metastable diffusions on σ-compact Riemannian manifold has been carried out for special form of the potential function U. 2.3.3. Main result. It is anticipated from (.5) that the time scale corresponding to the metastable transition is given by θ ɛ = e (H h)/ɛ. (2.2) Define the rescaled process { x ɛ (t) : t 0} as of x ɛ (t) x ɛ (t) = x ɛ (θ ɛ t). We now define the trace process y ɛ (t) of x ɛ (t) inside V. To this end, define the total time spent by ( x ɛ (s) : s 0, t]) in the valley V as T ɛ (t) = t 0 χ V ( x ɛ (s))ds ; t 0, where the function χ A : R d {0, } represents the characteristic function of A R d. Then, define S ɛ (t) = sup{s 0 : T ɛ (s) t} ; t 0, (2.22) which is the generalized inverse of the increasing function T ɛ ( ). Finally, the trace process of x ɛ (t) in the set V is defined by y ɛ (t) = x ɛ (S ɛ (t)) ; t 0. (2.23) One can readily verify that y ɛ (t) V for all t 0. Define a projection function Ψ : V S by Ψ(x) = i S i χ Vi (x). (2.24) Since y ɛ (t) is always in the set V, the following process is well-defined: y ɛ (t) = Ψ(y ɛ (t)) ; t 0. (2.25) The process y ɛ (t) represents the index of the valley in which the process y ɛ (t) is residing. Denote by P ɛ x and P ɛ x the law of processes x ɛ ( ) and x ɛ ( ) starting from x R d, respectively, and denote by E ɛ x and Êɛ x the corresponding expectations. For x V, denote by Q ɛ x the law of process y ɛ ( ) when the underlying diffusion process x ɛ (t) follows P ɛ x, i.e., Q ɛ x = P ɛ x Ψ. For any Borel probability measure π on V, we denote by P ɛ π the law of process x ɛ ( ) with initial distribution π. Then, define P ɛ π, E ɛ π, Êɛ π, and Q ɛ π similarly as above. We are now ready to state the main result of this article:

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 2 Theorem 2.3. For all i S and for any sequence of Borel probability measures (π ɛ ) ɛ>0 concentrated on V i, the sequence of probability laws (Q ɛ π ɛ ) ɛ>0 converges to Q i, the law of the Markov process (y(t)) t 0 starting from i, as ɛ tends to 0. We finish this section by explaining the organization of the rest of the paper. In Section 3, we construct a class of test functions which are useful in some of the computations we carry out in Section 4. In Section 4, we analyze a Poisson equation that will play a crucial role in the proof of both the tightness in Section 5, and the uniqueness of the limit point in Section 6. These two ingredients complete the proof of the convergence result stated in Theorem 2.3, as we will demonstrate in Section 6. 3. Test functions The purpose of the current section is to construct some test functions. We acknowledge that these functions are not new; similar functions have already been used in 7] and 23] in order to obtain sharp estimates on the capacity associated with pairs of valleys. Hence we refer to those papers for some proofs. We also remark here that the way we utilize these test functions will be entirely different from how they are used in 7] and 23]. We use these functions to estimate the value of a solution of our Poisson Problem in each valley (see Theorem 4.). 3.. Neighborhoods of saddle points. We now introduce some subsets of R d related to the inter-valley structure of U. For each saddle point σ S, denote by λ σ the unique negative eigenvalue of ( 2 U)(σ), and by λ σ 2, λ σ d the positive eigenvalues of ( 2 U)(σ). We choose unit eigenvectors v σ,... v σ d of ( 2 U)(σ) corresponding to the eigenvalues λ σ, λ σ 2,..., λ σ d. Remark 3.. Some care is needed as we select the direction of v σ. If σ S i, j for some i < j, we choose v σ to be directed toward the valley W j. Formally stating, we assume that σ + αv σ W j for all sufficiently small α > 0. We define δ = δ(ɛ) = A closed box Cσ ɛ around the saddle point σ is defined by { d Cσ ɛ = σ + α i v σ i : α Jδ ] Jδ, and α λ σ λ σ i i= ɛ log(/ɛ). (3.) 2Jδ λ σ i, ] 2Jδ λ σ i for 2 i d where J is a constant which is larger than 2d. We refer to Figure 3. for the illustration of the sets defined in this subsection. },

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 3 Figure 3.. Visualization of a neighborhood of σ S, 2. Notation 3.2. We summarize the notations used in the remaining of the paper. We regard J as a constant so that the terms like o ɛ (), O(δ 2 ) may depend on J as well. All the constants without subscript or superscript ɛ are independent of ɛ (and hence of δ) but may depend on J or the function U. Constants are usually denoted by c or C and different appearances may take different values. Decompose the boundary Cσ ɛ into { d + Cσ ɛ = σ + α i v σ i Cσ ɛ : α = Jδ }, λ σ C ɛ σ = { σ + i= d i= α i v σ i Cσ ɛ : α = Jδ } λ σ, and 0 C ɛ σ = C σ ɛ \ ( + C ɛ σ C ɛ σ). The following is a direct consequence of a Taylor expansion of U around σ, since U(σ) = H. Lemma 3.3. For all x 0 Cσ, ɛ we have that U(x) H + ( + o ɛ ()) 3J 2 δ 2. 2 Proof. This follows from the Taylor expansion of U at σ (see 23, Lemma 6.]). Now we define H ɛ = { x R d : U(x) H + J 2 δ 2}, and let Bσ ɛ = Cσ ɛ H ɛ for σ S. Decompose the boundary Bσ ɛ as + Bσ ɛ = Bσ ɛ + Cσ ɛ, Bσ ɛ = Bσ ɛ Cσ ɛ, and 0 Bσ ɛ = Bσ ɛ \ ( + Bσ ɛ Bσ) ɛ. (3.2)

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 4 Then, by Lemma 3.3, for small enough ɛ, we have U(x) = H + J 2 δ 2 for all x 0 B σ ɛ. (3.3) Thus, the set H ɛ \ σ S Bσ ɛ consists of K connected components W, ɛ, WK ɛ such that V i Wi ɛ for all i S. Furthermore, if σ S i, j with i < j, then by Remark 3. we have that B ɛ σ W ɛ i and + B ɛ σ W ɛ j. (3.4) We shall assume from now on that ɛ > 0 is small enough so that the construction above is in force. 3.2. Test function and basic estimates. For σ S, define a normalizing constant c σ ɛ by { } Jδ/ λ σ c σ λ σ ɛ = 2πɛ exp λσ 2ɛ t2 dt = + o ɛ (), (3.5) Jδ/ λ σ and define a function f ɛ σ( ) on B ɛ σ by, By (3.5) we have f σ ɛ (x) = (c σ ɛ ) (x σ) v σ Jδ/ λ σ { } λ σ 2πɛ exp λσ 2ɛ t2 dt ; x Bσ ɛ. (3.6) 0 if x fɛ σ Bσ ɛ (x) = if x + Bσ ɛ. (3.7) We next investigate two basic properties of fɛ σ in Lemmas 3.4 and 3.5 below. The statement and the proof of the first lemma is similar to those of 23, Lemma 8.7] (in terms of the notations of 23], our model corresponding to the special case M = I, where I denotes the identity matrix). Since the proof is much simpler for our specific case, and some of the computations carried out below will be useful later, we give the full proof of this lemma. Lemma 3.4. For all σ S, we have that θ ɛ (L ɛ fɛ σ )(x) ˆµ ɛ (x) dx = o ɛ (). (3.8) C ɛ σ Proof. To ease the notation, we may assume that σ = 0. For x Cσ, ɛ write α i := α i (x) = x v σ i so that x = d i= α i v σ i. By elementary computations, we can write (L ɛ fɛ σ )(x) = λ σ λ σ c σ ɛ 2πɛ e 2ɛ α 2 ( U(x) + λ σ x) v σ ]. (3.9) By the Taylor expansion of U around σ, we have d U(x) + λ σ x = ( 2 U)(σ) x + O(δ 2 ) + λ σ x = (α i λ i + α i λ ) σ v σ i + O(δ 2 ). (3.0) i=2

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 5 Since v σ v σ i = 0 for 2 i d, we conclude from (3.9) and (3.0) that { } (L ɛ fɛ σ )(x) = O(δ 2 ) ɛ 2 exp λσ 2ɛ α2. (3.) Therefore, the left-hand side of (3.8) is bounded above by { O(δ 2 ) θ ɛ ɛ 2 Z ɛ exp U(x) + (/2)λσ α 2 } dx Cσ ɛ ɛ { = O(δ 2 ) θ ɛ ɛ 2 Z ɛ e H ɛ exp } d λ σ i αi 2 dx, (3.2) 2ɛ where the identity follows from the second-order Taylor expansion of U around σ and the fact that O(δ 3 /ɛ) = o ɛ (). By the change of variables, the last integral can be bounded as 2Jδ 2Jδ/ λ σ { 2 2Jδ/ λ σ λ σ d exp } d λ σ 2Jδ/ λ σ 2 2Jδ/ λ σ i αi 2 dα 2 dα d 2ɛ d i=2 ɛ d 2Jδ { 2 exp } d λ σ λ σ i yi 2 dy 2 dy d = Cɛ d 2 δ. 2 Inserting this into (3.2) finishes the proof. Lemma 3.5. For all σ S, we have θ ɛ ɛ fɛ σ (x) 2ˆµ ɛ (x)dx = ( + o ɛ ()) ν ω σ. Proof. See 23, Lemma 8.4]. B ɛ σ For q = (q(i) : i S) R S, we now define a test function Fɛ q : R d R. This test function is used in Sections 4 and 5. In particular, in Section 4, the vector q may depend on ɛ. For this reason, we will keep track of the dependence of the constants on q in the inequalities that appear in this section. We start by defining a real-valued function F ɛ q on H ɛ. This function is defined by q(i) if x W F ɛ q i ɛ, i S, (x) = (3.3) q(i) + (q(j) q(i))fɛ σ (x) if x Bσ ɛ, σ S i, j with i < j. C ɛ σ i=2 i=2 By (3.7), the function F q ɛ is continuous on H ɛ. Evidently, F q ɛ L (H ɛ ) q := max{ q(i) : i S}. (3.4) Furthermore, since f σ ɛ C ɛ /2, we deduce that the function F q ɛ satisfies F q ɛ L (H ɛ ) C ɛ /2 max{ q(i) q(j) : i, j S} C ɛ /2 D x (q, q)] /2. (3.5) Here we stress that the constant C is independent of q.

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 6 Let K be a compact set containing H ɛ for all ɛ (0, 2]. For instance, one can select K = H a for any a > 2. Then, for ɛ (0, ], by (3.4) and (3.5), there exists a continuous extension Fɛ q : R d R of F ɛ q satisfying supp F q ɛ K, F q ɛ L (R d ) q, and F q ɛ L (R d ) C ɛ /2 D x (q, q)] /2. (3.6) Suppose from now on that ɛ is not larger than so that we can define Fɛ q satisfying (3.6). Note that the Dirichlet form D ɛ ( ) corresponding to the process x ɛ (t) is given by D ɛ (f) = ɛ f(x) 2ˆµ ɛ (x)dx ; f Hloc(R d ). (3.7) R d Lemma 3.6. For all q = (q(i) : i S) R K, we have that θ ɛ D ɛ (F q ɛ ) = ( + o ɛ ()) ν D x (q, q). Proof. It is immediate from Lemma 3.5 that θ ɛ ɛ F q H ɛ ɛ (x) 2ˆµ ɛ (x)dx = ( + o ɛ ()) ν D x (q, q). Thus, it suffices to show that θ ɛ ɛ F q (H ɛ ) c ɛ (x) 2ˆµ ɛ (x)dx = o ɛ () D x (q, q). (3.8) Since Fɛ q 0 on K c we can replace the domain of integration in (3.8) with K \ H ɛ. Then, by (3.6), (2.2), and by the fact that U(x) H + J 2 δ 2 for x / H ɛ, θ ɛ ɛ F q K\H ɛ ɛ (x) 2ˆµ ɛ (x)dx m d (K) D x (q, q) θ ɛ Zɛ e H/ɛ ɛ J 2 C D x (q, q) ɛ J 2 (d/2), where m d ( ) is the Lebesgue measure on R d. This completes the proof since J > 2d. 4. A Poisson equation Define a ɛ = (a ɛ (i) : i S ) R S by By (2.4), we have a ɛ (i) = Z ɛ (2πɛ) d/2 e h/ɛ ν i µ ɛ (V i ) ; i S. a ɛ (i) = + o ɛ () for all i S. (4.) The main result of the current section can be stated as following theorem. Theorem 4.. For all f : S R, there exists a bounded function φ ɛ = φ f ɛ : R d R satisfying all the following properties: () φ ɛ W 2, p loc (R d ) for all p.

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 7 (2) φ ɛ satisfies the equation (3) For all i S, it holds that θ ɛ L ɛ φ ɛ = i S a ɛ (i) (L y f)(i) χ Vi. (4.2) lim sup φ ɛ (x) f(i) = 0. (4.3) ɛ x V i Remark 4.2. In 3, Theorem 5.3], a similar analysis has been carried out for a slightly different situation. In 3], we treat a Poisson equation of the form (4.2) for a different right-hand side. The form of the right-hand we have chosen in (4.2) enables us to use the Poincare s inequality (see subsection 4.3 below). Furthermore, the proof therein relies on the capacity estimates between metastable valleys. Our proof though does not use any capacity estimates and has a chance to be applicable to the non-reversible variant of our model. This fact deserves to be highlighted here once more. Note that the function L y f : S R satisfies i S (L y f)(i) µ (i) = 0 (4.4) since µ ( ) defined in (2.20) is the invariant measure for the Markov chain y(t). Let e i R S, i S, be the ith unit vector defined by For i, j S, let S i,j be the collection of f R S e i (j) = {i = j} ; j S. (4.5) satisfying L y f = µ (i) e i µ (j) e j = ν ν i e i ν ν j e j. Remark that the selection S i,j is consistent with the condition (4.4) for L y f. It is immediate from the irreducibility of the Markov chain y(t) that i, j S S i,j spans whole space R S. Note that for f 0, it suffices to select φ ɛ 0 and thus it suffices to consider non-zero f. Therefore, by the linearity of the statement of Theorem 4. with respect to f, it suffices to prove the theorem for f S i,j only. To simplify notations, let us assume that, 2 S, and assume that f S,2, i.e, L y f = ν e ν e 2. (4.6) ν ν 2 Now we fix such f throughout the remaining part of the current section. We note that (L y f)(i) = 0 for all i, 2. Our plan is to select the test function φ ɛ that appeared in Theorem 4. as a minimizer of a functional I ɛ ( ) that will be defined in Section 4.. More precisely, we first take a minimizer ψ ɛ of that functional satisfies a certain symmetry condition (see (4.7) below) and analyze its

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 8 property thoroughly in Sections 4.2-4.5. Then, we shall prove that a translation of ψ ɛ, which is also a minimizer of I ɛ ( ), satisfies all the requirements of Theorem 4. in Section 4.6. 4.. A Variational principle. Recall from (3.7) the functional D ɛ ( ) and define a functional I ɛ ( ) on Hloc(R d ) as I ɛ (φ) = 2 θ ɛ D ɛ (φ) + a ɛ (i) (L y f)(i) φ(x) ˆµ ɛ (x)dx. (4.7) V i i=, 2 Denote by ψ ɛ a minimizer of I ɛ ( ). Then, it is well-known that ψ ɛ W 2, p loc (R d ) for all p and furthermore ψ ɛ satisfies (4.2), i.e., θ ɛ L ɛ ψ ɛ = i S a ɛ (i) (L y f)(i) χ Vi. (4.8) Our purpose in the remaining part is to find a constant c ɛ such that φ ɛ = ψ ɛ + c ɛ satisfies (4.3). Note that this φ ɛ also satisfies (4.2) and hence, this finishes the proof. Write p ɛ (i) = a ɛ (i) (L y f)(i) ψ ɛ (x) ˆµ ɛ (x)dx ; i S, (4.9) V i so that p ɛ (i) = 0 for all i, 2 because of (4.6). Note that if we add a constant a to ψ ɛ, then the value of p ɛ (i) for i =, 2 changes to p ɛ(i), with p ɛ() = p ɛ () + ab, p ɛ(2) = p ɛ (2) ab, p ɛ (i) = 0, for i, 2, where b = Zɛ (2πɛ) d/2 e h/ɛ ν. Hence, by adding a constant a to ψ ɛ if necessary, we can assume without loss of generality that p ɛ () = p ɛ (2). Set λ ɛ := p ɛ () = p ɛ (2). (4.0) We now multiply both sides of the equation (4.8) by ψ ɛ and integrate with respect to the invariant measure µ ɛ to deduce θ ɛ D ɛ (ψ ɛ ) = 2λ ɛ. (4.) Consequently, λ ɛ > 0 and furthermore, by (4.7), (4.0), and (4.) we obtain I ɛ (ψ ɛ ) = λ ɛ. (4.2) 4.2. Lower bound on λ ɛ. In this subsection, we prove a rough lower bound for λ ɛ in Proposition 4.4. We start by providing some relations between Dirichlet forms D x (, ) and D y (, ). For u : S R and u : S R, we say that u is an extension of u if u (i) = u(i) for all i S. For u : S R, we define the harmonic extension ũ : S R of u as the extension of u satisfying (L x ũ)(i) = 0 for all i S \ S. (4.3)

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 9 The following lemma will be used in several instances in the remaining part of the article. Lemma 4.3. For all u, v : S R, the following properties hold. () For harmonic extension ũ and ṽ of u and v, respectively, we have D x (ũ, ṽ) = ν D y (u, v). (4.4) (2) For any extensions v, v 2 : S R of v, we have D x (ũ, v ) = D x (ũ, v 2 ). (4.5) Proof. For part (), recall the function e i, i S, that was defined in (4.5). Since both D x (, ) and D y (, ) are bi-linear forms, it suffices to check (4.4) for (u, v) = (e i, e j ) for i S and j S. By (2.8), the harmonic extension of e i, namely ẽ i : S R, is the equilibrium potential between {i} and S \ {i}, with respect to the process x( ), and hence we have D x (ẽ i, ẽ i ) = cap x ({i}, S \ {i}). (4.6) Similarly, for i j S, the function ẽ i + ẽ j : S R is the equilibrium potential between {i, j} and S \ {i, j}, with respect to the process x( ), and therefore it holds D x (ẽ i + ẽ j, ẽ i + ẽ j ) = cap x ({i, j}, S \ ({i} {j})). (4.7) By (4.6), (4.7) and the bi-linearity of D x, we have D x (ẽ i, ẽ j ) = β i, j ; i j S. (4.8) It also follows from the definition D y (e i, e j ) = 2ν β i, j (0 )( 0) + 2ν β j, i ( 0)(0 ) = β i, j ν. (4.9) From (4.8) and (4.9), we deduce (4.4) for (u, v) = (e i, e j ) with i j. Now, we turn to the case (u, v) = (e i, e i ) for some i S. For this case, since j S e j = on S, it is immediate that j S ẽ j = on S. Therefore, ( D x ẽ i, ) = 0. j S ẽ j By this equation, (4.8), and the bi-linearity of D x, we obtain D x (ẽ i, ẽ i ) = D x (ẽ i, ẽ j ) = j S :j i j S :j i β i, j. This finishes the proof for part () since by the direct computation we can verify that D y (e i, e i ) = ν j S :j i β i, j.

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 20 For part (2), by the definition (2.6) of D x, we can write D x (ũ, v v 2 ) = i S µ(i) ( L x ũ)(i) (v (i) v 2 (i)). The last summation is 0 since (L x ũ)(i) = 0 for i S \ S and v (i) v 2 (i) = 0 for i S. This completes the proof. Now we are ready to establish an a priori lower bound on λ ɛ. We remark that a sharp asymptotic of λ ɛ will be given in Section 4.6. Recall that we have fixed f as in (4.6). Proposition 4.4. We have λ ɛ (/2)D y (f, f) + o ɛ (). Proof. Recall from Section 3.2 the test function F f ɛ : R d R where f is the harmonic extension of f defined above. By (2.3), (2.20) and Lemma 3.6, I ɛ (F f ɛ ) = 2 ν D x ( f, f) + µ (i) (L y f)(i) f(i) + o ɛ (). i S By Lemma 4.3, we can conclude that the right-hand side of the previous display is equal to 2 D y(f, f) i S µ (i) ( L y f)(i) f(i) + o ɛ () = 2 D y(f, f) + o ɛ (). The proof is completed by recalling that I ɛ (F f ɛ ) I ɛ (ψ ɛ ) = λ ɛ. 4.3. L 2 -estimates based on Poincaré s inequality. For small enough ɛ > 0 the set {x : U(x) H 4 J 2 δ 2 } (4.20) consists of K connected components. For such ɛ and i S, we write V () i := V () i, ɛ a connected component of (4.20) containing V i. Similarly, define V (2) i := V (2) i, ɛ as a connected component of the set {x : U(x) H 8 } J 2 δ 2 (4.2) containing V i for sufficiently small ɛ > 0. We shall assume that ɛ > 0 is small enough so that the descriptions above hold. Then, for i S, we have V i V () i V (2) i W i. It is not hard to show dist( V () i, V (2) i ) = cjδ + o(δ) (4.22) for some constant c > 0 where this distance is achieved around a saddle point σ S that belongs to W i. The constant c depends only on the Hessian of U at that saddle point.

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 2 For i S, define q ɛ (i) = m d (V () i ) V () i ψ ɛ (x)dx and q ɛ (i) = where m d denotes the Lebesgue measure of R d. m d (V (2) i ) V (2) i ψ ɛ (x)dx, Proposition 4.5. There exists a constant C > 0 such that the following estimate holds for all i S: ψ ɛ q ɛ (i) L 2 (V (2) i ) C ɛj 2 /6 λ ɛ. Remark 4.6. Here and elsewhere in this paper, L p norms are computed with respect to the Lebesgue measure of R d. Proof. By Poincaré s inequality, the definition of V (2) i, (4.), and (2.2), ψ ɛ (x) q ɛ (i) 2 dx C ψ ɛ (x) 2 dx Ce (H (/8)J 2 δ 2 )/ɛ ψ ɛ (x) 2 e U(x)/ɛ dx V (2) i V (2) i Ce (H (/8)J 2 δ 2 )/ɛ Z ɛ ɛ D ɛ (ψ ɛ ) Cɛ J 2 /8+d/2 λ ɛ Cɛ J 2 /8 λ ɛ, From this and Cauchy-Schwarz s inequality we deduce, q ɛ (i) q ɛ (i) ψ m d (V () ɛ (x) q ɛ (i) dx C i ) V () i Combining the above two bounds yields V (2) i V (2) i ψ ɛ (x) q ɛ (i) 2 dx Cɛ J 2 /8 λ ɛ. V (2) i ψ ɛ (x) q ɛ (i) dx C ɛ J 2 /6 λ /2 ɛ. Thus, the proposition follows immediately from this estimate and Proposition 4.4. 4.4. L -estimates on valleys. In this subsection, we use the interior elliptic regularity techniques and a suitable bootstrapping argument to reinforce the L 2 -estimate in V (2) i that was obtained in Proposition 4.5 to L -estimate in the smaller set V () i. This type of argument has been introduced originally in 0], and is suitably modified to yield a desired L -estimate. We start by a lemma. Let us write, for ɛ > 0, Then, by (2.2), we have η ɛ = o ɛ (). Lemma 4.7. We have Z ɛ = ( + η ɛ ) (2πɛ) d/2 e h/ɛ ν. (4.23) ( + η ɛ )p ɛ () q ɛ () ψ ɛ q ɛ () L (V ) and ( + η ɛ )p ɛ (2) + q ɛ (2) ψ ɛ q ɛ (2) L (V 2 ).

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 22 Proof. By (4.6), (4.9), and (4.23) we can write Therefore, we have p ɛ () = + η ɛ µ ɛ (V ) ( + η ɛ )p ɛ () q ɛ () V ψ ɛ (x) µ ɛ (dx). ψ ɛ (x) q ɛ () µ ɛ (dx). µ ɛ (V ) V Thus, the estimate for p ɛ () follows. The proof for p ɛ (2) is identical. Proposition 4.8. For all i S, we have Proof. On V (2) i ψ ɛ q ɛ (i) L (V () i ) = o ɛ()λ ɛ., i S, the function ψ ɛ satisfies the equation θɛ a ɛ (i) g(i) χ Vi if i S L ɛ ψ ɛ = 0 otherwise, where g = L y f. For both cases, we can rewrite the equation as ɛ (ψ ɛ q ɛ (i)) = (ψ ɛ q ɛ (i)) U] (ψ ɛ q ɛ (i)) U + C θ ɛ χ Vi for some constant C 0. Then, by the local interior elliptic estimate 5, Theorem 8.7] with R = cδ for small enough constant c > 0 (we are allowed to do this because of (4.22)), we obtain that, for any p > d and for some constant C p > 0, ψ ɛ q ɛ (i) C p L (V () i ) ɛδ ψ d/2 ɛ q ɛ (i) + C p L 2 (V (2) i ) ɛ δ (d/p) ψ ɛ q ɛ (i) + o L p (V (2) i ) ɛ(). Let us select p = 2d for the sake of definiteness and let us write ψ ɛ := ψ ɛ L (R d ) for the simplicity of notation. Then, by Propositions 4.4, 4.5, Hölder s inequality, and the trivial fact that q ɛ (i) ψ ɛ, we obtain ψ ɛ q ɛ (i) o L (V () i ) ɛ()λ ɛ + C ɛ δ/2 ψ ɛ q ɛ (i) /d ψ L 2 (V (2) ɛ q ɛ (i) (/d) i ) L (V (2) i ) ] = o ɛ () λ ɛ + λ /d ɛ ψ ɛ q ɛ (i) (/d), L (V (2) i ) o ɛ () λ ɛ + λ /d ɛ ψ ɛ (/d) ]. o ɛ () λ ɛ + ψ ɛ ]. Now we present a bootstrapping argument. Write (4.24) m ɛ (i) = ψ ɛ L (V () i ) for i S and ξ = ξ ɛ = max{m ɛ (), m ɛ (2)}

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 23 Then, it holds that ψ ɛ = ξ, since otherwise I (u ɛ ψ ɛ ) < I (ψ ɛ ) where ψ ɛ if t ξ, u ɛ (t) = t if t < ψ ɛ, ψ ɛ if t ξ. Thus we can write ψ ɛ = m ɛ (k) where k is either or 2. Then, ψ ɛ = m ɛ (k) = ψ ɛ L (V () k ) ψ ɛ q ɛ (k) L (V () k ) + q ɛ(k). (4.25) By Lemma 4.7 and (4.0), we have that By combining (4.25) and (4.26), we obtain q ɛ (k) ( + o ɛ ())λ ɛ + ψ ɛ q ɛ (k) L (V () k ). (4.26) ψ ɛ ( + o ɛ ())λ ɛ + 2 ψ ɛ q ɛ (k) L (V () k ). (4.27) Inserting (4.27) into (4.24) with i = k yields By (4.27) and (4.28), we have ψ ɛ q ɛ (k) L (V () k ) o ɛ()λ ɛ. (4.28) ψ ɛ ( + o ɛ ())λ ɛ. (4.29) Finally, inserting this into (4.24) finishes the proof. 4.5. Characterization of q ɛ on deepest valleys. In the previous subsection, we proved that if the constant λ ɛ is bounded above, then for every i S, the function ψ ɛ (x) q ɛ (i) is almost 0 in each valley V () i. This boundedness of λ ɛ will be established later in (4.55). In this sub section, we shall prove that, for each i S, the value q ɛ (i) is close to f(i) up to a constant c ɛ that does not depend on i. The following is a formulation of this result. Proposition 4.9. For all small enough ɛ > 0, there exists a constant c ɛ such that, for all i S, q ɛ (i) f(i) c ɛ = o ɛ () λ ɛ. Indeed, this characterization of q ɛ is the main innovation of the current work. We shall use the test function constructed in Section 3.2 in a novel manner to establish Proposition 4.9. For each ɛ > 0, we consider a function h ɛ : S R and write h ɛ : S R for its harmonic extension as was introduced in Section 4.2. Our selection for h ɛ will be revealed at the last stage of the proof (cf. (4.5)). To simplify the notation, we write F ɛ := F h ɛ ɛ, (4.30)

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 24 where the notation F h ɛ ɛ was introduced in Section 3.2. We denote by h ɛ and h ɛ the maximum of h ɛ and h ɛ on S and S, respectively. Using a discrete Maximum Principle, one can readily verify that h ɛ = h ɛ. Since ψ ɛ satisfies the equation (4.2) and since F ɛ h ɛ (i) = h ɛ (i) on V i, i S, we have the identity θ ɛ F ɛ (x) (L ɛ ψ ɛ )(x) µ ɛ (dx) = h ɛ (i) (L y f)(i) a ɛ (i) µ ɛ (V i ). (4.3) R d i S In order to prove Proposition 4.9, we compute two sides of (4.3) separately. From the comparison of these computations, we obtain the characterization described in Proposition 4.9. The right-hand side of (4.3) is relatively easy to compute. By Proposition 2.2 and (4.), we have a ɛ (i) µ ɛ (V i ) = ( + o ɛ ())(ν i /ν ) and thus we can rewrite the right-hand side of (4.3) as i S h ɛ (i) (L y f)(i) a ɛ (i) µ ɛ (V i ) = D y (h ɛ, f) + o ɛ () h ɛ. (4.32) The main difficulty of the proof lies on the computation of the left-hand side of (4.3). We carry out this computation in several lemmas below. Lemma 4.0. With the notations above, it holds that θ ɛ F ɛ (x) (L ɛ ψ ɛ )(x) µ ɛ (dx) R d = θ ɛ ɛ ( F ɛ ψ ɛ )(x) µ ɛ (dx) + o ɛ () λ /2 σ S Bσ ɛ ɛ h ɛ. (4.33) Proof. By the divergence theorem, the left-hand side of (4.33) is equal to θ ɛ ɛ ( F ɛ ψ ɛ )(x) µ ɛ (dx). (4.34) R d By the definition of F ɛ = F h ɛ ɛ, we have that Since it suffices to show that θ ɛ ɛ F ɛ 0 in W ɛ i for all i S. (4.35) ( ) H ɛ \ Wi ɛ = Bσ ɛ, i S σ S (H ɛ ) c ( F ɛ ψ ɛ )(x) µ ɛ (dx) = o ɛ () λ /2 ɛ h ɛ. (4.36)

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 25 By the Cauchy-Schwarz inequality, the square of the left-hand side of (4.36) is bounded above by ( ) ( θ ɛ ɛ F ɛ (x) 2 2 ) µ ɛ (dx) ψ ɛ (x) 2 2 µ ɛ (dx). (H ɛ ) c (H ɛ ) c By (3.8) and (4.), the last expression is o ɛ () λ /2 ɛ h ɛ. Thus, (4.36) follows. Recall the function fɛ σ from (3.6). The estimate below corresponds to that of each summand on the right-hand side of (4.33). Lemma 4.. For i, j S with i < j and for σ W i, j, it holds that θ ɛ ɛ ( fɛ σ ψ ɛ )(x) µ ɛ (dx) = ω σ q ɛ (j) q ɛ (i)] + o ɛ () λ ɛ. (4.37) ν B ɛ σ Proof. Recall the decomposition of boundary of Bσ ɛ from (3.2). By applying the divergence theorem to the left-hand side of (4.37), we can write θ ɛ ɛ ( fɛ σ ψ ɛ )(x) µ ɛ (dx) = A + A 2 + A 3 + A 4, (4.38) where B ɛ σ A = θ ɛ A 2 = θ ɛ ɛ A 3 = θ ɛ ɛ A 4 = θ ɛ ɛ B ɛ σ 0 Bσ ɛ + Bσ ɛ B ɛ σ (L ɛ f σ ɛ )(x) ψ ɛ (x) µ ɛ (dx), ( f σ ɛ )(x) n B ɛ σ] ψɛ (x) ˆµ ɛ (x) σ(dx), ( f σ ɛ )(x) n B ɛ σ] ψɛ (x) ˆµ ɛ (x) σ(dx), ( f σ ɛ )(x) n B ɛ σ] ψɛ (x) ˆµ ɛ (x) σ(dx), where the vector n B σ ɛ denotes the outward unit normal vector to the domain Bσ, ɛ and σ(dx) represents the surface integral. We now compute these four expressions. Without loss of generality, we may assume that σ = 0. First, we claim that A and A 2 are negligible in the sense that A = o ɛ () λ ɛ and A 2 = o ɛ () λ ɛ. (4.39) The estimate for A is immediate from Lemma 3.4 and (4.29). For A 2, notice first that by the definition (3.6) of fɛ σ, we can write ( fɛ σ )(x) = λ σ λ σ cɛ σ 2πɛ e 2ɛ (x v σ )2 v σ. (4.40) By inserting this into A 2, and applying (3.3), (3.5), and (4.29), we are able to deduce A 2 C θ ɛ ɛ /2 λ ɛ Z ɛ e (H+J 2 δ 2 )/ɛ δ d = o ɛ () λ ɛ. (4.4)

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 26 Here we have used trivial facts such as v σ n B ɛ σ, e λ σ 2ɛ (x vσ )2, and that the σ-measure of 0 Bσ ɛ is of order δ d. Next, we shall prove that A 3 = ω σ ν q ɛ (j) + o ɛ () λ ɛ and A 4 = ω σ ν q ɛ (i) + o ɛ () λ ɛ. (4.42) Since the proofs for these two estimates are identical, we only focus on the former. Note that the surface + Bσ ɛ is flat, and hence the outward normal vector n B ɛ σ is merely equal to v σ. Hence, by (2.2), (3.5) and (4.40) we can rewrite A 3 as λ σ A 3 = ( + o ɛ ())θ ɛ ɛ e λ σ 2πɛ (2πɛ) d/2 e h/ɛ 2ɛ (x vσ )2 U(x) ɛ ψ ɛ (x) σ(dx). ν + B ɛ σ By the Taylor expansion, we have U(x) = H + ( ) d λ σ (x v σ ) 2 + λ σ i (x v σ i ) 2 + o(δ 2 ). 2 i=2 Inserting this into the penultimate display, we can reorganize the right-hand side so that λ σ A 3 = ( + o ɛ ()) d e 2ɛ i=2 λσ i (x vσ i )2 ψ 2πν (2πɛ)(d )/2 ɛ (x) σ(dx). (4.43) + B ɛ σ Now we introduce a change of variable to estimate the last integral. Define a map gɛ σ : R d R d as, for y = (y 2,, y d ) R d, gɛ σ (y) = Jδ d ɛ v σ + y k v σ k, (4.44) λ λ k k=2 (recall σ = 0). Notice here that + B ɛ σ g σ ɛ (R d ). Write D ɛ σ = (g σ ɛ ) ( + B ɛ σ) R d. Then, by a change of variable x = gɛ σ (y), we can rewrite (4.43) as A 3 = ( + o ɛ ()) λ σ ν dk=2 2π λ σ e 2 y 2 ψ Dσ ɛ (2π)(d )/2 ɛ (g σ (y)) dy. (4.45) k Now we analyze Dσ. ɛ For y Dσ, ɛ we note that g σ (y) σ = O(δ) and thus by the Taylor expansion, U(g σ (y)) = H 2 J 2 δ 2 + ɛ d yk 2 + o(δ 2 ). 2 k=2

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 27 Denote by Q d (r) the (d )-dimensional ball of radius r > 0, centered at origin. Then, for y Q d ( J 2 log ), by the previous display we have that ɛ U(g σ (y)) H 2 J 2 δ 2 + 8 J 2 δ 2 + o(δ 2 ) < H 4 J 2 δ 2 for all sufficiently small ɛ > 0. For such ɛ, we can conclude that y + Bσ ɛ V () j by (4.20), and therefore by Proposition 4.8, we have that ψ ɛ (g σ (y)) = q ɛ (j) + o ɛ ()λ ɛ. Consequently, we have 2 y 2 ψ ɛ (g σ (y)) dy = ( + o ɛ ())q ɛ (j) + o ɛ ()λ ɛ, Q d ( J 2 (2π) log ɛ ) e (d )/2 because the integral of the probability density function of the (d )-dimensional standard normal distribution on Q d ( J 2 log ) is + o ɛ ɛ(). On the other hand, by (4.29), e 2 y 2 ψ Dσ\Q ɛ d ( J log 2 ɛ ) (2π)(d )/2 ɛ (g σ (y)) dy ψ ɛ e 2 y 2 dy = o log (2π) ɛ )c (d )/2 ɛ ()λ ɛ. Q d ( J 2 By the two last centered displays and by the definition of ω σ, we can rewrite (4.45) as A 3 = ω σ ν ( + o ɛ ())q ɛ (j) + o ɛ ()λ ɛ ]. The proof of (4.42) is completed by recalling that the fact that by (4.29) q ɛ (i) ψ ɛ ( + o ɛ ())λ ɛ. By combining (4.38), (4.39), and (4.42), we complete the proof. Lemma 4.2. Assume that f 0. It then holds, θ ɛ R d F ɛ (x) (L ɛ ψ ɛ )(x) ˆµ ɛ (x)dx = ν D x ( h ɛ, q ɛ ) + o ɛ () λ ɛ h ɛ. (4.46) Proof. By Lemma 4.0 and the definition (4.30) (cf. (3.3)) of F ɛ we can rewrite the left-hand side as θ ɛ ɛ ( h ɛ (j) h ɛ (i)) ( fɛ σ ψ ɛ )(x) ˆµ ɛ (x)dx + o ɛ () λ /2 ɛ h ɛ. i<j<k σ W i, j From this and Lemma 4., we deduce that the left-hand side of (4.46) equals to B ɛ σ ν D x ( h ɛ, q ɛ ) + o ɛ () λ /2 ɛ h ɛ + o ɛ () λ ɛ h ɛ. (4.47)

SCALING LIMIT OF SMALL RANDOM PERTURBATION OF DYNAMICAL SYSTEMS 28 Therefore, the proof is completed because by Maximum Principle h ɛ = h ɛ, and λ ɛ is uniformly positive whenever f 0 by Proposition 4.4. Now we are ready to prove Proposition 4.9. Proof of Proposition 4.9. The proof of the Proposition is trivial when f = 0, because we may choose ψ ɛ = c ɛ = 0. From now on, we assume that f 0. By (4.32), Proposition 4.4, and Lemma 4.2, we have D y (h ɛ, f) = ν D x ( h ɛ, q ɛ ) + o ɛ () λ ɛ h ɛ. (4.48) Denote by qɛ R S the restriction of q ɛ on S, i.e., qɛ (i) = q ɛ (i) for all i S, and denote by q ɛ R S the harmonic extension of qɛ to S. Note that q ɛ and q ɛ are two different extensions of qɛ R S to S. Thus, by Lemma 4.3 we have Hence, by (4.48) and (4.49), we obtain D x ( h ɛ, q ɛ ) = D x ( h ɛ, q ɛ ) = ν D y (h ɛ, q ɛ ). (4.49) D y (h ɛ, q ɛ f) = o ɛ () λ ɛ h ɛ. (4.50) Finally, let us define the test function h ɛ R S as h ɛ (i) := q ɛ (i) f(i) c ɛ for all i S, (4.5) where c ɛ = q ɛ (i) f(i)]. (4.52) S i S By inserting this test function h ɛ in (4.50), we obtain D y (h ɛ, h ɛ ) = o ɛ () λ ɛ h ɛ. (4.53) Write β = min 2ν β i, j > 0. i S, j S, i j Then, we have D y (h ɛ, h ɛ ) β (h ɛ (i) h ɛ (j)) 2 = 2β S hɛ 2 2β S 2 h ɛ 2, (4.54) i, j S i S where the identity follows from the fact that i S h ɛ = 0 thanks to our selection (4.5) and (4.53) of h ɛ. By (4.53) and (4.54), we obtain h ɛ o ɛ () λ ɛ. This completes the proof since h ɛ (i) = q ɛ (i) f(i) c ɛ for i S.