Heat kernels and function theory on metric measure spaces

Similar documents
Heat kernels on metric spaces with doubling measure

Heat kernels on metric measure spaces

HEAT KERNELS ON METRIC MEASURE SPACES AND AN APPLICATION TO SEMILINEAR ELLIPTIC EQUATIONS

Heat kernels on metric spaces with doubling measure

ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS

On positive solutions of semi-linear elliptic inequalities on Riemannian manifolds

Pseudo-Poincaré Inequalities and Applications to Sobolev Inequalities

Stochastic completeness of Markov processes

Heat kernels of some Schrödinger operators

COMPARISON INEQUALITIES FOR HEAT SEMIGROUPS AND HEAT KERNELS ON METRIC MEASURE SPACES

OBTAINING UPPER BOUNDS OF HEAT KERNELS FROM LOWER BOUNDS

Heat Kernel and Analysis on Manifolds Excerpt with Exercises. Alexander Grigor yan

Laplace s Equation. Chapter Mean Value Formulas

DOMAINS OF DIRICHLET FORMS AND EFFECTIVE RESISTANCE ESTIMATES ON P.C.F. FRACTALS. 1. Introduction

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

Local and Non-Local Dirichlet Forms on the Sierpiński Carpet

BOUNDARY VALUE PROBLEMS ON A HALF SIERPINSKI GASKET

ON PARABOLIC HARNACK INEQUALITY

The heat kernel meets Approximation theory. theory in Dirichlet spaces

i=1 α i. Given an m-times continuously

Nonlinear Energy Forms and Lipschitz Spaces on the Koch Curve

Heat Kernels on Manifolds, Graphs and Fractals

Random walks on ultra-metric spaces

A local time scaling exponent for compact metric spaces

LECTURE 15: COMPLETENESS AND CONVEXITY

The Dirichlet s P rinciple. In this lecture we discuss an alternative formulation of the Dirichlet problem for the Laplace equation:

A class of domains with fractal boundaries: Functions spaces and numerical methods

Lecture 5 - Hausdorff and Gromov-Hausdorff Distance

Harnack inequalities and Gaussian estimates for random walks on metric measure spaces. Mathav Murugan Laurent Saloff-Coste

On Hardy Littlewood inequality for Brownian motion on Riemannian manifolds

Some Background Material

SPECTRAL PROPERTIES OF THE LAPLACIAN ON BOUNDED DOMAINS

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm

Sobolev Spaces. Chapter Hölder spaces

Stochastic Processes on Fractals. Takashi Kumagai (RIMS, Kyoto University, Japan)

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

HAUSDORFF DIMENSION AND ITS APPLICATIONS

Brownian Motion. 1 Definition Brownian Motion Wiener measure... 3

Lectures on. Sobolev Spaces. S. Kesavan The Institute of Mathematical Sciences, Chennai.

Gaussian Random Fields: Geometric Properties and Extremes

RIEMANNIAN GEOMETRY COMPACT METRIC SPACES. Jean BELLISSARD 1. Collaboration:

l(y j ) = 0 for all y j (1)

Chapter 2 Metric Spaces

Analysis in weighted spaces : preliminary version

Spectral Gap and Concentration for Some Spherically Symmetric Probability Measures

Function spaces and stochastic processes on fractals I. Takashi Kumagai. (RIMS, Kyoto University, Japan)

CMS winter meeting 2008, Ottawa. The heat kernel on connected sums

Wave equation on manifolds and finite speed of propagation

CAPACITIES ON METRIC SPACES

Math The Laplacian. 1 Green s Identities, Fundamental Solution

Part III. 10 Topological Space Basics. Topological Spaces

R is a metric, called a resistance metric. By (5.1), the following key inequality holds.

Nontangential limits and Fatou-type theorems on post-critically finite self-similar sets

Wiener Measure and Brownian Motion

LAPLACIANS COMPACT METRIC SPACES. Sponsoring. Jean BELLISSARD a. Collaboration:

Metric Spaces and Topology

The small ball property in Banach spaces (quantitative results)

A new proof of Gromov s theorem on groups of polynomial growth

Basic Properties of Metric and Normed Spaces

Overview of normed linear spaces

Estimating the number of negative eigenvalues of Schrödinger operators

Some remarks on the elliptic Harnack inequality

Mid Term-1 : Practice problems

Introduction to Real Analysis Alternative Chapter 1

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Exercise Solutions to Functional Analysis

u xx + u yy = 0. (5.1)

Existence and Uniqueness

08a. Operators on Hilbert spaces. 1. Boundedness, continuity, operator norms

SELF-ADJOINTNESS OF SCHRÖDINGER-TYPE OPERATORS WITH SINGULAR POTENTIALS ON MANIFOLDS OF BOUNDED GEOMETRY

JUHA KINNUNEN. Harmonic Analysis

NECESSARY CONDITIONS FOR WEIGHTED POINTWISE HARDY INEQUALITIES

Probability and Measure

Lecture No 2 Degenerate Diffusion Free boundary problems

Hausdorff Measure. Jimmy Briggs and Tim Tyree. December 3, 2016

CHAPTER VIII HILBERT SPACES

Riemann integral and volume are generalized to unbounded functions and sets. is an admissible set, and its volume is a Riemann integral, 1l E,

Empirical Processes: General Weak Convergence Theory

A LOWER BOUND ON BLOWUP RATES FOR THE 3D INCOMPRESSIBLE EULER EQUATION AND A SINGLE EXPONENTIAL BEALE-KATO-MAJDA ESTIMATE. 1.

Traces, extensions and co-normal derivatives for elliptic systems on Lipschitz domains

The oblique derivative problem for general elliptic systems in Lipschitz domains

NONCOMMUTATIVE. GEOMETRY of FRACTALS

DISTRIBUTION THEORY ON P.C.F. FRACTALS

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

Logarithmic Sobolev Inequalities

Topological vectorspaces

1 Cheeger differentiation

Lecture Notes Introduction to Ergodic Theory

Tools from Lebesgue integration

Nonlinear aspects of Calderón-Zygmund theory

Trace and extension results for a class of domains with self-similar boundary

Spectral Continuity Properties of Graph Laplacians

The Kato square root problem on vector bundles with generalised bounded geometry

MATHS 730 FC Lecture Notes March 5, Introduction

Numerical Solutions to Partial Differential Equations

Continuity. Matt Rosenzweig

The Codimension of the Zeros of a Stable Process in Random Scenery

Examples of Dual Spaces from Measure Theory

Topological properties of Z p and Q p and Euclidean models

Transcription:

Heat kernels and function theory on metric measure spaces Alexander Grigor yan Imperial College London SW7 2AZ United Kingdom and Institute of Control Sciences of RAS oscow, Russia email: a.grigoryan@ic.ac.uk Contents Introduction 2 2 Definition of a heat kernel and examples 5 3 Volume of balls 8 4 Energy form 5 Besov spaces and energy domain 3 5. Besov spaces in R n............................ 3 5.2 Besov spaces in a metric measure space................. 4 5.3 Identification of energy domains and Besov spaces........... 5 5.4 Subordinated heat kernel......................... 9 6 Intrinsic characterization of walk dimension 22 7 Inequalities for walk dimension 23 8 Embedding of Besov spaces into Hölder spaces 29 9 Bessel potential spaces 3

Introduction The classical heat kernel in R n is the fundamental solution to the heat equation, which is given by the following formula ( x y 2 p t (x, y = exp. (. n/2 (4πt 4t It is worth observing that the Gaussian term exp ( x y 2 does not depend in n, 4t whereas the other term (4πt n/2 reflects the dependence of the heat kernel on the underlying space via its dimension. The notion of heat kernel extends to any Riemannian manifold. In this case, the heat kernel p t (x, y is the minimal positive fundamental solution to the heat equation u = u where is the Laplace-Beltrami operator on, and it always t exists (see [], [3], [6]. Under certain assumptions about, the heat kernel can be estimated similarly to (.. For example, if is geodesically complete and has non-negative Ricci curvature then by a theorem of Li and Yau [25] ( C p t (x, y V ( x, t exp d2 (x, y. Ct Here V (x, r is the volume of the geodesic ball B (x, r of radius r centered at x, and the sign means that both inequality signs and can be used instead but the positive constant C may be different in upper and lower bounds. Again, most information about the geometry of sits in the volume term, whereas the Gaussian ( term exp d2 (x,y Ct seems to be more robust. The recent development of analysis on fractals, notably by.barlow and R.Bass (see [5] and references therein, has shed new light on the nature of the Gaussian term. Without going into the definition of fractals sets, let us give an example. The most well studied fractal set is the Sierpinski gasket SG which is obtain from an equilateral triangle by removing its middle triangle, then removing the middle triangles from the remaining triangles, etc. (see Fig. where three iterations are shown and the removed triangles are blank. Figure : We regard SG as a metric measure energy space. A metric d on SG is defined as the restriction of the Euclidean metric. A measure µ on SG is the Hausdorff measure For a detailed account of fractals see the article of.barlow [3] in the same volume. 2

H α where α := dim H SG is called the fractal dimension of SG (it is possible to show that α = log 2 3. Defining an energy form is highly non-trivial. By an energy form we mean an analogue of the Dirichlet form E [f] = f 2 dvol which is defined on any Riemannian manifold. One first defines a discrete analogue of this form on graphs approximating SG, and then takes a certain scaling limit. Barlow and Perkins [8] have proved that the resulting functional E [f] is a local regular Dirichlet form in L 2 (SG, µ. Furthermore, they have shown that the associate diffusion process X t has a transition density p t (x, y which is a continuous function on x, y SG and t > and satisfies the following estimate p t (x, y C exp tα/β ( ( d β (x, y Ct β (.2 provided < t < (the restriction t < can be removed if one considers an unbounded version of SG. Here α is the same as above - the Hausdorff dimension of SG, whereas β is a new parameter called the walk dimension. It is determined in terms of the process X t as follows: the mean exit time from a ball of radius r is of the order r β. For SG it is known that β = log 2 5. Barlow and Bass [5] extended the above construction and the estimate (.2 to the large family of fractals including Sierpinski carpet and its higher dimensional analogues (see also [2]. oreover, it turns out that estimates similar to (.2 are valid on fractal-like graphs (see [6], [9], [2], [22], [3]. An example of such a graph the graphical Sierpinski gasket is shown on Fig. 2. Furthermore, one easily makes a smooth Riemannian manifold out Figure 2: of this graph by replacing the edges by tubes, and the heat kernel on this manifold also satisfies (.2 provided t max (d (x, y, where d is the geodesic distance. In all the cases the nature of the parameters α and β in (.2 is of great interest. Assume that a metric measure space (, d, µ admits a heat kernel satisfying (.2 with some α > and β > (a precise definition of a heat kernel will be given in 3

the next section to some extent, specifying a heat kernel is equivalent to choosing an appropriate energy form on. Although a priori the parameters α and β are defined through the heat kernel, a posteriori they happen to be invariants of the underlying metric measure structure alone. The parameter α turns out to be the volume growth exponent of (see Theorem 3. which also implies that α is the Hausdorff dimension of. The parameter β is characterized intrinsically as the critical exponent of a family of function spaces on. This approach for characterizing of the walk dimension originated by A.Jonsson [23] in the setting of SG, and was later used by K.Pietruska-Pa luba [26] and A.Stós [27] for so called d- sets in R n supporting a fractional diffusion. In full generality this result was proved in [8]. The structure of the paper and the main results are as follows. In Section 2 we define a heat kernel on a metric measure space and related notions. The main result of Section 3 is Theorem 3. characterizing the parameter α from (.2 as the volume growth exponent. In Section 4 we define the energy form associated with the heat kernel and prove that the domain of the energy form embeds compactly into L 2 (Theorem 4.. In Section 5 we introduce a family of function spaces on generalizing Besov spaces, and prove Theorems 5. and 5.2 characterizing the domain of the energy form in terms of the family of Besov spaces. In Section 6 we prove Theorem 6.2 giving an intrinsic characterization of the parameter β from (.2 in terms of Besov spaces. In Section 7 we prove Theorem 7.2 saying that under mild assumptions about the underlying metric space, the parameters α and β are related by the inequalities 2 β α + (see also Corollary 7.3(iii. In Section 8 we prove an embedding of Besov spaces into Hölder spaces C λ (Theorem 8.. In Section 9 we introduce Bessel potential spaces generalizing fractional Sobolev spaces in R n, and prove the embedding of these spaces into L q and C λ (Theorem 9.2(i and (ii, respectively. ost results surveyed here are taken from [8] (these are Theorems 3., 4., 5., 6.2, 7.2, 8.. Theorem 5.2 is new although it is largely motivated by a result of A.Stós [27]. Theorem 9.2(ii is also new giving a partial answer to a conjecture of R.Strichartz [28]. In this survey we do not touch the methods of obtaining estimates like (.2, and for that we refer the reader to [7], [2], [9], [2], [2], [24]. Notation. For non-negative functions f and g, we write f g if there is a constant C > such that C g f Cg in the domain of the functions f, g or in a specified range of the arguments. Letters c and C are normally used to denote unimportant positive constants, whose values may change at each occurrence. Acknowledgements. This survey is based on the lectures given during the trimester program Heat kernels, random walks, and analysis on manifolds and graphs held in April June 22 at the Centre Emile Borel, Institute Henri Poincaré, Paris. I am grateful to the CNRS (France for the financial support during this program. It is my pleasure to thank A.Stós and R.Strichartz for the discussions eventually leading to Theorem 9.2(ii, as well as T.Coulhon and W.Hansen for careful reading of the manuscript. 4

2 Definition of a heat kernel and examples Let (, µ be a measure space; that is, µ is a measure on a σ-algebra of subsets of a non-empty set. For any q [, + ], set L q = L q (, µ and u q := u L q (,µ. All functions in L q are considered to be real-valued. Definition 2. A family {p t } t> of µ µ-measurable functions p t (x, y on is called a heat kernel if the following conditions are satisfied, for µ-almost all x, y and all s, t > : (i Positivity: p t (x, y. (ii Stochastic completeness: p t (x, ydµ(y. (2. (iii Symmetry: p t (x, y = p t (y, x. (iv Semigroup property: p s+t (x, y = p s (x, zp t (z, ydµ(z. (2.2 (v Approximation of identity: for any u L 2 p t (x, yu(ydµ(y L2 u(x as t +. (2.3 For example, the Gauss-Weierstrass kernel (. in R n satisfies this definition. Alternatively, the function (. can be viewed as the density of the normal distribution in R n with the mean y and the variance 2t. Another elementary example is the Poisson kernel in R n p t (x, y = C n t n n+, (2.4 ( + x y 2 2 t 2 where C n = Γ ( n+ 2 /π (n+/2, which can be viewed as the Cauchy distribution in R n with parameters y and t. On any Riemannian manifold, the heat kernel associated with the Laplace- Beltrami operator satisfies all properties of Definition 2. (with respect to the Riemannian volume µ except for the stochastic completeness; however, the latter too can be obtained under certain mild hypotheses about (see [5] and [7]. Under additional assumptions about the space (, µ a heat kernel p t gives rise to a arkov process {X t } for which p t is the transition density. This means that for any measurable set A and for all x, t >, P x {X t A} = p t (x, ydµ(y. 5

For example, the process associated with the Gauss-Weierstrass kernel is the standard Brownian motion in R n, and process associated with the Poisson kernel is a certain jump process in R n. Any heat kernel gives rise to the heat semigroup {P t } t> where P t is the operator in L 2 defined by P t u(x = p t (x, yu(ydµ(y. (2.5 Indeed, we obtain by the Cauchy-Schwarz inequality and (2. P t u 2 2 = = ( ( = u 2 2, 2 p t (x, yu(ydµ(y dµ(x p t (x, ydµ(y p t (x, yu(y 2 dµ(y dµ(x p t (x, yu(y 2 dµ(xdµ(y which implies that P t is a bounded operator in L 2 and P t 2 2. The symmetry of the heat kernel implies that P t is a self-adjoint operator. The semigroup identity (2.2 implies that P t P s = P t+s, that is, the family {P t } t> is a semigroup. Finally, it follows from (2.3 that s- lim t P t = I, where I is the identity operator in L 2 and s-lim stands for strong limit. {P t } t> is a strongly continuous, self-adjoint, contraction semigroup in L 2. Define the infinitesimal generator L of a semigroup {P t } t> by Hence, u P t u Lu := lim, (2.6 t t where the limit is understood in the L 2 -norm. The domain dom(l of the generator L is the space of functions u L 2 for which the limit in (2.6 exists. By the Hille Yosida theorem, dom(l is dense in L 2. Furthermore, L is a self-adjoint, positive definite 2 operator, which immediately follows from the fact that the semigroup {P t } is self-adjoint and contractive. oreover, we have P t = exp ( tl, (2.7 where the right hand side is understood in the sense of spectral theory. For example, the generator of the Gauss-Weierstrass kernel in R n is where = n i= is the classical Laplace operator with a properly defined domain in 2 x 2 i 2 Normally the generator of a semigroup {P t } is defined as the limit lim t P t u u. t Our choice of the sign in (2.6 makes the generator positive definite. 6

L 2 (R n. The generator of the Poisson kernel is ( /2. It is well known that for any < m < 2 the operator ( m/2 is the generator of a heat kernel associated with the symmetric stable process of index m, which belongs to the family of Levy processes. Any positive definite self-adjoint operator L in L 2 determines by (2.7 a semigroup P t satisfying the above properties. It is not always the case that the semigroup {P t } defined by (2.7 possesses an integral kernel. However, when it does, the integral kernel satisfies all the properties of Definition 2. except for positivity and stochastic completeness; to ensure the latter properties, some additional assumptions about L are needed (see [4]. Definition 2.2 We say that a triple (, d, µ is a metric measure space if (, d is a non-empty metric space and µ is a Borel measure on. In the sequel, we will always work in a metric measure space (, d, µ. Let p t be a heat kernel on such a space, and consider the following estimates for p t, which in general may be true or not: ( ( d(x, y d(x, y t α/β Φ t /β p t (x, y t α/β Φ 2 t /β, (2.8 for µ-almost all x, y and all t (,. Here α, β are positive constants, and Φ and Φ 2 are a priori non-negative monotone decreasing functions on [, +. For example, the Gauss-Weierstrass kernel (. satisfies (2.8 with α = n, β = 2, and Φ (s = Φ 2 (s = (4π n/2 exp ( s2 4 The Poisson heat kernel (2.4 satisfies (2.8 with α = n, β =, and. Φ (s = Φ 2 (s = C n ( + s 2 n+ 2. For any < m < 2 the heat kernel of the operator ( m/2 in R n satisfies the following estimate p t (x, y t n/m ( n+m (2.9 + x y t /m (see [9] or Lemma 5.4 below. Hence, the estimate (2.8 is satisfied with α = n, β = m, and with functions Φ and Φ 2 of the form Φ (s = C, (2. α+β ( + s where the constant C may be different for Φ and Φ 2. The estimate (.2 for heat kernels on fractals mentioned in Introduction is a particular case of (2.8 with the same α and β and with functions Φ, Φ 2 of the form Φ (s = C exp ( Cs β β. 7

Definition 2.3 We say that a heat kernel p t on a metric measure space (, d, µ satisfies hypothesis H (θ (where θ > if p t satisfies the estimate (2.8 with some positive parameters α, β and non-negative decreasing functions Φ and Φ 2 such that Φ ( > and s θ Φ 2 (s ds <. (2. s Note that hypothesis H(θ gets stronger with increasing of θ. For example, every function of the form Φ 2 (s = exp ( Cs γ (2.2 satisfies (2. for any θ provided C > and γ >, and the function Φ 2 (s = C ( + s γ satisfies (2. for θ < γ. In particular, the Gauss-Weierstrass kernel satisfies H (θ for all θ whereas the heat kernel of the operator ( m/2 in R n satisfies H (θ for θ < n + m. 3 Volume of balls Let B(x, r := {y : d(x, y < r} be the ball in of radius r centered at the point x. Theorem 3. If a heat kernel p t on a metric measure space (, d, µ satisfies hypothesis H(α then for all x and r > C r α µ(b(x, r Cr α. (3. Remark 3.2 Note that in all examples considered in Section 2, hypothesis H(α is satisfied. Let us mention also that the estimate (3. can be true only for a single value of α which is called the exponent of the volume growth and is determined by intrinsic properties of the space (, d, µ. Hence, under hypothesis H(α the value of the parameter α in (2.8 is an invariant of the underlying space. Proof. Fix x and prove first the upper bound for all r >. Indeed, for any t >, we have p t (x, ydµ(y B(x,r µ(b(x, r Cr α, (3.2 p t (x, ydµ(y = (3.3 whence ( µ(b(x, r inf p t (x, y. y B(x,r 8

Applying the lower bound in (2.8 and taking t = r β we obtain inf p t (x, y ( r y B(x,r t Φ α/β = c r α, t /β where c = Φ ( >, whence (3.2 follows. Let us prove the opposite inequality µ(b(x, r c r α. (3.4 We first show that the upper bound in (2.8 and (3.2 imply the following inequality p t (x, ydµ(y for all t εr β, (3.5 \B(x,r 2 provided ε > is sufficiently small. Setting r k = 2 k r and using the monotonicity of Φ 2 and (3.2 we obtain ( d(x, y p t (x, ydµ(y t α/β Φ 2 dµ(y \B(x,r \B(x,r t /β ( t α/β rk Φ 2 dµ(y t /β k= C = C C B(x,r k+ \B(x,r k ( rk α t α/β rk Φ 2 t /β ( 2 k α ( r 2 k r Φ 2 k= k= t /β t /β s α Φ 2 (s ds s. (3.6 2 r/t/β Since by (2. the integral in (3.6 is convergent, its value can be made arbitrarily small provided r β /t is large enough, whence (3.5 follows. From (2. and (3.5, we conclude that for such r and t p t (x, ydµ(y 2, (3.7 whence B(x,r µ(b(x, r 2 ( sup p t (x, y. y B(x,r Finally, choosing t := εr β and using the upper bound we obtain (3.4. p t (x, y t α/β Φ 2 ( = Cr α, Corollary 3.3 If a metric measure space (, d, µ admits a heat kernel p t satisfying (2.8 then µ ( =. If in addition Φ ( > then diam =. 9

Proof. Let us show that the upper bound in (2.8 implies µ( =. Indeed, fix a point x and observe that the family of functions u t (x = p t (x, x satisfies the following two conditions: u t = and Hence, we obtain u t Ct α/β as t. µ( u t u t as t, that is µ( =. On the other hand, the first part of the proof of Theorem 3., based on the hypothesis Φ ( >, says that the measure of any ball is finite. Hence, is not contained in any ball, that is diam =. 4 Energy form Given a heat kernel {p t } on a measure space (, µ, define for any t > a quadratic form E t on L 2 by ( u Pt u E t [u] :=, u, (4. t where (, is the inner product in L 2. An easy computation shows that E t can be equivalently defined by E t [u] = u(x u(y 2 p t (x, ydµ(ydµ(x. (4.2 2t Indeed, by (2. and (2.5 we have u(x P t u(x = whence by (4. E t [u] = t (u(x u(y p t (x, ydµ(y (u(x u(y u(xp t (x, ydµ(ydµ(x. (4.3 Interchanging the variables x and y and using the symmetry of the heat kernel, we obtain also E t [u] = (u(y u(x u(yp t (x, ydµ(ydµ(x, (4.4 t and (4.2 follows by adding up (4.3 and (4.4. In terms of the spectral resolution {E λ } of the generator L, E t can be expressed as follows e tλ E t [u] = d E λ u 2 t 2, which implies that E t [u] is decreasing in t (indeed, this is an elementary exercise to show that the function t e tλ is decreasing. t

Let us define a quadratic form E by E[u] := lim t + E t [u] = λ d E λ u 2 2 (4.5 (where the limit may be + since E [u] E t [u] and its domain D (E by D(E : = {u L 2 : E [u] < }. It is clear from (4.2 and (4.5 that E t and E are positive definite. It is easy to see from (4.5 that D(E = dom(l /2. It will be convenient for us to use the following notation: dom E (L := D (E = dom(l /2. (4.6 The domain D(E is dense in L 2 because D (E contains dom(l. Indeed, if u dom(l then using (2.6 and (4., we obtain E [u] = lim t E t [u] = (Lu, u <. (4.7 The quadratic form E [u] extends to a bilinear form E (u, v by the polarization identity E (u, v = (E [u + v] E [u v]. 2 It follows from (4.7 that E(u, v = (Lu, v for all u, v dom(l. The space D (E is naturally endowed with the inner product [u, v] := (u, v + E (u, v. (4.8 It is possible to show that the form E is closed, that is the space D(E is Hilbert. It is easy to see from (2.5 and the definition of a heat kernel that the semigroup {P t } is arkovian, that is u implies P t u. This implies that the form E satisfies the arkov property, that is u D (E implies v := min(u +, D (E and E [v] E [u]. Hence, E is a Dirichlet form. In the next statement, we demonstrate how the heat kernel estimate (2.8 allows to prove a compact embedding theorem. Theorem 4. Let (, d, µ be a metric measure space, and p t be a heat kernel in satisfying (2.8. Then for any bounded sequence {u k } D(E there exists a subsequence {u ki } that converges to a function u L 2 (, µ in the following sense: for any set B of finite measure. u ki u L 2 (B,µ, Remark 4.2 The estimate (2.8 without further assumptions on Φ and Φ 2 is equivalent to the upper bound µ-ess sup x,y p t (x, y Ct α/β, for all t >. (4.9

Proof. Let {u k } be a bounded sequence in D (E. Since {u k } is also bounded in L 2, there exists a subsequence, still denoted by {u k }, such that {u k } weakly converges to some function u L 2. Let us show that in fact {u k } converges to u in L 2 (B = L 2 (B, µ for any set B of finite measure. For any t >, we have by the triangle inequality u k u L 2 (B u k P t u k L 2 ( + P t u k P t u L 2 (B + P t u u L 2 (. (4. For any function v L 2 we have v P t v 2 2 = ( { = 2t E t [v] 2t E [v]. 2 (v(x v(yp t (x, ydµ(y dµ(x } p t (x, ydµ(y v(x v(y 2 p t (x, ydµ(y dµ(x Since E [u k ] is uniformly bounded in k by the hypothesis, we obtain that for all k and t > u k P t u k 2 C t. (4. Since {u k } converges to u weakly in L 2 and p t (x, L 2, we see that for µ-almost all x P t u k (x = p t (x, yu k (ydµ(y p t (x, yu(ydµ(y = P t u(x. Using the definition (2.5 of the semigroup P t, the Cauchy-Schwarz inequality, and (2. we obtain that, for any v L 2, P t v(x p t (x, y v(y dµ(y Hence, we have by (4.2 { } /2 { } /2 p t (x, yv(y 2 dµ(y p t (x, ydµ(y C t α 2β v 2. (4.2 P t u k Ct α 2β uk 2 so that the sequence {P t u k } is uniformly bounded in k for any t >. Since {P t u k } converges to P t u almost everywhere, the dominated convergence theorem yields P t u k P t u in L 2 (B, (4.3 because µ(b <. Hence, we obtain from (4., (4., and (4.3 that for any t > lim sup u k u L 2 (B C t + P t u u L 2 (. k Since P t u u in L 2 ( as t, we finish the proof by letting t. 2

Corollary 4.3 Let (, d, µ be a metric measure space, and p t be a heat kernel in satisfying (2.8 with Φ ( >. Then for any bounded sequence {u k } D(E there exists a subsequence {u ki } that converges to a function u L 2 (, µ almost everywhere. Proof. By the first part of the proof of Theorem 3., the hypothesis Φ ( > implies finiteness of the measure of any ball. Fix a point x and consider the sequence of balls B N = B(x, N, where N =, 2,... By Theorem 4. we can assume that the sequence {u k } converges to u L 2 ( in the norm of L 2 (B N for any N. Therefore, there exists a subsequence that converges almost everywhere in B. From this sequence, let us select a subsequence that converges to u almost everywhere in B 2, and so on. Using the diagonal process, we obtain a subsequence that converges to u almost everywhere in. 5 Besov spaces and energy domain The purpose of this section is to give an alternative characterization of dom E (L in terms of Besov spaces, which are defined independently of the heat kernel. 5. Besov spaces in R n Recall that the Sobolev space W2 (R n consists of functions u L 2 (R n such that u x i L 2 (R n for all i =, 2,..., n. It is known that a function u L 2 (R n belongs to W2 (R n if and only if u(x + z u(x 2 sup z R n \{} z <. Fix p <, < σ <, and consider a more general Besov-Nikol skii space B σ p, (R n that consists of functions u L p (R n such that u (x + z u (x p sup z R n,< z z σ <, (5. and the norm in Bp, σ is the sum of u p and the left hand side of (5.. A more general family Bp,q σ of Besov spaces is defined for any q but alongside the case q = considered above, we will need only the case q = p. By definition, u Bp,p σ if u L p and u(y u(x p n+pσ dx dy <, (5.2 y x R n R n with the obvious definition of a norm in B σ p,p. Here are some well known facts about Besov and Sobolev spaces (see for example [].. u Bp, σ (R n if and only if u L p (R n and there is a constant C such that for all < r D p (u, r := u(y u(x p dx dy Cr n+pσ. (5.3 {x,y R n : x y <r} 3

2. u B σ p,p (R n if and only if u L p (R n and D p (u, r dr r n+pσ r <. 3. For any < σ < the following relations take place W2 (R n B2,2 σ (R n B2, σ (R n (5.4 dom E ( dom E ( σ 5.2 Besov spaces in a metric measure space Fix α >, and for any σ > introduce the following functionals on L 2 (, µ: D (u, r := u(x u(y 2 dµ(xdµ(y, (5.5 and {x,y :d(x,y<r} D (u, r N σ, (u := sup (5.6 <r r α+2σ N σ,2 (u := ore generally, for any q [, + one can set For any q [, + ] define the space and the norm in this space by D (u, r dr r α+2σ r. (5.7 ( ( q/2 2/q D (u, r dr N σ,q (u :=. (5.8 r α+2σ r Λ σ 2,q := { u L 2 : N σ,q (u < } u 2 Λ σ 2,q := u 2 2 + N σ,q (u. An obvious modification of the above formulas allows to introduce the space Λ σ p,q for any p [, + (the space Λ σ p,q was denoted by Lip (σ, p, q in [23] and by Λ p,q σ in [28]; the space Λ σ 2, was denoted by W σ,2 in [8]. For example, we have Λ σ 2,q (R n = B σ 2,q (R n if < σ <, Λ σ 2,q (R n = {}, if σ >, Λ 2, (R n = W 2 (R n, Λ 2,2 (R n = {}. The definitions of Λ σ 2,q and B σ 2,q match only for σ <. For σ the definition of B σ 2,q becomes more involved whereas the above definition of Λ σ 2,q is valid for all 4

σ > even if the space Λ σ 2,q degenerates to {} for sufficiently large σ. With some abuse of terminology, we refer to Λ σ p,q also as Besov spaces. The fact that D (u, r is increasing in r implies that for any r > D (u, r r α+2σ 2 α+2σ 2r r D (u, ρ dρ ρ α+2σ ρ whence N σ, (u CN σ,2 (u and Λ σ 2,2 Λ σ 2,. (5.9 The above definition of the spaces Λ σ 2,q depends on the choice of α. A priori α is any number but we will use the above definition in the presence of the following condition µ (B(x, r r α for all x and r >, (5. and α in (5.6-(5.8 will always be the same as in (5.. In particular, (5. implies that for any u L 2 ( D (u, r 2 u (x 2 + u (y 2 dµ (x dµ (y = 4 = 4 {d(x,y<r} {d(x,y<r} Cr α u 2. u (y 2 dµ (x dµ (y u (y 2 µ (B (y, r dµ (y Therefore, the integrals in (5.7 and (5.8 converge at for all u L 2 and σ >, and the point of the condition N σ,q (u < is the convergence of the integral at. Consequently, we see that the space Λ σ 2,q decreases when σ increases. 5.3 Identification of energy domains and Besov spaces Recall that a heat kernel p t on a metric measure space (, d, µ has the associated energy form E and the generator L. The following theorem identifies the domain D(E of the energy form in terms of Besov spaces. Theorem 5. Let p t be a heat kernel on (, d, µ satisfying hypothesis H(α + β. Then dom E (L := D(E = Λ β/2 2, and for any u D (E E [u] N β/2, (u. (5. The result of Theorem 5. was first obtained by Jonsson [23, Theorem ] for SG and then was extended to more general fractal diffusions by Pietruska-Pa luba [26, Theorem ]. In the present form it was proved in [8, Theorem 4.2]. 5

Recall that hypothesis H(α + β means that p t satisfies (2.8 with functions Φ and Φ 2 such that Φ ( > and s α+β Φ 2 (s ds <. (5.2 s Let us show the sharpness of the condition (5.2. As it was mentioned above if < σ < then the heat kernel of the operator ( σ in R n satisfies (2.8 with the function Φ 2 (s = C, where α = n and β = 2σ (see also Lemma 5.4 below. For +s α+β this function, the condition (5.2 breaks just on the borderline, and the conclusion of Theorem 5. is not valid either. Indeed, in R n by (5.4 dom E ( σ = B2,2 σ that is strictly smaller than B2, σ = Λ β/2 2,. This case will be covered by Theorem 5.2 below. As we will see in the proof below (cf. (5.5 and (5.7, under hypothesis H (α + β we have in fact E [u] lim sup r α β r u(x u(y 2 dµ(ydµ(x. (5.3 {d(x,y<r} In particular, this implies that the energy form is strongly local, that is for all functions u, v D (E with compact supports, if u const in an open neighborhood of the support of v then E (u, v =. The operator ( σ is not local for < σ <, and this explains why Theorem 5. does not apply to this operator. Proof of Theorem 5.. Since the expressions E [u] and N β/2, (u are defined (with possibility of infinite values for all u L 2, it suffices to show that (5. holds for all u L 2. Fix a function u L 2 and recall that by (5.6 and (5.5 where D(u, r = D(u, r N β/2, (u = sup <r r α+β B(x,r We first prove that for some c > and for all r > which would imply E [u] c sup r> u(x u(y 2 dµ(ydµ(x. (5.4 D (u, r E [u] c r α+β D (u, r r α+β cn β/2, (u. (5.5 Using the lower bound in (2.8 and the monotonicity of Φ, we obtain from (4.2 that for any r > and t = r β, E[u] (u(x u(y 2 p t (x, ydµ(ydµ(x 2t B(x,r ( α/β+ ( r Φ (u(x u(y 2 t t 2 dµ(ydµ(x /β B(x,r = 2 r Φ α+β ( (u(x u(y 2 dµ(ydµ(x D (u, r c, r α+β B(x,r 6

which was to be proved. Let us now prove that for any r > which would imply D (u, ρ E [u] C sup, (5.6 <ρ r ρ α+β D (u, r E [u] C lim sup C N r + r α+β β/2, (u. (5.7 For any t > and r > we have E t [u] = (u(x u(y 2 p t (x, ydµ(ydµ(x = A(t + B(t (5.8 2t where A(t = 2t B(t = 2t \B(x,r B(x,r (u(x u(y 2 p t (x, ydµ(ydµ(x, (5.9 (u(x u(y 2 p t (x, ydµ(ydµ(x. (5.2 To estimate A(t let us observe that by (3.6 p t (x, ydµ(y C s α Φ 2 (s ds \B(x,r s C t s α+β Φ r β 2 (s ds 2 rt /β s, 2 rt /β (5.2 whence by (5.2 p t (x, ydµ(y = o (t as t + uniformly in x. (5.22 t \B(x,r Therefore, applying the elementary inequality (a b 2 2(a 2 + b 2 in (5.9 and using (5.22, we obtain that for small enough t > A(t (u(x 2 + u(y 2 p t (x, ydµ(ydµ(x t = 2 t {x,y:d(x,y<r} = o( u 2 2, u(x 2 ( \B(x,r p t (x, ydµ(y dµ(x whence lim A(t =. (5.23 t + 7

The quantity B(t is estimated as follows using (2.8, (5.4, and (5.2, setting r k = 2 k r: B(t = (u(x u(y 2 p t (x, ydµ(ydµ(x 2t 2 C k= k= k= C sup <ρ r C sup <ρ r t +α/β Φ 2 B(x,r k \B(x,r k+ ( rk+ t /β B(x,r k ( rk+ α+β ( rk+ D(u, rk Φ2 t /β t /β D(u, ρ ρ α+β r α+β k s α+β Φ 2 (s ds s (u(x u(y 2 dµ(ydµ(x (5.24 D(u, ρ ρ α+β. (5.25 Finally, (5.6 follows from (5.8, (5.23 and (5.25 by letting t. Theorem 5.2 Let p t be a heat kernel on (, d, µ satisfying estimate (2.8 with functions Φ and Φ 2 such that Then Φ (s s (α+β for s > and Φ 2 (s Cs (α+β for s >. (5.26 and for any u D (E dom E (L = Λ β/2 2,2, E [u] N β/2,2 (u. (5.27 Remark 5.3 Condition (5.26 implies that H (α + β is not satisfied while H (α + β ε is satisfied for any ε >. In this case, neither the hypotheses nor the conclusions of Theorem 5. are satisfied. Proof. The proof is similar to that of Theorem 5.. It suffices to show that (5.27 holds for any u L 2. Fix a decreasing geometric sequence {r k } k Z and observe that by (5.7 N β/2,2 (u D (u, r k. r α+β k Z k Using (4.2 and the upper bounds in (2.8 and (5.26 we obtain 2E t [u] = (u(x u(y 2 p t (x, ydµ(ydµ(x t k Z k Z k Z t +α/β Φ 2 t +α/β Φ 2 C D (u, r k r α+β k Z k CN β/2,2 (u, B(x,r k \B(x,r k+ ( rk+ t /β D (u, r t /β k ( rk+ B(x,r k \B(x,r k+ 8 (u(x u(y 2 dµ(ydµ(x

whence E [u] = lim t E t [u] CN β/2,2 (u. (5.28 Similarly, using the lower bound in (2.8, we obtain 2E t [u] ( t Φ rk+ +α/β (u(x u(y t /β k Z 2 dµ(ydµ(x B(x,r k \B(x,r k+ = ( t Φ rk+ +α/β (D (u, r t /β k D (u, r k+ k Z = ( ( rk+ ( rk Φ t +α/β Φ t /β D (u, r t /β k. (5.29 k Z The first part of the hypothesis (5.26 implies that there exists a large enough number a such that ( s Φ 2Φ (s s > a. a Setting r k = a k, we obtain from (5.29 and (5.26 E t [u] 2 {k:r k >at /β } Letting t, we conclude ( t Φ rk +α/β D (u, r t /β k c {k:r k >at /β } D (u, r k. r α+β k E [u] = lim t E t [u] c k Z D (u, r k r α+β k N β/2,2 (u, which together with (5.28 finishes the proof. 5.4 Subordinated heat kernel Let ϕ be a non-negative continuous function on [, + such that ϕ ( =, and let {η t } t> be a family of non-negative continuous functions on (, + such that for all t > and λ exp ( tϕ (λ = η t (s e sλ ds. (5.3 Then, for any heat kernel p t on a metric measure space (, d, µ, the following expression q t (x, y := η t (s p s (x, y ds (5.3 defines a new heat kernel {q t } t> on, which is called a subordinated heat kernel to p t (and η t is called a subordinator. Indeed, applying (5.3 to the generator L of p t we obtain exp ( tϕ (L = η t (s P s ds. 9

Comparing to (5.3 we see that q t is the integral kernel of the semigroup { e tϕ(l} t> generated by the operator ϕ (L. Since { e tϕ(l} is a self-adjoint strongly continuous contraction semigroup in L 2, the family {q t } t> t> satisfies the properties 3, 4, 5 of Definition 2.. The positivity of q t follows from η t, and the stochastic completeness of q t follows from that of p t and ( q t (x, y dµ (y = η t (s p s (x, y dµ (y ds = η t (s ds =, where the last equality is obtained from (5.3 by taking λ =. Hence, {q t } t> is a heat kernel. For example, it follows from the definition of the gamma-function that for all t > and λ exp ( t log ( + λ = ( + λ t = Γ (t s t e s(+λ ds, which takes the form (5.3 for ϕ (λ = log ( + λ and η t (s = st e s. Therefore, Γ(t the operator log ( + L that generates the semigroup { (I + L t} has the heat t> kernel q t (x, y = Γ (t s t e s p s (x, y ds. It is well known that for any δ (, there exists a subordinator η t = η (δ t such that (5.3 takes place with ϕ (λ = λ δ. In this case, (5.3 defines the heat kernel q t of the operator L δ. For example, if δ = then 2 η (/2 t (s = ( t exp t2. 4πs 3 4s For any < δ <, the function η (δ t (s possesses the scaling property η (δ t (s = ( s η(δ t/δ, t /δ and satisfies the estimates η (δ t (s C t η (δ t (s s +δ s, t >, (5.32 t s +δ s t /δ >. (5.33 As s +, η (δ (s goes to exponentially fast so that for any γ > (see [3] and []. s γ η (δ (s ds < (5.34 2

Lemma 5.4 If a heat kernel p t satisfies hypothesis H (α + β where β = δβ, < δ <, then the heat kernel q t (x, y of operator L δ satisfies the estimate ( q t (x, y t ( t α/β + d(x,y α+β min t α/β,, (5.35 d (x, y α+β t /β for all x, y and t >. Proof. Setting r = d (x, y and using (5.3, (2.8, (5.32 we obtain q t (x, y C = C t r α+βδ ( r s Φ α/β 2 s /β s Φ α/β 2 By H (α + β the above integral converges, whence η (δ t (s ds ( r t s /β s Φ 2 (ξ ξ α+βδ dξ ξ. +δ ds q t (x, y C t. (5.36 r α+β On the other hand, using the upper bound p s (x, y Cs α/β τ = s/t /δ we obtain and the change q t (x, y C s η(δ α/β t (s ds = t α/(βδ η(δ τ α/β (τ dτ Ct α/β (5.37 where the last inequality follows from (5.34. Combining (5.36 and (5.37 we obtain the upper bound in (5.35. Finally, (5.3, (2.8, (5.33 imply the lower bound in (5.35 as follows: q t (x, y max(t /δ,r β cφ ( ( r s Φ α/β s /β max(t /δ,r β s α/β = ct max ( t /δ, r β α/β δ ( = c min t α/β, tr α β. t s η (δ t (s ds +δ ds Corollary 5.5 If a heat kernel p t satisfies hypothesis H (α + β where β = δβ, < δ <, then dom E (L δ = Λ β /2 2,2. (5.38 Remark 5.6 This result was essentially proved by A.Stós [27]. 2

Proof. By Lemma 5.4, the heat kernel q t of the operator L δ satisfies the estimate (2.8 q t (x, y ( d (x, y Φ t α/β t /β where Φ (s = ( + s α+β. Applying Theorem 5.2 to the heat kernel q t and its generator L δ we obtain (5.38. 6 Intrinsic characterization of walk dimension Definition 6. Let us set β := 2 sup { σ : dim Λ σ 2, = } (6. and refer to β as the critical exponent of the family { Λ σ 2, } σ> of Besov spaces. Note that the value of β is an intrinsic property of the space (, d, µ, which is defined independently of any heat kernel. For example, for R n we have β = 2. Theorem 6.2 Let p t be a heat kernel on a metric measure space (, d, µ. (i If p t satisfies hypothesis H(α then dim Λ β/2 2, =. Consequently, β β. (ii If p t satisfies hypothesis H(α + β + ε for some ε > then Λ σ 2, = {} for any σ > β/2. Consequently, β = β. If p t is the heat kernel of the operator ( β/2 in R n, < β < 2, then by (2.9 it satisfies H (α (with α = n but not H (α + β + ε. In this case the conclusion of Theorem 6.2(ii is not true, because β can actually be smaller than β = 2. Proof of Theorem 6.2(i. As one can see from the proof of Theorem 5., the inclusion D (E Λ β/2 2, requires only the lower estimates in (2.8 and (3. (and the opposite inclusion requires only the upper estimates in (2.8 and (3.. By Theorem 3. hypothesis H(α implies (3., and by the above remark we obtain D (E Λ β/2 2,. On the other hand, D (E is always dense in L 2, whereas it follows from Corollary 3.3 that dim L 2 =. Therefore, dim Λ β/2 2, =, whence β β. Proof of Theorem 6.2(ii. Let us first explain why β = β follows from the first claim. Indeed, Λ σ 2, = {} implies that σ β /2, and since this is true for any σ > β/2, we obtain β β. Since the opposite inequality holds by part (i, we conclude β = β. Let us prove that Λ σ 2, = {} for any σ > β/2. It suffices to assume that σ β/2 is positive but sufficiently small. Namely, we can assume that ε := 2σ β is so small that hypothesis H (α + β + ε holds. Recall that this hypothesis means that the estimate (2.8 holds with functions Φ and Φ 2 such that Φ ( > and s α+β+ε Φ 2 (s ds s <. (6.2 22

Let us show that for any function u Λ σ 2, we have E [u] =. We use again the decomposition E t [u] = A(t + B(t, where A(t and B(t are defined in (5.9 and (5.2 with r =. Estimating B(t similarly to (5.24 but using N σ, instead of N β/2, and setting r k = 2 k we obtain t ε/β B(t C k= ( rk+ α+β+ε ( rk+ D(u, rk Φ2 t /β t /β D(u, ρ C sup <ρ ρ α+2σ CN σ, (u. Together with (5.8 and (5.23 this yields whence r α+β+ε k s α+β+ε Φ 2 (s ds s E t [u] A(t + Ct ε/β N σ, (u as t, E [u] = lim t E t [u] =. Since E t [u] E [u], this implies back E t [u] for all t >. On the other hand, it follows from (4.2 and the lower bound in (2.8 that E t [u] 2t Φ ( (u(x u(y 2 dµ(ydµ(x, α/β+ {d(x,y t /β } which yields u(x = u(y for µ-almost all x, y such that d(x, y t /β. Since t is arbitrary, we conclude that u is a constant function. Since µ( = (see Corollary 3.3, we obtain u. Corollary 6.3 If a heat kernel p t satisfies H(α + β + ε then the values of the parameters α and β in (2.8 are invariants of the metric measure space (, d, µ. Proof. By Theorem 3., µ (B(x, r satisfies (3., which uniquely determines α as the exponent of the volume growth of (, d, µ. By Theorem 6.2(ii, β is uniquely determined as the critical exponent of the family of Besov spaces of (, d, µ. 7 Inequalities for walk dimension Definition 7. We say that a metric space (, d satisfies the chain condition if there exists a (large constant C such that for any two points x, y and for any positive integer n there exists a sequence {x i } n i= of points in such that x = x, x n = y, and The sequence {x i } n i= d(x i, x i+ C d(x, y, for all i =,,..., n. (7. n is referred to as a chain connecting x and y. 23

For example, the chain condition is satisfied if (, d is a length space, that is if the distance d(x, y is defined as the infimum of the length of all continuous curves connecting x and y, with a proper definition of length. On the other hand, the chain condition is not satisfied if is a locally finite graph, and d is the graph distance. Recall that the critical exponent β = β (, d, µ of the family of Besov spaces Λ σ 2, was defined by (6.. Theorem 7.2 Let (, d, µ be a metric measure space. (i If < µ (B (x, r < for all x and r >, and for all x and < r then µ(b(x, r Cr α (7.2 (ii If the space (, d satisfies the chain condition and for all x and < r then β 2. (7.3 µ(b(x, r r α (7.4 β α +. (7.5 Observe that the chain condition is essential for the inequality β α + to be true. Indeed, assume for a moment that the claim of Theorem 7.2(ii holds without the chain condition, and consider a new metric d on given by d = d /γ where γ >. Let us mark by a dash all notions related to the space (, d, µ as opposed to those of (, d, µ. It is easy to see that α = αγ and N σγ = N σ ; in particular, the latter implies β = β γ. Hence, if Theorem 7.2 could be applied to the space (, d, µ it would yield β γ αγ + which implies β α because γ may be taken arbitrarily large. However, there are spaces with β > α, for example SG. Clearly, the metric d does not satisfy the chain condition; indeed the inequality (7. implies d (x i, x i+ C d (x, y n /γ, (7.6 which is not good enough. Note that if in the inequality (7. we replace n by n /γ then the proof below will give β α + γ instead of β α +. Corollary 7.3 Let p t be a heat kernel on a metric measure space (, d, µ. (i If p t satisfies hypothesis H(α + β + ε for some ε > then β 2. (ii If p t satisfies hypothesis H(α and (, d satisfies the chain condition then β α +. (iii If p t satisfies hypothesis H(2α + + ε for some ε > and (, d satisfies the chain condition then 2 β α +. (7.7 24

.Barlow [4] has proved that if α and β satisfy (7.7 then there exists a graph such that the transition probability for the random walk on this graph satisfies (.2. There is no doubt that similar examples can be constructed in the setting of metric measure spaces satisfying the chain condition. Proof of Corollary 7.3. (i By Theorems 3. and 7.2(i we have β 2, and by Theorem 6.2(ii we have β = β, whence β 2. (ii By Theorems 3. and 7.2(ii we have β α +, and by Theorem 6.2(i we have β β, whence β α +. (iii Clearly, H(2α + + ε implies H(α, and by part (ii we obtain β α +. Therefore, H(2α + + ε implies H(α + β + ε, and by part (i we obtain β 2. Proof of Theorem 7.2(i. Let us show that dim Λ 2, =, which would imply by definition (6. of β that β 2. Fix a ball B(x, R in of a positive radius R, and let u(x be the tent function of this ball, that is u(x = (R d(x, x +. (7.8 Let us show that u Λ 2,. Indeed, by (5.6, it suffices to prove that for some constant C and for all < r < D(u, r = u(x u(y 2 dµ(ydµ(x Cr 2+α. (7.9 B(x,r Since the function u vanishes outside the ball B(x, R and r, the exterior integration in (7.9 can be reduced to B(x, R +. Using the obvious inequality and (7.2 we obtain D(u, r C u(x u(y d(x, y r, B(x,R+ r 2 µ (B (x, r dµ(x = Cr 2+α, whence (7.9 follows. Note also that u 2 which follows from µ(b(x, R >. Observe that contains infinitely many points. Indeed, by (7.2 the measure of any single point is. Since any ball has positive measure, it has uncountable many points. Let {x i } i= be a sequence of distinct points in. Fix a positive integer n and choose R > small enough so that all balls B(x i, R, i =, 2,..., n, are disjoint. The tent functions of these balls are linearly independent, which implies dim Λ 2, n. Since this is true for any n, we conclude dim Λ 2, =. We precede the proof of the second part of Theorem 7.2 by a lemma. Lemma 7.4 Let {x i } n i= be a sequence of points in a metric space (, d such that for some ρ > we have d(x, x n > 2ρ and d(x i, x i+ < ρ for all i =,,..., n. (7. Then there exists a subsequence {x ik } l k= such that (a = i < i <... < i l = n; 25

(b d(x ik, x ik+ < 5ρ for all k =,,..., l ; (c d(x ik, x ij 2ρ for all distinct k, j =,,..., l. The significance of conditions (a, (b, (c is that a sequence {x ik } l k= satisfying them gives rise to a chain of balls B(x ik, 5ρ connecting the points x and x n in a way that each ball contains the center of the next one whereas the balls B(x ik, ρ are disjoint. This is similar to the classical ball covering argument, but additional difficulties arise from the necessity to maintain a proper order in the set of balls. Proof. Let us say that a sequence of indices {i k } l k= is good if the following conditions are satisfied: (a = i < i <... < i l ; (b d(x ik, x ik+ < 3ρ for all k =,,..., l ; (c d(x ik, x ij 2ρ for all distinct k, j =,,..., l. Note that a good sequence does not necessarily have i l = n as required in condition (a. We start with a good sequence that consists of a single index i =, and will successively redefine it to increase at each step the value of i l. A terminal good sequence will be used to construct a sequence satisfying (a, (b, (c. Assuming that {i k } l k= is a good sequence, define the following set of indices S := {s : i l < s n and d(x s, x ik 2ρ for all k l}, and consider two cases. Set S is non-empty. In this case we will redefine {i k } to increase i l. Let m be the minimal index in S. Therefore, m is not in S, whence we have either m i l or d(x m, x ik < 2ρ for some k l (7. (see Fig. 3. In the first case, we have in fact m = i l so that (7. also holds (with k = l. x=x x ik <3ρ <ρ <2ρ x il y=x n x m x m- Figure 3: By (7. and (b we obtain, for the same k as in (7., d(x m, x ik d(x m, x m + d(x m, x ik < 3ρ. Now we modify the sequence {i j } as follows: keep i, i,..., i k as before, forget the previously selected indices i k+,..., i l, and set i k+ := m and l := k +. 26

Clearly, the new sequence {i k } l k= is also good, and the value of i l has increased (although l may have decreased. Therefore, this construction can be repeated only a finite number of times. Set S is empty. In this case, we will use the existing good sequence to construct a sequence satisfying conditions (a, (b, (c. The set S can be empty for two reasons: either i l = n or i l < n and for any index s such that i l < s n there exists k l such that d(x s, x ik < 2ρ. In the first case the sequence {i k } l k= already satisfies (a, (b, (c, and the proof is finished. In the second case, choose the minimal k l such that d(x n, x ik < 2ρ (see Fig. 4. x=x x ik- <5ρ <3ρ <2ρ x ik x il y=x n Figure 4: The hypothesis d(x n, x 2ρ implies k, and we obtain from (b d(x n, x ik d(x n, x ik + d(x ik, x ik < 5ρ. By the minimality of k, we have also d(x n, x ij 2ρ for all j < k. Hence, we define the final sequence {i j } as follows: keep i, i,..., i k as before, forget i k,..., i l, and set i k := n and l := k. Then this sequence satisfies (a, (b, (c. Let A be a subset of of finite measure, that is µ(a <. Then any function u L 2 is integrable on A, and let us set u A := µ(a A u dµ. For any two measurable sets A, B of finite measure, the following identity takes place u(x u(y 2 dµ(xdµ(y (7.2 A B = µ(a u u B 2 dµ + µ(b u u A 2 dµ + µ(aµ(b u A u B 2, B A which is proved by a straightforward computation. Proof of Theorem 7.2(ii. The inequality β α + will follow from (6. if we show that for any σ > α+ the space Λ σ 2 2, is trivial, that is N σ, (u < implies u. By definition (5.6 of N σ, and (7.4 we have, for any < r, N σ, (u cr 2σ α u(x u(y 2 dµ(ydµ(x. (7.3 { d(x,y<r} 27

Fix some < r and assume that we have a sequence of disjoint balls {B k } l k= of the same radius < ρ <, such that for all k =,,..., l Then (7.3, (7.2, and (7.4 imply x B k and y B k+ = d(x, y < r. (7.4 l N σ, (u cr 2σ α k= B k B k+ u(x u(y 2 dµ(ydµ(x l cr 2σ α ρ 2α ubk u Bk+ 2. (7.5 k= By the chain condition, for any two distinct points x, y and for any positive integer n there exists a sequence of points {x i } n i= such that x = x, x n = y, and d(x, y d(x i, x i+ < C n := ρ, for all i < n. Assuming that n is large enough so that d(x, y > 2ρ and ρ < /7, we obtain by Lemma 7.4 that there exists a subsequence {x ik } l k= (where of course l n such that x i = x, x il = y, the balls {B (x ik, ρ} are disjoint, and d(x ik, x ik+ < 5ρ, (7.6 for all k =,,..., l. Applying (7.5 to the balls B k := B (x ik, ρ and setting r = 7ρ < (which together with (7.6 ensures (7.4 we obtain l N σ, (u cρ 2σ+α u Bk u Bk+ 2 cρ 2σ+α l k= ( l 2 u Bk u Bk+ k= cρ 2σ+α n u B u Bl 2 cρ 2σ+α+ u B(x,ρ u B(y,ρ 2. (7.7 d(x, y The property (7.4 of measure of balls implies that for µ-almost all x one has lim u B(x,ρ = u(x, (7.8 ρ and any point x satisfying (7.8 is called a Lebesgue point of u. Assuming that x and y are Lebesgue points of u, using N σ, (u < we obtain from (7.7 as n (that is, as ρ u(x u(y 2 d(x, y C N σ, (u lim ρ ρ 2σ α. (7.9 Since 2σ > α +, the above limit is zero whence u const. Finally, µ( = implies u. 28

8 Embedding of Besov spaces into Hölder spaces In addition to the spaces L p and Λ σ 2,q defined above, let us define a Hölder space C λ = C λ (, d, µ as follows: u C λ if u C λ := u + µ-ess sup x, y < d(x, y /3 u(x u(y d(x, y λ <. The restriction d(x, y /3 here is related to the restriction r in definition (5.6. If (, d satisfies the chain condition then the /3 can be replaced by any other positive constant. Theorem 8. Let (, d, µ satisfy (5.. Then for any σ > α/2 That is, for any u L 2 we have Λ σ 2, C λ where λ = σ α/2. u C λ C u Λ σ 2,. (8. Remark 8.2 From (5.9 it follows that also Λ σ 2,2 C λ, which will be used in the proof of Theorem 9.2(ii. Proof. For any x and r >, set u r (x := µ(b(x, r B(x,r u(ξdµ(ξ. (8.2 We claim that for any u L 2, any < r /3, and all x, y such that d(x, y r, the following inequality holds: u r (x u r (y C r λ N σ, (u /2. (8.3 Indeed, setting B = B(x, r, B 2 = B(y, r, we have u r (x = u(ξdµ(ξ = µ(b B µ(b µ(b 2 B B 2 u(ξdµ(ηdµ(ξ, and similarly u r (y = u(ηdµ(ηdµ(ξ. µ(b µ(b 2 B B 2 Applying the Cauchy-Schwarz inequality, (3. and (5.6, we obtain { u r (x u r (y 2 = (u(ξ u(η dµ(ηdµ(ξ µ(b µ(b 2 B B 2 u(ξ u(η 2 dµ(ηdµ(ξ µ(b µ(b 2 B B 2 C r 2α u(ξ u(η 2 dµ(ηdµ(ξ B(η,3r C r α+2σ N σ, (u, 29 } 2

thus proving (8.3. Similarly, one proves that for any < r /3 and x u 2r (x u r (x C r λ N σ, (u /2. (8.4 Let x be a Lebesgue point of u. Setting r k = 2 k r for any k =,, 2,... we obtain from (8.4 u(x u r (x u rk (x u rk+ (x k= C ( rk λ k= N σ, (u /2 C r λ N σ, (u /2. (8.5 Applying the Cauchy-Schwarz inequality u r (x C r α/2 u 2 and using (8.5 to some fixed value of r, say r = /4, we obtain whence u(x u(x u r (x + u r (x C ( u 2 + N σ, (u /2, u C u Λ σ 2,. (8.6 If y is another Lebesgue point of u such that r := d(x, y < /3 then we obtain from (8.3, (8.5, and a similar inequality for y u(x u(y u(x u r (x + u r (x u r (y + u r (y u(y C r λ N σ, (u /2. Hence, which together with (8.6 yields (8.. u(x u(y d(x, y λ C N σ, (u /2, 9 Bessel potential spaces Let p t be a heat kernel on a metric measure space (, d, µ and let L be its generator. Since L is positive definite, the operator (I + L s is a bounded operator in L 2 for any s > (cf. Section 5.4. This operator is called a Bessel potential. Fix β >, and for any σ > define the Bessel potential space H σ as the image of (I + L σ/β, that is H σ := (I + L σ/β (L 2, with the norm u H σ := (I + L σ/β u 2. This definition of H σ depends on the parameter β. A priori the value of β is arbitrary but in fact we take for β the corresponding exponent in (2.8 assuming that (2.8 holds. For example, for the Gauss-Weierstrass kernel in R n we take β = 2. In this 3

case L = and it is easy to see that H σ (R n consists of functions u L 2 (R n such that R n ũ (ξ 2 ( + ξ 2 σ/2 dξ <, where ũ is the Fourier transform of u. Of course, this is the classical definition of the fractional Sobolev space H σ (R n. The purpose of this section is to prove embedding theorems for the space H σ in the abstract setting, which generalize the classical Sobolev embedding theorems in R n. We start with a lemma. Lemma 9. For any < σ β/2 we have H σ = dom E (L 2σ/β. Proof. Indeed, let {E λ } λ R be the spectral resolution of the operator L. Setting s = σ/β, we have { } H σ = dom(i + L s = u L 2 : ( + λ 2s d E λ u 2 < { } = u L 2 : λ 2s d E λ u 2 < = dom E (L 2s, which was to be proved. Theorem 9.2 (Embedding of H σ (i If a heat kernel p t satisfies (2.8 and < σ < α/2 then H σ L q where q = 2α α 2σ. (9. (ii If a heat kernel p t satisfies H (α + β and α/2 < σ β/2 then H σ C λ where λ = σ α/2. (9.2 Remark 9.3 It is curious that under the chain condition the Hölder exponent λ in (9.2 does not exceed /2, which follows from Corollary 7.3(ii. Proof. (i This is essentially a result of Varopoulos. Indeed, [29, Theorem II.2.7] says that if for all t > and some ν > P t Ct ν (9.3 then for any < s < ν/2 the operator (I + L s is bounded from L 2 to L q where q = 2ν. In our case (9.3 with ν = α/β follows from (4.2. Applying the above ν 2s result with s = σ/β we obtain (9.. An alternative proof is given in [28, Theorem 3.] under a stronger assumption that the heat kernel satisfies the upper bound in (.2. (ii Using Lemma 9., Theorem 5., Corollary 5.5, Theorem 8., and Remark 8.2 we obtain { Λ H σ = dom E (L 2σ/β σ = 2,, σ = β/2, C λ, σ < β/2, Λ σ 2,2, 3