A fast method for solving the Heat equation by Layer Potentials

Similar documents
Fast Nyström Methods for Parabolic Boundary Integral Equations

Numerical Solution of Integral Equations in Solidification and Melting with Spherical Symmetry

Rational Chebyshev pseudospectral method for long-short wave equations

A butterfly algorithm. for the geometric nonuniform FFT. John Strain Mathematics Department UC Berkeley February 2013

A Fast N-Body Solver for the Poisson(-Boltzmann) Equation

256 Summary. D n f(x j ) = f j+n f j n 2n x. j n=1. α m n = 2( 1) n (m!) 2 (m n)!(m + n)!. PPW = 2π k x 2 N + 1. i=0?d i,j. N/2} N + 1-dim.

Non-Conforming Finite Element Methods for Nonmatching Grids in Three Dimensions

A Remark on the Fast Gauss Transform

CS 450 Numerical Analysis. Chapter 8: Numerical Integration and Differentiation

PIECEWISE LINEAR FINITE ELEMENT METHODS ARE NOT LOCALIZED

An Accurate Fourier-Spectral Solver for Variable Coefficient Elliptic Equations

A COLLOCATION METHOD FOR SOLVING THE EXTERIOR NEUMANN PROBLEM

A Comparison between Solving Two Dimensional Integral Equations by the Traditional Collocation Method and Radial Basis Functions

Fast matrix algebra for dense matrices with rank-deficient off-diagonal blocks

An Overview of Fast Multipole Methods

Fast and accurate methods for the discretization of singular integral operators given on surfaces

Abstract. 1. Introduction

On an Approximation Result for Piecewise Polynomial Functions. O. Karakashian

The Chebyshev Fast Gauss and Nonuniform Fast Fourier Transforms and their application to the evaluation of distributed heat potentials

Numerical Solution of Two-Dimensional Volterra Integral Equations by Spectral Galerkin Method

PART IV Spectral Methods

A FAST SOLVER FOR ELLIPTIC EQUATIONS WITH HARMONIC COEFFICIENT APPROXIMATIONS

Fast multipole boundary element method for the analysis of plates with many holes

INTRODUCTION TO MULTIGRID METHODS

Scientific Computing: An Introductory Survey

Appendix C: Recapitulation of Numerical schemes

Algebraic Multigrid as Solvers and as Preconditioner

Hierarchical Matrices. Jon Cockayne April 18, 2017

Overlapping Schwarz preconditioners for Fekete spectral elements

A direct solver for elliptic PDEs in three dimensions based on hierarchical merging of Poincaré-Steklov operators

A fast randomized algorithm for computing a Hierarchically Semi-Separable representation of a matrix

Computational Linear Algebra

1462. Jacobi pseudo-spectral Galerkin method for second kind Volterra integro-differential equations with a weakly singular kernel

Multigrid finite element methods on semi-structured triangular grids

Kasetsart University Workshop. Multigrid methods: An introduction

APPENDIX A. Background Mathematics. A.1 Linear Algebra. Vector algebra. Let x denote the n-dimensional column vector with components x 1 x 2.

Lecture 2: Computing functions of dense matrices

Fast Multipole BEM for Structural Acoustics Simulation

CS 542G: The Poisson Problem, Finite Differences

Chapter 6. Finite Element Method. Literature: (tiny selection from an enormous number of publications)

Domain decomposition schemes with high-order accuracy and unconditional stability

Discrete Projection Methods for Integral Equations

(f(x) P 3 (x)) dx. (a) The Lagrange formula for the error is given by

arxiv: v1 [cs.lg] 26 Jul 2017

The generalized Euler-Maclaurin formula for the numerical solution of Abel-type integral equations.

Convergence Order Studies for Elliptic Test Problems with COMSOL Multiphysics

Simple Examples on Rectangular Domains

Transactions on Modelling and Simulation vol 12, 1996 WIT Press, ISSN X

Elliptic Problems / Multigrid. PHY 604: Computational Methods for Physics and Astrophysics II

A note on accurate and efficient higher order Galerkin time stepping schemes for the nonstationary Stokes equations

Nonlinear Filtering Revisited: a Spectral Approach, II

Lecture 2: The Fast Multipole Method

Lectures 9-10: Polynomial and piecewise polynomial interpolation

Approximation by Conditionally Positive Definite Functions with Finitely Many Centers

INTRODUCTION TO FAST MULTIPOLE METHODS LONG CHEN

arxiv: v1 [math.cv] 18 Aug 2015

Numerical Integration for Multivariable. October Abstract. We consider the numerical integration of functions with point singularities over

Math 671: Tensor Train decomposition methods

Integration, differentiation, and root finding. Phys 420/580 Lecture 7

A Recursive Trust-Region Method for Non-Convex Constrained Minimization

The Closed Form Reproducing Polynomial Particle Shape Functions for Meshfree Particle Methods

Direct method to solve Volterra integral equation of the first kind using operational matrix with block-pulse functions

Multilevel Preconditioning of Graph-Laplacians: Polynomial Approximation of the Pivot Blocks Inverses

Chapter Two: Numerical Methods for Elliptic PDEs. 1 Finite Difference Methods for Elliptic PDEs

18.336J/6.335J: Fast methods for partial differential and integral equations. Lecture 13. Instructor: Carlos Pérez Arancibia

Numerical Methods I Orthogonal Polynomials

An Iterative Substructuring Method for Mortar Nonconforming Discretization of a Fourth-Order Elliptic Problem in two dimensions

arxiv: v1 [math.co] 3 Nov 2014

An implementation of the Fast Multipole Method without multipoles

On the use of multipole methods for domain integration in the BEM

Cubic spline collocation for a class of weakly singular Fredholm integral equations and corresponding eigenvalue problem

The Fast Multipole Method and other Fast Summation Techniques

Orthogonality of hat functions in Sobolev spaces

A fast adaptive numerical solver for nonseparable elliptic partial differential equations

H 2 -matrices with adaptive bases

A LOWER BOUND ON BLOWUP RATES FOR THE 3D INCOMPRESSIBLE EULER EQUATION AND A SINGLE EXPONENTIAL BEALE-KATO-MAJDA ESTIMATE. 1.

Compensation. June 29, Arizona State University. Edge Detection from Nonuniform Data via. Convolutional Gridding with Density.

(1) u i = g(x i, x j )q j, i = 1, 2,..., N, where g(x, y) is the interaction potential of electrostatics in the plane. 0 x = y.

3. Numerical integration

Least-Squares Spectral Collocation with the Overlapping Schwarz Method for the Incompressible Navier Stokes Equations

Introduction to the Numerical Solution of IVP for ODE

SPLINE COLLOCATION METHOD FOR SINGULAR PERTURBATION PROBLEM. Mirjana Stojanović University of Novi Sad, Yugoslavia

Weighted Regularization of Maxwell Equations Computations in Curvilinear Polygons

Improved Fast Gauss Transform. Fast Gauss Transform (FGT)

ON SPECTRAL METHODS FOR VOLTERRA INTEGRAL EQUATIONS AND THE CONVERGENCE ANALYSIS * 1. Introduction

Scientific Computing

ELLIPTIC RECONSTRUCTION AND A POSTERIORI ERROR ESTIMATES FOR PARABOLIC PROBLEMS

Numerical Methods I: Numerical Integration/Quadrature

Chapter 4: Interpolation and Approximation. October 28, 2005

GENERALIZED GAUSS RADAU AND GAUSS LOBATTO FORMULAE

A direct solver for variable coefficient elliptic PDEs discretized via a composite spectral collocation method Abstract: Problem formulation.

A space-time Trefftz method for the second order wave equation

INTRODUCTION TO FINITE ELEMENT METHODS

University of Houston, Department of Mathematics Numerical Analysis, Fall 2005

Introduction to Partial Differential Equations

Measuring Ellipsoids 1

you expect to encounter difficulties when trying to solve A x = b? 4. A composite quadrature rule has error associated with it in the following form

MATH 205C: STATIONARY PHASE LEMMA

Recent Results for Moving Least Squares Approximation

n 1 f n 1 c 1 n+1 = c 1 n $ c 1 n 1. After taking logs, this becomes

Transcription:

A fast method for solving the Heat equation by Layer Potentials Johannes Tausch Abstract Boundary integral formulations of the heat equation involve time convolutions in addition to surface potentials. If M is the number of time steps and N is the number of degrees of freedom of the spatial discretization then the direct computation of a heat potential involves order N 2 M 2 operations. This article describes a fast method to compute three dimensional heat potentials which is based on Chebyshev interpolation of the heat kernel in both space and time. The computational complexity is order p 4 q 2 NM operations, where p and q are the orders of the polynomial approximation in space and time. Introduction Boundary integral formulations of parabolic problems involve time convolutions in addition to the usual boundary integral operators. While the finite element or finite difference method are highly popular for this type of problems, the method of layer potentials has distinct advantages, in particular, stability of explicit time stepping and reduction of unknowns to the boundary surface. Boundary element methods that evaluate the time convolution have been discussed in the past decades in several places, such as [4, 9, ]. If the time convolution is computed directly the cost quickly becomes prohibitively expensive with increasing problem size which makes the integral equation approach less attractive than competing approaches. However, the situation changes if fast methods for the computation of heat potentials are considered. One such an algorithm is the fast Gauss transform [7]. In this approach the heat kernel is approximated by a Hermite expansion, which leads to an efficient scheme to compute the spatial convolution efficiently. This approach has been combined with re-starts to avoid time convolutions [2]. The original version of the fast Gauss Transform is based on Hermite expansions of the heat kernel. Since the spatial variables of the kernel separate, the translation operator appear in tensor product form, which can be exploited to reduce the computational cost associated with translation operators [3]. Besides Hermite expansions, exponential expansions have been considered as well, in this case translation operators are diagonal [8].

Another technique that exploits the convolution form of the time integral is to evaluate heat potentials in Fourier domain. This was described for bounded domains in [6], where the Green s function is expanded in a Fourier series. The method must be combined with nonuniform FFTs since the heat sources are located on a boundary surface. In unbounded domains the kernel appears as a continuous Fourier transform which results in some nontrivial complications, c.f., [5]. The approach described in this article is more close to the fast Gauss Transform in that the computation is performed in physical and not in Fourier domain. Hence the method is somewhat simpler when the solution must be evaluated in physical space in every time step, which is the case, for instance, when the heat equation is solved via the Green s representation theorem. The main features of the discussed approach are The heat kernel is expanded in both space and time. The time convolution is computed using a hierarchical scheme, similar to the FMM for one dimensional source distributions. However, certain modifications are necessary to account for the Volterra form of heat potentials. The cost per time step in this algorithm is almost indepent of the length of the time interval. The heat kernel is interpolated at the Chebyshev nodes in space and time. The translation operators of the multipole method can be easily derived and preserve the tensor product form of the kernel. After discussing heat potentials and their discretization in Section 2 we introduce the Chebyshev interpolation and the resulting translation operators in Sections 3 and 4. This leads to the fast algorithm described in Section 5. Section 6 concludes with numerical results. 2 Heat Potentials It is well known that the solution of the heat equation satisfies Green s Representation formula. If u t = u in the exterior of a domain with boundary S and if u has vanishing initial conditions, then Green s formula assumes the form u u(x, t = Ku(x, t V (x, t, x S, t >. ( 2 n In three spatial variables, the single- and double layer potential are given by Vg s (x, t = Kg d (x, t = t S t S G(x y, t τg s (y, τ ds y dτ, n y G(x y, t τg d (y, τ ds y dτ, 2

where S is the boundary surface, and G is the heat kernel which is given by ( G(r, t = exp r 2. (4πt 3 2 4t For the following discussion it is convenient to write the single layer potential in the form Vg s (x, t = t V (t τg s (τ dτ (2 4π t τ where ( V(δg s (x, τ = 4πδ exp x y 2 g s (y, τ ds y (3 4δ and the double layer potential in an analogous manner. The kernel of (3 is the Green s function of the two-dimensional heat equation. Using local parameterizations, it can be shown that this operator is q-times differentiable in δ for every point x S if the surface is of class C q+. The first term of the Taylor series is given by V (δg s (x = g s (x + O(δ, (4 K(δg d (x = H(xg d (x + O(δ (5 where H(x is the mean curvature of the surface. The direct boundary element method consists of discretizing ( and solving for the missing boundary data. The usual discretization schemes for integral operators, namely the Galerkin, collocation and the Nyström method are available. In this work we consider the latter, because no influence coefficients must be computed or stored, which constitutes a significant savings over the Galerkin or collocation method. Temporal discretization. The Nyström method replaces integrals by quadrature rules and enforces the equation on the quadrature nodes. For the temporal discretization we consider the composite midpoint rule of meshwidth t and quadrature nodes t j = (j + /2 t. Because of the O((t τ 2 -singularity of the integrand in (2 this rule is applied to the interval [, i t], where the integrand is smooth. For the singular piece [i t, t i expansion is used. The resulting rule is ti V (t i τ g s (τ dτ t i V (t i t j t g s (t j + 4π t τ 4π ti t j 2π g s(t i A straightforward analysis shows that the error of the singular part is O( t 3 2 whereas the error of the midpoint rule is O( t 2. To improve this convergence rate, we introduce the following singularity subtraction t 4π V (t τ t τ g s (τ dτ = 3 j=

t t (V (t τg s (τ V (g s (t dτ + 4π t τ π g s(t where the last term comes from expansion (4. Now the integrand is O((t τ 2 which implies that the error of the composite midpoint rule is improved to O( t 3 2. The desingularized midpoint rule simplifies to where 4π ti V (t i τ g s (τ dτ t i t τ 4π m i = j= ti π t i. 4π ti t j j= The analogous rule for the double layer potential is 4π ti K(t i τ g d (τ dτ t i t τ 4π j= V (t i t j ti t j g s (t j + m i g s (t i (6 K(t i t j ti t j g d (t j + m i Hg d (t i (7 where H denotes multiplication with the mean curvature. The desingularized rules (6 and (7 are O( t 3 2 and differ from the regular rule only by the weight of the node t i. Spatial Discretization The surface integral operators in the above quadrature rules have smooth kernels, thus the Nyström method can also be used for the spatial discretization. Quadrature rules for surface integrals are usually based on a triangulation of the surface and piecewise polynomial interpolation. Their form is f(x ds(x f(x P w P. (8 S P We use the quadratic rule where the quadrature points x P are located on the midpoints of the edges of the triangles; the details of the derivation of this rule can be found in Atkinson [] and Chien [3]. Since the kernels in (6 become increasingly peaked when t is decreased, the meshwidth x of the surface triangulation must be decreased as well. The form of the heat kernel suggests that the relationship between the two mesh parameters should be at least x ( t 2. Nyström Method We now turn to the discretization of (. The surface integral operators in the desingularized rules (6 and (7 are replaced by the spatial rule (8 for time steps j =,..., i 2. The term j = i is replaced by the truncated local expansions (4 and (5, which introduces an additional error of O( t 3 2. The resulting recurrence is 2 ui P = m i H p u i P m i vp i + t ( H p u i P 4 v i P Φ i P, (9

where u i P is the approximation for u(t i, x P and vp i is the approximation for n u(t i, x P and i 2 Φ i P = t j= Q w Q ( G(t i t j, x P x Q v j Q G(t i t j, x P x Q u j Q n y ( The smooth potential Φ i P contains only quantities of the previous time steps, hence (9 can be solved for the unknown boundary data. By far the dominant cost in every time step the computation of Φ i P. In the following we describe a fast method for this task. 3 Chebyshev Expansion of the Heat Kernel Chebyshev polynomials are commonly used to approximate functions in the interval [, ]. In this article, we use the definition T n (x = c n cos(n arccos(x, c n = { 2, n =,, n >. Note that the inclusion of the factor c n is not standard, but will simplify the formulas below. The roots of T n+ are given by ( π ωk n 2k + = cos, k n., 2 n + and the Lagrange polynomials with respect to these nodes are L i (x = n k= k i x ωk n ωi n. ωn k The interpolation T n f of a function f at the Chebyshev nodes can be expressed in terms of Lagrange polynomials T n f(x = n f(ωi n L i(x ( i= as well as in terms of Chebyshev polynomials T n f(x = n ˆf α T α (x (2 α= where ˆf α = 2 n + n T α (ωk n f(ωk n. (3 k= 5

The last formula follows from the discrete orthogonality of the Chebyshev polynomials, see, e.g., [2]. The essence of most fast methods for integral operators is to replace the kernel by a truncated series expansion. Since ( and (2 represent the same polynomial they can be used alternatively. In order to preserve the tensor product form of the translation operators discussed below, we will use the Lagrange form for the temporal interpolation and the Chebyshev expansion for the spatial interpolation. Let ν be a cube in R 3 with center x and side length 2h x and ν be another cube with same side length and center x, furthermore let I and Ĩ be non overlapping intervals of R with length 2h t and centers t and t, respectively. For x ν, y ν, t I and τ Ĩ we introduce local variables with the transformation x = x + x h x, y = x + y h x, x, y [, ] 3 t = t + t h t, τ = t + τ h t, t, τ [, ]. When the heat kernel is considered as a function of the eight local variables, the Chebyshev interpolation using order p polynomials in space and order q polynomials in time is ( x y 2 exp (4π 3 2 (t τ 3 2 4(t τ = (4πh t 3/2 (d + t τ 3 2 (4πh t 3/2 i q j q α+β p exp ( ρ r + x y 2 d + t τ E iα,jβ L i (t L j (τ T α (x T β (y. (4 Here α and β are multiindices, α = (α, α 2, α 3, α = α + α 2 + α 3, and T α (x = T α (x T α2 (x 2T α3 (x 3. The constants d >, r R 3 and ρ > are given by d = ( t t/h t, (5 r = ( x x/h x, (6 ρ = h2 x 4h t. (7 It follows from ( and (3 that the coefficients in (4 are given by where E (σ iγ,jδ = ( 2 p + 2 E iα,jβ = E ( iα,jβ E (2 iα 2,jβ 2 E (3 iα 3,jβ 3 (8 ( d + ω q i ωq 2 j k p l p T γ (ω p k T δ(ω p l exp ( ρ (r σ + ω p k ωp l 2 d + ω q i. ωq j 6

e 5 error e e 5 2 p 3 8 2 q 6 Figure : Maximum errors of the Chebyshev interpolation as a function of p and q; ρ = and d = 4. Approximation Errors. Since the heat kernel is an analytic function in all local variables the approximation error of the Chebyshev exhibits exponential decay up to logarithmic factors []. To illustrate this behavior, we compute the Chebyshev interpolation error of the function G(r, s = exp ( ρ r2, s, r. (d + s 2 d + s Figure shows the maximum interpolation error as a function of p and q when d = 4 and ρ =. Since the figure is on a logarithmic scale, the exponential decay in the s-variable appears as a straight line. In the r-variable G is an entire function, therefore the decay of the error is initially faster than exponential in p until the error of the s-variable is dominant. 4 Translation Operators The FMM combines sources and evaluation points in a tree of cubes. To accelerate the computation of cube interactions the kernel is replaced by a truncated series expansion. The efficient evaluation of such a series involves the moments of the source cube and the expansion coefficients of the destination cube. In the course of the computation, the moments and coefficients must be translated between the different levels of the cube hierarchy. In addition to the space variable, heat potentials involve an integration over the time variable. Therefore the time interval [, T ], in which the heat equation is to be solved, is also clustered into a binary tree of intervals. The finest level, denoted by level zero, consists of M intervals of length t. The half-length of the finest level intervals is also denoted by h t = t/2. The half length of a 7

level-l interval is h t = 2 l h t. Likewise, the half-length of a cube in the finest spatial level is h x and the half-length in level l is h x = 2 l h x. We briefly describe the translation operators related to Chebyshev interpolation in space and time. 4. Moments-to-Local (MtL Translations The Moments-to-Local (MtL translation is the evaluation of a heat potential in I ν due to sources in Ĩ ν using the expansion (4. The potential can be any combination of single- and double- layer sources. Replacing the kernel by the Chebyshev interpolation results in the series ( G(x x P, t t Q g s (x P, t Q + G(x x P, t t Q g d (x P, t Q w P t n y t Q Ĩ, x P S ν (4πh t 3/2 i q α p λ iα L i (t T α (x. The summation in the left hand side is over quadrature points in Ĩ S ν and primes denote local variables. The expansion coefficients λ iα are given by L j (t t Q Ĩ, x P S ν λ iα = q j= β p α The µ jβ s are the moments of the sources in Ĩ ν, given by µ jβ = ( Q T β (x P g s (x P, t Q + E iα,jβ µ jβ. (9 n y T β (x P g d (x P, t Q w P t Because the expansion coefficients in (8 appear as products, the MtL transform (9 is a tensor product in the space variables, which can be exploited to reduce the cost of computing all λ iα, i =,..., q, α p to O(q 2 p 4 operations. 4.2 Moments-to-Moments (MtM Translations In the FMM sources and evaluation points are combined in a tree of cubes in space and a tree of time intervals. The moments are computed in a recursive fashion, beginning with the finest level. This involves recentering moments from a child ν Ĩ to the parent ν I, this operation is also known as the MtM translation and is described next. The MtM translation operators follows from the following addition formula for the Chebyshev polynomials ( T γ 2 x ± = 2 8 γ a γδ T δ (x (2 δ=

where a γδ = 2 p + p T δ (ω p k T γ k= ( 2 ωp k ±. 2 This formula directly follows from (3 and holds when p γ. A similar result holds for the Lagrange polynomials ( L i 2 t ± q = b ij L j (t (2 2 j= where b ij = L i ( 2 ωq j ± 2. Let ( x, t be the center and (4h x, 4h t be the lengths of the parent ν I. The lengths of the child ν Ĩ are (2h x, 2h t and the centers are x = x + σ x h x and t = t +σ t h t where σ t, σ x,k {±}. The contribution µ iα of the child s moments µ jβ to the parent s moment is µ iα = ( tq t 2h t L i t Q Ĩ, x P S ν ( T α ( xp x 2h x g s (x P, t Q + n x T α ( xp x 2h x g d (x P, t Q w P t Changing the local variables with respect to ν I to variables local to ν Ĩ gives t t = t t + 2h t 2 h t 2 σ t x x = x x + 2h x 2 h x 2 σ x Combining this with (2 and (2 results in µ iα = q b ij a αβ a α2β 2 a α3β 3 µ jβ. j= β α This translation is again of tensor-product form in the spatial variable. Thus the complexity of one MtM transform is O(q 2 p 4. In the algorithm described in the following there are also MtM translations from one temporal interval to a subinterval within the same cube. In this case the translation formula involves only in the spatial variable µ iα = q b ij µ jα. j= This MtM translation has O(q 2 p 3 complexity. 9

4.3 Local-to-Local (LtL Translations In the LtL translation the Chebyshev expansion for ν I is recentered to the child ν Ĩ. Let ( x, t be the center and (4h x, 4h t be the lengths of the parent ν I. The lengths of the child ν Ĩ are (2h x, 2h t and the centers are x = x + σ x h x and t = t + σ t h t where σ t, σ x,k {±}. The translation from the parent s expansion to the child s expansion follows from the addition formulas (2 and (2 where Φ(x, t = α <p, i q = α <p, i q = α <p, i q λ αi = λ αi L i ( t t 2h t T α ( x x 2h x ( t t λ αi L i σ ( t x x T α σ x 2 h t 2 2 h x 2 ( ( t t x x λ αi L i T α h t h x q b ji a αβ a α2β 2 a α3β 3 λ βj (22 j= β α β p are the expansion coefficients for ν Ĩ. The tensor product form in the above formula can be used to compute an LtL translation in O(q 2 p 4 operations. If the translation is only in time (i.e., ν = ν, then (22 simplifies to λ αi = q b ji λ αj j= which can be computed in O(q 2 p 3 operations. 5 A Fast Algorithm for the Time Convolution This section describes in detail the fast method for the smooth potential (. 5. Temporal Tree Recall that the half-lengths of the intervals in the finest level of the temporal tree are h t, = t/2, hence the i-th interval Ii is [2ih t,, 2(i + h t, ] and its center is the quadrature node t i. The time quadrature of the smooth potential ( is over intervals I... I i 2. That is, there is one interval between the integration interval and the interval where the smooth potential is evaluated. The finest-level intervals are the leafs of the binary tree, c.f. Figure 2. Let the binary representation of i be i = σ p 2 p +... + σ 2 + σ, (23

then the position of Ii in the tree can be found as follows. Start in the leftmost node of the p + -st level, then for l = p,... follow the left branch when σ l = and the right branch when σ l =. The parent of Ii in the l-th level is denoted by Ii l l where i l is given by i l = σ p 2 p l +... + σ l+ 2 + σ l. (24 The interval I l i l has length h t,l = 2 l h t, and is given by I l i l = 2 l (i l + j=2 l i l I j. The neighbors of Ii l l are Ni l = { Ii l l, } Il i l for il > and N l = { I} l for il =. Because of the Volterra form of the integral operators this definition only involves intervals corresponding to earlier times. The interaction list Ii l l of Ii l l consists of nodes in level l, which are children of Ii l l s parent s neighbors, but not neighbors of Ii l l. Because of the simple structure of the binary tree, there are only two nontrivial cases of interaction lists. If the binary representation of i > is given by (23, then { Ii l {Ii l = l l 2 }, if σ l =, {Ii l l 3, Il i l 2 }, if σ l =. For i = or i = the interaction lists Ii l l are empty. Thus for i l > there are either one or two intervals in an interaction list, deping on whether the node is the right or the left branch of its parent. If t Ii then the interval of integration in ( is the union of the interaction lists of all parents of Ii, that is, p [, 2(i h t ] = Ii l l. (25 5.2 Spatial Tree. Similar to the FMM in potential theory, we introduce a hierarchical decomposition of R 3 into cubes. In the finest level the cubes have side length 2h x,, where h x, is a parameter at our disposition, whose choice will be discussed later. The next coarser level consists of cubes of side length 4h x, which contain eight finest level cubes. The coarsest level cube has side length 2 Ls+ h x, and contains the surface S. The set of nonzero cubes in level l S is denoted by C ls. The heat kernel is smooth for t > τ and exponentially decaying in space, therefore it suffices to compute MtL translation between neighboring cubes only. When the separation of t and τ is small, the heat kernel is peaked and must be resolved by fine-level cubes and fine-level intervals. For larger separations of t and τ the kernel can be resolved in coarse-level cubes and intervals. The number of neighbors in MtL translations is indepent of the level. l=

To control the errors introduced by the Chebyshev expansion and by the far field truncation the relationship between the temporal level l and the spatial level l S is such that the parameter ρ in the Chebyshev expansion of the heat kernel (4 remains bounded. Because of (7 the depence of the temporal level l to the spatial level l S is ([ ] l l S = min, L S (26 2 where [ ] denotes rounding off to the next smaller integer. Thus for time-levels l 2L S the values of ρ are either ρ or ρ /2, where 5.3 The Fast Algorithm. ρ = h2 x 4h t (27 To compute the right hand side in ( efficiently the heat potentials are expanded in local expansions that are valid in a given interval of the time-tree and in a given cube of the corresponding level of the spatial tree. Specifically, in the i-th time step the vector λ l ν is the vector of expansion coefficients that is valid in I il S ν representing heat sources in Ii l l S. Here, i and i l are given by (23 and (24, respectively. Because of (25 the smooth potential can be computed by translating and adding expansions from the coarser to the finer levels and then evaluating the resulting expansion in the finest level. The λ l ν s are computed using the MtL translations, the moments are computed using the MtM translations. The smooth potential in the i-th time step deps only on the λ l ν s in Ii s parents I i l l, l =,..., p. Hence only one set of expansion coefficients per temporal level must be stored and updated. Since the expansion coefficients in the coarser levels are valid in larger time intervals it is not necessary to update all coefficients in all levels in every time step. Indeed, if σ =... σ L = in (23 then only the λ l ν s in levels l L must be updated. The expansion coefficients of Ii l l dep on the moments of Ii l l 2 and Il i l 3, denoted by µ l 2,ν and µ l 3,ν. Thus it is possible to store and update only two sets of expansion coefficients in each level. One time step of this algorithm is illustrated in Figure 2. A more detailed description follows. MtL translations for levels l L l = L for ν C ls for ν N (ν form λ l ν by translating µ l 2ν and µl 3ν 2

µ λ µ µ λ λ µ µ µ λ λ 2 µ µ µ λ λ 9 2 t Figure 2: Time step for i = 2, µ s represent moments, λ s represent expansion coefficients. Quantities on dark ground have been computed in the previous time steps, quantities on white ground are computed in this time step. Curved arrows indicate MtL translations. for l = L,... for ν C ls for ν N (ν form λ l ν by translating µl 2ν MtM translations for levels l L l = L for ν C ls overwrite µ l 3ν with µl 2ν for l = L,..., for ν C ls if l is even and [l/2] L S C(ν := Children of ν else C(ν := {ν} if 3

for ν C(ν form µ l 2ν by translating µl 2ν LtL translations for levels l L for l = L +,..., for ν C ls if l is even and [l/2] L S C(ν := Children of ν else C(ν := {ν} if for ν C(ν translate λ l ν and add to λ l ν and µl 3ν Algorithm for computing the smooth potential for i = 2,..., M Determine p and L from the binary expansion of i. Perform MtM translations for ν C Compute the moments µ 2,ν Perform MtL translations Perform LtL translations for ν C Evaluate the expansions using the coefficients λ ν Complexity. The binary tree has O(M nodes. Each node is made up of cubes in the corresponding spatial level. Using geometric series arguments it follows that the number of all nonempty cubes in all nodes is O(NM. For every cube one set of moments and one set of expansion coefficients is computed using a bounded number of translation operators. The estimates in Section 4 imply that the total complexity is O(p 4 q 2 NM. 4

6 Numerical Example To illustrate the convergence behavior we solve the heat equation in the exterior of the unit sphere. The solution approach is based on the Green s Representation formula (, desingularized Nyström discretization, and the fast method to evaluate the heat potentials. To obtain a problem with known solution we place the heat source ( u(x, t = exp x x 2 (28 (4πt 3 2 4t at the point x = [.5,, ], supply the Neumann data on the sphere and compare the numerical solution on the sphere with the exact solution (28. The L 2 norm of the solution as a function of time is shown in Figure 3.5.4 L2 norm.3.2..2.4.6.8 time Figure 3: L 2 -Norm of the solution of the sample problem as a function of time Choice of Parameters. In the discretization scheme considered, the time step size t has the greatest influence on the error. This parameter determines the quadrature error of the temporal midpoint rule as well as the truncation error in the expansion of the local part. The parameter of greatest influence to the error of the fast method is ρ, because it controls the error due to neglecting well-separated interactions. Increasing ρ reduces this error but demands for larger values of p and q. Once t and ρ are determined, the cube width h x follows from (27. Table 2 and Figures 5 and 6 show the L 2 -error for t =.5 and cpu timings for refining the 5

2 L2 error 3 4 5.2.4.6.8 time Figure 4: L 2 -Norm of the error for p = 2 (top, p = 6, p = 2, p = 24 (bottom. The last two lines overlap. Mesh # time steps # triangles p q L S ρ 2 92 2 4 2.5 2 4 768 6 4 5 3 8 372 2 4 4 6 2288 24 4 2 5 5 32 2288 28 6 2 Table : Discretization parameters for results in Table 2 and Figures 5 and 6 meshwidth. The table also displays the ratio of the cpu time to the complexity estimate p 4 q 2 NM and the convergence rate of the error. Figure 4 displays the L 2 -errors of the solution of the test problem as a function of time for increasing values of p. In this case there are 8 time steps and 372 triangles. 7 Conclusion We have discussed a fast method for solving heat conduction problems with layer potentials. The numerical results are in good agreement with the O(p 4 q 2 NM complexity and the O( 3 2 quadrature error. It is possible to construct higherorder quadrature schemes that evaluate more terms of the expansion of the local part. The difficulty is that the higher terms involve derivatives of the surface curvature and get very complicated. 6

Mesh cpu time ratio error rate 4 3.3e-9 3.85e-3 2 2 6.4e- 7.42e-4 2.37 3 3 2.e-.2e-4 2.86 4 728 7.e- 3.48e-5.55 5 45844 5.3e-.24e-5.49 Table 2: Results for the meshes in Table 2 L2 error 3 4 5 6.2.4.6.8 time Figure 5: L 2 -errors corresponding to discretization meshes in Table ; coarsest (top to finest (bottom mesh. For the simplicity of exposition, we have restricted the discussion to the homogeneous heat equation and vanishing initial conditions. Source terms lead to volume potentials which can be treated with the same interpolation scheme of the kernel. Non-trivial initial conditions can be included with well-documented algorithms, such as the fast Gauss transform. References [] Kall E. Atkinson. The Numerical Solution of Integral Equations of the Second Kind. Cambridge University Press, 997. [2] C. Canuto, M.Y. Hussaini, A. Quarteroni, and T.A. Zhang. Spectral Methods in Fluid Dynamics. Springer, 988. 7

5 2 4 cpu time (sec 3.2.4.6.8 time Figure 6: Cpu times (in seconds per time step as a function of time. The numbers refer to the meshes of Table. The cpu times for Meshes and 2 resulted are too short to be measured accurately and are not shown. [3] D. Chien. Numerical evaluation of surface integrals in three dimensions. Math. Comp., 64:727 743, 995. [4] G.F. Dargush and P.K. Banerjee. Application of the boundary element method to transient heat conduction. Internat. J. Numer. Methods Engrg., 3:23 247, 99. [5] L. Greengard and P. Lin. Spectral approximation of the free-space heat kernel. Appl. Comput. Harmonic Anal., 9:83 97, 999. [6] L. Greengard and J. Strain. A fast algorithm for the evaluation of heat potentials. Comm. Pure Appl. Math., XLIII:949 963, 99. [7] L. Greengard and J. Strain. The fast Gauss transform. SIAM J. Sci. Statist. Comput., 2:79 94, 99. [8] L. Greengard and X. Sun. A new version of the fast Gauss transform. Doc. Math. J. DMV, Extra Volume ICM 998, III:575 584, 998. [9] M.M. Grigoriev and G.F. Dargush. Higher-order boundary element methods for transient diffusion problems. part I: Bounded flux formulation. Internat. J. Numer. Methods Engrg., 55: 4, 22. [] J. Mason and D. Handscomb. Chebyshev Polynomials. CRC Press, 22. 8

[] E.A. McIntyre. Boundary integral solutions of the heat equation. Math. Comp., 46(73:7 79, 986. [2] J. Strain. Fast adaptive methods for the free-space heat equation. SIAM J. Sci. Statist. Comput., 5(:85 26, 994. [3] X. Sun and Y. Bao. A Kronecker product representation of the fast Gauss transform. SIAM J. Matrix Anal. Appl., 24(3:768 786, 23. 9