GALERKIN TIME STEPPING METHODS FOR NONLINEAR PARABOLIC EQUATIONS

Similar documents
Optimal order a posteriori error estimates for a class of Runge Kutta and Galerkin methods

GALERKIN AND RUNGE KUTTA METHODS: UNIFIED FORMULATION, A POSTERIORI ERROR ESTIMATES AND NODAL SUPERCONVERGENCE

A POSTERIORI ERROR ESTIMATES FOR THE BDF2 METHOD FOR PARABOLIC EQUATIONS

10 The Finite Element Method for a Parabolic Problem

ELLIPTIC RECONSTRUCTION AND A POSTERIORI ERROR ESTIMATES FOR PARABOLIC PROBLEMS

COMBINING MAXIMAL REGULARITY AND ENERGY ESTIMATES FOR TIME DISCRETIZATIONS OF QUASILINEAR PARABOLIC EQUATIONS

A POSTERIORI ERROR ESTIMATES BY RECOVERED GRADIENTS IN PARABOLIC FINITE ELEMENT EQUATIONS

LECTURE 1: SOURCES OF ERRORS MATHEMATICAL TOOLS A PRIORI ERROR ESTIMATES. Sergey Korotov,

SECOND ORDER TIME DISCONTINUOUS GALERKIN METHOD FOR NONLINEAR CONVECTION-DIFFUSION PROBLEMS

Numerical Solutions to Partial Differential Equations

Discontinuous Galerkin methods for fractional diffusion problems

A POSTERIORI ERROR ESTIMATES FOR THE CRANK NICOLSON METHOD FOR PARABOLIC EQUATIONS

A space-time Trefftz method for the second order wave equation

WEAK GALERKIN FINITE ELEMENT METHOD FOR SECOND ORDER PARABOLIC EQUATIONS

COMBINING MAXIMAL REGULARITY AND ENERGY ESTIMATES FOR TIME DISCRETIZATIONS OF QUASILINEAR PARABOLIC EQUATIONS

WELL-POSEDNESS FOR HYPERBOLIC PROBLEMS (0.2)

Basic Concepts of Adaptive Finite Element Methods for Elliptic Boundary Value Problems

OPTIMAL ORDER A POSTERIORI ERROR ESTIMATES FOR A CLASS OF RUNGE KUTTA AND GALERKIN METHODS

A space-time Trefftz method for the second order wave equation

Nonlinear Analysis 71 (2009) Contents lists available at ScienceDirect. Nonlinear Analysis. journal homepage:

Iterative methods for positive definite linear systems with a complex shift

LECTURE 3: DISCRETE GRADIENT FLOWS

Scientific Computing WS 2018/2019. Lecture 15. Jürgen Fuhrmann Lecture 15 Slide 1

CONVERGENCE OF FINITE DIFFERENCE METHOD FOR THE GENERALIZED SOLUTIONS OF SOBOLEV EQUATIONS

Simple Examples on Rectangular Domains

Chapter 5 A priori error estimates for nonconforming finite element approximations 5.1 Strang s first lemma

Piecewise Smooth Solutions to the Burgers-Hilbert Equation

From Completing the Squares and Orthogonal Projection to Finite Element Methods

A Concise Course on Stochastic Partial Differential Equations

Discontinuous Galerkin Time Discretization Methods for Parabolic Problems with Linear Constraints

Konstantinos Chrysafinos 1 and L. Steven Hou Introduction

Research Article A Two-Grid Method for Finite Element Solutions of Nonlinear Parabolic Equations

A POSTERIORI ERROR ANALYSIS FOR HIGHER ORDER DISSIPATIVE METHODS FOR EVOLUTION PROBLEMS

A posteriori error estimation for elliptic problems

Math Tune-Up Louisiana State University August, Lectures on Partial Differential Equations and Hilbert Space

Chapter Two: Numerical Methods for Elliptic PDEs. 1 Finite Difference Methods for Elliptic PDEs

Scientific Computing WS 2017/2018. Lecture 18. Jürgen Fuhrmann Lecture 18 Slide 1

Nonlinear stabilization via a linear observability

On an Approximation Result for Piecewise Polynomial Functions. O. Karakashian

CHAPTER 3 Further properties of splines and B-splines

arxiv: v1 [math.na] 29 Feb 2016

Convergence and optimality of an adaptive FEM for controlling L 2 errors

On the relationship of local projection stabilization to other stabilized methods for one-dimensional advection-diffusion equations

Preconditioned space-time boundary element methods for the heat equation

NONLINEAR DIFFERENTIAL INEQUALITY. 1. Introduction. In this paper the following nonlinear differential inequality

Weak Formulation of Elliptic BVP s

Putzer s Algorithm. Norman Lebovitz. September 8, 2016

On Pressure Stabilization Method and Projection Method for Unsteady Navier-Stokes Equations 1

Goal. Robust A Posteriori Error Estimates for Stabilized Finite Element Discretizations of Non-Stationary Convection-Diffusion Problems.

Basic Principles of Weak Galerkin Finite Element Methods for PDEs

Least Squares Based Self-Tuning Control Systems: Supplementary Notes

2. Dual space is essential for the concept of gradient which, in turn, leads to the variational analysis of Lagrange multipliers.

c 2018 Society for Industrial and Applied Mathematics

Adaptive methods for control problems with finite-dimensional control space

Some asymptotic properties of solutions for Burgers equation in L p (R)

The Discontinuous Galerkin Method for Hyperbolic Problems

STOKES PROBLEM WITH SEVERAL TYPES OF BOUNDARY CONDITIONS IN AN EXTERIOR DOMAIN

The method of lines (MOL) for the diffusion equation

Where is matrix multiplication locally open?

Traces, extensions and co-normal derivatives for elliptic systems on Lipschitz domains

Chapter 1 Foundations of Elliptic Boundary Value Problems 1.1 Euler equations of variational problems

The collocation method for ODEs: an introduction

L p -boundedness of the Hilbert transform

Partial Differential Equations

Remarks on the analysis of finite element methods on a Shishkin mesh: are Scott-Zhang interpolants applicable?

Variational Formulations

P(E t, Ω)dt, (2) 4t has an advantage with respect. to the compactly supported mollifiers, i.e., the function W (t)f satisfies a semigroup law:

Lecture 4: Numerical solution of ordinary differential equations

THE INVERSE FUNCTION THEOREM FOR LIPSCHITZ MAPS

Exponential stability of abstract evolution equations with time delay feedback

Introduction. J.M. Burgers Center Graduate Course CFD I January Least-Squares Spectral Element Methods

Yongdeok Kim and Seki Kim

MULTIGRID PRECONDITIONING IN H(div) ON NON-CONVEX POLYGONS* Dedicated to Professor Jim Douglas, Jr. on the occasion of his seventieth birthday.

A Posteriori Error Estimation for hp-version Time-Stepping Methods for Parabolic Partial Differential Equations

Stability of an abstract wave equation with delay and a Kelvin Voigt damping

Exam 2, Solution 4. Jordan Paschke

AMS subject classifications. Primary, 65N15, 65N30, 76D07; Secondary, 35B45, 35J50

ENERGY NORM A POSTERIORI ERROR ESTIMATES FOR MIXED FINITE ELEMENT METHODS

Computational Methods in Optimal Control Lecture 8. hp-collocation

Empirical Processes: General Weak Convergence Theory

Key words. optimal control, heat equation, control constraints, state constraints, finite elements, a priori error estimates

2 Statement of the problem and assumptions

Nonlinear Control Systems

Numerische Mathematik

Sobolev spaces. May 18

A Locking-Free MHM Method for Elasticity

Numerical Analysis of Nonlinear Multiharmonic Eddy Current Problems

LECTURE # 0 BASIC NOTATIONS AND CONCEPTS IN THE THEORY OF PARTIAL DIFFERENTIAL EQUATIONS (PDES)

Graded Mesh Refinement and Error Estimates of Higher Order for DGFE-solutions of Elliptic Boundary Value Problems in Polygons

Adaptive Finite Element Methods Lecture Notes Winter Term 2017/18. R. Verfürth. Fakultät für Mathematik, Ruhr-Universität Bochum

ON WEAKLY NONLINEAR BACKWARD PARABOLIC PROBLEM

2. Function spaces and approximation

Conservation law equations : problem set

Second Order Elliptic PDE

Université de Metz. Master 2 Recherche de Mathématiques 2ème semestre. par Ralph Chill Laboratoire de Mathématiques et Applications de Metz

Subdiffusion in a nonconvex polygon

NUMERICAL ANALYSIS OF THE CAHN-HILLIARD EQUATION AND APPROXIMATION FOR THE HELE-SHAW PROBLEM, PART II: ERROR ANALYSIS AND CONVERGENCE OF THE INTERFACE

Error estimates for a finite volume element method for parabolic equations in convex polygonal domains

Equations paraboliques: comportement qualitatif

A WEAK GALERKIN MIXED FINITE ELEMENT METHOD FOR BIHARMONIC EQUATIONS

Transcription:

GALERKIN TIME STEPPING METHODS FOR NONLINEAR PARABOLIC EQUATIONS GEORGIOS AKRIVIS AND CHARALAMBOS MAKRIDAKIS Abstract. We consider discontinuous as well as continuous Galerkin methods for the time discretization of a class of nonlinear parabolic equations. We show existence and local uniqueness and derive optimal order optimal regularity a priori error estimates. We establish the results in an abstract Hilbert space setting and apply them to a quasilinear parabolic equation. 1. Introduction The interest for time Galerkin and corresponding space time finite element methods has been linked during the last decade to the development of adaptivity of mesh selection for evolution PDE s. Certain issues as, e.g., a posteriori estimates, estimates of optimal order and regularity, fully discrete schemes with mesh modification, etc., have been extensively considered in the framework of Continuous and Discontinuous Galerkin methods, cf., e.g., [14, 9, 1, 13, 19, 16, 17, 18, 4, 2]. This is probably partly due to the fact that in principle space time Galerkin methods provide freedom for (almost) arbitrary selection of the space time mesh, [14], and partly due merely to the fact that, as in the elliptic case, the properties of variational methods can be studied in an easier, more systematic and clearer way than properties of their pointwise counterparts, i.e., finite difference methods. Still many issues related to the above problems are to be investigated, mainly for nonlinear evolution PDE s. The purpose of this paper is to provide a rather comprehensive a priori analysis of variational in time methods, and in particular of the Discontinuous and Continuous Galerkin methods, for a wide class of nonlinear parabolic equations. It turns out that the limitations of the approach of [9, 1] that was further developed (although from a different perspective) for nonlinear problems in [4] can be overcomed by adopting a direct approach based on energy type variational arguments. We will discretize initial value problems of the form: find u : [, T ] D(A) satisfying { u (t) + F (t, u(t)) =, < t < T, (1.1) u() = u, with F (t, v) = Av B(t, v), A a positive definite, selfadjoint, linear operator on a Hilbert space (H, (, )) with domain D(A) dense in H, B(t, ) : D(A) H, t [, T ], a (possibly) nonlinear operator, and u H. Subsequently, we will precisely describe the properties of Date: December 4, 22. 1

2 G. AKRIVIS AND CH. MAKRIDAKIS B and therefore also of F. Essentially the assumption that F admits the above splitting is made for purely technical reasons, in fact our framework, see below, covers a wide class of nonlinear parabolic equations. The time Galerkin methods. Let = t < t 1 < < t N = T be a partition of [, T ], := (t n, t n+1 ], k n := t n+1 t n, and q N. We shall analyze the discretization of (1.1) both by the discontinuous and the continuous Galerkin methods. To formulate the discontinuous Galerkin method, we let Vq d be the space of functions that are piecewise polynomials of degree at most q 1 in time in each, with coefficients in D(A 1/2 ), i.e., V d q := {ϕ : [, T ] D(A 1/2 )/ ϕ In (t) = q 1 j= w j t j }. The elements of Vq d are allowed to be discontinuous at the nodes t n, but are taken to be continuous to the left there. The time discrete discontinuous Galerkin approximation U to u is defined as follows: We seek U Vq d such that (1.2) [(U, v) + (F (t, U), v)] dt + (U n+ U n, v n+ ) = v V q ( ), for n =,..., N 1. Here U() = u(), v n := v(t n ), v n+ := lim t t n + v(t) and V q ( ) := {ϕ In : ϕ V d q }. To formulate the continuous Galerkin method, we let the space V c q consist of continuous functions that are piecewise polynomials of degree at most q 1 in time in each, with coefficients in D(A 1/2 ), i.e., V c q := {ϕ C([, T ]; D(A 1/2 )) : ϕ In (t) = q 1 j= w j t j }. The time discrete continuous Galerkin approximation U to u is defined as follows: We seek U Vq c such that (1.3) [(U, v) + (F (t, U), v)] dt = v V q 1 ( ), for n =,..., N 1. We emphasize that both the discontinuous and continuous Galerkin methods are independent of the particular splitting of F in the form F (t, v) = Av B(t, v); in applications F is given and the splitting is only used for the analysis of the methods. The problem framework. We let V := D(A 1/2 ) and denote the norms in H and V by and, v := A 1/2 v, respectively. We assume that dominates in V. We identify H with its dual, and let V be the dual of V, V H V. We still denote by (, ) the duality pairing between V and V, and by the dual norm on V, v := A 1/2 v.

A PRIORI ANALYSIS FOR GALERKIN METHODS 3 We assume that B(t, ) can be extended to an operator from V into V. A natural condition for (1.1) to be locally of parabolic type is the one-sided local Lipschitz condition on B(t, ), (1.4) (B(t, v) B(t, w), v w) λ v w 2 + µ v w 2 v, w T u in a tube T u, T u := {v V : min t u(t) v 1}, around the solution u, uniformly in t, with a constant λ less than one. It is easily seen that (1.4) can be written in the form of a Gårding type inequality, (1.4 ) (F (t, v) F (t, w), v w) (1 λ) v w 2 µ v w 2 v, w T u. For some of our results this mild condition will be sufficient. Our main results, however, are derived under the following, stronger, local Lipschitz condition (1.5) B(t, v) B(t, w) λ v w + µ v w v, w T u with a constant λ less than one and a constant µ. The tube T u is defined here in terms of the norm of V for concreteness. The analysis may be modified to yield error estimates under conditions analogous to (1.5) for v and w belonging to tubes defined in terms of other norms, not necessarily the same for both arguments. It is actually more natural to have two tubes, because in the error analysis one of the arguments in (1.5) will be a time interpolant of the exact solution (or a time interpolant of an elliptic projection of the exact solution in the fully discrete case) for which estimates in stronger norms might be available, and the other argument will be the approximate solution; it is advantageous to define the second tube in terms of weak norms since this allows the derivation of error estimates under mild meshconditions, cf. [2], [3], [1], and Section 6. In particular the above framework covers the following class of quasilinear equations: Let Ω R d, d = 1, 2, 3, be a bounded interval or a bounded domain with smooth boundary Ω. For T > we seek a real valued function u, defined on Ω [, T ], satisfying u t = div ( c(x, t, u) u + g(x, t, u) ) + f(x, t, u) in Ω [, T ], u = on Ω [, T ], u(, ) = u in Ω, with c : Ω (, ), f : Ω [, T ] R R, g : Ω [, T ] R R d, and u : Ω R given smooth functions. Let U := [ 1 + min x,t u, 1 + max x,t u], c > and c be such that c c(x, t, y) c x Ω, t [, T ], y U. We set a := c + c, b(x, t, y) := c(x, t, y) a, 2 A := a, B(t, v) := div ( b(, t, v) v ) + div g(, t, y) + f(, t, y).

4 G. AKRIVIS AND CH. MAKRIDAKIS Then, V = H 1 H 1 norm. Let = H1 (Ω) and the norm in V, v = a v, is equivalent to the λ := sup{ b(x, t, y) /a : x Ω, t [, T ], y U}; it is shown in section 5 that λ = 1 c a < 1 and that (1.5) is satisfied, in appropriately defined tubes. Condition (1.5), with appropriately small λ, is used in [3], see also [2] for a similar but more stringent condition, for the analysis of implicit explicit multistep schemes for (1.1), and in [1] for the analysis of more general linearly implicit methods. In these papers, A and B are discretized in different ways and their knowledge is crucial. In contrast, in this paper both A and B are discretized in the same way; thus for the methods only F matters while for the analysis solely the existence of A and B suffices. Description of the results. In this paper we present a comprehensive a priori analysis of variational in time methods, and in particular of the Discontinuous and Continuous Galerkin methods, for the above class of nonlinear parabolic equations. Our approach is related to the one of [16, 17] in the sense that we still use the properties of Radau and Gauss-Legendre quadrature rules that are naturally associated to Discontinuous and Continuous Galerkin methods. On the other hand the proofs in this paper are based on entirely variational arguments and in particular on novel stability lemmata, cf. Lemma 2.1, Corollary 2.1 and Lemma 5.1. These lemmata allow us to gain the necessary control in L 2 ( ; H) that is missing in order for the energy method to be successfully applied. The lack of control in L 2 ( ; H) is also the reason that the proof of [24] for the linear case is not easily extendable to nonlinear equations. (Note the interesting relation between the test functions we choose in Lemma 2.1, Corollary 2.1 and Lemma 5.1.) First, we show existence and local uniqueness of the Galerkin approximations. We then derive optimal order a priori error estimates. Note that the regularity required by the exact solution is minimal and corresponds to similar estimates in [24]. The analysis is extended also in the case of fully discrete schemes, i.e., the combination of Galerkin time stepping methods with discretization in space. For simplicity we do not consider here schemes combined with mesh modification in space, but our results can be extended to this case by appropriately adopting ideas from [18] to our case. We derive our results in an abstract Hilbert space setting and apply them to a concrete example, namely to a quasilinear parabolic equation. The paper is organized as follows: In Section 2 we show existence and uniqueness of the discontinuous Galerkin approximations for a modified equation with globally Lipschitz continuous nonlinearity. Our proofs are variational and simplify those in [16]. These results combined with the error estimates yield existence and local uniqueness of the discontinuous Galerkin approximations for problem (1.1). Section 3 is devoted to the a priori error analysis for the discontinuous Galerkin method. In Section 4 we consider fully discrete schemes, i.e., we combine the discontinuous Galerkin time stepping with space discrete schemes. The continuous Galerkin method is analyzed in Section 5: we prove

A PRIORI ANALYSIS FOR GALERKIN METHODS 5 existence and local uniqueness of continuous Galerkin approximations and derive optimal order error estimates. The results are presented in a more condensed way, avoiding details for arguments already used in the analysis of the discontinuous Galerkin method. In addition we do not include the analysis for fully discrete continuous Galerkin approximations since our results can be extended to fully discrete schemes in a similar fashion as in the discontinuous Galerkin case presented in Section 4. In Section 6 we briefly discuss the application of the abstract results to a quasilinear parabolic equation. 2. The DG Case: Existence and Uniqueness In this section we show existence and uniqueness of the discontinuous Galerkin approximations for a modified equation. This serves as an intermediate step and will be used in the sequel to establish existence and local uniqueness of the discontinuous Galerkin approximations for our original equation. The existence result is derived under the assumption that V d q be finite dimensional; although this might appear restrictive, let us note that in the really interesting case when the time discretization is combined with discretization in space, i.e., in the fully discrete case, our assumption on the dimension of V d q is fulfilled. We assume that B(t, ) can be modified to yield an operator B(t, ) : V V coinciding with B(t, ) in the tube T u, B(t, v) = B(t, v) for all t [, T ] and all v T u, and satisfying the global Lipschitz condition, cf. (1.5), (2.1) B(t, v) B(t, w) λ v w + µ v w v, w V. The discontinuous Galerkin method for the modified equation (2.2) { u (t) + Au(t) = B(t, u(t)), < t < T, u() = u, is to seek U Vq d satisfying (2.3) [(U, v) + (AU, v)] dt + (U n+ U n, v n+ ) = ( B(t, U), v) dt, for all v V q ( ), for n =,..., N 1. Here U() = u(). It is easily seen that the solution u of (1.1) is also a solution of (2.2); further, (2.1) yields easily uniqueness of (smooth) solutions of (2.2). In this section we show existence and uniqueness of the solution of the scheme (2.3). Later on we will see that U T u and will easily conclude existence and local uniqueness of the solution of scheme (1.2). Existence and uniqueness of discontinuous Galerkin approximations for the nonlinear Schrödinger equation were established in [16]. Our approach is motivated by the one in [16], simplifies it and relies on the following auxiliary result. Lemma 2.1. Let < τ 1 < < τ q = 1 and w 1,..., w q be the abscissae and the weights of the Radau quadrature rule in [, 1], p P q 1 and p P q 1 be the interpolant of ϕ,

6 G. AKRIVIS AND CH. MAKRIDAKIS ϕ(t) := p(t)/t, at the Radau points. Then (2.4) in particular, (2.5) 1 p pdt + p() p() = 1 2 1 p pdt + p() p() 1 2 [ p(1) 2 + [ p(1) 2 + w i τ 1 i p(τ i ) 2] ; 1 ] p 2 dt. Proof. Let v P q 2 be given by v(t) := p(t) p() t, i.e., be such that p(t) = p() + tv(t); then, obviously, ϕ(t) = v(t) + p() 1 t. Therefore, polynomial p can be written in the form p = v + p()λ with Λ P q 1 the interpolant of 1 t at the Radau points, i.e., Λ(τ i) = 1 τ i, i = 1,..., q. To start with, we first note that it is easily seen that 1 (2.6) p pdt = 1 1 v 2 dt + 1 [ 1 1 ] 2 2 v(1) 2 + p() tv (t)λ(t) dt + v(t)λ(t) dt. Now, for s P q 1, using the exactness of the Radau quadrature formula, we have in particular (2.7) and (2.8) 1 ts (t)λ(t) dt = 1 1 1 s (t) dt = s(1) s() ; tv (t)λ(t) dt = v(1) v() tλ (t)λ(t) dt = 1 Λ(). Since also vλ is integrated exactly by the Radau quadrature formula, using (2.7) in (2.6), we get 1 (2.9) p pdt = 1 1 v 2 dt + 1 [ ] 2 2 v(1) 2 + p() v(1) v() + w i v(τ i )τi 1. Since, p() p() = p()v() + Λ() p() 2, v(1) = p(1) p(), and v 2 is integrated exactly by the Radau quadrature formula, relation (2.9) can be written as 1 p pdt + p p() = 1 2 p(1) 2 + ( Λ() 1 ) p() 2 2 (2.1) + 1 2 w i [v(τ i ) 2 + 2v(τ i )τ 1 i p()].

A PRIORI ANALYSIS FOR GALERKIN METHODS 7 Now, in view of (2.8), i.e., Λ() = 1 1 2 1 (2.11) Λ() 1 2 = 1 2 t(λ 2 ) (t)dt = 1 2 + 1 2 w i τ 2 i. Using (2.11) in (2.1) and then the fact that ( ) 2τ p() 2 i + ( v(τ i ) ) 2 + 2p()v(τi )τi 1 we obtain (2.4). Further, 1 p pdt + p() p() 1 2 [ p(1) 2 + 1 ( Λ(t) ) 2dt, = τ 1 i p(τ i ) 2, w i p(τ i ) 2] = 1 [ p(1) 2 + 2 1 ] p(t) 2 dt. A second proof to Lemma 2.1 is given in the Appendix. A change of variables shows that the results of Lemma 2.1 transform to the interval [t n, t n+1 ] as follows: Corollary 2.1. Let p P q 1 be a polynomial in [t n, t n+1 ], ϕ(t) := k n p(t)/(t t n ), and p P q 1 be the interpolant of ϕ at the Radau points t n,i, i = 1,..., q, shifted to [t n, t n+1 ], t n,i := t n + k n τ i. Then (2.12) and (2.13) and t n+1 t n p pdt + p(t n ) p(t n ) = 1 2 t n+1 t n p pdt + p(t n ) p(t n ) 1 2 [ p(t n+1 ) 2 + We introduce in V q ( ) the norms n and n by v n := [ k n v n := [ k n w i τ 1 i p(t n,i ) 2] [ p(t n+1 ) 2 + 1 t n+1 ] p 2 dt. k n t n w i τ 1 i v(t n,i ) 2] 1/2 w i τ 1 i v(t n,i ) 2] 1/2. Let U, V V q ( ), W := U V, and W V q ( ) be the interpolant of k n W (t)/(t t n ) at the Radau points t n,i, i = 1,..., q. Then, we have, cf. (2.12), (2.14i) (W, W ) dt + (W n+, W n+ ) 1 [ W n+1 2 + 1 W 2 ] n. 2 k n

8 G. AKRIVIS AND CH. MAKRIDAKIS Further, since the Radau quadrature rule integrates polynomials of degree at most 2q 2 exactly, (AW, W ) dt = k n w i (AW (t n,i ), W (t n,i )) = k n w i τi 1 (AW (t n,i ), W (t n,i )), i.e., (2.14ii) Moreover, using first (2.1) we get (AW, W ) dt = W 2 n. ( B(t, U) B(t, V ), W ) ( λ W + µ W ) W and using then the fact that ( λ W + µ W ) W is integrated exactly by the Radau quadrature rule we obtain ( B(t, U) B(t, V ), W ] ) dt k n w i [λ W (t n,i ) + µ W (t n,i ) τi 1 W (t n,i ) therefore, (2.14iii) for any positive ε. = λ W 2 n + µk n w i τ 1 i W (t n,i ) W (t n,i ) ; In ( B(t, U) B(t, V ), W ) dt (λ + ε) W 2n + µ2 4ε W 2 n 2.1. Uniqueness. Let U, V V d q be discontinuous Galerkin approximations to (2.2), corresponding to the same partition of the time interval [, T ], with the same initial value u(). We will inductively show that U and V coincide, assuming that the partition is sufficiently fine. Assume that U and V coincide in 1. Then, obviously, (2.15) U n = V n. Subtracting [(V, v) + (AV, v)] dt + (V n+ V n, v n+ ) = ( B(t, V ), v) dt for all v V q ( ), from (2.3) and using (2.15), we obtain, with W := U V, (2.16) (W, v) dt + (W n+, v n+ ) + (AW, v) dt = ( B(t, U) B(t, V ), v) dt for all v V q ( ). We now let v = W V q ( ) in (2.16) be the interpolant of k n W (t)/(t t n ) at the Radau points t n,i, i = 1,..., q, and use (2.14) to obtain (2.17) k n W n+1 2 + W 2 n + 2k n (1 λ ε) W 2 n k n µ 2 2ε W 2 n,

A PRIORI ANALYSIS FOR GALERKIN METHODS 9 for any positive ε. Now let ε < 1 λ and k n be sufficiently small such that k n µ 2 2ε < 1 to conclude from (2.17) that W vanishes in ; this shows uniqueness of the discontinuous Galerkin approximation for the modified equation (2.2) (and local uniqueness of the discontinuous Galerkin approximation for the original equation (1.1)) under the assumption that k, k := max n k n, be sufficiently small. 2.2. Existence. The discontinuous Galerkin approximate solution U Vq d is defined in by its value U n at t n, which has been determined from the conditions in the preceding time interval 1, and by (2.3). We introduce in V q ( ) the inner product, by v, w := (v, w) dt with w V q ( ) denoting the interpolant of k n w(t)/(t t n ) at the Radau points t n,i, i = 1,..., q. It is readily seen that, can also be written in the form v, w = k n w i τ 1 i (v(t n,i ), w(t n,i )). We now define a map G : V q ( ) V q ( ) by (2.18) G(v), w = [(v, w) + (Av, w) ( B(t, v), w)] dt + (v n+ U n, w n+ ) for all w V q ( ). It is easily seen that G is well defined and continuous. Further, the existence of a discontinuous Galerkin approximation in is equivalent to the existence of a U V q ( ) such that G(U) =. We will use a variant of Brouwer s fixed point theorem to show existence of a root of G, under the assumption that Vq d be finite dimensional. For v V q ( ) we have G(v), v = (v, ṽ) dt + (v n+, ṽ n+ ) + (Av, ṽ) dt I n ( B(t, v) B(t, ), ṽ) dt ( B(t, ), ṽ) dt (U n, ṽ n+ ), and thus, in view of (2.14), (2.19) G(v), v 1 ( v n+1 2 + 1 v 2 ) n + (1 λ ε) v 2 2 k n n µ2 4ε v 2 n ( B(t, ), ṽ) dt (U n, ṽ n+ ). Since ṽ 2 is integrated exactly by the Radau quadrature rule, it is easily seen that ṽ 2 dt = k n w i τi 2 v(t n,i ) 2 τ1 1 v 2 n,

1 G. AKRIVIS AND CH. MAKRIDAKIS and thus (2.2i) Further, and hence therefore ( B(t, ), ṽ) dt 1 2 ṽ n+ = w i τ 1 i v(t n,i ) B(t, ) 2 dt + 1 2τ 1 v 2 n. q j=1, j i ṽ n+ 2 c 1 k n v 2 n ; τ j τ i τ j, (2.2ii) (U n, ṽ n+ ) c 1 U n 2 + 1 4k n v 2 n. Letting now ε < 1 λ in (2.19) and using (2.2), we obtain (2.21) G(v), v 1 ( 1 µ2 4 k n ε 2 ) v 2 τ n α n 1 with α n := 1 2 B(t, ) 2 dt + c 1 U n 2. For k n sufficiently small such that 1 k n µ2 ε 2 τ 1 be positive and for v n sufficiently large, the right-hand side of (2.21) is positive, and the existence of a discontinuous Galerkin approximation in, for sufficiently small k n, follows from Brouwer s fixed point theorem if Vq d is finite dimensional, for instance in the fully discrete case. 3. The DG Case: Error Estimates In this Section we establish optimal order estimates for u U, U being the solution of (2.3), under the assumption that B(t, ) : V V coincides with B(t, ) in the tube T u, B(t, v) = B(t, v) for all t [, T ] and all v T u, and satisfies the global Lipschitz condition (2.1). After having established the error estimate we can show that for sufficiently small time steps the solution of (2.3) satisfies also (1.2), and, thus, we will have error estimates for the original equation. Let W V d q be defined by W = u and (3.1) W n+1 = u n+1, (u W, v)dt = v V q 1 ( ), n =,..., N 1, cf. [24], p. 185. The function W will play an important role in the error analysis in the sequel. It is well known that W is well defined by (3.1) and satisfies the error estimates (3.2) (u W )(t) 2 Ckn 2q 1 u (q) (s) 2 ds, t,

A PRIORI ANALYSIS FOR GALERKIN METHODS 11 with standing for either one of the norms, and, cf. [24]. From (3.1) we easily obtain, for v V q ( ), (u W, v)dt = (u W, v )dt (u n W n+, v n+ ), i.e., (3.3) (W, v)dt + (W n+ W n, v n+ ) = (u, v)dt v V q ( ). With ρ := u W and Θ := W U, we split the error u U in the form u U = ρ + Θ. Since ρ has been estimated in (3.2), our main goal in this section will be the estimation of Θ. We will achieve this in two stages: First we will show consistency of the numerical scheme for W and subsequently we will show stability. 3.1. Consistency. Let R V q ( ) denote the consistency error of the discontinuous Galerkin method for W, (3.4) (R(t), v)dt = (W + AW B(t, W ), v) dt + (W n+ W n, v n+ ) for all v V q ( ). Then, in view of (3.3) and (1.1), (3.5) (R(t), v)dt = (Aρ, v) dt + ( B(t, u) B(t, W ), v) dt. Letting v := A 1 R in (3.5) and using (2.1), we have R(t) 2 dt = (A 1/2 ρ, A 1/2 [ ] R) dt + λ ρ + µ ρ R dt, and thus easily (3.6) R(t) 2 dt 2 ρ(t) 2 [ ] 2dt dt + 2 λ ρ(t) + µ ρ(t). From (3.2) and (3.6) we obtain the consistency estimate (3.7) R(t) 2 dt Ckn 2q u (q) (t) 2 dt. 3.2. Stability. From (2.3) and (3.4) we obtain an error equation for Θ, (Θ, v) dt + (Θ n+, v n+ ) + (AΘ, v)dt = (Θ n, v n+ ) I (3.8) n + ( B(t, W ) B(t, U), v) dt + (R(t), v) dt

12 G. AKRIVIS AND CH. MAKRIDAKIS for all v V q ( ). First, letting v := 2Θ in (3.8) we obtain Θ n+1 2 + Θ n+ 2 2(Θ n, Θ n+ ) + 2 Θ 2 dt I (3.9) n = 2 ( B(t, W ) B(t, U), Θ) dt + 2 (R(t), Θ) dt. Now, (3.1i) Θ n+ 2 2(Θ n, Θ n+ ) Θ n 2 and (3.1ii) 2 (R(t), Θ) dt 1 R(t) 2 dt + ε Θ 2 dt ; ε further, in view of (2.1), (3.1iii) 2 ( B(t, W ) B(t, U), Θ) dt (2λ + ε) Θ In 2 dt + µ2 ε Using (3.1) in (3.9), we get Θ n+1 2 + α In Θ(t) 2 dt Θ n 2 + µ2 ε Θ(t) 2 dt + 1 ε Θ 2 dt. R(t) 2 dt, with α := 2(1 λ ε), for any positive ε. Thus, choosing ε < 1 λ, we have (3.11) Θ n+1 2 + c Θ(t) 2 dt Θ n 2 + c Θ(t) 2 dt + c R(t) 2 dt. The standard energy proof can not be completed since we do not have control of the term Θ(t) 2 dt. To overcome this barrier we will make use of the stability Lemma 2.1 and the corresponding Corollary 2.1. To this end, letting in (3.8) v := Θ, the interpolant of k n Θ(t)/(t t n ) at the Radau points t n,i, i = 1,..., q, we obtain, cf. (2.14) and (2.17), (3.12) 1 2 for any positive ε. Now, and thus (3.13i) [ Θ n+1 2 + 1 k n Θ 2 n ] + (1 λ ε) Θ 2 n µ 2 4ε Θ 2 n + (Θ n, Θ n+ ) + (R(t), Θ) dt, (R(t), Θ) 1 4τ 1 ε R(t) 2 + ετ 1 Θ 2, (R(t), Θ) dt 1 4τ 1 ε cf. the derivation of (2.2i); moreover R(t) 2 dt + ε Θ 2 n, (3.13ii) (Θ n, Θ n+ ) c 1 Θ n 2 + 1 4k n Θ 2 n,

cf. (2.2ii). Using (3.13) in (3.12) we get Θ n+1 2 + 1 2 A PRIORI ANALYSIS FOR GALERKIN METHODS 13 ( 1 µ2 ) Θ 2 k n ε n + α Θ 2 n 2c 1 Θ n 2 + 1 2τ 1 ε R(t) 2 dt, with α = 2(1 λ 2ε), for any positive ε. Hence, in view of the equivalence of the norms n and ( 2 dt ) 1/2 in Vq ( ) with constants independent of, with standing for either one of the norms and, we have, for ε < (1 λ)/2 and k n sufficiently small, (3.14) Θ n+1 2 + c k n In particular, (3.15) Θ(t) 2 dt + c Θ(t) 2 dt c 1 Θ n 2 + c 1 R(t) 2 dt. Θ(t) 2 dt Ck n Θ n 2 + Ck n R(t) 2 dt. Using (3.15) in (3.11), we obtain Θ n+1 2 + c Θ(t) 2 dt (1 + Ck n ) Θ n 2 + C R(t) 2 dt and this yields easily (3.16) Θ n 2 + 3.3. Convergence. t n Θ(t) 2 dt ce Ctn[ t n ] Θ 2 + R(t) 2 dt. 3.3.1. Error estimation at the nodes. In view of (3.7) and the fact that Θ() =, from (3.16) we obtain n 1 (3.17) Θ(t n ) 2 ce Ctn j= k 2q j I j u (q) (t) 2 dt. Now, since ρ vanishes at the nodes of the partition, see (3.1), (3.17) yields immediately the desired estimate at the nodes N 1 (3.18) max (u n N U)(tn ) 2 ce CT k 2q j u (q) (t) 2 dt. I j 3.3.2. Error analysis in L (H). Combining the inverse inequality j= (3.19) Θ 2 L ( ;H) ck 1 n Θ 2 L 2 (;H) with (3.15) and using (3.7) and (3.17), we easily conclude n (3.2) Θ 2 L ( ;H) cectn+1 j= k 2q j I j u (q) (t) 2 dt.

14 G. AKRIVIS AND CH. MAKRIDAKIS Finally, (3.2) and (3.2) yield the desired uniform in time error estimate ( ) 2 max (u t T U)(t) 2 c max kn q max u (q) (t) n N 1 t (3.21) N 1 + ce CT k 2q j u (q) (t) 2 dt. I j 3.3.3. Error analysis in L 2 (V ). In view of (3.7) and the fact that Θ() =, from (3.16) we obtain T N 1 (3.22) Θ(t) 2 dt C k 2q j u (q) (t) 2 dt. I j j= j= Finally, (3.2) and (3.22) yield the desired error estimate (3.23) T (u U)(t) 2 dt C N 1 j= k 2q j I j u (q) (t) 2 dt. 3.3.4. Error analysis for the original equation. Our analysis up to this point concerns the discontinuous Galerkin approximation U, see (2.3), to the modified equation (2.2). Here, we give our main result, namely error estimates for the discontinuous Galerkin approximation to the original equation (1.1). Theorem 3.1. Let the solution u of (1.1) be sufficiently smooth and U Vq d be a discontinuous Galerkin approximation to (1.1), i.e., a solution of (1.2). Let k := min n N 1 k n. Then, under the meshcondition k 2q k 1 sufficiently small we have the error estimate (3.24) max (u t T U)(t) 2 C + C max n N 1 N 1 j= k 2q j ( k q n max t u (q) (t) I j u (q) (t) 2 dt. Proof. Let for the time being U be the discontinuous Galerkin approximation to the modified equation (2.2). First, from (3.2) we conclude, for k sufficiently small, (3.25) max t T (u W )(t) 1 2. Further, from (3.22) and the analogous to (3.19) inverse inequality, we get (3.26) max t T (W U)(t) 2 Ck 2q k 1, and thus, under our meshcondition, (3.27) max t T (W U)(t) 1 2. ) 2

A PRIORI ANALYSIS FOR GALERKIN METHODS 15 It immediately follows from (3.25) and (3.27) that U T u. Therefore, the discontinuous Galerkin approximation U to the modified equation (2.2) is also a (locally unique) discontinuous Galerkin approximation to the original equation (1.1), and (3.24) follows from (3.21). Remark 3.1. The constants in this and previous sections as well as conditions like k sufficiently small do not directly depend on the particular choice of the operators A and B; they only depend on λ, µ, the discretization scheme and on various norms of the solution u. This fact will play a crucial role in the analysis of fully discrete schemes in the next section. 4. The DG case: Fully discrete schemes In this section we consider fully discrete schemes; we combine the discontinuous Galerkin time stepping with space discrete schemes. We establish optimal order error estimates. For the space discretization we use a family V h, < h < 1, of finite dimensional subspaces of V. For simplicity, we will use the same finite dimensional space V h throughout the interval [, T ]; the analysis can be modified to take into account possible changes of this space. In this section the following discrete operators will play an essential role: Define P o : V V h, A h : V V h and B h (t, ) : V V h by (P o v, χ) = (v, χ) χ V h (A h ϕ, χ) = (Aϕ, χ) χ V h (B h (t, ϕ), χ) = (B(t, ϕ), χ) χ V h. The space discrete problem corresponding to (1.1) is to seek a function u h : [, T ] V h satisfying { u h (t) + A h u h (t) = B h (t, u h (t)), < t < T, (4.1) u h () = u h, with u h V h a given approximation to u. To construct a fully discrete scheme, we discretize (4.1) in time by the discontinuous Galerkin method. With the notation of the previous sections and V d qh := {ϕ : [, T ] V h/ ϕ In (t) = q 1 j= w j t j }, the fully discrete approximation U h Vqh d to u is defined by (4.2) [(U h, v) + (F h(t, U h ), v)] dt + (U n+ h Uh n, vn+ ) = v V qh ( ), for n =,..., N 1, with F h (t, v) := A h v B h (t, v) and U h () = u h.

16 G. AKRIVIS AND CH. MAKRIDAKIS Let B(t, ) : V V be differentiable, and assume that the linear operator M(t), M(t) := A B (t, u(t)) + σi, is uniformly positive definite, for an appropriate constant σ. Following [3], we introduce the elliptic projection R h (t) : V V h, t [, T ], by (4.3) P o M(t)R h (t)v = P o M(t)v. We will show consistency of the space discrete scheme for W h, W h (t) := R h (t)u(t); to this end we shall use approximation properties of the elliptic projection operator R h (t). We assume that R h (t) satisfies the estimates (4.4) u(t) W h (t) + h d/2 u(t) W h (t) Ch r, and (4.5) d dt [u(t) W h(t)] Ch r, with two integers r and d, 2 d r. Note here that d corresponds to the order of the operator A, e.g., if A is a second order elliptic operator, as in Section 5, then d = 2. We further assume that (4.6) T dq dt q W h(t)] 2 dt C. For consistency purposes, we assume for the nonlinear part the estimate (4.7) B(t, u(t)) B(t, W h (t)) B (t, u(t))(u(t) W h (t)) Ch r. Let E h (t) V h denote the consistency error of the space discrete equation (4.1) for W h, (4.8) E h (t) := W h (t) + A hw h (t) B h (t, W h (t)), t T. From the definition of W h we easily conclude (A h W h (t), χ) = (Au(t) [ B (t, u(t)) κi ] (u(t) W h (t)), χ) χ V h. Therefore, using (1.1), E h (t) =W h (t) P ou (t) + σ [ P o u(t) W h (t) ] + P o [ B(t, u(t)) B(t, Wh (t)) B (t, u(t))(u(t) W h (t)) ], and, in view of (4.4), (4.5) and (4.7), we easily obtain the following optimal order estimate for the consistency error E h, (4.9) max t T E h(t) Ch r. The main result in this paper concerning the discontinuous Galerkin method is given in the following theorem: Theorem 4.1. Let the solution u of (1.1) be sufficiently smooth. Assume we are given an initial approximation U h = u h V h to u such that (4.1) u u h Chr.

A PRIORI ANALYSIS FOR GALERKIN METHODS 17 Then, for k and h sufficiently small, there exists a locally unique solution U h V d qh to (4.2); further, there exists a constant C, independent of k and h, such that, for k 2q k 1 and h 2r k 1 sufficiently small, we have the error estimate [ N 1 (4.11) max (u U ] h)(t) 2 C h 2r + k 2q+1 j. t T Proof. Existence and local uniqueness follow easily from our analysis in Section 2. To prove the error estimate, let first ρ := u W h ; according to (4.4) we have (4.12) max t T ρ(t) Chr. Further, according to (4.4), (4.13) max t T ρ(t) Chr d 2 and thus, for h sufficiently small, W h (t) T u, t [, T ]. Now, let B h (t, ) : V V h be defined by ( B h (t, ϕ), χ) = ( B(t, ϕ), χ) χ V h, and Ŵh Vqh d be such that (4.14) [(Ŵ h, v) + ( ˆF h (t, Ŵh), v)] dt + (Ŵ n+ h Ŵ h n, vn+ ) = v V qh ( ), for n =,..., N 1, with ˆB h (t, v) := B h (t, v) + E h (t), ˆFh (t, v) := A h v ˆB h (t, v) and Ŵ h () = W h, i.e., Ŵ h is a modified fully discrete discontinuous Galerkin approximation corresponding to equation (4.8). Then, according to (3.21), and in view of (3.7) and (4.6), N 1 (4.15) max (W h Ŵh)(t) 2 C k 2q+1 j. t T Further, (4.16) max t T (W h Ŵh)(t) 2 Ck 2q k 1, cf. (3.26). In view of (4.12) and (4.15), it remains to estimate Θ := Ŵh U h. We now temporarily change the meaning of U h and let it denote the modified discontinuous Galerkin approximation, (4.17) [(U h, v) + (A hu h, v)] dt + (U n+ h Uh n, vn+ ) = ( B h (t, U h ), v) dt for all v V qh ( ), for n =,..., N 1, with U h () = u h ; subsequently, after having established the desired error estimate we will show that U h is also a solution to (4.2). From (4.14) and (4.17), we obtain t n (4.18) Θ n 2 + Θ(t) 2 dt ce Ctn[ t n ] Θ 2 + E h (t) 2 dt, j= j=

18 G. AKRIVIS AND CH. MAKRIDAKIS see (3.16). Now, Θ = (Wh u )+(u u h ); thus, in view of (4.12) and (4.1), Θ Ch r, and, using also (4.9), (4.18) yields (4.19) Θ n 2 + Further, t n Θ(t) 2 dt Ch 2r. Θ(t) 2 dt Ck n Θ n 2 + Ck n E h (t) 2 dt, see (3.15), and thus, in view of (3.19), (4.19) and (4.9), (4.2) max t T Θ(t) Chr. Moreover, from (4.19) we obtain (4.21) max t T Θ(t) 2 Ch 2r k 1. Now, it easily follows from (4.13), (4.16) and (4.21) that, for k 2q k 1 and h 2r k 1 sufficiently small, U h (t) T u, t [, T ], and thus that U h is a discontinuous Galerkin approximation to the original equation, i.e., it satisfies (4.2). Finally, (4.12), (4.15) and (4.2) yield the error estimate (4.11) and the proof is complete. 5. The continuous Galerkin method In this section we analyze the continuous Galerkin method for problem (1.1). We show existence and local uniqueness of the continuous Galerkin approximations and establish optimal order error estimates. Let us emphasize that in this section no error estimates in L 2 (V ) are derived, and thus the tube T u is defined in terms of the norm of H, T u, T u := {v V : min t u(t) v 1}, see Remark 5.1. 5.1. Existence and Uniqueness. We begin by showing existence and uniqueness of the continuous Galerkin approximations for a modified equation. As before, this serves as an intermediate step and will be used in the sequel to establish existence and local uniqueness of the continuous Galerkin approximations for our original equation. The existence result is derived under the assumption that Vq c be finite dimensional, as was the case for the discontinuous Galerkin method. Existence and uniqueness of continuous Galerkin approximations for the nonlinear Schrödinger equation were established in [17]. The approach in [17] is based on properties of Gauss-Legendre quadrature formulae and interpolation. Our approach is direct and simplifies the proofs of [17]. We assume that B(t, ) can be modified to yield an operator B(t, ) : V V coinciding with B(t, ) in the tube T u, B(t, v) = B(t, v) for all t [, T ] and all v T u, and satisfying globally a one-sided Lipschitz condition, cf. (1.4), (5.1) ( B(t, v) B(t, w), v w) λ v w 2 + µ v w 2 v, w V.

A PRIORI ANALYSIS FOR GALERKIN METHODS 19 The continuous Galerkin method for the modified equation { u (t) + Au(t) = B(t, u(t)), < t < T, (5.2) u() = u, is to seek U V q satisfying (5.3) [(U, v) + (AU, v)] dt = ( B(t, U), v) dt v V q 1 ( ) for n =,..., N 1. Here U() = u(). It is easily seen that the solution u of (1.1) is also a solution of (5.2); further, (5.1) yields easily uniqueness of (smooth) solutions of (5.2). Next, we shall show existence, under the assumption that V c q be finite dimensional, and uniqueness of the solution of the scheme (5.3). Later on, we will see that U T u and will easily conclude existence and local uniqueness of the solution of scheme (1.3). 5.1.1. Uniqueness. Let U, V V c q be continuous Galerkin approximations, corresponding to the same partition of the time interval [, T ], with the same initial value u(). We will inductively show that U and V coincide. Assume that U and V coincide in 1. Then, obviously, (5.4) U(t n ) = V (t n ) and, consequently, (5.5) t [t n, t n+1 ] (U V )(t) = (t t n )w(t) with w V q 1 ( ). Subtracting [(V, v) + (AV, v)] dt = ( B(t, V ), v) dt v V q 1 ( ) from (5.3), we obtain [((U V ), v) + (A(U V ), v)] dt = ( B(t, U) B(t, V ), v) dt for all v V q 1 ( ); thus, in view of (5.5), we have (5.6) [(w, v) + (t t n )(w, v) + (t t n )(Aw, v)] dt = ( B(t, U) B(t, V ), v) dt, for all v V q 1 ( ). Now, i.e., (5.7) (t t n )(w, w) dt = 1 (t t n ) d 2 dt w 2 dt = 1 [ (t t n ) w(t) 2] t=t n+1 1 w 2 dt, 2 t=t n 2 (t t n )(w, w) dt = 1 2 k n w(t n+1 ) 2 1 2 w 2 dt.

2 G. AKRIVIS AND CH. MAKRIDAKIS In view of (5.7), relation (5.6), for v = w, can be written in the form 1 w 2 dt + 1 2 2 k n w(t n+1 ) 2 + (t t n ) w 2 dt I (5.8) n = ( B(t, U) B(t, V ), w) dt. Now, in view of (5.5) and (5.1), (5.9) ( B(t, U) B(t, V ), w) (t t n ) [ λ w 2 + µ w 2]. From (5.8) and (5.9) we obtain 1 w 2 dt + 1 2 2 k n w(t n+1 ) 2 + (t t n ) w 2 dt I n λ (t t n ) w 2 dt + µ (t t n ) w 2 dt ; hence, (5.1) 1 2 w 2 dt + 1 2 k n w(t n+1 ) 2 µ (t t n ) w 2 dt. For k n sufficiently small, such that 2 µ k n < 1, (5.1) yields w =, and we conclude uniqueness of the continuous Galerkin approximation. 5.1.2. Existence. The continuous Galerkin approximate solution U V c q is defined in by its value U(t n ) at t n (which has been determined from the conditions in the preceding time interval 1 ) and by (5.3). Now, since U(t n ) is considered given, U can be written in in the form (5.11) U(t) = U(t n ) + (t t n )W (t), t, with W V q 1 ( ). We now consider W V q 1 ( ) our unknown and use (5.11) to rewrite (5.3) as [(W, v) + (t t n )(W, v) + (AU(t n ), v) + (t t n )(AW, v)] dt I (5.12) n = ( B(t, U(t n ) + (t t n )W ), v) dt v V q 1 ( ). Let P q 2 denote the L 2 orthogonal projection operator onto V q 1 ( ). It is then easily seen that (5.12) can be written in the form (5.13) G(W ) = with G : V q 1 ( ) V q 1 ( ), (5.14) G(v) := v + (t t n )v + AU(t n ) + P q 2 ( (t t n )Av ) P q 2 B(t, U(t n ) + (t t n )v).

A PRIORI ANALYSIS FOR GALERKIN METHODS 21 We will use a variant of Brouwer s fixed point theorem to show existence of a W at which G vanishes, i.e., existence of a solution of (5.13). For v V q 1 ( ) we have [ (G(v), v)dt = v 2 + (t t n )(v, v) + (AU(t n ), v) i.e., (5.15) Now, + (t t n ) v 2 ( B(t, U(t n ) + (t t n )v), v) ] dt, [ (G(v),v)dt = v 2 + (t t n )(v, v) + (AU(t n ), v) + (t t n ) v 2 ( B(t, U(t n ) + (t t n )v) B(t, U(t n )), v) ( B(t, U(t n )), v) ] dt. (AU(t n ), v) U(t n ) v 1 U(t n ) 2 εk n v 2, 4εk n and, in view of the inverse inequality k n v I 2 dt c (t t n ) v 2 dt, n cf. [24], we have (5.16i) (AU(t n ), v) dt 1 4ε U(tn ) 2 cε (t t n ) v 2 dt. Similarly, (5.16ii) Therefore, with ( B(t, U(t n )), v) dt 1 4εk n B(t, U(t n )) 2 dt cε (t t n ) v 2 dt. ϕ n (t) := 1 4εk n [ U(t n ) 2 + B(t, U(t n )) 2], in view of (5.1) and (5.7), we easily obtain from (5.15), (G(v), v)dt 1 v 2 dt + 1 2 2 k n v(t n+1 ) 2 + (1 λ 2cε) (t t n ) v 2 dt I n µ (t t n ) v 2 dt ϕ n (t)dt, and thus, for ε sufficiently small, [1 (G(v), v)dt 2 µ(t tn ) ] v 2 dt ϕ n (t)dt. Hence, for k n sufficiently small such that 4 µ k n 1, we have [1 (5.17) (G(v), v) dt 4 v 2 ϕ n (t) ] dt.

22 G. AKRIVIS AND CH. MAKRIDAKIS Consequently, for v V q 1 such that v(t) 2 ϕ n (t), for all t, we have (G(v), v) dt. Since, := (, ) dt is an inner product in V q 1 ( ), existence of a continuous Galerkin approximation in, for sufficiently small k n, follows from a variant of Brouwer s fixed point theorem if V q 1 ( ) is finite dimensional, for instance in the fully discrete case. 5.2. Error Estimates. We shall establish optimal order estimates for u U, U being the solution of (5.3), under the assumption that B(t, ) : D(A) H coincides with B(t, ) in the tube T u, B(t, v) = B(t, v) for all t [, T ] and all v Tu, and satisfies the global Lipschitz condition (2.1). After having established the error estimate we can show that for sufficiently small time steps the solution of (5.3) satisfies also (1.3), and, thus, we will have error estimates for the original equation. Let W V c q be defined by (5.18) T W () = u(), (u W, v )dt =, v V c q, cf. [5]. The function W will play an important role in the error analysis. With (u W )(t n )t, t t n, (5.19) v(t) := (u W )(t n )t n, t n < t T, relation (5.18) yields W (t n ) = u(t n ), i.e., W has the following interpolation property (5.2) W (t n ) = u(t n ), n =,..., N. It is then easily seen that the restriction of W in could have been alternatively defined by (5.21) W (t n ) = u(t n ), W (t n+1 ) = u(t n+1 ), (u W, v)dt = v V q 1 ( ). Existence, uniqueness and the error estimate (5.22) (u W )(t) 2 Ckn 2q 1 u (q) (s) 2 ds, t, with standing for either one of the norms, and, can be established by standard arguments, cf. (3.1) and (3.2). With ρ := u W and Θ := W U, we split the error u U in the form u U = ρ + Θ.

A PRIORI ANALYSIS FOR GALERKIN METHODS 23 Since ρ has been estimated in (5.21), our main goal in this Section will be the estimation of Θ. We will achieve this in two stages: First we will show consistency of the numerical scheme for the interpolant W and subsequently we will show stability. 5.3. Consistency. Let R V q 1 ( ) denote the consistency error of the continuous Galerkin method (5.3) for W, (5.23) R = W + P q 2 ( AW B(, W ) ). Let v V q 1 ( ). Then, (R, v) dt = (W + AW B(t, W ), v) dt, and thus, in view of (1.1) and (5.21), (5.24) (R, v) dt = (Aρ, v) dt + ( B(t, u) B(t, W ), v) dt. Letting v := A 1 R in (5.23) and using (2.1) we have R(t) 2 dt = (A 1/2 ρ, A 1/2 [ ] R)dt + λ ρ + µ ρ R(t) dt, and thus easily (5.25) R(t) 2 dt 2 ρ(t) 2 [ ] 2dt dt + 2 λ ρ(t) + µ ρ(t). From (5.21) and (5.25) we obtain the consistency estimate (5.26) R(t) 2 dt Ckn 2q u (q) (s) 2 ds. 5.4. Stability. From (5.3) and (5.23) we obtain an error equation for Θ, (Θ, v)dt + (AΘ, v)dt = ( B(t, W ) B(t, U), v)dt I (5.27) n + (R(t), v)dt for all v V q 1 ( ). The stability analysis relies on the following auxiliary result. Lemma 5.1. Let < τ 1 < < τ q 1 < 1 be the abscissae of the Gauss Legendre quadrature formula in [, 1], and t n,i, t n,i := t n + k n τ i, i = 1,..., q 1, denote the corresponding nodes shifted to. Let Θ P q 1 ( ), and ˆΘ, Θ P q 2 ( ) be the interpolants of Θ and ϕ, ϕ(t) := k n Θ(t)/(t t n ), at t n,i, i = 1,..., q 1, respectively. Then (5.28) Θ ˆΘ dt = Θ Θ dt and (5.29) k n Θ Θ dt 1 5 Θ 2 dt ck n ( Θ(t n+1 ) 2 + Θ(t n ) 2),

24 G. AKRIVIS AND CH. MAKRIDAKIS with a positive constant c. Proof. Let w 1,..., w q 1 denote the weights of the Gauss Legendre quadrature formula in [, 1] with q 1 nodes. It is well known that the corresponding Gauss Legendre quadrature formula Q, Q(v) = k n [ w1 v(t n,1 ) + + w q 1 v(t n,q 1 ) ], integrates in polynomials of degree at most 2q 3 exactly. Since Θ(τ i ) = ˆΘ(τ i ), obviously Q(Θ Θ) = Q(Θ ˆΘ), and (5.28) follows. To show (5.29), we first write Θ in the form with Z P q 2 ( ), Then Θ(t) = Θ(t n ) + (t t n )Z(t), Z(t) = 1 t t n [Θ(t) Θ(tn )]. ϕ(t) = k n Z(t) + Θ(t n ) t t n. Therefore, with Λ P q 2 ( ) denoting the inteprolant of k n /(t t n ) at t n,i, i = 1,..., q 1, i.e., k n Λ(t n,i ) = 1 τ i, i = 1,..., q 1, Θ may be written in the form Θ = k n Z + Θ(t n )Λ. Now, cf. (5.7), Θ [ (t)z(t) dt = Z(t) + (t t n )Z (t) ] Z(t) dt = 1 2 k n Z(t n+1 ) 2 + 1 Z 2 dt, 2 i.e., (5.3) k n Θ I (t)z(t) dt = 1 n 2 Θ(tn+1 ) Θ(t n ) 2 + 1 2 k n Z 2 dt. Further, from the definition of Z we easily get (5.31) Θ 2 dt 2k n Θ(t n ) 2 + 2kn 2 Z 2 dt. Relations (5.3) and (5.31) yield immediately (5.32) kn 2 Θ (t)z(t) dt 1 2 k n[ Θ(t n+1 ) Θ(t n ) 2 Θ(t n ) 2] + 1 Θ 2 dt. 4 Next, we shall estimate Θ Λdt. We rewrite this term in the form Θ Λ dt = ΘΛ dt + Θ(t n+1 )Λ(t n+1 ) Θ(t n )Λ(t n ), and use the fact that Λ and k n Λ are uniformly bounded to obtain (5.33) k n Θ Θ(t n )Λ dt ε Θ 2 [ dt + k n c Θ(t n+1 ) 2 + c ε Θ(t n ) 2]

A PRIORI ANALYSIS FOR GALERKIN METHODS 25 for any positive ε. Choosing ε 1/2 in (5.33), from (5.32) and (5.33) we easily conclude that (5.29) is valid and the proof is complete. Letting v = ˆΘ V q 1 ( ) in (5.27) be the interpolant of Θ at t n,i, i = 1,..., q 1, and using (5.28), we have (Θ, Θ)dt + (AΘ, ˆΘ)dt = ( B(t, W ) B(t, U), ˆΘ)dt I (5.34) n + (R(t), ˆΘ)dt. In view of the exactness of the Gaussian quadrature rule, the left-hand side can be written in the form (5.35) (Θ, Θ)dt + (AΘ, ˆΘ)dt = 1 [ Θ n+1 2 Θ n 2] q 1 + k n w i Θ(t n,i ) 2. 2 To estimate the first term on the right-hand side of (5.34), we first note that using (2.1) we have ( B(t, W ) B(t, U), ˆΘ) B(t, W ) B(t, U) ˆΘ λ Θ ˆΘ + µ Θ ˆΘ [ λ Θ ˆΘ + 1 2 µε ˆΘ 2] + 1 2ε µ Θ 2 ; since the first term on the right-hand side is integrated exactly by the Gaussian quadrature formula, we obtain (5.36) ( B(t, W ) B(t, U), ˆΘ)dt ( λ + 1 2 µε) q 1 k n w i Θ(t n,i ) 2 + 1 2ε I µ Θ 2 dt. n Further, ˆΘ 2 is integrated exactly by the Gaussian quadrature formula, and the second term on the right-hand side of (5.34) can be estimated in the form (5.37) (R(t), ˆΘ)dt 1 2ε R(t) 2 dt + 1 2 εk n Using (5.35), (5.36) and (5.37) in (5.34), we obtain q 1 (5.38) Θ n+1 2 + αk n w i Θ(t n,i ) 2 Θ n 2 + µ ε w i Θ(t n,i ) 2. Θ 2 dt + 1 ε R(t) 2 dt with α := 2(1 λ) (µ + 1)ε and µ ε := 1 ε µ; we assume in the sequel that ε is sufficiently small such that α be positive.

26 G. AKRIVIS AND CH. MAKRIDAKIS Next, we would like to estimate Θ L 2 ( ;H) = ( Θ 2 dt ) 1/2. Letting v := ˆΘ Vq 1 ( ) 1 in (5.27) be the interpolant of k n t t Θ(t) at t n,i, i = 1,..., q, and using (5.29), we have n 1 Θ 2 dt + k n (AΘ, 5 Θ)dt ( ck n Θ n+1 2 + Θ n 2) I (5.39) n + k n ( I B(t, W ) B(t, U), Θ)dt + k n (R(t), Θ)dt. n Now, in view of the exactness of the Gaussian quadrature formula, (5.4) (AΘ, Θ)dt q 1 w i = k n Θ(t n,i ) 2. τ i Further, (5.41) ( B(t, W ) B(t, U), Θ)dt λk n q 1 From (5.39), (5.4) and (5.41), we obtain 1 q 1 Θ 2 dt + k 2 w i 5 n(1 λ) Θ(t n,i ) 2 τ i (5.42) ck n ( Θ n+1 2 + Θ n 2) + µk n Θ Θ dt + k n w i Θ(t n,i ) 2 + µ Θ τ Θ dt. i R(t) Θ dt. Now, Θ 2 is integrated exactly by the Gaussian quadrature formula, and we have Θ q 1 2 w i dt = k n Θ(t n,i ) 2 ; hence (5.43) Further, Θ 2 dt 1 τ 1 k n τ 2 i q 1 w i τ i Θ(t n,i ) 2. µ Θ Θ µ2 Θ 2 + τ 1ε 2ετ 1 2 Θ 2, and, in view of (5.43), we obtain (5.44) µ Θ In Θ dt µ2 Θ 2 dt + ε 2ετ 1 2 k q 1 w i n Θ(t n,i ) 2. τ i Similarly, (5.45) R(t) Θ dt 1 2ετ 1 R(t) 2 dt + ε 2 k n q 1 w i τ i Θ(t n,i ) 2.

From (5.42), (5.44) and (5.45) we get (5.46) ( 5µ 2 ) 1 k n 2ετ 1 A PRIORI ANALYSIS FOR GALERKIN METHODS 27 Θ 2 dt + 5k 2 n(1 λ ε) q 1 w i ck n ( Θ n+1 2 + Θ n 2) + 5 2ετ 1 k n Θ(t n,i ) 2 τ i R(t) 2 dt ; therefore, for ε < 1 λ and k n sufficiently small, (5.47) Θ 2 ( dt ck n Θ n+1 2 + Θ n 2) + ck n R(t) 2 dt. From (5.38) and (5.47), we easily obtain, for k n sufficiently small, q 1 (5.48) Θ n+1 2 + k n w i Θ(t n,i ) 2 (1 + Ck n ) Θ n 2 + C R(t) 2 dt and this yields easily n 1 (5.49) Θ n 2 + cf. (3.16). 5.5. Convergence. l= q 1 k l w i Θ(t l,i ) 2 [ t n ce Ctn Θ 2 + R(t) 2 dt ], 5.5.1. Error estimation at the nodes. In view of (5.26) and the fact that Θ() =, from (5.49) we obtain n 1 (5.5) Θ(t n ) 2 ce Ctn j= k 2q j I j u (q) (t) 2 dt. Now, since ρ vanishes at the nodes of the partition, see (5.2), (5.5) yields immediately the desired estimate at the nodes N 1 (5.51) max (u n N U)(tn ) 2 ce CT k 2q j u (q) (t) 2 dt. I j 5.5.2. Error analysis in L (H). Combining the inverse inequality j= Θ 2 L ( ;H) ck 1 n Θ 2 L 2 (;H), cf. (3.19), with (5.47) and using (5.26) and (5.5), we easily conclude n (5.52) Θ 2 L ( ;H) cectn+1 k 2q j u (q) (t) 2 dt. I j j=

28 G. AKRIVIS AND CH. MAKRIDAKIS Finally, (5.22) and (5.52) yield the desired uniform in time error estimate ( ) 2 kn q max (5.53) max (u t T U)(t) 2 C max n N 1 N 1 + Ce ct j= k 2q j u (q) (t) t I j u (q) (t) 2 dt. Up to this point in this section U is considered a continuous Galerkin approximation for the modified equation (5.2). However, it immediately follows from (5.53) that, for sufficiently small k, we have U T u ; therefore U is also a continuous Galerkin approximation for the original equation (1.1). Remark 5.1. In contrast to the discontinous Galerkin method, error estimates in L 2 (V ) for the continuous Galerkin method are not obtained directly. The essential reason for this difference between the two methods might be the fact that the continuous Galerkin method has less advantageous smoothing properties than the discontinuous Galerkin method, cf. [24]. Concerning our approach, estimates in L 2 (V ) do not follow at once for the continuous Galerkin approximations, since the expression ( ) q 1 1/2 (5.54) k n w i v(t n,i ) 2 cf., e.g., (5.38), it is a seminorm in V q ( ) rather than a norm. An expression of the form (5.54) is a norm in V q ( ) equivalent to ( ) 1/2 v(t) 2 dt if the sum contains at least q terms. In concrete applications one might derive estimates in at an additional point in the interval and then combine this with our results to obtain estimates in L 2 (V ). The lack of estimates in L 2 (V ) is the reason for the definition of the tube T u in terms of the norm of H in this section. As noted in the introduction, in applications the choice of the appropriate tube depends on the concrete problem. Thus in the analysis of fully discrete schemes the verification of the inclusion of the approximate solution to the appropriate tube is shown using the estimates obtained, certain inverse inequalities, and appropriate meshconditions, cf. Section 6 for example. For the continuous Galerkin method in the fully discrete case the meshconditions needed can be relaxed by obtaining extra control of the error in the norm of V at an additional point in the interval (different from t n,i, i = 1,..., q 1). The implementation of this task depends on the particular application. 6. Application to a quasilinear equation In this section we shall briefly discuss the application of our abstract results to a class of quasilinear equations: Let Ω R ν, ν = 1, 2, 3, be a bounded domain with smooth

A PRIORI ANALYSIS FOR GALERKIN METHODS 29 boundary Ω. For T > we seek a real valued function u, defined on Ω [, T ], satisfying u t = div ( c(x, t, u) u + g(x, t, u) ) + f(x, t, u) in Ω [, T ], (6.1) u = on Ω [, T ], u(, ) = u in Ω, with c : Ω (, ), f : Ω [, T ] R R, g : Ω [, T ] R R ν, and u : Ω R given smooth functions. We are interested in approximating smooth solutions of this problem, and assume therefore that the data are smooth and compatible such that (6.1) gives rise to a sufficiently regular solution. For the discretization of (6.1) by implicit explicit finite element multistep methods we refer to [3]; other applications are included in [2]. Let H s = H s (Ω) be the usual Sobolev spaces of order s, and H s be the norm of H s. The inner product in H := L 2 (Ω) is denoted by (, ), and the induced norm by ; the norm of L s (Ω), 1 s, is denoted by L s. Let U := [ 1 + min x,t u, 1 + max x,t u], and T u := {v V L : min t u(t) v L 1}, ˆT u := {v V W 1 : min t u(t) v W 1 1}. Let c > and c be such that We set c c(x, t, y) c x Ω, t [, T ], y U. a := c + c, b(x, t, y) := c(x, t, y) a, 2 A := a, B(t, v) := div ( b(, t, v) v ) + div g(, t, y) + f(, t, y). Then, obviously, V = H 1 = H1 (Ω) and the norm in V, v = a v, is equivalent to the H 1 norm. Let now λ := sup{ b(x, t, y) /a : x Ω, t [, T ], y U}; it is then easily seen that λ = 1 c a < 1. For v, w, ϕ V, and we easily see that (B(t,v) B(t, W ), ϕ) = (b(, t, w) (v w), ϕ) ([b(, t, v) b(, t, w)] v, ϕ) (g(, t, v) g(, t, w), ϕ) + (f(, t, v) f(, t, w), ϕ), (6.2) B(t, v) B(t, w) λ v w + µ v w for all v ˆT u, w T u ; thus, a stability condition of the form (1.5) is satisfied for v ˆT u and w T u.

3 G. AKRIVIS AND CH. MAKRIDAKIS Further, B (t, v)w = div(b(, t, v) w) + div( 3 b(, t, v)w v) + div( 3 g(, t, v)w) + 3 f(, t, v)w, and, therefore, A B (t, u(t)) + σi is, for an appropriate constant σ, uniformly positive definite in H 1. Let V h be the subspace of V defined on a regular finite element partition T h of Ω, and consisting of piecewise polynomial functions of degree at most r 1, r 2. Let h K denote the diameter of an element K T h, and h := max K Th h K. We define the elliptic projection operator R h (t), R h (t) : V V h, t [, T ], by ([a( ) + b(, t, u(, t))] (v R h (t)v), χ) + ([ 3 b(, t, u(, t))] u(, t) + 3 g(, t, u(, t))](v R h (t)v), χ) ([ 3 f(, t, u(, t)) σ](v R h (t)v), χ) = χ V h. It is well known from the error analysis for elliptic equations that (6.3) v R h (t)v + h v R h (t)v Ch r v H r, v H r H 1, i.e., the estimate (4.4) is satisfied with d = 2. Further, (6.4) d dt [u(, t) R h(t)u(, t)] Ch r, and (6.5) dq dt q R h(t)v + h dq dt q R h(t)v Ch r v H r, v H r H 1, cf., e.g., [6]; thus (4.5) and (4.6) are valid. We further assume, cf. [23], that (6.6) sup u(, t) R h (t)u(, t) W 1 1 t 2. Next, we will verify (4.7). We have (6.7i) and (6.7ii) B(t, u(t)) B(t, R h (t)u(t)) B (t, u(t))(r h (t)u(t) u(t)) = = 1 τb ( t, R h (t)u(t) τ[r h (t)u(t) u(t)] ) dτ[r h (t)u(t) u(t)] 2 B (t, v)w 2 = div( 2 3b(, t, v)w 2 v) + 2 div( 3 b(, t, v)w w) + div( 2 3g(, t, v)w 2 ) + 2 3f(, t, v)w 2. It easily follows from (6.7) and (6.3), in view of (6.6), that (6.8) B(t, u(t)) B(t, R h (t)u(t)) B (t, u(t))(u(t) R h (t)u(t)) H 1 Ch r, i.e., (4.7) is satisfied. We further assume we are given an initial approximation u h V h to u such that (6.9) u u h chr.