R T (u H )v + (2.1) J S (u H )v v V, T (2.2) (2.3) H S J S (u H ) 2 L 2 (S). S T

2 R.H. NOCHETTO 2. Lecture 2. Adaptivity I: Design and Convergence of AFEM tarting with a conforming mesh T H, the adaptive procedure AFEM consists of loops of the form OLVE ETIMATE MARK REFINE to produce the next conforming nested triangulation T h. The procedure OLVE solves (.5) for the discrete solution u H. The procedure ETIMATE determines the element indicators η H (T ) and oscillation osc H (T ) for all elements T T H. Depending on their relative size, these quantities are later used by the procedure MARK to mark elements T, and thereby create a subset T H of T H of elements to be refined. Finally, procedure REFINE partitions those elements in T H and a few more to maintain mesh conformity. These procedures are discussed in detail below. The theory started with Dörfler [2], but we follow the more recent developments [32, 34, 35]. 2.. Procedure ETIMATE: A Posteriori Error Bounds. In addition to T H, let H denote the set of interior faces (edges or sides) of the mesh T H. According to (.7), we consider the residual R = R(u H ) V defined by R(u H ) := f + div (A u H ) b u H c u H, and its relation to the error L(u u H ) = R(u H ). In view of Lemma.5 it suffices to estimate R(u H ) V. To this end, we set e H := u u H and integrate by parts B[e H, v] elementwise to obtain the error representation formula B[e H, v] = R(u h ), v = R T (u H )v + (2.) J (u H )v v V, T T H T H where the element residual R T (u H ) and the jump residual J (u H ) are defined as (2.2) (2.3) R T (u H ) := f + div (A u H ) b u H c u H in T T H, J (u H ) := A u + H ν+ A u H ν := [A u H ] ν on H, where is the common side of elements T + and T with unit outward normals ν + and ν, respectively, and ν = ν. Whenever convenient, we will use the abbreviations R T = R T (u H ) and J = J (u H ). 2... Upper Bound. For T T H we define the local error indicator η H (T ) by (2.4) η H (T ) 2 := HT 2 R T (u H ) 2 L 2 (T ) + H J (u H ) 2 L 2 (). T Given a subset ω Ω, we define the error estimator η H (ω) by η H (ω) 2 := T T H, T ω η H (T ) 2. Hence, η H (Ω) is the residual-type error estimator of Ω with respect to the mesh T H. This estimator is the simplest in the literature but not the most precise. We now find a first relation between η H (Ω) and the energy error. Lemma 2. (A Posteriori Upper Bound). There exists a constant C > 0 depending only on the shape regularity constant γ, and the coercivity and continuity constants c B and C B of B, such that (2.5) u u H 2 C η H (Ω) 2.

ADAPTIVE FINITE ELEMENT METHOD FOR ELLIPTIC PDE 3 Proof. With φ := e H I H e H we can write e H 2 = B[e H, e H ] = R(u H ), e H (definition (.7) of R(u H )) = R(u H ), φ (Galerkin orthogonality (.3)) = R T φ + J φ (expression (2.) of R) T T H T H T T H R T L 2 (T ) φ L 2 (T ) + H J L 2 () φ L 2 () (Cauchy-chwarz) In view of the interpolation estimate (.22), we have φ L 2 (T ) CH T e H L 2 (N(T )) as well as R T L2 (T ) φ L2 (T ) C HR T L2 (T ) e H L2 (N(T )) T T H T T H ( ) /2 C HR T 2 L 2 (T ) eh, T T H where we have used the finite overlapping property of the neighborhoods N(T ) of elements T T H. On the other hand, we recall the scaled trace inequality for H and element T T H with side φ L 2 () CH /2 φ L 2 (T ) + CH /2 φ L 2 (T ). This, in conjunction with (.22), yields J L 2 () φ L 2 () C H /2 J L 2 () e H L 2 (N()) H H ( ) /2 C H /2 J L 2 () eh. H ince we can always associate the side contributions by elements, we arrive at the asserted estimate. 2..2. Lower Bound. Let R T P k (T ) be the L 2 -projection of R T, where k is the polynomial degree. We define the oscillation on the elements T T H by (2.6) osc H (T ) 2 := HT 2 RT R T 2 L 2 (T ), and for a subset ω Ω, we define osc H (ω) 2 := T T H, T ω osc H (T ) 2. We point out that osc H is a convenient means to quantify information missed by the averaging process associated with the FEM. Using the explicit construction of Verfürth [, 46] via bubble functions and positivity, and continuity of A, we can get a local lower bound of the error in terms of local indicators and oscillation. We point out that our construction deals with discrete bubble functions in a space V h containing V H and defined over a refinement of T h of T H ; this idea is due to Dörfler [2] and will be crucial later. Lemma 2.2 (A Posteriori Lower Bound). There exists a constant C 2 > 0, depending only on the shape regularity constant γ, C B, and c B, such that (2.7) C 2 η H (T ) 2 u u H 2 H (ω T ) + osc H(ω T ) 2, where the domain ω T consists of all elements sharing at least a side with T. Proof. We assume that ω T for T T H is refined in such a way that there is an interior node in each element in ω T and each side of T. We also assume that the test function v in (2.) is piecewise

4 R.H. NOCHETTO Pfrag replacements T 2 T x x 2 x Figure 2.. Example of a refined two-element patch T T 2 in two dimensions. polynomial of degree k over the refinement of ω T, namely v V h, and supp (v) ω T. Note that (2.) reads B[e H, v] = R T v + (R T R T )v + (2.8) J v. T T T T H H Figure 2.2. Discrete bubble functions ψ, ψ x, x of Fig. 2. for polynomial degree k =. associated with the interior nodes We proceed in three steps.. Interior Residual. Let T T H, and let x T T h be an interior node of T. Let ψ T V h be a bubble function which satisfies ψ T (x T ) =, vanishes on T, and 0 ψ T ; hence supp (ψ T ) T. ince R T P k (T ) and ψ T > 0 in a polyhedron of measure comparable with that of T, we have C RT 2 L 2 (T ) 2 ψ T R T = R T (ψ T R T ). T ince v = ψ T R T is a piecewise polynomial of degree k over T h, it is thus an admissible test function in (2.8) which vanishes outside T (and in particular on all H ). Therefore C RT 2 L 2 (T ) B[e H, ψ T R T ] + (R T R T )ψ T R T T ( C H T e H H (T ) + RT RT R L2 L T (T )) 2 (T ), because of the inverse inequality v L 2 (T ) CH T v L 2 (T ). This, together with the triangle inequality, yields the desired estimate for HT 2 R T 2 L 2 (T ) : HT 2 R T 2 L 2 (T ) ( e C H 2 H (T ) + ) H2 T RT R T 2 (2.9). L 2 (T ) 2. Jump Residual. Let H be an interior side of T = T T H, and let T 2 T H be the other element sharing. Let x T h be an interior node of. Let ψ V h be a bubble function in ω := T T 2 such that ψ (x ) =, ψ vanishes on ω, and 0 ψ ; hence supp (ψ ) ω. T

ADAPTIVE FINITE ELEMENT METHOD FOR ELLIPTIC PDE 5 ince u H is continuous, then [[ u H ]] is parallel to ν, i.e. [ u H ] = j ν. Moreover, the coefficient matrix A(x) being continuous implies (2.) J = A(x) [[ u H ] ν = j A(x)ν ν = a(x) j, where a(x) := A(x)ν ν satisfies 0 < a a(x) a with a, a the smallest and largest eigenvalues of A(x) on. Consequently, J 2 L 2 () a2 j 2 Ca2 j 2 ψ C a2 (2.0) (j ψ )J, a where the second inequality follows from j being a polynomial and ψ > 0 in a polygon of measure comparable with that of. We now extend j to ω by first mapping to the reference element, next extending constantly along the normal to Ŝ and finally mapping back to ω. The resulting extension E h (j ) is a piecewise polynomial of degree n in ω so that ψ E h (j ) V h, and satisfies ψ E h (j ) L 2 (ω ) CH/2 j L 2 (). ince v = ψ E h (j ) V h is an admissible test function in (2.8) which vanishes on all sides of H but, we arrive at J (j ψ ) = B[e H, v] R T v R T2 v T T ( 2 ) Therefore (2.2) C H /2 e H H (ω + ) H/2 2 R Ti L 2 (T i) i= ( ) 2 H J 2 L 2 () C e H 2 H (ω + ) HT 2 i R Ti 2 L 2 (T i). i= j L 2 (). 3. Final Estimate. To remove the interior residual from the right hand side of (2.2) we observe that both T and T 2 contain an interior node of T h. Hence, (2.9) implies ( ) 2 H J 2 L 2 () C ε H 2 H (ω + ) HT 2 RTi i R Ti 2 (2.3). i= L 2 (T i) The asserted estimate for η H (T ) 2 is thus obtained by adding this bound to (2.9). The constant C depends on the shape regularity constant γ and the ratio a 2 /a of largest and smallest eigenvalues of A(x) for x. Remark 2.3 (Equidistribution). We see from (2.7) that if the oscillation osc H (ω T ) is small compared to the indicator η H (T ), then the size of the indicator η H (T ) will give reliable information about the size of the error u u H H (ω T ). This explains why refining elements with large indicators usually tends to equi-distribute the errors which, according to Example.9, is an ultimate goal of adaptivity. This idea is employed by the procedure MARK of 2.2. Remark 2.4 (Positivity). The use of A(x) being positive definite in (2.0) avoids having oscillation terms on. This comes at the expense of a constant depending on a 2 /a. If we were to proceed in the usual manner, as in [, 37, 46], we would end up with an oscillation of the form H /2 (A(x) A(x )) [ u H ] ν L2 () = H/2 (a(x) a(x ))j L 2 () CH 3/2 CH H /2 A W () j L2 (), L 2 () where C > 0 also depends on the ratio a /a dictated by the variation of a(x) on. This oscillation can be absorbed into the term H /2 J L 2 () provided that the meshsize H is sufficiently small; see [37]. We do not need this assumption in our present discussion. J

6 R.H. NOCHETTO Remark 2.5 (Continuity of A). The continuity of A is instrumental in avoiding jump oscillations which in turn makes computations simpler. However, jump oscillations cannot be avoided when A exhibits discontinuities across inter-element boundaries of the initial mesh. We get instead of (2.3) (2.4) 2 CH J 2 L 2 () ε H 2 H (ω + ) HT 2 i R Ti R Ti 2 L 2 (T +H i) J J 2 i= L 2 (), where J is the L 2 -projection of J onto P k (). To obtain estimate (2.4) we proceed as follows. tarting from a polynomial J, we get an estimate similar to that of (2.0) C J 2 L 2 () 2 (2.5) ψ J = J (ψ J ) + (J J )(ψ J ). In contrast to (2.0), we see that the oscillation term (J J ) cannot be avoided when A has a discontinuity across. We estimate the first term on the right hand side of (2.5) exactly as we have argued with (2.) and thereby arrive at ( J (J ψ ) C H /2 ε H H (ω + ) H/2 2 J R Ti L L 2 (T i)) 2 (). This and a further estimate of the second term on the right hand side of (2.5), yield ( ) H J 2 2 C ε L 2 () H 2 H (ω + ) HT 2 i R Ti 2 L 2 (T + H i) J J 2, L 2 () i= whence the assertion (2.4) follows using triangle inequality for J L 2 (). Combining with (2.9), we deduce an estimate for η H (T ) similar to (2.7), namely, ( η H (T ) 2 C ε H 2 H (ω T ) + osc H(ω T ) 2), with the new oscillation term involving jumps on interior sides (2.6) osc H (T ) 2 := HT 2 RT R T 2 L 2 (T ) + H J J 2 L 2 (). For a given mesh T H and discrete solution u H, along with input data A, b, c and f, the procedure ETIMATE computes element indicators η H (T ) and oscillations osc H (T ) for all T T H according to (2.4) and (2.6): {η H (T ), osc H (T )} T TH = ETIMATE(T H, u H, A, b, c, f) T i= 2.2. Procedure MARK. Our goal is to devise a marking procedure, namely to identify a subset T H of the mesh T H such that after refining, both error and oscillation will be reduced. This is achieved with two marking strategies as follows. The Marking trategy E, also called bulk-chasing, was introduced by Dörfler [2]: Marking trategy E : Given a parameter 0 < θ E <, construct a minimal subset T H of T H such that (2.7) η H (T ) 2 θe 2 η H(Ω) 2, T T b H and mark all elements in T H for refinement. We will see later that Marking trategy E guarantees error reduction in the absence of oscillation terms. ince the latter account for information missed by the averaging process associated with the finite element method, we need a separate procedure to guarantee oscillation reduction. This procedure was introduced by Morin et al. [34, 35] as a separate means for reducing oscillation:

ADAPTIVE FINITE ELEMENT METHOD FOR ELLIPTIC PDE 7 Marking trategy O : Given a parameter 0 < θ 0 < and the subset T H T H produced by Marking trategy E, enlarge T H to a minimal set such that (2.8) osc H (T ) 2 θ0 2 osc H(Ω) 2, T T b H and mark all elements in T H for refinement. Given a mesh T H and all information about the local error indicators η H (T ), and oscillation osc H (T ), together with user parameters θ and ˆθ, MARK generates a subset T H of T H T H = MARK(θ E, θ 0 ; T H, {η H (T ), osc H (T )} T TH ) 2.3. Procedure REFINE and AFEM. The following Interior Node Property, due to Morin et al [34, 35], is known to be necessary for error and oscillation reduction: Interior Node Property : Refine each marked element T T H to obtain a new mesh T h compatible with T H such that T and the d + adjacent elements T T H of T, as well as their common sides, contain a node of the finer mesh T h in their interior. In addition to the Interior Node Property, we assume that the refinement is done in such a way that the new mesh T h is conforming, which guarantees that both T H and T h are nested. This can be achieved by repeated bisection (see Figure 2.3). Pfrag replacements 0 0 0 Figure 2.3. Refinement of triangles in two dimensions by newest-vertex bisection. Dashed lines indicate the refinement edges, which are sides opposite to the most recently created nodes. Creating interior nodes is rather easy in two dimensions. First, elements are marked for two bisections and then refined. This already produces a node at the midpoint of each edge. econd, the two grandchildren with index are bisected once more. The whole refinement process is shown in Figure 2.3. The first refinement step may, as usual, involve surrounding elements which are not marked. This is an inevitable effect in order to preserve mesh conformity. The second refinement step is local in that it involves only the two grandchildren with index and does not spread outside them. In three dimensions it is impossible to perform the second step by dealing only with children of the original tetrahedron. The first step consists of three bisections. In order to obtain the interior nodes, the second step consists of marking some sub-tetrahedra for two or three additional bisections. This has the spreading effect of creating additional nodes in the edges of the original tetrahedron. For the implementation, we do not split the refinement into two steps, but rather mark a tetrahedron for six bisections which are performed in one step. This creates an interior node in the tetrahedron and interior nodes in all the element faces. With this property, we have a reduction factor γ 0 < of element size, i.e. if T T h is obtained by refining T T H, then h T γ 0 H T. In d = 2 with triangular elements, 3 newest bisections yield γ 0 /2.

8 R.H. NOCHETTO Given a mesh T H and a marked set T H, REFINE constructs a conforming refinement T h satisfying the Interior Node Property: T h = REFINE(T H, T H ) Given parameters θ E and θ 0 according to Marking trategies E and O, the adaptive algorithm AFEM consists of the loops of procedures OLVE, ETIMATE, MARK, and REFINE as follows: AFEM Choose parameters 0 < θ E, θ 0 <. () Pick an initial mesh T 0, initial guest u = 0, and set k = 0. (2) u k = OLVE(T k, u k, A, b, c, f). (3) {η k (T ), osc k (T )} T Tk = ETIMATE(T k, u k, A, b, c, f)). (4) T k = MARK(θ, ˆθ ; T k, {η k (T ), osc k (T )} T Tk ). (5) T k+ = REFINE(T k, T k ). (6) et k = k + and go to tep 2. 2.4. Error and Oscillation Reduction. To shed light on the ingredients for convergence of AFEM, we discuss three examples. They show the significance of the Interior Node Property in reducing both error and oscillation. We consider the simplest scenario, thereby following [34, 35], and assume that A is piecewise constant (but possibly discontinuous) and that both b and c vanish. The resulting operator L reads (2.9) Lu = div(a u) = f, and the corresponding oscillation merely depends on data (2.20) osc 2 H = T T H H 2 T f f T 2 L 2 (T ). Example 2.6 (Interior Node ). This example shows the necessity of creating an interior node inside each refined triangle. Consider problem (.3)-(.4) with A = I, f, and Ω = (0, ) (0, ). Let { (0, 0), (, 0), (, ), (0, ), ( 2, 2 )} be the set of vertices of T 0 (see Figure 2.4-left). Figure 2.4. Example 2.6: Finite element solutions u 0, u, u 2 for 3 consecutive meshes T 0, T, T 2 obtained with 2 bisection steps. The triangles of T do not have interior nodes but those of T 2 do, thereby yielding u = u 2 = 2 φ u 3. Let φ be the nodal basis function of V 0 that corresponds to the node ( 2, 2 ). Then, it is easily seen that the finite element solution u 0 is u 0 = 2 φ. Let T be the grid obtained from T 0 by performing two bisections on each triangle of T 0 using the newest-vertex bisection and assuming that ( 2, 2 ) is the newest vertex on the initial grid (see Figure 2.4-middle). This is the standard refinement 2-step

ADAPTIVE FINITE ELEMENT METHOD FOR ELLIPTIC PDE 9 bisection [2], and does not lead to an interior node in the refined elements. Then we get a set of 5 nodes, and the finite element solution u on T solves a simple 5 5 system which satisfies u = u 0, as can be seen in Figure 2.4. ince u u = u u 0, we conclude that without one interior node in at least one triangle, no error reduction is obtained even when osc 0 = 0 and u u 0 Ω > 0. The presence of interior nodes (with respect to T 0 ) in the refinement T 2 of T yields a change of solution values (see Figure 2.4-right). Example 2.7 (Interior Node 2). At first sight, it may seem that the situation of Example 2.6 may occur only at the first refinement step. This example shows that such a situation can also happen at any refinement step n. Fix n N 0 and consider (.3) with A = I, Ω = (0, ) 2, and f given by f(x) = { if x (i 2 n, (i + ) 2 n ) (j 2 n, (j + ) 2 n ) and i + j odd otherwise; see Figure 2.5. Then, if we start with T 0 equal to the grid T 0 of Example 2.6, and φ also as in Example 2.6, we have that φ is orthogonal to f and consequently u 0 0. If we now define recursively T k+, k = 0,,... as the grid that results from T k by performing two newest-vertex bisections on every triangle (see Figure 2.5), we will have u k 0 for k = 0,,..., n, due to the fact that f is orthogonal to the basis functions of T k for k = 0,,..., n. - - - - - - - - Pfrag replacements - - - - - - - - - - - - - - - - - - - - - - - - T 0 T T 2 Figure 2.5. Example 2.7: Checkerboard function f for n = 3 (left), and grids T k for k = 0,, 2 (right). Figure 2.6. Example 2.7: Finite element solutions u 2, u 3, u 4 and meshes T 2, T 3, T 4 with n = 2. ince u 2 = u 3 u 4, error reduction may fail to hold at any adaptive loop not just at the first one. For k = n the solution u n will not be zero anymore, but it will be zero along the lines where f changes sign due to the symmetry of the problem, and the same will happen with u n+ (see Figure 2.6). Then, if we observe u n and u n+ in a fixed square where f is constant, they behave exactly as u 0 and u do in Example 2.6, and consequently u n = u n+, which means that the error does not decrease, even when the oscillation osc n is zero. Figure 2.6 depicts this situation.

20 R.H. NOCHETTO Example 2.8 (Data Oscillation). This example shows that if the data oscillation osc H is not small, then, even introducing an interior node on all elements, the error may not decrease. To see this, consider Example 2.7 for some fixed large n N 0. Observe now that if we obtain T k+ by performing three bisections on all the elements of T k, then three new nodes are created on the edge opposite to the newest vertex in addition to an interior node per element (see Figure 2.7). Even though this refinement is stronger than required by the Interior Node Property in each step, the solutions, u k will all be zero for k < 2n/3. Pfrag replacements T 0 T T T 2 Figure 2.7. Resulting grid T (left) and T 2 (right) after performing three bisections on each element of T 0 and T, respectively. We conclude from Examples 2.6 and 2.7 that the interior node is necessary to obtain an error decrease, and from Example 2.8 that this may not be sufficient if the mesh does not resolve oscillation of data. Therefore, in order to obtain an asymptotically convergent sequence of discrete solutions, we must readjust the mesh to resolve osc H according to a decreasing tolerance. We next quantify to effect of the Interior Node Property for both error and oscillation reduction. We start with a simple result relating two consecutive discrete solutions on an element that has been refined. Lemma 2.9 (Local Lower Bound for u h u H ). Let C 2 > 0 be the constant of Lemma 2.2. Then (2.2) C 2 η H (T ) 2 u h u H 2 H (ω T ) + osc H(ω T ) 2 T T H. Proof. ince B[u u h, v] = 0 for v V h, we have B[u u H, v] = B[u h u H, v] v V h. We now observe that the bubble functions in the proof of Lemma 2.2 are in V h. Therefore, the assertion follows as in Lemma 2.2. We realize that the local energy error between consecutive discrete solutions is bounded below by the local indicators for elements in the marked set T H, provided the oscillation term is sufficiently small relative to the energy error. On the other hand, we now prove that the oscillation reduces with a factor ρ <. The first lemma considers the worst scenario situation of f just in L 2 (Ω), whereas the second lemma addresses the case of f piecewise smooth. Lemma 2.0 (Data Oscillation Reduction ). Let 0 < γ 0 < be the reduction factor of element size associated with one refinement step of REFINE. Given 0 < θ 0 <, let ρ := ( ( γ 2 0)θ 2 0) /2. Let T H be a subset of T H satisfying Marking trategy O. If T h is generated by REFINE from T H, then the following data oscillation reduction occurs: (2.22) osc h ρ osc H.

ADAPTIVE FINITE ELEMENT METHOD FOR ELLIPTIC PDE 2 Proof. Let T T h be an element contained in ˆT T H. ince f T is the L 2 -projection of f onto P k (T ), for instance f T = T T f for k =, we have ince h T γ 0 h ˆT, we discover that osc 2 h = f f T L 2 (T ) f f ˆT L 2 (T ). h 2 T f f T 2 L 2 (T ) T T h γ0 2 h 2ˆT f f ˆT 2 L 2 ( ˆT + ) ˆT T b H = (γ 2 0 )osc 2 H + osc 2 H ρ 2 osc 2 H. T T H\ b T H h 2 T f f T 2 L 2 (T ) 2.5. Convergence of AFEM. We prove convergence of AFEM under the simplifying assumptions on the coefficients of section 2.4, following [34, 35]. We will consider the general case in Lecture 3. It is crucial for convergence to be able to link two consecutive discrete solutions u H V H and u h V h. This is easy to do for the energy norm due to Lemma 2., but not for any other norm. Lemma 2. (Orthogonality). If V H V h, then the following relation holds (2.23) u u H 2 = u u h 2 + u h u H 2. Proof. The bilinear form B, being symmetric, induces a scalar product in V. By Galerkin orthogonality, B[e h, v] = 0 for all v V h, whence u h u H V h is perpendicular to u u h. Therefore, since u u H = (u u h ) + (u h u H ), the assertion (2.23) follows from the Pythagoras theorem This resul reveals the monotonicity property u u h u u H regardless of the strengh of the refinement leading from T H to T h. To enforce a strick error reduction we need to make sure that u h u H is at least a fixed proportion of u u H. We do this next upon quantifying the effect of MARK and REFINE on error reduction. In particular, we show the compound effect of the Marking trategy E with the Interior Node Property. Theorem 2.2 (Error Reduction). Given a triangulation T H and discrete solution u H, let T H be generated by MARK and let T h be a conforming refinement of T H generated by REFINE. Then there exists a constant 0 < α <, depending only on the minimum angle, θ E, C B, and c B, such that the solution u h on the mesh T h satisfies u u h α u u H + osc 2 H. Proof. We first derive a lower bound for u h u H. By Lemma 2.9 and Marking trategy E we have C 2 θe 2 η2 H C 2 η H (T ) 2 T T b H u h u H 2 H (ω T ) + osc H(ω T ) 2 T T b H D u h u H 2 + Dosc 2 H, where D := d + 2 accounts for the overlap of sets ω T. Hence, since u u H 2 C ηh 2 Lemma 2., in light of u h u H 2 θ2 E C 2 D η2 H osc2 H θ2 E C 2 DC u u H 2 osc 2 H

22 R.H. NOCHETTO Cobining this with Lemma (2.), we obtain This proves the assertion with α 2 = θ2 E C2 DC. u u h 2 = u u H 2 u h u H 2 ( u u H 2 θ2 E C ) 2 + osc 2 H DC Theorem 2.2 shows that the energy error decreases provided that oscillation decreases separately. The latter is provided by Lemma 2.0. We point out though that we do not prove a geometric decrease of the error but that rather that it is dominated by a geometric sequence. Theorem 2.3 (Convergence). Let 0 < θ E, θ 0 < be the parameters of MARK. Let 0 < α < be given by Theorem 2.2 and 0 < ρ < by Lemma 2.0. Let α 0 := max(α, ρ). Then, for every β such that α 0 < β <, the sequence {u k } k N0 of finite element solutions produced by AFEM satisfies (2.24) u u k Ω C 0 β k, with C 0 := e 0 + β α 0 osc 0. Proof. We first apply Lemma 2.0 recursively to get osc k ρ osc k ρ k osc 0. We then set e k := u u k, and utilize Theorem 2.2 to deduce which by recursion implies e k+ αe k + osc k αe k + ρ k osc 0, k (2.25) e k+ α k+ e 0 + osc 0 α j ρ k j. ince α α 0 and ρ < β, we obtain the estimate k k ( α j ρ k j β k α0 ) j β k β α0 β j=0 j=0 from which the assertion follows immediately. j=0 = β k+ β α 0, 2.6. Numerical Experiments. In this section we present two examples taken from [34, 35]. The purpose of the simulations is to verify experimentally the optimal performance of AFEM. We monitor the error decay in terms of degrees of freedom N for polynomial degree k =, and observe the optimal rate N /2. The experiments were implemented within the finite element toolbox ALBERTA [43]. 2.6.. Example: Discontinuous coefficients. We consider the Example.3 due to Kellogg [29]. In 0 0 true error estimate true error 0-0 5 0 5 20 25 30 35 iteration Figure 2.8. Example 2.6.: Error reduction: estimate and true error.

ADAPTIVE FINITE ELEMENT METHOD FOR ELLIPTIC PDE 23 0 0 true error estimate opt. true error 0-0 0 2 0 3 DOFs Figure 2.9. Example 2.6.: Quasioptimality of AFEM: estimate and true error. The optimal decay is indicated by the green line with slope /2. Table 2.. Example 2.6.2: Total number and number of marked elements per iteration in two dimensions (left) and three dimensions (right): est.: marked elements due to error estimator, osc.: additionally marked elements to data oscillation. iter. elements est. osc. 0 4 8 0 64 6 6 2 704 56 8 3 2256 80 0 4 4208 96 8 5 6624 2 24 6 8752 344 0 7 752 432 0 8 28368 608 0 9 42896 768 6 0 6026 292 0 3040 2296 24 2 60592 386 24 iter. elements est. osc. 0 6 6 0 384 48 0 2 7776 48 48 3 5936 576 0 4 2320 5040 0 5 860592 536 720 6 693536 3044 0 Figures 2.8-2.9 we see the same behavior of the true error e k Ω and the estimator η k scaled by the factor 0.05. Figure 2.9 demonstrates that the grids and associated numerical complexity are quasioptimal: e k Ω = C DOFs(k) /2 is valid asymptotically (the performance of an optimal method is again indicated by the additional green straight line). For this problem the grid is highly graded at the origin. It is worth realizing the strength of the singularity at hand in Figure 2.0. We see a mesh with less than 2000 nodes and three zooms at the origin, each obtained with a magnifying factor 0 3, and yet exhibiting a rather strong grading. This is also reflected in Figure 2., which depicts the graph of the discrete solution over the underlying mesh: the solution is flatter in the quadrants with a 6 although the grid is finer, which accounts for the presence of a in the energy norm. This picture was created using the graphics package GRAPE [27]. 2.6.2. Example: Variable ource. In Example 2.6. the source term is constant. It is our purpose now to examine the effect of data oscillation (2.20). To this end, we consider the domain Ω = (, ) d with d = 2, 3, and the exact solution 0 x 2 u(x) = e of (2.9) with A = I and non-constant f = u. uch an f exhibits a relatively large variation in Ω, and within elements, which forces AFEM to refine additional elements due to data oscillation (Marking trategy O), not yet marked for refinement by Marking trategy E. This is reported in Table 2. for two dimensions (left) and three dimensions (right). We see that the number of additional elements due to large data oscillation is rather small relative to those due to large error indicators, but it is not zero. On the one hand, this confirms that control of data oscillation cannot be omitted in a

24 R.H. NOCHETTO Figure 2.0. Example 2.6.: Final grid (full grid with < 2000 nodes) (top left), zooms to ( 0 3, 0 3 ) 2 (top right), ( 0 6, 0 6 ) 2 (bottom left), and ( 0 9, 0 9 ) 2 (bottom right). Figure 2.. Example 2.6.: Graph of the discrete solution, which is r 0., and underlying grid.

ADAPTIVE FINITE ELEMENT METHOD FOR ELLIPTIC PDE 25 0 0 true error estimate opt. true error 0-0 2 0 3 0 4 Figure 2.2. Example 2.6.2: Quasioptimality of AFEM: estimate and true error in two dimensions. The optimal decay is indicated by the green line with slope /2. DOFs 0 0 true error estimate opt. true error 0-0 3 0 4 0 5 DOFs Figure 2.3. Example 2.6.2: Quasioptimality of AFEM: estimate and true error in three dimensions. The optimal decay is indicated by the green line with slope /3. convergent algorithm. On the other hand, this explains why data oscillation seems to play a minor role for (piecewise) smooth data f, and hints at the underlying reasons why most adaptive strategies, although neglecting data oscillation, converge in practice. Figure 2.4. Example 2.6.2: Adaptive grids of the 3d simulation on ((, ) 3 \(0, ) 3 ): full grid of the 2nd iteration (left), zoom into the grid of the 4th iteration (right).

26 R.H. NOCHETTO As mentioned in 2.3, we produce in three dimensions the interior node by bisecting a marked tetrahedron six times. This corresponds in two dimensions to four bisections of a marked triangle, which is used here instead of the procedure of Figure 2.3. Although this produces more DOFs than needed, Figures 2.2 and 2.3 demonstrate that the resulting meshes are still quasi-optimal for both two dimensions and three dimensions. Here, the estimators η k were scaled by the factor 0.25. For comparison with an optimal mesh, green lines with slope /d are plotted in Figure 2.2 (d = 2) and Figure 2.3 (d = 3); note that these lines have nearly the same slope due to different scaling of the y axis. Finally, in Figure 2.4 we cut (0, ) 3 out of the domain (, ) 3 and show the adaptive grid of the three-dimensional simulation on the boundary of the resulting domain. In the left picture we show the full grid of the 2nd iteration and in the right one a zoom into the grid of the 4th iteration. For this picture we also used the graphics package GRAPE. 2.7. Exercises. 2.7.. Exercise: uperconvergence. Let T h be a shape-regular triangulation of a polygonal domain Ω R d. Let V h be a space of (possibly discontinuous) finite elements containing at least P k (T ) as interpolating polynomials for all T T h. Given u L 2 (Ω) let u h V h be the L 2 -projection of u onto V h, i.e. u h V h : (u u h )v = 0 v V h. Prove the following error estimates for 0 m k + (H m (Ω) = dual of H m 0 (Ω)): (2.26) (2.27) (2.28) Ω u u h L 2 (Ω) inf v V h u v L 2 (Ω) Ch k+ u H k+ (Ω); u u h H m (Ω) Ch k++m u H k+ (Ω); u u h H m (Ω) Ch k+ m u H k+ (Ω), the latter provided T h is quasi-uniform. The inequality (2.27) is referred to as a superconvergence estimate. Hint: to prove (2.28) add and subtract I h u and use an inverse inequality. 2.7.2. Exercise: Equivalent Estimator. Consider the a posteriori error estimator based only on jumps ( ) /2. ζ h = h J 2 L 2 () h The purpose of this problem is to demonstrate that ζ h is equivalent to the true error e h 2 L (Ω) under the same conditions that η h is, where η h is given by ηh 2 = h 2 T f 2 L 2 (T ) + h J 2 L 2 (). T T h h (a) Let f i be the L 2 -projection of f onto the set of constants P 0 (ω i ) over the star ω i =supp(φ i ). Use the error equation to deduce f i 2 L 2 (ω i) C f f i 2 L 2 (ω i) + C x i h J 2 L 2 (). (b) Use that each triangle T T h belongs to at most N sets ω i with N independent of h to conclude that h 2 T f 2 L 2 (T ) Ch2 max f f i 2 L 2 (ω + C i) h J 2 L 2 (). T T h i I h (c) how that f f i 2 L 2 (ω = i) o(h2 max ), h 2 max i I and then prove that ζ h is equivalent to e h 2 L (Ω).

ADAPTIVE FINITE ELEMENT METHOD FOR ELLIPTIC PDE 27 2.7.3. Exercise: Data Oscillation. Construct a counterexample with oscillatory function f showing that a global lower bound such as (2.7) cannot be valid without osc H. 2.7.4. Exercise: Data Oscillation Reduction 2. Let f be piecewise H s for 0 < s over the initial mesh, where H s stands for the space of functions with fractional derivative of order s in L 2. Let data oscillation be redefined by ( /2 osc h := h 2+2s T D s f 2 L 2 (T )). T T h Let γ 0, θ 0, and T h be defined as in Lemma 2.0. If ρ := ( ( γ0 2+2s )θ0) 2 /2, then show (2.29) osc h ρ osc H. ince γ 0 /2 in two dimensions, the reduction rate squared of is ˆα 2 5ˆθ 2 /6 ˆθ 2 provided f is piecewise H.