BDDC ALGORITHMS WITH DELUXE SCALING AND ADAPTIVE SELECTION OF PRIMAL CONSTRAINTS FOR RAVIART-THOMAS VECTOR FIELDS

Similar documents
Parallel Sums and Adaptive BDDC Deluxe

AN ADAPTIVE CHOICE OF PRIMAL CONSTRAINTS FOR BDDC DOMAIN DECOMPOSITION ALGORITHMS

A BDDC ALGORITHM FOR RAVIART-THOMAS VECTOR FIELDS TR

OVERLAPPING SCHWARZ ALGORITHMS FOR ALMOST INCOMPRESSIBLE LINEAR ELASTICITY TR

Multispace and Multilevel BDDC. Jan Mandel University of Colorado at Denver and Health Sciences Center

INSTITUTE OF MATHEMATICS THE CZECH ACADEMY OF SCIENCES. A virtual overlapping Schwarz method for scalar elliptic problems in two dimensions

CONVERGENCE ANALYSIS OF A BALANCING DOMAIN DECOMPOSITION METHOD FOR SOLVING A CLASS OF INDEFINITE LINEAR SYSTEMS

MULTIGRID PRECONDITIONING IN H(div) ON NON-CONVEX POLYGONS* Dedicated to Professor Jim Douglas, Jr. on the occasion of his seventieth birthday.

Multilevel and Adaptive Iterative Substructuring Methods. Jan Mandel University of Colorado Denver

Domain Decomposition Preconditioners for Spectral Nédélec Elements in Two and Three Dimensions

ASM-BDDC Preconditioners with variable polynomial degree for CG- and DG-SEM

Overlapping Schwarz Preconditioners for Spectral. Problem in H(curl)

20. A Dual-Primal FETI Method for solving Stokes/Navier-Stokes Equations

A Balancing Algorithm for Mortar Methods

Extending the theory for domain decomposition algorithms to less regular subdomains

ETNA Kent State University

Convergence analysis of a balancing domain decomposition method for solving a class of indefinite linear systems

Selecting Constraints in Dual-Primal FETI Methods for Elasticity in Three Dimensions

A Balancing Algorithm for Mortar Methods

Robust Domain Decomposition Preconditioners for Abstract Symmetric Positive Definite Bilinear Forms

A Multigrid Method for Two Dimensional Maxwell Interface Problems

ENERGY NORM A POSTERIORI ERROR ESTIMATES FOR MIXED FINITE ELEMENT METHODS

Multispace and Multilevel BDDC

INSTITUTE OF MATHEMATICS THE CZECH ACADEMY OF SCIENCES

Adaptive Coarse Space Selection in BDDC and FETI-DP Iterative Substructuring Methods: Towards Fast and Robust Solvers

Short title: Total FETI. Corresponding author: Zdenek Dostal, VŠB-Technical University of Ostrava, 17 listopadu 15, CZ Ostrava, Czech Republic

Optimal multilevel preconditioning of strongly anisotropic problems.part II: non-conforming FEM. p. 1/36

1. Fast Solvers and Schwarz Preconditioners for Spectral Nédélec Elements for a Model Problem in H(curl)

ISOGEOMETRIC BDDC PRECONDITIONERS WITH DELUXE SCALING TR

Overlapping Schwarz preconditioners for Fekete spectral elements

FEniCS Course. Lecture 0: Introduction to FEM. Contributors Anders Logg, Kent-Andre Mardal

From the Boundary Element Domain Decomposition Methods to Local Trefftz Finite Element Methods on Polyhedral Meshes

AN ADAPTIVE CHOICE OF PRIMAL CONSTRAINTS FOR BDDC DOMAIN DECOMPOSITION ALGORITHMS TR

Finite Element Methods for Maxwell Equations

From the Boundary Element DDM to local Trefftz Finite Element Methods on Polyhedral Meshes

TR THREE-LEVEL BDDC IN THREE DIMENSIONS

b i (x) u + c(x)u = f in Ω,

18. Balancing Neumann-Neumann for (In)Compressible Linear Elasticity and (Generalized) Stokes Parallel Implementation

arxiv: v3 [math.na] 8 Sep 2015

On the Robustness and Prospects of Adaptive BDDC Methods for Finite Element Discretizations of Elliptic PDEs with High-Contrast Coefficients

The All-floating BETI Method: Numerical Results

Construction of a New Domain Decomposition Method for the Stokes Equations

Discontinuous Galerkin Methods

An Iterative Substructuring Method for Mortar Nonconforming Discretization of a Fourth-Order Elliptic Problem in two dimensions

Scientific Computing WS 2018/2019. Lecture 15. Jürgen Fuhrmann Lecture 15 Slide 1

Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner

PANM 17. Marta Čertíková; Jakub Šístek; Pavel Burda Different approaches to interface weights in the BDDC method in 3D

ENERGY NORM A POSTERIORI ERROR ESTIMATES FOR MIXED FINITE ELEMENT METHODS

cfl Jing Li All rights reserved, 22

Parallel Implementation of BDDC for Mixed-Hybrid Formulation of Flow in Porous Media

Schwarz Preconditioner for the Stochastic Finite Element Method

SOME PRACTICAL ASPECTS OF PARALLEL ADAPTIVE BDDC METHOD

Additive Average Schwarz Method for a Crouzeix-Raviart Finite Volume Element Discretization of Elliptic Problems

Algebraic Coarse Spaces for Overlapping Schwarz Preconditioners

MORTAR MULTISCALE FINITE ELEMENT METHODS FOR STOKES-DARCY FLOWS

IsogEometric Tearing and Interconnecting

A Mixed Nonconforming Finite Element for Linear Elasticity

PARTITION OF UNITY FOR THE STOKES PROBLEM ON NONMATCHING GRIDS

Lecture Note III: Least-Squares Method

Preconditioning of Saddle Point Systems by Substructuring and a Penalty Approach

Simple Examples on Rectangular Domains

A Robust Preconditioner for the Hessian System in Elliptic Optimal Control Problems

Adaptive Finite Element Methods Lecture Notes Winter Term 2017/18. R. Verfürth. Fakultät für Mathematik, Ruhr-Universität Bochum

A Subspace Correction Method for Discontinuous Galerkin Discretizations of Linear Elasticity Equations

Key words. Parallel iterative solvers, saddle-point linear systems, preconditioners, timeharmonic

Divergence-free or curl-free finite elements for solving the curl-div system

Hybrid (DG) Methods for the Helmholtz Equation

A local-structure-preserving local discontinuous Galerkin method for the Laplace equation

A DELTA-REGULARIZATION FINITE ELEMENT METHOD FOR A DOUBLE CURL PROBLEM WITH DIVERGENCE-FREE CONSTRAINT

Solving the curl-div system using divergence-free or curl-free finite elements

ANDREA TOSELLI. Abstract. Two-level overlapping Schwarz methods are considered for nite element problems

LECTURE # 0 BASIC NOTATIONS AND CONCEPTS IN THE THEORY OF PARTIAL DIFFERENTIAL EQUATIONS (PDES)

Parallel scalability of a FETI DP mortar method for problems with discontinuous coefficients

On the Use of Inexact Subdomain Solvers for BDDC Algorithms

Coupled FETI/BETI for Nonlinear Potential Problems

Domain Decomposition Methods for Mortar Finite Elements

A Finite Element Method for the Surface Stokes Problem

BETI for acoustic and electromagnetic scattering

Optimal Interface Conditions for an Arbitrary Decomposition into Subdomains

ICES REPORT A Generalized Mimetic Finite Difference Method and Two-Point Flux Schemes over Voronoi Diagrams

for three dimensional problems are often more complicated than the quite simple constructions that work well for problems in the plane; see [23] for a

Nonoverlapping Domain Decomposition Methods with Simplified Coarse Spaces for Solving Three-dimensional Elliptic Problems

Scalable Domain Decomposition Preconditioners For Heterogeneous Elliptic Problems

A WEAK GALERKIN MIXED FINITE ELEMENT METHOD FOR BIHARMONIC EQUATIONS

HEXAHEDRAL H(DIV) AND H(CURL) FINITE ELEMENTS

Scientific Computing I

Key words. preconditioned conjugate gradient method, saddle point problems, optimal control of PDEs, control and state constraints, multigrid method

Numerische Mathematik

On an Approximation Result for Piecewise Polynomial Functions. O. Karakashian

QUADRILATERAL H(DIV) FINITE ELEMENTS

Acceleration of a Domain Decomposition Method for Advection-Diffusion Problems

A posteriori error estimates for Maxwell Equations

SOME NONOVERLAPPING DOMAIN DECOMPOSITION METHODS

A FETI-DP Method for Mortar Finite Element Discretization of a Fourth Order Problem

Divergence-conforming multigrid methods for incompressible flow problems

arxiv: v1 [math.na] 28 Feb 2008

Traces and Duality Lemma

A MULTIGRID METHOD FOR THE PSEUDOSTRESS FORMULATION OF STOKES PROBLEMS

On Nonlinear Dirichlet Neumann Algorithms for Jumping Nonlinearities

Indefinite and physics-based preconditioning

Transcription:

MATHEMATICS OF COMPUTATION Volume 00, Number 0, Pages 000 000 S 0025-5718(XX)0000-0 BDDC ALGORITHMS WITH DELUXE SCALING AND ADAPTIVE SELECTION OF PRIMAL CONSTRAINTS FOR RAVIART-THOMAS VECTOR FIELDS DUK-SOON OH, OLOF B WIDLUND, STEFANO ZAMPINI, AND CLARK R DOHRMANN Abstract A BDDC domain decomposition preconditioner is defined by a coarse component, expressed in terms of primal constraints, a weighted average across the interface between the subdomains, and local components given in terms of solvers of local subdomain problems BDDC methods for vector field problems discretized with Raviart-Thomas finite elements are introduced The methods are based on a new type of weighted average and an adaptive selection of primal constraints developed to deal with coefficients with high contrast even inside individual subdomains For problems with very many subdomains, a third level of the preconditioner is introduced Under the assumption that the subdomains are all built from elements of a coarse triangulation of the given domain, and that the material parameters are constant in each subdomain, a bound is obtained for the condition number of the preconditioned linear system which is independent of the values and the jumps of these parameters across the interface between the subdomains Numerical experiments, using the PETSc library, are also presented which support the theory and show the effectiveness of the algorithms even for problems not covered by the theory Included are also experiments with Brezzi-Douglas-Marini finite element approximations TR2015-978 1 Introduction Let Ω be a bounded Lipschitz domain in R 3 We will work with the Hilbert space H(div;Ω), which is the subspace of vector valued functions u (L 2 (Ω)) 3 with divu L 2 (Ω) The space H 0 (div;ω) is the subspace of H(div;Ω) with a vanishing normal component on the boundary Ω We will consider the following problem: Find u H 0 (div;ω) such that (11) a(u, v) := (αdivudivv +βu v)dx = f vdx, v H 0 (div;ω) Ω We will assume that the coefficient α is a bounded, nonnegative function, that β is a strictly positive, bounded function, and that the right-hand side f (L 2 (Ω)) 3 We note that the norm of u H(div;Ω) for a domain with a unit diameter is given by (a(u,u)) 1/2 with α = 1 and β = 1 Ω 2010 Mathematics Subject Classification 65F08, 65F10, 65N30, 65N55 Key words and phrases domain decomposition, BDDC preconditioner, Raviart-Thomas finite elements, three-level preconditioners, adaptive selection of coarse spaces 1 c XXXX American Mathematical Society

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann The bilinear form (11) arises from the boundary value problem: (12) Lu := grad(αdivu)+βu = f in Ω, u n = 0 on Ω Here, n isthe outwardunit normalvectorof Ω The boundaryvalueproblem (12) is equivalent to a mixed formulation of a first order system least-squares problem as in [15] There are also other applications related to the H(div) space, eg, in iterative solvers for the Reissner-Mindlin plate, the sequential regularization method for the Navier-Stokes equations, and, possibly most importantly, mixed formulations of flow in porous media or Brinkman equations For more details, see [5,6,46,77] Domain decomposition methods of iterative substructuring type for solving large linear algebraic systems originating from elliptic partial differential equations have been studied extensively; see [70] Among these methods, the balancing Neumann- Neumann (BNN) and the finite element tearing and interconnecting (FETI) algorithms have proven quite successful; see, eg, [25, 27, 28, 41, 49] The balancing domain decomposition by constraints (BDDC) methods, introduced in [19], are modified BNN methods with a global component of the preconditioner determined by a set of primal continuity constraints between the subdomains For a pioneering analysis for scalar elliptic problems, see [50, 51] The BDDC methods are closely related to the dual-primal FETI (FETI-DP) methods [26,54] as are the earlier BNN methods to the one-level FETI methods; see [29] Thus, the spectra relevant to the performance of a BDDC and a FETI- DP algorithm will be the same, except possibly for eigenvalues of 0 and 1, for the same set of primal constraints; see [13,45,51] Hence, we can use results for BDDC methods to obtain results for FETI-DP methods and vice versa The main purpose of this paper is to construct and analyze a BDDC preconditioner for vector field problems discretized with Raviart-Thomas finite elements Iterative substructuring methods for Raviart-Thomas problems were first considered in [80] and we will use several auxiliary results from that study in the analysis of our method BNN, FETI, and FETI-DP methods for these problems were developed in [65, 68, 69] Overlapping Schwarz methods have also been introduced for vector field problems; see [5,30,57,58,66] Other methods such as multigrid methods have been applied successfully in [7, 33, 40] BDDC methods havealsobeen widelyextended to otherproblemssuchasflowin porous media in [71, 72], incompressible Stokes equations in [44], Reissner-Mindlin plate models in [9, 42], advection-diffusion problems in [76], and Helmholtz equations in [43] Multilevel BDDC methods were introduced in [19,64,73 75] and other discretization methods, eg, spectral element methods and discontinuous Galerkin methods, have been considered in [24, 60, 61] Recently, there has also been pioneering work on isogeometric element problems, see [10,11] We will explore the use of three-level BDDC, which was first successfully studied in [73 75] for other elliptic problems In the construction of a BDDC preconditioner, a set of primal constraints and a weighted averaging technique have to be chosen and these choices will very directly affect the rate of convergence Effective primal constraints are very simple to find for the Raviart-Thomas elements; we primarily need a no-net-flux condition across each subdomain boundary However, for problems with coefficients with large contrast, 2

BDDC FOR RAVIART-THOMAS VECTOR FIELDS additional primal constraints are sometimes very useful We will adaptively select the primal constraints to deal with such coefficients as pioneered in [62] For recent work in this field, see also [18,35,36,52,53] Thechoiceofaveragingprovestobeintricate Wewilluseanewtypeofweighted averaging technique, called deluxe scaling, introduced in [22] for three dimensional H(curl) problems The deluxe scaling technique allows us to reduce the analysis to individual subdomains Hence, a finite element extension theorem, which is needed in the analysis of other averaging techniques as in [38], is no longer needed In several previous studies on domain decomposition methods for vector field problems, [65, 68, 69, 80], the bound on the condition number of the preconditioned linear system depends on the ratio of the coefficients α and βh 2 where H is the diameter of the subdomain Such results have been developed for a BDDC algorithm for three-dimensional problems in H(curl); see [22] as well as [67] This limitation has been removed in several recent studies Among them is a paper on an iterative substructuring method for two-dimensional problems posed in H(curl), see[21] In morerecentworkresultshavebeen obtained forbddc deluxe and overlapping Schwarz algorithms again for two-dimensional problems posed in H(curl), see [16, 17] An overlapping Schwarz method for three-dimensional H(div) problems has also been developed, see [58] In that work, two sets of inequalities were developed to handle the mass-dominated and the divergence-dominated cases, respectively We know of no previous full analysis of BDDC or FETI-DP type methods for three-dimensional H(div) problems; see [59] for an announcement of some of our results We areableto providebddc methods with an upperbound on the condition number which is independent of the values and the jumps of the coefficients across the interface and to obtain condition number bounds which are polylogarithmic in the number of degrees of freedom of the individual subdomains or only depend on a tolerance parameter used for an adaptive selection of primal constraints While we are developing and testing our algorithms for quite general subdomains and material parameters, our proofs are restricted to subdomains which each is a union of a finite number of coarse elements and with material parameters constant in each subdomain The rest of this paper is organized as follows In section 2, we introduce some standard Sobolev spaces, a finite element approximation based on Raviart-Thomas elements, and decompositions of the interface spaces We introduce our BDDC algorithms for an interface problem and define various operators used to describe the algorithms in section 3; in its last subsection, we introduce adaptive 3-level BDDC We next provide some auxiliary results and a proof of our main result in section 4 Finally, section 5 contains results of numerical experiments, which extend our findings to irregular subdomains obtained by mesh partitioners Our algorithm is also shown to perform well for higher order Raviart-Thomas elements and Brezzi-Douglas-Marini (BDM) elements; cf [14] 2 Function and finite element spaces 21 Continuous spaces We will use the Sobolev spaces H 1 (Ω) and its trace space H 1/2 ( Ω), equipped with their norms and semi-norms for bounded domains The domains and the subdomains into which they are partitioned are assumed to 3

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann be Lipschitz Let H be the diameter of Ω Then, u 2 1;Ω := u 2 1;Ω + 1 H 2 u 2 0;Ω, u 2 1/2; Ω := u 2 1/2; Ω + 1 H u 2 0; Ω, where the L 2 -norm 0;Ω and the semi-norms 1;Ω and 1/2; Ω are defined by u 2 0;Ω := Ω u 2 dx, u 2 1;Ω := Ω u 2 dx, and u 2 1/2; Ω := Ω Ω u(x) u(y) 2 x y 3 dxdy, respectively The weights for the L 2 terms result from the standard definitions of the norms for a domain of diameter 1 and a dilation We can also easily extend these definitions to vector-valued cases The space H(div;Ω) is defined by with the scaled graph norm: H(div;Ω) := {u (L 2 (Ω)) 3 divu L 2 (Ω)} u 2 div;ω := divu 2 0;Ω + 1 H 2 u 2 0;Ω The normal component on the boundary Ω of any u H(div;Ω) belongs to H 1/2 ( Ω); see [14,55] The norm for the space H 1/2 ( Ω) is given by u n 1/2; Ω := u n,φ sup, φ H 1/2 ( Ω),φ 0 φ 1/2; Ω wheretheanglebracketsstandforthedualityproductofh 1/2 ( Ω)andH 1/2 ( Ω) We have the following trace theorem Lemma 21 There exists a constant C, which is independent of the diameter of Ω, such that, for all u H(div;Ω), u n 2 1/2; Ω C(H2 divu 2 0;Ω + u 2 0;Ω ) Proof This follows directly from Green s formula on a domain of unit diameter and by applying a dilation; see [80, Lemma 21] In developingourtheory, wealsoneedto workwith the H(curl;Ω) spacedefined by with the scaled graph norm: H(curl;Ω) := {u (L 2 (Ω)) 3 curlu (L 2 (Ω)) 3 } u 2 curl;ω := curlu 2 0;Ω + 1 H 2 u 2 0;Ω We finally introduce H 1 0(Ω), H 0 (div;ω), and H 0 (curl;ω) as the subspaces of H 1 (Ω), H(div;Ω), and H(curl;Ω) with a vanishing boundary value, a vanishing normal component, and a vanishing tangential component on Ω, respectively Remark 22 The curl operator for two dimensions is just a simple rotation of the divergence operator In two dimensions, we therefore can use results for H(div; Ω) to obtain results for H(curl;Ω) and vice versa 4

BDDC FOR RAVIART-THOMAS VECTOR FIELDS 22 Finite element spaces In this paper, we will develop our theory for tetrahedral elements but we note that our results are equally valid for hexahedral elements We remark that there are many useful tools, developed for such elements in [80], that can easily be modified for tetrahedral elements We first introduce a triangulation T h of Ω into tetrahedral elements We then decompose the domain Ω into N nonoverlapping subdomains Ω i of diameter H i We assume that each subdomain Ω i is a union of elements of the triangulation T h and that each Ω i is simply connected and has a connected boundary We also assume that the triangulation T h is shape regular with nodes matching across the interface between the subdomains The smallest diameter of the elements of Ω i is denoted by h i In our estimates, we will use the fraction H/h to denote H/h := max 1 i N {H i/h i } We also define the interface by ( N := Ω i )\ Ω and the local interfaces i by i=0 i := Ω i For our analysis, we will consider the lowest order Raviart-Thomas elements on the mesh T h ; see [14, Chapter 3] and [56] The Raviart-Thomas elements are conforming in H(div;Ω) and those of lowest order are defined by W := {u u K RT (K),K T h and u H(div;Ω)}, where the shape function RT (K) is given by four scalar parameters RT (K) := a 1 a 2 +b x y a 3 z for a tetrahedral element The degrees of freedom for an element K in T h are given by λ f (u) := 1 u n ds, f K, f f ie, the average values of the normal components over the faces of the element These four values determine a 1,a 2,a 3, and b A basis function of the lowest order Raviart-Thomas element space is supported in two elements of T h, with the value of the normal component on a face equal to 1 for one of the elements and 1 for the other, while vanishing on all other faces While our analysis will be limited to the lowest order Raviart-Thomas elements, we will consider higher order Raviart-Thomas elements and BDM elements in our experiments; such elements have additional degrees of freedom corresponding to moments over elements faces of the normal component and as well as degrees of freedom of the element interiors For details, see [14] The l 2 norm of the vector of the coefficients λ f (u) can be used to estimate the L 2 norm of u and of its divergence; the proof of the following lemma is elementary and a simple modification of [63, Proposition 631] See also [80, Lemma 31] 5

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann Lemma 23 Let K T h Then, there exist strictly positive constants, c and C, depending only on the aspect ratio of K, such that for all u W, c h 3 f λ f (u) 2 u 2 0;K C h 3 f λ f (u) 2 and f K where h f is the diameter of f divu 2 0;K C f K f K h f λ f (u) 2, The following lemma follows directly from Lemma 23 Corollary 24 (inverse inequality) Let K T h Then, there exists a constant C, depending only on the aspect ratio of K, such that for all u W, (21) h K divu 0;K C u 0;K, where h K is the diameter of K We also need Ŵ0, a finite element subspace of H 0 (div;ω): Ŵ 0 (Ω) := W(Ω) H 0 (div;ω) We will now consider the variational problem (11) We obtain the stiffness matrix A by restricting this problem to Ŵ0; A is symmetric and positive definite When developing our theory, we will need several additional spaces Let S be the space of continuous, piecewise linear functions on the tetrahedral elements, and let S 0 be the subspace of elements of S which vanish on Ω Let Q be the space of piecewise constant functions on the same elements Finally, let X be the space of the lowest order Nédélec elements We recall that the Lagrange P 1, Raviart- Thomas, and Nédélec spaces are conforming finite element spaces in H 1, H(div), and H(curl), respectively Let V h and F h be the set of vertices and faces of T h, respectively The interpolationoperatorsi h, andπ RT h forsufficientlysmoothfunctionsu H 1 andv H(div) onto S and W, respectively, are defined as follows: I h u := p V h u(p)φ P p and ΠRT h v := λ f (v)φ RT f, f F h where φ P p and φ RT f are the basis functions of the P 1 and Raviart-Thomas spaces associated with the node p and the element face f, respectively We also denote by Π h the L 2 projection operator onto Q We finally recall the following error estimate for the Raviart-Thomas interpolation operator and a commuting property Lemma 25 For any u (H 1 (Ω)) 3, we have u Π RT h u 0;Ω Ch u 1;Ω Proof See [12, Lemma 55] Lemma 26 Let u be sufficiently regular Then, the following commuting property holds: (22) div ( Π RT h u ) = Π h (divu) Proof See [12, Property 53] 6

BDDC FOR RAVIART-THOMAS VECTOR FIELDS We note that property (22) is a part of the discrete de Rham diagram described, eg, in [55, section 57] 23 The discrete problem The description of the BDDC algorithm and its analysis require the introduction of several spaces Let W (i) be the space of the lowest order Raviart-Thomas finite elements on Ω i with a zero normal component on Ω Ω i We decompose W (i) into two subspaces, an interior space W (i) I and an interface space W (i) The interface space W(i) is then decomposed into a primal space W (i) Π and and a dual space W(i) Hence, we have the following decompositions: W (i) := W (i) I W (i) := W(i) I W (i) W(i) Π We will also use the following product spaces: N N N W 0 := i=1 W (i), W I := W := N i=1 i=1 W (i), W Π := W (i) I, W := N i=1 W (i) Π i=1 W (i) We then have W 0 := W I W := W I W W Π Ingeneral, thefunctions inw havediscontinuousnormalcomponentsacrossthe interface while those of the finite element solutions are continuous; the subspace with continuous normal components will be denoted by Ŵ We also consider a space W, consisting of functions which are continuous at the primal degrees of freedom while being discontinuous elsewhere on the interface We can then decompose Ŵ and W into Ŵ ŴΠ and W ŴΠ, respectively, where Ŵ is the continuous dual variable subspace and ŴΠ is the continuous primal variable subspace We can now obtain the local stiffness matrix by restricting the bilinear form to Ω i, ie (23) a i (u,v) := (αdivudivv +βu v)dx, Ω i and replacing H(div;Ω i ) by the finite element space W (i) It is convenient to make a change of variables by introducing a basis for the primal degrees of freedom and a complementarybasisforthe dualsubspacew (i) The presentationofthe algorithms and the theoretical results are considerably simplified using the new basis Here we can follow the recipes of [45, subsection 33] closely We note that there is also some evidence that such a change of variables enhances the numerical stability of BDDC and FETI-DP algorithms; see [37] However, the change of variables can come with a loss of sparsity and there are alternative ways of implementing these algorithms, cf [19] After this change of variables, we find that the contributions from the subdomain Ω i to the stiffness matrix and to the load vector can be written as II I IΠ f (i) T I I A(i) Π and f (i) f (i) Π T IΠ T Π ΠΠ 7

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann We then obtain the global linear system of algebraic equations by sub-assembling these local contributions: (24) A u I u = A II A I A IΠ A T I A A Π u I u = f I f, u Π A T IΠ AT Π A ΠΠ u Π f Π where u I W I, u Ŵ, and u Π ŴΠ 3 The BDDC methods 31 Some useful operators We will now define several operators which perform restrictions, extensions, scalings, and averaging between different spaces We first consider the restriction operators R (i) maps the space Ŵ to the subdomain subspace W (i) Similarly, we can define R(i) : W W (i) Moreover, R (i) : W W (i) and R(i) Π : ŴΠ W (i) Π map global interface vectors defined on to their components on i R and R Π are the restriction operators from the intermediate space W to W and ŴΠ, respectively Similarly, we can define the restriction operator R (i) and R(i) all the R (i) of R Π and all the from W(i) to W(i) R and R are the direct sums of, respectively Furthermore, R : Ŵ W is the direct sum (i) R, where R Π represents the restriction from Ŵ to ŴΠ and R (i) maps the space Ŵ into W (i) We next introduce scaling matrices, D (i), acting on the degrees of freedom associated with the i They are combined into a block diagonal matrix and should provide a discrete partition of unity, ie, D (1) (31) R T D (2) D (N) R = I We can now define scaled operators R (i) D, := D(i) R (i) the scaling matrices D (i) Other locally scaled operators R (i) D, := R(i) R(i) the direct sum of R Π and all the by pre-multiplying R(i) by are defined by R (i) D, D, We next consider a globally scaled operator R D, defined by We note that R (i) D, R T R D, = R T D, R = I This identity shows that the averaging operator E D : W Ŵ given by (32) E D := R RT D, is a projection E D provides a weighted average of the subdomain interface values across the interface We will provide details on our choice of scaling matrices in subsection 33 32 A block factorization The following block factorization of the inverse of the stiffness matrix is associated with the elimination of the interior degrees of freedom of all subdomains: (33) A 1 = [ I A 1 II A ][ I A 1 0 I 8 II 0 0 Ŝ 1 ][ ] I 0 I A T IA 1 II

BDDC FOR RAVIART-THOMAS VECTOR FIELDS Here A I := (A I A IΠ ) The Schur complement Ŝ, with respect to the interior unknowns, is positive definite and it is obtained by sub-assembly N (34) Ŝ := R T S R = R (i)t S (i) R(i), where S is the direct sum of the local Schur complements defined by and where S (i) i=1 := A(i) A(i)T I A(i) 1 II := [ T Π A(i) Π ΠΠ I, For the model problem (12), the local Schur complements S (i) are always positive definite A preconditioner for (24) is then defined by using (33), after replacing the inverse of the Schur complement Ŝ by the action of a suitable interface preconditioner 33 The BDDC algorithm In order to describe the BDDC algorithm, we need to define a partially assembled Schur complement, defined on W, by S := R T S R After eliminating the components of the right-hand-side corresponding to the interior unknowns, we obtain u (1) 0 I u (1) R à (1) R S u u (N) =, 0 I u (N) R (N) R S u u Π R Π S u where à is the partially assembled stiffness matrix on W I W We note that solving this linear system is much less expensive than working with the fully assembled linear system since the number of primal variables is much smaller than the total number of interface variables We also note that the fully assembled Schur complement Ŝ can be obtained by an additional sub-assembly step, ie Ŝ = R T S R The BDDC preconditioner has the following form: S 1 i=1 ] M 1 := R T D, S 1 R D, Here, can be obtained by using a block Cholesky factorization of à as in [45, section 4]: (35) S 1 = R N [ ] [ ] 1[ T 0 R (i)t II ] 0 I R (i) R +ΦS 1 ΠΠ ΦT with Φ := R T Π R T N [ i=1 T I 0 R (i)t ] [ II 9 T I I ] 1 [ IΠ Π ] R (i) Π

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann and where (36) N S ΠΠ := i=1 R (i)t Π ΠΠ [ T IΠ T Π ] [ II T I I ] 1 [ IΠ Π ] R (i) Π The first term of (35) consists of uncoupled subdomain corrections realized by means of local Neumann solves, with solutions constrained to vanish at the primal degrees of freedom; the second term is the coarse-level part of the preconditioner associated with the primal space and provides the global exchange of information which is needed to obtain a scalable preconditioner for the conjugate gradient method In subsection 37, we will also explore the option of approximating the inverse of S ΠΠ by invoking the BDDC algorithm once more, thus introducing a three-level BDDC algorithm 34 Interface equivalence classes Equivalence classes of degrees of freedom related to the interface between the subdomains play an important role in the design, analysis, and parallel implementation of domain decomposition algorithms such as BDDC In the case of div-conforming Raviart-Thomas and BDM elements, the situation is very simple since each interface degree of freedom is associated with an element face common to only two elements Thus, any equivalence class will be given by a subset of the degrees of freedom on the intersection of the boundaries of two neighboring subdomains We will refer to such an interface class as a subdomain face In order to avoid disconnected subdomain faces in our experiments, we will consider two degrees of freedom connected and belonging to the same subdomain face only if there exists a path between their element faces which passes from element to element of the subdomain face by crossing an edge between element faces 35 Deluxe scaling In order to complete the description of the algorithm, we need to define the weighted averaging operators D (i) Conventional weighted averaging techniques, known as stiffness and ρ scalings, are described in [19, 51] We will show, in section 5, that there are cases for which these conventional techniques perform poorly for (12), since these methods are designed for constant coefficients or for one variable coefficient For more than one variable coefficient, as for the problem considered here, we need a different approach and we will use the deluxe scaling, introduced in [22] for H(curl) problems and further considered in [23] A survey of other studies using deluxe scaling is given in [78] We will work with the principal minors, associated with subdomain faces F, of the subdomain stiffness matrices Two local stiffness matrices associated with F are given by principal minors of the subdomain stiffness matrices [ A (k) II A (k)t IF A (k) IF A (k) ], k = i,j, and the two Schur complements associated with F by S (k) := A(k) A(k)T IF A(k) II 10 1 A (k) IF, k = i,j

BDDC FOR RAVIART-THOMAS VECTOR FIELDS The scaling oper- ( ) 1S We will use the scaling matrices D (i) j := S (i) +S(j) (i) ator D (i) is then given by (37) D (i) := D (i) F 1 D (i) F 2 D (i) F k, where F 1,,F k are the subdomain faces of Ω i We remark that there are two scaling matrices for each subdomain face, and that it is easy to show that the partition of unity condition (31) is satisfied, ie, that N i=1 R (i) T D (i) R (i) = I Thus, a face component of the averaging operator E D is defined by w F := (E D w) F := (S (i) +S(j) ) 1 (S (i) w(i) F +S(j) w(j) F ) (resp w(j) F ) is the the restriction of w(i) W (i) (w (j) W (j) ) to the Here w (i) F face F The action of (S (i) +S(j) ) 1 can be implemented by solving a Dirichlet problem on Ω i F Ω j, where F is the face between the two subdomains This can add significantly to the cost In the economic variant of deluxe scaling (e-deluxe), we replacethis largedomainbyathin domainbuilt fromoneorafew layersofelements next to the face and this often results in very similar performance; see, eg, [23] and section 5 Instead of solving such a Dirichlet problem, in our experiments, we exploit the Schur complement feature of the numerical factorization package MUMPS [2], a package which explicitly provides the local Schur complement matrix S (i) when factoring the subdomain problem Our approach has the further advantage that the Dirichlet solver can be reused in the static condensation step in (33), and that the Schur complement solver can be reused when computing the subdomain correction in (35) Therefore, a single factorization step suffices to set up the preconditioner Differently from [23], in our experiments with the e-deluxe variant, we consider the principal minors of the Schur complement obtained by eliminating all the interior degreesfreedom in a layerof elements next to all of the subdomain interface i However, when using e-deluxe, the Dirichlet and Schur complement solvers cannot be reused, and additional factorizations are needed to set up the preconditioner We note that our implementation of e-deluxe is quite similar to the one analyzed in [36] 36 Basic BDDC deluxe estimates The core of any estimate for a BDDC algorithm is an estimate of the norm of the averagingoperator E D By an algebraic argument, known for FETI DP since 2002, we have κ(m 1 Ŝ ) E D S, see [39] or [45] The analysis of any BDDC deluxe algorithm can be reduced to bounds for individual subdomains Arbitrary jumps in the coefficients across can then be accommodated 11

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann Instead of developingan estimate for E D, we will workwith P D := I E D Thus, we estimate the S (i) normof RT F (w(i) F w F), instead of(rf T w F) T S (i) RT F w F Here R F denotes the restriction to the face F By elementary algebra, we find that w (i) F w F = (S (i) +S(j) ) 1 S (j) (w(i) F w(j) F ) More algebra gives, by using that S (i) := R FS (i) RT F, (R T F(w (i) F w F)) T S (i) (RT F(w (i) F w F)) = (w (i) F w(j) F )T S (j) (S(i) +S(j) ) 1 S (i) (S(i) +S(j) ) 1 S (j) (w(i) F w(j) F ) AddingasimilarcontributionfromΩ j,weobtain,followingpechsteinanddohrmann [62], that the relevant expression of the energy is (w (i) F w(j) F )T (S (i) 1 +S (j) 1 ) 1 (w (i) F w(j) F ) The matrix of this quadratic form is a parallel sum, and we will use the notation cf [4] We easily find that A : B := (A 1 +B 1 ) 1 ; (38) (w (i) F w(j) F )T (S (i) : S(j) )(w(i) F w(j) F ) 2(w (i) F w Π) T (S (i) : S(j) )(w(i) F w Π)+2(w (j) F w Π) T (S (i) : S(j) )(w(j) F w Π) where w Π is an arbitrary element of the primal space Since S (i) : S(j) S(i) and S (i), each of these terms can be estimated by an expression which is : S(j) S(j) local to only one subdomain Let w (i) F := w(i) F w Π There now remains to estimate w (i)t F (S(i) : S(j) )w(i) F by the energy of w (i) For this, we will need the energy-minimizing extension of any finite element function defined on F The relevant matrix is (39) S(i) := S(i) S(i)T F F S(i) 1 F F S(i) F F S (i) defined by an off- Here S (i) F F is the principal minor of S(i) with respect to i \ F and S (i) F F diagonal block of S (i) We need to establish a bound for w (i)t F (S(i) : S(j) )w(i) F and to show that by w(i)t F (i) (j) ( S : S )w(i) F w (i)t (i) (j) F ( S : S )w(i) F w(i)t S (i) w(i), where w (i) is an arbitrary extension of the values of w (i) F on the face F to the rest of i In standard BDDC theory, as in section 4, the required estimates is obtained by using a face lemma, cf [70, subsection 463], where such a result is established for constant coefficients in each subdomain, polyhedral subdomains, and scalar elliptic problems For an adaptive algorithm, this result is replaced by the use of a generalized eigenvalue problem For BDDC deluxe, we first generate elements for the primal space for a face by solving a generalized eigenvalue problem (310) S (i) : S(j) (i) (j) ψ = ν S : S ψ The primal space are then generated by the eigenvectors for a few of the largest eigenvalues of (310) and making ( S (i) : S(j) )(w(i) F w(j) F ) orthogonal to these 12

BDDC FOR RAVIART-THOMAS VECTOR FIELDS eigenvectors The complementary dual space is then spanned by the remaining eigenvectors which is then orthogonal to the primal space with respect to the matrix (j) : S This orthogonality condition allows us to conclude that S (i) w (i)t (i) (j) F ( S : S )w(i) S (i) F w(i)t F ( S (i) (j) : S )w(i) F (j) (i) (i) (j) (j) We also have to use that : S S and S : S S We can now bound the right-hand side of (38) by νtol F times the energy of w contributed by the two subdomains Ω i and Ω j where νtol F is the largest eigenvalue associated with the eigenvectors not used in deriving the primal constraints Thus, we have the following result: Lemma 31 If νtol F is the largest eigenvalue of (310) ignored when we generate our primal constraints from eigenvectors of that eigenvalue problem, we have where a i (, ) is given by (23) (P D w) F 2 S 2 ν F tol(a i (w,w)+a j (w,w)), By elementary estimates, we can then obtain the following result: Theorem 32 κ(m 1 Ŝ ) P D S 4 ν tol N 2 F, where ν tol := max F ν F tol and N F equals the maximum number of faces of any subdomain We note that there is evidence that adaptive primal spaces in deluxe BDDC methods are smaller compared to those generated using conventional point-wise scalings [35, 36] 37 Three-level BDDC Coarse problem solvers in BDDC methods, as in all others two-level Domain Decomposition methods, can create a bottleneck for the algorithm when there is a large number of subdomains and/or many coarse degrees of freedom per subdomain, as could be the case when an adaptive BDDC algorithm is used A multilevel extension of the BDDC algorithm is readily available given the unassembled nature of the coarse problem(36); for a pioneering analysis see[73,74] In a three-level BDDC algorithm, the solution of the coarse problem is replaced by the application of a BDDC preconditioner at a coarser level; this leads to highly scalable BDDC preconditioners, provided a suitable coarse space for the second level is found In fact, as proved in [20], the use of approximate coarse solvers could degrade the condition number of the BDDC algorithm and increase the number of iterations of the Krylov solver But an upper bound for the condition number of a three-level BDDC method holds κ(m 1 3levelŜ) κ(m 1 ΠΠ S ΠΠ)κ(M 1 Ŝ ), where M 1 ΠΠ is the BDDC preconditioner for S ΠΠ Differently from earlier work, we will construct an adaptive coarse problem for S ΠΠ and we then will be able to control κ(m 1 ΠΠ S ΠΠ) and construct highly scalable and robust three-level BDDC preconditioners We note that the design of effective primal spaces for coarser levels of H(div) problems could be a subject of future development of theory 13

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann 4 Technical tools and the main theoretical result When we turn to the development of our theory, we need to impose additional conditions on the geometry of the subdomains and the values of the two material parameters α and β Thus, each subdomain Ω i will be assumed to be convex and a union of a finite number of shape-regular tetrahedral elements of a coarse triangulation T H We also assume that α and β are constant in each subdomain while we allow arbitrarily large coefficients jumps across the interface We will denote by α i and β i the values of these coefficients taken on subdomain Ω i We note that these assumptions have been used in the development of theory for many domain decomposition algorithms of the iterative substructuring family, see, eg, [70, Chapters 4 6] Our primal variables will be defined by the common average of the normal component of the solution over the subdomain faces This means that a no-net-flux condition F w (i) n ds = 0 is imposed for each subdomain face In our proofs, we also need some standard tools for the space S(Ω i ), which we can borrow from [70, subsection 46] These results are related to the subdomain faces and their boundaries of Ω i We note that the proof of [70, Lemma 416] is not satisfactory but that a correct proof is now available in [23, Section 3] Lemma 41 There are functions ϑ F S(Ω i ) and ϑ F S(Ω i ) such that for all nodes on the closure of F ϑ F +ϑ F = 1, ϑ F = 0 on Ω i \F, and ϑ F = 0 at all nodes of Ω i except those on F Moreover, for any u S(Ω i ), there exists a constant independent of h i and H i, such that and I h (ϑ F u) 2 1;Ω i C(1+logH i /h i ) u 2 1;Ω i I h (ϑ F u) 2 1;Ω i C(1+logH i /h i ) 2 u 2 1;Ω i In addition, the following estimate holds: u 2 0; F C(1+logH i/h i ) u 2 1;Ω i We next introduce a stable operator, which provides a divergence-free extension Lemma 42 (divergence free extension) There exists an extension operator H i from the normal trace space of W (i) to W(i), such that, for all u W (i) (, Hi µ) n = µ, div H i µ = 0, where µ := u n Moreover, H i µ C µ 1/2; Ωi 0;Ωi Proof A proof of this result is provided for the lowest order Raviart-Thomas element defined on hexahedral meshes in [80, Lemma 43] and [79, Lemma 26] A proof for the case of tetrahedral elements can be obtained using exactly the same arguments 14

BDDC FOR RAVIART-THOMAS VECTOR FIELDS We then have the following estimate for the discrete harmonic extension, ie, the extension of a given normal trace with the minimal energy For more details, see [70, section 102] and [58, section 31] Corollary 43 (discrete harmonic extension) Let H i be the energy minimizing discrete harmonic extension For all u W (i), we have (H i µ) n = µ := u n Furthermore, α i divh i µ 2 0;Ω i +β i H i µ 2 0;Ω i Cβ i µ 2 1/2; Ω i Proof H i istheminimal-energyextensionoperatorforagivensubdomaininterface Therefore, we have α i divh i µ 2 0;Ω i +β i H i µ 2 div 0;Ω i α i Hi µ 2 +β i Hi µ 2 0;Ω i 0;Ω i Since div H i µ = 0, we have, by using Lemma 42, α i divh i µ 2 0;Ω i +β i H i µ 2 0;Ω i Cβ i µ 2 1/2; Ω i We next consider a coarse interpolation operator Π RT H space on the coarse mesh T H onto the Raviart-Thomas Lemma 44 (stability of the coarse interpolant) Let K T H For all u W, we have the following estimates: div ( Π RT H u) 2 0;K divu 2 0;K and ) Π RT H u ((1+logH/h) u 2 C 2 0;K 0;K +H2 K divu i 2 0;K, where H K is the diameter of K Here, the constant C depends only on the aspect ratio of K and the elements of T h Proof Full proofs of this result are given in [79, Lemma 24] and [80, Lemma 41] for hexahedral meshes These proofs can be translated line by line to also hold for tetrahedral meshes We then have the following: Lemma 45 Let u F W (i) with λ f (u F ) = 0, f Ω i \F Furthermore, let u H F := H i( Π RT H u F n ) and assume that α i β i Hi 2 We then have for the bilinear form (23) ( (41) a i u H F,u H ) F Cai (u F,u F ), where C is independent of H i, h i, α i, and β i Proof Wewill modifythe proofof[58,lemma56] WefirstassumethatF consists of only one face of a coarse element K T H Ω i Let Ω di i,f Ω i be the set of all points which are within a distance d i of F Let χ F be a piecewise linear scalar cut-off function, which has the value 1 on F and vanishes in Ω i \Ω di i,f for some h i d i H i Moreover, χ F 1 and χ F C/d i We next consider a coarse basis function related to a discrete harmonic extension This basis function φ F is obtained from the standard basis function φ RT F RT 15

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann ( RT and is given by φ F := H i χf φ RT F n) We note that φ RT 2 F 0;Ω i CHi 3 and divφ RT 2 F 0;Ω i CH i We then define the function ũ H F as follows: (42) ũ H F := λ F (u F ) φ RT F In order to estimate the energy of ũ H F, we will estimate the coefficient λ F (u F ) and RT the energy of φ F, separately We first estimate the coefficient By the divergence theorem, and the fact that (χ F u F ) n vanishes on Ω di i,f \F, F λ F (u F ) = F λ F (χ F u F ) = (χ F u F ) nds F = div(χ F u F ) dx (χ F u F ) nds Ω d i i,f Ω d i i,f \F = div(χ F u F ) dx Ω d i i,f By using the Cauchy-Schwarz inequality and the shape-regularity of the elements of T H, we obtain λ F (u F ) 2 C d i H 2 i C d i H 2 i C d i H 2 i div(χ F u F ) 2 0;Ω d i i,f ) ( χ F 2 divu F 20;Ωi + χ F 2 u F 20;Ωi divu F 2 0;Ω i +C 1 Hi 2d u F 2 0;Ω i i We can now obtain our estimate of the basis function From the minimal energy property, we find div α φrt i F 2 +β φrt i F 2 0;Ω i 0;Ω i ( ( )) α i div Π RT h χf φ RT F 2 ( ( )) 0;Ω i +β i Π RT h χf φ RT F 2 0;Ω ( i Cα i di +Hi 2 /d i) +Cβi Hi 2 d i Cα i Hi 2 /d i +Cβ i Hi 2 d i Hence, we have (43) (ũh a i F,ũ H ) λf F = αi (u F )(div ) RT 2 λf φ F +β i (u F ) 0;Ω i ( φrt ) 2 F 0;Ω i C ( α i +β i d 2 i) divuf 2 0;Ω i +C ( α i /d 2 i +β i) uf 2 0;Ω i Let d i = max{ α i /β i,h i } and recall that h i d i H i By using (43) and Lemma 24, we obtain ( a i u H F,u H ) (ũh F ai F,ũ H ) F Cai (u F,u F ) If F is the union of faces of several elements of T H, we should replace the basis function in formula (42) by a sum of such functions associated with the relevant faces of the coarse elements 16

BDDC FOR RAVIART-THOMAS VECTOR FIELDS We next introduce a partition of unity associated with the faces of an individual subdomain Ω i as in [70, Chapter 1021], F Ω i ζ F = 1,ae on Ω i \ Ω, where (44) ζ F (x) := { 1, x F 0, x Ω i \F We then have the following estimates for the subdomain face components; we recall that for u i W (i), F u i nds = 0 for each subdomain face F Ω i Lemma 46 For any u i W (i) and for any uh i W (i) Π, let µ i := u i n, µ F := ζ F µ i, and µ H i := u H i n Then, the following estimate holds: µ F 2 1/2; Ω i C(1+logH i /h i ) ( (1+logH i /h i ) µi +µ H i holds, where C is independent of µ H i, H i, and h i ) 2 1/2; Ω i + µ i 2 1/2; Ω i Proof This result is closely related to [80, Lemma 44] and [79, Lemma 27], which are results for hexahedral meshes Carefully checking all the arguments, we find that the same results are also valid for tetrahedral meshes Unlike for the gradient operator, it is quite complicated to classify the kernel and the range of the curl and divergence operators The discrete regular decompositions given in [32] provide useful tools to analyze problems posed in H(curl) and H(div) We can then apply techniques developed for H 1 -functions by using the following result Lemma 47 (Hiptmair-Xu decomposition) Let D Ω be a convex polyhedron Then, for all v h W(D), there exist Ψ h S(D), q h X(D), and ṽ h W(D) such that v h = ṽ h +Π RT h (Ψ h )+curlq h, and h 1 i ṽ h 2 0;D h 2 1;D C divv h 2 0;D, (46) curlqh 2 0;D + Ψ h 2 0;D C v h 2 0;D Proof See [32, Lemmas 51 and 52] We note that this important paper, [32], was preceded by [31], which concerns another application of the same decomposition Lemma 48 For any u i W (i), there exist v i,f W (i) and vi,f H W(i) Π such that { λf (v (47) i,f ) = λ f (u i ) if f F; λ f (v i,f ) = 0 if f Ω i \F and (48) Furthermore, { ( ) ( λf v H i,f = λf Π RT H u ) ( ) i λ f v H i,f = 0 if f F; if f Ωi \F a i ( vi,f v H i,f,v i,f v H i,f) C(1+logHi /h i ) 2 a i (u i,u i ), 17

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann where C is independent of α i, β i, H i, and h i Proof We will only consider the case where α i β i H 2 i since for β i H 2 i α i, the proof is straightforward by using Corollary 43 and Lemmas 46, 21, and 44; for more details, see [80, Section 5] By using Lemma 47, we can find ũ i, Ψ i, and q i such that (49) u i = ũ i +Π RT h (Ψ i )+curlq i We note that div(curlq i ) = 0 and that (45) and (46) provide bounds for the different terms We first consider ũ i We define ũ i,f := f F λ f (ũ i )φ RT f, where φ RT f is the Raviart-Thomas basis function associated with the face f By using Lemmas 23, 24, and 47, we have (410) ũ i,f 2 0;Ω i C f F h 3 iλ f (ũ i ) 2 C ũ i 2 0;Ω i and (411) divũ i,f 2 0;Ω i C f F Hence, from (410) and (411), Ch 2 i divu i 2 0;Ω i C u i 2 0;Ω i h i λ f (ũ i ) 2 C h 1 i ũ i 2 0;Ω i C divu i 2 0;Ω i (412) a i (ũ i,f,ũ i,f ) Ca i (u i,u i ) We also define ũ H i,f := H i( Π RT H ũi,f n ) By using Lemma 45 and (412), we obtain (413) a i (ũh i,f,ũ H i,f) Cai (ũ i,f,ũ i,f ) Ca i (u i,u i ) We note that by construction, ũ i,f ũ H i,f satisfies the no-net-flux condition We will similarly construct pairs of functions originating from the other terms of the right-hand side of (49) such that the differences satisfy the no-net-flux condition We next consider the second term Π RT h (Ψ i) of (49) Let Ψ i,f := I h (ϑ F Ψ i ) By using Lemmas 41 and 47, we obtain and Ψ i,f 2 0;Ω i C Ψ i 2 0;Ω i C u i 2 0;Ω i divψ i,f 2 0;Ω i C Ψ i,f 2 1;Ω i C(1+logH i /h i ) 2 Ψ i 2 1;Ω i C(1+logH i /h i ) 2 divu i 2 0;Ω i Moreover, by using Lemma 25, an inverse estimate, and Lemma 26, we obtain Π RT h (Ψ i,f ) 2 2 Ψ 0;Ω i,f 2 i 0;Ω i +2 Ψ i,f Π RT h (Ψ i,f) 2 0;Ω i and Therefore, (414) 2 Ψ i,f 2 0;Ω i +Ch 2 i Ψ i,f 2 1;Ω i C Ψ i,f 2 0;Ω i, divπ RT h (Ψ i,f ) 2 0;Ω i = Π h (divψ i,f ) 2 0;Ω i divψ i,f 2 0;Ω i Π RT h (Ψ i,f ) 2 0;Ω i C u i 2 0;Ω i 18

BDDC FOR RAVIART-THOMAS VECTOR FIELDS and (415) divπ RT h (Ψ i,f ) 2 0;Ω i C(1+logH i /h i ) 2 divu i 2 0;Ω i Hence, from (414) and (415), we obtain ( (416) a i Π RT h (Ψ i,f ),Π RT h (Ψ i,f) ) C(1+logH i /h i ) 2 a i (u i,u i ) ( Π RT Let Ψ H i,f := H i( Π RT H (417) a i ( Ψ H i,f,ψ H i,f h Ψ ) i,f) n By using Lemma 45 and (416), we have ) ( Cai Π RT h (Ψ i,f ),Π RT h (Ψ i,f ) ) C(1+logH i /h i ) 2 a i (u i,u i ) Let g i, F := Π RT h (I h (ϑ F Ψ i )) and Ψ i, F := f F λ f (g i, F )φ RT f By using Lemmas 23, 25, 41, and 47, an inverse estimate, and an estimate for the P 1 basis functions of S(Ω i ), we obtain (418) Ψ i, F 2 0;Ω i C f F h 3 i λ f (g i, F ) 2 C g i, F 2 0;Ω i C I h (ϑ F Ψ i ) 2 0;Ω i C Ψ i 2 0;Ω i C u i 2 0;Ω i and divψ i, F 2 0;Ω i C f F h i λ f (g i, F ) 2 C 1 h 2 i g i, F 2 0;Ω i (419) C 1 h 2 i I h (ϑ F Ψ i ) 2 0;Ω i C Ψ i 2 0; F C(1+logH i /h i ) Ψ i 2 1;Ω i C(1+logH i /h i ) divu i 2 0;Ω i Hence, combining (418) and (419), we obtain (420) a i (Ψ i, F,Ψ i, F ) C(1+logH i /h i )a i (u i,u i ) Let Ψ H i, F := H i( Π RT H Ψ i, F n ) From Lemma 45 and (420), we obtain a i ( Ψ H i, F,Ψ H i, F) Cai (Ψ i, F,Ψ i, F ) C(1+logH i /h i )a i (u i,u i ) We finally consider the curlq i term of (49) Let ri H := Π RT H (curlq i), r i := (curlq i ) n, and ri H := ri H n Moreover, let ri,f H := H ) i( ζf ri H and ri,f := ( ( )) H i ζf ri ri H From Corollary 43 and Lemma 46, we obtain (421) a i (r i,f,r i,f ) ( ) Cβ ζf i ri ri H 2 1/2; Ω i ( Cβ i (1+logH i /h i ) (1+logH i /h i ) r i 2 1/2; Ω i + ) ri ri H 2 1/2; Ω i ( Cβ i (1+logH i /h i ) (1+logH i /h i ) r i 2 1/2; Ω i + ) r H 2 i 1/2; Ω i Cβ i (1+logH i /h i ) 2 r i 2 1/2; Ω i +Cβ i (1+logH i /h i ) r H i 2 1/2; Ω i 19

Duk-Soon Oh, Olof B Widlund, Stefano Zampini, and Clark R Dohrmann We note that divri H 0;Ωi div(curlq i ) 0;Ωi = 0 from Lemma 44 Hence, by using Lemmas 21 and 44, we obtain ( r H i 2 1/2; Ω i C Hi 2 divr H i 2 0;Ω i + ) r H i 2 0;Ω i = C r H i 2 0;Ω i ) (422) C ((1+logH i /h i ) curlq i 20;Ωi +Hi 2 div(curlq i ) 20;Ωi = C(1+logH i /h i ) curlq i 2 0;Ω i and (423) ) r i 2 1/2; Ω i C (Hi 2 div(curlq i) 20;Ωi + curlq i 20;Ωi = C curlq i 2 0;Ω i Therefore, by combining (421), (422), (423), and Lemma 47, we obtain (424) a i (r i,f,r i,f ) C(1+logH i /h i ) 2 β i curlq i 2 0;Ω i We can now define v i,f as follows: and v H i,f by C(1+logH i /h i ) 2 β i u i 2 0;Ω i C(1+logH i /h i ) 2 a i (u i,u i ) v i,f := ũ i,f +Π RT h (Ψ i,f )+Ψ i, F +r H i,f +r i,f v H i,f := ũh i,f +ΨH i,f +ΨH i, F +rh i,f Then, v i,f and vi,f H satisfy the conditions (47) and (48), respectively Furthermore, we obtain the following estimate by using (412), (413), (416), (420), (417), (421), and (424): a i ( vi,f v H i,f,v i,f v H i,f) C(1+logHi /h i ) 2 a i (u i,u i ) We are now ready to prove our main result Theorem 49 (condition number estimate) The condition number of the preconditioned operator M 1 Ŝ satisfies ) (425) κ (M 1 Ŝ C(1+logH/h) 2 Proof We first recall that the left hand side of (38) provides an upper bound of the square of the S (i) norm of a face component of P D w We again note that S (i) : S(j) S(i) and that Lemma 48 provides us with a bound for the right-hand side of (38) 5 Numerical results All results in this section, except when otherwise stated, are for problems on the domain Ω = [0,1] 3 The triangulation of Ω and the assembly of the subdomain matrices are performed using the C++ library DOLFIN, [48], which is part of the FENICS project, [47] ParMETIS, [34], is used to decompose the meshes and always results in irregular subdomains The linear system (24) is solved using the Preconditioned Conjugate Gradient (PCG) method as implemented in the Portable and Extensible Toolbox for Scientific Computing (PETSc), [8], with the 20

BDDC FOR RAVIART-THOMAS VECTOR FIELDS BDDC preconditioner implemented in PETSc by the third author, see [81], for which each subdomain is assigned to a different MPI process The right-hand sides are always chosen randomly and a relative residual reduction of 10 8 is used as a stopping criterion The MUMPS, [2], Cholesky factorizations are used for the subdomain solvers and to compute the local Schur complements The current PETSc implementation, which uses dense linear algebra kernels, see [3], is employed for solving (310) for each subdomain face and the small dense blocks S (i) are inverted explicitly Formula (39) is not used in practice; instead, each S (i) is first explicitly inverted, and the principal minors S (i) 1 F F are then extracted Unless otherwise stated, we use the averages of the normal component over each subdomain face as our primal constraints as provided by the no-net-flux condition The quadrature weights for these constraints can easily be obtained by using the divergence theorem, ie, u n = div u i Ω i For further details on the BDDC implementation and for additional numerical results for our H(div) model problem, see [81] Our large scale numerical results havebeenobtainedonthecrayxc40shaheenofkaust,ranked9thinthetop500 list as of November 2015, and which features 6192 dual 16-core Haswell processors clocked at 23 Ghz and equipped with 128GB of DRAM per node, for a total of 198,144 cores 51 Example 1 (Common average constraints) We decompose the unit cube Ω into 64 irregular subdomains using ParMETIS and assume that the coefficients α and β have jumps only across the interface between the subdomains We have conducted two different sets of experiments for the lowest order Raviart-Thomas and BDM elements For the first set of experiments, we report on the condition numbers and the number of iterations for different values of H/h, with a constant value of β while varying α between the subdomains Here and in what follows, H/h is defined as the maximum of the ratio of the diameter of a subdomain and the smallest diameter of any of its elements The subdomains are subdivided into even and odd subdomains according to their MPI rank In the second set of experiments, the value of α is constant while β varies As predicted by the theory, the experimental results in Tables 1 and 2 confirm that the condition number of deluxe BDDC is insensitive to the jumps of the coefficients for the lowest order Raviart- Thomas elements As indicated by the results in Tables 3 and 4, the no-net-flux condition is also sufficient to obtain condition number independence for the lowest order BDM elements 52 Example 2 (The effect of using conventional averaging techniques) In this subsection, we report on some numerical experiments comparing deluxe BDDC and conventional scaling techniques We have performed four different types of experiments with the same set of coefficient distributions using the lowest order Raviart-Thomas and BDM elements The first set of experiments, named deluxe, is based on the weighted averaging techniques as described in (37) In the second, the economic variant of the deluxe scaling ( e-deluxe ) is used with one layer of elementsnextto TheresultsinthethirdandfourthcolumnsofTables5and6are obtained by using conventional methods as described in [19, 51] In the stiffness case, the scaling is based on the diagonal entries of the subdomain matrices, whereas 21