Cholesky factorisations of linear systems coming from a finite difference method applied to singularly perturbed problems

Similar documents
Cholesky factorisation of linear systems coming from finite difference approximations of singularly perturbed problems

Parameter robust methods for second-order complex-valued reaction-diffusion equations

ROBUST SOLUTION OF SINGULARLY PERTURBED PROBLEMS USING MULTIGRID METHODS; ANALYSIS AND NUMERICAL RESULTS IN ONE AND TWO DIMENSIONS

On solving linear systems arising from Shishkin mesh discretizations

Scientific Computing with Case Studies SIAM Press, Lecture Notes for Unit VII Sparse Matrix

Scientific Computing

Preconditioning Techniques Analysis for CG Method

An exponentially graded mesh for singularly perturbed problems

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

A PARAMETER ROBUST NUMERICAL METHOD FOR A TWO DIMENSIONAL REACTION-DIFFUSION PROBLEM

Iterative Domain Decomposition Methods for Singularly Perturbed Nonlinear Convection-Diffusion Equations

5.1 Banded Storage. u = temperature. The five-point difference operator. uh (x, y + h) 2u h (x, y)+u h (x, y h) uh (x + h, y) 2u h (x, y)+u h (x h, y)

HIGH ORDER VARIABLE MESH EXPONENTIAL FINITE DIFFERENCE METHOD FOR THE NUMERICAL SOLUTION OF THE TWO POINTS BOUNDARY VALUE PROBLEMS

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

Lecture # 20 The Preconditioned Conjugate Gradient Method

Notes for CS542G (Iterative Solvers for Linear Systems)

c 2015 Society for Industrial and Applied Mathematics

AMS526: Numerical Analysis I (Numerical Linear Algebra)

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

A greedy strategy for coarse-grid selection

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION

Numerical Solution Techniques in Mechanical and Aerospace Engineering

INTERGRID OPERATORS FOR THE CELL CENTERED FINITE DIFFERENCE MULTIGRID ALGORITHM ON RECTANGULAR GRIDS. 1. Introduction

Finite Difference Methods for Boundary Value Problems

A MULTIGRID ALGORITHM FOR. Richard E. Ewing and Jian Shen. Institute for Scientic Computation. Texas A&M University. College Station, Texas SUMMARY

The antitriangular factorisation of saddle point matrices

Solving linear systems (6 lectures)

An Introduction to Algebraic Multigrid (AMG) Algorithms Derrick Cerwinsky and Craig C. Douglas 1/84

Sparse Linear Systems. Iterative Methods for Sparse Linear Systems. Motivation for Studying Sparse Linear Systems. Partial Differential Equations

AMG for a Peta-scale Navier Stokes Code

9. Iterative Methods for Large Linear Systems

Preconditioning Techniques for Large Linear Systems Part III: General-Purpose Algebraic Preconditioners

Numerical Linear Algebra Primer. Ryan Tibshirani Convex Optimization /36-725

Comparative Analysis of Mesh Generators and MIC(0) Preconditioning of FEM Elasticity Systems

Numerical Methods I Non-Square and Sparse Linear Systems

The flexible incomplete LU preconditioner for large nonsymmetric linear systems. Takatoshi Nakamura Takashi Nodera

Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner

Preconditioning techniques to accelerate the convergence of the iterative solution methods

We first repeat some well known facts about condition numbers for normwise and componentwise perturbations. Consider the matrix

LINEAR SYSTEMS (11) Intensive Computation

Jae Heon Yun and Yu Du Han

Incomplete Cholesky preconditioners that exploit the low-rank property

Fine-grained Parallel Incomplete LU Factorization

Local Mesh Refinement with the PCD Method

Jim Lambers MAT 610 Summer Session Lecture 2 Notes

Remarks on the analysis of finite element methods on a Shishkin mesh: are Scott-Zhang interpolants applicable?

Robust exponential convergence of hp-fem for singularly perturbed systems of reaction-diffusion equations

NUMERICAL SOLUTION OF CONVECTION DIFFUSION EQUATIONS USING UPWINDING TECHNIQUES SATISFYING THE DISCRETE MAXIMUM PRINCIPLE

Some Geometric and Algebraic Aspects of Domain Decomposition Methods

Direct and Incomplete Cholesky Factorizations with Static Supernodes

Math 471 (Numerical methods) Chapter 3 (second half). System of equations

Lecture 18 Classical Iterative Methods

9.1 Preconditioned Krylov Subspace Methods

CME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 6

Domain decomposition for the Jacobi-Davidson method: practical strategies

A Block Compression Algorithm for Computing Preconditioners

LAPACK-Style Codes for Pivoted Cholesky and QR Updating

Numerical Linear Algebra

Program Lecture 2. Numerical Linear Algebra. Gaussian elimination (2) Gaussian elimination. Decompositions, numerical aspects

arxiv: v1 [math.na] 11 Jul 2011

Iterative Methods for Linear Systems

V C V L T I 0 C V B 1 V T 0 I. l nk

MODIFICATION AND COMPENSATION STRATEGIES FOR THRESHOLD-BASED INCOMPLETE FACTORIZATIONS

Poisson Solvers. William McLean. April 21, Return to Math3301/Math5315 Common Material.

Scientific Computing WS 2018/2019. Lecture 9. Jürgen Fuhrmann Lecture 9 Slide 1

LU Preconditioning for Overdetermined Sparse Least Squares Problems

A Method for Constructing Diagonally Dominant Preconditioners based on Jacobi Rotations

Applied Mathematics 205. Unit II: Numerical Linear Algebra. Lecturer: Dr. David Knezevic

FEM and Sparse Linear System Solving

Adaptive algebraic multigrid methods in lattice computations

Fine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning

A generalization of the Gauss-Seidel iteration method for solving absolute value equations

Finite Math - J-term Section Systems of Linear Equations in Two Variables Example 1. Solve the system

J.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1. March, 2009

Robust solution of Poisson-like problems with aggregation-based AMG

ACM106a - Homework 2 Solutions

A first-order system Petrov-Galerkin discretisation for a reaction-diffusion problem on a fitted mesh

FEM and sparse linear system solving

Linear Solvers. Andrew Hazel

A Note on Inverse Iteration

Numerical Analysis of Differential Equations Numerical Solution of Elliptic Boundary Value

MATH 425-Spring 2010 HOMEWORK ASSIGNMENTS

Some recent results on positivity-preserving discretisations for the convection-diffusion equation

The behaviour of high Reynolds flows in a driven cavity

Equations Introduction

4.6 Iterative Solvers for Linear Systems

Lecture 3: QR-Factorization

Solving PDEs with Multigrid Methods p.1

A Backward Stable Hyperbolic QR Factorization Method for Solving Indefinite Least Squares Problem

BLOCK ILU PRECONDITIONED ITERATIVE METHODS FOR REDUCED LINEAR SYSTEMS

On the performance of the Algebraic Optimized Schwarz Methods with applications

Incomplete LU Preconditioning and Error Compensation Strategies for Sparse Matrices

Lecture 17: Iterative Methods and Sparse Linear Algebra

3.5 Finite Differences and Fast Poisson Solvers

Random walks and anisotropic interpolation on graphs. Filip Malmberg

Iterative Methods and Multigrid

15-780: LinearProgramming

Key words. conjugate gradients, normwise backward error, incremental norm estimation.

OUTLINE 1. Introduction 1.1 Notation 1.2 Special matrices 2. Gaussian Elimination 2.1 Vector and matrix norms 2.2 Finite precision arithmetic 2.3 Fact

Transcription:

Cholesky factorisations of linear systems coming from a finite difference method applied to singularly perturbed problems Thái Anh Nhan and Niall Madden The Boundary and Interior Layers - Computational & Asymptotic Methods Charles University in Prague, September 2014 Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 1 / 24

Outline... 1 The model problem 2 Cholesky factorization on a uniform mesh An application: Incomplete Cholesky with threshold 3 Cholesky factorization on a boundary layer-adapted mesh 4 Conclusion Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 2 / 24

The model problem Problem: We consider the singularly perturbed two dimensional reaction-diffusion problem: ε 2 u + b(x, y)u = f(x, y), on Ω = (0, 1) 2, and u( Ω) = g(x, y), (1) where ε is a perturbation parameter, 0 < ε 1, g(x, y), b(x, y) and f(x, y) are some given functions such that b(x, y) β 2 > 0. It is known that a standard finite difference method applied to (1) on a boundary layer-adapted mesh with N intervals in each direction yields a parameter robust approximation (cf. Clavero et al. (2005) and Linß (2010)). This method leads to a linear system of equations which must be solved. We write the system as AU = F, (2) where A is a banded, symmetric and positive definite (N 1) 2 (N 1) 2 matrix. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 3 / 24

The model problem Motivation We wish to investigate the Cholesky factorization of matrices in which the magnitudes of diagonal entries are dominant compared to the off-diagonal entries such as the coefficient matrix A defined in (2). It is observed by MacLachlan and Madden (2013) that The performance of direct solvers (Cholesky factorization) for sparse matrices of this type depends not only on the location of nonzero of the matrix, but also their values. Factorization process results in many extremely small numbers, so-called subnormal and underflow numbers, in the Cholesky factors. In practice, this affects the computational speed considerably, and so the amount of time required to solve these linear systems depends badly on the perturbation parameter. The fill-in entries in the Cholesky factorisation decay exponentially away from the main diagonal. Thus, for small ε and large N, the Cholesky factors contain many subnormal and underflow numbers. We want to provide a mathematical justification for this phenomenon, and also study how it can be exploited to our advantages. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 4 / 24

Cholesky factorization on a uniform mesh Initially, we consider the standard central finite difference discretization of the model problem (1) on uniform mesh with N intervals on each direction. The equally spaced stepsize is denoted by h = N 1 and we will suppose that ε h, which is reasonable in practice. Then the discrete matrix A can be written as the following 5-point stencil ε 2 ε 2 A = ε 2 4ε 2 + h 2 b i,j ε 2 ε 2 = ε 2 O(h 2 ) ε 2 ε 2, (3) since (4ε 2 + h 2 b i,j ) = O(h 2 ), where we write f( ) = O(g( )) if there exists positive constants C 0, C 1 independent of variables such that C 0 g( ) f( ) C 1 g( ). This problem is related to the region in the interior of a boundary layer-adapted mesh for problems (1) where the local mesh width of order O(N 1 ). Therefore, the Cholesky factorization in the interior region is essential the same as that of the discretized matrix A on a uniform mesh. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 5 / 24

There are several ways to implement Cholesky factorization of a symmetric positive definite matrix. Below, we present in Algorithm 1 a version adapted from Golub and Van Loan, Matrix Computations (1996). It computes a lower triangular matrix L such that A = LL T. We will follow Matlab notation by denoting A = [a(i, j)] and L = [l(i, j)]. Algorithm 1: Cholesky factorization for j = 1 : n if j = 1 for i = j : n a(i, j) l(i, j) = a(j, j) end elseif (j > 1) for i = j : n l(i, j) = a(i, j) j 1 k=1 l(i, k)l(j, k) (line 8) a(j, j) end end end Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 6 / 24

Figures below show the structures of the coefficient matrix A (on the left) and Cholesky factor L (on the right) when N = 8. Figure: The matrix A (left) and Cholesky factor L (right). Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 7 / 24

Figures below show the structures of the coefficient matrix A (on the left) and Cholesky factor L (on the right) when N = 8. Figure: The matrix A (left) and Cholesky factor L (right). The fill-in entries are colored in red. Question: can we estimate the magnitudes of these fill-in entries in terms of ε and h? Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 8 / 24

Terminology (based on Chapter 10, Y. Saad (2003)) To analyse the magnitudes of the fill-in entries, we will denote The distinct sets L [0], L [1],..., L [m] where all entries of the same magnitude belong to the same set in a sense that if l [k] is the magnitude of entry in L [k], then l(i, j) L [k] if and only if l(i, j) is O(l [k] ). We shall see that these sets are quite distinct, meaning that l [k] l [k+1]. All the nonzero a(i, j) of the original matrix A belong to L [0]. The Cholesky process begins with zero matrix L. We define these initial zeros belong to L [ ]. Suppose that l(i, j) L [pi,j] (with p i,j = from the start). At each sweep through the algorithm, new values of l(i, j) are computed, and so p i,j is modified. Then, from line 8 in Algorithm 1, we can see that the p i,j is updated by { 0, if a(i, j) 0, p i,j = (4) min{p i,1 + p j,1 + 1, p i,2 + p j,2 + 1,..., p i,j 1 + p j,j 1 + 1}, otherwise. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 9 / 24

Let us denote m = N 1, the band width of A defined in (3). Then our analysis has shown that, the fill-in entries from Cholesky factorization have the following structure. The first 2m rows of matrix L is shown below in which, for simplicity, the entries belonging to L [k] are denoted by [k] and the entries that corresponding to nonzero entries of original matrix are written in terms of their magnitude. O(h) O(ε 2 /h) O(h) O(ε 2 /h) O(h)...... O(ε 2 /h) O(h) O(ε 2 /h) [1] [2]... [m 1] O(h) O(ε 2 /h) [1]... [m 2] O(ε 2 /h) O(h)......... [3] O(ε 2 /h) O(h)... O(ε 2...... /h) [1]. O(ε 2 /h) [m]... [3] O(ε 2 /h) O(h) Figure: The submatrix L(1 : 2m, 1 : 2m). It turns out that the remaining fill-in entries in L have the same structure as the fill-in entries in submatrix L(m + 1 : 2m, 1 : 2m). Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 10 / 24

We present a closer look on the submatrix L(m + 1 : 2m, 1 : m). O(ε 2 /h) [1] [2] [3]... [m 2] [m 1] O(ε 2 /h) [1] [2]... [m 3] [m 2]............... O(ε 2 /h) [1] [2] [3] O(ε 2 /h) [1] [2] O(ε 2 /h) [1] O(ε 2 /h) Figure: The submatrix L(m + 1 : 2m, 1 : m). Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 11 / 24

And the submatrix L(m + 1 : 2m, m + 1 : 2m). O(h) O(ε 2 /h) O(h) [3] O(ε 2 /h) O(h) [4] [3] O(ε 2 /h) O(h)........... [m 1] [m 2]... [3] O(ε 2 /h) O(h) [m] [m 1]... [4] [3] O(ε 2 /h) O(h) Figure: The partition L(m + 1 : 2m, m + 1 : 2m). Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 12 / 24

We have shown that Theorem 1 Let δ = ε/h. If l(i, j) L [k], then the magnitude l [k] of l(i, j) is given as follows, l [k] = O ( δ 2(k+1) h ). (5) The formulation given in (5) tells us why the values of fill-in entries decay exponentially with respect to k. In practice, for a reaction-dominated problem, we usually have ε h. Hence, when ε decreases and the mesh parameter N increases, the fill-in entries tend to zero rapidly. This fact also suggests that an Incomplete Cholesky factorization would be an excellent preconditioner for iterative solvers (not discussed here). Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 13 / 24

To demonstrate Theorem 1, we plot the magnitude of the entries of the vector L(m + 1, 1 : m) with various values of ε and N = 128. It clearly shows that the magnitude of the fill-in entries decay exponentially, which is in agreement with our result. Furthermore, when ε = 10 6, there is only first 41 entries of the vector L(128, 1 : 127) can be plotted because the magnitudes of the other entries are less than the smallest number in IEEE standard double precision, and so they are flushed to zero. Figure: Semilog plot of the magnitude of L(128, 1 : 127) with various ε. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 14 / 24

Furthermore, we can give the exact number of the fill-in entries associated with their magnitudes. Theorem 2 Let L [k] denote the number of elements in the set L [k]. Then sum of the number of fill-in entries m L [k] (m 1) 3 k=1 m L [k] (m 2)(m 1) 2 k=2 m L [k], p 3 (m 1)(m p + 1) 2 k=p Table: The sum of the number of fill-in entries associated with their magnitudes. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 15 / 24

Numerical experiments To validate Theorem 2, we compare the sum of all fill-in entries which belong to L [k], k p given by Theorem 2 and the actual number of fill-in entries that are less than the magnitude l [p] is computed by Matlab. p δ 2(p+1) h Actual number Theorem 2 1 2.62e-07 238328 238328 2 1.07e-09 234484 234484 3 4.39e-12 230702 230702 4 1.80e-14 223200 223200 5 7.37e-17 215822 215822 6 3.02e-19 208568 208568 Table: The actual and estimated numbers of fill-in entries associated with their magnitudes when ε = 10 3 and N = 2 6. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 16 / 24

An application: Incomplete Cholesky with threshold An application: Incomplete Cholesky with threshold Note that the matrix A is symmetric and positive definite, so there exists a complete Cholesky factorization: A = LL T where L is a lower triangular matrix. Incomplete Cholesky: set some entries of L to zeros (or, rather, don t compute them at all), and so A L L T where L is the Incomplete Cholesky factor. L is quite inexpensive to compute compared to L. Since A L L T, it can be used as a preconditioner. Incomplete Cholesky with threshold: set the fill-in entries to zeros when they are less than some chosen-threshold. It takes the advantage of previous analysis that shows the locations of fill-in entries according to their magnitudes. We would like to see how good the Incomplete Cholesky with threshold can be applied to find the approximate solution. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 17 / 24

An application: Incomplete Cholesky with threshold Remark Let u, U N be the solutions of the continuous and discrete problems respectively. The numerical method yields the discretization error u U N Cg(N), where denotes the discrete maximum norm, we wish to find an approximate solution U N App to UN in a reasonable way. We have It is feasible to find U N App such that u U N App u U N + U N U N App. U N U N App Cg(N). Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 18 / 24

An application: Incomplete Cholesky with threshold Let L p be the Incomplete Cholesky with threshold p = δ 2(p+1) h, and denote U N,p App = (L pl T p) 1 b. p p U N U N,p App b AUN,p App A L pl T p 1 1.342e-08 4.799e-05 1.831e-10 6.984e-16 2 3.518e-11 1.249e-07 4.755e-13 1.812e-18 3 9.223e-14 5.821e-10 1.776e-15 4.712e-21 4 2.418e-16 3.492e-10 9.992e-16 8.631e-22 5 6.338e-19 3.492e-10 9.992e-16 8.570e-22 6 1.662e-21 3.492e-10 9.992e-16 8.570e-22 Table: The Incomplete Cholesky with various thresholds when ε = 10 4 and N = 512. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 19 / 24

Cholesky factorization on a boundary layer-adapted mesh Cholesky factorization on a boundary layer-adapted mesh Boundary layer-adapted meshes for reaction-diffusion problems. Figure: The Shishkin (left) and Bakhvalov (right) meshes for reaction-diffusion problems. Note that the nodes in the interior region is similar to that of uniform mesh. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 20 / 24

Cholesky factorization on a boundary layer-adapted mesh We use the subscripts indicate the block structure of corners, C, edge layers, E and the interior points (not including the nodes along transition points in both directions), I. This gives the partitioned matrix of the form A = A CC A CE 0 A EC A EE A EI 0 A IE A II We note that in the interior region where the grids on both x- and y-directions are uniform with the mesh width H is proportional to N 1. Therefore, the matrix A II shares the same structure like the matrix stencil defined in (3), O(ε 2 ) A II = O(ε ) O(H 2 ) O(ε 2 ). O(ε 2 ) where A II is an (N/2 1) 2 (N/2 1) 2 banded matrix. It inherits all properties of the matrix defined in (3). Therefore, the previous analysis can be applied to the matrix A II.. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 21 / 24

Cholesky factorization on a boundary layer-adapted mesh Again, let L p be the Incomplete Cholesky with threshold p = δ 2(p+1) h, and denote U N,p App = (L pl T p) 1 b. p p U N U N,p App b AUN,p App A L pl T p 1 1.678e-009 2.028e-005 5.624e-013 1.341e-012 2 1.100e-012 2.731e-008 8.684e-016 1.328e-015 3 7.206e-016 1.707e-011 5.353e-019 9.576e-019 4 4.722e-019 2.232e-013 1.084e-019 3.219e-020 5 3.095e-022 2.232e-013 1.084e-019 3.219e-020 6 2.028e-025 2.232e-013 1.084e-019 3.219e-020 Table: The Incomplete Cholesky with various thresholds when ε = 10 4 and N = 256 on a Shishkin mesh. Compared to U N U N,full App = 3.5971 10 14, b AU N,full App = 1.3553 10 19, A LL T = 1.3559 10 20, where L is a complete Cholesky factor of A, and U N,full App = (LL T ) 1 b. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 22 / 24

Conclusion Conclusion We gave a detailed description for the magnitudes of fill-in entries of Cholesky factors for the matrices in which the values of entries have different scales. We also provided an estimate for the sum of number of fill-in entries according to their magnitudes. The Incomplete Cholesky with threshold shown its advantages for finding the approximate solutions in the context of singularly perturbed problems. Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 23 / 24

Conclusion Ali R. Ansari and Alan F. Hegarty, A note on iterative methods for solving singularly perturbed problems using non-monotone methods on Shishkin meshes, Comput. Methods Applied Mech. Eng., 192 (2003), pp. 3673-3687. T. Linß, Layer-Adapted Meshes for Reaction-Convection-Diffusion Problems. Lecture Notes in Mathematics, Vol. 1985, Springer, Heidelberg, Berlin 2010. Scott MacLachlan and Niall Madden, Robust solution of singularly perturbed problems using multigrid methods, SIAM J. Sci. Comput., 35(5):A2225-A2254, 2013. Yousef Saad, Iterative methods for sparse linear systems, Society for Industrial and Applied Mathematics, Philadelphia, PA, second edition, 2003. Thank you! Thái Anh Nhan (NUI Galway, Ireland) Cholesky factorisations for problems SPPs September 2014 24 / 24