From Direct to Iterative Substructuring: some Parallel Experiences in 2 and 3D

Size: px
Start display at page:

Download "From Direct to Iterative Substructuring: some Parallel Experiences in 2 and 3D"

Transcription

1 From Direct to Iterative Substructuring: some Parallel Experiences in 2 and 3D Luc Giraud N7-IRIT, Toulouse MUMPS Day October 24, 2006, ENS-INRIA, Lyon, France

2 Outline 1 General Framework 2 The direct substructuring approach 3 The iterative substructuring approach Description of the preconditioners Variant of Additive Shwarz preconditioner M AS M AS v.s. Neumann-Neumann 4 Parallel numerical experiments 2D experiments in semiconductor device modelling 3D experiments on the academic cube 5 Prospectives

3 Outline 1 General Framework 2 The direct substructuring approach 3 The iterative substructuring approach Description of the preconditioners Variant of Additive Shwarz preconditioner M AS M AS v.s. Neumann-Neumann 4 Parallel numerical experiments 2D experiments in semiconductor device modelling 3D experiments on the academic cube 5 Prospectives

4 Background in 3D The model PDE 8 < : div(k. u) = f in Ω, u = 0 on Ω Dirichlet, (K u, n) = 0 on Ω Neumann. The associated linear system 0 A 11 A 1Γ u f 1 Au = A T 1Γ A (1) Γ + A (2) Γ A T 2Γ u Γ A f Γ A 0 A 2Γ A 22 u 2 f 2

5 Background in 2D The same model PDE but in 2D ( x (a(x, y) v x ) y (b(x, y) v ) y = F (x, y) in Ω, v = 0 on Ω. The associated linear system A 11 0 A 1Γ x 1 b 1 A h x = 0 A 22 A 2Γ x 2 A b 2 A A T 1Γ A T 2Γ A (1) Γ + A (2) x Γ Γ b Γ

6 Algebraic Gaussian elimination Space dimension free exposure Algebraic splitting and block Gaussian elimination 0 A 11 0 A 1Γ x b 0 A 22 A 2Γ x 2 A b 2 A A T 1Γ A T 2Γ A (1) Γ + A (2) Γ {z } x Γ b Γ A h 2X 2X Sx Γ = b Γ i=1 A T iγa 1 ii b i S = i=1 A (i) Γ AT iγa 1 ii A iγ = 2X i=1 S (i) Local unassembled Schur complement associated with Ω i ( ) A (Ω i ) Aii A iγi = S (i) = A (i) Γ i A T iγ i A 1 A T iγ i A (i) Γ i ii A iγi }{{} Contribution matrix

7 Outline 1 General Framework 2 The direct substructuring approach 3 The iterative substructuring approach Description of the preconditioners Variant of Additive Shwarz preconditioner M AS M AS v.s. Neumann-Neumann 4 Parallel numerical experiments 2D experiments in semiconductor device modelling 3D experiments on the academic cube 5 Prospectives

8 The sparse direct multifrontal view point [Amestoy, Duff, L Excellent - 98] [Amestoy, Duff, Koster, L Excellent - 01], [Amestoy, Guermouche, L Excellent, Pralet - 06] Ω 1 Ω 3 Ω 2 Ω 4 Ω 1 Ω 2 Ω 3 Ω 4 The direct substructuring approach on top of MUMPS 1: Call #domains sequential MUMPS in parallel on each A (Ω i ) to form the local unassembled Schur complement (local ordering) 2: Convert the local ordering on the interface into a global ordering (simple pre-processing) 3: Call one parallel MUMPS on the distributed matrix elemental distributed matrices S = #domains X i=1 S (i) {z } feature of MUMPS for redundant coordinates

9 The sparse direct multifrontal view point [Amestoy, Duff, L Excellent - 98] [Amestoy, Duff, Koster, L Excellent - 01], [Amestoy, Guermouche, L Excellent, Pralet - 06] Ω 1 Ω 3 Ω 2 Ω 4 Ω 1 Ω 2 Ω 3 Ω 4 The direct substructuring approach on top of MUMPS 1: Call #domains sequential MUMPS in parallel on each A (Ω i ) to form the local unassembled Schur complement (local ordering) 2: Convert the local ordering on the interface into a global ordering (simple pre-processing) 3: Call one parallel MUMPS on the distributed matrix elemental distributed matrices S = #domains X i=1 S (i) {z } feature of MUMPS for redundant coordinates

10 Outline 1 General Framework 2 The direct substructuring approach 3 The iterative substructuring approach Description of the preconditioners Variant of Additive Shwarz preconditioner M AS M AS v.s. Neumann-Neumann 4 Parallel numerical experiments 2D experiments in semiconductor device modelling 3D experiments on the academic cube 5 Prospectives

11 Elliptic PDEs properties Algebraic splitting and block Gaussian elimination: # domains = N sub-domains case A I1 I A I1 Γ 1. B C 0... A IN I N A IN Γ N A Γ1 I 1... A ΓN I N A ΓΓ Su Γ = i=1 u I1.. u IN u Γ i=1 C A = f I1.. f IN fγ C A! NX NX RΓ T i S (i) R Γi u Γ = f Γ RΓ T i A Γi I i A 1 I i I i f Ii where S (i) = A (i) Γ i Γ i A Γi I i A 1 I i I i A Ii Γ i Spectral properties for elliptic PDE s κ(a) = O(h 2 ) κ(s) = O(h 1 ) e (k) A 2 p κ(a) 1 p κ(a) + 1! k e (0) A

12 A simple mathematical framework Block preconditioners U a algebraic space of vectors associated with unknowns on Γ U l subspaces of U such that U = U U n R l : the canonical pointwise restriction from U U l M = nx l=1 Examples : Rl T M 1 l R l where M l = R l SRl T U l associated with each edge/face: block Jacobi (geometric information needed) U l associated with Ω l : additive Schwarz

13 Structure of the Local Schur Complement Non-Overlapping Domain Decomposition E m Ω i E g Ω j E l E k Ωi = Γ i = E l E k E m E g Distributed Schur Complement 0 1 S (i) mm S mg S mk S ml S (i) S gm S (i) gg S = gk S gl S km S kg S (i) C kk S kl A S lm S lg S lk S (i) ll Sgg = S(i) gg + S (j) gg If A is SPD then S is also SPD CG In a distributed memory environment: S is distributed non-assembled

14 Additive Schwarz preconditioner [Carvalho, Giraud, Meurant - 01] Preconditionner properties U i associated with the entire interface Γ i of sub-domain M AS = #domains i=1 R T i ( S(i) ) 1 Ri 0 1 S mm S mg S mk S ml S (i) = BS gm S gg S gk S gl S km S kg S kk S kl A S lm S lg S lk S ll Assembled local Schur complement S (i) and S (i) are dense matrices... Remarks M AS is SPD if S is SPD 0 1 S (i) mm S mg S mk S ml S (i) S gm S (i) gg S = gk S gl S km S kg S (i) C kk S kl A S lm S lg S lk S (i) ll local Schur complement

15 Cheaper Additive Shwarz preconditioner form Main characteristics Cheaper in memory space Flops reduction Without any additional communication cost S (i) Sparsification strategy for { skl if s ŝ kl = kl ɛ( s kk + s ll ) 0 else We end-up with a sparse variant of the preconditioner M spas = #domains i=1 R T i (Ŝ(i) ) 1 Ri

16 M AS v.s. Neumann-Neumann Neumann-Neumann preconditioner [J.F Bourgat, R. Glowinski, P. Le Tallec and M. Vidrascu - 89] [Y.H. de Roek, P. Le Tallec and M. Vidrascu - 91] S (1) = S (2) = S 2 S 1 = 1 S (1) 1 + S (2) « «««A (i) Aii A iγ I 0 Aii 0 I A 1 = A iγ A (i) = A Γ iγ A 1 ii I 0 S (i) ii A Γi 0 I S (i) 1 ` = 0 I A (i) «1 0 I M NN = #domains X i=1 R T i D i S (i) «#domains 1 X Di R i while M AS = i=1 R T i «S(i) 1 Ri

17 Outline 1 General Framework 2 The direct substructuring approach 3 The iterative substructuring approach Description of the preconditioners Variant of Additive Shwarz preconditioner M AS M AS v.s. Neumann-Neumann 4 Parallel numerical experiments 2D experiments in semiconductor device modelling 3D experiments on the academic cube 5 Prospectives

18 Application to semiconductor device modelling [Giraud, Koster, Marroco, Rioual - 01], [Giraud, Marroco, Rioual - 05] The drift diffusion equations Find (φ, φ n, φ p ) so that div(ɛ φ) + q[n(φ, φ n ) P(φ, φ p ) Dop] = 0, div(qµ n N(φ, φ n ) φ n ) + qgr(φ, φ n, φ p ) = 0, div(qµ p P(φ, φ p ) φ p ) qgr(φ, φ n, φ p ) = 0, with mixed Dirichlet-Neumann boundary conditions.

19 Heterojunction device (mixed finite elements) Mesh size # subdomains Size Schur Medium edges Large edges

20 Explicit Schur complement calculation Experiments on a SGI O2K S = N A (i) Γ i=1 A iγa 1 ii A Γia }{{} implicit = N }{{} S (i) explicit i= Facto Matvec 20 Krylov Facto Matvec 20 Krylov Implicit Explicit Krylov solvers CG on SPD systems (Poisson equations) Full-GMRES on unsymmetric systems (electrons/holes equations)

21 Iterative solvers Embeded iterative schemes: numerical effects Newton Steps ε Krylov Performance of the preconditioners Medium 16 Large 32 M bj M AS M bj M AS Newton its iter CG iter GMRES time (s) Direct v.s. Hybrid iterative solve Medium 16 Large 32 M AS Dss M AS Dss Newton time (s)

22 3D experiments [Giraud, Haidar, Watson - ongoing] 3D Poisson problem: Number of CG iterations where either: H h constant while # sub-domains is varied horizontal view Increasing mesh size H h while # sub-domains kept constant vertical view # sub-domains # processors sub-domains size M AS M SpAS AS M SpAS M AS M SpAS M M AS M SpAS The solved problem size vary from 1.1 up to 42.8 Millions of unknowns The number of iterations increases slightly when increasing # sub-domains This increase is less significant when the local mesh size H h grows With a few hundred processors those runs cannot be performed using a direct substructuring approach with current version of MUMPS

23 Numerical scalability 3D Discontinuous problem : Jumps in diffusion coefficient functions a() = b() = c():

24 Numerical scalability 3D Discontinuous problem : Jumps in diffusion coefficient functions a() = b() = c(): Number of CG iterations where either: H h constant while # sub-domains is varied horizontal view Increasing mesh size H h while # sub-domains kept constant vertical view # sub-domains # processors sub-domains size M AS M SpAS M AS M SpAS M AS M SpAS M AS M SpAS

25 Parallel performance 3D Discontinuous problem: Implementation details: Setup Schur: MUMPS Setup Precond: dense Schur (LAPACK)- sparse Schur (MUMPS) Target computer : System Xserve MAC G5 Parallel elapsed time: 10 3 processors H h vary ɛ = 10 4 Jumps in diffusion coefficient functions a() = b() = c(): Sub-domains size setup Schur setup Precond time per iter total # iter dense local Schur Precond M AS - sparse local Schur Precond M SpAS

26 Local data storage M AS vs M SpAS Memory behaviour M AS M SpAS Subdomains size ɛ = 10 5 ɛ = MB 7.5 MB (10%) 1.8 MB ( 5%) MB 12.7 MB (14%) 2.7 MB ( 3%) MB 19.4 MB (10%) 3.8 MB ( 2%) MB 28.6 MB ( 7%) 10.2 MB ( 2%)

27 Outline 1 General Framework 2 The direct substructuring approach 3 The iterative substructuring approach Description of the preconditioners Variant of Additive Shwarz preconditioner M AS M AS v.s. Neumann-Neumann 4 Parallel numerical experiments 2D experiments in semiconductor device modelling 3D experiments on the academic cube 5 Prospectives

28 Prospectives Objective Control the growth of iterations when increasing the # processors Various possibilities (future work) Numerical remedy: two-level preconditioner - Coarse space correction, ie solve a closed problem on a coarse space - Various choices for the coarse component (eg one d.o.f. per sub-domain) Computer Science remedy : several processors per sub-domain - two-level of parallelism - 2D cyclic data storage MUMPS is back!!! Final comments The advances in sparse directs techniques anables us to consider preconditioning techniques that were out of reach before

29 Prospectives Objective Control the growth of iterations when increasing the # processors Various possibilities (future work) Numerical remedy: two-level preconditioner - Coarse space correction, ie solve a closed problem on a coarse space - Various choices for the coarse component (eg one d.o.f. per sub-domain) Computer Science remedy : several processors per sub-domain - two-level of parallelism - 2D cyclic data storage MUMPS is back!!! Final comments The advances in sparse directs techniques anables us to consider preconditioning techniques that were out of reach before

30 Prospectives Objective Control the growth of iterations when increasing the # processors Various possibilities (future work) Numerical remedy: two-level preconditioner - Coarse space correction, ie solve a closed problem on a coarse space - Various choices for the coarse component (eg one d.o.f. per sub-domain) Computer Science remedy : several processors per sub-domain - two-level of parallelism - 2D cyclic data storage MUMPS is back!!! Final comments The advances in sparse directs techniques anables us to consider preconditioning techniques that were out of reach before

31 Credit to co-workers Co-workers L. M. Carvalho (UFRJ Brasil, mainly during PhD at CERFACS), A. Haidar (CERFACS), J. Koster (Norway ministery of research, mainly when at Parallab), P. Le Tallec (Ecole Polytechnique & Paris Dauphine, mainly when at Paris Dauphine & INRIA), J. C. Rioual (NEC, mainly during PhD at CERFACS), A. Marrocco (INRIA), G. Meurant (CEA), L. Watson (Virginia Tech). and of course to the MUMPS team

32 More information giraud Merci pour votre attention a final personal advert...

33 More information giraud Merci pour votre attention a final personal advert...

34

A High-Performance Parallel Hybrid Method for Large Sparse Linear Systems

A High-Performance Parallel Hybrid Method for Large Sparse Linear Systems Outline A High-Performance Parallel Hybrid Method for Large Sparse Linear Systems Azzam Haidar CERFACS, Toulouse joint work with Luc Giraud (N7-IRIT, France) and Layne Watson (Virginia Polytechnic Institute,

More information

Iterative versus direct parallel substructuring methods in semiconductor device modeling

Iterative versus direct parallel substructuring methods in semiconductor device modeling Iterative versus direct parallel substructuring methods in semiconductor device modeling L. Giraud, A. Marrocco, J.-C. Rioual TR/PA/02/114 Abstract The numerical simulation of semiconductor devices is

More information

ADDITIVE SCHWARZ FOR SCHUR COMPLEMENT 305 the parallel implementation of both preconditioners on distributed memory platforms, and compare their perfo

ADDITIVE SCHWARZ FOR SCHUR COMPLEMENT 305 the parallel implementation of both preconditioners on distributed memory platforms, and compare their perfo 35 Additive Schwarz for the Schur Complement Method Luiz M. Carvalho and Luc Giraud 1 Introduction Domain decomposition methods for solving elliptic boundary problems have been receiving increasing attention

More information

Dirichlet-Neumann and Neumann-Neumann Methods

Dirichlet-Neumann and Neumann-Neumann Methods Dirichlet-Neumann and Neumann-Neumann Methods Felix Kwok Hong Kong Baptist University Introductory Domain Decomposition Short Course DD25, Memorial University of Newfoundland July 22, 2018 Outline Methods

More information

ANR Project DEDALES Algebraic and Geometric Domain Decomposition for Subsurface Flow

ANR Project DEDALES Algebraic and Geometric Domain Decomposition for Subsurface Flow ANR Project DEDALES Algebraic and Geometric Domain Decomposition for Subsurface Flow Michel Kern Inria Paris Rocquencourt Maison de la Simulation C2S@Exa Days, Inria Paris Centre, Novembre 2016 M. Kern

More information

Indefinite and physics-based preconditioning

Indefinite and physics-based preconditioning Indefinite and physics-based preconditioning Jed Brown VAW, ETH Zürich 2009-01-29 Newton iteration Standard form of a nonlinear system F (u) 0 Iteration Solve: Update: J(ũ)u F (ũ) ũ + ũ + u Example (p-bratu)

More information

Solving linear systems for semiconductor device simulations on parallel distributed computers

Solving linear systems for semiconductor device simulations on parallel distributed computers N o Ordre : 1880 THÈSE présentée pour obtenir LE TITRE DE DOCTEUR DE L INSTITUT NATIONAL POLYTECHNIQUE DE TOULOUSE Spécialité : INFORMATIQUE par Jean-Christophe RIOUAL Solving linear systems for semiconductor

More information

Inexactness and flexibility in linear Krylov solvers

Inexactness and flexibility in linear Krylov solvers Inexactness and flexibility in linear Krylov solvers Luc Giraud ENSEEIHT (N7) - IRIT, Toulouse Matrix Analysis and Applications CIRM Luminy - October 15-19, 2007 in honor of Gérard Meurant for his 60 th

More information

Optimal Left and Right Additive Schwarz Preconditioning for Minimal Residual Methods with Euclidean and Energy Norms

Optimal Left and Right Additive Schwarz Preconditioning for Minimal Residual Methods with Euclidean and Energy Norms Optimal Left and Right Additive Schwarz Preconditioning for Minimal Residual Methods with Euclidean and Energy Norms Marcus Sarkis Worcester Polytechnic Inst., Mass. and IMPA, Rio de Janeiro and Daniel

More information

AN INTRODUCTION TO DOMAIN DECOMPOSITION METHODS. Gérard MEURANT CEA

AN INTRODUCTION TO DOMAIN DECOMPOSITION METHODS. Gérard MEURANT CEA Marrakech Jan 2003 AN INTRODUCTION TO DOMAIN DECOMPOSITION METHODS Gérard MEURANT CEA Introduction Domain decomposition is a divide and conquer technique Natural framework to introduce parallelism in the

More information

Preface to the Second Edition. Preface to the First Edition

Preface to the Second Edition. Preface to the First Edition n page v Preface to the Second Edition Preface to the First Edition xiii xvii 1 Background in Linear Algebra 1 1.1 Matrices................................. 1 1.2 Square Matrices and Eigenvalues....................

More information

Scalable Domain Decomposition Preconditioners For Heterogeneous Elliptic Problems

Scalable Domain Decomposition Preconditioners For Heterogeneous Elliptic Problems Scalable Domain Decomposition Preconditioners For Heterogeneous Elliptic Problems Pierre Jolivet, F. Hecht, F. Nataf, C. Prud homme Laboratoire Jacques-Louis Lions Laboratoire Jean Kuntzmann INRIA Rocquencourt

More information

A Numerical Study of Some Parallel Algebraic Preconditioners

A Numerical Study of Some Parallel Algebraic Preconditioners A Numerical Study of Some Parallel Algebraic Preconditioners Xing Cai Simula Research Laboratory & University of Oslo PO Box 1080, Blindern, 0316 Oslo, Norway xingca@simulano Masha Sosonkina University

More information

Parallel scalability of a FETI DP mortar method for problems with discontinuous coefficients

Parallel scalability of a FETI DP mortar method for problems with discontinuous coefficients Parallel scalability of a FETI DP mortar method for problems with discontinuous coefficients Nina Dokeva and Wlodek Proskurowski University of Southern California, Department of Mathematics Los Angeles,

More information

Parallel sparse linear solvers and applications in CFD

Parallel sparse linear solvers and applications in CFD Parallel sparse linear solvers and applications in CFD Jocelyne Erhel Joint work with Désiré Nuentsa Wakam () and Baptiste Poirriez () SAGE team, Inria Rennes, France journée Calcul Intensif Distribué

More information

Incomplete Cholesky preconditioners that exploit the low-rank property

Incomplete Cholesky preconditioners that exploit the low-rank property anapov@ulb.ac.be ; http://homepages.ulb.ac.be/ anapov/ 1 / 35 Incomplete Cholesky preconditioners that exploit the low-rank property (theory and practice) Artem Napov Service de Métrologie Nucléaire, Université

More information

OUTLINE ffl CFD: elliptic pde's! Ax = b ffl Basic iterative methods ffl Krylov subspace methods ffl Preconditioning techniques: Iterative methods ILU

OUTLINE ffl CFD: elliptic pde's! Ax = b ffl Basic iterative methods ffl Krylov subspace methods ffl Preconditioning techniques: Iterative methods ILU Preconditioning Techniques for Solving Large Sparse Linear Systems Arnold Reusken Institut für Geometrie und Praktische Mathematik RWTH-Aachen OUTLINE ffl CFD: elliptic pde's! Ax = b ffl Basic iterative

More information

Efficient domain decomposition methods for the time-harmonic Maxwell equations

Efficient domain decomposition methods for the time-harmonic Maxwell equations Efficient domain decomposition methods for the time-harmonic Maxwell equations Marcella Bonazzoli 1, Victorita Dolean 2, Ivan G. Graham 3, Euan A. Spence 3, Pierre-Henri Tournier 4 1 Inria Saclay (Defi

More information

ITERATIVE METHODS FOR SPARSE LINEAR SYSTEMS

ITERATIVE METHODS FOR SPARSE LINEAR SYSTEMS ITERATIVE METHODS FOR SPARSE LINEAR SYSTEMS YOUSEF SAAD University of Minnesota PWS PUBLISHING COMPANY I(T)P An International Thomson Publishing Company BOSTON ALBANY BONN CINCINNATI DETROIT LONDON MADRID

More information

Toward black-box adaptive domain decomposition methods

Toward black-box adaptive domain decomposition methods Toward black-box adaptive domain decomposition methods Frédéric Nataf Laboratory J.L. Lions (LJLL), CNRS, Alpines Inria and Univ. Paris VI joint work with Victorita Dolean (Univ. Nice Sophia-Antipolis)

More information

Capacitance Matrix Method

Capacitance Matrix Method Capacitance Matrix Method Marcus Sarkis New England Numerical Analysis Day at WPI, 2019 Thanks to Maksymilian Dryja, University of Warsaw 1 Outline 1 Timeline 2 Capacitance Matrix Method-CMM 3 Applications

More information

Parallel Implementation of BDDC for Mixed-Hybrid Formulation of Flow in Porous Media

Parallel Implementation of BDDC for Mixed-Hybrid Formulation of Flow in Porous Media Parallel Implementation of BDDC for Mixed-Hybrid Formulation of Flow in Porous Media Jakub Šístek1 joint work with Jan Březina 2 and Bedřich Sousedík 3 1 Institute of Mathematics of the AS CR Nečas Center

More information

Algebraic two-level preconditioners. for the Schur complement method. L. M. Carvalho, L. Giraud and P. Le Tallec

Algebraic two-level preconditioners. for the Schur complement method. L. M. Carvalho, L. Giraud and P. Le Tallec Algebraic two-level preconditioners for the Schur complement method L. M. Carvalho, L. Giraud and P. Le Tallec June 1998 TR/PA/98/18 Algebraic two-level preconditioners for the Schur complement method

More information

An Efficient Low Memory Implicit DG Algorithm for Time Dependent Problems

An Efficient Low Memory Implicit DG Algorithm for Time Dependent Problems An Efficient Low Memory Implicit DG Algorithm for Time Dependent Problems P.-O. Persson and J. Peraire Massachusetts Institute of Technology 2006 AIAA Aerospace Sciences Meeting, Reno, Nevada January 9,

More information

Robust solution of Poisson-like problems with aggregation-based AMG

Robust solution of Poisson-like problems with aggregation-based AMG Robust solution of Poisson-like problems with aggregation-based AMG Yvan Notay Université Libre de Bruxelles Service de Métrologie Nucléaire Paris, January 26, 215 Supported by the Belgian FNRS http://homepages.ulb.ac.be/

More information

Fast Iterative Solution of Saddle Point Problems

Fast Iterative Solution of Saddle Point Problems Michele Benzi Department of Mathematics and Computer Science Emory University Atlanta, GA Acknowledgments NSF (Computational Mathematics) Maxim Olshanskii (Mech-Math, Moscow State U.) Zhen Wang (PhD student,

More information

Schwarz-type methods and their application in geomechanics

Schwarz-type methods and their application in geomechanics Schwarz-type methods and their application in geomechanics R. Blaheta, O. Jakl, K. Krečmer, J. Starý Institute of Geonics AS CR, Ostrava, Czech Republic E-mail: stary@ugn.cas.cz PDEMAMIP, September 7-11,

More information

Linear Solvers. Andrew Hazel

Linear Solvers. Andrew Hazel Linear Solvers Andrew Hazel Introduction Thus far we have talked about the formulation and discretisation of physical problems...... and stopped when we got to a discrete linear system of equations. Introduction

More information

A Balancing Algorithm for Mortar Methods

A Balancing Algorithm for Mortar Methods A Balancing Algorithm for Mortar Methods Dan Stefanica Baruch College, City University of New York, NY 11, USA Dan Stefanica@baruch.cuny.edu Summary. The balancing methods are hybrid nonoverlapping Schwarz

More information

Master Thesis Literature Study Presentation

Master Thesis Literature Study Presentation Master Thesis Literature Study Presentation Delft University of Technology The Faculty of Electrical Engineering, Mathematics and Computer Science January 29, 2010 Plaxis Introduction Plaxis Finite Element

More information

Solving Ax = b, an overview. Program

Solving Ax = b, an overview. Program Numerical Linear Algebra Improving iterative solvers: preconditioning, deflation, numerical software and parallelisation Gerard Sleijpen and Martin van Gijzen November 29, 27 Solving Ax = b, an overview

More information

BETI for acoustic and electromagnetic scattering

BETI for acoustic and electromagnetic scattering BETI for acoustic and electromagnetic scattering O. Steinbach, M. Windisch Institut für Numerische Mathematik Technische Universität Graz Oberwolfach 18. Februar 2010 FWF-Project: Data-sparse Boundary

More information

Stabilization and Acceleration of Algebraic Multigrid Method

Stabilization and Acceleration of Algebraic Multigrid Method Stabilization and Acceleration of Algebraic Multigrid Method Recursive Projection Algorithm A. Jemcov J.P. Maruszewski Fluent Inc. October 24, 2006 Outline 1 Need for Algorithm Stabilization and Acceleration

More information

Parallel Scalability of a FETI DP Mortar Method for Problems with Discontinuous Coefficients

Parallel Scalability of a FETI DP Mortar Method for Problems with Discontinuous Coefficients Parallel Scalability of a FETI DP Mortar Method for Problems with Discontinuous Coefficients Nina Dokeva and Wlodek Proskurowski Department of Mathematics, University of Southern California, Los Angeles,

More information

Two-level Domain decomposition preconditioning for the high-frequency time-harmonic Maxwell equations

Two-level Domain decomposition preconditioning for the high-frequency time-harmonic Maxwell equations Two-level Domain decomposition preconditioning for the high-frequency time-harmonic Maxwell equations Marcella Bonazzoli 2, Victorita Dolean 1,4, Ivan G. Graham 3, Euan A. Spence 3, Pierre-Henri Tournier

More information

Multispace and Multilevel BDDC. Jan Mandel University of Colorado at Denver and Health Sciences Center

Multispace and Multilevel BDDC. Jan Mandel University of Colorado at Denver and Health Sciences Center Multispace and Multilevel BDDC Jan Mandel University of Colorado at Denver and Health Sciences Center Based on joint work with Bedřich Sousedík, UCDHSC and Czech Technical University, and Clark R. Dohrmann,

More information

Contents. Preface... xi. Introduction...

Contents. Preface... xi. Introduction... Contents Preface... xi Introduction... xv Chapter 1. Computer Architectures... 1 1.1. Different types of parallelism... 1 1.1.1. Overlap, concurrency and parallelism... 1 1.1.2. Temporal and spatial parallelism

More information

Solving Large Nonlinear Sparse Systems

Solving Large Nonlinear Sparse Systems Solving Large Nonlinear Sparse Systems Fred W. Wubs and Jonas Thies Computational Mechanics & Numerical Mathematics University of Groningen, the Netherlands f.w.wubs@rug.nl Centre for Interdisciplinary

More information

Multipréconditionnement adaptatif pour les méthodes de décomposition de domaine. Nicole Spillane (CNRS, CMAP, École Polytechnique)

Multipréconditionnement adaptatif pour les méthodes de décomposition de domaine. Nicole Spillane (CNRS, CMAP, École Polytechnique) Multipréconditionnement adaptatif pour les méthodes de décomposition de domaine Nicole Spillane (CNRS, CMAP, École Polytechnique) C. Bovet (ONERA), P. Gosselet (ENS Cachan), A. Parret Fréaud (SafranTech),

More information

Construction of a New Domain Decomposition Method for the Stokes Equations

Construction of a New Domain Decomposition Method for the Stokes Equations Construction of a New Domain Decomposition Method for the Stokes Equations Frédéric Nataf 1 and Gerd Rapin 2 1 CMAP, CNRS; UMR7641, Ecole Polytechnique, 91128 Palaiseau Cedex, France 2 Math. Dep., NAM,

More information

Solving PDEs with Multigrid Methods p.1

Solving PDEs with Multigrid Methods p.1 Solving PDEs with Multigrid Methods Scott MacLachlan maclachl@colorado.edu Department of Applied Mathematics, University of Colorado at Boulder Solving PDEs with Multigrid Methods p.1 Support and Collaboration

More information

A robust multilevel approximate inverse preconditioner for symmetric positive definite matrices

A robust multilevel approximate inverse preconditioner for symmetric positive definite matrices DICEA DEPARTMENT OF CIVIL, ENVIRONMENTAL AND ARCHITECTURAL ENGINEERING PhD SCHOOL CIVIL AND ENVIRONMENTAL ENGINEERING SCIENCES XXX CYCLE A robust multilevel approximate inverse preconditioner for symmetric

More information

Multilevel spectral coarse space methods in FreeFem++ on parallel architectures

Multilevel spectral coarse space methods in FreeFem++ on parallel architectures Multilevel spectral coarse space methods in FreeFem++ on parallel architectures Pierre Jolivet Laboratoire Jacques-Louis Lions Laboratoire Jean Kuntzmann DD 21, Rennes. June 29, 2012 In collaboration with

More information

PhD Thesis. Sparse preconditioners for dense linear systems from electromagnetic applications

PhD Thesis. Sparse preconditioners for dense linear systems from electromagnetic applications N o Ordre: 1879 PhD Thesis Spécialité : Informatique Sparse preconditioners for dense linear systems from electromagnetic applications présentée le 23 Avril 2002 à l Institut National Polytechnique de

More information

Lecture 8: Fast Linear Solvers (Part 7)

Lecture 8: Fast Linear Solvers (Part 7) Lecture 8: Fast Linear Solvers (Part 7) 1 Modified Gram-Schmidt Process with Reorthogonalization Test Reorthogonalization If Av k 2 + δ v k+1 2 = Av k 2 to working precision. δ = 10 3 2 Householder Arnoldi

More information

Lecture 17: Iterative Methods and Sparse Linear Algebra

Lecture 17: Iterative Methods and Sparse Linear Algebra Lecture 17: Iterative Methods and Sparse Linear Algebra David Bindel 25 Mar 2014 Logistics HW 3 extended to Wednesday after break HW 4 should come out Monday after break Still need project description

More information

Domain Decomposition Preconditioners for Spectral Nédélec Elements in Two and Three Dimensions

Domain Decomposition Preconditioners for Spectral Nédélec Elements in Two and Three Dimensions Domain Decomposition Preconditioners for Spectral Nédélec Elements in Two and Three Dimensions Bernhard Hientzsch Courant Institute of Mathematical Sciences, New York University, 51 Mercer Street, New

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 18 Outline

More information

Multilevel and Adaptive Iterative Substructuring Methods. Jan Mandel University of Colorado Denver

Multilevel and Adaptive Iterative Substructuring Methods. Jan Mandel University of Colorado Denver Multilevel and Adaptive Iterative Substructuring Methods Jan Mandel University of Colorado Denver The multilevel BDDC method is joint work with Bedřich Sousedík, Czech Technical University, and Clark Dohrmann,

More information

Two-scale Dirichlet-Neumann preconditioners for boundary refinements

Two-scale Dirichlet-Neumann preconditioners for boundary refinements Two-scale Dirichlet-Neumann preconditioners for boundary refinements Patrice Hauret 1 and Patrick Le Tallec 2 1 Graduate Aeronautical Laboratories, MS 25-45, California Institute of Technology Pasadena,

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences) AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences) Lecture 19: Computing the SVD; Sparse Linear Systems Xiangmin Jiao Stony Brook University Xiangmin Jiao Numerical

More information

Iterative Methods and Multigrid

Iterative Methods and Multigrid Iterative Methods and Multigrid Part 3: Preconditioning 2 Eric de Sturler Preconditioning The general idea behind preconditioning is that convergence of some method for the linear system Ax = b can be

More information

Large Scale Sparse Linear Algebra

Large Scale Sparse Linear Algebra Large Scale Sparse Linear Algebra P. Amestoy (INP-N7, IRIT) A. Buttari (CNRS, IRIT) T. Mary (University of Toulouse, IRIT) A. Guermouche (Univ. Bordeaux, LaBRI), J.-Y. L Excellent (INRIA, LIP, ENS-Lyon)

More information

Adaptive algebraic multigrid methods in lattice computations

Adaptive algebraic multigrid methods in lattice computations Adaptive algebraic multigrid methods in lattice computations Karsten Kahl Bergische Universität Wuppertal January 8, 2009 Acknowledgements Matthias Bolten, University of Wuppertal Achi Brandt, Weizmann

More information

Multi-Domain Approaches for the Solution of High-Frequency Time-Harmonic Propagation Problems

Multi-Domain Approaches for the Solution of High-Frequency Time-Harmonic Propagation Problems Académie universitaire Wallonie Europe Université de Liège Faculté des Sciences Appliquées Collège de doctorat en Électricité, électronique et informatique Multi-Domain Approaches for the Solution of High-Frequency

More information

Extending the theory for domain decomposition algorithms to less regular subdomains

Extending the theory for domain decomposition algorithms to less regular subdomains Extending the theory for domain decomposition algorithms to less regular subdomains Olof Widlund Courant Institute of Mathematical Sciences New York University http://www.cs.nyu.edu/cs/faculty/widlund/

More information

Some examples in domain decomposition

Some examples in domain decomposition Some examples in domain decomposition Frédéric Nataf nataf@ann.jussieu.fr, www.ann.jussieu.fr/ nataf Laboratoire J.L. Lions, CNRS UMR7598. Université Pierre et Marie Curie, France Joint work with V. Dolean

More information

Lecture 18 Classical Iterative Methods

Lecture 18 Classical Iterative Methods Lecture 18 Classical Iterative Methods MIT 18.335J / 6.337J Introduction to Numerical Methods Per-Olof Persson November 14, 2006 1 Iterative Methods for Linear Systems Direct methods for solving Ax = b,

More information

Parallel Numerics, WT 2016/ Iterative Methods for Sparse Linear Systems of Equations. page 1 of 1

Parallel Numerics, WT 2016/ Iterative Methods for Sparse Linear Systems of Equations. page 1 of 1 Parallel Numerics, WT 2016/2017 5 Iterative Methods for Sparse Linear Systems of Equations page 1 of 1 Contents 1 Introduction 1.1 Computer Science Aspects 1.2 Numerical Problems 1.3 Graphs 1.4 Loop Manipulations

More information

Preconditioning Techniques for Large Linear Systems Part III: General-Purpose Algebraic Preconditioners

Preconditioning Techniques for Large Linear Systems Part III: General-Purpose Algebraic Preconditioners Preconditioning Techniques for Large Linear Systems Part III: General-Purpose Algebraic Preconditioners Michele Benzi Department of Mathematics and Computer Science Emory University Atlanta, Georgia, USA

More information

A Balancing Algorithm for Mortar Methods

A Balancing Algorithm for Mortar Methods A Balancing Algorithm for Mortar Methods Dan Stefanica Baruch College, City University of New York, NY, USA. Dan_Stefanica@baruch.cuny.edu Summary. The balancing methods are hybrid nonoverlapping Schwarz

More information

Domain decomposition on different levels of the Jacobi-Davidson method

Domain decomposition on different levels of the Jacobi-Davidson method hapter 5 Domain decomposition on different levels of the Jacobi-Davidson method Abstract Most computational work of Jacobi-Davidson [46], an iterative method suitable for computing solutions of large dimensional

More information

Implicit Solution of Viscous Aerodynamic Flows using the Discontinuous Galerkin Method

Implicit Solution of Viscous Aerodynamic Flows using the Discontinuous Galerkin Method Implicit Solution of Viscous Aerodynamic Flows using the Discontinuous Galerkin Method Per-Olof Persson and Jaime Peraire Massachusetts Institute of Technology 7th World Congress on Computational Mechanics

More information

Recent advances in HPC with FreeFem++

Recent advances in HPC with FreeFem++ Recent advances in HPC with FreeFem++ Pierre Jolivet Laboratoire Jacques-Louis Lions Laboratoire Jean Kuntzmann Fourth workshop on FreeFem++ December 6, 2012 With F. Hecht, F. Nataf, C. Prud homme. Outline

More information

Multi-source inversion of TEM data: with field applications to Mt. Milligan

Multi-source inversion of TEM data: with field applications to Mt. Milligan Multi-source inversion of TEM data: with field applications to Mt. Milligan D. Oldenburg, E. Haber, D. Yang Geophysical Inversion Facility, University of British Columbia, Vancouver, BC, Canada Summary

More information

The Conjugate Gradient Method

The Conjugate Gradient Method The Conjugate Gradient Method Classical Iterations We have a problem, We assume that the matrix comes from a discretization of a PDE. The best and most popular model problem is, The matrix will be as large

More information

Parallel Iterative Methods for Sparse Linear Systems. H. Martin Bücker Lehrstuhl für Hochleistungsrechnen

Parallel Iterative Methods for Sparse Linear Systems. H. Martin Bücker Lehrstuhl für Hochleistungsrechnen Parallel Iterative Methods for Sparse Linear Systems Lehrstuhl für Hochleistungsrechnen www.sc.rwth-aachen.de RWTH Aachen Large and Sparse Small and Dense Outline Problem with Direct Methods Iterative

More information

Adaptive preconditioners for nonlinear systems of equations

Adaptive preconditioners for nonlinear systems of equations Adaptive preconditioners for nonlinear systems of equations L. Loghin D. Ruiz A. Touhami CERFACS Technical Report TR/PA/04/42 Also ENSEEIHT-IRIT Technical Report RT/TLSE/04/02 Abstract The use of preconditioned

More information

A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation

A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation Tao Zhao 1, Feng-Nan Hwang 2 and Xiao-Chuan Cai 3 Abstract In this paper, we develop an overlapping domain decomposition

More information

A FETI-DP Method for Mortar Finite Element Discretization of a Fourth Order Problem

A FETI-DP Method for Mortar Finite Element Discretization of a Fourth Order Problem A FETI-DP Method for Mortar Finite Element Discretization of a Fourth Order Problem Leszek Marcinkowski 1 and Nina Dokeva 2 1 Department of Mathematics, Warsaw University, Banacha 2, 02 097 Warszawa, Poland,

More information

Enhancing Scalability of Sparse Direct Methods

Enhancing Scalability of Sparse Direct Methods Journal of Physics: Conference Series 78 (007) 0 doi:0.088/7-6596/78//0 Enhancing Scalability of Sparse Direct Methods X.S. Li, J. Demmel, L. Grigori, M. Gu, J. Xia 5, S. Jardin 6, C. Sovinec 7, L.-Q.

More information

On the design of parallel linear solvers for large scale problems

On the design of parallel linear solvers for large scale problems On the design of parallel linear solvers for large scale problems Journée problème de Poisson, IHP, Paris M. Faverge, P. Ramet M. Faverge Assistant Professor Bordeaux INP LaBRI Inria Bordeaux - Sud-Ouest

More information

A sparse multifrontal solver using hierarchically semi-separable frontal matrices

A sparse multifrontal solver using hierarchically semi-separable frontal matrices A sparse multifrontal solver using hierarchically semi-separable frontal matrices Pieter Ghysels Lawrence Berkeley National Laboratory Joint work with: Xiaoye S. Li (LBNL), Artem Napov (ULB), François-Henry

More information

Block Low-Rank (BLR) approximations to improve multifrontal sparse solvers

Block Low-Rank (BLR) approximations to improve multifrontal sparse solvers Block Low-Rank (BLR) approximations to improve multifrontal sparse solvers Joint work with Patrick Amestoy, Cleve Ashcraft, Olivier Boiteau, Alfredo Buttari and Jean-Yves L Excellent, PhD started on October

More information

An additive average Schwarz method for the plate bending problem

An additive average Schwarz method for the plate bending problem J. Numer. Math., Vol. 10, No. 2, pp. 109 125 (2002) c VSP 2002 Prepared using jnm.sty [Version: 02.02.2002 v1.2] An additive average Schwarz method for the plate bending problem X. Feng and T. Rahman Abstract

More information

Une méthode parallèle hybride à deux niveaux interfacée dans un logiciel d éléments finis

Une méthode parallèle hybride à deux niveaux interfacée dans un logiciel d éléments finis Une méthode parallèle hybride à deux niveaux interfacée dans un logiciel d éléments finis Frédéric Nataf Laboratory J.L. Lions (LJLL), CNRS, Alpines Inria and Univ. Paris VI Victorita Dolean (Univ. Nice

More information

MUMPS. The MUMPS library: work done during the SOLSTICE project. MUMPS team, Lyon-Grenoble, Toulouse, Bordeaux

MUMPS. The MUMPS library: work done during the SOLSTICE project. MUMPS team, Lyon-Grenoble, Toulouse, Bordeaux The MUMPS library: work done during the SOLSTICE project MUMPS team, Lyon-Grenoble, Toulouse, Bordeaux Sparse Days and ANR SOLSTICE Final Workshop June MUMPS MUMPS Team since beg. of SOLSTICE (2007) Permanent

More information

Chapter 7 Iterative Techniques in Matrix Algebra

Chapter 7 Iterative Techniques in Matrix Algebra Chapter 7 Iterative Techniques in Matrix Algebra Per-Olof Persson persson@berkeley.edu Department of Mathematics University of California, Berkeley Math 128B Numerical Analysis Vector Norms Definition

More information

A Novel Aggregation Method based on Graph Matching for Algebraic MultiGrid Preconditioning of Sparse Linear Systems

A Novel Aggregation Method based on Graph Matching for Algebraic MultiGrid Preconditioning of Sparse Linear Systems A Novel Aggregation Method based on Graph Matching for Algebraic MultiGrid Preconditioning of Sparse Linear Systems Pasqua D Ambra, Alfredo Buttari, Daniela Di Serafino, Salvatore Filippone, Simone Gentile,

More information

Shifted Laplace and related preconditioning for the Helmholtz equation

Shifted Laplace and related preconditioning for the Helmholtz equation Shifted Laplace and related preconditioning for the Helmholtz equation Ivan Graham and Euan Spence (Bath, UK) Collaborations with: Paul Childs (Schlumberger Gould Research), Martin Gander (Geneva) Douglas

More information

FEniCS Course. Lecture 0: Introduction to FEM. Contributors Anders Logg, Kent-Andre Mardal

FEniCS Course. Lecture 0: Introduction to FEM. Contributors Anders Logg, Kent-Andre Mardal FEniCS Course Lecture 0: Introduction to FEM Contributors Anders Logg, Kent-Andre Mardal 1 / 46 What is FEM? The finite element method is a framework and a recipe for discretization of mathematical problems

More information

SOLVING ELLIPTIC PDES

SOLVING ELLIPTIC PDES university-logo SOLVING ELLIPTIC PDES School of Mathematics Semester 1 2008 OUTLINE 1 REVIEW 2 POISSON S EQUATION Equation and Boundary Conditions Solving the Model Problem 3 THE LINEAR ALGEBRA PROBLEM

More information

On the choice of abstract projection vectors for second level preconditioners

On the choice of abstract projection vectors for second level preconditioners On the choice of abstract projection vectors for second level preconditioners C. Vuik 1, J.M. Tang 1, and R. Nabben 2 1 Delft University of Technology 2 Technische Universität Berlin Institut für Mathematik

More information

A SHORT NOTE COMPARING MULTIGRID AND DOMAIN DECOMPOSITION FOR PROTEIN MODELING EQUATIONS

A SHORT NOTE COMPARING MULTIGRID AND DOMAIN DECOMPOSITION FOR PROTEIN MODELING EQUATIONS A SHORT NOTE COMPARING MULTIGRID AND DOMAIN DECOMPOSITION FOR PROTEIN MODELING EQUATIONS MICHAEL HOLST AND FAISAL SAIED Abstract. We consider multigrid and domain decomposition methods for the numerical

More information

Multigrid and Domain Decomposition Methods for Electrostatics Problems

Multigrid and Domain Decomposition Methods for Electrostatics Problems Multigrid and Domain Decomposition Methods for Electrostatics Problems Michael Holst and Faisal Saied Abstract. We consider multigrid and domain decomposition methods for the numerical solution of electrostatics

More information

Optimal Interface Conditions for an Arbitrary Decomposition into Subdomains

Optimal Interface Conditions for an Arbitrary Decomposition into Subdomains Optimal Interface Conditions for an Arbitrary Decomposition into Subdomains Martin J. Gander and Felix Kwok Section de mathématiques, Université de Genève, Geneva CH-1211, Switzerland, Martin.Gander@unige.ch;

More information

New constructions of domain decomposition methods for systems of PDEs

New constructions of domain decomposition methods for systems of PDEs New constructions of domain decomposition methods for systems of PDEs Nouvelles constructions de méthodes de décomposition de domaine pour des systèmes d équations aux dérivées partielles V. Dolean?? F.

More information

Preconditioning the Newton-Krylov method for reactive transport

Preconditioning the Newton-Krylov method for reactive transport Preconditioning the Newton-Krylov method for reactive transport Michel Kern, L. Amir, A. Taakili INRIA Paris-Rocquencourt Maison de la Simulation MoMaS workshop Reactive Transport Modeling in the Geological

More information

arxiv: v1 [math.na] 11 Jul 2011

arxiv: v1 [math.na] 11 Jul 2011 Multigrid Preconditioner for Nonconforming Discretization of Elliptic Problems with Jump Coefficients arxiv:07.260v [math.na] Jul 20 Blanca Ayuso De Dios, Michael Holst 2, Yunrong Zhu 2, and Ludmil Zikatanov

More information

Solving Symmetric Indefinite Systems with Symmetric Positive Definite Preconditioners

Solving Symmetric Indefinite Systems with Symmetric Positive Definite Preconditioners Solving Symmetric Indefinite Systems with Symmetric Positive Definite Preconditioners Eugene Vecharynski 1 Andrew Knyazev 2 1 Department of Computer Science and Engineering University of Minnesota 2 Department

More information

J.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1. March, 2009

J.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1. March, 2009 Parallel Preconditioning of Linear Systems based on ILUPACK for Multithreaded Architectures J.I. Aliaga M. Bollhöfer 2 A.F. Martín E.S. Quintana-Ortí Deparment of Computer Science and Engineering, Univ.

More information

Numerical Methods in Matrix Computations

Numerical Methods in Matrix Computations Ake Bjorck Numerical Methods in Matrix Computations Springer Contents 1 Direct Methods for Linear Systems 1 1.1 Elements of Matrix Theory 1 1.1.1 Matrix Algebra 2 1.1.2 Vector Spaces 6 1.1.3 Submatrices

More information

Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner

Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner Olivier Dubois 1 and Martin J. Gander 2 1 IMA, University of Minnesota, 207 Church St. SE, Minneapolis, MN 55455 dubois@ima.umn.edu

More information

Key words. Convection-Diffusion, Domain Decomposition, Preconditioning, Spectral Element Method

Key words. Convection-Diffusion, Domain Decomposition, Preconditioning, Spectral Element Method University of Maryland Department of Computer Science TR-C4931 University of Maryland Institute for Advanced Computer Studies TR-2009-03 FAST ITERATIVE SOLVER FOR CONVECTION-DIFFUSION SYSTEMS WITH SPECTRAL

More information

MARCH 24-27, 2014 SAN JOSE, CA

MARCH 24-27, 2014 SAN JOSE, CA MARCH 24-27, 2014 SAN JOSE, CA Sparse HPC on modern architectures Important scientific applications rely on sparse linear algebra HPCG a new benchmark proposal to complement Top500 (HPL) To solve A x =

More information

EFFICIENT NUMERICAL TREATMENT OF HIGH-CONTRAST DIFFUSION PROBLEMS

EFFICIENT NUMERICAL TREATMENT OF HIGH-CONTRAST DIFFUSION PROBLEMS EFFICIENT NUMERICAL TREATMENT OF HIGH-CONTRAST DIFFUSION PROBLEMS A Dissertation Presented to the Faculty of the Department of Mathematics University of Houston In Partial Fulfillment of the Requirements

More information

From the Boundary Element Domain Decomposition Methods to Local Trefftz Finite Element Methods on Polyhedral Meshes

From the Boundary Element Domain Decomposition Methods to Local Trefftz Finite Element Methods on Polyhedral Meshes From the Boundary Element Domain Decomposition Methods to Local Trefftz Finite Element Methods on Polyhedral Meshes Dylan Copeland 1, Ulrich Langer 2, and David Pusch 3 1 Institute of Computational Mathematics,

More information

A Robust Preconditioned Iterative Method for the Navier-Stokes Equations with High Reynolds Numbers

A Robust Preconditioned Iterative Method for the Navier-Stokes Equations with High Reynolds Numbers Applied and Computational Mathematics 2017; 6(4): 202-207 http://www.sciencepublishinggroup.com/j/acm doi: 10.11648/j.acm.20170604.18 ISSN: 2328-5605 (Print); ISSN: 2328-5613 (Online) A Robust Preconditioned

More information

Matrix square root and interpolation spaces

Matrix square root and interpolation spaces Matrix square root and interpolation spaces Mario Arioli and Daniel Loghin m.arioli@rl.ac.uk STFC-Rutherford Appleton Laboratory, and University of Birmingham Sparse Days,CERFACS, Toulouse, 2008 p.1/30

More information

Accelerating incompressible fluid flow simulations on hybrid CPU/GPU systems

Accelerating incompressible fluid flow simulations on hybrid CPU/GPU systems Accelerating incompressible fluid flow simulations on hybrid CPU/GPU systems Yushan Wang 1, Marc Baboulin 1,2, Karl Rupp 3,4, Yann Fraigneau 1,5, Olivier Le Maître 1,5 1 Université Paris-Sud, France 2

More information