Fast Algorithms for Many-Particle Simulations on Peta-Scale Parallel Architectures

Size: px
Start display at page:

Download "Fast Algorithms for Many-Particle Simulations on Peta-Scale Parallel Architectures"

Transcription

1 Mitglied der Helmholtz-Gemeinschaft Fast Algorithms for Many-Particle Simulations on Peta-Scale Parallel Architectures Godehard Sutmann Institute for Advanced Simulation (IAS) - Jülich Supercomputing Centre (JSC) Research Centre Jülich g.sutmann@fz-juelich.de

2 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 2

3 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 3

4 Atomistic Models of Matter Galaxy Li F in water 4

5 Molecular dynamics Particle systems are described by a Hamiltonian Kinetic Energy Potential Energy Particle trajectories are generated by solving classical equations of motion 5

6 Long range interactions Typical form for a force field (AMBER / Sander) Classification of potentials Examples: Coulomb interactions (charge-charge, dipole-dipole) Gravitational potential 6

7 Long Range Interactions Potential energy calculations 7

8 Long Range Interactions Hierarchical potential energy calculations 8

9 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 9

10 Methods for long range interactions Periodic systems Grid free: Plain Ewald summation O(N3/2) Fast multipole method (FMM) O(N) Tree-algorithms (Barnes-Hut) O(N log(n)) Grid based: Particle Mesh Ewald (PME) O(N log(n)) Particle-Particle Particle-Mesh Method (P3M) O(N log(n)) Open systems Grid free: Tree-algorithms (Barnes-Hut) O(N log(n)) Fast multipole method (FMM) O(N) Grid based: Multigrid based method (P3MG) O(N) List not complete 10

11 Classification of Algorithms Fully Hierarchical Hierarchical Splitting Methods complete subdivision of space complete subdivision of space continuous description from sources to far-field splitting between short- and longrange contributions hierarchical ordering of field contributions fast methods to solve long range contributions mesh-less description (often) grid-based description sources: charges, multipoles sources: pseudo-charges res.: physical potentials, fields res.: lr: pseudo-potentials / -fields sr: potentials + corrections expl: Fast Multipole Method Barnes-Hut Tree Method expl: P3M, SPME, multigrid 11

12 Why different methods? Multigrid Elliptic PDE solver (Poisson, Helmholtz) & non-linear (Poisson-Boltzmann) allows for dielectric shading prescription of desired boundary potentials (capacitors) Barnes-Hut Tree Method strongly inhomogeneous particle distributions very complex system geometries Fast Multipole Method high accuracy calculations full error control All methods are scalable on parallel architectures 12

13 Schematic differences of hierarchical methods Multigrid Barnes-Hut Tree method Fast multipole method 13

14 Acknowlegements FMM Holger Dachsel, Ivo Kabadshow Barnes-Hut Paul Gibbon, Lukas Arnold, Robert Speck, Mathias Winkel Multigrid Matthias Bolten, Bernhard Steffen Parallel Sorting Michael Hofmann (TU Chemnitz) Interface Lidia Westphal, Rene Halver Financial support from BMBF 14

15 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 15

16 Multigrid for Poisson Equation Solve PDE on the finest grid with Dirichlet boundary conditions Use finite differences to approximate PDE Get an approximation for the potential from where an error ε and residuum r is obtained These obey the same differential equation Get correction to approximation: (repeat until convergence) 16

17 Multigrid for Poisson Equation Multigrid: solve for the error onto a hierarchy of coarse grids 2Lh Low frequency components on the fine grid get high frequency components on the coarse grid (fast error reduction) Forward MG: Restrict residuum Backward MG: Prolongate residuum e.g. V cycle 17

18 Complexity Natural complexity of long-range pair-wise interactions is O(N2) Multigrid solver reduces complexity of field calculations to O(N) niter fixed on each level 18

19 Convergence of Multigrid Test system of 4096 particles with (25+1)3 grid points Multigrid calculation 19

20 Convergence of Multigrid Test system of 4096 particles with (25)3 grid points Single grid calculation 20

21 Electrostatics: finite difference solution for point particles In molecular simulations, source terms are point charges, which create a potential, diverging at the origin Sampling problem on a discrete grid Resolution problem depending on grid spacing Large error close to the source term Also far field poisoned by discretization errors 21

22 Electrostatic Regularization Splitting of the potential into near- and far-field (analogon to Ewald summation) 22

23 Scaling 23

24 Multigrid Parallelization Exchange of neighbor grid points on each hierarchy level Data non-locality, examples Ng = 1283, i.e. (27)3, i.e. 7 MG-levels Np = 512, i.e. 83, i.e. 5 local levels (128,64,32,16,8) r ~ 2.5x104 Ng = 20483, i.e. (211)3, i.e. 11 MG-levels Np = 65536, i.e. 323, i.e. 7 local levels (2048,1024,512,256,128,64,32) r ~ 2x106 24

25 Parallel Fast Multipole Method Parallelization: Domain decomposition MPI (cartesian communicators) Asynchronous communications only nearest neighbor communication Strong scaling: N= point stencil Strong scaling: N= unknowns/core 25

26 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 26

27 Tree Algorithms Organize physical space in a hierarchical way (relative to each particle) Interactions are weighted according to their importance Trees may be organized in different ways, e.g. 2d trees: binary-, quad-, oct-trees in 1,2,3 dimensions Principle is to sort particles into cells, where on the lowest level only one particle is inside a cell Original work by Barnes & Hut 27

28 Tree Algorithms: grouping criterion Interaction between individual particles and distant pseudo-particles (multipoles) Which criterion is adopted for grouping? If s/d < θ : internal structure of the pseudoparticle is ignored If s/d > θ : the node is resolved and the criterion is checked for every child At latest this procedure stops, when a leaf node is reached 28

29 Sorted list of keys maps 3D particle coordinates onto 1D space-filling curve Sort keys according to their size in ASCII representation This sequence will be a space filling curve (Morton, Z, Lebesque curve) finish start 29

30 PEPC Barnes-Hut algorithm implemented into PEPC (Pretty Efficient Parallel Coulomb-Solver) developed by Paul Gibbon (JSC) for plasma-laser interaction reduction of the n-body O(N2) complexity to O(N log N) OpenSource, freely available F90 and MPI part of ScaFaCoS library part of DEISA and PRACE benchmark suites 30

31 Scientific Application PEPC is highly flexible and comes in different flavours PEPC-E Coulomb interaction PEPC-B Magnetic fields laser-plasma interaction PEPC-V vortex fluid methods PEPC-G Newtonian gravitation extends to smooth particle hydrodynamics 31

32 Parallel Performance (pure MPI version) PEPC on IBM BG/P 32

33 Hybrid PEPC MPI + POSIX threads reduce metadata overhead communication thread + multiple worker threads overlap of communication and computation first successful runs on 300k cores of jugene system size N > 2x109 33

34 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 34

35 Long range contributions For general orientations of coordinate vectors the expansion is Define 35

36 Extensions: rotation-based FMM Conventional FMM has complexity O(L4) with respect to the multipole length Costs can be reduced to O(L3) if Legendre polynomials are used 36

37 Fast electrostatics Fast Multipole Method FMM is subdivided into 5+1 passes Pass 0: determine optimal FMM parameter set Pass 1: form and shift multipole expansions Pass 2: transform distant multipole expansions Pass 3: shift Taylor like expansions Pass 4: calculate far field energy Pass 5: calculate near field energy 37

38 Fast Multipole Method Pass 0 Determine FMM parameters Seperation criteria ws Depth of the FMM tree Length of the multipole expansion L Optimisation of run time behavior Pass 4 Sum up far field energies Pass 5 Sum up near field energies 38

39 Fast electrostatics Fast Multipole Method Computational complexity: Every Pass consists of operations of O(N), > Fast Multipole Method has optimal scaling with number of particles! 39

40 Periodic Boundary Conditions Implementation straight forward (although involved) for 1d, 2d, 3d-periodicity Scaling of O(N) Full error control 40

41 FMM Parallelization 41

42 Parallel Fast Multipole Method Parallelization: Remote memory access (RMA) Point-to-point: One sided communication Put, Fence, Notify, Notifywait Jugene Contigous data transfer ARMCI / A1 Collective operations MPI Allgather, Allreduce, Barrier Juropa 42

43 Parallel Fast Multipole Method Weak scaling on Jugene and Juropa (Load: 86= particles/core) 43

44 Parallel Fast Multipole Method Strong scaling on Jugene (Load: N = 10243) 44

45 Parallel Fast Multipole Method Strong scaling on Juropa (Load: N = 10243) 45

46 FMM World record (Dec. 2010) After memory optimization: 50 Bytes / particle 512 Mbytes / core x cores / 50 Bytes / particle 3.0 x 1012 particles Largest simulation with N = particles = = 3.0 Trillion 46

47 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 47

48 BMBF Project ScaFaCoS Scalable Library for Fast Coulomb Solver Coordinator: Forschungszentrum Jülich (IAS/JSC) University Partners: Bonn Chemnitz Stuttgart Wuppertal Research Centres: MPI Mainz (Polymer) Industry: BASF Ludwigshafen Cognis Düsseldorf (BASF) IBM Dresden Jülich Cognis Wuppertal SCAI Bonn IBM Chemnitz MPI BASF Stuttgart 48

49 BMBF Project ScaFaCoS Interfacing Programs Scalable Fast Coulomb Solvers liblr Particle2Grid Communication Ewald P3M FMM BHT P3MG Fast summation Long Range Interactions Multipole libsr S c h n i t t s t e l l e Communication Neighbor search Potential fcts. Short Range Interactions Raumf.Kurven Parallel Sorting Parallel NFFT, AFFT Parallel Multigrid Parallel Interface AnwendungsProgramm 1 (MD, MC, BD, ) AnwendungsProgramm N (MD, MC, BD, ) 49

50 Yes Insert essential solver-specific data into a new FCSHandle-object Call fcs_init to set method and MPI communicator Calls fcs_fmm_init Calls fcs_p3m_init Calls fcs_pp3mg_init Calls fcs_nfft_init Calls fcs_pepc_init Calls fcs_direct_init Calls fcs_pepc_tune Calls fcs_direct_tune Calls fcs_pepc_run Calls fcs_direct_run Insert optional solver-specific data into FCSHandle-object Call fcs_tune to calculate solver-specific parameters based on positions / charges Calls fcs_fmm_tune Calls fcs_p3m_tune Calls fcs_pp3mg_tune Calls fcs_nfft_tune No Call fcs_run to calculate a time step with the chosen solver Calls fcs_fmm_run Calls fcs_p3m_run Calls fcs_pp3mg_run Calls fcs_nfft_run Yes No Next time step with same configuration? Change solver? Calculate another time step at all? Call fcshandle_destroy to deallocate FCSHandle-object Simulation ends 50

51 Thank you for your Attention 51

Long-range Interactions in Particle Simulations ScaFaCoS. Olaf Lenz

Long-range Interactions in Particle Simulations ScaFaCoS. Olaf Lenz Long-range Interactions in Particle Simulations ScaFaCoS 1/33 Outline Many-particle Simulations Short-range Interactions Long-range Interactions: ScaFaCoS If time permits: ESPResSo and coarse-graining

More information

The Blue Gene/P at Jülich Case Study & Optimization. W.Frings, Forschungszentrum Jülich,

The Blue Gene/P at Jülich Case Study & Optimization. W.Frings, Forschungszentrum Jülich, The Blue Gene/P at Jülich Case Study & Optimization W.Frings, Forschungszentrum Jülich, 26.08.2008 Jugene Case-Studies: Overview Case Study: PEPC Case Study: racoon Case Study: QCD CPU0CPU3 CPU1CPU2 2

More information

A Fast N-Body Solver for the Poisson(-Boltzmann) Equation

A Fast N-Body Solver for the Poisson(-Boltzmann) Equation A Fast N-Body Solver for the Poisson(-Boltzmann) Equation Robert D. Skeel Departments of Computer Science (and Mathematics) Purdue University http://bionum.cs.purdue.edu/2008december.pdf 1 Thesis Poisson(-Boltzmann)

More information

Fast Methods for Long-Range Interactions in Complex Systems. IAS Series. Volume 6 ISBN

Fast Methods for Long-Range Interactions in Complex Systems. IAS Series. Volume 6 ISBN IAS Series This publication was edited at the Jülich Supercomputing Centre (JSC) which is an integral part of the Institute for Advanced Simulation (IAS). The IAS combines the Jülich simulation sciences

More information

Multipole-Based Preconditioners for Sparse Linear Systems.

Multipole-Based Preconditioners for Sparse Linear Systems. Multipole-Based Preconditioners for Sparse Linear Systems. Ananth Grama Purdue University. Supported by the National Science Foundation. Overview Summary of Contributions Generalized Stokes Problem Solenoidal

More information

Simulation Laboratories at JSC

Simulation Laboratories at JSC Mitglied der Helmholtz-Gemeinschaft Simulation Laboratories at JSC Paul Gibbon Jülich Supercomputing Centre Jülich Supercomputing Centre Supercomputer operation for Centre FZJ Regional JARA Helmholtz &

More information

Treecodes for Cosmology Thomas Quinn University of Washington N-Body Shop

Treecodes for Cosmology Thomas Quinn University of Washington N-Body Shop Treecodes for Cosmology Thomas Quinn University of Washington N-Body Shop Outline Motivation Multipole Expansions Tree Algorithms Periodic Boundaries Time integration Gravitational Softening SPH Parallel

More information

The Fast Multipole Method in molecular dynamics

The Fast Multipole Method in molecular dynamics The Fast Multipole Method in molecular dynamics Berk Hess KTH Royal Institute of Technology, Stockholm, Sweden ADAC6 workshop Zurich, 20-06-2018 Slide BioExcel Slide Molecular Dynamics of biomolecules

More information

Hideyuki Usui 1,3, M. Nunami 2,3, Y. Yagi 1,3, T. Moritaka 1,3, and JST/CREST multi-scale PIC simulation team

Hideyuki Usui 1,3, M. Nunami 2,3, Y. Yagi 1,3, T. Moritaka 1,3, and JST/CREST multi-scale PIC simulation team Hideyuki Usui 1,3, M. Nunami 2,3, Y. Yagi 1,3, T. Moritaka 1,3, and JST/CREST multi-scale PIC simulation team 1 Kobe Univ., Japan, 2 NIFS,Japan, 3 JST/CREST, Outline Multi-scale interaction between weak

More information

Advanced Molecular Molecular Dynamics

Advanced Molecular Molecular Dynamics Advanced Molecular Molecular Dynamics Technical details May 12, 2014 Integration of harmonic oscillator r m period = 2 k k and the temperature T determine the sampling of x (here T is related with v 0

More information

Brad Gibson Centre for Astrophysics & Supercomputing Swinburne University

Brad Gibson Centre for Astrophysics & Supercomputing Swinburne University N-body Simulations: Tree and Mesh Approaches Brad Gibson Centre for Astrophysics & Supercomputing Swinburne University Demystifying the Jargon Trees Collisionless Boltzmann Equation Symplectic Time Integration

More information

Mitglied der Helmholtz-Gemeinschaft. Plasma Simulation at the Jülich Supercomputing Centre

Mitglied der Helmholtz-Gemeinschaft. Plasma Simulation at the Jülich Supercomputing Centre Mitglied der Helmholtz-Gemeinschaft Plasma Simulation at the Jülich Supercomputing Centre 27 October 2008 Paul Gibbon JSC s Mission: Provide supercomputer resources of highest performance and widest scope;

More information

Contents. Preface... xi. Introduction...

Contents. Preface... xi. Introduction... Contents Preface... xi Introduction... xv Chapter 1. Computer Architectures... 1 1.1. Different types of parallelism... 1 1.1.1. Overlap, concurrency and parallelism... 1 1.1.2. Temporal and spatial parallelism

More information

Parallel programming using MPI. Analysis and optimization. Bhupender Thakur, Jim Lupo, Le Yan, Alex Pacheco

Parallel programming using MPI. Analysis and optimization. Bhupender Thakur, Jim Lupo, Le Yan, Alex Pacheco Parallel programming using MPI Analysis and optimization Bhupender Thakur, Jim Lupo, Le Yan, Alex Pacheco Outline l Parallel programming: Basic definitions l Choosing right algorithms: Optimal serial and

More information

Practical in Numerical Astronomy, SS 2012 LECTURE 9

Practical in Numerical Astronomy, SS 2012 LECTURE 9 Practical in Numerical Astronomy, SS 01 Elliptic partial differential equations. Poisson solvers. LECTURE 9 1. Gravity force and the equations of hydrodynamics. Poisson equation versus Poisson integral.

More information

Elliptic Problems / Multigrid. PHY 604: Computational Methods for Physics and Astrophysics II

Elliptic Problems / Multigrid. PHY 604: Computational Methods for Physics and Astrophysics II Elliptic Problems / Multigrid Summary of Hyperbolic PDEs We looked at a simple linear and a nonlinear scalar hyperbolic PDE There is a speed associated with the change of the solution Explicit methods

More information

Partial Differential Equations

Partial Differential Equations Partial Differential Equations Introduction Deng Li Discretization Methods Chunfang Chen, Danny Thorne, Adam Zornes CS521 Feb.,7, 2006 What do You Stand For? A PDE is a Partial Differential Equation This

More information

Particle Methods. Lecture Reduce and Broadcast: A function viewpoint

Particle Methods. Lecture Reduce and Broadcast: A function viewpoint Lecture 9 Particle Methods 9.1 Reduce and Broadcast: A function viewpoint [This section is being rewritten with what we hope will be the world s clearest explanation of the fast multipole algorithm. Readers

More information

Aspects of Multigrid

Aspects of Multigrid Aspects of Multigrid Kees Oosterlee 1,2 1 Delft University of Technology, Delft. 2 CWI, Center for Mathematics and Computer Science, Amsterdam, SIAM Chapter Workshop Day, May 30th 2018 C.W.Oosterlee (CWI)

More information

CS 542G: The Poisson Problem, Finite Differences

CS 542G: The Poisson Problem, Finite Differences CS 542G: The Poisson Problem, Finite Differences Robert Bridson November 10, 2008 1 The Poisson Problem At the end last time, we noticed that the gravitational potential has a zero Laplacian except at

More information

A Jacobi Davidson Method with a Multigrid Solver for the Hermitian Wilson-Dirac Operator

A Jacobi Davidson Method with a Multigrid Solver for the Hermitian Wilson-Dirac Operator A Jacobi Davidson Method with a Multigrid Solver for the Hermitian Wilson-Dirac Operator Artur Strebel Bergische Universität Wuppertal August 3, 2016 Joint Work This project is joint work with: Gunnar

More information

Performance Evaluation of Scientific Applications on POWER8

Performance Evaluation of Scientific Applications on POWER8 Performance Evaluation of Scientific Applications on POWER8 2014 Nov 16 Andrew V. Adinetz 1, Paul F. Baumeister 1, Hans Böttiger 3, Thorsten Hater 1, Thilo Maurer 3, Dirk Pleiter 1, Wolfram Schenck 4,

More information

INTRODUCTION TO FAST MULTIPOLE METHODS LONG CHEN

INTRODUCTION TO FAST MULTIPOLE METHODS LONG CHEN INTRODUCTION TO FAST MULTIPOLE METHODS LONG CHEN ABSTRACT. This is a simple introduction to fast multipole methods for the N-body summation problems. Low rank approximation plus hierarchical decomposition

More information

Differential Algebra (DA) based Fast Multipole Method (FMM)

Differential Algebra (DA) based Fast Multipole Method (FMM) Differential Algebra (DA) based Fast Multipole Method (FMM) Department of Physics and Astronomy Michigan State University June 27, 2010 Space charge effect Space charge effect : the fields of the charged

More information

Numerical Simulations. Duncan Christie

Numerical Simulations. Duncan Christie Numerical Simulations Duncan Christie Motivation There isn t enough time to derive the necessary methods to do numerical simulations, but there is enough time to survey what methods and codes are available

More information

Parallel Simulations of Self-propelled Microorganisms

Parallel Simulations of Self-propelled Microorganisms Parallel Simulations of Self-propelled Microorganisms K. Pickl a,b M. Hofmann c T. Preclik a H. Köstler a A.-S. Smith b,d U. Rüde a,b ParCo 2013, Munich a Lehrstuhl für Informatik 10 (Systemsimulation),

More information

A THE FAST MULTIPLE METHOD FOR GRIDLESS PARTICLEii

A THE FAST MULTIPLE METHOD FOR GRIDLESS PARTICLEii A-89 804 THE FAST MULTIPLE METHOD FOR GRIDLESS PARTICLEii UNCLASSIFIED SIMULATIONSU) YALE UNIV NEW HAVEN CT DEPT OF COMPUTER SCIENCE J AMBROSIANO ET AL SEP 87 YALEU/DCS/RR-565 N888t14-8-k-S F/G 12/1 NL

More information

Iterative Methods and Multigrid

Iterative Methods and Multigrid Iterative Methods and Multigrid Part 1: Introduction to Multigrid 1 12/02/09 MG02.prz Error Smoothing 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Initial Solution=-Error 0 10 20 30 40 50 60 70 80 90 100 DCT:

More information

Particle-Simulation Methods for Fluid Dynamics

Particle-Simulation Methods for Fluid Dynamics Particle-Simulation Methods for Fluid Dynamics X. Y. Hu and Marco Ellero E-mail: Xiangyu.Hu and Marco.Ellero at mw.tum.de, WS 2012/2013: Lectures for Mechanical Engineering Institute of Aerodynamics Technical

More information

MOLECULAR DYNAMICS SIMULATIONS OF BIOMOLECULES: Long-Range Electrostatic Effects

MOLECULAR DYNAMICS SIMULATIONS OF BIOMOLECULES: Long-Range Electrostatic Effects Annu. Rev. Biophys. Biomol. Struct. 1999. 28:155 79 Copyright c 1999 by Annual Reviews. All rights reserved MOLECULAR DYNAMICS SIMULATIONS OF BIOMOLECULES: Long-Range Electrostatic Effects Celeste Sagui

More information

TREECODES FOR POTENTIAL AND FORCE APPROXIMATIONS. A Dissertation KASTHURI SRINIVASAN KANNAN

TREECODES FOR POTENTIAL AND FORCE APPROXIMATIONS. A Dissertation KASTHURI SRINIVASAN KANNAN TREECODES FOR POTENTIAL AND FORCE APPROXIMATIONS A Dissertation by KASTHURI SRINIVASAN KANNAN Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements

More information

A Comparison of Adaptive Algebraic Multigrid and Lüscher s Inexact Deflation

A Comparison of Adaptive Algebraic Multigrid and Lüscher s Inexact Deflation A Comparison of Adaptive Algebraic Multigrid and Lüscher s Inexact Deflation Andreas Frommer, Karsten Kahl, Stefan Krieg, Björn Leder and Matthias Rottmann Bergische Universität Wuppertal April 11, 2013

More information

Efficient Particle Simulation of a Virtual Cathode Using a Grid-Free Treecode Poisson Solver

Efficient Particle Simulation of a Virtual Cathode Using a Grid-Free Treecode Poisson Solver 384 IEEE TRANSACTIONS ON PLASMA SCIENCE, VOL. 32, NO. 2, APRIL 2004 Efficient Particle Simulation of a Virtual Cathode Using a Grid-Free Treecode Poisson Solver Andrew J. Christlieb, Robert Krasny, and

More information

Kinetic Solvers with Adaptive Mesh in Phase Space for Low- Temperature Plasmas

Kinetic Solvers with Adaptive Mesh in Phase Space for Low- Temperature Plasmas Kinetic Solvers with Adaptive Mesh in Phase Space for Low- Temperature Plasmas Vladimir Kolobov, a,b,1 Robert Arslanbekov a and Dmitry Levko a a CFD Research Corporation, Huntsville, AL 35806, USA b The

More information

A Crash Course on Fast Multipole Method (FMM)

A Crash Course on Fast Multipole Method (FMM) A Crash Course on Fast Multipole Method (FMM) Hanliang Guo Kanso Biodynamical System Lab, 2018 April What is FMM? A fast algorithm to study N body interactions (Gravitational, Coulombic interactions )

More information

A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries

A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries SC13, November 21 st 2013 Christian Godenschwager, Florian Schornbaum, Martin Bauer, Harald Köstler, Ulrich

More information

Modelling and implementation of algorithms in applied mathematics using MPI

Modelling and implementation of algorithms in applied mathematics using MPI Modelling and implementation of algorithms in applied mathematics using MPI Lecture 3: Linear Systems: Simple Iterative Methods and their parallelization, Programming MPI G. Rapin Brazil March 2011 Outline

More information

INTEGRAL-EQUATION-BASED FAST ALGORITHMS AND GRAPH-THEORETIC METHODS FOR LARGE-SCALE SIMULATIONS

INTEGRAL-EQUATION-BASED FAST ALGORITHMS AND GRAPH-THEORETIC METHODS FOR LARGE-SCALE SIMULATIONS INTEGRAL-EQUATION-BASED FAST ALGORITHMS AND GRAPH-THEORETIC METHODS FOR LARGE-SCALE SIMULATIONS Bo Zhang A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial

More information

INTRODUCTION TO FINITE ELEMENT METHODS ON ELLIPTIC EQUATIONS LONG CHEN

INTRODUCTION TO FINITE ELEMENT METHODS ON ELLIPTIC EQUATIONS LONG CHEN INTROUCTION TO FINITE ELEMENT METHOS ON ELLIPTIC EQUATIONS LONG CHEN CONTENTS 1. Poisson Equation 1 2. Outline of Topics 3 2.1. Finite ifference Method 3 2.2. Finite Element Method 3 2.3. Finite Volume

More information

Coupling atomistic and continuum modelling of magnetism

Coupling atomistic and continuum modelling of magnetism Coupling atomistic and continuum modelling of magnetism M. Poluektov 1,2 G. Kreiss 2 O. Eriksson 3 1 University of Warwick WMG International Institute for Nanocomposites Manufacturing 2 Uppsala University

More information

ICON. The Icosahedral Nonhydrostatic modelling framework

ICON. The Icosahedral Nonhydrostatic modelling framework ICON The Icosahedral Nonhydrostatic modelling framework Basic formulation, NWP and high-performance computing aspects, and its perspective towards a unified model for seamless prediction Günther Zängl,

More information

5. FVM discretization and Solution Procedure

5. FVM discretization and Solution Procedure 5. FVM discretization and Solution Procedure 1. The fluid domain is divided into a finite number of control volumes (cells of a computational grid). 2. Integral form of the conservation equations are discretized

More information

Analysis of PFLOTRAN on Jaguar

Analysis of PFLOTRAN on Jaguar Analysis of PFLOTRAN on Jaguar Kevin Huck, Jesus Labarta, Judit Gimenez, Harald Servat, Juan Gonzalez, and German Llort CScADS Workshop on Performance Tools for Petascale Computing Snowbird, UT How this

More information

Lecture 12: Solvation Models: Molecular Mechanics Modeling of Hydration Effects

Lecture 12: Solvation Models: Molecular Mechanics Modeling of Hydration Effects Statistical Thermodynamics Lecture 12: Solvation Models: Molecular Mechanics Modeling of Hydration Effects Dr. Ronald M. Levy ronlevy@temple.edu Bare Molecular Mechanics Atomistic Force Fields: torsion

More information

Fast Space Charge Calculations with a Multigrid Poisson Solver & Applications

Fast Space Charge Calculations with a Multigrid Poisson Solver & Applications Fast Space Charge Calculations with a Multigrid Poisson Solver & Applications Gisela Pöplau Ursula van Rienen Rostock University DESY, Hamburg, April 26, 2005 Bas van der Geer Marieke de Loos Pulsar Physics

More information

A particle-in-cell method with adaptive phase-space remapping for kinetic plasmas

A particle-in-cell method with adaptive phase-space remapping for kinetic plasmas A particle-in-cell method with adaptive phase-space remapping for kinetic plasmas Bei Wang 1 Greg Miller 2 Phil Colella 3 1 Princeton Institute of Computational Science and Engineering Princeton University

More information

RWTH Aachen University

RWTH Aachen University IPCC @ RWTH Aachen University Optimization of multibody and long-range solvers in LAMMPS Rodrigo Canales William McDoniel Markus Höhnerbach Ahmed E. Ismail Paolo Bientinesi IPCC Showcase November 2016

More information

Multiscale simulations of complex fluid rheology

Multiscale simulations of complex fluid rheology Multiscale simulations of complex fluid rheology Michael P. Howard, Athanassios Z. Panagiotopoulos Department of Chemical and Biological Engineering, Princeton University Arash Nikoubashman Institute of

More information

Numerical Methods for Partial Differential Equations: an Overview.

Numerical Methods for Partial Differential Equations: an Overview. Numerical Methods for Partial Differential Equations: an Overview math652_spring2009@colorstate PDEs are mathematical models of physical phenomena Heat conduction Wave motion PDEs are mathematical models

More information

Analyzing the Error Bounds of Multipole-Based

Analyzing the Error Bounds of Multipole-Based Page 1 of 12 Analyzing the Error Bounds of Multipole-Based Treecodes Vivek Sarin, Ananth Grama, and Ahmed Sameh Department of Computer Sciences Purdue University W. Lafayette, IN 47906 Email: Sarin, Grama,

More information

N-body simulations. Phys 750 Lecture 10

N-body simulations. Phys 750 Lecture 10 N-body simulations Phys 750 Lecture 10 N-body simulations Forward integration in time of strongly coupled classical particles or (rigid-body composite objects) dissipation Brownian motion Very general

More information

An FPGA Implementation of Reciprocal Sums for SPME

An FPGA Implementation of Reciprocal Sums for SPME An FPGA Implementation of Reciprocal Sums for SPME Sam Lee and Paul Chow Edward S. Rogers Sr. Department of Electrical and Computer Engineering University of Toronto Objectives Accelerate part of Molecular

More information

Robust solution of Poisson-like problems with aggregation-based AMG

Robust solution of Poisson-like problems with aggregation-based AMG Robust solution of Poisson-like problems with aggregation-based AMG Yvan Notay Université Libre de Bruxelles Service de Métrologie Nucléaire Paris, January 26, 215 Supported by the Belgian FNRS http://homepages.ulb.ac.be/

More information

Compact High Order Finite Difference Stencils for Elliptic Variable Coefficient and Interface Problems

Compact High Order Finite Difference Stencils for Elliptic Variable Coefficient and Interface Problems Compact High Order Finite Difference Stencils for Elliptic Variable Coefficient and Interface Problems Daniel Ritter 1, Ulrich Rüde 1, Björn Gmeiner 1, Rochus Schmid 2 Copper Mountain, March 18th, 2013

More information

Chapter 5. Methods for Solving Elliptic Equations

Chapter 5. Methods for Solving Elliptic Equations Chapter 5. Methods for Solving Elliptic Equations References: Tannehill et al Section 4.3. Fulton et al (1986 MWR). Recommended reading: Chapter 7, Numerical Methods for Engineering Application. J. H.

More information

- Part 4 - Multicore and Manycore Technology: Chances and Challenges. Vincent Heuveline

- Part 4 - Multicore and Manycore Technology: Chances and Challenges. Vincent Heuveline - Part 4 - Multicore and Manycore Technology: Chances and Challenges Vincent Heuveline 1 Numerical Simulation of Tropical Cyclones Goal oriented adaptivity for tropical cyclones ~10⁴km ~1500km ~100km 2

More information

Computational Astrophysics 1 Particle Mesh methods

Computational Astrophysics 1 Particle Mesh methods Computational Astrophysics 1 Particle Mesh methods Oscar Agertz Outline - The Vlassov-Poisson equation - The Particle-Mesh method in a nut shell - Symplectic Integrators - Mass assignment and force interpolation

More information

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics SPARSE SOLVERS FOR THE POISSON EQUATION Margreet Nool CWI, Multiscale Dynamics November 9, 2015 OUTLINE OF THIS TALK 1 FISHPACK, LAPACK, PARDISO 2 SYSTEM OVERVIEW OF CARTESIUS 3 POISSON EQUATION 4 SOLVERS

More information

Optimizing GROMACS for parallel performance

Optimizing GROMACS for parallel performance Optimizing GROMACS for parallel performance Outline 1. Why optimize? Performance status quo 2. GROMACS as a black box. (PME) 3. How does GROMACS spend its time? (MPE) 4. What you can do What I want to

More information

Numerically Solving Partial Differential Equations

Numerically Solving Partial Differential Equations Numerically Solving Partial Differential Equations Michael Lavell Department of Applied Mathematics and Statistics Abstract The physics describing the fundamental principles of fluid dynamics can be written

More information

Dynamics of Galaxies: Basics. Frontiers in Numerical Gravitational Astrophysics July 3, 2008

Dynamics of Galaxies: Basics. Frontiers in Numerical Gravitational Astrophysics July 3, 2008 Dynamics of Galaxies: Basics Frontiers in Numerical Gravitational Astrophysics July 3, 2008 c = G = 1 h = 0 Outline (a) Galaxies as physical systems. (b) The Collisionless Boltzmann Equation. (c) N-body

More information

PIConGPU Bringing Large-Scale Laser Plasma Simulations to GPU Supercomputing

PIConGPU Bringing Large-Scale Laser Plasma Simulations to GPU Supercomputing PIConGPU Bringing Large-Scale Laser Plasma Simulations to GPU Supercomputing Michael Bussmann 1, Heiko Burau 1, René Widera 1, Florian Berninger 1, Axel Hübl 1, Thomas Kluge 1, Alexander Debus 1, Ulrich

More information

Molecular Dynamics. A very brief introduction

Molecular Dynamics. A very brief introduction Molecular Dynamics A very brief introduction Sander Pronk Dept. of Theoretical Physics KTH Royal Institute of Technology & Science For Life Laboratory Stockholm, Sweden Why computer simulations? Two primary

More information

AM 205 Final Project The N-Body Problem

AM 205 Final Project The N-Body Problem AM 205 Final Project The N-Body Problem Leah Birch Elizabeth Finn Karen Yu December 14, 2012 Abstract The N-Body Problem can be solved using a variety of numeric integrators. Newton s Law of Universal

More information

An Accurate Fourier-Spectral Solver for Variable Coefficient Elliptic Equations

An Accurate Fourier-Spectral Solver for Variable Coefficient Elliptic Equations An Accurate Fourier-Spectral Solver for Variable Coefficient Elliptic Equations Moshe Israeli Computer Science Department, Technion-Israel Institute of Technology, Technion city, Haifa 32000, ISRAEL Alexander

More information

Huge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei

Huge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei 1/20 Huge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei H. Watanabe ISSP, The M. Suzuki H. Inaoka N. Ito Kyushu University RIKEN AICS The, RIKEN AICS Outline 1. Introduction 2. Benchmark results

More information

Nonlinear Iterative Solution of the Neutron Transport Equation

Nonlinear Iterative Solution of the Neutron Transport Equation Nonlinear Iterative Solution of the Neutron Transport Equation Emiliano Masiello Commissariat à l Energie Atomique de Saclay /DANS//SERMA/LTSD emiliano.masiello@cea.fr 1/37 Outline - motivations and framework

More information

A Trillion Particles: Studying Large-Scale Structure Formation on the BG/Q

A Trillion Particles: Studying Large-Scale Structure Formation on the BG/Q A Trillion Particles: Studying Large-Scale Structure Formation on the BG/Q David Daniel, Patricia Fasel, Hal Finkel, Nicholas Frontiere, Salman Habib, Katrin Heitmann, Zarija Lukic, Adrian Pope July 10,

More information

Long-range interactions: P 3 M, MMMxD, ELC, MEMD and ICC

Long-range interactions: P 3 M, MMMxD, ELC, MEMD and ICC Long-range interactions: P 3 M, MMMxD, ELC, MEMD and ICC Axel Arnold Institute for Computational Physics Universität Stuttgart October 10, 2012 Long-range interactions gravity Coulomb interaction dipolar

More information

Introduction to Multigrid Methods

Introduction to Multigrid Methods Introduction to Multigrid Methods Chapter 9: Multigrid Methodology and Applications Gustaf Söderlind Numerical Analysis, Lund University Textbooks: A Multigrid Tutorial, by William L Briggs. SIAM 1988

More information

Finite Difference Methods (FDMs) 1

Finite Difference Methods (FDMs) 1 Finite Difference Methods (FDMs) 1 1 st - order Approxima9on Recall Taylor series expansion: Forward difference: Backward difference: Central difference: 2 nd - order Approxima9on Forward difference: Backward

More information

An implementation of the Fast Multipole Method without multipoles

An implementation of the Fast Multipole Method without multipoles An implementation of the Fast Multipole Method without multipoles Robin Schoemaker Scientific Computing Group Seminar The Fast Multipole Method Theory and Applications Spring 2002 Overview Fast Multipole

More information

A recovery-assisted DG code for the compressible Navier-Stokes equations

A recovery-assisted DG code for the compressible Navier-Stokes equations A recovery-assisted DG code for the compressible Navier-Stokes equations January 6 th, 217 5 th International Workshop on High-Order CFD Methods Kissimmee, Florida Philip E. Johnson & Eric Johnsen Scientific

More information

18.336J/6.335J: Fast methods for partial differential and integral equations. Lecture 13. Instructor: Carlos Pérez Arancibia

18.336J/6.335J: Fast methods for partial differential and integral equations. Lecture 13. Instructor: Carlos Pérez Arancibia 18.336J/6.335J: Fast methods for partial differential and integral equations Lecture 13 Instructor: Carlos Pérez Arancibia MIT Mathematics Department Cambridge, October 24, 2016 Fast Multipole Method (FMM)

More information

PIC Simulation of Magnetic Reconnection with Adaptive Mesh Refinement

PIC Simulation of Magnetic Reconnection with Adaptive Mesh Refinement PIC Simulation of Magnetic Reconnection with Adaptive Mesh Refinement Keizo Fujimoto National Astronomical Observatory of Japan Overview Introduction Particle-in-Cell (PIC) model with adaptive mesh refinement

More information

Classical Field Theory

Classical Field Theory April 13, 2010 Field Theory : Introduction A classical field theory is a physical theory that describes the study of how one or more physical fields interact with matter. The word classical is used in

More information

What place for mathematicians in plasma physics

What place for mathematicians in plasma physics What place for mathematicians in plasma physics Eric Sonnendrücker IRMA Université Louis Pasteur, Strasbourg projet CALVI INRIA Nancy Grand Est 15-19 September 2008 Eric Sonnendrücker (U. Strasbourg) Math

More information

S Subdivide, Preprocess and Conquer: Micromagnetism FEM/BEM-Simulations on Single-Node/Multi-GPU Systems

S Subdivide, Preprocess and Conquer: Micromagnetism FEM/BEM-Simulations on Single-Node/Multi-GPU Systems S4283 - Subdivide, : Micromagnetism FEM/BEM-Simulations on Single-Node/Multi-GPU Systems Elmar Westphal - Forschungszentrum Jülich GmbH 1 Contents Micromagnetism TetraMag, a FEM/BEM Micromagnetism Simulator

More information

Granularity effects Bas van der Geer

Granularity effects Bas van der Geer Granularity effects Bas van der Geer Marieke de Loos Steinar Wouters Edgar Vredenbregt Jom Luiten Eindhoven University The Netherlands Globular cluster Messier 2 by Hubble Space Telescope.. Located in

More information

Large Scale High Resolution Blood Flow Simulations

Large Scale High Resolution Blood Flow Simulations Large Scale High Resolution Blood Flow Simulations Florian Janoschek Jens Harting Federico Toschi Department of Applied Physics, Eindhoven University of Technology, The Netherlands Institute for Computational

More information

Computer simulation methods (2) Dr. Vania Calandrini

Computer simulation methods (2) Dr. Vania Calandrini Computer simulation methods (2) Dr. Vania Calandrini in the previous lecture: time average versus ensemble average MC versus MD simulations equipartition theorem (=> computing T) virial theorem (=> computing

More information

Transactions on Modelling and Simulation vol 18, 1997 WIT Press, ISSN X

Transactions on Modelling and Simulation vol 18, 1997 WIT Press,  ISSN X Application of the fast multipole method to the 3-D BEM analysis of electron guns T. Nishida and K. Hayami Department of Mathematical Engineering and Information Physics, School of Engineering, University

More information

Improving Dynamical Core Scalability, Accuracy, and Limi:ng Flexibility with the ADER- DT Time Discre:za:on

Improving Dynamical Core Scalability, Accuracy, and Limi:ng Flexibility with the ADER- DT Time Discre:za:on Improving Dynamical Core Scalability, Accuracy, and Limi:ng Flexibility with the ADER- DT Time Discre:za:on Matthew R. Norman Scientific Computing Group National Center for Computational Sciences Oak Ridge

More information

An Overview of Fast Multipole Methods

An Overview of Fast Multipole Methods An Overview of Fast Multipole Methods Alexander T. Ihler April 23, 2004 Sing, Muse, of the low-rank approximations Of spherical harmonics and O(N) computation... Abstract We present some background and

More information

An H-LU Based Direct Finite Element Solver Accelerated by Nested Dissection for Large-scale Modeling of ICs and Packages

An H-LU Based Direct Finite Element Solver Accelerated by Nested Dissection for Large-scale Modeling of ICs and Packages PIERS ONLINE, VOL. 6, NO. 7, 2010 679 An H-LU Based Direct Finite Element Solver Accelerated by Nested Dissection for Large-scale Modeling of ICs and Packages Haixin Liu and Dan Jiao School of Electrical

More information

Performance Analysis of Lattice QCD Application with APGAS Programming Model

Performance Analysis of Lattice QCD Application with APGAS Programming Model Performance Analysis of Lattice QCD Application with APGAS Programming Model Koichi Shirahata 1, Jun Doi 2, Mikio Takeuchi 2 1: Tokyo Institute of Technology 2: IBM Research - Tokyo Programming Models

More information

Case Study: Quantum Chromodynamics

Case Study: Quantum Chromodynamics Case Study: Quantum Chromodynamics Michael Clark Harvard University with R. Babich, K. Barros, R. Brower, J. Chen and C. Rebbi Outline Primer to QCD QCD on a GPU Mixed Precision Solvers Multigrid solver

More information

Parallel Preconditioning Methods for Ill-conditioned Problems

Parallel Preconditioning Methods for Ill-conditioned Problems Parallel Preconditioning Methods for Ill-conditioned Problems Kengo Nakajima Information Technology Center, The University of Tokyo 2014 Conference on Advanced Topics and Auto Tuning in High Performance

More information

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar. Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar. 25, 2002 Molecular Dynamics: Introduction At physiological conditions, the

More information

Some Geometric and Algebraic Aspects of Domain Decomposition Methods

Some Geometric and Algebraic Aspects of Domain Decomposition Methods Some Geometric and Algebraic Aspects of Domain Decomposition Methods D.S.Butyugin 1, Y.L.Gurieva 1, V.P.Ilin 1,2, and D.V.Perevozkin 1 Abstract Some geometric and algebraic aspects of various domain decomposition

More information

White dwarf dynamical interactions. Enrique García-Berro. Jornades de Recerca, Departament de Física

White dwarf dynamical interactions. Enrique García-Berro. Jornades de Recerca, Departament de Física White dwarf dynamical interactions Enrique García-Berro Jornades de Recerca, Departament de Física CONTENTS 1. Introduction 2. Smoothed Particle Hydrodynamics 3. White dwarf mergers 4. White dwarf collisions

More information

Density Functional Modeling of Nanocrystalline Materials

Density Functional Modeling of Nanocrystalline Materials Density Functional Modeling of Nanocrystalline Materials A new approach for modeling atomic scale properties in materials Peter Stefanovic Supervisor: Nikolas Provatas 70 / Part 1-7 February 007 Density

More information

Optimal Interface Conditions for an Arbitrary Decomposition into Subdomains

Optimal Interface Conditions for an Arbitrary Decomposition into Subdomains Optimal Interface Conditions for an Arbitrary Decomposition into Subdomains Martin J. Gander and Felix Kwok Section de mathématiques, Université de Genève, Geneva CH-1211, Switzerland, Martin.Gander@unige.ch;

More information

Block-Structured Adaptive Mesh Refinement

Block-Structured Adaptive Mesh Refinement Block-Structured Adaptive Mesh Refinement Lecture 2 Incompressible Navier-Stokes Equations Fractional Step Scheme 1-D AMR for classical PDE s hyperbolic elliptic parabolic Accuracy considerations Bell

More information

A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation

A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation Tao Zhao 1, Feng-Nan Hwang 2 and Xiao-Chuan Cai 3 Abstract In this paper, we develop an overlapping domain decomposition

More information

A parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns

A parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns A parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns Dominik Bartuschat, Ulrich Rüde Chair for System Simulation, University of Erlangen-Nürnberg

More information

Chenhan D. Yu The 3rd BLIS Retreat Sep 28, 2015

Chenhan D. Yu The 3rd BLIS Retreat Sep 28, 2015 GSKS GSKNN BLIS-Based High Performance Computing Kernels in N-body Problems Chenhan D. Yu The 3rd BLIS Retreat Sep 28, 2015 N-body Problems Hellstorm Astronomy and 3D https://youtu.be/bllwkx_mrfk 2 N-body

More information

Grid-Free Plasma Simulation Techniques

Grid-Free Plasma Simulation Techniques IEEE TRANSACTIONS ON PLASMA SCIENCE, VOL. 34, NO. 2, APRIL 2006 149 Grid-Free Plasma Simulation Techniques Andrew J. Christlieb, Member, IEEE, Robert Krasny, John P. Verboncoeur, Member, IEEE, Jerold W.

More information

Adaptive numerical simulation of Streamer. Shailendra Singh

Adaptive numerical simulation of Streamer. Shailendra Singh Adaptive numerical simulation of Streamer Shailendra Singh Tools: BDV from Drift diffusion calculation Motivation: Streamer criteria works very well in moderately uniform field. But in highly non uniform

More information

Multigrid Methods and their application in CFD

Multigrid Methods and their application in CFD Multigrid Methods and their application in CFD Michael Wurst TU München 16.06.2009 1 Multigrid Methods Definition Multigrid (MG) methods in numerical analysis are a group of algorithms for solving differential

More information