Fast Algorithms for Many-Particle Simulations on Peta-Scale Parallel Architectures
|
|
- Anissa Rose Parker
- 5 years ago
- Views:
Transcription
1 Mitglied der Helmholtz-Gemeinschaft Fast Algorithms for Many-Particle Simulations on Peta-Scale Parallel Architectures Godehard Sutmann Institute for Advanced Simulation (IAS) - Jülich Supercomputing Centre (JSC) Research Centre Jülich g.sutmann@fz-juelich.de
2 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 2
3 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 3
4 Atomistic Models of Matter Galaxy Li F in water 4
5 Molecular dynamics Particle systems are described by a Hamiltonian Kinetic Energy Potential Energy Particle trajectories are generated by solving classical equations of motion 5
6 Long range interactions Typical form for a force field (AMBER / Sander) Classification of potentials Examples: Coulomb interactions (charge-charge, dipole-dipole) Gravitational potential 6
7 Long Range Interactions Potential energy calculations 7
8 Long Range Interactions Hierarchical potential energy calculations 8
9 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 9
10 Methods for long range interactions Periodic systems Grid free: Plain Ewald summation O(N3/2) Fast multipole method (FMM) O(N) Tree-algorithms (Barnes-Hut) O(N log(n)) Grid based: Particle Mesh Ewald (PME) O(N log(n)) Particle-Particle Particle-Mesh Method (P3M) O(N log(n)) Open systems Grid free: Tree-algorithms (Barnes-Hut) O(N log(n)) Fast multipole method (FMM) O(N) Grid based: Multigrid based method (P3MG) O(N) List not complete 10
11 Classification of Algorithms Fully Hierarchical Hierarchical Splitting Methods complete subdivision of space complete subdivision of space continuous description from sources to far-field splitting between short- and longrange contributions hierarchical ordering of field contributions fast methods to solve long range contributions mesh-less description (often) grid-based description sources: charges, multipoles sources: pseudo-charges res.: physical potentials, fields res.: lr: pseudo-potentials / -fields sr: potentials + corrections expl: Fast Multipole Method Barnes-Hut Tree Method expl: P3M, SPME, multigrid 11
12 Why different methods? Multigrid Elliptic PDE solver (Poisson, Helmholtz) & non-linear (Poisson-Boltzmann) allows for dielectric shading prescription of desired boundary potentials (capacitors) Barnes-Hut Tree Method strongly inhomogeneous particle distributions very complex system geometries Fast Multipole Method high accuracy calculations full error control All methods are scalable on parallel architectures 12
13 Schematic differences of hierarchical methods Multigrid Barnes-Hut Tree method Fast multipole method 13
14 Acknowlegements FMM Holger Dachsel, Ivo Kabadshow Barnes-Hut Paul Gibbon, Lukas Arnold, Robert Speck, Mathias Winkel Multigrid Matthias Bolten, Bernhard Steffen Parallel Sorting Michael Hofmann (TU Chemnitz) Interface Lidia Westphal, Rene Halver Financial support from BMBF 14
15 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 15
16 Multigrid for Poisson Equation Solve PDE on the finest grid with Dirichlet boundary conditions Use finite differences to approximate PDE Get an approximation for the potential from where an error ε and residuum r is obtained These obey the same differential equation Get correction to approximation: (repeat until convergence) 16
17 Multigrid for Poisson Equation Multigrid: solve for the error onto a hierarchy of coarse grids 2Lh Low frequency components on the fine grid get high frequency components on the coarse grid (fast error reduction) Forward MG: Restrict residuum Backward MG: Prolongate residuum e.g. V cycle 17
18 Complexity Natural complexity of long-range pair-wise interactions is O(N2) Multigrid solver reduces complexity of field calculations to O(N) niter fixed on each level 18
19 Convergence of Multigrid Test system of 4096 particles with (25+1)3 grid points Multigrid calculation 19
20 Convergence of Multigrid Test system of 4096 particles with (25)3 grid points Single grid calculation 20
21 Electrostatics: finite difference solution for point particles In molecular simulations, source terms are point charges, which create a potential, diverging at the origin Sampling problem on a discrete grid Resolution problem depending on grid spacing Large error close to the source term Also far field poisoned by discretization errors 21
22 Electrostatic Regularization Splitting of the potential into near- and far-field (analogon to Ewald summation) 22
23 Scaling 23
24 Multigrid Parallelization Exchange of neighbor grid points on each hierarchy level Data non-locality, examples Ng = 1283, i.e. (27)3, i.e. 7 MG-levels Np = 512, i.e. 83, i.e. 5 local levels (128,64,32,16,8) r ~ 2.5x104 Ng = 20483, i.e. (211)3, i.e. 11 MG-levels Np = 65536, i.e. 323, i.e. 7 local levels (2048,1024,512,256,128,64,32) r ~ 2x106 24
25 Parallel Fast Multipole Method Parallelization: Domain decomposition MPI (cartesian communicators) Asynchronous communications only nearest neighbor communication Strong scaling: N= point stencil Strong scaling: N= unknowns/core 25
26 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 26
27 Tree Algorithms Organize physical space in a hierarchical way (relative to each particle) Interactions are weighted according to their importance Trees may be organized in different ways, e.g. 2d trees: binary-, quad-, oct-trees in 1,2,3 dimensions Principle is to sort particles into cells, where on the lowest level only one particle is inside a cell Original work by Barnes & Hut 27
28 Tree Algorithms: grouping criterion Interaction between individual particles and distant pseudo-particles (multipoles) Which criterion is adopted for grouping? If s/d < θ : internal structure of the pseudoparticle is ignored If s/d > θ : the node is resolved and the criterion is checked for every child At latest this procedure stops, when a leaf node is reached 28
29 Sorted list of keys maps 3D particle coordinates onto 1D space-filling curve Sort keys according to their size in ASCII representation This sequence will be a space filling curve (Morton, Z, Lebesque curve) finish start 29
30 PEPC Barnes-Hut algorithm implemented into PEPC (Pretty Efficient Parallel Coulomb-Solver) developed by Paul Gibbon (JSC) for plasma-laser interaction reduction of the n-body O(N2) complexity to O(N log N) OpenSource, freely available F90 and MPI part of ScaFaCoS library part of DEISA and PRACE benchmark suites 30
31 Scientific Application PEPC is highly flexible and comes in different flavours PEPC-E Coulomb interaction PEPC-B Magnetic fields laser-plasma interaction PEPC-V vortex fluid methods PEPC-G Newtonian gravitation extends to smooth particle hydrodynamics 31
32 Parallel Performance (pure MPI version) PEPC on IBM BG/P 32
33 Hybrid PEPC MPI + POSIX threads reduce metadata overhead communication thread + multiple worker threads overlap of communication and computation first successful runs on 300k cores of jugene system size N > 2x109 33
34 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 34
35 Long range contributions For general orientations of coordinate vectors the expansion is Define 35
36 Extensions: rotation-based FMM Conventional FMM has complexity O(L4) with respect to the multipole length Costs can be reduced to O(L3) if Legendre polynomials are used 36
37 Fast electrostatics Fast Multipole Method FMM is subdivided into 5+1 passes Pass 0: determine optimal FMM parameter set Pass 1: form and shift multipole expansions Pass 2: transform distant multipole expansions Pass 3: shift Taylor like expansions Pass 4: calculate far field energy Pass 5: calculate near field energy 37
38 Fast Multipole Method Pass 0 Determine FMM parameters Seperation criteria ws Depth of the FMM tree Length of the multipole expansion L Optimisation of run time behavior Pass 4 Sum up far field energies Pass 5 Sum up near field energies 38
39 Fast electrostatics Fast Multipole Method Computational complexity: Every Pass consists of operations of O(N), > Fast Multipole Method has optimal scaling with number of particles! 39
40 Periodic Boundary Conditions Implementation straight forward (although involved) for 1d, 2d, 3d-periodicity Scaling of O(N) Full error control 40
41 FMM Parallelization 41
42 Parallel Fast Multipole Method Parallelization: Remote memory access (RMA) Point-to-point: One sided communication Put, Fence, Notify, Notifywait Jugene Contigous data transfer ARMCI / A1 Collective operations MPI Allgather, Allreduce, Barrier Juropa 42
43 Parallel Fast Multipole Method Weak scaling on Jugene and Juropa (Load: 86= particles/core) 43
44 Parallel Fast Multipole Method Strong scaling on Jugene (Load: N = 10243) 44
45 Parallel Fast Multipole Method Strong scaling on Juropa (Load: N = 10243) 45
46 FMM World record (Dec. 2010) After memory optimization: 50 Bytes / particle 512 Mbytes / core x cores / 50 Bytes / particle 3.0 x 1012 particles Largest simulation with N = particles = = 3.0 Trillion 46
47 Outline Many-Body simulations Methods for long range interactions in many-body systems Multigrid Method Barnes-Hut Tree Method Fast Multipole Method Coupling Methods to Codes 47
48 BMBF Project ScaFaCoS Scalable Library for Fast Coulomb Solver Coordinator: Forschungszentrum Jülich (IAS/JSC) University Partners: Bonn Chemnitz Stuttgart Wuppertal Research Centres: MPI Mainz (Polymer) Industry: BASF Ludwigshafen Cognis Düsseldorf (BASF) IBM Dresden Jülich Cognis Wuppertal SCAI Bonn IBM Chemnitz MPI BASF Stuttgart 48
49 BMBF Project ScaFaCoS Interfacing Programs Scalable Fast Coulomb Solvers liblr Particle2Grid Communication Ewald P3M FMM BHT P3MG Fast summation Long Range Interactions Multipole libsr S c h n i t t s t e l l e Communication Neighbor search Potential fcts. Short Range Interactions Raumf.Kurven Parallel Sorting Parallel NFFT, AFFT Parallel Multigrid Parallel Interface AnwendungsProgramm 1 (MD, MC, BD, ) AnwendungsProgramm N (MD, MC, BD, ) 49
50 Yes Insert essential solver-specific data into a new FCSHandle-object Call fcs_init to set method and MPI communicator Calls fcs_fmm_init Calls fcs_p3m_init Calls fcs_pp3mg_init Calls fcs_nfft_init Calls fcs_pepc_init Calls fcs_direct_init Calls fcs_pepc_tune Calls fcs_direct_tune Calls fcs_pepc_run Calls fcs_direct_run Insert optional solver-specific data into FCSHandle-object Call fcs_tune to calculate solver-specific parameters based on positions / charges Calls fcs_fmm_tune Calls fcs_p3m_tune Calls fcs_pp3mg_tune Calls fcs_nfft_tune No Call fcs_run to calculate a time step with the chosen solver Calls fcs_fmm_run Calls fcs_p3m_run Calls fcs_pp3mg_run Calls fcs_nfft_run Yes No Next time step with same configuration? Change solver? Calculate another time step at all? Call fcshandle_destroy to deallocate FCSHandle-object Simulation ends 50
51 Thank you for your Attention 51
Long-range Interactions in Particle Simulations ScaFaCoS. Olaf Lenz
Long-range Interactions in Particle Simulations ScaFaCoS 1/33 Outline Many-particle Simulations Short-range Interactions Long-range Interactions: ScaFaCoS If time permits: ESPResSo and coarse-graining
More informationThe Blue Gene/P at Jülich Case Study & Optimization. W.Frings, Forschungszentrum Jülich,
The Blue Gene/P at Jülich Case Study & Optimization W.Frings, Forschungszentrum Jülich, 26.08.2008 Jugene Case-Studies: Overview Case Study: PEPC Case Study: racoon Case Study: QCD CPU0CPU3 CPU1CPU2 2
More informationA Fast N-Body Solver for the Poisson(-Boltzmann) Equation
A Fast N-Body Solver for the Poisson(-Boltzmann) Equation Robert D. Skeel Departments of Computer Science (and Mathematics) Purdue University http://bionum.cs.purdue.edu/2008december.pdf 1 Thesis Poisson(-Boltzmann)
More informationFast Methods for Long-Range Interactions in Complex Systems. IAS Series. Volume 6 ISBN
IAS Series This publication was edited at the Jülich Supercomputing Centre (JSC) which is an integral part of the Institute for Advanced Simulation (IAS). The IAS combines the Jülich simulation sciences
More informationMultipole-Based Preconditioners for Sparse Linear Systems.
Multipole-Based Preconditioners for Sparse Linear Systems. Ananth Grama Purdue University. Supported by the National Science Foundation. Overview Summary of Contributions Generalized Stokes Problem Solenoidal
More informationSimulation Laboratories at JSC
Mitglied der Helmholtz-Gemeinschaft Simulation Laboratories at JSC Paul Gibbon Jülich Supercomputing Centre Jülich Supercomputing Centre Supercomputer operation for Centre FZJ Regional JARA Helmholtz &
More informationTreecodes for Cosmology Thomas Quinn University of Washington N-Body Shop
Treecodes for Cosmology Thomas Quinn University of Washington N-Body Shop Outline Motivation Multipole Expansions Tree Algorithms Periodic Boundaries Time integration Gravitational Softening SPH Parallel
More informationThe Fast Multipole Method in molecular dynamics
The Fast Multipole Method in molecular dynamics Berk Hess KTH Royal Institute of Technology, Stockholm, Sweden ADAC6 workshop Zurich, 20-06-2018 Slide BioExcel Slide Molecular Dynamics of biomolecules
More informationHideyuki Usui 1,3, M. Nunami 2,3, Y. Yagi 1,3, T. Moritaka 1,3, and JST/CREST multi-scale PIC simulation team
Hideyuki Usui 1,3, M. Nunami 2,3, Y. Yagi 1,3, T. Moritaka 1,3, and JST/CREST multi-scale PIC simulation team 1 Kobe Univ., Japan, 2 NIFS,Japan, 3 JST/CREST, Outline Multi-scale interaction between weak
More informationAdvanced Molecular Molecular Dynamics
Advanced Molecular Molecular Dynamics Technical details May 12, 2014 Integration of harmonic oscillator r m period = 2 k k and the temperature T determine the sampling of x (here T is related with v 0
More informationBrad Gibson Centre for Astrophysics & Supercomputing Swinburne University
N-body Simulations: Tree and Mesh Approaches Brad Gibson Centre for Astrophysics & Supercomputing Swinburne University Demystifying the Jargon Trees Collisionless Boltzmann Equation Symplectic Time Integration
More informationMitglied der Helmholtz-Gemeinschaft. Plasma Simulation at the Jülich Supercomputing Centre
Mitglied der Helmholtz-Gemeinschaft Plasma Simulation at the Jülich Supercomputing Centre 27 October 2008 Paul Gibbon JSC s Mission: Provide supercomputer resources of highest performance and widest scope;
More informationContents. Preface... xi. Introduction...
Contents Preface... xi Introduction... xv Chapter 1. Computer Architectures... 1 1.1. Different types of parallelism... 1 1.1.1. Overlap, concurrency and parallelism... 1 1.1.2. Temporal and spatial parallelism
More informationParallel programming using MPI. Analysis and optimization. Bhupender Thakur, Jim Lupo, Le Yan, Alex Pacheco
Parallel programming using MPI Analysis and optimization Bhupender Thakur, Jim Lupo, Le Yan, Alex Pacheco Outline l Parallel programming: Basic definitions l Choosing right algorithms: Optimal serial and
More informationPractical in Numerical Astronomy, SS 2012 LECTURE 9
Practical in Numerical Astronomy, SS 01 Elliptic partial differential equations. Poisson solvers. LECTURE 9 1. Gravity force and the equations of hydrodynamics. Poisson equation versus Poisson integral.
More informationElliptic Problems / Multigrid. PHY 604: Computational Methods for Physics and Astrophysics II
Elliptic Problems / Multigrid Summary of Hyperbolic PDEs We looked at a simple linear and a nonlinear scalar hyperbolic PDE There is a speed associated with the change of the solution Explicit methods
More informationPartial Differential Equations
Partial Differential Equations Introduction Deng Li Discretization Methods Chunfang Chen, Danny Thorne, Adam Zornes CS521 Feb.,7, 2006 What do You Stand For? A PDE is a Partial Differential Equation This
More informationParticle Methods. Lecture Reduce and Broadcast: A function viewpoint
Lecture 9 Particle Methods 9.1 Reduce and Broadcast: A function viewpoint [This section is being rewritten with what we hope will be the world s clearest explanation of the fast multipole algorithm. Readers
More informationAspects of Multigrid
Aspects of Multigrid Kees Oosterlee 1,2 1 Delft University of Technology, Delft. 2 CWI, Center for Mathematics and Computer Science, Amsterdam, SIAM Chapter Workshop Day, May 30th 2018 C.W.Oosterlee (CWI)
More informationCS 542G: The Poisson Problem, Finite Differences
CS 542G: The Poisson Problem, Finite Differences Robert Bridson November 10, 2008 1 The Poisson Problem At the end last time, we noticed that the gravitational potential has a zero Laplacian except at
More informationA Jacobi Davidson Method with a Multigrid Solver for the Hermitian Wilson-Dirac Operator
A Jacobi Davidson Method with a Multigrid Solver for the Hermitian Wilson-Dirac Operator Artur Strebel Bergische Universität Wuppertal August 3, 2016 Joint Work This project is joint work with: Gunnar
More informationPerformance Evaluation of Scientific Applications on POWER8
Performance Evaluation of Scientific Applications on POWER8 2014 Nov 16 Andrew V. Adinetz 1, Paul F. Baumeister 1, Hans Böttiger 3, Thorsten Hater 1, Thilo Maurer 3, Dirk Pleiter 1, Wolfram Schenck 4,
More informationINTRODUCTION TO FAST MULTIPOLE METHODS LONG CHEN
INTRODUCTION TO FAST MULTIPOLE METHODS LONG CHEN ABSTRACT. This is a simple introduction to fast multipole methods for the N-body summation problems. Low rank approximation plus hierarchical decomposition
More informationDifferential Algebra (DA) based Fast Multipole Method (FMM)
Differential Algebra (DA) based Fast Multipole Method (FMM) Department of Physics and Astronomy Michigan State University June 27, 2010 Space charge effect Space charge effect : the fields of the charged
More informationNumerical Simulations. Duncan Christie
Numerical Simulations Duncan Christie Motivation There isn t enough time to derive the necessary methods to do numerical simulations, but there is enough time to survey what methods and codes are available
More informationParallel Simulations of Self-propelled Microorganisms
Parallel Simulations of Self-propelled Microorganisms K. Pickl a,b M. Hofmann c T. Preclik a H. Köstler a A.-S. Smith b,d U. Rüde a,b ParCo 2013, Munich a Lehrstuhl für Informatik 10 (Systemsimulation),
More informationA THE FAST MULTIPLE METHOD FOR GRIDLESS PARTICLEii
A-89 804 THE FAST MULTIPLE METHOD FOR GRIDLESS PARTICLEii UNCLASSIFIED SIMULATIONSU) YALE UNIV NEW HAVEN CT DEPT OF COMPUTER SCIENCE J AMBROSIANO ET AL SEP 87 YALEU/DCS/RR-565 N888t14-8-k-S F/G 12/1 NL
More informationIterative Methods and Multigrid
Iterative Methods and Multigrid Part 1: Introduction to Multigrid 1 12/02/09 MG02.prz Error Smoothing 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Initial Solution=-Error 0 10 20 30 40 50 60 70 80 90 100 DCT:
More informationParticle-Simulation Methods for Fluid Dynamics
Particle-Simulation Methods for Fluid Dynamics X. Y. Hu and Marco Ellero E-mail: Xiangyu.Hu and Marco.Ellero at mw.tum.de, WS 2012/2013: Lectures for Mechanical Engineering Institute of Aerodynamics Technical
More informationMOLECULAR DYNAMICS SIMULATIONS OF BIOMOLECULES: Long-Range Electrostatic Effects
Annu. Rev. Biophys. Biomol. Struct. 1999. 28:155 79 Copyright c 1999 by Annual Reviews. All rights reserved MOLECULAR DYNAMICS SIMULATIONS OF BIOMOLECULES: Long-Range Electrostatic Effects Celeste Sagui
More informationTREECODES FOR POTENTIAL AND FORCE APPROXIMATIONS. A Dissertation KASTHURI SRINIVASAN KANNAN
TREECODES FOR POTENTIAL AND FORCE APPROXIMATIONS A Dissertation by KASTHURI SRINIVASAN KANNAN Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements
More informationA Comparison of Adaptive Algebraic Multigrid and Lüscher s Inexact Deflation
A Comparison of Adaptive Algebraic Multigrid and Lüscher s Inexact Deflation Andreas Frommer, Karsten Kahl, Stefan Krieg, Björn Leder and Matthias Rottmann Bergische Universität Wuppertal April 11, 2013
More informationEfficient Particle Simulation of a Virtual Cathode Using a Grid-Free Treecode Poisson Solver
384 IEEE TRANSACTIONS ON PLASMA SCIENCE, VOL. 32, NO. 2, APRIL 2004 Efficient Particle Simulation of a Virtual Cathode Using a Grid-Free Treecode Poisson Solver Andrew J. Christlieb, Robert Krasny, and
More informationKinetic Solvers with Adaptive Mesh in Phase Space for Low- Temperature Plasmas
Kinetic Solvers with Adaptive Mesh in Phase Space for Low- Temperature Plasmas Vladimir Kolobov, a,b,1 Robert Arslanbekov a and Dmitry Levko a a CFD Research Corporation, Huntsville, AL 35806, USA b The
More informationA Crash Course on Fast Multipole Method (FMM)
A Crash Course on Fast Multipole Method (FMM) Hanliang Guo Kanso Biodynamical System Lab, 2018 April What is FMM? A fast algorithm to study N body interactions (Gravitational, Coulombic interactions )
More informationA Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries
A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries SC13, November 21 st 2013 Christian Godenschwager, Florian Schornbaum, Martin Bauer, Harald Köstler, Ulrich
More informationModelling and implementation of algorithms in applied mathematics using MPI
Modelling and implementation of algorithms in applied mathematics using MPI Lecture 3: Linear Systems: Simple Iterative Methods and their parallelization, Programming MPI G. Rapin Brazil March 2011 Outline
More informationINTEGRAL-EQUATION-BASED FAST ALGORITHMS AND GRAPH-THEORETIC METHODS FOR LARGE-SCALE SIMULATIONS
INTEGRAL-EQUATION-BASED FAST ALGORITHMS AND GRAPH-THEORETIC METHODS FOR LARGE-SCALE SIMULATIONS Bo Zhang A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial
More informationINTRODUCTION TO FINITE ELEMENT METHODS ON ELLIPTIC EQUATIONS LONG CHEN
INTROUCTION TO FINITE ELEMENT METHOS ON ELLIPTIC EQUATIONS LONG CHEN CONTENTS 1. Poisson Equation 1 2. Outline of Topics 3 2.1. Finite ifference Method 3 2.2. Finite Element Method 3 2.3. Finite Volume
More informationCoupling atomistic and continuum modelling of magnetism
Coupling atomistic and continuum modelling of magnetism M. Poluektov 1,2 G. Kreiss 2 O. Eriksson 3 1 University of Warwick WMG International Institute for Nanocomposites Manufacturing 2 Uppsala University
More informationICON. The Icosahedral Nonhydrostatic modelling framework
ICON The Icosahedral Nonhydrostatic modelling framework Basic formulation, NWP and high-performance computing aspects, and its perspective towards a unified model for seamless prediction Günther Zängl,
More information5. FVM discretization and Solution Procedure
5. FVM discretization and Solution Procedure 1. The fluid domain is divided into a finite number of control volumes (cells of a computational grid). 2. Integral form of the conservation equations are discretized
More informationAnalysis of PFLOTRAN on Jaguar
Analysis of PFLOTRAN on Jaguar Kevin Huck, Jesus Labarta, Judit Gimenez, Harald Servat, Juan Gonzalez, and German Llort CScADS Workshop on Performance Tools for Petascale Computing Snowbird, UT How this
More informationLecture 12: Solvation Models: Molecular Mechanics Modeling of Hydration Effects
Statistical Thermodynamics Lecture 12: Solvation Models: Molecular Mechanics Modeling of Hydration Effects Dr. Ronald M. Levy ronlevy@temple.edu Bare Molecular Mechanics Atomistic Force Fields: torsion
More informationFast Space Charge Calculations with a Multigrid Poisson Solver & Applications
Fast Space Charge Calculations with a Multigrid Poisson Solver & Applications Gisela Pöplau Ursula van Rienen Rostock University DESY, Hamburg, April 26, 2005 Bas van der Geer Marieke de Loos Pulsar Physics
More informationA particle-in-cell method with adaptive phase-space remapping for kinetic plasmas
A particle-in-cell method with adaptive phase-space remapping for kinetic plasmas Bei Wang 1 Greg Miller 2 Phil Colella 3 1 Princeton Institute of Computational Science and Engineering Princeton University
More informationRWTH Aachen University
IPCC @ RWTH Aachen University Optimization of multibody and long-range solvers in LAMMPS Rodrigo Canales William McDoniel Markus Höhnerbach Ahmed E. Ismail Paolo Bientinesi IPCC Showcase November 2016
More informationMultiscale simulations of complex fluid rheology
Multiscale simulations of complex fluid rheology Michael P. Howard, Athanassios Z. Panagiotopoulos Department of Chemical and Biological Engineering, Princeton University Arash Nikoubashman Institute of
More informationNumerical Methods for Partial Differential Equations: an Overview.
Numerical Methods for Partial Differential Equations: an Overview math652_spring2009@colorstate PDEs are mathematical models of physical phenomena Heat conduction Wave motion PDEs are mathematical models
More informationAnalyzing the Error Bounds of Multipole-Based
Page 1 of 12 Analyzing the Error Bounds of Multipole-Based Treecodes Vivek Sarin, Ananth Grama, and Ahmed Sameh Department of Computer Sciences Purdue University W. Lafayette, IN 47906 Email: Sarin, Grama,
More informationN-body simulations. Phys 750 Lecture 10
N-body simulations Phys 750 Lecture 10 N-body simulations Forward integration in time of strongly coupled classical particles or (rigid-body composite objects) dissipation Brownian motion Very general
More informationAn FPGA Implementation of Reciprocal Sums for SPME
An FPGA Implementation of Reciprocal Sums for SPME Sam Lee and Paul Chow Edward S. Rogers Sr. Department of Electrical and Computer Engineering University of Toronto Objectives Accelerate part of Molecular
More informationRobust solution of Poisson-like problems with aggregation-based AMG
Robust solution of Poisson-like problems with aggregation-based AMG Yvan Notay Université Libre de Bruxelles Service de Métrologie Nucléaire Paris, January 26, 215 Supported by the Belgian FNRS http://homepages.ulb.ac.be/
More informationCompact High Order Finite Difference Stencils for Elliptic Variable Coefficient and Interface Problems
Compact High Order Finite Difference Stencils for Elliptic Variable Coefficient and Interface Problems Daniel Ritter 1, Ulrich Rüde 1, Björn Gmeiner 1, Rochus Schmid 2 Copper Mountain, March 18th, 2013
More informationChapter 5. Methods for Solving Elliptic Equations
Chapter 5. Methods for Solving Elliptic Equations References: Tannehill et al Section 4.3. Fulton et al (1986 MWR). Recommended reading: Chapter 7, Numerical Methods for Engineering Application. J. H.
More information- Part 4 - Multicore and Manycore Technology: Chances and Challenges. Vincent Heuveline
- Part 4 - Multicore and Manycore Technology: Chances and Challenges Vincent Heuveline 1 Numerical Simulation of Tropical Cyclones Goal oriented adaptivity for tropical cyclones ~10⁴km ~1500km ~100km 2
More informationComputational Astrophysics 1 Particle Mesh methods
Computational Astrophysics 1 Particle Mesh methods Oscar Agertz Outline - The Vlassov-Poisson equation - The Particle-Mesh method in a nut shell - Symplectic Integrators - Mass assignment and force interpolation
More informationSPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics
SPARSE SOLVERS FOR THE POISSON EQUATION Margreet Nool CWI, Multiscale Dynamics November 9, 2015 OUTLINE OF THIS TALK 1 FISHPACK, LAPACK, PARDISO 2 SYSTEM OVERVIEW OF CARTESIUS 3 POISSON EQUATION 4 SOLVERS
More informationOptimizing GROMACS for parallel performance
Optimizing GROMACS for parallel performance Outline 1. Why optimize? Performance status quo 2. GROMACS as a black box. (PME) 3. How does GROMACS spend its time? (MPE) 4. What you can do What I want to
More informationNumerically Solving Partial Differential Equations
Numerically Solving Partial Differential Equations Michael Lavell Department of Applied Mathematics and Statistics Abstract The physics describing the fundamental principles of fluid dynamics can be written
More informationDynamics of Galaxies: Basics. Frontiers in Numerical Gravitational Astrophysics July 3, 2008
Dynamics of Galaxies: Basics Frontiers in Numerical Gravitational Astrophysics July 3, 2008 c = G = 1 h = 0 Outline (a) Galaxies as physical systems. (b) The Collisionless Boltzmann Equation. (c) N-body
More informationPIConGPU Bringing Large-Scale Laser Plasma Simulations to GPU Supercomputing
PIConGPU Bringing Large-Scale Laser Plasma Simulations to GPU Supercomputing Michael Bussmann 1, Heiko Burau 1, René Widera 1, Florian Berninger 1, Axel Hübl 1, Thomas Kluge 1, Alexander Debus 1, Ulrich
More informationMolecular Dynamics. A very brief introduction
Molecular Dynamics A very brief introduction Sander Pronk Dept. of Theoretical Physics KTH Royal Institute of Technology & Science For Life Laboratory Stockholm, Sweden Why computer simulations? Two primary
More informationAM 205 Final Project The N-Body Problem
AM 205 Final Project The N-Body Problem Leah Birch Elizabeth Finn Karen Yu December 14, 2012 Abstract The N-Body Problem can be solved using a variety of numeric integrators. Newton s Law of Universal
More informationAn Accurate Fourier-Spectral Solver for Variable Coefficient Elliptic Equations
An Accurate Fourier-Spectral Solver for Variable Coefficient Elliptic Equations Moshe Israeli Computer Science Department, Technion-Israel Institute of Technology, Technion city, Haifa 32000, ISRAEL Alexander
More informationHuge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei
1/20 Huge-Scale Molecular Dynamics Simulation of Multi-bubble Nuclei H. Watanabe ISSP, The M. Suzuki H. Inaoka N. Ito Kyushu University RIKEN AICS The, RIKEN AICS Outline 1. Introduction 2. Benchmark results
More informationNonlinear Iterative Solution of the Neutron Transport Equation
Nonlinear Iterative Solution of the Neutron Transport Equation Emiliano Masiello Commissariat à l Energie Atomique de Saclay /DANS//SERMA/LTSD emiliano.masiello@cea.fr 1/37 Outline - motivations and framework
More informationA Trillion Particles: Studying Large-Scale Structure Formation on the BG/Q
A Trillion Particles: Studying Large-Scale Structure Formation on the BG/Q David Daniel, Patricia Fasel, Hal Finkel, Nicholas Frontiere, Salman Habib, Katrin Heitmann, Zarija Lukic, Adrian Pope July 10,
More informationLong-range interactions: P 3 M, MMMxD, ELC, MEMD and ICC
Long-range interactions: P 3 M, MMMxD, ELC, MEMD and ICC Axel Arnold Institute for Computational Physics Universität Stuttgart October 10, 2012 Long-range interactions gravity Coulomb interaction dipolar
More informationIntroduction to Multigrid Methods
Introduction to Multigrid Methods Chapter 9: Multigrid Methodology and Applications Gustaf Söderlind Numerical Analysis, Lund University Textbooks: A Multigrid Tutorial, by William L Briggs. SIAM 1988
More informationFinite Difference Methods (FDMs) 1
Finite Difference Methods (FDMs) 1 1 st - order Approxima9on Recall Taylor series expansion: Forward difference: Backward difference: Central difference: 2 nd - order Approxima9on Forward difference: Backward
More informationAn implementation of the Fast Multipole Method without multipoles
An implementation of the Fast Multipole Method without multipoles Robin Schoemaker Scientific Computing Group Seminar The Fast Multipole Method Theory and Applications Spring 2002 Overview Fast Multipole
More informationA recovery-assisted DG code for the compressible Navier-Stokes equations
A recovery-assisted DG code for the compressible Navier-Stokes equations January 6 th, 217 5 th International Workshop on High-Order CFD Methods Kissimmee, Florida Philip E. Johnson & Eric Johnsen Scientific
More information18.336J/6.335J: Fast methods for partial differential and integral equations. Lecture 13. Instructor: Carlos Pérez Arancibia
18.336J/6.335J: Fast methods for partial differential and integral equations Lecture 13 Instructor: Carlos Pérez Arancibia MIT Mathematics Department Cambridge, October 24, 2016 Fast Multipole Method (FMM)
More informationPIC Simulation of Magnetic Reconnection with Adaptive Mesh Refinement
PIC Simulation of Magnetic Reconnection with Adaptive Mesh Refinement Keizo Fujimoto National Astronomical Observatory of Japan Overview Introduction Particle-in-Cell (PIC) model with adaptive mesh refinement
More informationClassical Field Theory
April 13, 2010 Field Theory : Introduction A classical field theory is a physical theory that describes the study of how one or more physical fields interact with matter. The word classical is used in
More informationWhat place for mathematicians in plasma physics
What place for mathematicians in plasma physics Eric Sonnendrücker IRMA Université Louis Pasteur, Strasbourg projet CALVI INRIA Nancy Grand Est 15-19 September 2008 Eric Sonnendrücker (U. Strasbourg) Math
More informationS Subdivide, Preprocess and Conquer: Micromagnetism FEM/BEM-Simulations on Single-Node/Multi-GPU Systems
S4283 - Subdivide, : Micromagnetism FEM/BEM-Simulations on Single-Node/Multi-GPU Systems Elmar Westphal - Forschungszentrum Jülich GmbH 1 Contents Micromagnetism TetraMag, a FEM/BEM Micromagnetism Simulator
More informationGranularity effects Bas van der Geer
Granularity effects Bas van der Geer Marieke de Loos Steinar Wouters Edgar Vredenbregt Jom Luiten Eindhoven University The Netherlands Globular cluster Messier 2 by Hubble Space Telescope.. Located in
More informationLarge Scale High Resolution Blood Flow Simulations
Large Scale High Resolution Blood Flow Simulations Florian Janoschek Jens Harting Federico Toschi Department of Applied Physics, Eindhoven University of Technology, The Netherlands Institute for Computational
More informationComputer simulation methods (2) Dr. Vania Calandrini
Computer simulation methods (2) Dr. Vania Calandrini in the previous lecture: time average versus ensemble average MC versus MD simulations equipartition theorem (=> computing T) virial theorem (=> computing
More informationTransactions on Modelling and Simulation vol 18, 1997 WIT Press, ISSN X
Application of the fast multipole method to the 3-D BEM analysis of electron guns T. Nishida and K. Hayami Department of Mathematical Engineering and Information Physics, School of Engineering, University
More informationImproving Dynamical Core Scalability, Accuracy, and Limi:ng Flexibility with the ADER- DT Time Discre:za:on
Improving Dynamical Core Scalability, Accuracy, and Limi:ng Flexibility with the ADER- DT Time Discre:za:on Matthew R. Norman Scientific Computing Group National Center for Computational Sciences Oak Ridge
More informationAn Overview of Fast Multipole Methods
An Overview of Fast Multipole Methods Alexander T. Ihler April 23, 2004 Sing, Muse, of the low-rank approximations Of spherical harmonics and O(N) computation... Abstract We present some background and
More informationAn H-LU Based Direct Finite Element Solver Accelerated by Nested Dissection for Large-scale Modeling of ICs and Packages
PIERS ONLINE, VOL. 6, NO. 7, 2010 679 An H-LU Based Direct Finite Element Solver Accelerated by Nested Dissection for Large-scale Modeling of ICs and Packages Haixin Liu and Dan Jiao School of Electrical
More informationPerformance Analysis of Lattice QCD Application with APGAS Programming Model
Performance Analysis of Lattice QCD Application with APGAS Programming Model Koichi Shirahata 1, Jun Doi 2, Mikio Takeuchi 2 1: Tokyo Institute of Technology 2: IBM Research - Tokyo Programming Models
More informationCase Study: Quantum Chromodynamics
Case Study: Quantum Chromodynamics Michael Clark Harvard University with R. Babich, K. Barros, R. Brower, J. Chen and C. Rebbi Outline Primer to QCD QCD on a GPU Mixed Precision Solvers Multigrid solver
More informationParallel Preconditioning Methods for Ill-conditioned Problems
Parallel Preconditioning Methods for Ill-conditioned Problems Kengo Nakajima Information Technology Center, The University of Tokyo 2014 Conference on Advanced Topics and Auto Tuning in High Performance
More informationWhy Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.
Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar. 25, 2002 Molecular Dynamics: Introduction At physiological conditions, the
More informationSome Geometric and Algebraic Aspects of Domain Decomposition Methods
Some Geometric and Algebraic Aspects of Domain Decomposition Methods D.S.Butyugin 1, Y.L.Gurieva 1, V.P.Ilin 1,2, and D.V.Perevozkin 1 Abstract Some geometric and algebraic aspects of various domain decomposition
More informationWhite dwarf dynamical interactions. Enrique García-Berro. Jornades de Recerca, Departament de Física
White dwarf dynamical interactions Enrique García-Berro Jornades de Recerca, Departament de Física CONTENTS 1. Introduction 2. Smoothed Particle Hydrodynamics 3. White dwarf mergers 4. White dwarf collisions
More informationDensity Functional Modeling of Nanocrystalline Materials
Density Functional Modeling of Nanocrystalline Materials A new approach for modeling atomic scale properties in materials Peter Stefanovic Supervisor: Nikolas Provatas 70 / Part 1-7 February 007 Density
More informationOptimal Interface Conditions for an Arbitrary Decomposition into Subdomains
Optimal Interface Conditions for an Arbitrary Decomposition into Subdomains Martin J. Gander and Felix Kwok Section de mathématiques, Université de Genève, Geneva CH-1211, Switzerland, Martin.Gander@unige.ch;
More informationBlock-Structured Adaptive Mesh Refinement
Block-Structured Adaptive Mesh Refinement Lecture 2 Incompressible Navier-Stokes Equations Fractional Step Scheme 1-D AMR for classical PDE s hyperbolic elliptic parabolic Accuracy considerations Bell
More informationA Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation
A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation Tao Zhao 1, Feng-Nan Hwang 2 and Xiao-Chuan Cai 3 Abstract In this paper, we develop an overlapping domain decomposition
More informationA parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns
A parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns Dominik Bartuschat, Ulrich Rüde Chair for System Simulation, University of Erlangen-Nürnberg
More informationChenhan D. Yu The 3rd BLIS Retreat Sep 28, 2015
GSKS GSKNN BLIS-Based High Performance Computing Kernels in N-body Problems Chenhan D. Yu The 3rd BLIS Retreat Sep 28, 2015 N-body Problems Hellstorm Astronomy and 3D https://youtu.be/bllwkx_mrfk 2 N-body
More informationGrid-Free Plasma Simulation Techniques
IEEE TRANSACTIONS ON PLASMA SCIENCE, VOL. 34, NO. 2, APRIL 2006 149 Grid-Free Plasma Simulation Techniques Andrew J. Christlieb, Member, IEEE, Robert Krasny, John P. Verboncoeur, Member, IEEE, Jerold W.
More informationAdaptive numerical simulation of Streamer. Shailendra Singh
Adaptive numerical simulation of Streamer Shailendra Singh Tools: BDV from Drift diffusion calculation Motivation: Streamer criteria works very well in moderately uniform field. But in highly non uniform
More informationMultigrid Methods and their application in CFD
Multigrid Methods and their application in CFD Michael Wurst TU München 16.06.2009 1 Multigrid Methods Definition Multigrid (MG) methods in numerical analysis are a group of algorithms for solving differential
More information