Sustained Petascale Performance of Seismic Simulations with SeisSol

Size: px
Start display at page:

Download "Sustained Petascale Performance of Seismic Simulations with SeisSol"

Transcription

1 SIAM EX Workshop on Exascale Applied Mathematics Challenges and Opportunities Sustained Petascale Performance of Seismic Simulations with SeisSol M. Bader, A. Breuer, A. Heinecke, S. Rettenberger C. Pelties, A.-A. Gabriel Technische Universität München, Ludwig-Maximilians-Universität München SIAM EX, July,

2 HPC Meets Geoscience Alexander Alice-Agnes Alexander Christian Sebastian Breuer Gabriel Heinecke Pelties Rettenberger SIAM EX, July,

3 Overview and Agenda SeisSol: dynamic rupture and seismic wave propagation unstructured tetrahedral meshes high-order ADER-DG discretisation Optimisation for Heterogeneous Petascale Platforms: code generation to optimize element-local matrix kernels hybrid MPI/OpenMP parallelisation offload scheme to address multiphysics Performance on Tianhe-, Stampede and SuperMUC: weak scaling of wave propagation component strong scaling for 99 Landers M. earthquake SIAM EX, July,

4 Dynamic Rupture and Earthquake Simulation Tohoku subduction zone: CAD model and tetrahedral mesh (C. Pelties) Use of Adaptive Tetrahedral Meshes: curved subduction zones that meet surface at shallow angles high impact on uplift for tsunamigenic earthquakes complicated fault systems with multiple branches non-linear multiphysics dynamic rupture simulation goal: automated meshing process (incl. CAD generation) SIAM EX, July,

5 Dynamic Rupture and Earthquake Simulation Landers fault system: simulated ground motion and tetrahedral mesh Use of Adaptive Tetrahedral Meshes: curved subduction zones that meet surface at shallow angles high impact on uplift for tsunamigenic earthquakes complicated fault systems with multiple branches non-linear multiphysics dynamic rupture simulation goal: automated meshing process (incl. CAD generation) SIAM EX, July,

6 + Aq Technische Universität München : Governing x + Bq eq s y + Cq z = Seismic Wave Propagation with SeisSol B C Elastic Wave Equations: B C A formulation) q y + Cq q t + Aq z x + Bq y + A Cq = z. = B C with q = (σ, σ, σ, σ B, σ, σ, u, v, w) T - - µ - - -µ -µ A µ µ - - -µ A = -µ B = -µ -µ - - B - - C A µ - B u high order discontinuous Galerkin discretisation C - - µ -µ = ADER-DG: -µ v A high approximationc order = in space andtime: -µ -µ B additional - features: - local time C stepping, B high accuracy - - w - of C - - A earthquake - faulting - (full frictional sliding) Dumbser, Käser et al. [,] µ M. Bader et al.: Sustained Petascale Performance of Seismic Simulations with SeisSol SIAMC EX=, B July, -µ C = inecke, S. Rettenberger, Optimization of SeisSol

7 Department of Informatics V Technische Universität München Technische Universität München SeisSol in a Nutshell ADER-DG SeisSol in a nutshell: ADER-DG Update scheme Q n+ k = Q k - S X k J k M- F -,i I(t n, t n+, Q n k )N k,i A + k N- k,i + i= X i= +M - K I(t n, t n+, Q n k )A k +M - K I(t n, t n+, Q n k )B k +M - K I(t n, t n+, Q n k )C k F +,i,j,h I(t n, t n+, Q n k(i) )N k,ia - k(i) N- k,i Cauchy Kovalewski I(t n, t n+, Q n k )= JX (t n+ - t n ) j (j + j Q k(t n ) j= (Q k ) t = -M - (K ) T Q k A k +(K ) T Q k B k +(K ) T Q k C k A. M. Breuer, Bader eta. al.: Heinecke, Sustained Petascale S. Rettenberger, Performance of Seismic Optimization Simulations of with SeisSol SeisSol M. SIAM Bader, EX, C. July Pelties,

8 Technische Universität München tiplication with the sparse Jacobians A k, B k and ure shows all involved sparsity patterns of the A Optimisation of(b Sparse = ) scheme. Matrix Operations Apply sparse matrices to multiple DOF-vectors Q k Code Generator for Sparse Kernels: (Breuer et al. []) Figure. Sparsity patterns appearing in the ADER tim avoid overhead of CSR (or similar) data structures; store CSR elements vector, only Volume integration: In the volume integration, g we multiply the sparse stiffness matrices K ξ, K η unknowns I(t n,t n+,q n k ) previously obtained by the left. The intermediate results are again multip full unrolling of all element operations using a code generator use intrinsics and apply blocking to improve vectorisation SIAM EX, July,

9 Technische Universität München Optimisation of Sparse Matrix Operations Computation Apply sparse of the matrices first time to multiple derivative DOF-vectors / t Q k Q. k e) derivative Computation / t of the Q k. second derivative / t Q k. Dense vs. Sparse Kernels: (Breuer et al. []) recursions switch of thetoader densetime kernels integration dependingfor onaachieved fifth order timemethod. to solution erate zero-blocks for sparse in and the derivatives, dense kernels: ivory blocks hit zero blocks. exploit zero-blocks generated during recursive CK computation Evaluation of the ADER Time Integration SIAM EX, July,

10 Mesh Generation and Partitioning Mesh Generation: high-quality meshes required (shallow subduction zones, complicated fault structures) with 9 grid cells using SimModeler by Simmetrix ( Two-stage approach to provide parallel mesh partitions: graph-based partitioning (ParMETIS) create customised parallel format (based on netcdf) for mesh partitions highly scalable mesh input via netcdf/mpi-io in SeisSol SIAM EX, July,

11 Optimization for Intel Xeon Phi Platforms Offload Scheme: Host PCIe Xeon Phi to address load imbalances of multiphysics simulation hides communication with Xeon Phi and between nodes OpenMP parallelisation: MPI comm., receiver output dynamic rupture fluxes, fault output download cells for receivers, DR, MPI upload MPIreceived cells upload dynamic rupture updates time integration of MPI boundary cells time integration of non-mpi cells, volume integration wave propagation fluxes to address manycore parallelism with coprocessors download all data (if required) apply dynamic rupture updates, pack transfer data careful parallelisation of all loops plot wave field (if required) SIAM EX, July, 9

12 Supercomputing Platforms LRZ, Munich 9 compute nodes ( thin node islands), Intel SNB-EP cores (. GHz) Infiniband FDR interconnect (fat tree) # in Top :.9 PFlop/s TACC, Austin compute nodes,, cores SNB-EP (c) + Xeon Phi SEP per node Mellanox FDR interconnect (fat tree) # in Top :. PFlop/s NSCC, Guangzhou compute nodes used,. Mio cores SNB-EP (c) + Xeon Phi SP per node TH-Express custom interconnect # in Top :. PFlop/s SIAM EX, July,

13 Weak Scaling of Wave Propagation SuperMUC, classic Stampede SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency 9 9 # nodes goal: test scalability towards large problem sizes cubic domain, uniformly refined tetrahedral cells weak scaling:, elements per card/node SIAM EX, July,

14 Weak Scaling of Wave Propagation SuperMUC, classic Stampede SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency # nodes more than 9 % parallel efficiency on Tianhe- and Stampede % on full SuperMUC (no overlapping) SIAM EX, July,

15 Weak Scaling of Wave Propagation SuperMUC, classic Stampede SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency 9 9 # nodes. PFlop/s on Tianhe- ( nodes). PFlop/s on Stampede ( nodes). PFlop/s on SuperMUC (9 nodes) SIAM EX, July,

16 Weak Scaling Peak Efficiency SuperMUC, gr. buff. Stampede Tianhe- SuperMUC, classic % peak: hardware % peak: non-zero 9 9 # nodes SIAM EX, July,

17 99 Landers M. Earthquake multiphysics simulation of dynamic rupture and resulting ground motion of a M. earthquake fault inferred from measured data, regional topography from satellite data, physically consistent stress and friction parameters D velocity structure, low velocity near surface SIAM EX, July,

18 Strong Scaling of Landers Scenario Stampede SuperMUC, classic SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency # nodes 9 million tetrahedrons;,9 element faces on fault th order, 9 billion degrees of freedom SIAM EX, July,

19 Strong Scaling of Landers Scenario Stampede SuperMUC, classic SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency # nodes more than % parallel efficiency on Stampede and Tianhe- (when using only one Xeon Phi per node) multiple-xeon-phi performance suffers from MPI communication SIAM EX, July,

20 Strong Scaling of Landers Scenario Stampede SuperMUC, classic SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards 9 9 % parallel efficiency 9 # nodes. PFlop/s on Tianhe- ( nodes). PFlop/s on Stampede ( nodes). PFlop/s on SuperMUC (9 nodes) SIAM EX, July,

21 Landers Strong Scaling Peak Efficiency SuperMUC, gr. buff. Stampede Tianhe- SuperMUC, classic % peak: hardware % peak: non-zero 9 9 # nodes SIAM EX, July,

22 Landers Earthquake Production Run On SuperMUC: h min computing time fault output and receivers. PFLOPS sustained performance Observations: complex rupture dynamics (fault branching, etc.) high-frequency signals (up to Hz) from rupture propagate directly into wave field ground shaking in the engineering frequency band s simulated time SIAM EX, July,

23 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

24 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

25 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

26 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

27 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

28 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

29 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

30 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

31 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

32 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

33 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

34 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

35 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

36 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

37 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,

38 SeisSol: Earthquake Petascale Multiphysics Dynamic Rupture Simulations with SeisSol: high-order ADER-DG on unstructured adaptive meshes focus on complicated geometries and rupture physics non-linear interaction of rupture process and seismic waves Petascale Performance on Heterogeneous Platforms: exploits high computational intensity of ADER-DG requires careful tuning of the entire simulation pipeline code generation to accelerate element kernels scalable mesh input for M cells on k cores offload-scheme for multiphysics with Xeon Phi SIAM EX, July,

39 I Would Like to Thank... Intel: Mikhail Smelyanskiy, Karthikeyan Vaidyanathan Pradeep Dubey all colleagues from the three supercomputing centers, esp.: Arndt Bode (Leibniz Supercomputing Centre) William Barth (Texas Advanced Computing Centre) Xiang-Ke Liao (NCSS and NUDT) for financial support: Volkswagen Foundation (project ASCETE) KONWIHR the entire SeisSol team and all contributors You! SIAM EX, July, 9

40 References [] A. Breuer, A. Heinecke, M. Bader, C. Pelties: Accelerating SeisSol by generating vectorized code for sparse matrix operators. In: Advances in Parallel Computing, IOS Press,. Proceedings of ParCo [] A. Breuer, A. Heinecke, S. Rettenberger, M. Bader, A.-A. Gabriel, C. Pelties: Sustained Petascale Performance of Seismic Simulations with SeisSol on SuperMUC. In: Supercomputing, LNCS, p.. PRACE ISC Award. [] M. Dumbser, M. Käser: An arbitrary high-order discontinuous Galerkin method for elastic waves on unstructured meshes II. The three-dimensional isotropic case. Geophys. J. Int. (),. [] A. Heinecke, A. Breuer, S. Rettenberger, M. Bader, A.-A. Gabriel, C. Pelties, A. Bode, W. Barth, X.-K. Liao, K. Vaidyanathan, M. Smelyanskiy, P. Dubey: Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers. Gordon Bell Prize Finalist. [] M. Käser, M. Dumbser, J. de la Puente, H. Igel: An arbitrary high-order Discontinuous Galerkin method for elastic waves on unstructured meshes III. Viscoelastic attenuation. Geophys. J. Int. (),. [] C. Pelties, J. de la Puente, J.-P. Ampuero, G. B. Brietzke, M. Käser: Three-dimensional dynamic rupture simulation with a high-order discontinuous Galerkin method on unstructured tetrahedral meshes. J. Geophys. Res.: Solid Earth, (B),. SIAM EX, July,

Extreme-scale Multi-physics Simulation of the 2004 Sumatra Earthquake

Extreme-scale Multi-physics Simulation of the 2004 Sumatra Earthquake Extreme-scale Multi-physics Simulation of the 2004 Sumatra Earthquake Intel MIC Programming Workshop Michael Bader (and many others!) Technical University of Munich LRZ, 28 June 2017 Co-Authors Current

More information

Large-scale multi-physics earthquake scenarios with the ADER-DG method on modern supercomputers

Large-scale multi-physics earthquake scenarios with the ADER-DG method on modern supercomputers Largescale multiphysics earthquake scenarios with the ADERDG method on modern supercomputers Stephanie Wollherr, Dr. AliceAgnes Gabriel, Dr. Betsy Madden, Thomas Ulrich LudwigsMaximilians Universität (LMU)

More information

Aspects of a Discontinuous Galerkin Approach for 3D Dynamic Rupture Modeling in the Case of a Complex Fault System

Aspects of a Discontinuous Galerkin Approach for 3D Dynamic Rupture Modeling in the Case of a Complex Fault System AGU Fall Meeting 2011 S54C-08 Aspects of a Discontinuous Galerkin Approach for 3D Dynamic Rupture Modeling in the Case of a Complex Fault System Christian Pelties Josep de la Puente (BSC), Jean Paul Ampuero

More information

Scientific Computing I

Scientific Computing I Scientific Computing I Module 1: Introduction Michael Bader Lehrstuhl Informatik V Winter 2016/2017 Scientific Computing = Science + Computing? Science on Computers?? Computational Science??? Michael Bader

More information

Extreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust Earthquake

Extreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust Earthquake Extreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust Earthquake ABSTRACT Carsten Uphoff Sebastian Rettenberger Michael Bader uphoff@in.tum.de rettenbs@in.tum.de bader@in.tum.de

More information

Towards a highly-parallel PDE-Solver using Adaptive Sparse Grids on Compute Clusters

Towards a highly-parallel PDE-Solver using Adaptive Sparse Grids on Compute Clusters Towards a highly-parallel PDE-Solver using Adaptive Sparse Grids on Compute Clusters HIM - Workshop on Sparse Grids and Applications Alexander Heinecke Chair of Scientific Computing May 18 th 2011 HIM

More information

A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries

A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries SC13, November 21 st 2013 Christian Godenschwager, Florian Schornbaum, Martin Bauer, Harald Köstler, Ulrich

More information

Performance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain

Performance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain March 4-5, 2015 Performance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain M. Bonnasse-Gahot 1,2, H. Calandra 3, J. Diaz 1 and S. Lanteri 2

More information

Parallel Simulations of Self-propelled Microorganisms

Parallel Simulations of Self-propelled Microorganisms Parallel Simulations of Self-propelled Microorganisms K. Pickl a,b M. Hofmann c T. Preclik a H. Köstler a A.-S. Smith b,d U. Rüde a,b ParCo 2013, Munich a Lehrstuhl für Informatik 10 (Systemsimulation),

More information

591 TFLOPS Multi-TRILLION Particles Simulation on SuperMUC

591 TFLOPS Multi-TRILLION Particles Simulation on SuperMUC International Supercomputing Conference 2013 591 TFLOPS Multi-TRILLION Particles Simulation on SuperMUC W. Eckhardt TUM, A. Heinecke TUM, R. Bader LRZ, M. Brehm LRZ, N. Hammer LRZ, H. Huber LRZ, H.-G.

More information

Some thoughts about energy efficient application execution on NEC LX Series compute clusters

Some thoughts about energy efficient application execution on NEC LX Series compute clusters Some thoughts about energy efficient application execution on NEC LX Series compute clusters G. Wellein, G. Hager, J. Treibig, M. Wittmann Erlangen Regional Computing Center & Department of Computer Science

More information

Cactus Tools for Petascale Computing

Cactus Tools for Petascale Computing Cactus Tools for Petascale Computing Erik Schnetter Reno, November 2007 Gamma Ray Bursts ~10 7 km He Protoneutron Star Accretion Collapse to a Black Hole Jet Formation and Sustainment Fe-group nuclei Si

More information

Computational Seismology: A Practical Introduction. Heiner Igel Computational Seismology 1 / 18

Computational Seismology: A Practical Introduction. Heiner Igel Computational Seismology 1 / 18 Computational Seismology: A Practical Introduction Heiner Igel Computational Seismology 1 / 18 Introduction Heiner Igel Computational Seismology 2 / 18 Goals of the course Introduction Understand methods

More information

A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters

A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters ANTONINO TUMEO, ORESTE VILLA Collaborators: Karol Kowalski, Sriram Krishnamoorthy, Wenjing Ma, Simone Secchi May 15, 2012 1 Outline!

More information

Scalable Hybrid Programming and Performance for SuperLU Sparse Direct Solver

Scalable Hybrid Programming and Performance for SuperLU Sparse Direct Solver Scalable Hybrid Programming and Performance for SuperLU Sparse Direct Solver Sherry Li Lawrence Berkeley National Laboratory Piyush Sao Rich Vuduc Georgia Institute of Technology CUG 14, May 4-8, 14, Lugano,

More information

arxiv: v1 [hep-lat] 8 Nov 2014

arxiv: v1 [hep-lat] 8 Nov 2014 Staggered Dslash Performance on Intel Xeon Phi Architecture arxiv:1411.2087v1 [hep-lat] 8 Nov 2014 Department of Physics, Indiana University, Bloomington IN 47405, USA E-mail: ruizli AT umail.iu.edu Steven

More information

Lattice Boltzmann simulations on heterogeneous CPU-GPU clusters

Lattice Boltzmann simulations on heterogeneous CPU-GPU clusters Lattice Boltzmann simulations on heterogeneous CPU-GPU clusters H. Köstler 2nd International Symposium Computer Simulations on GPU Freudenstadt, 29.05.2013 1 Contents Motivation walberla software concepts

More information

Geophysical Journal International

Geophysical Journal International Geophysical Journal International Geophys. J. Int. (2015) 200, 888 907 GJI Seismology doi: 10.1093/gji/ggu436 On the initiation of sustained slip-weakening ruptures by localized stresses M. Galis, 1 C.

More information

上海超级计算中心 Shanghai Supercomputer Center. Lei Xu Shanghai Supercomputer Center San Jose

上海超级计算中心 Shanghai Supercomputer Center. Lei Xu Shanghai Supercomputer Center San Jose 上海超级计算中心 Shanghai Supercomputer Center Lei Xu Shanghai Supercomputer Center 03/26/2014 @GTC, San Jose Overview Introduction Fundamentals of the FDTD method Implementation of 3D UPML-FDTD algorithm on GPU

More information

Numerical modeling of dynamic rupture propagation

Numerical modeling of dynamic rupture propagation IX. Slovenská geofyzikálna konferencia, 22. - 23. jún 2011 Fakulta matematiky, fyziky a informatiky UK, Bratislava Numerical modeling of dynamic rupture propagation Martin Galis Peter Moczo Jozef Kristek

More information

Physics plans and ILDG usage

Physics plans and ILDG usage Physics plans and ILDG usage in Italy Francesco Di Renzo University of Parma & INFN Parma The MAIN ILDG USERS in Italy are the ROME groups A (by now) well long track of ILDG-based projects mainly within

More information

Large-scale Electronic Structure Simulations with MVAPICH2 on Intel Knights Landing Manycore Processors

Large-scale Electronic Structure Simulations with MVAPICH2 on Intel Knights Landing Manycore Processors Large-scale Electronic Structure Simulations with MVAPICH2 on Intel Knights Landing Manycore Processors Hoon Ryu, Ph.D. (E: elec1020@kisti.re.kr) Principal Researcher / Korea Institute of Science and Technology

More information

Static-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems

Static-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems Static-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems Ichitaro Yamazaki University of Tennessee, Knoxville Xiaoye Sherry Li Lawrence Berkeley National Laboratory MS49: Sparse

More information

Large-scale MD simulation of heterogeneous systems with ls1 mardyn

Large-scale MD simulation of heterogeneous systems with ls1 mardyn Large-scale MD simulation of heterogeneous systems with ls1 mardyn M. T. Horsch, R. Srivastava, S. J. Werth, C. Niethammer, C. W. Glass, W. Eckhardt, A. Heinecke, N. Tchipev, H.-J. Bungartz, S. Eckelsbach,

More information

Massively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling

Massively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling 2019 Intel extreme Performance Users Group (IXPUG) meeting Massively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling Hoon Ryu, Ph.D. (E: elec1020@kisti.re.kr)

More information

Performance evaluation of scalable optoelectronics application on large-scale Knights Landing cluster

Performance evaluation of scalable optoelectronics application on large-scale Knights Landing cluster Performance evaluation of scalable optoelectronics application on large-scale Knights Landing cluster Yuta Hirokawa Graduate School of Systems and Information Engineering, University of Tsukuba hirokawa@hpcs.cs.tsukuba.ac.jp

More information

Weather Research and Forecasting (WRF) Performance Benchmark and Profiling. July 2012

Weather Research and Forecasting (WRF) Performance Benchmark and Profiling. July 2012 Weather Research and Forecasting (WRF) Performance Benchmark and Profiling July 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell,

More information

Performance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore Architectures

Performance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore Architectures Performance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore Architectures José I. Aliaga Performance and Energy Analysis of the Iterative Solution of Sparse

More information

Performance Analysis of Lattice QCD Application with APGAS Programming Model

Performance Analysis of Lattice QCD Application with APGAS Programming Model Performance Analysis of Lattice QCD Application with APGAS Programming Model Koichi Shirahata 1, Jun Doi 2, Mikio Takeuchi 2 1: Tokyo Institute of Technology 2: IBM Research - Tokyo Programming Models

More information

Efficient multigrid solvers for mixed finite element discretisations in NWP models

Efficient multigrid solvers for mixed finite element discretisations in NWP models 1/20 Efficient multigrid solvers for mixed finite element discretisations in NWP models Colin Cotter, David Ham, Lawrence Mitchell, Eike Hermann Müller *, Robert Scheichl * * University of Bath, Imperial

More information

Claude Tadonki. MINES ParisTech PSL Research University Centre de Recherche Informatique

Claude Tadonki. MINES ParisTech PSL Research University Centre de Recherche Informatique Claude Tadonki MINES ParisTech PSL Research University Centre de Recherche Informatique claude.tadonki@mines-paristech.fr Monthly CRI Seminar MINES ParisTech - CRI June 06, 2016, Fontainebleau (France)

More information

Unravelling earthquake dynamics with SeisSol: Megathrust ruptures, off- fault plas?city and rough faults Elizabeth H.

Unravelling earthquake dynamics with SeisSol: Megathrust ruptures, off- fault plas?city and rough faults Elizabeth H. Unravelling earthquake dynamics with SeisSol: Megathrust ruptures, off- fault plas?city and rough faults Elizabeth H. Madden (Betsy) Stephanie Wollherr, Thomas Ulrich, Alice- Agnes Gabriel 1 Earthquake

More information

Utilisation de la compression low-rank pour réduire la complexité du solveur PaStiX

Utilisation de la compression low-rank pour réduire la complexité du solveur PaStiX Utilisation de la compression low-rank pour réduire la complexité du solveur PaStiX 26 Septembre 2018 - JCAD 2018 - Lyon Grégoire Pichon, Mathieu Faverge, Pierre Ramet, Jean Roman Outline 1. Context 2.

More information

Nuclear Physics and Computing: Exascale Partnerships. Juan Meza Senior Scientist Lawrence Berkeley National Laboratory

Nuclear Physics and Computing: Exascale Partnerships. Juan Meza Senior Scientist Lawrence Berkeley National Laboratory Nuclear Physics and Computing: Exascale Partnerships Juan Meza Senior Scientist Lawrence Berkeley National Laboratory Nuclear Science and Exascale i Workshop held in DC to identify scientific challenges

More information

3D Cartesian Transport Sweep for Massively Parallel Architectures on top of PaRSEC

3D Cartesian Transport Sweep for Massively Parallel Architectures on top of PaRSEC 3D Cartesian Transport Sweep for Massively Parallel Architectures on top of PaRSEC 9th Scheduling for Large Scale Systems Workshop, Lyon S. Moustafa, M. Faverge, L. Plagne, and P. Ramet S. Moustafa, M.

More information

Accelerating linear algebra computations with hybrid GPU-multicore systems.

Accelerating linear algebra computations with hybrid GPU-multicore systems. Accelerating linear algebra computations with hybrid GPU-multicore systems. Marc Baboulin INRIA/Université Paris-Sud joint work with Jack Dongarra (University of Tennessee and Oak Ridge National Laboratory)

More information

Massively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling

Massively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling 2019 Intel extreme Performance Users Group (IXPUG) meeting Massively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling Hoon Ryu, Ph.D. (E: elec1020@kisti.re.kr)

More information

A parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns

A parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns A parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns Dominik Bartuschat, Ulrich Rüde Chair for System Simulation, University of Erlangen-Nürnberg

More information

Scalable, performant, and resilient large-scale applications of molecular process engineering

Scalable, performant, and resilient large-scale applications of molecular process engineering Scalable, performant, and resilient large-scale applications of molecular process engineering M. Horsch,1 P. Gralka,2 C. Niethammer,3 N. Tchipev,4 J. Vrabec,5 H. Hasse1 1 University of Kaiserslautern,

More information

Parallel Sparse Tensor Decompositions using HiCOO Format

Parallel Sparse Tensor Decompositions using HiCOO Format Figure sources: A brief survey of tensors by Berton Earnshaw and NVIDIA Tensor Cores Parallel Sparse Tensor Decompositions using HiCOO Format Jiajia Li, Jee Choi, Richard Vuduc May 8, 8 @ SIAM ALA 8 Outline

More information

ERLANGEN REGIONAL COMPUTING CENTER

ERLANGEN REGIONAL COMPUTING CENTER ERLANGEN REGIONAL COMPUTING CENTER Making Sense of Performance Numbers Georg Hager Erlangen Regional Computing Center (RRZE) Friedrich-Alexander-Universität Erlangen-Nürnberg OpenMPCon 2018 Barcelona,

More information

Lattice QCD with Domain Decomposition on Intel R Xeon Phi TM

Lattice QCD with Domain Decomposition on Intel R Xeon Phi TM Lattice QCD with Domain Decomposition on Intel R Xeon Phi TM Co-Processors Simon Heybrock, Bálint Joó, Dhiraj D. Kalamkar, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Tilo Wettig, and Pradeep Dubey

More information

Crossing the Chasm. On the Paths to Exascale: Presented by Mike Rezny, Monash University, Australia

Crossing the Chasm. On the Paths to Exascale: Presented by Mike Rezny, Monash University, Australia On the Paths to Exascale: Crossing the Chasm Presented by Mike Rezny, Monash University, Australia michael.rezny@monash.edu Crossing the Chasm meeting Reading, 24 th October 2016 Version 0.1 In collaboration

More information

FPGA Implementation of a Predictive Controller

FPGA Implementation of a Predictive Controller FPGA Implementation of a Predictive Controller SIAM Conference on Optimization 2011, Darmstadt, Germany Minisymposium on embedded optimization Juan L. Jerez, George A. Constantinides and Eric C. Kerrigan

More information

Piz Daint & Piz Kesch : from general purpose supercomputing to an appliance for weather forecasting. Thomas C. Schulthess

Piz Daint & Piz Kesch : from general purpose supercomputing to an appliance for weather forecasting. Thomas C. Schulthess Piz Daint & Piz Kesch : from general purpose supercomputing to an appliance for weather forecasting Thomas C. Schulthess 1 Cray XC30 with 5272 hybrid, GPU accelerated compute nodes Piz Daint Compute node:

More information

Parallel Discontinuous Galerkin Method

Parallel Discontinuous Galerkin Method Parallel Discontinuous Galerkin Method Yin Ki, NG The Chinese University of Hong Kong Aug 5, 2015 Mentors: Dr. Ohannes Karakashian, Dr. Kwai Wong Overview Project Goal Implement parallelization on Discontinuous

More information

CP2K. New Frontiers. ab initio Molecular Dynamics

CP2K. New Frontiers. ab initio Molecular Dynamics CP2K New Frontiers in ab initio Molecular Dynamics Jürg Hutter, Joost VandeVondele, Valery Weber Physical-Chemistry Institute, University of Zurich Ab Initio Molecular Dynamics Molecular Dynamics Sampling

More information

Open-Source Parallel FE Software : FrontISTR -- Performance Considerations about B/F (Byte per Flop) of SpMV on K-Supercomputer and GPU-Clusters --

Open-Source Parallel FE Software : FrontISTR -- Performance Considerations about B/F (Byte per Flop) of SpMV on K-Supercomputer and GPU-Clusters -- Parallel Processing for Energy Efficiency October 3, 2013 NTNU, Trondheim, Norway Open-Source Parallel FE Software : FrontISTR -- Performance Considerations about B/F (Byte per Flop) of SpMV on K-Supercomputer

More information

Defmod, an earthquake simulator that adaptively switches between quasi-static and dynamic states

Defmod, an earthquake simulator that adaptively switches between quasi-static and dynamic states Defmod, an earthquake simulator that adaptively switches between quasi-static and dynamic states C. Meng (cmeng@mit.edu) B. Hager ERL, MIT May 16, 2016 Reservoir production/injection induced seismicity

More information

Japan Seismic Hazard Information Station

Japan Seismic Hazard Information Station Japan Seismic Hazard Information Station (J-SHIS) Hiroyuki Fujiwara National Research Institute for Earth Science and Disaster Prevention (NIED) Background of the Project Headquarters for Earthquake Research

More information

Spectral Elements. Introduction recalling the elastic wave equation

Spectral Elements. Introduction recalling the elastic wave equation Spectral Elements Introduction recalling the elastic wave equation The spectral-element method: General concept domain mapping from space-continuous to space-discrete time extrapolation Gauss-Lobatto-Legendre

More information

Strassen s Algorithm for Tensor Contraction

Strassen s Algorithm for Tensor Contraction Strassen s Algorithm for Tensor Contraction Jianyu Huang, Devin A. Matthews, Robert A. van de Geijn The University of Texas at Austin September 14-15, 2017 Tensor Computation Workshop Flatiron Institute,

More information

Universität Dortmund UCHPC. Performance. Computing for Finite Element Simulations

Universität Dortmund UCHPC. Performance. Computing for Finite Element Simulations technische universität dortmund Universität Dortmund fakultät für mathematik LS III (IAM) UCHPC UnConventional High Performance Computing for Finite Element Simulations S. Turek, Chr. Becker, S. Buijssen,

More information

The Memory Intensive System

The Memory Intensive System DiRAC@Durham The Memory Intensive System The DiRAC-2.5x Memory Intensive system at Durham in partnership with Dell Dr Lydia Heck, Technical Director ICC HPC and DiRAC Technical Manager 1 DiRAC Who we are:

More information

ESiWACE. A Center of Excellence for HPC applications to support cloud resolving earth system modelling

ESiWACE. A Center of Excellence for HPC applications to support cloud resolving earth system modelling ESiWACE A Center of Excellence for HPC applications to support cloud resolving earth system modelling Joachim Biercamp, Panagiotis Adamidis, Philipp Neumann Deutsches Klimarechenzentrum (DKRZ) Motivation:

More information

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics SPARSE SOLVERS FOR THE POISSON EQUATION Margreet Nool CWI, Multiscale Dynamics November 9, 2015 OUTLINE OF THIS TALK 1 FISHPACK, LAPACK, PARDISO 2 SYSTEM OVERVIEW OF CARTESIUS 3 POISSON EQUATION 4 SOLVERS

More information

Multiphysics and multiscale earthqauke modeling

Multiphysics and multiscale earthqauke modeling Multiphysics and multiscale earthqauke modeling C. Meng (cmeng@mit.edu) B. Hager ERL, MIT May 29, 2017 Multiphysics Poro(visco-)elasticity, (quasi-)static and dynamic processes. ground overburden reservoir

More information

Spectral Element simulation of rupture dynamics

Spectral Element simulation of rupture dynamics Spectral Element simulation of rupture dynamics J.-P. Vilotte & G. Festa Department of Seismology, Institut de Physique du Globe de Paris, 75252 France ABSTRACT Numerical modeling is an important tool,

More information

Dark Energy and Massive Neutrino Universe Covariances

Dark Energy and Massive Neutrino Universe Covariances Dark Energy and Massive Neutrino Universe Covariances (DEMNUniCov) Carmelita Carbone Physics Dept, Milan University & INAF-Brera Collaborators: M. Calabrese, M. Zennaro, G. Fabbian, J. Bel November 30

More information

MAGMA MIC 1.0: Linear Algebra Library for Intel Xeon Phi Coprocessors

MAGMA MIC 1.0: Linear Algebra Library for Intel Xeon Phi Coprocessors MAGMA MIC 1.0: Linear Algebra Library for Intel Xeon Phi Coprocessors J. Dongarra, M. Gates, A. Haidar, Y. Jia, K. Kabir, P. Luszczek, and S. Tomov University of Tennessee, Knoxville 05 / 03 / 2013 MAGMA:

More information

Performance of the fusion code GYRO on three four generations of Crays. Mark Fahey University of Tennessee, Knoxville

Performance of the fusion code GYRO on three four generations of Crays. Mark Fahey University of Tennessee, Knoxville Performance of the fusion code GYRO on three four generations of Crays Mark Fahey mfahey@utk.edu University of Tennessee, Knoxville Contents Introduction GYRO Overview Benchmark Problem Test Platforms

More information

3D Parallel Elastodynamic Modeling of Large Subduction Earthquakes

3D Parallel Elastodynamic Modeling of Large Subduction Earthquakes D Parallel Elastodynamic Modeling of Large Subduction Earthquakes Eduardo Cabrera 1 Mario Chavez 2 Raúl Madariaga Narciso Perea 2 and Marco Frisenda 1 Supercomputing Dept. DGSCA UNAM C.U. 04510 Mexico

More information

Parallel Preconditioning Methods for Ill-conditioned Problems

Parallel Preconditioning Methods for Ill-conditioned Problems Parallel Preconditioning Methods for Ill-conditioned Problems Kengo Nakajima Information Technology Center, The University of Tokyo 2014 Conference on Advanced Topics and Auto Tuning in High Performance

More information

Lattice Quantum Chromodynamics on the MIC architectures

Lattice Quantum Chromodynamics on the MIC architectures Lattice Quantum Chromodynamics on the MIC architectures Piotr Korcyl Universität Regensburg Intel MIC Programming Workshop @ LRZ 28 June 2017 Piotr Korcyl Lattice Quantum Chromodynamics on the MIC 1/ 25

More information

A parallel exponential integrator for large-scale discretizations of advection-diffusion models

A parallel exponential integrator for large-scale discretizations of advection-diffusion models A parallel exponential integrator for large-scale discretizations of advection-diffusion models L. Bergamaschi 1, M. Caliari 2, A. Martínez 3, and M. Vianello 3 1 Department of Mathematical Methods and

More information

Vibro-acoustic Analysis for Noise Reduction of Electric Machines

Vibro-acoustic Analysis for Noise Reduction of Electric Machines Vibro-acoustic Analysis for Noise Reduction of Electric Machines From Flux to OptiStruct Example : Synchronous Machine Flux 2D coupling to OptiStruct Patrick LOMBARD Application Team Manager Patrick.Lombard@cedrat.com

More information

Overview of the PyLith Finite Element Code

Overview of the PyLith Finite Element Code Overview of the PyLith Finite Element Code www.geodynamics.org Charles Williams (GNS Science) Brad Aagaard (USGS) Matt Knepley (University of Chicago) CIG Software PyLith is a Tool for Crustal Deformation

More information

From here to there. peta exa-scale transient simulation. Lawrence Mitchell 1, 27th March

From here to there. peta exa-scale transient simulation. Lawrence Mitchell 1, 27th March From here to there challenges for peta exa-scale transient simulation Lawrence Mitchell 1, 27th March 2017 1 Departments of Computing and Mathematics, Imperial College London lawrence.mitchell@imperial.ac.uk

More information

Optimization strategy for MASNUM surface wave model

Optimization strategy for MASNUM surface wave model Hillsboro, September 27, 2018 Optimization strategy for MASNUM surface wave model Zhenya Song *, + * First Institute of Oceanography (FIO), State Oceanic Administrative (SOA), China + Intel Parallel Computing

More information

Scalable and Power-Efficient Data Mining Kernels

Scalable and Power-Efficient Data Mining Kernels Scalable and Power-Efficient Data Mining Kernels Alok Choudhary, John G. Searle Professor Dept. of Electrical Engineering and Computer Science and Professor, Kellogg School of Management Director of the

More information

Weather and Climate Modeling on GPU and Xeon Phi Accelerated Systems

Weather and Climate Modeling on GPU and Xeon Phi Accelerated Systems Weather and Climate Modeling on GPU and Xeon Phi Accelerated Systems Mike Ashworth, Rupert Ford, Graham Riley, Stephen Pickles Scientific Computing Department & STFC Hartree Centre STFC Daresbury Laboratory

More information

Computational Numerical Integration for Spherical Quadratures. Verified by the Boltzmann Equation

Computational Numerical Integration for Spherical Quadratures. Verified by the Boltzmann Equation Computational Numerical Integration for Spherical Quadratures Verified by the Boltzmann Equation Huston Rogers. 1 Glenn Brook, Mentor 2 Greg Peterson, Mentor 2 1 The University of Alabama 2 Joint Institute

More information

Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors

Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors J.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1 1 Deparment of Computer

More information

ON NEAR-FIELD GROUND MOTIONS OF NORMAL AND REVERSE FAULTS FROM VIEWPOINT OF DYNAMIC RUPTURE MODEL

ON NEAR-FIELD GROUND MOTIONS OF NORMAL AND REVERSE FAULTS FROM VIEWPOINT OF DYNAMIC RUPTURE MODEL 1 Best Practices in Physics-based Fault Rupture Models for Seismic Hazard Assessment of Nuclear ON NEAR-FIELD GROUND MOTIONS OF NORMAL AND REVERSE FAULTS FROM VIEWPOINT OF DYNAMIC RUPTURE MODEL Hideo AOCHI

More information

Modelling the dynamic behaviour of a ballscrew system taking into account the changing position of the ball-screw nut

Modelling the dynamic behaviour of a ballscrew system taking into account the changing position of the ball-screw nut Modelling the dynamic behaviour of a ballscrew system taking into account the changing position of the ball-screw nut G. Holroyd, C. Pislaru & D.G. Ford Ultra Precision Engineering Centre, University of

More information

SWE Anatomy of a Parallel Shallow Water Code

SWE Anatomy of a Parallel Shallow Water Code SWE Anatomy of a Parallel Shallow Water Code CSCS-FoMICS-USI Summer School on Computer Simulations in Science and Engineering Michael Bader July 8 19, 2013 Computer Simulations in Science and Engineering,

More information

RWTH Aachen University

RWTH Aachen University IPCC @ RWTH Aachen University Optimization of multibody and long-range solvers in LAMMPS Rodrigo Canales William McDoniel Markus Höhnerbach Ahmed E. Ismail Paolo Bientinesi IPCC Showcase November 2016

More information

R. Glenn Brook, Bilel Hadri*, Vincent C. Betro, Ryan C. Hulguin, and Ryan Braby Cray Users Group 2012 Stuttgart, Germany April 29 May 3, 2012

R. Glenn Brook, Bilel Hadri*, Vincent C. Betro, Ryan C. Hulguin, and Ryan Braby Cray Users Group 2012 Stuttgart, Germany April 29 May 3, 2012 R. Glenn Brook, Bilel Hadri*, Vincent C. Betro, Ryan C. Hulguin, and Ryan Braby Cray Users Group 2012 Stuttgart, Germany April 29 May 3, 2012 * presenting author Contents Overview on AACE Overview on MIC

More information

Deterministic Generation of Broadband Ground Motions! with Simulations of Dynamic Ruptures on Rough Faults! for Physics-Based Seismic Hazard Analysis

Deterministic Generation of Broadband Ground Motions! with Simulations of Dynamic Ruptures on Rough Faults! for Physics-Based Seismic Hazard Analysis Deterministic Generation of Broadband Ground Motions! with Simulations of Dynamic Ruptures on Rough Faults! for Physics-Based Seismic Hazard Analysis Zheqiang Shi and Steven M. Day! Department of Geological

More information

Three Dimensional Simulations of Tsunami Generation and Propagation

Three Dimensional Simulations of Tsunami Generation and Propagation Chapter 1 Earth Science Three Dimensional Simulations of Tsunami Generation and Propagation Project Representative Takashi Furumura Authors Tatsuhiko Saito Takashi Furumura Earthquake Research Institute,

More information

Parallel Algorithms for Solution of Large Sparse Linear Systems with Applications

Parallel Algorithms for Solution of Large Sparse Linear Systems with Applications Parallel Algorithms for Solution of Large Sparse Linear Systems with Applications Murat Manguoğlu Department of Computer Engineering Middle East Technical University, Ankara, Turkey Prace workshop: HPC

More information

Conquest order N ab initio Electronic Structure simulation code for quantum mechanical modelling in large scale

Conquest order N ab initio Electronic Structure simulation code for quantum mechanical modelling in large scale Fortran Expo: 15 Jun 2012 Conquest order N ab initio Electronic Structure simulation code for quantum mechanical modelling in large scale Lianheng Tong Overview Overview of Conquest project Brief Introduction

More information

Generalized Finite Element Methods for Three Dimensional Structural Mechanics Problems. C. A. Duarte. I. Babuška and J. T. Oden

Generalized Finite Element Methods for Three Dimensional Structural Mechanics Problems. C. A. Duarte. I. Babuška and J. T. Oden Generalized Finite Element Methods for Three Dimensional Structural Mechanics Problems C. A. Duarte COMCO, Inc., 7800 Shoal Creek Blvd. Suite 290E Austin, Texas, 78757, USA I. Babuška and J. T. Oden TICAM,

More information

Contribution of HPC to the mitigation of natural risks. B. Feignier. CEA-DAM Ile de France Département Analyse, Surveillance, Environnement

Contribution of HPC to the mitigation of natural risks. B. Feignier. CEA-DAM Ile de France Département Analyse, Surveillance, Environnement Contribution of HPC to the mitigation of natural risks B. Feignier CEA-DAM Ile de France Département Analyse, Surveillance, Environnement Introduction Over the last 40 years, the increase in computational

More information

A framework for detailed multiphase cloud modeling on HPC systems

A framework for detailed multiphase cloud modeling on HPC systems Center for Information Services and High Performance Computing (ZIH) A framework for detailed multiphase cloud modeling on HPC systems ParCo 2009, 3. September 2009, ENS Lyon, France Matthias Lieber a,

More information

Scenario Earthquake and Tsunami Simulations for a Pacific Rim GNSS Tsunami Early Warning System

Scenario Earthquake and Tsunami Simulations for a Pacific Rim GNSS Tsunami Early Warning System Scenario Earthquake and Tsunami Simulations for a Pacific Rim GNSS Tsunami Early Warning System Kasey Schultz February 2, 2015 Abstract The first operational prototype for a Pacific Rim Tsunami Early Warning

More information

Exploiting In-Memory Processing Capabilities for Density Functional Theory Applications

Exploiting In-Memory Processing Capabilities for Density Functional Theory Applications Exploiting In-Memory Processing Capabilities for Density Functional Theory Applications 2016 Aug 23 P. F. Baumeister, T. Hater, D. Pleiter H. Boettiger, T. Maurer, J. R. Brunheroto Contributors IBM R&D

More information

The Fast Multipole Method in molecular dynamics

The Fast Multipole Method in molecular dynamics The Fast Multipole Method in molecular dynamics Berk Hess KTH Royal Institute of Technology, Stockholm, Sweden ADAC6 workshop Zurich, 20-06-2018 Slide BioExcel Slide Molecular Dynamics of biomolecules

More information

Mimetic Operators for Seismic Exploration

Mimetic Operators for Seismic Exploration Mimetic Operators for Seismic Exploration Josep de la Puente, Otilio Rojas, Miguel Ferrer and Jean Kormann WS13 A01 Methods and Challenges of Seismic Wave Modelling for Seismic Imaging Outline What is

More information

Supercomputing: Why, What, and Where (are we)?

Supercomputing: Why, What, and Where (are we)? Supercomputing: Why, What, and Where (are we)? R. Govindarajan Indian Institute of Science, Bangalore, INDIA govind@serc.iisc.ernet.in (C)RG@SERC,IISc Why Supercomputer? Third and Fourth Legs RG@SERC,IISc

More information

Targeting Extreme Scale Computational Challenges with Heterogeneous Systems

Targeting Extreme Scale Computational Challenges with Heterogeneous Systems Targeting Extreme Scale Computational Challenges with Heterogeneous Systems Oreste Villa, Antonino Tumeo Pacific Northwest Na/onal Laboratory (PNNL) 1 Introduction! PNNL Laboratory Directed Research &

More information

Parallelization Strategies for Density Matrix Renormalization Group algorithms on Shared-Memory Systems

Parallelization Strategies for Density Matrix Renormalization Group algorithms on Shared-Memory Systems Parallelization Strategies for Density Matrix Renormalization Group algorithms on Shared-Memory Systems G. Hager HPC Services, Computing Center Erlangen, Germany E. Jeckelmann Theoretical Physics, Univ.

More information

Logo. A Massively-Parallel Multicore Acceleration of a Point Contact Solid Mechanics Simulation DRAFT

Logo. A Massively-Parallel Multicore Acceleration of a Point Contact Solid Mechanics Simulation DRAFT Paper 1 Logo Civil-Comp Press, 2017 Proceedings of the Fifth International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering, P. Iványi, B.H.V Topping and G. Várady (Editors)

More information

Parallel Matrix Factorization for Recommender Systems

Parallel Matrix Factorization for Recommender Systems Under consideration for publication in Knowledge and Information Systems Parallel Matrix Factorization for Recommender Systems Hsiang-Fu Yu, Cho-Jui Hsieh, Si Si, and Inderjit S. Dhillon Department of

More information

Some Geometric and Algebraic Aspects of Domain Decomposition Methods

Some Geometric and Algebraic Aspects of Domain Decomposition Methods Some Geometric and Algebraic Aspects of Domain Decomposition Methods D.S.Butyugin 1, Y.L.Gurieva 1, V.P.Ilin 1,2, and D.V.Perevozkin 1 Abstract Some geometric and algebraic aspects of various domain decomposition

More information

The Finite Cell Method: High order simulation of complex structures without meshing

The Finite Cell Method: High order simulation of complex structures without meshing The Finite Cell Method: High order simulation of complex structures without meshing E. Rank, A. Düster, D. Schillinger, Z. Yang Fakultät für Bauingenieur und Vermessungswesen Technische Universität München,

More information

Chapt pt 15 er EARTHQUAKES! BFRB P 215 ages -226

Chapt pt 15 er EARTHQUAKES! BFRB P 215 ages -226 Chapter 15 EARTHQUAKES! BFRB Pages 215-226226 Earthquake causes An earthquake is the shaking of the Earth s crust caused by a release of energy The movement of the Earth s plates causes most earthquakes

More information

Electronic Transactions on Numerical Analysis Volume 49, 2018

Electronic Transactions on Numerical Analysis Volume 49, 2018 Electronic Transactions on Numerical Analysis Volume 49, 2018 Contents 1 Adaptive FETI-DP and BDDC methods with a generalized transformation of basis for heterogeneous problems. Axel Klawonn, Martin Kühn,

More information

Weight-adjusted DG methods for elastic wave propagation in arbitrary heterogeneous media

Weight-adjusted DG methods for elastic wave propagation in arbitrary heterogeneous media Weight-adjusted DG methods for elastic wave propagation in arbitrary heterogeneous media Jesse Chan Department of Computational and Applied Math, Rice University ICOSAHOM 2018 July 12, 2018 Chan (CAAM)

More information

RESEARCH HIGHLIGHTS. WAF: Weighted Average Flux Method

RESEARCH HIGHLIGHTS. WAF: Weighted Average Flux Method RESEARCH HIGHLIGHTS (Last update: 3 rd April 2013) Here I briefly describe my contributions to research on numerical methods for hyperbolic balance laws that, in my view, have made an impact in the scientific

More information