Sustained Petascale Performance of Seismic Simulations with SeisSol
|
|
- Bryce George
- 5 years ago
- Views:
Transcription
1 SIAM EX Workshop on Exascale Applied Mathematics Challenges and Opportunities Sustained Petascale Performance of Seismic Simulations with SeisSol M. Bader, A. Breuer, A. Heinecke, S. Rettenberger C. Pelties, A.-A. Gabriel Technische Universität München, Ludwig-Maximilians-Universität München SIAM EX, July,
2 HPC Meets Geoscience Alexander Alice-Agnes Alexander Christian Sebastian Breuer Gabriel Heinecke Pelties Rettenberger SIAM EX, July,
3 Overview and Agenda SeisSol: dynamic rupture and seismic wave propagation unstructured tetrahedral meshes high-order ADER-DG discretisation Optimisation for Heterogeneous Petascale Platforms: code generation to optimize element-local matrix kernels hybrid MPI/OpenMP parallelisation offload scheme to address multiphysics Performance on Tianhe-, Stampede and SuperMUC: weak scaling of wave propagation component strong scaling for 99 Landers M. earthquake SIAM EX, July,
4 Dynamic Rupture and Earthquake Simulation Tohoku subduction zone: CAD model and tetrahedral mesh (C. Pelties) Use of Adaptive Tetrahedral Meshes: curved subduction zones that meet surface at shallow angles high impact on uplift for tsunamigenic earthquakes complicated fault systems with multiple branches non-linear multiphysics dynamic rupture simulation goal: automated meshing process (incl. CAD generation) SIAM EX, July,
5 Dynamic Rupture and Earthquake Simulation Landers fault system: simulated ground motion and tetrahedral mesh Use of Adaptive Tetrahedral Meshes: curved subduction zones that meet surface at shallow angles high impact on uplift for tsunamigenic earthquakes complicated fault systems with multiple branches non-linear multiphysics dynamic rupture simulation goal: automated meshing process (incl. CAD generation) SIAM EX, July,
6 + Aq Technische Universität München : Governing x + Bq eq s y + Cq z = Seismic Wave Propagation with SeisSol B C Elastic Wave Equations: B C A formulation) q y + Cq q t + Aq z x + Bq y + A Cq = z. = B C with q = (σ, σ, σ, σ B, σ, σ, u, v, w) T - - µ - - -µ -µ A µ µ - - -µ A = -µ B = -µ -µ - - B - - C A µ - B u high order discontinuous Galerkin discretisation C - - µ -µ = ADER-DG: -µ v A high approximationc order = in space andtime: -µ -µ B additional - features: - local time C stepping, B high accuracy - - w - of C - - A earthquake - faulting - (full frictional sliding) Dumbser, Käser et al. [,] µ M. Bader et al.: Sustained Petascale Performance of Seismic Simulations with SeisSol SIAMC EX=, B July, -µ C = inecke, S. Rettenberger, Optimization of SeisSol
7 Department of Informatics V Technische Universität München Technische Universität München SeisSol in a Nutshell ADER-DG SeisSol in a nutshell: ADER-DG Update scheme Q n+ k = Q k - S X k J k M- F -,i I(t n, t n+, Q n k )N k,i A + k N- k,i + i= X i= +M - K I(t n, t n+, Q n k )A k +M - K I(t n, t n+, Q n k )B k +M - K I(t n, t n+, Q n k )C k F +,i,j,h I(t n, t n+, Q n k(i) )N k,ia - k(i) N- k,i Cauchy Kovalewski I(t n, t n+, Q n k )= JX (t n+ - t n ) j (j + j Q k(t n ) j= (Q k ) t = -M - (K ) T Q k A k +(K ) T Q k B k +(K ) T Q k C k A. M. Breuer, Bader eta. al.: Heinecke, Sustained Petascale S. Rettenberger, Performance of Seismic Optimization Simulations of with SeisSol SeisSol M. SIAM Bader, EX, C. July Pelties,
8 Technische Universität München tiplication with the sparse Jacobians A k, B k and ure shows all involved sparsity patterns of the A Optimisation of(b Sparse = ) scheme. Matrix Operations Apply sparse matrices to multiple DOF-vectors Q k Code Generator for Sparse Kernels: (Breuer et al. []) Figure. Sparsity patterns appearing in the ADER tim avoid overhead of CSR (or similar) data structures; store CSR elements vector, only Volume integration: In the volume integration, g we multiply the sparse stiffness matrices K ξ, K η unknowns I(t n,t n+,q n k ) previously obtained by the left. The intermediate results are again multip full unrolling of all element operations using a code generator use intrinsics and apply blocking to improve vectorisation SIAM EX, July,
9 Technische Universität München Optimisation of Sparse Matrix Operations Computation Apply sparse of the matrices first time to multiple derivative DOF-vectors / t Q k Q. k e) derivative Computation / t of the Q k. second derivative / t Q k. Dense vs. Sparse Kernels: (Breuer et al. []) recursions switch of thetoader densetime kernels integration dependingfor onaachieved fifth order timemethod. to solution erate zero-blocks for sparse in and the derivatives, dense kernels: ivory blocks hit zero blocks. exploit zero-blocks generated during recursive CK computation Evaluation of the ADER Time Integration SIAM EX, July,
10 Mesh Generation and Partitioning Mesh Generation: high-quality meshes required (shallow subduction zones, complicated fault structures) with 9 grid cells using SimModeler by Simmetrix ( Two-stage approach to provide parallel mesh partitions: graph-based partitioning (ParMETIS) create customised parallel format (based on netcdf) for mesh partitions highly scalable mesh input via netcdf/mpi-io in SeisSol SIAM EX, July,
11 Optimization for Intel Xeon Phi Platforms Offload Scheme: Host PCIe Xeon Phi to address load imbalances of multiphysics simulation hides communication with Xeon Phi and between nodes OpenMP parallelisation: MPI comm., receiver output dynamic rupture fluxes, fault output download cells for receivers, DR, MPI upload MPIreceived cells upload dynamic rupture updates time integration of MPI boundary cells time integration of non-mpi cells, volume integration wave propagation fluxes to address manycore parallelism with coprocessors download all data (if required) apply dynamic rupture updates, pack transfer data careful parallelisation of all loops plot wave field (if required) SIAM EX, July, 9
12 Supercomputing Platforms LRZ, Munich 9 compute nodes ( thin node islands), Intel SNB-EP cores (. GHz) Infiniband FDR interconnect (fat tree) # in Top :.9 PFlop/s TACC, Austin compute nodes,, cores SNB-EP (c) + Xeon Phi SEP per node Mellanox FDR interconnect (fat tree) # in Top :. PFlop/s NSCC, Guangzhou compute nodes used,. Mio cores SNB-EP (c) + Xeon Phi SP per node TH-Express custom interconnect # in Top :. PFlop/s SIAM EX, July,
13 Weak Scaling of Wave Propagation SuperMUC, classic Stampede SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency 9 9 # nodes goal: test scalability towards large problem sizes cubic domain, uniformly refined tetrahedral cells weak scaling:, elements per card/node SIAM EX, July,
14 Weak Scaling of Wave Propagation SuperMUC, classic Stampede SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency # nodes more than 9 % parallel efficiency on Tianhe- and Stampede % on full SuperMUC (no overlapping) SIAM EX, July,
15 Weak Scaling of Wave Propagation SuperMUC, classic Stampede SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency 9 9 # nodes. PFlop/s on Tianhe- ( nodes). PFlop/s on Stampede ( nodes). PFlop/s on SuperMUC (9 nodes) SIAM EX, July,
16 Weak Scaling Peak Efficiency SuperMUC, gr. buff. Stampede Tianhe- SuperMUC, classic % peak: hardware % peak: non-zero 9 9 # nodes SIAM EX, July,
17 99 Landers M. Earthquake multiphysics simulation of dynamic rupture and resulting ground motion of a M. earthquake fault inferred from measured data, regional topography from satellite data, physically consistent stress and friction parameters D velocity structure, low velocity near surface SIAM EX, July,
18 Strong Scaling of Landers Scenario Stampede SuperMUC, classic SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency # nodes 9 million tetrahedrons;,9 element faces on fault th order, 9 billion degrees of freedom SIAM EX, July,
19 Strong Scaling of Landers Scenario Stampede SuperMUC, classic SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards % parallel efficiency # nodes more than % parallel efficiency on Stampede and Tianhe- (when using only one Xeon Phi per node) multiple-xeon-phi performance suffers from MPI communication SIAM EX, July,
20 Strong Scaling of Landers Scenario Stampede SuperMUC, classic SuperMUC, gr. buff. Tianhe-, card Tianhe-, cards Tianhe-, cards 9 9 % parallel efficiency 9 # nodes. PFlop/s on Tianhe- ( nodes). PFlop/s on Stampede ( nodes). PFlop/s on SuperMUC (9 nodes) SIAM EX, July,
21 Landers Strong Scaling Peak Efficiency SuperMUC, gr. buff. Stampede Tianhe- SuperMUC, classic % peak: hardware % peak: non-zero 9 9 # nodes SIAM EX, July,
22 Landers Earthquake Production Run On SuperMUC: h min computing time fault output and receivers. PFLOPS sustained performance Observations: complex rupture dynamics (fault branching, etc.) high-frequency signals (up to Hz) from rupture propagate directly into wave field ground shaking in the engineering frequency band s simulated time SIAM EX, July,
23 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
24 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
25 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
26 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
27 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
28 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
29 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
30 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
31 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
32 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
33 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
34 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
35 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
36 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
37 Multiphysics Dynamic Rupture Simulation spontaneous rupturing due to exceeded stress limits including rupture jumps, fault branching, etc. tackles fundamental questions on the dynamics of earthquakes in natural fault zone systems SIAM EX, July,
38 SeisSol: Earthquake Petascale Multiphysics Dynamic Rupture Simulations with SeisSol: high-order ADER-DG on unstructured adaptive meshes focus on complicated geometries and rupture physics non-linear interaction of rupture process and seismic waves Petascale Performance on Heterogeneous Platforms: exploits high computational intensity of ADER-DG requires careful tuning of the entire simulation pipeline code generation to accelerate element kernels scalable mesh input for M cells on k cores offload-scheme for multiphysics with Xeon Phi SIAM EX, July,
39 I Would Like to Thank... Intel: Mikhail Smelyanskiy, Karthikeyan Vaidyanathan Pradeep Dubey all colleagues from the three supercomputing centers, esp.: Arndt Bode (Leibniz Supercomputing Centre) William Barth (Texas Advanced Computing Centre) Xiang-Ke Liao (NCSS and NUDT) for financial support: Volkswagen Foundation (project ASCETE) KONWIHR the entire SeisSol team and all contributors You! SIAM EX, July, 9
40 References [] A. Breuer, A. Heinecke, M. Bader, C. Pelties: Accelerating SeisSol by generating vectorized code for sparse matrix operators. In: Advances in Parallel Computing, IOS Press,. Proceedings of ParCo [] A. Breuer, A. Heinecke, S. Rettenberger, M. Bader, A.-A. Gabriel, C. Pelties: Sustained Petascale Performance of Seismic Simulations with SeisSol on SuperMUC. In: Supercomputing, LNCS, p.. PRACE ISC Award. [] M. Dumbser, M. Käser: An arbitrary high-order discontinuous Galerkin method for elastic waves on unstructured meshes II. The three-dimensional isotropic case. Geophys. J. Int. (),. [] A. Heinecke, A. Breuer, S. Rettenberger, M. Bader, A.-A. Gabriel, C. Pelties, A. Bode, W. Barth, X.-K. Liao, K. Vaidyanathan, M. Smelyanskiy, P. Dubey: Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers. Gordon Bell Prize Finalist. [] M. Käser, M. Dumbser, J. de la Puente, H. Igel: An arbitrary high-order Discontinuous Galerkin method for elastic waves on unstructured meshes III. Viscoelastic attenuation. Geophys. J. Int. (),. [] C. Pelties, J. de la Puente, J.-P. Ampuero, G. B. Brietzke, M. Käser: Three-dimensional dynamic rupture simulation with a high-order discontinuous Galerkin method on unstructured tetrahedral meshes. J. Geophys. Res.: Solid Earth, (B),. SIAM EX, July,
Extreme-scale Multi-physics Simulation of the 2004 Sumatra Earthquake
Extreme-scale Multi-physics Simulation of the 2004 Sumatra Earthquake Intel MIC Programming Workshop Michael Bader (and many others!) Technical University of Munich LRZ, 28 June 2017 Co-Authors Current
More informationLarge-scale multi-physics earthquake scenarios with the ADER-DG method on modern supercomputers
Largescale multiphysics earthquake scenarios with the ADERDG method on modern supercomputers Stephanie Wollherr, Dr. AliceAgnes Gabriel, Dr. Betsy Madden, Thomas Ulrich LudwigsMaximilians Universität (LMU)
More informationAspects of a Discontinuous Galerkin Approach for 3D Dynamic Rupture Modeling in the Case of a Complex Fault System
AGU Fall Meeting 2011 S54C-08 Aspects of a Discontinuous Galerkin Approach for 3D Dynamic Rupture Modeling in the Case of a Complex Fault System Christian Pelties Josep de la Puente (BSC), Jean Paul Ampuero
More informationScientific Computing I
Scientific Computing I Module 1: Introduction Michael Bader Lehrstuhl Informatik V Winter 2016/2017 Scientific Computing = Science + Computing? Science on Computers?? Computational Science??? Michael Bader
More informationExtreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust Earthquake
Extreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust Earthquake ABSTRACT Carsten Uphoff Sebastian Rettenberger Michael Bader uphoff@in.tum.de rettenbs@in.tum.de bader@in.tum.de
More informationTowards a highly-parallel PDE-Solver using Adaptive Sparse Grids on Compute Clusters
Towards a highly-parallel PDE-Solver using Adaptive Sparse Grids on Compute Clusters HIM - Workshop on Sparse Grids and Applications Alexander Heinecke Chair of Scientific Computing May 18 th 2011 HIM
More informationA Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries
A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries SC13, November 21 st 2013 Christian Godenschwager, Florian Schornbaum, Martin Bauer, Harald Köstler, Ulrich
More informationPerformance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain
March 4-5, 2015 Performance comparison between hybridizable DG and classical DG methods for elastic waves simulation in harmonic domain M. Bonnasse-Gahot 1,2, H. Calandra 3, J. Diaz 1 and S. Lanteri 2
More informationParallel Simulations of Self-propelled Microorganisms
Parallel Simulations of Self-propelled Microorganisms K. Pickl a,b M. Hofmann c T. Preclik a H. Köstler a A.-S. Smith b,d U. Rüde a,b ParCo 2013, Munich a Lehrstuhl für Informatik 10 (Systemsimulation),
More information591 TFLOPS Multi-TRILLION Particles Simulation on SuperMUC
International Supercomputing Conference 2013 591 TFLOPS Multi-TRILLION Particles Simulation on SuperMUC W. Eckhardt TUM, A. Heinecke TUM, R. Bader LRZ, M. Brehm LRZ, N. Hammer LRZ, H. Huber LRZ, H.-G.
More informationSome thoughts about energy efficient application execution on NEC LX Series compute clusters
Some thoughts about energy efficient application execution on NEC LX Series compute clusters G. Wellein, G. Hager, J. Treibig, M. Wittmann Erlangen Regional Computing Center & Department of Computer Science
More informationCactus Tools for Petascale Computing
Cactus Tools for Petascale Computing Erik Schnetter Reno, November 2007 Gamma Ray Bursts ~10 7 km He Protoneutron Star Accretion Collapse to a Black Hole Jet Formation and Sustainment Fe-group nuclei Si
More informationComputational Seismology: A Practical Introduction. Heiner Igel Computational Seismology 1 / 18
Computational Seismology: A Practical Introduction Heiner Igel Computational Seismology 1 / 18 Introduction Heiner Igel Computational Seismology 2 / 18 Goals of the course Introduction Understand methods
More informationA Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters
A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters ANTONINO TUMEO, ORESTE VILLA Collaborators: Karol Kowalski, Sriram Krishnamoorthy, Wenjing Ma, Simone Secchi May 15, 2012 1 Outline!
More informationScalable Hybrid Programming and Performance for SuperLU Sparse Direct Solver
Scalable Hybrid Programming and Performance for SuperLU Sparse Direct Solver Sherry Li Lawrence Berkeley National Laboratory Piyush Sao Rich Vuduc Georgia Institute of Technology CUG 14, May 4-8, 14, Lugano,
More informationarxiv: v1 [hep-lat] 8 Nov 2014
Staggered Dslash Performance on Intel Xeon Phi Architecture arxiv:1411.2087v1 [hep-lat] 8 Nov 2014 Department of Physics, Indiana University, Bloomington IN 47405, USA E-mail: ruizli AT umail.iu.edu Steven
More informationLattice Boltzmann simulations on heterogeneous CPU-GPU clusters
Lattice Boltzmann simulations on heterogeneous CPU-GPU clusters H. Köstler 2nd International Symposium Computer Simulations on GPU Freudenstadt, 29.05.2013 1 Contents Motivation walberla software concepts
More informationGeophysical Journal International
Geophysical Journal International Geophys. J. Int. (2015) 200, 888 907 GJI Seismology doi: 10.1093/gji/ggu436 On the initiation of sustained slip-weakening ruptures by localized stresses M. Galis, 1 C.
More information上海超级计算中心 Shanghai Supercomputer Center. Lei Xu Shanghai Supercomputer Center San Jose
上海超级计算中心 Shanghai Supercomputer Center Lei Xu Shanghai Supercomputer Center 03/26/2014 @GTC, San Jose Overview Introduction Fundamentals of the FDTD method Implementation of 3D UPML-FDTD algorithm on GPU
More informationNumerical modeling of dynamic rupture propagation
IX. Slovenská geofyzikálna konferencia, 22. - 23. jún 2011 Fakulta matematiky, fyziky a informatiky UK, Bratislava Numerical modeling of dynamic rupture propagation Martin Galis Peter Moczo Jozef Kristek
More informationPhysics plans and ILDG usage
Physics plans and ILDG usage in Italy Francesco Di Renzo University of Parma & INFN Parma The MAIN ILDG USERS in Italy are the ROME groups A (by now) well long track of ILDG-based projects mainly within
More informationLarge-scale Electronic Structure Simulations with MVAPICH2 on Intel Knights Landing Manycore Processors
Large-scale Electronic Structure Simulations with MVAPICH2 on Intel Knights Landing Manycore Processors Hoon Ryu, Ph.D. (E: elec1020@kisti.re.kr) Principal Researcher / Korea Institute of Science and Technology
More informationStatic-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems
Static-scheduling and hybrid-programming in SuperLU DIST on multicore cluster systems Ichitaro Yamazaki University of Tennessee, Knoxville Xiaoye Sherry Li Lawrence Berkeley National Laboratory MS49: Sparse
More informationLarge-scale MD simulation of heterogeneous systems with ls1 mardyn
Large-scale MD simulation of heterogeneous systems with ls1 mardyn M. T. Horsch, R. Srivastava, S. J. Werth, C. Niethammer, C. W. Glass, W. Eckhardt, A. Heinecke, N. Tchipev, H.-J. Bungartz, S. Eckelsbach,
More informationMassively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling
2019 Intel extreme Performance Users Group (IXPUG) meeting Massively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling Hoon Ryu, Ph.D. (E: elec1020@kisti.re.kr)
More informationPerformance evaluation of scalable optoelectronics application on large-scale Knights Landing cluster
Performance evaluation of scalable optoelectronics application on large-scale Knights Landing cluster Yuta Hirokawa Graduate School of Systems and Information Engineering, University of Tsukuba hirokawa@hpcs.cs.tsukuba.ac.jp
More informationWeather Research and Forecasting (WRF) Performance Benchmark and Profiling. July 2012
Weather Research and Forecasting (WRF) Performance Benchmark and Profiling July 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell,
More informationPerformance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore Architectures
Performance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore Architectures José I. Aliaga Performance and Energy Analysis of the Iterative Solution of Sparse
More informationPerformance Analysis of Lattice QCD Application with APGAS Programming Model
Performance Analysis of Lattice QCD Application with APGAS Programming Model Koichi Shirahata 1, Jun Doi 2, Mikio Takeuchi 2 1: Tokyo Institute of Technology 2: IBM Research - Tokyo Programming Models
More informationEfficient multigrid solvers for mixed finite element discretisations in NWP models
1/20 Efficient multigrid solvers for mixed finite element discretisations in NWP models Colin Cotter, David Ham, Lawrence Mitchell, Eike Hermann Müller *, Robert Scheichl * * University of Bath, Imperial
More informationClaude Tadonki. MINES ParisTech PSL Research University Centre de Recherche Informatique
Claude Tadonki MINES ParisTech PSL Research University Centre de Recherche Informatique claude.tadonki@mines-paristech.fr Monthly CRI Seminar MINES ParisTech - CRI June 06, 2016, Fontainebleau (France)
More informationUnravelling earthquake dynamics with SeisSol: Megathrust ruptures, off- fault plas?city and rough faults Elizabeth H.
Unravelling earthquake dynamics with SeisSol: Megathrust ruptures, off- fault plas?city and rough faults Elizabeth H. Madden (Betsy) Stephanie Wollherr, Thomas Ulrich, Alice- Agnes Gabriel 1 Earthquake
More informationUtilisation de la compression low-rank pour réduire la complexité du solveur PaStiX
Utilisation de la compression low-rank pour réduire la complexité du solveur PaStiX 26 Septembre 2018 - JCAD 2018 - Lyon Grégoire Pichon, Mathieu Faverge, Pierre Ramet, Jean Roman Outline 1. Context 2.
More informationNuclear Physics and Computing: Exascale Partnerships. Juan Meza Senior Scientist Lawrence Berkeley National Laboratory
Nuclear Physics and Computing: Exascale Partnerships Juan Meza Senior Scientist Lawrence Berkeley National Laboratory Nuclear Science and Exascale i Workshop held in DC to identify scientific challenges
More information3D Cartesian Transport Sweep for Massively Parallel Architectures on top of PaRSEC
3D Cartesian Transport Sweep for Massively Parallel Architectures on top of PaRSEC 9th Scheduling for Large Scale Systems Workshop, Lyon S. Moustafa, M. Faverge, L. Plagne, and P. Ramet S. Moustafa, M.
More informationAccelerating linear algebra computations with hybrid GPU-multicore systems.
Accelerating linear algebra computations with hybrid GPU-multicore systems. Marc Baboulin INRIA/Université Paris-Sud joint work with Jack Dongarra (University of Tennessee and Oak Ridge National Laboratory)
More informationMassively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling
2019 Intel extreme Performance Users Group (IXPUG) meeting Massively scalable computing method to tackle large eigenvalue problems for nanoelectronics modeling Hoon Ryu, Ph.D. (E: elec1020@kisti.re.kr)
More informationA parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns
A parallel finite element multigrid framework for geodynamic simulations with more than ten trillion unknowns Dominik Bartuschat, Ulrich Rüde Chair for System Simulation, University of Erlangen-Nürnberg
More informationScalable, performant, and resilient large-scale applications of molecular process engineering
Scalable, performant, and resilient large-scale applications of molecular process engineering M. Horsch,1 P. Gralka,2 C. Niethammer,3 N. Tchipev,4 J. Vrabec,5 H. Hasse1 1 University of Kaiserslautern,
More informationParallel Sparse Tensor Decompositions using HiCOO Format
Figure sources: A brief survey of tensors by Berton Earnshaw and NVIDIA Tensor Cores Parallel Sparse Tensor Decompositions using HiCOO Format Jiajia Li, Jee Choi, Richard Vuduc May 8, 8 @ SIAM ALA 8 Outline
More informationERLANGEN REGIONAL COMPUTING CENTER
ERLANGEN REGIONAL COMPUTING CENTER Making Sense of Performance Numbers Georg Hager Erlangen Regional Computing Center (RRZE) Friedrich-Alexander-Universität Erlangen-Nürnberg OpenMPCon 2018 Barcelona,
More informationLattice QCD with Domain Decomposition on Intel R Xeon Phi TM
Lattice QCD with Domain Decomposition on Intel R Xeon Phi TM Co-Processors Simon Heybrock, Bálint Joó, Dhiraj D. Kalamkar, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Tilo Wettig, and Pradeep Dubey
More informationCrossing the Chasm. On the Paths to Exascale: Presented by Mike Rezny, Monash University, Australia
On the Paths to Exascale: Crossing the Chasm Presented by Mike Rezny, Monash University, Australia michael.rezny@monash.edu Crossing the Chasm meeting Reading, 24 th October 2016 Version 0.1 In collaboration
More informationFPGA Implementation of a Predictive Controller
FPGA Implementation of a Predictive Controller SIAM Conference on Optimization 2011, Darmstadt, Germany Minisymposium on embedded optimization Juan L. Jerez, George A. Constantinides and Eric C. Kerrigan
More informationPiz Daint & Piz Kesch : from general purpose supercomputing to an appliance for weather forecasting. Thomas C. Schulthess
Piz Daint & Piz Kesch : from general purpose supercomputing to an appliance for weather forecasting Thomas C. Schulthess 1 Cray XC30 with 5272 hybrid, GPU accelerated compute nodes Piz Daint Compute node:
More informationParallel Discontinuous Galerkin Method
Parallel Discontinuous Galerkin Method Yin Ki, NG The Chinese University of Hong Kong Aug 5, 2015 Mentors: Dr. Ohannes Karakashian, Dr. Kwai Wong Overview Project Goal Implement parallelization on Discontinuous
More informationCP2K. New Frontiers. ab initio Molecular Dynamics
CP2K New Frontiers in ab initio Molecular Dynamics Jürg Hutter, Joost VandeVondele, Valery Weber Physical-Chemistry Institute, University of Zurich Ab Initio Molecular Dynamics Molecular Dynamics Sampling
More informationOpen-Source Parallel FE Software : FrontISTR -- Performance Considerations about B/F (Byte per Flop) of SpMV on K-Supercomputer and GPU-Clusters --
Parallel Processing for Energy Efficiency October 3, 2013 NTNU, Trondheim, Norway Open-Source Parallel FE Software : FrontISTR -- Performance Considerations about B/F (Byte per Flop) of SpMV on K-Supercomputer
More informationDefmod, an earthquake simulator that adaptively switches between quasi-static and dynamic states
Defmod, an earthquake simulator that adaptively switches between quasi-static and dynamic states C. Meng (cmeng@mit.edu) B. Hager ERL, MIT May 16, 2016 Reservoir production/injection induced seismicity
More informationJapan Seismic Hazard Information Station
Japan Seismic Hazard Information Station (J-SHIS) Hiroyuki Fujiwara National Research Institute for Earth Science and Disaster Prevention (NIED) Background of the Project Headquarters for Earthquake Research
More informationSpectral Elements. Introduction recalling the elastic wave equation
Spectral Elements Introduction recalling the elastic wave equation The spectral-element method: General concept domain mapping from space-continuous to space-discrete time extrapolation Gauss-Lobatto-Legendre
More informationStrassen s Algorithm for Tensor Contraction
Strassen s Algorithm for Tensor Contraction Jianyu Huang, Devin A. Matthews, Robert A. van de Geijn The University of Texas at Austin September 14-15, 2017 Tensor Computation Workshop Flatiron Institute,
More informationUniversität Dortmund UCHPC. Performance. Computing for Finite Element Simulations
technische universität dortmund Universität Dortmund fakultät für mathematik LS III (IAM) UCHPC UnConventional High Performance Computing for Finite Element Simulations S. Turek, Chr. Becker, S. Buijssen,
More informationThe Memory Intensive System
DiRAC@Durham The Memory Intensive System The DiRAC-2.5x Memory Intensive system at Durham in partnership with Dell Dr Lydia Heck, Technical Director ICC HPC and DiRAC Technical Manager 1 DiRAC Who we are:
More informationESiWACE. A Center of Excellence for HPC applications to support cloud resolving earth system modelling
ESiWACE A Center of Excellence for HPC applications to support cloud resolving earth system modelling Joachim Biercamp, Panagiotis Adamidis, Philipp Neumann Deutsches Klimarechenzentrum (DKRZ) Motivation:
More informationSPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics
SPARSE SOLVERS FOR THE POISSON EQUATION Margreet Nool CWI, Multiscale Dynamics November 9, 2015 OUTLINE OF THIS TALK 1 FISHPACK, LAPACK, PARDISO 2 SYSTEM OVERVIEW OF CARTESIUS 3 POISSON EQUATION 4 SOLVERS
More informationMultiphysics and multiscale earthqauke modeling
Multiphysics and multiscale earthqauke modeling C. Meng (cmeng@mit.edu) B. Hager ERL, MIT May 29, 2017 Multiphysics Poro(visco-)elasticity, (quasi-)static and dynamic processes. ground overburden reservoir
More informationSpectral Element simulation of rupture dynamics
Spectral Element simulation of rupture dynamics J.-P. Vilotte & G. Festa Department of Seismology, Institut de Physique du Globe de Paris, 75252 France ABSTRACT Numerical modeling is an important tool,
More informationDark Energy and Massive Neutrino Universe Covariances
Dark Energy and Massive Neutrino Universe Covariances (DEMNUniCov) Carmelita Carbone Physics Dept, Milan University & INAF-Brera Collaborators: M. Calabrese, M. Zennaro, G. Fabbian, J. Bel November 30
More informationMAGMA MIC 1.0: Linear Algebra Library for Intel Xeon Phi Coprocessors
MAGMA MIC 1.0: Linear Algebra Library for Intel Xeon Phi Coprocessors J. Dongarra, M. Gates, A. Haidar, Y. Jia, K. Kabir, P. Luszczek, and S. Tomov University of Tennessee, Knoxville 05 / 03 / 2013 MAGMA:
More informationPerformance of the fusion code GYRO on three four generations of Crays. Mark Fahey University of Tennessee, Knoxville
Performance of the fusion code GYRO on three four generations of Crays Mark Fahey mfahey@utk.edu University of Tennessee, Knoxville Contents Introduction GYRO Overview Benchmark Problem Test Platforms
More information3D Parallel Elastodynamic Modeling of Large Subduction Earthquakes
D Parallel Elastodynamic Modeling of Large Subduction Earthquakes Eduardo Cabrera 1 Mario Chavez 2 Raúl Madariaga Narciso Perea 2 and Marco Frisenda 1 Supercomputing Dept. DGSCA UNAM C.U. 04510 Mexico
More informationParallel Preconditioning Methods for Ill-conditioned Problems
Parallel Preconditioning Methods for Ill-conditioned Problems Kengo Nakajima Information Technology Center, The University of Tokyo 2014 Conference on Advanced Topics and Auto Tuning in High Performance
More informationLattice Quantum Chromodynamics on the MIC architectures
Lattice Quantum Chromodynamics on the MIC architectures Piotr Korcyl Universität Regensburg Intel MIC Programming Workshop @ LRZ 28 June 2017 Piotr Korcyl Lattice Quantum Chromodynamics on the MIC 1/ 25
More informationA parallel exponential integrator for large-scale discretizations of advection-diffusion models
A parallel exponential integrator for large-scale discretizations of advection-diffusion models L. Bergamaschi 1, M. Caliari 2, A. Martínez 3, and M. Vianello 3 1 Department of Mathematical Methods and
More informationVibro-acoustic Analysis for Noise Reduction of Electric Machines
Vibro-acoustic Analysis for Noise Reduction of Electric Machines From Flux to OptiStruct Example : Synchronous Machine Flux 2D coupling to OptiStruct Patrick LOMBARD Application Team Manager Patrick.Lombard@cedrat.com
More informationOverview of the PyLith Finite Element Code
Overview of the PyLith Finite Element Code www.geodynamics.org Charles Williams (GNS Science) Brad Aagaard (USGS) Matt Knepley (University of Chicago) CIG Software PyLith is a Tool for Crustal Deformation
More informationFrom here to there. peta exa-scale transient simulation. Lawrence Mitchell 1, 27th March
From here to there challenges for peta exa-scale transient simulation Lawrence Mitchell 1, 27th March 2017 1 Departments of Computing and Mathematics, Imperial College London lawrence.mitchell@imperial.ac.uk
More informationOptimization strategy for MASNUM surface wave model
Hillsboro, September 27, 2018 Optimization strategy for MASNUM surface wave model Zhenya Song *, + * First Institute of Oceanography (FIO), State Oceanic Administrative (SOA), China + Intel Parallel Computing
More informationScalable and Power-Efficient Data Mining Kernels
Scalable and Power-Efficient Data Mining Kernels Alok Choudhary, John G. Searle Professor Dept. of Electrical Engineering and Computer Science and Professor, Kellogg School of Management Director of the
More informationWeather and Climate Modeling on GPU and Xeon Phi Accelerated Systems
Weather and Climate Modeling on GPU and Xeon Phi Accelerated Systems Mike Ashworth, Rupert Ford, Graham Riley, Stephen Pickles Scientific Computing Department & STFC Hartree Centre STFC Daresbury Laboratory
More informationComputational Numerical Integration for Spherical Quadratures. Verified by the Boltzmann Equation
Computational Numerical Integration for Spherical Quadratures Verified by the Boltzmann Equation Huston Rogers. 1 Glenn Brook, Mentor 2 Greg Peterson, Mentor 2 1 The University of Alabama 2 Joint Institute
More informationParallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors
Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors J.I. Aliaga 1 M. Bollhöfer 2 A.F. Martín 1 E.S. Quintana-Ortí 1 1 Deparment of Computer
More informationON NEAR-FIELD GROUND MOTIONS OF NORMAL AND REVERSE FAULTS FROM VIEWPOINT OF DYNAMIC RUPTURE MODEL
1 Best Practices in Physics-based Fault Rupture Models for Seismic Hazard Assessment of Nuclear ON NEAR-FIELD GROUND MOTIONS OF NORMAL AND REVERSE FAULTS FROM VIEWPOINT OF DYNAMIC RUPTURE MODEL Hideo AOCHI
More informationModelling the dynamic behaviour of a ballscrew system taking into account the changing position of the ball-screw nut
Modelling the dynamic behaviour of a ballscrew system taking into account the changing position of the ball-screw nut G. Holroyd, C. Pislaru & D.G. Ford Ultra Precision Engineering Centre, University of
More informationSWE Anatomy of a Parallel Shallow Water Code
SWE Anatomy of a Parallel Shallow Water Code CSCS-FoMICS-USI Summer School on Computer Simulations in Science and Engineering Michael Bader July 8 19, 2013 Computer Simulations in Science and Engineering,
More informationRWTH Aachen University
IPCC @ RWTH Aachen University Optimization of multibody and long-range solvers in LAMMPS Rodrigo Canales William McDoniel Markus Höhnerbach Ahmed E. Ismail Paolo Bientinesi IPCC Showcase November 2016
More informationR. Glenn Brook, Bilel Hadri*, Vincent C. Betro, Ryan C. Hulguin, and Ryan Braby Cray Users Group 2012 Stuttgart, Germany April 29 May 3, 2012
R. Glenn Brook, Bilel Hadri*, Vincent C. Betro, Ryan C. Hulguin, and Ryan Braby Cray Users Group 2012 Stuttgart, Germany April 29 May 3, 2012 * presenting author Contents Overview on AACE Overview on MIC
More informationDeterministic Generation of Broadband Ground Motions! with Simulations of Dynamic Ruptures on Rough Faults! for Physics-Based Seismic Hazard Analysis
Deterministic Generation of Broadband Ground Motions! with Simulations of Dynamic Ruptures on Rough Faults! for Physics-Based Seismic Hazard Analysis Zheqiang Shi and Steven M. Day! Department of Geological
More informationThree Dimensional Simulations of Tsunami Generation and Propagation
Chapter 1 Earth Science Three Dimensional Simulations of Tsunami Generation and Propagation Project Representative Takashi Furumura Authors Tatsuhiko Saito Takashi Furumura Earthquake Research Institute,
More informationParallel Algorithms for Solution of Large Sparse Linear Systems with Applications
Parallel Algorithms for Solution of Large Sparse Linear Systems with Applications Murat Manguoğlu Department of Computer Engineering Middle East Technical University, Ankara, Turkey Prace workshop: HPC
More informationConquest order N ab initio Electronic Structure simulation code for quantum mechanical modelling in large scale
Fortran Expo: 15 Jun 2012 Conquest order N ab initio Electronic Structure simulation code for quantum mechanical modelling in large scale Lianheng Tong Overview Overview of Conquest project Brief Introduction
More informationGeneralized Finite Element Methods for Three Dimensional Structural Mechanics Problems. C. A. Duarte. I. Babuška and J. T. Oden
Generalized Finite Element Methods for Three Dimensional Structural Mechanics Problems C. A. Duarte COMCO, Inc., 7800 Shoal Creek Blvd. Suite 290E Austin, Texas, 78757, USA I. Babuška and J. T. Oden TICAM,
More informationContribution of HPC to the mitigation of natural risks. B. Feignier. CEA-DAM Ile de France Département Analyse, Surveillance, Environnement
Contribution of HPC to the mitigation of natural risks B. Feignier CEA-DAM Ile de France Département Analyse, Surveillance, Environnement Introduction Over the last 40 years, the increase in computational
More informationA framework for detailed multiphase cloud modeling on HPC systems
Center for Information Services and High Performance Computing (ZIH) A framework for detailed multiphase cloud modeling on HPC systems ParCo 2009, 3. September 2009, ENS Lyon, France Matthias Lieber a,
More informationScenario Earthquake and Tsunami Simulations for a Pacific Rim GNSS Tsunami Early Warning System
Scenario Earthquake and Tsunami Simulations for a Pacific Rim GNSS Tsunami Early Warning System Kasey Schultz February 2, 2015 Abstract The first operational prototype for a Pacific Rim Tsunami Early Warning
More informationExploiting In-Memory Processing Capabilities for Density Functional Theory Applications
Exploiting In-Memory Processing Capabilities for Density Functional Theory Applications 2016 Aug 23 P. F. Baumeister, T. Hater, D. Pleiter H. Boettiger, T. Maurer, J. R. Brunheroto Contributors IBM R&D
More informationThe Fast Multipole Method in molecular dynamics
The Fast Multipole Method in molecular dynamics Berk Hess KTH Royal Institute of Technology, Stockholm, Sweden ADAC6 workshop Zurich, 20-06-2018 Slide BioExcel Slide Molecular Dynamics of biomolecules
More informationMimetic Operators for Seismic Exploration
Mimetic Operators for Seismic Exploration Josep de la Puente, Otilio Rojas, Miguel Ferrer and Jean Kormann WS13 A01 Methods and Challenges of Seismic Wave Modelling for Seismic Imaging Outline What is
More informationSupercomputing: Why, What, and Where (are we)?
Supercomputing: Why, What, and Where (are we)? R. Govindarajan Indian Institute of Science, Bangalore, INDIA govind@serc.iisc.ernet.in (C)RG@SERC,IISc Why Supercomputer? Third and Fourth Legs RG@SERC,IISc
More informationTargeting Extreme Scale Computational Challenges with Heterogeneous Systems
Targeting Extreme Scale Computational Challenges with Heterogeneous Systems Oreste Villa, Antonino Tumeo Pacific Northwest Na/onal Laboratory (PNNL) 1 Introduction! PNNL Laboratory Directed Research &
More informationParallelization Strategies for Density Matrix Renormalization Group algorithms on Shared-Memory Systems
Parallelization Strategies for Density Matrix Renormalization Group algorithms on Shared-Memory Systems G. Hager HPC Services, Computing Center Erlangen, Germany E. Jeckelmann Theoretical Physics, Univ.
More informationLogo. A Massively-Parallel Multicore Acceleration of a Point Contact Solid Mechanics Simulation DRAFT
Paper 1 Logo Civil-Comp Press, 2017 Proceedings of the Fifth International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering, P. Iványi, B.H.V Topping and G. Várady (Editors)
More informationParallel Matrix Factorization for Recommender Systems
Under consideration for publication in Knowledge and Information Systems Parallel Matrix Factorization for Recommender Systems Hsiang-Fu Yu, Cho-Jui Hsieh, Si Si, and Inderjit S. Dhillon Department of
More informationSome Geometric and Algebraic Aspects of Domain Decomposition Methods
Some Geometric and Algebraic Aspects of Domain Decomposition Methods D.S.Butyugin 1, Y.L.Gurieva 1, V.P.Ilin 1,2, and D.V.Perevozkin 1 Abstract Some geometric and algebraic aspects of various domain decomposition
More informationThe Finite Cell Method: High order simulation of complex structures without meshing
The Finite Cell Method: High order simulation of complex structures without meshing E. Rank, A. Düster, D. Schillinger, Z. Yang Fakultät für Bauingenieur und Vermessungswesen Technische Universität München,
More informationChapt pt 15 er EARTHQUAKES! BFRB P 215 ages -226
Chapter 15 EARTHQUAKES! BFRB Pages 215-226226 Earthquake causes An earthquake is the shaking of the Earth s crust caused by a release of energy The movement of the Earth s plates causes most earthquakes
More informationElectronic Transactions on Numerical Analysis Volume 49, 2018
Electronic Transactions on Numerical Analysis Volume 49, 2018 Contents 1 Adaptive FETI-DP and BDDC methods with a generalized transformation of basis for heterogeneous problems. Axel Klawonn, Martin Kühn,
More informationWeight-adjusted DG methods for elastic wave propagation in arbitrary heterogeneous media
Weight-adjusted DG methods for elastic wave propagation in arbitrary heterogeneous media Jesse Chan Department of Computational and Applied Math, Rice University ICOSAHOM 2018 July 12, 2018 Chan (CAAM)
More informationRESEARCH HIGHLIGHTS. WAF: Weighted Average Flux Method
RESEARCH HIGHLIGHTS (Last update: 3 rd April 2013) Here I briefly describe my contributions to research on numerical methods for hyperbolic balance laws that, in my view, have made an impact in the scientific
More information