Computational Issues in the Continuum Gyrokinetic Code GYRO

Similar documents
Gyrokinetic Transport Driven by Energetic Particle Modes

TURBULENT TRANSPORT THEORY

Multiscale, multiphysics modeling of turbulent transport and heating in collisionless, magnetized plasmas

Performance of the fusion code GYRO on three four generations of Crays. Mark Fahey University of Tennessee, Knoxville

Gyrokinetics an efficient framework for studying turbulence and reconnection in magnetized plasmas

INTRODUCTION TO GYROKINETIC AND FLUID SIMULATIONS OF PLASMA TURBULENCE AND OPPORTUNITES FOR ADVANCED FUSION SIMULATIONS

Gyrokinetic simulations including the centrifugal force in a strongly rotating tokamak plasma

Research on Structure Formation of Plasmas Dominated by Multiple Hierarchical Dynamics

Microtearing Simulations in the Madison Symmetric Torus

The gyrokinetic turbulence code GENE - Numerics and applications

GTC Simulation of Turbulence and Transport in Tokamak Plasmas

Advances in stellarator gyrokinetics

ITER Predictions Using the GYRO Verified and Experimentally Validated TGLF Transport Model

Global particle-in-cell simulations of Alfvénic modes

Long Time Simulations of Microturbulence in Fusion Plasmas

Coupled radius-energy turbulent transport of alpha particles

Gyrokinetic Turbulence in Tokamaks and Stellarators

Hybrid Discontinuous/Continuous Galerkin Algorithms for (Gyro) Kinetic Simulations of Plasmas

Reduction of Turbulence and Transport in the Alcator C-Mod Tokamak by Dilution of Deuterium Ions with Nitrogen and Neon Injection

Self-consistent particle tracking in a simulation of the entropy mode in a Z pinch

Concurrent Divide-and-Conquer Library

in tokamak plasmas Istvan Pusztai 1 Jeff Candy 2 Punit Gohil 2

Kinetic damping in gyro-kinetic simulation and the role in multi-scale turbulence

Particle-in-cell simulations of electron transport from plasma turbulence: recent progress in gyrokinetic particle simulations of turbulent plasmas

Global gyrokinetic particle simulations with kinetic electrons

Electromagnetic Turbulence Simulations with Kinetic Electrons from the the Summit Framework

NSTX. Investigation of electron gyro-scale fluctuations in the National Spherical Torus Experiment. David Smith. Advisor: Ernesto Mazzucato

Progress and Plans on Physics and Validation

Z. Lin University of California, Irvine, CA 92697, USA. Supported by SciDAC GPS-TTBP, GSEP & CPES

A numerical instability in an ADI algorithm for gyrokinetics

Frank Jenko. Session on Plasma Physics: Petascale Computations of Turbulence in Laboratory and Astrophysical Plasmas

GA A27235 EULERIAN SIMULATIONS OF NEOCLASSICAL FLOWS AND TRANSPORT IN THE TOKAMAK PLASMA EDGE AND OUTER CORE

Hybrid Kinetic-MHD simulations with NIMROD

Validating Simulations of Multi-Scale Plasma Turbulence in ITER-Relevant, Alcator C-Mod Plasmas

Size Scaling and Nondiffusive Features of Electron Heat Transport in Multi-Scale Turbulence

Bounce-averaged gyrokinetic simulations of trapped electron turbulence in elongated tokamak plasmas

Gyrokinetic simulations of magnetic fusion plasmas

International Workshop on the Frontiers of Modern Plasma Physics July On the Nature of Plasma Core Turbulence.

Microturbulence in optimised stellarators

Reduced Electron Thermal Transport in Low Collisionality H-mode Plasmas in DIII-D and the Importance of Small-scale Turbulence

GA A26874 ITER PREDICTIONS USING THE GYRO VERIFIED AND EXPERIMENTALLY VALIDATED TGLF TRANSPORT MODEL

ª 10 KeV. In 2XIIB and the tandem mirrors built to date, in which the plug radius R p. ª r Li

Gyrokinetic Simulations of Tokamak Microturbulence

Plasma instabilities. Dr Ben Dudson, University of York 1 / 37

Numerical Modelling in Fortran: day 10. Paul Tackley, 2016

NumKin, Strasbourg, October 17 th, 2016

Recent Developments in Theory for W7-X

Water-bag reduced gyrokinetic model for the study of the spatial structure of ITG instability linear modes

Energetic Particle Physics in Tokamak Burning Plasmas

Overview of Gyrokinetic Theory & Properties of ITG/TEM Instabilities

Gyrokine.c Analysis of the Linear Ohmic Confinement Regime in Alcator C- Mod *

Validation Study of gyrokinetic simulation (GYRO) near the edge in Alcator C-Mod ohmic discharges

Fine-Scale Zonal Flow Suppression of Electron Temperature Gradient Turbulence

Verification and validation of integrated simulation of energetic particles in fusion plasmas I: linear simulations

Models for Global Plasma Dynamics

Energetic-Ion-Driven MHD Instab. & Transport: Simulation Methods, V&V and Predictions

0 Magnetically Confined Plasma

Turbulence spreading and transport scaling in global gyrokinetic particle simulations

Guiding Center Orbit Studies in a Tokamak Edge Geometry Employing Boozer and Cartesian Coordinates

DisconFnuous Galerkin Algorithms for Edge GyrokineFcs

RESEARCH PROPOSAL This research proposal is devoted to the development and analysis of mathematical and numerical methods, as well as computer

Towards Multiscale Gyrokinetic Simulations of ITER-like Plasmas

Innovative Concepts Workshop Austin, Texas February 13-15, 2006

Verification and validation of integrated simulation of energetic particles in fusion plasmas I: linear simulations

UCLA POSTECH UCSD ASIPP U

arxiv: v1 [math.na] 7 May 2009

HPC Simulations of Magnetic Fusion Plasmas

Full-wave Electromagnetic Field Simulations in the Lower Hybrid Range of Frequencies

Scalable Methods for Kinetic Equations

Enhancing Scalability of Sparse Direct Methods

Numerical methods for eigenvalue problems in the description of drift instabilities in the plasma edge

NIMROD Project Overview

Transport Improvement Near Low Order Rational q Surfaces in DIII D

Scalable Non-Linear Compact Schemes

Low-collisionality density-peaking in GYRO simulations of C-Mod plasmas

Modeling of ELM Dynamics for ITER

Progress in Parallel Implicit Methods For Tokamak Edge Plasma Modeling

Turbulent Transport due to Kinetic Ballooning Modes in High-Beta Toroidal Plasmas

GA A26148 VALIDATING SIMULATIONS OF CORE TOKAMAK TURBULENCE: CURRENT STATUS AND FUTURE DIRECTIONS

THEORY AND SIMULATION OF ROTATIONAL SHEAR STABILIZATION OF TURBULENCE

Multi-scale turbulence, electron transport, and Zonal Flows in DIII-D

GA A25566 COUPLED ITG/TEM-ETG GYROKINETIC SIMULATIONS

David R. Smith UW-Madison

Overview of results from MAST Presented by: Glenn Counsell, for the MAST team

Full f Gyrokinetic Simulation of Tokamak Plasma Turbulence Using ELMFIRE

A Strategy for the Development of Coupled Ocean- Atmosphere Discontinuous Galerkin Models

Benchmark of gyrokinetic, kinetic MHD and gyrofluid codes for the linear calculation of fast particle driven TAE dynamics

Geometric Gyrokinetic Theory and its Applications to Large-Scale Simulations of Magnetized Plasmas

Entropy evolution and dissipation in collisionless particle-in-cell gyrokinetic simulations

High-Order Finite-Volume Methods! Phillip Colella! Computational Research Division! Lawrence Berkeley National Laboratory!

AMSC 663 Project Proposal: Upgrade to the GSP Gyrokinetic Code

Analytic Benchmarking of the 2DX eigenvalue code

Verification of gyrokinetic particle simulation of Alfven eigenmodes excited by external antenna and by fast ions

A Numerical Method for Parallel Particle Motions in Gyrokinetic Vlasov Simulations )

GA A26866 REDUCED ELECTRON THERMAL TRANSPORT IN LOW COLLISIONALITY H-MODE PLASMAS IN DIII-D AND THE IMPORTANCE OF SMALL-SCALE TURBULENCE

Gyrokinetic Simulation of Energetic Particle Turbulence and Transport

What place for mathematicians in plasma physics

Presentation by Herb Berk University of Texas at Austin Institute for Fusion Studies in Vienna, Austria Sept. 1-4, 2015

Damping and drive of low frequency modes in tokamak plasmas

MHD-particle simulations and collective alpha-particle transport: analysis of ITER scenarios and perspectives for integrated modelling

Transcription:

Computational Issues in the Continuum Gyrokinetic Code GYRO presented by: Eric Bass GSEP SciDAC project at General Atomics CScADS Workshop Snowbird, UT July 19, 2010

About GYRO Purpose: To predict transport driven by turbulence in toroidal nuclear fusion devices called tokamaks. J. Candy, R.E. Waltz, JCP 186 545 (2003) http://fusion.gat.com/theory/gyro/

About GYRO Purpose: To predict transport driven by turbulence in toroidal nuclear fusion devices called tokamaks. Method: Solve the gyrokinetic equation on a five-dimensional phase-space grid. 3D real space: 2D grid + 1 spectral (toroidal n number) degree of freedom 2D velocity-space grid comes from averaging out fast cyclotron motion. J. Candy, R.E. Waltz, JCP 186 545 (2003) http://fusion.gat.com/theory/gyro/

About GYRO Purpose: To predict transport driven by turbulence in toroidal nuclear fusion devices called tokamaks. Method: Solve the gyrokinetic equation on a five-dimensional phase-space grid. 3D real space: 2D grid + 1 spectral (toroidal n number) degree of freedom 2D velocity-space grid comes from averaging out fast cyclotron motion. Parallelization: The 2D velocity-space grid and 1 spectral degree of freedom are distributed (separate MPI_COMM groups). Each processor follows a 2D real-space grid. J. Candy, R.E. Waltz, JCP 186 545 (2003) http://fusion.gat.com/theory/gyro/

About GYRO Purpose: To predict transport driven by turbulence in toroidal nuclear fusion devices called tokamaks. Method: Solve the gyrokinetic equation on a five-dimensional phase-space grid. 3D real space: 2D grid + 1 spectral (toroidal n number) degree of freedom 2D velocity-space grid comes from averaging out fast cyclotron motion. Parallelization: The 2D velocity-space grid and 1 spectral degree of freedom are distributed (separate MPI_COMM groups). Each processor follows a 2D real-space grid. Multiple operating modes: GYRO can follow the time evolution from initial conditions or find the eigenvalues of the full system (GKEIGEN, SLEPc/PETSc) or the much-reduced Maxwell dispersion matrix (FIELDEIGEN, spatial degrees only). J. Candy, R.E. Waltz, JCP 186 545 (2003) http://fusion.gat.com/theory/gyro/

Notes about the GYRO time step (initial-value mode) GYRO uses a hybrid implicit-explicit (IMEX) scheme with fixed time steps. It can also be set to use fully explicit Runge-Kutta time steps as well.

Notes about the GYRO time step (initial-value mode) GYRO uses a hybrid implicit-explicit (IMEX) scheme with fixed time steps. It can also be set to use fully explicit Runge-Kutta time steps as well. The (cyclotron averaged) fields are solved for on each processor using (usually sparse) linear algebra.

Notes about the GYRO time step (initial-value mode) GYRO uses a hybrid implicit-explicit (IMEX) scheme with fixed time steps. It can also be set to use fully explicit Runge-Kutta time steps as well. The (cyclotron averaged) fields are solved for on each processor using (usually sparse) linear algebra. An MPI_ALL_REDUCE produces the total field actually used in the equations of motion of the system (linear or non-linear operation).

Notes about the GYRO time step (initial-value mode) GYRO uses a hybrid implicit-explicit (IMEX) scheme with fixed time steps. It can also be set to use fully explicit Runge-Kutta time steps as well. The (cyclotron averaged) fields are solved for on each processor using (usually sparse) linear algebra. An MPI_ALL_REDUCE produces the total field actually used in the equations of motion of the system (linear or non-linear operation). In non-linear operation, MPI calls over the COMM representing the spectral degree of freedom are required to evaluate the non-linear term.

Notes about the GYRO time step (initial-value mode) GYRO uses a hybrid implicit-explicit (IMEX) scheme with fixed time steps. It can also be set to use fully explicit Runge-Kutta time steps as well. The (cyclotron averaged) fields are solved for on each processor using (usually sparse) linear algebra. An MPI_ALL_REDUCE produces the total field actually used in the equations of motion of the system (linear or non-linear operation). In non-linear operation, MPI calls over the COMM representing the spectral degree of freedom are required to evaluate the non-linear term. Actual time advance operations for one entire spatial cross section (at one or more velocity or spectral grid points) are handled by each processor.

GYRO is versatile and scalable. Local, linear simulations quickly give all features of the most-unstable mode even on a desktop machine. 1-32 cores, < 1 hour Waltz standard case flux tube

GYRO is versatile and scalable. Local, linear simulations quickly give all features of the most-unstable mode even on a desktop machine. 1-32 cores, < 1 hour Candy supertorus case Non-linear simulations track multiple toroidal n numbers. 16-512 cores, 1-48 hours Waltz standard case flux tube

GYRO is versatile and scalable. Local, linear simulations quickly give all features of the most-unstable mode even on a desktop machine. 1-32 cores, < 1 hour Candy supertorus case Non-linear simulations track multiple toroidal n numbers. 16-512 cores, 1-48 hours Waltz standard case flux tube Massively multi-scale, non-linear simulations have disparate length and time scales and are only practical on terascale+ machines. ~1000 cores, 10 hours-7 days ITG-ETG flux tube

New GYRO simulations are pushing toward petascale territory Multiple nonlinear instances managed in tgyro can efficiently use 10,000+ cores (talk by Luc Peterson)

New GYRO simulations are pushing toward petascale territory Multiple nonlinear instances managed in tgyro can efficiently use 10,000+ cores (talk by Luc Peterson) Non-linear cases with high spectral resolution are highly parallelizable... φ - <φ> in a nonlinear GYRO run with driven TAE/EPM turbulence.

New GYRO simulations are pushing toward petascale territory Multiple nonlinear instances managed in tgyro can efficiently use 10,000+ cores (talk by Luc Peterson) Non-linear cases with high spectral resolution are highly parallelizable... φ - <φ> in a nonlinear GYRO run with driven TAE/EPM turbulence. max cores=2560 But, for 100% efficiency max cores=1280...but maximum efficiency requires a corresponding increase in real-space grid density because spectral and spatial degrees of freedom are convoluted in the non-linear terms. Total calculation time is increased.

New GYRO simulations are pushing toward petascale territory Multiple nonlinear instances managed in tgyro can efficiently use 10,000+ cores (talk by Luc Peterson) Non-linear cases with high spectral resolution are highly parallelizable... φ - <φ> in a nonlinear GYRO run with driven TAE/EPM turbulence. Even spatially small cases of GKEIGEN solve 25,000 25,000 matrices with parallel processing... max cores=2560 But, for 100% efficiency max cores=1280...but maximum efficiency requires a corresponding increase in real-space grid density because spectral and spatial degrees of freedom are convoluted in the non-linear terms. Total calculation time is increased.

New GYRO simulations are pushing toward petascale territory Multiple nonlinear instances managed in tgyro can efficiently use 10,000+ cores (talk by Luc Peterson) Non-linear cases with high spectral resolution are highly parallelizable... φ - <φ> in a nonlinear GYRO run with driven TAE/EPM turbulence. Even spatially small cases of GKEIGEN solve 25,000 25,000 matrices with parallel processing... max cores=2560 But, for 100% efficiency max cores=1280...but maximum efficiency requires a corresponding increase in real-space grid density because spectral and spatial degrees of freedom are convoluted in the non-linear terms. Total calculation time is increased....but GKEIGEN cases are implemented now with GYRO s parallelization scheme. Total cores 64.

Some bottlenecks limit the largest simulations. Cases involving an energetic particle (EP) species show such bottlenecks.

Some bottlenecks limit the largest simulations. Cases involving an energetic particle (EP) species show such bottlenecks. Stiff term in the gyrokinetic equation: h a,n t iω θ iω d iω E h a,n iω Ψ a,n + qρ s,unit rg r { Ψ a } = C CL a a h ˆ a, ˆ [ ] Curvature drift (ω d ) term is proportional to particle energy. EP requires small time steps.

Some bottlenecks limit the largest simulations. Cases involving an energetic particle (EP) species show such bottlenecks. Stiff term in the gyrokinetic equation: h a,n t iω θ iω d iω E h a,n iω Ψ a,n + qρ s,unit rg r { Ψ a } = C CL a a h ˆ a, ˆ [ ] IMEX time-integration scheme used to mitigate the electron parallel advection (ω q ) term should probably be avoided for ω d term for physics reasons. Curvature drift (ω d ) term is proportional to particle energy. EP requires small time steps.

Some bottlenecks limit the largest simulations. Cases involving an energetic particle (EP) species show such bottlenecks. Stiff term in the gyrokinetic equation: h a,n t iω θ iω d iω E h a,n iω Ψ a,n + qρ s,unit rg r { Ψ a } = C CL a a h ˆ a, ˆ [ ] IMEX time-integration scheme used to mitigate the electron parallel advection (ω q ) term should probably be avoided for ω d term for physics reasons. Curvature drift (ω d ) term is proportional to particle energy. EP requires small time steps. Expensive cyclotron average and field solve: Ψ a,n is a cyclotron-averaged composite field variable. ion cyclotron orbit radial grid

Some bottlenecks limit the largest simulations. Cases involving an energetic particle (EP) species show such bottlenecks. Stiff term in the gyrokinetic equation: h a,n t iω θ iω d iω E h a,n iω Ψ a,n + qρ s,unit rg r { Ψ a } = C CL a a h ˆ a, ˆ [ ] IMEX time-integration scheme used to mitigate the electron parallel advection (ω q ) term should probably be avoided for ω d term for physics reasons. Expensive cyclotron average and field solve: Ψ a,n is a cyclotron-averaged composite field variable. EPs have large cyclotron orbits. Curvature drift (ω d ) term is proportional to particle energy. EP requires small time steps. EP cyclotron orbit ion cyclotron orbit radial grid

Some bottlenecks limit the largest simulations. Cases involving an energetic particle (EP) species show such bottlenecks. Stiff term in the gyrokinetic equation: h a,n t iω θ iω d iω E h a,n iω Ψ a,n + qρ s,unit rg r { Ψ a } = C CL a a h ˆ a, ˆ [ ] IMEX time-integration scheme used to mitigate the electron parallel advection (ω q ) term should probably be avoided for ω d term for physics reasons. Expensive cyclotron average and field solve: Ψ a,n is a cyclotron-averaged composite field variable. EPs have large cyclotron orbits. Curvature drift (ω d ) term is proportional to particle energy. EP requires small time steps. EP cyclotron orbit Many more grid points are coupled and the Ψ a,n field-solve linear algebra becomes dense. ion cyclotron orbit radial grid

Some bottlenecks limit the largest simulations. Cases involving an energetic particle (EP) species show such bottlenecks. Stiff term in the gyrokinetic equation: h a,n t iω θ iω d iω E h a,n iω Ψ a,n + qρ s,unit rg r { Ψ a } = C CL a a h ˆ a, ˆ [ ] IMEX time-integration scheme used to mitigate the electron parallel advection (ω q ) term should probably be avoided for ω d term for physics reasons. Expensive cyclotron average and field solve: Ψ a,n is a cyclotron-averaged composite field variable. EPs have large cyclotron orbits. Curvature drift (ω d ) term is proportional to particle energy. EP requires small time steps. EP cyclotron orbit Many more grid points are coupled and the Ψ a,n field-solve linear algebra becomes dense. ion cyclotron orbit radial grid Dense field-solve linear algebra also occurs in multiscale turbulence cases with a dense radial grid.

Workshop goals Explore strategies to overcome speed bottlenecks:

Workshop goals Explore strategies to overcome speed bottlenecks: Consider alternative to LAPACK linear algebra solvers for dense algebra

Workshop goals Explore strategies to overcome speed bottlenecks: Consider alternative to LAPACK linear algebra solvers for dense algebra Develop a more clever way to set up the field-solve matrix in dense cases

Workshop goals Explore strategies to overcome speed bottlenecks: Consider alternative to LAPACK linear algebra solvers for dense algebra Develop a more clever way to set up the field-solve matrix in dense cases Possibly implement a new IMEX time-integration scheme for new stiff terms

Workshop goals Explore strategies to overcome speed bottlenecks: Consider alternative to LAPACK linear algebra solvers for dense algebra Develop a more clever way to set up the field-solve matrix in dense cases Possibly implement a new IMEX time-integration scheme for new stiff terms Increase parallelizability:

Workshop goals Explore strategies to overcome speed bottlenecks: Consider alternative to LAPACK linear algebra solvers for dense algebra Develop a more clever way to set up the field-solve matrix in dense cases Possibly implement a new IMEX time-integration scheme for new stiff terms Increase parallelizability: Consider strategies to distribute the spatial grid (particularly the radial grid) or the separate kinetic species.

Workshop goals Explore strategies to overcome speed bottlenecks: Consider alternative to LAPACK linear algebra solvers for dense algebra Develop a more clever way to set up the field-solve matrix in dense cases Possibly implement a new IMEX time-integration scheme for new stiff terms Increase parallelizability: Consider strategies to distribute the spatial grid (particularly the radial grid) or the separate kinetic species. For eigenvalue-solving mode, distribute the time-evolution matrix over a greater number of processors (for use by SLEPc) than that suggested by the existing gyro parallelization scheme.