Multi-Objective Stochastic Optimization of Magnetic Fields. David S Bindel, Cornell University

Similar documents
Global, Robust, Multi-Objective Optimization of Stellarators

Optimization Tools in an Uncertain Environment

Knowledge-Gradient Methods for Bayesian Optimization

Multi-Attribute Bayesian Optimization under Utility Uncertainty

Utilizing Adjoint-Based Techniques to Improve the Accuracy and Reliability in Uncertainty Quantification

Scalable Algorithms for Optimal Control of Systems Governed by PDEs Under Uncertainty

Why should you care about the solution strategies?

Non-convex optimization. Issam Laradji

Fully Bayesian Deep Gaussian Processes for Uncertainty Quantification

Modelling plasma scenarios for MAST-Upgrade

Kernel-based Approximation. Methods using MATLAB. Gregory Fasshauer. Interdisciplinary Mathematical Sciences. Michael McCourt.

Scalable algorithms for optimal experimental design for infinite-dimensional nonlinear Bayesian inverse problems

Scalable kernel methods and their use in black-box optimization

Statistical Rock Physics

Contents. Preface. 1 Introduction Optimization view on mathematical models NLP models, black-box versus explicit expression 3

Experiences with Model Reduction and Interpolation

STA414/2104 Statistical Methods for Machine Learning II

At A Glance. UQ16 Mobile App.

Nonlinear Optimization for Optimal Control

LA Support for Scalable Kernel Methods. David Bindel 29 Sep 2018

1 EX/P5-9 International Stellarator/Heliotron Database Activities on High-Beta Confinement and Operational Boundaries

KNOWLEDGE GRADIENT METHODS FOR BAYESIAN OPTIMIZATION

c 2012 Society for Industrial and Applied Mathematics

Numerical Simulations. Duncan Christie

Mathematics for Economics and Finance

Simulation optimization via bootstrapped Kriging: Survey

Inverse Problems and Optimal Design in Electricity and Magnetism

Estimating functional uncertainty using polynomial chaos and adjoint equations

Augmented Reality VU numerical optimization algorithms. Prof. Vincent Lepetit

Rotation and Neoclassical Ripple Transport in ITER

Numerical Optimization Techniques

Parallelised Bayesian Optimisation via Thompson Sampling

Stellarators. Dr Ben Dudson. 6 th February Department of Physics, University of York Heslington, York YO10 5DD, UK

1 Introduction

AM 205: lecture 19. Last time: Conditions for optimality, Newton s method for optimization Today: survey of optimization methods

Applying Bayesian Estimation to Noisy Simulation Optimization

Gaussian Process Approximations of Stochastic Differential Equations

Economics 201b Spring 2010 Solutions to Problem Set 1 John Zhu

MHD equilibrium calculations for stellarators. Antoine Cerfon, MIT PSFC with F. Parra, J. Freidberg (MIT NSE)

Gaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008

Nonlinear Optimization Methods for Machine Learning

Including general scattering in high-fidelity deterministic particle transport calculations for proton therapy applications

Support Vector Regression (SVR) Descriptions of SVR in this discussion follow that in Refs. (2, 6, 7, 8, 9). The literature

MAGNETIC NOZZLE PLASMA EXHAUST SIMULATION FOR THE VASIMR ADVANCED PROPULSION CONCEPT

Multi-fluid Simulation Models for Inductively Coupled Plasma Sources

High Performance Nonlinear Solvers

Modeling of ELM Dynamics for ITER

D.J. Schlossberg, D.J. Battaglia, M.W. Bongard, R.J. Fonck, A.J. Redd. University of Wisconsin - Madison 1500 Engineering Drive Madison, WI 53706

The importance of including XMHD physics in HED codes

On the physics of shear flows in 3D geometry

INITIAL EVALUATION OF COMPUTATIONAL TOOLS FOR STABILITY OF COMPACT STELLARATOR REACTOR DESIGNS

Calibrating Environmental Engineering Models and Uncertainty Analysis

3D toroidal physics: testing the boundaries of symmetry breaking

Stochastic Analogues to Deterministic Optimizers

NON-LINEAR APPROXIMATION OF BAYESIAN UPDATE

D.J. Schlossberg, D.J. Battaglia, M.W. Bongard, R.J. Fonck, A.J. Redd. University of Wisconsin - Madison 1500 Engineering Drive Madison, WI 53706

Chapter 1 Direct Modeling for Computational Fluid Dynamics

Modelling Under Risk and Uncertainty

NIMROD FROM THE CUSTOMER S PERSPECTIVE MING CHU. General Atomics. Nimrod Project Review Meeting July 21 22, 1997

A Rothschild-Stiglitz approach to Bayesian persuasion

ITER operation. Ben Dudson. 14 th March Department of Physics, University of York, Heslington, York YO10 5DD, UK

Linear Regression. CSL603 - Fall 2017 Narayanan C Krishnan

Linear Regression. CSL465/603 - Fall 2016 Narayanan C Krishnan

A picture of the energy landscape of! deep neural networks

Computational and Statistical Aspects of Statistical Machine Learning. John Lafferty Department of Statistics Retreat Gleacher Center

Non-Solenoidal Plasma Startup in

AM 205: lecture 19. Last time: Conditions for optimality Today: Newton s method for optimization, survey of optimization methods

NONLOCALITY AND STOCHASTICITY TWO EMERGENT DIRECTIONS FOR APPLIED MATHEMATICS. Max Gunzburger

Direct drive by cyclotron heating can explain spontaneous rotation in tokamaks

Formation and Long Term Evolution of an Externally Driven Magnetic Island in Rotating Plasmas )

Stochastic Spectral Approaches to Bayesian Inference

A New Trust Region Algorithm Using Radial Basis Function Models

Review for Exam 2 Ben Wang and Mark Styczynski

Neoclassical transport

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

Evidence for Electromagnetic Fluid Drift Turbulence Controlling the Edge Plasma State in Alcator C-Mod

Probabilistic numerics for deep learning

KINETIC DESCRIPTION OF MAGNETIZED TECHNOLOGICAL PLASMAS

Model Calibration under Uncertainty: Matching Distribution Information

Physics and Operations Plan for LDX

STA 4273H: Sta-s-cal Machine Learning

Magnetically Confined Fusion: Transport in the core and in the Scrape- off Layer Bogdan Hnat

Fundamentals of Data Assimila1on

Sampling and low-rank tensor approximation of the response surface

Self-Organization of Plasmas with Flows

Multidomain Design and Optimization based on Comsol Multiphysics: Applications for Mechatronic Devices

Covariance to PCA. CS 510 Lecture #8 February 17, 2014

Nonlinear Functional Analysis and its Applications

Fundamentals of Data Assimila1on

CPSC 540: Machine Learning

QUANTITATIVE INTERPRETATION

Introduction to Optimization

Asynchronous 3D HYPERS Simulations of Compact Toroids and Magnetoplasmas

Numerical Optimization Professor Horst Cerjak, Horst Bischof, Thomas Pock Mat Vis-Gra SS09

Direct Modeling for Computational Fluid Dynamics

Highlights from (3D) Modeling of Tokamak Disruptions

Linear Regression (continued)

Overfitting, Bias / Variance Analysis

Simulation of a simple electron gun. David Moore 1 San Diego Miramar College

Lecture V. Numerical Optimization

Transcription:

Multi-Objective Stochastic Optimization of Magnetic Fields David S Bindel, Cornell University

Current State of the Art (STELLOPT) Optimizer Calculate! " (physics + engineering targets) Adjust plasma boundary (or coil shape) Solve 3D equilibrium $ %, ' + )* %, ' =, -.,/ 0 1(.34/5) 2

STELLOPT Approach Goal: Design MHD equilibrium (coil optimization often separate) Possible parameters for boundary:! R $ Physics / engineering properties: %! R $ R ( Target vector % R ( Minimize +, objective over!: ( +, - = / 0 2, 1 3 1 (-), 3 1 - = % 1 - %, 1 167 Solve via Levenberg-Marquardt, GA, differential evolution (avoids gradient information apart from finite differences) 3

Challenges 1. Costly and black box physics computations Each step: MHD equilibrium solve, transport calculation, coil design Compute several times per step for finite-difference gradient estimates! 2. Managing tradeoffs How do we choose the weights in the! " measure? By gut? Does varying the weights expose tradeoffs in a sensible way? No! 3. Dealing with uncertainties What you simulate what you build will performance suffer? 4. Global search How do we avoid getting stuck in local minima without excessive cost? 4

Challenge 1: Costly Physics Constraints Beltrami field (Taylor state): 5 = 75 on Ω 5 ; = 0 on =Ω 5 = 0 + flux conditions for well-posedness Requires costly physics solves (MHD equilibrium, transport, ) Derivatives require PDE sensitivity / adjoints (not black-box) 5

Physics-Constrained Optimization Beltrami field (Taylor state): 5 = 75 on Ω 5 ; = 0 on =Ω 5 = 0 + flux conditions for well-posedness 6 Key: Exploit PDE properties PDE-constrained: Solves are part of the optimization, not a black box PDE structure influences optimization objective landscape PDE operator properties: compactness, smoothing, near/far field interactions, etc Provides opportunities for dimension reduction in optimization

Challenge 2: Multi-Objective Optimization What makes an optimal stellarator? Approximates field symmetries (which measures?) Satisfies macroscopic and local stability Includes divertor fields for particle and heat exhaust Minimizes collisional and energetic particle transport Minimizes turbulent transport Satisfies basic engineering constraints (cost, size, etc) Each objective involves different approximations, uncertainties, and computational costs. 7

Exploring the Pareto Frontier Minimize $! " + 1 $! # ( dominates ) if! (! ) and 0,! 2 (! 2 ()) Worse Pareto optimal (non-dominated, noninferior, efficient): no ) dominates (. Pareto frontier generally an 6 1 - dimensional manifold with corners.! # Pareto frontier Minimizing $ 2! 2 2 only explores convex hull of Pareto frontier! Better! " 8

Challenge 3: Optimization Under Uncertainty Want performance not to depend on Tiny changes to coil geometry (within engineering tolerance) Changes to control parameters during operation Uncertainty in approximations to physics or model parameters 9

Risk-Neutral and Risk-Averse Optimization Risk-Neutral Objective Risk-Averse Objective 10

Challenge 4: Global Optimization Global optimization is hard! Especially in high-dimensional spaces Effective solvers are tailored to structure (e.g. convexity) More general methods are mainly heuristic Want algorithms that balance Exploration: Evaluating novel designs with unknown properties Exploitation: Refining known designs from previously explored regions Global model-based techniques help (with the right models!) 11

12 Exploration vs Exploitation

General Formulation min * +, &-. /, 1, * +, 2( /, 1, * +, /, 3, 1,, R /, 3, 1, $%&'( Subject to: manufacturing and physics constraints, PDEs relating coils to field /, particle or heat transport 3, etc. Optimize integrability (, &-. ), quasi-symmetry (, 2( ), etc 13 Take into account uncertain parameters 1 Include a risk aversion objective R Find Pareto points vs using weighted sums of objectives

General Formulation min * +, &-. /, 1, * +, 2( /, 1, * +, /, 3, 1,, R /, 3, 1, $%&'( Subject to: manufacturing and physics constraints, PDEs relating coils to field /, particle or heat transport 3, etc. 14 Costs beyond deterministic PDE solves: Stochastic objectives require many deterministic solves each Pareto frontier is an (7 1)-dimensional manifold with corners Non-convex global optimization requires a lot of searching Common issue: the curse of dimensionality

Addressing the Challenges Fast physics solver formulations Efficient optimization under uncertainty Surrogates and multi-fidelity methods 15

Fast Equilibrium Solvers (a) The radial component, B r. (b) The azimuthal component, B f. (c) The z component, B z. Figure 4: The Beltrami field B in the f = 0 plane. 16 Integral equation solver for Taylor states in toroidal geometries [O Neill, Cerfon, 2018] Laplace-Beltrami solver on genus 1 surfaces [Imbert-Gérard, Greengard, 2017]

Fast Equilibrium Solvers Integral equation formulation for coil fields + MHD equilibria Respects underlying physical conditioning of problem Generally only need boundary discretization (vs volume meshing) Fast high-order algorithms exist for required integral operators Associated adjoint solvers to compute sensitivities Fast re-solves in optimizer under low-rank geometry updates 17

Efficient Optimization Under Uncertainty! " # + % =! # +! ' # % + %( ) * # %, %~/(0, 2) 2 Use a quadratic approximation to compute stochastic, possibly risk-averse, objective. [c.f. Alexanderian, Petra, Ghattas, Stadler, 2017]. 18

Efficient Optimization Under Uncertainty Consider objective!(#, %) where # is control and % uncertain Model % as multivariate Gaussian Use local quadratic approximation in stochastic variables Require (!/(% and action of Hessian ( *!/(% * on vectors Assume Hessian is (approximately) low rank dimension reduction Scaling with low intrinsic dimension vs. number of parameters Beyond Gaussian: use approximation as a control variate Lots of remaining challenges (high nonlinearity, turbulence, etc) 19

Surrogate Methods Surrogates (response surfaces) approximate costly functions May also estimate uncertainty (e.g. Gaussian process models) Different variants: fixed, parametric, non-parametric Incorporate function values, gradients, bounds, 20

Surrogate Optimization Example: Single objective Bayesian optimization Sample objective and fit a GP model Use acquisition function to guide further sampling (EI, PI, UCB, KG); goal is to balance exploration vs exploitation Active work on recent variants for Pareto (ParEGO [Knowles 2004], GPareto [Binois, Picheny, 2018]) Multi-fidelity optimization [e.g. March, Willcox, Wang, 2011] Incorporating gradients [Wu, Poloczek, Wilson, Frazier, 2018] Objectives with quadrature [Toscano-Palmerin, Frazier, 2018] Several options in PySOT toolkit [B, Eriksson, Shoemaker] 21

Surrogates with Side Information Problem: Need predictions from limited data Shape surrogate to have known structure (inductive bias) Meaningful mean fields Structured kernels (symmetry, regularity, dimension reduction, etc) Tails that capture known singularities and other features Alternative: Jointly predict! "#$%&' ()) and! "#++ ()) Kernel captures correlation between functions as well as across space Basic idea is old: e.g. co-kriging in geostatistics Use in computational science and engineering is active research [Peherstorfer, Willcox, Gunzburger, others also my sabbatical!] 22

The Bigger Picture Many of the challenges of stellarators are universal in computational science and engineering! Physics is often governed by expensive-to-solve PDEs Physics-agnostic optimization infeasibly hard, even with big computers Need structure to reduce problem dimension / model complexity Stellarator problem involves many common components Mechanisms described by PDEs for transport, diffusion, reaction Methods we develop will impact other areas But success depends on using specific problem structure! 23

Summary Challenge: Multi-objective risk-averse stellarator optimization Approach: Fast equilibrium solvers, scalable optimization under uncertainty, multi-fidelity surrogate methods Specific goals: Test problem formulation (vacuum and positive pressure) Extension to multi-objective programming formulation Scalable risk-averse stochastic programming methods Optimization via physics-sensitive multi-fidelity surrogates Methods to be incorporated into new SIMSOPT code 24