Intro BCS/Low Rank Model Inference/Comparison Summary References. UQTk. A Flexible Python/C++ Toolkit for Uncertainty Quantification

Similar documents
Uncertainty Quantification in Computational Models

Polynomial Chaos Based Uncertainty Propagation

Polynomial chaos expansions for sensitivity analysis

Uncertainty quantification for sparse solutions of random PDEs

Uncertainty Propagation and Global Sensitivity Analysis in Hybrid Simulation using Polynomial Chaos Expansion

Stochastic Collocation Methods for Polynomial Chaos: Analysis and Applications

Stochastic Spectral Approaches to Bayesian Inference

Data-Driven Bayesian Model Selection: Parameter Space Dimension Reduction using Automatic Relevance Determination Priors

Quantifying conformation fluctuation induced uncertainty in bio-molecular systems

Polynomial chaos expansions for structural reliability analysis

Steps in Uncertainty Quantification

Error Budgets: A Path from Uncertainty Quantification to Model Validation

Stochastic Spectral Methods for Uncertainty Quantification

Fast Numerical Methods for Stochastic Computations

Uncertainty analysis of large-scale systems using domain decomposition

PC EXPANSION FOR GLOBAL SENSITIVITY ANALYSIS OF NON-SMOOTH FUNCTIONALS OF UNCERTAIN STOCHASTIC DIFFERENTIAL EQUATIONS SOLUTIONS

Beyond Wiener Askey Expansions: Handling Arbitrary PDFs

Sparse polynomial chaos expansions in engineering applications

Sparse Pseudo Spectral Projection Methods with Directional Adaptation for Uncertainty Quantification

Chaospy: A modular implementation of Polynomial Chaos expansions and Monte Carlo methods

NON-LINEAR APPROXIMATION OF BAYESIAN UPDATE

Stochastic Dimension Reduction

Introduction to Uncertainty Quantification in Computational Science Handout #3

At-A-Glance SIAM Events Mobile App

Uncertainty Quantification

Evaluation of Non-Intrusive Approaches for Wiener-Askey Generalized Polynomial Chaos

Challenges in Uncertainty Quantification in Computational Models

Application and validation of polynomial chaos methods to quantify uncertainties in simulating the Gulf of Mexico circulation using HYCOM.

Predictability of Chemical Systems

Sobol-Hoeffding Decomposition with Application to Global Sensitivity Analysis

Utilizing Adjoint-Based Techniques to Improve the Accuracy and Reliability in Uncertainty Quantification

Identification of multi-modal random variables through mixtures of polynomial chaos expansions

A Polynomial Chaos Approach to Robust Multiobjective Optimization

Adaptive low-rank approximation in hierarchical tensor format using least-squares method

Estimating functional uncertainty using polynomial chaos and adjoint equations

On a Data Assimilation Method coupling Kalman Filtering, MCRE Concept and PGD Model Reduction for Real-Time Updating of Structural Mechanics Model

Original Research. Sensitivity Analysis and Variance Reduction in a Stochastic NDT Problem

Multiscale stochastic preconditioners in non-intrusive spectral projection

Performance Evaluation of Generalized Polynomial Chaos

MCMC Sampling for Bayesian Inference using L1-type Priors

Overview. Bayesian assimilation of experimental data into simulation (for Goland wing flutter) Why not uncertainty quantification?

Photos placed in horizontal position with even amount of white space between photos and header

Spectral Methods for Uncertainty Quantification

Uncertainty Evolution In Stochastic Dynamic Models Using Polynomial Chaos

A Unified Framework for Uncertainty and Sensitivity Analysis of Computational Models with Many Input Parameters

Algorithms for Uncertainty Quantification

Efficient Sampling for Non-Intrusive Polynomial Chaos Applications with Multiple Uncertain Input Variables

Instructions for Matlab Routines

Keywords: Sonic boom analysis, Atmospheric uncertainties, Uncertainty quantification, Monte Carlo method, Polynomial chaos method.

Solving the Stochastic Steady-State Diffusion Problem Using Multigrid

A regularized least-squares method for sparse low-rank approximation of multivariate functions

ICES REPORT A posteriori error control for partial differential equations with random data

A Stochastic Collocation based. for Data Assimilation

Spectral Representation of Random Processes

Integration using Gaussian Quadrature

New Approaches in Process Monitoring for Fuel Cycle Facilities

Dimensionality reduction and polynomial chaos acceleration of Bayesian inference in inverse problems

A Stochastic Projection Method for Fluid Flow

Model Calibration under Uncertainty: Matching Distribution Information

Enabling Advanced Automation Tools to manage Trajectory Prediction Uncertainty

Sampling and low-rank tensor approximation of the response surface

Recent Advances in Bayesian Inference Techniques

Non-Intrusive Solution of Stochastic and Parametric Equations

Parametric Problems, Stochastics, and Identification

Winter 2019 Math 106 Topics in Applied Mathematics. Lecture 1: Introduction

arxiv: v2 [physics.comp-ph] 15 Sep 2015

Hyperbolic Polynomial Chaos Expansion (HPCE) and its Application to Statistical Analysis of Nonlinear Circuits

Approximate Interval Method for Epistemic Uncertainty Propagation using Polynomial Chaos and Evidence Theory

Bifidelity Methods for Polynomial Chaos Expansions

Accepted Manuscript. SAMBA: Sparse approximation of moment-based arbitrary polynomial chaos. R. Ahlfeld, B. Belkouchi, F.

ASSESSMENT OF COLLOCATION AND GALERKIN APPROACHES TO LINEAR DIFFUSION EQUATIONS WITH RANDOM DATA

Quadrature for Uncertainty Analysis Stochastic Collocation. What does quadrature have to do with uncertainty?

arxiv: v3 [math.na] 7 Dec 2018

Multivariate Bayesian Linear Regression MLAI Lecture 11

Xun (Ryan) Huan. My research focuses on uncertainty quantification, data-driven modeling, and optimization for engineering applications.

Bayesian Regression Linear and Logistic Regression

Strain and stress computations in stochastic finite element. methods

Uncertainty Quantification of Radionuclide Release Models using Non-Intrusive Polynomial Chaos. Casper Hoogwerf

Solving the stochastic steady-state diffusion problem using multigrid

Final Report: DE-FG02-95ER25239 Spectral Representations of Uncertainty: Algorithms and Applications

Sparse polynomial chaos expansions as a machine learning regression technique

At A Glance. UQ16 Mobile App.

Downloaded 08/12/13 to Redistribution subject to SIAM license or copyright; see

Kinematic Analysis and Inverse Dynamics-based Control of Nondeterministic Multibody Systems

Polynomial Chaos and Karhunen-Loeve Expansion

Overview of sparse system identification

Parameter Selection Techniques and Surrogate Models

An Empirical Chaos Expansion Method for Uncertainty Quantification

Study Notes on the Latent Dirichlet Allocation

Ensemble Data Assimilation and Uncertainty Quantification

A Framework for Modeling and Optimizing Dynamic Systems Under Uncertainty

Nonparametric Bayesian Methods (Gaussian Processes)

arxiv: v4 [math.st] 13 Jan 2017

Accuracy, Precision and Efficiency in Sparse Grids

EFFICIENT STOCHASTIC GALERKIN METHODS FOR RANDOM DIFFUSION EQUATIONS

arxiv: v1 [math.na] 14 Sep 2017

Chapter 2 Spectral Expansions

Fast Direct Methods for Gaussian Processes

Efficient Solvers for Stochastic Finite Element Saddle Point Problems

Introduction to Probabilistic Graphical Models: Exercises

Transcription:

A Flexible Python/C++ Toolkit for Uncertainty Quantification Bert Debusschere, Khachik Sargsyan, Cosmin Safta, Prashant Rai, Kenny Chowdhary bjdebus@sandia.gov Sandia National Laboratories, Livermore, CA, USA Thursday April 19, 2018 SIAM UQ 18

Acknowledgements Cosmin Safta Khachik Sargsyan Kenny Chowdhary Habib Najm Omar Knio Roger Ghanem Olivier Le Maître and many others... Sandia National Laboratories Livermore, CA, USA KAUST, Saudi Arabia University of Southern California Los Angeles, CA, USA LIMSI-CNRS, Orsay, France This material is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research (ASCR), Scientific Discovery through Advanced Computing (SciDAC) and Applied Mathematics Research (AMR) programs. Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy s National Nuclear Security Administration under contract DE-NA0003525. The views expressed in this talk do not necessarily represent the views of the U.S. Department of Energy or the United States Government.

Outline 1 General Characteristics 2 Surrogate Construction 3 Bayesian Model Inference and Comparison 4 Summary 5 References

UQ is about enabling predictive simulations Experiments Predictive Simulation Theory

UQ methods extract information from all sources to enable predictive simulation Experiments D, P(λ D, I) = Inference P(D λ, I)P(λ, I) P(D) P(λ) (λ, (λ) Predictive Simulation Theory du dt = f (u; λ) Forward Propagation du t P(u) u UQ not just about propagating uncertainties The term UQ covers a wide range of methods

An example workflow Prior p(λ) Model f(λ) Surrogate fc(λ) Data Likelihood D = {y i} Dim. Red. Posterior p(λ D) Forward UQ Inverse UQ g(λ) Any model Prediction p(g(λ) D)

provides tools to build a general UQ workflow Tools for Representation of random variables and stochastic processes Forward uncertainty propagation Inverse problems Sensitivity analysis Dimensionality reduction Bayesian Compressive Sensing Low Rank Tensors Gaussian Processes... Tools can be used stand-alone or combined into a general workflow

is meant to be straightforward to download, install and use Target usage: Rapid prototyping of UQ workflows Algorithmic research in UQ Tutorials / educational Expertise in UQ methods (or a desire to acquire it) helpful Released under the GNU Lesser General Public License http://www.sandia.gov/uqtoolkit/ Current version 3.0.4 Version 3.1.0 planned for later this year No massive third party libaries to download, install, and configure

is used in a variety of applications Direct collaborations US DOE SciDAC FASTMath institute https://fastmath-scidac.llnl.gov/ Variety of US DOE SciDAC partnership projects Part of US DOE BER E3SM climate model analysis tools Many other research groups at universities, National Labs, and industry Always welcome new applications / collaborations Mailing lists uqtk-announce@software.sandia.gov uqtk-users@software.sandia.gov Join at http://www.sandia.gov/uqtoolkit

Downloads 800 600 400 200 0 2012 20132014 Total Downloads 2015 20162017 V 1.x V 2.x V 3.x 2018 Downloads from http://www.sandia.gov/uqtoolkit 900 total downloads 200 downloads of version 3.x

We rely on Polynomial Chaos expansions (PCEs) to represent uncertainty Standard PC Basis types supported: Gauss Hermite Uniform Legendre Gamma Laguerre Beta Jacobi Also support for custom orthogonal polynomials Defined by user-provided three-term recurrence formula Both intrusive and non-intrusive PC tools provided Primarily Galerkin projection methods Some regression approaches offered through Bayesian Compressed Sensing module See also Debusschere, et al. 2004; Sargsyan, et al. 2014

uses a combination of C++ and Python Main libraries in C++ PCBasis and PCSet classes: PC tools (intrusive and non-intrusive) Quad class: quadrature rules (full tensor and sparse tensor product rules) MCMC, Gproc,... Functionality available via Direct linking of C++ code Standalone apps Python interface based on Swig ( version 3.0) Download as tar file and configure with CMake Examples of common workflows provided

Upcoming Features in 3.1.0 New Functionality Canonical Low Rank Tensor (LRT) Representations Data Free Inference (DFI) Library tempered MCMC (tmcmc) Enhanced functionality Python scripts for model evidence computation Python class for Bayesian Compressed sensing Additional examples and tutorials Expected Fall 2018 Sign up for the announcement mailing list uqtk-announce@software.sandia.gov at http://www.sandia.gov/uqtoolkit

Longer Term Plans Coupling with other libraries Better support for user specified third-party libaries, e.g. random number generators, integrators,... Coupling with DAKOTA and MUQ for leveraging functionality Mixed PC basis types More general multi-index specification Data structures amenable to parallelization and GPU acceleration Other developments you would like to see? Let us know at uqtk-users@software.sandia.gov

Outline 1 General Characteristics 2 Surrogate Construction 3 Bayesian Model Inference and Comparison 4 Summary 5 References

Bayesian Compressed Sensing N training data points (x n, u n ) and K basis terms Ψ k ( ) Projection matrix P N K with P nk = Ψ k (x n ) Find regression weights c = (c 0,..., c K 1 ) so that u Pc or u n k c kψ k (x n ) The number of polynomial basis terms grows fast; a p-th order, d-dimensional basis has a total of K = (p + d)!/(p!d!) terms. For limited data and large basis set (N < K ) this is a sparse signal recovery problem need some regularization/constraints. Least-squares argmin c { u Pc 2 2 } The sparsest argmin c { u Pc 2 2 + α c 0 } Compressive sensing argmin c { u Pc 2 2 + α c 1 }

Bayesian Compressed Sensing N training data points (x n, u n ) and K basis terms Ψ k ( ) Projection matrix P N K with P nk = Ψ k (x n ) Find regression weights c = (c 0,..., c K 1 ) so that u Pc or u n k c kψ k (x n ) The number of polynomial basis terms grows fast; a p-th order, d-dimensional basis has a total of K = (p + d)!/(p!d!) terms. For limited data and large basis set (N < K ) this is a sparse signal recovery problem need some regularization/constraints. Least-squares argmin c { u Pc 2 2 } The sparsest argmin c { u Pc 2 2 + α c 0 } Compressive sensing argmin c { u Pc 2 2 + α c 1 } Bayesian Likelihood Prior

BCS removes unnecessary basis terms f (x, y) = cos(x +4y) f (x, y) = cos(x 2 +4y) 0 Order (dim 2) 0 1 2 3 4 5 6 7 8 9 10 2 0 Order (dim 2) 0 1 2 3 4 5 6 7 8 9 10 2 1 2 3 4 6 1 2 3 4 6 Order (dim 1) 4 5 6 7 8 8 10 12 14 Order (dim 1) 4 5 6 7 8 8 10 12 14 9 16 9 16 10 18 10 18 The square (i, j) represents the (log) spectral coefficient for the basis term ψ i (x)ψ j (y).

Iterative Bayesian Compressive Sensing (ibcs) Iterative BCS: iteratively increase the order for the relevant basis terms while maintaining the dimensionality reduction [Sargsyan et al. 2014], [Jakeman et al. 2015]. Combine basis growth and reweighting! Model data Initial Basis Weighted BCS Sparse Basis Final Basis Iterations New Basis Basis Growth Reweighting

Low-Rank Approximations Univariate function representation: p + 1 coefficients u(ξ) = P u k ψ k (ξ) k=0 Multivariate function representation: P + 1 = (n+p)! n!p! u(ξ 1,..., ξ n ) = P k=0 u k n ψ α k i (ξ i ) i=1 Low-rank approximation: r (p 1 + p 2 +... + p n ) u(ξ 1,..., ξ n ) = r k=1 w (1) k (ξ 1 )... w (n) k (ξ n )

Minimization Problem for Coefficients Minimization problem: min u(ξ) ũ(ξ) 2 + λr(ũ(ξ)) ũ M where M is a suitable tensor subset (Canonical, Tensor Train) and R is a regularization function Selection of optimal rank r and regularization coefficient λ using cross validation Pros: Linear increase in parameters with dimension Cons: Non-linear optimization problem. Optimal approximation ũ(ξ) is often not known

Low Rank Tensor and Bayesian Compressed Sensing Applied to Gauss Genz Function Relative approximation error 10 2 10 1 10 0 10 1 10 2 10 3 10 4 Genz Gaussian Model with D = 5 Low rank BCS 10 20 30 40 50 Sample Size u(x) = exp Relative approximation error 10 2 10 1 10 0 10 1 10 2 10 3 Genz Gaussian Model with D = 20 Low rank BCS 100 200 300 400 500 Sample Size ( d i=1 w 2 i (x i b) 2 ), w i = 1, b = 0.5 Box plot over 21 replica tests Low-rank approach performs well on function with inherent low-rank structure

Low Rank Tensor and Bayesian Compressed Sensing Applied to Gauss Genz Function Relative approximation error 10 2 10 1 10 0 10 1 10 2 10 3 10 4 Genz Gaussian Model with D = 5 Low rank BCS 10 20 30 40 50 Sample Size u(x) = exp Relative approximation error 10 1 10 0 10 1 10 2 10 3 10 4 Genz Gaussian Model with D = 20 Low rank BCS 100 200 300 400 500 Sample Size ( d i=1 w 2 i (x i b) 2 ), w i = 1 i 3, b = 0.5 Box plot over 21 replica tests Bayesian compressive sensing does well on function with inherent sparsity and low number of samples

Outline 1 General Characteristics 2 Surrogate Construction 3 Bayesian Model Inference and Comparison 4 Summary 5 References

Bayesian Inference and Model Comparison Model for thermodynamic properties of RedOx active materials Used in design of materials for solar thermochemical Hydrogen production General model form δ = f (p O2, T ) Model A: 4 parameters Model B: 8 parameters Bayesian parameter inference and model comparison Joint work with Dr. Ellen Stechel at Arizona State University Funded by the DOE Office of Energy Efficiency and Renewable Energy (EERE)

Bayesian Inference and Model Comparison Employed Python Bayesian Inference tools to infer parameters and compare the two models Model properties and numerical settings specified via flexible xml input file Python postprocessing and model evidence computation Workflow will be part of 3.1.0 release later this year

Both models agree well with data Model A (left) and Model B (right)

Both models agree well with data Model B (right) has smaller residuals

Intro BCS/Low Rank Model Inference/Comparison Summary References Posterior distributions were sampled with adaptive MCMC Well-defined uni-modal distributions Model B has more dependencies between its parameters

Model evidence favors model B Model evidence computed from posterior samples, using a Gaussian approximation Model A: Ln(evidence) = 1580 Model B: Ln(evidence) = 1939 Despite its higher complexity, model B is clearly favored. For situations with more measurement noise, or fewer data points, a simpler model maybe preferred

Summary provides a powerful set of tools for building general UQ workflows Multiple ways to access functionality Direct linking of C++ code Standalone apps Python interface based on Swig Available at http://www.sandia.gov/uqtoolkit Suggestions for improvements welcome! Do not hesitate to contact us uqtk-users@software.sandia.gov

References B. Debusschere, H. Najm, P. Pébay, O. Knio, R. Ghanem and O. Le Maître, Numerical Challenges in the Use of Polynomial Chaos Representations for Stochastic Processes, SIAM J. Sci. Comp., 26:2, 2004. K. Sargsyan, et al., Dimensionality reduction for complex models via bayesian compressive sensing, Int. J. of Uncertainty Quantification, 4, 1:63 93, 2014. Handbook of Uncertainty Quantification, R. Ghanem, D. Higdon, H. Owhadi (Eds.), Springer, 2016, http://www.springer.com/us/book/9783319123844 Jakeman, J. D., Eldred, M. S., and Sargsyan, K., Enhancing l 1 -minimization estimates of polynomial chaos expansions using basis selection, J. of Computational Physics, 289, 18 34, 2015. UQ Tutorials: http://www.quest-scidac.org/outreach/tutorials/