Simulation optimization via bootstrapped Kriging: Survey

Similar documents
Tilburg University. Simulation-Optimization via Kriging and Bootstrapping Kleijnen, J.P.C. Publication date: Link to publication

Optimization of simulated systems is the goal of many methods, but most methods assume known environments.

Design and Analysis of Simulation Experiments

CONDITIONAL SIMULATION FOR EFFICIENT GLOBAL OPTIMIZATION. Jack P.C. Kleijnen Ehsan Mehdad

Tilburg University. Response Surface Methodology Kleijnen, J.P.C. Publication date: Link to publication

PARAMETRIC AND DISTRIBUTION-FREE BOOTSTRAPPING IN ROBUST SIMULATION-OPTIMIZATION. Carlo Meloni

Simulation. Where real stuff starts

Tilburg University. Efficient Global Optimization for Black-Box Simulation via Sequential Intrinsic Kriging Mehdad, Ehsan; Kleijnen, Jack

DESIGN OF EXPERIMENTS: OVERVIEW. Jack P.C. Kleijnen. Tilburg University Faculty of Economics and Business Administration Tilburg, THE NETHERLANDS

Sequential approaches to reliability estimation and optimization based on kriging

Mustafa H. Tongarlak Bruce E. Ankenman Barry L. Nelson

Covariance function estimation in Gaussian process regression

Better Simulation Metamodeling: The Why, What and How of Stochastic Kriging

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Simulation. Where real stuff starts

Quantifying Stochastic Model Errors via Robust Optimization

BEST ESTIMATE PLUS UNCERTAINTY SAFETY STUDIES AT THE CONCEPTUAL DESIGN PHASE OF THE ASTRID DEMONSTRATOR

ROBUST SIMULATION-OPTIMIZATION USING METAMODELS. Carlo Meloni

ESTIMATION AND OUTPUT ANALYSIS (L&K Chapters 9, 10) Review performance measures (means, probabilities and quantiles).

Chapter 11. Output Analysis for a Single Model Prof. Dr. Mesut Güneş Ch. 11 Output Analysis for a Single Model

Adjustable Robust Parameter Design with Unknown Distributions Yanikoglu, Ihsan; den Hertog, Dick; Kleijnen, J.P.C.

STA414/2104. Lecture 11: Gaussian Processes. Department of Statistics

ISyE 6644 Fall 2014 Test 3 Solutions

FDSA and SPSA in Simulation-Based Optimization Stochastic approximation provides ideal framework for carrying out simulation-based optimization

Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study

11. Bootstrap Methods

Multivariate Versus Univariate Kriging Metamodels for Multi-Response Simulation Models (Revision of ) Kleijnen, J.P.C.

Preliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference

Bootstrap & Confidence/Prediction intervals

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Stochastic Models. Edited by D.P. Heyman Bellcore. MJ. Sobel State University of New York at Stony Brook

Preliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference

Tilburg University. Design and Analysis of simulation experiments Kleijnen, Jack. Document version: Early version, also known as pre-print

REDUCING INPUT PARAMETER UNCERTAINTY FOR SIMULATIONS. Szu Hui Ng Stephen E. Chick

BOOTSTRAPPING AND VALIDATION OF METAMODELS IN SIMULATION

ECE521 week 3: 23/26 January 2017

Least Squares. Ken Kreutz-Delgado (Nuno Vasconcelos) ECE 175A Winter UCSD

Post-exam 2 practice questions 18.05, Spring 2014

Remedial Measures for Multiple Linear Regression Models

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Optimization Tools in an Uncertain Environment

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling

Robust Dual-Response Optimization

Particle Filtering for Data-Driven Simulation and Optimization

Formal Statement of Simple Linear Regression Model

STA414/2104 Statistical Methods for Machine Learning II

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011

Discrete-Event System Simulation

Some general observations.

2905 Queueing Theory and Simulation PART IV: SIMULATION

Gaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008

Confidence Intervals, Testing and ANOVA Summary

Slides 12: Output Analysis for a Single Model

[y i α βx i ] 2 (2) Q = i=1

Asymptotic Statistics-III. Changliang Zou

Practice Problems Section Problems

Condensed Table of Contents for Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control by J. C.

LOGNORMAL ORDINARY KRIGING METAMODEL IN SIMULATION OPTIMIZATION

Part 6: Multivariate Normal and Linear Models

Interval Estimation III: Fisher's Information & Bootstrapping

Cross-Validation with Confidence

The Nonparametric Bootstrap

Lecture : Probabilistic Machine Learning

LOGNORMAL ORDINARY KRIGING METAMODEL

Kevin Ewans Shell International Exploration and Production

Statistics & Data Sciences: First Year Prelim Exam May 2018

Using R in Undergraduate and Graduate Probability and Mathematical Statistics Courses*

SOME HISTORY OF STOCHASTIC PROGRAMMING

Contents 1 Introduction 2 Statistical Tools and Concepts

mlegp: an R package for Gaussian process modeling and sensitivity analysis

Cross-Validation with Confidence

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

Probability and Statistics Notes

Sensitivity Analysis with Correlated Variables

Chapter 11 Estimation of Absolute Performance

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Machine learning, ALAMO, and constrained regression

p y (1 p) 1 y, y = 0, 1 p Y (y p) = 0, otherwise.

Monitoring Wafer Geometric Quality using Additive Gaussian Process

Bias Variance Trade-off

Outline Introduction OLS Design of experiments Regression. Metamodeling. ME598/494 Lecture. Max Yi Ren

Monte Carlo Simulations

Introduction to Geostatistics

Stat 5101 Lecture Notes

Probability and Information Theory. Sargur N. Srihari

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics

A QUICK ASSESSMENT OF INPUT UNCERTAINTY. Bruce E. Ankenman Barry L. Nelson

Supplement A. The relationship of the proportion of populations that replaced themselves

Covariance Matrix Simplification For Efficient Uncertainty Management

Review of Statistics 101

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama

2WB05 Simulation Lecture 7: Output analysis

Probabilistic numerics for deep learning

Defect Detection using Nonparametric Regression

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 19

NON-STATIONARY QUEUE SIMULATION ANALYSIS USING TIME SERIES

Multivariate Regression

What to do if Assumptions are Violated?

Why do we care? Examples. Bayes Rule. What room am I in? Handling uncertainty over time: predicting, estimating, recognizing, learning

Chapter 1 Statistical Inference

Transcription:

Simulation optimization via bootstrapped Kriging: Survey Jack P.C. Kleijnen Department of Information Management / Center for Economic Research (CentER) Tilburg School of Economics & Management (TiSEM) Tilburg University, Tilburg, Netherlands Workshop "Stochastic and noisy simulators" Organizer "GDR MASCOT-NUM, Paris, 17 May 2011

Overview Deterministic & random simulation models Focus: Optimization via Kriging metamodel Analysis of Kriging: Bootstrapping Random simulation: replicates distribution-free bootstrap Deterministic simulation: no replicates parametric bootstrap (multivariate Gaussian); parameters estimated from simulator s I/O 2

Introduction Workshop theme: Stochastic & noisy simulators 3 interpretations: Deterministic simulation with numerical noise Deterministic simulation with random input distribution: Uncertainty propagation Epistemic uncertainty Pseudo-Random Numbers (PRN) in Discrete Event Dynamic Systems (DEDS) Aleatory uncertainty See next slide 3

DEDS: Single-server queue Time w(i + 1) = w(i) + s(i) a(i + 1) if positive; else 0 M/M/1 (s & a times Markov / Poisson / exponential): s(i) = -ln r(2i 1)] / µ with E(s) = 1/µ & pseudo-random number 0 < r < 1 a(i + 1) = -ln r(2i) / λ Start with empty system: w(1) = 0 Stop after (say) 100,000 simulated jobs: i = 100,000 Extensions: Servers in parallel or sequence, feedbacks, priorities Conclusion: DEDS: Discrete Event Dynamic Systems (other type: nonlinear difference equations) Start & stop states of system Inputs: µ, λ (plus seed r(0)) 4

Kriging: Random simulation 1. Geostatistics: Nugget / Measurement error 2. Deterministic simulator: Numerical noise Sub 1 & 2: Y(x) = µ + Z(x) + e with GP Z(x) & white noise e ~ NIID[0,σ(e)] 3. Random simulator: σ(e) σ[x(i)] & CRN Predictor for new point x(n+1): Y^[x(n+1)] = µ +σ²σ (n+1)[σ +Σ(ebar)] ¹(ybar - 1µ) with ybar average of m(i) replicates of point i Σ(ebar) diagonal, unless CRN Not an exact interpolator of n averages 5

Kriging: MLE Plug-in MLE: Non-linear Kriging predictor Y^^[x(n+1)] = µ^ +σ^² r^ [Σ^ +Σ^(ebar)] ¹(ybar - 1µ^) Biased estimator of predictor var., s²{y^^[x(n+1)]} Random simulation: Estimate Σ(e) from replicates CRN: m > n (# replicates > # combinations) s²{y^^(x)} and s²{y^(x)} are not max at same x (see EGO; slide 7) 6

Parametric bootstrap for s²{y^^(x)} 1. Original I/O (X, y) gives original µ^ and Σ^(θ^, σ^) 2. Sample (y*(1),, y*(n), y*(n + 1)) from N(µ^,Σ^) with µ^: all (n + 1) elements equal to µ^ Σ^: (n + 1) x (n + 1) matrix with (see paper) 3. Bootstrapped I/O (X, y*) (see Step 2) gives bootstrap µ*^ and Σ*^(θ*^, σ*^) (see Step 1) 4. Compute bootstrap predictor y*^^(n + 1), using Step 3 5. Compute squared Error SE = [y*^^(n + 1) - y*(n + 1)]² (see Steps 4 resp. 2) 6. Repeat Steps 2-5, B times: s²* = Σ b SE(b) / B Example: Circuit simulator in Sacks et al. (1989) 7

EI / EGO with bootstrap variance Local / global optima: Exploration / exploitation Assume: Deterministic simulation; single output 1.Find y o, minimum among n old outputs 2.Find x o, maximizer x of EI(x) = E[y o - y(x) y(x) < y o ] with y(x) ~ N(y^^, s 2 ) Find x o via candidate set or Genetic Algorithm 3.Simulate x o ; refit Kriging; go to 1 until EI 0 Alternative: Bootstrap estimator s 2 * Result: Better in 3 of 4 test functions; one tie 8

Constrained optimization in random simulation Goal output y(0): Min E[y(0, x)] Other r 1 constrained outputs: E[y(h, x)] c(h) s constraints for d inputs: f(g)[x(1),,x(d)] c(g) Non-negative integer inputs: x(j) N Solution combines (see next slide) Sequential DOE (like EGO) Kriging (like EGO) Integer Non-Linear Programming (INLP) 9

10

Robust optimization Taguchi s worldview: Decision inputs (e.g., service rate, order quantity) Uncertain environmental inputs (e.g., demand) Taguchi s methodology adapted: Kriging replaces polynomial regression Bootstrap to quantify Kriging s variability NLP to estimate Pareto frontier Example: Deterministic simulator with uncertain demand rate Minimize expected cost E(C) such that standard deviation σ(c) T 11

Robust optimization: Methodology 1. Design: Cross Space-filling design for decision inputs d LHS for environmental inputs e with prob. F(e) n(d) conditional means: wbar(i) = w(ij) / n(e) Conditional variance: s(i)² = [(w(ij) wbar(i)]² / [n(e) -1] 2. Metamodel: Kriging metamodel for µ resp. σ 3. Min µ s.t. σ T: Mathematical Programming 4. Vary T: Pareto frontier 5. Quantify frontier s variability: Distribution-free bootstrap (resample n(e) times w (n(d)-dimen.) 12

Monotonic bootstrapped Kriging Practice: I/O function known to be monotonic Example: Queuing simulation s mean & quantile Random simulation: Replication Distribution-free bootstrapped Kriging (next slide) Result: Confidence intervals with higher coverage and similar width Future research: Replace classic Kriging by stochastic Kriging Preserve convexity or nonnegativity Deterministic simulator: Parametric bootstrap 13

Procedure for monotonic Kriging 1. Resample -- with replacement -- the m IID original w(i, r): w*(i, r) [i = 1,, n; r = 1,, m] 2. From w*(i, r) compute the average wbar*(i) 3. From (X, wbar*) compute Kriging y(θ*) 4. Accept only monotonically increasing Kriging: y(i)* > 0 (i = 1,,n): Positive gradients 5. Repeat B times; sort B predictions y*[x(n + 1)] Point estimator: Median of B predictions 90% confidence interval (CI): Lower limit: 5% quantile of B predictions Upper limit: 95% quantile of B Asymmetric CI; positive lower limit 14

Conclusions: General Topic: Simulation optimization via bootstrap Kriging Bootstrapping: 1. Distribution-free: Random simulation with replicates 2. Parametric: Deterministic simulation (no replicates) Multivariate Gaussian with MLE of parameters 15

Conclusions: Specific topics EGO: Parametric bootstrap estimator of variance of Kriging predictor with random par. Constrained opt. in random simulation: Distribution-free bootstrap for validation of Kriging model (giving opt. via INLP) Robust opt. for uncertain environment: Distribution-free bootstrap for variability of Kriging model (giving Pareto frontier via NLP) Monotonic bootstrapped Kriging The End 16

Kriging: Basics Kriging: Global model Y(x) = µ + Z(x) with stationary GP Z(x) with zero mean corr[y{x(i)}, Y{x(j)}] = exp[-θ(k){x(ik) x(jk)}²) Linear predictor for point x(n+1): Y^[x(n+1)] = µ + r R ¹(y-1µ) Exact interpolator: y^[x(i)] = y[x(i)] with i = 1,, n Predictor variance: σ²[1- r R ¹r + {(1-1 R ¹r)²)/(1 R ¹1)}] 17

Parametric bootstrap: Basics Data driven statistical method Examples: Give n IID observations y(i) (i = 1,,n) a.mean E(y) of y(i) ~ Exp(λ) b.skewness: _(y(i) ybar)³/[(n - 1)s³] Sub a: 1.Estimate λ^ = 1/ybar 2.Sample y* from Exp(λ^): Parametric bootstrap 3.Estimate mean: y*bar = y*(i)}/n 4.Repeat Steps 2-3: y*bar(b) (b = 1,, B) 5.Sort y*bar(b): y*bar(1) < < y*bar(b) 6.90% CI for mean: y*bar(0.05b), y*bar(0.95b) 18

Distribution-free bootstrap: Basics 1. Resample with replacement y(i) gives y*(i) Example: y(1) is sampled, 0, 1,, n times 2. Estimate mean: y*bar = y*(i)}/n 3. Repeat Steps 2-3: y*bar(b) (b = 1,, B) 4. Sort y*bar(b): y*bar(1) < < y*bar(b) 5. 90% CI for mean: y*bar(0.05b), y*bar(0.95b) 19

Bootstrap: Applications Bootstrap: simple idea; yet, art of modeling 1. CI for estimated skewness (Example 2) 2. Validation of simulation models 3. Ranking of journals on quality (citations) 4. See next slides 20

Constrained optimization: details Enough replicates per point; see Law (2007) CRN Fit Kriging to averages Global/local: Steps 5 /9 in next flowchart Cross-validate n(cv) points incl. bootstrap: Step 4 Complications in bootstrap: Multiple outputs, non-constant m(i), CRN Studentized prediction error: Divide by of bootstrapped variance + replicate var. estimate Apply Bonferroni s inequality: α/[r n(cv)] 21

Constrained optimization: details Step 5: If Kriging is rejected, then add point halfway worst point and nearest neighbor Adapt for continuous inputs, incl. gradients deterministic outputs Applications: Academic inventory system (s, S) Realistic call center Better than OptQuest (in Arena) 22

Robust optimization: Example 8.92 x 10 4 8.9 8.88 8.86 Regression 1-L Kriging 2-L Kriging 8700 8600 8500 Regression 1-L Kriging 2-L Kriging 8.84 8400 C s C 8.82 8300 8.8 8200 8.78 8.76 8100 8.74 1.5 2 2.5 3 3.5 4 4.5 Q x 10 4 8000 1.5 2 2.5 3 3.5 4 4.5 Q x 10 4 23

Robust optimization: Future research Replace Min µ s.t. σ T by quantile or CVaR Random (s, S): aleatory & epistemic uncertainty Multiple constrained outputs 24

M/M/1 example: Wiggling versus monotonic Kriging m = 5; no CRN; wbar(i) < wbar(i + 1); Gaussian correlation function 25

26