Stability-based generation of scenario trees for multistage stochastic programs

Similar documents
Stability of Stochastic Programming Problems

Stochastic Programming: From statistical data to optimal decisions

Jitka Dupačová and scenario reduction

A note on scenario reduction for two-stage stochastic programs

Airline Network Revenue Management by Multistage Stochastic Programming

Scenario generation in stochastic programming

Scenario tree generation for multi-stage stochastic programs

Scenario Reduction: Old and New Results

Scenario generation in stochastic programming with application to energy systems

GAMINGRE 8/1/ of 7

Stability of optimization problems with stochastic dominance constraints

Scenario Generation in Stochastic Programming with Application to Optimizing Electricity Portfolios under Uncertainty

Scenario generation Contents

Operations Research Letters. On a time consistency concept in risk averse multistage stochastic programming

Conditioning of linear-quadratic two-stage stochastic programming problems

Stochastic optimization, multivariate numerical integration and Quasi-Monte Carlo methods. W. Römisch

Complexity of two and multi-stage stochastic programming problems

arxiv: v2 [math.oc] 18 Nov 2017

On the Scenario-Tree Optimal-Value Error for Stochastic Programming Problems

Introduction to Stochastic Optimization Part 4: Multi-stage decision

Stochastic Programming: Tutorial Part II

Approximation of stochastic optimization problems and scenario generation

Lecture 1. Stochastic Optimization: Introduction. January 8, 2018

CONVERGENCE ANALYSIS OF SAMPLING-BASED DECOMPOSITION METHODS FOR RISK-AVERSE MULTISTAGE STOCHASTIC CONVEX PROGRAMS

Scenario estimation and generation

In Centre, Online Classroom Live and Online Classroom Programme Prices

Optimal scenario generation and reduction in stochastic programming

Birgit Rudloff Operations Research and Financial Engineering, Princeton University

Monthly Trading Report July 2018

GENERALIZED second-order cone complementarity

SYSTEM BRIEF DAILY SUMMARY

Annual Average NYMEX Strip Comparison 7/03/2017

ANOVA decomposition of convex piecewise linear functions

Asymptotics of minimax stochastic programs

MIDAS: A Mixed Integer Dynamic Approximation Scheme

Monthly Trading Report Trading Date: Dec Monthly Trading Report December 2017

Time Series Analysis

SYSTEM BRIEF DAILY SUMMARY

A look into the factor model black box Publication lags and the role of hard and soft data in forecasting GDP

Weierstraß-Institut. für Angewandte Analysis und Stochastik. Leibniz-Institut im Forschungsverbund Berlin e. V. Preprint ISSN

Regularized optimization techniques for multistage stochastic programming

Upper bound for optimal value of risk averse multistage problems

Dynamic Generation of Scenario Trees

DEPARTMENT OF THE ARMY MILITARY SURFACE DEPLOYMENT AND DISTRIBUTION COMMAND (SDDC) 1 SOLDIER WAY SCOTT AFB, IL 62225

Stochastic Integer Programming

A. Shapiro Introduction

Average 175, , , , , , ,046 YTD Total 1,098,649 1,509,593 1,868,795 1,418, ,169 1,977,225 2,065,321

Average 175, , , , , , ,940 YTD Total 944,460 1,284,944 1,635,177 1,183, ,954 1,744,134 1,565,640

On a class of minimax stochastic programs

Scenario Grouping and Decomposition Algorithms for Chance-constrained Programs

Quantitative Stability of Two Stage Linear Second-order Conic Stochastic Programs with Full Random Recourse

A Tighter Variant of Jensen s Lower Bound for Stochastic Programs and Separable Approximations to Recourse Functions

Convex Optimization on Large-Scale Domains Given by Linear Minimization Oracles

Technical note on seasonal adjustment for Capital goods imports

Long-term Water Quality Monitoring in Estero Bay

Approximating Fixed-Horizon Forecasts Using Fixed-Event Forecasts

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -12 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Minimax and risk averse multistage stochastic programming

Jayalath Ekanayake Jonas Tappolet Harald Gall Abraham Bernstein. Time variance and defect prediction in software projects: additional figures

SOME HISTORY OF STOCHASTIC PROGRAMMING

2016 Year-End Benchmark Oil and Gas Prices (Average of Previous 12 months First-Day-of-the Month [FDOM] Prices)

Decomposability and time consistency of risk averse multistage programs

Introduction to Course

Valid Inequalities and Restrictions for Stochastic Programming Problems with First Order Stochastic Dominance Constraints

Splitting Techniques in the Face of Huge Problem Sizes: Block-Coordinate and Block-Iterative Approaches

Reformulation of chance constrained problems using penalty functions

On the maximum and minimum response to an impulse in Svars

Risk neutral and risk averse approaches to multistage stochastic programming.

Tight and Compact MILP Formulation for the Thermal Unit Commitment Problem

Optimization Tools in an Uncertain Environment

Robust Process Control by Dynamic Stochastic Programming

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

Applied Analysis (APPM 5440): Final exam 1:30pm 4:00pm, Dec. 14, Closed books.

An Effective Heuristic for Multistage Stochastic Linear Programming

Optimization of Dispersed Energy Supply - Stochastic Programming with Recombining Scenario Trees

OPTIMAL SOLUTIONS TO STOCHASTIC DIFFERENTIAL INCLUSIONS

FROM EMPIRICAL OBSERVATIONS TO TREE MODELS FOR STOCHASTIC OPTIMIZATION: CONVERGENCE PROPERTIES

Complexity of stochastic branch and bound methods for belief tree search in Bayesian reinforcement learning

Stochastic Programming Models in Design OUTLINE

3. Problem definition

How are adding integers and subtracting integers related? Work with a partner. Use integer counters to find 4 2. Remove 2 positive counters.

Kernel Method: Data Analysis with Positive Definite Kernels

Forecasting the Canadian Dollar Exchange Rate Wissam Saleh & Pablo Navarro

Risk-Averse Dynamic Optimization. Andrzej Ruszczyński. Research supported by the NSF award CMMI

A Concise Course on Stochastic Partial Differential Equations

WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and Rainfall For Selected Arizona Cities

Course Handouts: Pages 1-20 ASSET PRICE BUBBLES AND SPECULATION. Jan Werner

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Stochastic Unit Commitment with Topology Control Recourse for Renewables Integration

Stochastic Dual Dynamic Integer Programming

MAS3706 Topology. Revision Lectures, May I do not answer enquiries as to what material will be in the exam.

Probability metrics and the stability of stochastic programs with recourse

GTR # VLTs GTR/VLT/Day %Δ:

Mr. XYZ. Stock Market Trading and Investment Astrology Report. Report Duration: 12 months. Type: Both Stocks and Option. Date: Apr 12, 2011

Risk averse stochastic programming: time consistency and optimal stopping

Stochastic Programming: Basic Theory

Scenario grouping and decomposition algorithms for chance-constrained programs

Revisiting some results on the complexity of multistage stochastic programs and some extensions

Totally Unimodular Stochastic Programs

Stochastic Mathematical Programs with Equilibrium Constraints, Modeling and Sample Average Approximation

Transcription:

Stability-based generation of scenario trees for multistage stochastic programs H. Heitsch and W. Römisch Humboldt-University Berlin Institute of Mathematics 10099 Berlin, Germany http://www.math.hu-berlin.de/~romisch Page 1 of 22 ISMP 2006, Rio de Janeiro, July 30 August 5, 2006

Multistage stochastic programs Let ξ = {ξ t } T t=1 be an IR d -valued discrete-time stochastic process defined on some probability space (Ω, F, IP ) and with ξ 1 deterministic. The stochastic decision x t at period t is assumed to be measurable with respect to the σ-field F t (ξ) := σ(ξ 1,..., ξ t ) (nonanticipativity). Multistage stochastic program: T min IE[ x t X t, b t (ξ t ), x t ] x t is F t (ξ) measurable, t = 1,..., T, t=1 A t,0 x t + A t,1 (ξ t )x t 1 = h t (ξ t ), t = 2,..., T where X t are nonempty and polyhedral sets, A t,0 are fixed recourse matrices and b t ( ), h t ( ) and A t,1 ( ) are affine functions depending on ξ t, where ξ varies in a polyhedral set Ξ. If the process {ξ t } T t=1 has a finite number of scenarios, they exhibit a scenario tree structure. Page 2 of 22

To have the multistage stochastic program well defined, we assume x t L r (Ω, F, IP ; IR m t ) and ξ t L r (Ω, F, IP ; IR d ), where r 1 and r := r r 1, if costs are random r, if only right-hand sides are random, if all technology matrices are random and r = T. Then nonanticipativity may be expressed as x N r (ξ) N r (ξ) = {x T t=1l r (Ω, F, IP ; IR m t ) : x t = IE[x t F t (ξ)], t}, i.e., as a subspace constraint, by using the conditional expectations IE[ F t (ξ)]. Page 3 of 22 For T = 2 we have N r (ξ) = IR m 1 L r (Ω, F, P ; IR m 2 ). infinite-dimensional optimization problem

Data process approximation by scenario trees The process {ξ t } T t=1 is approximated by a process forming a scenario tree being based on a finite set N IN of nodes. n = 1 t = 1 ξ n n n t = 2 N+ (n) N T t(n) T Page 4 of 22 Scenario tree with T = 5, N = 22 and 11 leaves n = 1 root node, n unique predecessor of node n, path(n) = {1,..., n, n}, t(n) := path(n), N + (n) set of successors to n, N T := {n N : N + (n) = } set of leaves, path(n), n N T, scenario with (given) probability π n, π n := ν N + (n) πν probability of node n, ξ n realization of ξ t(n).

Tree representation of the optimization model { } min π n b t(n) (ξ n ), x n xn X t(n), n N, A 1,0 x 1 = h 1 (ξ 1 ) A t(n),0 x n + A t(n),1 x n =h t(n) (ξ n ), n N n N How to solve the optimization model? - Standard software (e.g., CPLEX) - Decomposition methods for (very) large scale models (Ruszczynski/Shapiro (Eds.): Stochastic Programming, Handbook, 2003) Page 5 of 22 Questions: Under which conditions and in which sense do multistage models behave stable with respect to perturbations of ξ? Can such stability results be used to generate (multivariate) scenario trees?

Dynamic programming Theorem: (Evstigneev 76, Rockafellar/Wets 76) Under weak assumptions the multistage stochastic program is equivalent to the (first-stage) convex minimization problem { min f(x 1, ξ)p (dξ) : x 1 X 1 }, Ξ where f is an integrand on IR m 1 Ξ given by f(x 1, ξ):= b 1 (ξ 1 ), x 1 + Φ 2 (x 1, ξ 2 ), Φ t (x 1,..., x t 1, ξ t ):=inf{ b t (ξ t ), x t +IE [ Φ t+1 (x 1,..., x t, ξ t+1 ) F t ] : x t X t, A t,0 x t + A t,1 (ξ t )x t 1 = h t (ξ t )} for t = 2,..., T, where Φ T +1 (x 1,..., x T, ξ T +1 ) := 0. The integrand f depends on the probability measure IP and, thus, also on the probability distribution P = IP ξ 1 of ξ in a nonlinear way! Hence, earlier approaches to stability fail! Page 6 of 22

Quantitative Stability Let us introduce some notations. Let F denote the objective function defined on L r (Ω, F, IP ; IR s ) L r (Ω, F, IP ; IR m ) IR by F (ξ, x) := IE[ T t=1 b t(ξ t ), x t ], let X t (x t 1 ; ξ t ) := {x t X t A t,0 x t + A t,1 (ξ t )x t 1 = h t (ξ t )} denote the t-th feasibility set for every t = 2,..., T and X (ξ) := {x L r (Ω, F, IP ; IR m ) x 1 X 1, x t X t (x t 1 ; ξ t )} the set of feasible elements with input ξ. Then the multistage stochastic program may be rewritten as min{f (ξ, x) : x X (ξ) N r (ξ)}. Let v(ξ) denote its optimal value and, for any α 0, l α (F (ξ, )) := {x X (ξ) N r (ξ) : F (ξ, x) v(ξ) + α} S(ξ) := l 0 (F (ξ, )) denote the α-level set and the solution set of the stochastic program with input ξ. Page 7 of 22

The following conditions are imposed: (A1) ξ L r (Ω, F, IP ; IR s ) for some r 1. (A2) There exists a δ > 0 such that for any ξ L r (Ω, F, IP ; IR s ) with ξ ξ r δ, any t = 2,..., T and any x 1 X 1, x τ X τ (x τ 1 ; ξ τ ), τ = 2,..., t 1, the set X t (x t 1 ; ξ t ) is nonempty (relatively complete recourse locally around ξ). (A3) The optimal values v( ξ) are finite for all ξ L r (Ω, F, IP ; IR s ) with ξ ξ r δ and the objective function F is level-bounded locally uniformly at ξ, i.e., for some α > 0 there exists a δ > 0 and a bounded subset B of L r (Ω, F, IP ; IR m ) such that l α (F ( ξ, )) is nonempty and contained in B for all ξ Lr (Ω, F, IP ; IR s ) with ξ ξ r δ. Page 8 of 22 Norm in L r : ξ r := ( T IE[ ξ t r ]) 1 r t=1

Theorem: (Heitsch/Römisch/Strugarek, SIAM J. Opt. 2006) Let (A1), (A2) and (A3) be satisfied, r > 1 and X 1 be bounded. Then there exist positive constants L and δ such that v(ξ) v( ξ) L( ξ ξ r + D f (ξ, ξ)) holds for all ξ L r (Ω, F, IP ; IR s ) with ξ ξ r δ. Assume that technology matrices are non-random and the solution x of the original problem is unique. If (ξ (n) ) is a sequence in T t=1l r (Ω, F t (ξ), IP ; IR s ) such that ξ (n) ξ r and D f (ξ (n), ξ) converge to 0 and if (x (n) ) is a sequence of solutions of the approximate problems, then the sequence (x (n) ) converges to x with respect to the weak topology in L r. Page 9 of 22 Here, D f (ξ, ξ) denotes the filtration distance of ξ and ξ defined by D f (ξ, ξ)= T 1 inf max{ x t IE[x t F t ( ξ)] r, x t IE[ x t F t (ξ)] r }. x S(ξ) x S( ξ) t=2

Remark: The continuity property of infima in the Theorem can be supplemented by a quantitative stability property of the set S 1 (ξ) of first stage solutions. Namely, there exists a constant ˆL > 0 such that sup x S 1 ( ξ) d(x, S 1 (ξ)) Ψ 1 ξ ( ˆL( ξ ξ r + D f (ξ, ξ))), where Ψ ξ (τ) := inf {IE[f(x 1, ξ)] v(ξ) : d(x 1, S 1 (ξ)) τ, x 1 X 1 } with Ψ 1 ξ (α) := sup{τ IR + : Ψ ξ (τ) α} (α IR + ) is the growth function of the original problem near its solution set S 1 (ξ). Page 10 of 22 Remark: Simple examples show that the filtration distance is indispensable for the stability result to hold.

Generation of scenario trees (i) In most practical situations scenarios ξ i with known probabilities p i, i = 1,..., N, can be generated, e.g., simulation scenarios from (parametric or nonparametric) statistical models of ξ or (nearly) optimal quantizations of the probability distribution of ξ. (ii) Construction of a scenario tree out of the scenarios ξ i with probabilities p i, i = 1,..., N,. Page 11 of 22

Approaches for (ii): (1) Bound-based approximation methods, (Frauendorfer 96, Kuhn 05, Edirisinghe 99, Casey/Sen 05). (2) Monte Carlo-based schemes (inside or outside decomposition methods) (e.g. Shapiro 03, 06, Higle/Rayco/Sen 01, Chiralaksanakul/Morton 04). (3) the use of Quasi Monte Carlo integration quadratures (Pennanen 05, 06). (4) EVPI-based sampling schemes (inside decomposition schemes) (Corvera Poire 95, Dempster 04). (5) Moment-matching principle (Høyland/Wallace 01, Høyland/Kaut/Wallace 03). (6) (Nearly) best approximations based on probability metrics (Pflug 01, Hochreiter/Pflug 02, Mirkov/Pflug 06; Gröwe-Kuska/Heitsch/Römisch 01, 03, Heitsch/Römisch 05). Page 12 of 22 Survey: Dupačová/Consigli/Wallace 00

Constructing scenario trees Let ξ be the original stochastic process on some probability space (Ω, F, IP ) with parameter set {1,..., T } and state space IR d. We aim at generating a scenario tree ξ tr such that and, thus, ξ ξ tr r and D f (ξ, ξ tr ) v(ξ) v(ξ tr ) are small. To determine such a scenario tree, we start with a discrete approximation ξ f consisting of scenarios ξ i = (ξ1, i..., ξt i ) with probabilities p i, i = 1,..., N. ξ f is a fan of individual scenarios. Page 13 of 22 t = 1 t = 2 t = 3 t = 4 t = 5

The fan ξ f is chosen such that it is adapted to the filtration (F t (ξ)) T t=1 and ξ ξ f r ε appr. Algorithms are developed that generate a scenario tree ξ tr by deleting and bundling scenarios of ξ f (that are similar at t) such that it is also adapted to the filtration (F t (ξ)) T t=1 and satisfies Since it holds (1) ξ f ξ tr r ε r T 1 (2) inf x t IE[x t F t (ξ tr )] r ε f. x S(ξ f ) t=2 D f (ξ, ξ tr ) ε appr + T 1 inf x t IE[x t F t (ξ tr )] r, x S(ξ f ) t=2 if ξ f is sufficiently close to ξ, we obtain in case ε appr + ε r δ that v(ξ) v(ξ tr ) L(2ε appr + ε r + ε f ). Page 14 of 22

(1) Forward tree generation Let scenarios ξ i with probabilities p i, i = 1,..., N, fixed root ξ1 IR d, r 1, and tolerances ε r, ε t, t = 2,..., T, be given such that T ε t ε r. t=2 Step 1: Set ˆξ 1 := ξ f and C 1 = {I = {1,..., N}}. Step t: Let C t 1 = {Ct 1, 1..., C K t 1 t 1 }. Determine disjoint index sets It k and Jt k of remaining and deleted scenarios such that It k Jt k = Ct 1, k a mapping α t : I I { i k α t (j) = t (j), j Jt k, k = 1,..., K t 1, j, otherwise, where i k t (j) I k t such that i k t (j) arg min i I k t ˆξ t 1,i ˆξ t 1,j t, Page 15 of 22

a stochastic process ˆξ t ˆξ t,i τ = { ξ α τ (i) τ, τ t,, otherwise, ξ i τ such that ˆξ t ˆξ t 1 r,t ε t. Set I t := K t 1 k=1 Ik t and C t := {α 1 t (i) : i I k t, k = 1,..., K t 1 }. Step T+1: Let C T = {CT 1,..., CK T T }. Construct a stochastic process ξ tr having K T scenarios ξtr k such that ξtr,t k := ξ α t(i) t with probabilities πt i = p j if i CT k, k = 1,..., K T, t = 2,..., T. Proposition: j C k T ξ f ξ tr r T ε t ε r. t=2 Page 16 of 22

t = 1 t = 2 t = 3 t = 4 t = 5 t = 1 t = 2 t = 3 t = 4 t = 5 t = 1 t = 2 t = 3 t = 4 t = 5 Page 17 of 22 t = 1 t = 2 t = 3 t = 4 t = 5 t = 1 t = 2 t = 3 t = 4 t = 5 t = 1 t = 2 t = 3 t = 4 t = 5 Illustration of the forward tree construction for an example including T=5 time periods starting with a scenario fan containing N=58 scenarios <Start Animation>

(2) Bounding approximate filtration distances T 1 Aim: (ξ f, ξ tr ) := inf x t IE[x t F t (ξ tr )] r ε f x S(ξ f ) t=2 Two possibilities: (i) Estimates in terms of some solutions with input ξ f, which would require to solve a two-stage model. (ii) Estimates in terms of the input ξ f. Proposition: Let (A2) and (A3) be satisfied and X 1 be bounded. Assume that the technology matrices A t,1 are non-random, 1 r < and ξ f is sufficiently close to ξ. Then there exists a constant ˆL > 0 such that Condition: (ξ f, ξ tr ) ˆL ( i I 2 i I 2 j I 2,i p j ξ j ξ i r j I 2,i p j ξ j ξ i r ε r f ) 1 r Page 18 of 22

Numerical experience We consider the electricity portfolio management of a municipal power company. Data was available on the electrical load demand and on electricity prices at the market place EEX. A multivariate statistical model is developed for the yearly demandprice process ξ that allowed to generate yearly demand-price scenarios ξ i, with probabilities p i = 1 N, i = 1,..., N. These scenarios are assumed to form the process ξ f. Branching in ξ tr was allowed at most monthly. The tolerances ε t at branching points were chosen such that Page 19 of 22 ε t = ε T [1 + q(1 2 t )], t = 2,..., T, T where the parameter q [0, 1] affects the branching structure of the constructed trees. For the test runs we used q = 0.6. The test runs were performed on a PC with a 3 GHz Intel Pentium CPU and 1 GByte main memory.

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec a) Forward tree construction with relative filtration tolerance ε rel,f = 0.35 Page 20 of 22 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec b) Forward tree construction with relative filtration tolerance ε rel,f = 0.45 Yearly demand-price scenario trees with relative tolerance ε rel,r = 0.25

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec a) Forward tree construction with relative filtration tolerance ε rel,f = 0.6 Page 21 of 22 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec b) Forward tree construction with relative filtration tolerance ε rel,f = 0.7 Yearly demand-price scenario trees with relative tolerance ε rel,r = 0.5

ε rel,r ε rel,f Scenarios Nodes Stages Time (sec) 0.10 0.20 98 774 988 6 25.01 0.30 99 774 424 6 25.05 0.15 0.25 94 719 714 12 24.97 0.35 94 723 495 10 24.99 0.20 0.30 90 670 321 9 24.94 0.40 90 670 478 10 24.94 0.25 0.35 85 619 296 9 24.95 0.45 87 620 340 10 24.93 0.30 0.40 80 547 824 11 24.86 0.50 83 567 250 11 24.91 0.35 0.45 72 482 163 11 24.94 0.55 76 498 732 11 24.90 0.40 0.50 67 426 794 8 24.92 0.60 71 444 060 11 24.90 0.45 0.55 60 368 380 7 24.97 0.65 65 383 556 11 24.87 0.50 0.60 50 309 225 6 24.99 0.70 60 319 380 11 24.88 0.55 0.65 44 247 303 6 25.00 0.75 51 265 336 10 24.91 0.60 0.70 37 188 263 6 25.17 0.80 45 203 321 9 24.98 Numerical results for yearly demand-price scenario trees Page 22 of 22