Simple Integer Recourse Models: Convexity and Convex Approximations

Similar documents
Cf "Expected shortage and surplus functions", in Stochastic Programming, by Willem K. Klein Haneveld and Maarten H. van der Vlerk, Dept of

Module 3. Function of a Random Variable and its distribution

Citation for published version (APA): van der Vlerk, M. H. (2002). Convex approximations for complete integer recourse models. s.n.

Introduction and Preliminaries

Introduction to Real Analysis Alternative Chapter 1

3 Integration and Expectation

Metric Spaces and Topology

Estimates for probabilities of independent events and infinite series

Dynkin (λ-) and π-systems; monotone classes of sets, and of functions with some examples of application (mainly of a probabilistic flavor)

ANALYSIS QUALIFYING EXAM FALL 2017: SOLUTIONS. 1 cos(nx) lim. n 2 x 2. g n (x) = 1 cos(nx) n 2 x 2. x 2.

1.1. MEASURES AND INTEGRALS

Convex approximations for a class of mixed-integer recourse models

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

Citation for published version (APA): van der Vlerk, M. H. (1995). Stochastic programming with integer recourse [Groningen]: University of Groningen

Some Background Material

Near-Potential Games: Geometry and Dynamics

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Near-Potential Games: Geometry and Dynamics

ASYMPTOTICALLY NONEXPANSIVE MAPPINGS IN MODULAR FUNCTION SPACES ABSTRACT

Chapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries

Elementary properties of functions with one-sided limits

Lecture 4 Lebesgue spaces and inequalities

be the set of complex valued 2π-periodic functions f on R such that

Concepts and Applications of Stochastically Weighted Stochastic Dominance

STAT 7032 Probability Spring Wlodek Bryc

SUPPLEMENTARY MATERIAL TO IRONING WITHOUT CONTROL

Convergence of generalized entropy minimizers in sequences of convex problems

The Skorokhod problem in a time-dependent interval

On a Class of Multidimensional Optimal Transportation Problems

Appendix B for The Evolution of Strategic Sophistication (Intended for Online Publication)

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

Convex Functions and Optimization

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

1 The Observability Canonical Form

1 Lyapunov theory of stability

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by

NOTES ON CALCULUS OF VARIATIONS. September 13, 2012

h(x) lim H(x) = lim Since h is nondecreasing then h(x) 0 for all x, and if h is discontinuous at a point x then H(x) > 0. Denote

Notes on uniform convergence

6.2 Fubini s Theorem. (µ ν)(c) = f C (x) dµ(x). (6.2) Proof. Note that (X Y, A B, µ ν) must be σ-finite as well, so that.

LECTURE 6. CONTINUOUS FUNCTIONS AND BASIC TOPOLOGICAL NOTIONS

An introduction to Mathematical Theory of Control

Optimal stopping for non-linear expectations Part I

GENERALIZED COVARIATION FOR BANACH SPACE VALUED PROCESSES, ITÔ FORMULA AND APPLICATIONS

( f ^ M _ M 0 )dµ (5.1)

Chapter 3 Continuous Functions

Asymptotics of minimax stochastic programs

Lebesgue-Stieltjes measures and the play operator

Convex Analysis and Economic Theory Winter 2018

2. Dual space is essential for the concept of gradient which, in turn, leads to the variational analysis of Lagrange multipliers.

1 Review of di erential calculus

Limits and Continuity

An elementary proof of the weak convergence of empirical processes

Some basic elements of Probability Theory

arxiv: v1 [math.oc] 22 Sep 2016

ON KRONECKER PRODUCTS OF CHARACTERS OF THE SYMMETRIC GROUPS WITH FEW COMPONENTS

Legendre-Fenchel transforms in a nutshell

Random variables. DS GA 1002 Probability and Statistics for Data Science.

n [ F (b j ) F (a j ) ], n j=1(a j, b j ] E (4.1)

Chapter 4. Measure Theory. 1. Measure Spaces

Asymptotic at infinity. g(x) g(x) Tangential at finite point. slope y. slope y

MATH MEASURE THEORY AND FOURIER ANALYSIS. Contents

Observer design for a general class of triangular systems

Convex Optimization Notes

Gaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008

X n D X lim n F n (x) = F (x) for all x C F. lim n F n(u) = F (u) for all u C F. (2)

Convex Analysis and Economic Theory AY Elementary properties of convex functions

Chapter 3a Topics in differentiation. Problems in differentiation. Problems in differentiation. LC Abueg: mathematical economics

Approximation of Minimal Functions by Extreme Functions

The Canonical Gaussian Measure on R

Real Analysis Problems

1 Stochastic Dynamic Programming

Notes on Measure Theory and Markov Processes

An asymptotic ratio characterization of input-to-state stability

Non-linear wave equations. Hans Ringström. Department of Mathematics, KTH, Stockholm, Sweden

Introduction to Real Analysis

A Rothschild-Stiglitz approach to Bayesian persuasion

Inexact cutting planes for two-stage mixed-integer stochastic programs Romeijnders, Ward; van der Laan, Niels

DISTRIBUTION OF THE SUPREMUM LOCATION OF STATIONARY PROCESSES. 1. Introduction

Coupled second order singular perturbations for phase transitions

Propagating terraces and the dynamics of front-like solutions of reaction-diffusion equations on R

at time t, in dimension d. The index i varies in a countable set I. We call configuration the family, denoted generically by Φ: U (x i (t) x j (t))

van Rooij, Schikhof: A Second Course on Real Functions

Lecture 3: Probability Measures - 2

Nonlinear Control Systems

16 1 Basic Facts from Functional Analysis and Banach Lattices

THEOREMS, ETC., FOR MATH 515

arxiv: v1 [math.pr] 14 Aug 2017

A Rothschild-Stiglitz approach to Bayesian persuasion

On Laplace Finite Marchi Fasulo Transform Of Generalized Functions. A.M.Mahajan

SOME MEASURABILITY AND CONTINUITY PROPERTIES OF ARBITRARY REAL FUNCTIONS

FENCHEL DUALITY, FITZPATRICK FUNCTIONS AND MAXIMAL MONOTONICITY S. SIMONS AND C. ZĂLINESCU

+ 2x sin x. f(b i ) f(a i ) < ɛ. i=1. i=1

Economics 204 Summer/Fall 2011 Lecture 5 Friday July 29, 2011

CHEBYSHEV INEQUALITIES AND SELF-DUAL CONES

Economics 574 Appendix to 13 Ways

Random Bernstein-Markov factors

Stochastic programming with simple integer recourse

Course 212: Academic Year Section 1: Metric Spaces

Transcription:

Simple Integer Recourse Models: Convexity and Convex Approximations Willem K. Klein Haneveld Leen Stougie Maarten H. van der Vlerk November 8, 005 We consider the objective function of a simple recourse problem with fixed technology matrix and integer second-stage variables. Separability due to the simple recourse structure allows to study a one-dimensional version instead. Based on an explicit formula for the objective function, we derive a complete description of the class of probability density functions such that the objective function is convex. This result is also stated in terms of random variables. Next, we present a class of convex approximations of the objective function, which are obtained by perturbing the distributions of the right-hand side parameters. We derive a uniform bound on the error of the approximation. Finally, we give a representation of convex simple integer recourse problems as continuous simple recourse problems, so that they can be solved by existing special purpose algorithms. Keywords: Simple integer recourse, convexity, convex approximations 1 Introduction The simple integer recourse SIR) model with fixed technology matrix is defined as { inf cx + Qx) : Ax = b, x R n 1 } x +, 1) where the expected value function Q is Qx) := E ξ vξ T x)], with E ξ ] denoting mathematical expectation with respect to ξ, and v is the value function of the second stage problem vs) := inf y {q+ y + + q y : y + s, y s, y +,y Z m + }, s Rm. Here c,a,b,q +,q ) and T are vectors/matrices of the appropriate size, q +,q 0, q + + q > 0, and ξ is a random vector in R m. As suggested by the name, this model has the same structure as the well-known simple continuous recourse model, in which the second-stage decision variables y = y +,y ) are non-negative reals. The expected value function of the latter problem is Lipschitz continuous and convex, so that in principle i.e., disregarding possible difficulties in evaluating the integrals) it can be solved efficiently by standard techniques from mathematical programming, see 9]. Several special purpose algorithms exist, see e.g. 1, 1,, 10, 1]. The expected value function of the integer problem lacks these favorable properties in general. 1

In 7] and 13] the latter treating complete mixed-integer recourse) it is shown that this function is continuous if the distribution of ξ is continuous. In 3, 4] we discussed the construction of the convex hull of the discontinuous expected value function Q for the simple integer recourse model with discretely distributed right-hand side vector ξ. For an overview of the field of stochastic integer programming we refer to 5, 6, 15] and the website 14]. In the first part of this paper we give a complete description of the class of distributions such that the expected value function Q is convex. Following a review of relevant results in Section, this class will be presented in Section 3. Due to the simple recourse structure, Q is separable in the tender variables z i := T i x, i = 1,...,m. Thus, Q is completely characterized by the one-dimensional generic function Q, given by Qz) := q + gz) + q hz), z R, where q +,q R, with q + 0, q 0, q + + q > 0, gz) := E ξ ξ z + ], z R, hz) := E ξ ξ z ], z R, with ξ a random variable, and s + and s are shorthand notations for s ) + := max{0, s } and s ) := max{0, s }, s R, respectively. Hence, we will derive convexity results for Q by doing so for Q. Next, using the results obtained so far, we present a class of convex approximations of Q. In Section 6 we concentrate on the integer expected surplus 1 function g first, and then extend the results to the expected shortage function h and the one-dimensional expected value function Q. Moreover, using a result from 3], we show in Section 7 that up to a constant) the approximation of Q is the expected value function of a continuous simple recourse problem, with random right-hand side parameter with explicitly given discrete distribution. In Section 8 these results are extended to the n 1 -dimensional expected value function Q. Finally, in Section 9, we summarize our results and put them in a wider perspective. Before starting our detailed exposition on the simple integer recourse model, we mention that the relevance of our results extends beyond this particular model. The results of Sections 5 to 8 on convex approximations are at the basis of a similar approach for the general class of complete integer recourse models 19]. Although the latter class of models is much larger, the results presented in this paper do not follow as a special case, because in the complete recourse model costs are assigned to one-sided deviations only, whereas in the SIR model both shortages as well as surpluses are penalized. Consequently, the results below on convex approximations of the SIR expected value function can be obtained from 19] only for the special cases with either q + = 0 or q = 0. The extension to the general case is non-trivial. Recently, the same approach has yielded first results for mixed-integer recourse problems 0]. Thus, it appears that the results reported in this paper provide the foundation for an original approach to solving mixed-)integer recourse problems. Preliminaries In this section we first review some structural properties of the expected value function of the simple integer recourse problem. Subsequently we present some preliminary lemmas..1 Review Some of the results in this subsection are quoted from 7] where the reader is referred to for proofs. 1 Of ξ over z.

Let F and ˆF be respectively the right- and left-continuous cumulative distribution function cdf) of the random variable ξ, i.e., Fs) := Pr{ξ s} and ˆFs) := Pr{ξ < s}. Obviously, if ξ has a probability density function pdf) then F ˆF.) Then cf. 7]) gz) = hz) = 1 Fz + k)) = Pr{ξ > z + k} ) ˆFz k) = Pr{ξ < z k}. 3) The functions g and h are related by an elementary transformation. Lemma.1 Let ξ be a random variable. Define ζ = ξ. Then hz) = g ζ z), z R, where g ζ z) := E ζ ζ z + ]. The random variable ζ has cdf F ζ s) = 1 ˆF s). If ξ has a pdf f then ζ has a pdf f ζ s) = f s). PROOF. Since s = s +, s R, it follows that hz) = E ξ ξ z ] = E ξ ξ + z + ] = E ζ ζ + z + ] = g ζ z), z R. The relations between the cdf and pdf) of the random variables ξ and ζ are trivial. ξ. Finiteness of the various functions is directly related to finiteness of the first moment of Lemma. For all z R gz) < µ + = E ξ ξ) + ] < ; hz) < µ = E ξ ξ) ] < ; Qz) < < µ = µ + µ <. From now on we assume that µ is finite. Lemma.3 The functions g, h, and Q are continuous on R if and only if the random variable ξ is continuously distributed.. Right derivatives In studying convexity of the functions g, h, and Q we will use a necessary and sufficient condition for convexity of a function in terms of its right derivative: a function ϕ is convex on the interval a,b] if and only if its right derivative ϕ + is non-decreasing on a,b). As we will see, existence of the right derivatives of g, h, and Q depends on the total variation of the density f of ξ. Although we only need finiteness of the series involved we will prove sharp lower and upper bounds. The reason is that these bounds are essential for obtaining an error bound for the convex approximations that we have in mind. For ϕ a real-valued function on a non-empty subset I of R, we denote the total increase on I by + ϕi), the total decrease by ϕi), and the total variation by ϕi). For all I R, ϕi) = + ϕi) + ϕi). For convenience we use the shorthand notations + ϕ, ϕ, and ϕ for the case I = R. A real function ϕ on R is of bounded variation if ϕ < +. We establish a result for probability density functions of bounded variation. It is based on the following lemma, which is proved in Appendix A.1. 3

0.06 0.04 0.0 0 5 0 5 10 15 Figure.1: Illustration of the upper bound 6) in Lemma.5 with z = 5. Each term f z + k), k = 0, 1,..., is represented by a rectangle of size f z + k) 1. The total area of the rectangles is divided in two: the unshaded area is less than or equal to 1 Fz), and the shaded area is less than or equal to fz, )). Lemma.4 Let the real functions ϕ i on R + be nonnegative, nonincreasing and integrable, i = 1,. Then the function ϕ := ϕ 1 ϕ is of bounded variation on 0, ). Moreover, < + ϕ0, )) ϕk) ϕs) ds ϕ0, )) < 4) and < ϕ0, )) ϕk) k=1 0 0 ϕs) ds + ϕ0, )) <. 5) In the following lemma we apply these results to a class of probability density functions on R that are of bounded variation. Lemma.5 Suppose that for i = 1, the functions f i : R 0, 1] satisfy the following conditions: i) f i is nondecreasing, f i ) = 0, f i + ) = 1, i = 1,. ii) f 1 s) f s) for all s R, and {s R : f 1 s) > f s)} has a positive Lebesgue measure. iii) 0 f 1s) ds < and 0 1 f s)) ds <. Then c := f 1s) f s)) ds 0, ), and the function f : R R + defined by fs) = 1 f1 s) f s) ), s R, c is a pdf. Moreover, f has a right-continuous version f + and a left-continuous version f, given by f + s) := limf t), t s s R, f s) := limf t), t s s R, having the same cumulative distribution function as f. Denoting this cdf by F, we have for all z R 1 Fz 1) f z 1, )) fz + k) 1 Fz) + f z, )) 6) 4

and Fz) + f,z]) f z k) Fz 1) + + f,z 1]). 7) k=1 The pdf f is of bounded variation, so that the following uniform bounds hold, too: 1 Fz 1) f f z + k) 1 Fz) + f Fz) f k=1 8) f z k) Fz 1) + f. 9) Remark.1 The conditions on f 1 and f are not very strong, if one wants to describe pdfs that are of bounded variation. Indeed, any non-constant function f of bounded variation that converges to 0 at the tails, can be written as the difference f 1 f of nondecreasing functions f 1 and f with a := f 1 ) = f ) and b := f 1 ) = f ), < a < b <. Then fs) = f 1 s) f s) = b a) f 1 s) f s) ), s R, where f i s) = f i s) a)/b a), i = 1,, satisfy i). In order to assure that f is a pdf, obviously the condition ii) is needed together with b a = 1/c. Hence, the only real assumption is iii), describing that f 1 and f converge fast enough at their tails. PROOF. Obviously, c > 0 because of ii). Moreover, c < because of i) and iii): f1 s) f s) ) ds = 0 0 f1 s) f s) ) ds + f1 s) f s) ) ds f 1 s) ds + 0 0 1 f s) ) ds <. Therefore, f is a pdf. Moreover, since f 1 and f are monotonous, they have all limits from left and right, hence so does f. Since f 1 and f are nondecreasing and bounded, they can only have a countable number of discontinuities, so that f s) = f s) = f + s) for all s R except for a set of measure 0. The upper bound in 6) follows from the upper bound in 4) by taking, for s R +, ϕs) = f z + s), ϕ 1 s) = 1 f z + s) and ϕ s) = 1 f 1z + s). c c Both ϕ 1 and ϕ are nonnegative, nonincreasing and integrable on R + since 0 z 1 f 1 s)) ds z 1 f s)) ds <. The lower bound in 6) follows from the lower bound in 5) by taking ϕs) = f z 1 + s), ϕ 1 s) = 1 f z 1 + s) c using z 1 1 f 1s)) ds in 7) follow from 4) and 5), by taking, for s R +, and and ϕ s) = 1 f 1z 1 + s), c z 1 1 f s)) ds <. Similarly, the upper and lower bound ϕs) = f z 1 s), ϕ 1 s) = 1 f z 1 s) c and ϕ s) = 1 f 1z 1 s), c ϕs) = f z s), ϕ 1 s) = 1 f z s) and ϕ s) = 1 f 1z s), c c respectively. Finally, 8) and 9) follow from 6) and 7) by observing that, for all z R, f z, )) f and + f,z]) + f. Indeed, since + f f = f ) f ) = 0 0 = 0 and + f + f = f, we have that f = + f = f)/. 5

See Figure.1 for an illustration of the upper bounds 6) c.q. 8). With Lemma.5 in mind, let us define the following class of probability density functions. Definition.1 Let F be the class of probability density functions on R with finite mean value, that can be written as the difference of functions f 1 and f satisfying the conditions i) iii) of Lemma.5. It is easy to verify that F is a convex set. As said before, from a practical point of view F contains all pdfs with a bounded variation. For instance, if f is a pdf with s fs) ds < ) such that R can be partitioned in a finite number of intervals on each of which f is either nondecreasing or nonincreasing, f must be in F. The final lemma of this subsection gives formulae for the right derivatives of the functions g, h, and Q = q + g + q h. They follow directly from ) and 3) by interchanging differentiation and summation. This interchange is allowed since the sums converge uniformly in z see 7] for more details). Lemma.6 Let ξ be a random variable with pdf f F. Then the right derivative Q + exists and is finite everywhere, and is given by Q + z) = q+ f + z + k) + q f + z k), z R, where f + is the right-continuous version of f given in Lemma.5. Obviously, corresponding results for the constituting functions g and h are obtained by appropriate choices of q + and q. 3 Convexity properties We will give a complete characterization of the pdfs in F such that the functions g, h, and Q are convex. There is no loss in assuming a continuous distribution, since Lemma.3 shows that if ξ is discretely distributed, then these functions are finite and discontinuous and hence non-convex. Also for continuous distributions of ξ convexity is an exception rather than the rule. However, for any distribution of ξ, the restriction of g, h, and Q to translates of Z is convex. Lemma 3.1 For every α 0, 1), the restriction of g, h, and Q to {α + Z} is convex. PROOF. Using ) we have gα + n + 1) gα + n) = Fα + n) 1 n Z. The result follows from the fact that the cdf F is non-decreasing. Using 3) similarly, the same result holds for h and Q. Next we formally define the set C F of probability density functions such that the corresponding expected value function Q and hence both g and h) is convex. Due to Lemma.6 we have Definition 3.1 Let C denote the set of probability density functions in F such that the corresponding expected value function Q is convex, i.e., { } f + z + k) is a nonincreasing function of z, and C = f F :. f + z k) is a nondecreasing function of z The subset containing all right-continuous versions of elements of C is denoted by C +. 6

Given that F is convex, it is obvious that C and C + are convex sets. The following lemma gives a property shared by all elements of C. Lemma 3. If f C then k= f + z + k) = 1 for all z R. PROOF. By definition, f C if and only if Q is convex when the random variable ξ has pdf f. It holds Qz) Qz) Qz) + max{q +,q } for all z R, where the convex function Q is the one-dimensional expected value function of the continuous relaxation of 1), i.e., Qz) := q + E ξ ξ z) + ] + q E ξ ξ z) ], z R see 16]). Hence, if Q is convex, it has asymptotes at and + with the same slopes as the asymptotes of Q, which are q + and q, respectively. Therefore, in this case Q + is nondecreasing from lim z Q + z) = q+ to lim z Q + z) = q. Using Lemma.6 we have, for z R and n Z, Q + z + n) = q+ f + z + k) + q = k= k=n Since f F, it follows see 8) and 9)) that Sz) = k= f + z + k) n k= is finite for all z R. Therefore we have, for all n Z, k= f + z + k) q+ 1 {k n} + q 1 {k n} ) f+ z + k). 10) q+ 1 {k n} + q 1 {k n} f+ z + k) q + + q )Sz) <, so that by taking n and n in 10) we obtain, using Lebesgue s dominated convergence theorem, that q + = q + Sz) and q = q Sz), respectively, implying Sz) = 1. Note that the condition k= f + z + k) = 1 for all z R is necessary but not sufficient for f C. A counter example is given by the pdf f such that { s, s 1, 1), f + s) = 0, otherwise. It is easily seen that f F and k= f + z + k) = 1 for all z R. However, f C since f + z + k) is strictly increasing on 0, 1). As said before, convexity of the functions g, h, and Q is an exception rather than the rule. Therefore, C is a small subset of F. In the following we first define a subset of C, using a characterization that is attractive from a computational point of view. Next, we show that this subset actually contains all of C. Definition 3. Let C 0 be the set of functions f on R, that can be written as fs) = Gs + 1) Gs), s R, where G is an arbitrary cdf on R with a finite mean value µ G. The following lemma shows that C 0 is a subset of C. 7

Lemma 3.3 Let f be generated by a cdf G with mean value µ G as described in Definition 3.. Then f is continuous from the right, and the following is true. i) For s R f s + k) = 1 Gs), In particular, f C + C. f s k) = Gs), k=1 and f s + k) = 1. k= ii) If F denotes the cdf of f, and η a random variable distributed according to G, we have for all z R Fz) = z+1 z Gs) ds ) ) 1 Fz + k) = 1 Gs) ds = Eη η z) + ] Fz k) = k=1 z z Gs) ds = E η η z) ]. iii) If it exists, the nth moment of f, denoted by ν n, is equal to PROOF. ν n = 1 n + 1 n ) n + 1 1) k n k τ k, where τ k is the kth moment of G. In particular, the mean value µ f of f equals µ G 1, and its variance σ f is equal to σ G + 1 1, where σ G is the variance of G possibly infinite). i) Since f 1 s) := Gs + 1) and f s) := Gs) satisfy all conditions in Lemma.5, we conclude that f F. Indeed, 0 f 1 s) ds = = 1,1] Gs) ds = 1 {,s] { } 1 ds dgt) = t } dgt) ds,1] 1 t) dgt) < since µ G is finite. Similarly 0 1 f s)) ds <. The corresponding value of c is equal to 1, since Moreover, ) { Gs + 1) Gs) ds = N f s + k) = = { t s,s+1] t 1 } dgt) ds } ds dgt) = 1. N ) Gs + k + 1) Gs + k) = Gs + N + 1) Gs), 8

so that f s + k) = 1 Gs). In a similar way we get k=1 f s k) = Gs), so that k= fs + k) = 1 for all s R. Since f = f + because G is a cdf) it follows from the definition of C + that f C + indeed. ii) It is not difficult to see that Fz) = z+1 z Gs) ds for all z R, so that Fz k) = k=1 k=1 z k+1 z k Gs) ds = z Gs) ds = E η η z) ], and similarly 1 Fz + k) ) = Eη η z) + ] for all z R. iii) If it exists, the nth moment of f is given by ν n = = = = 1 n + 1 s n fs) ds = { t s n ds t 1 n 1 n + 1 n n + 1 k s n { } dgt) = s,s+1] } dgt) ds 1 n + 1 ) n + 1 1) k n k t k dgt) ) 1) n k τ k, t n+1 t 1) n+1) dgt) follow by straight- where τ k is the kth moment of G. The expressions for µ f and σf forward computation. Theorem 3.1 C + = C 0. PROOF. We already established that C 0 C +. To prove the reverse inclusion, suppose that f C +. Then we have to show that it can be represented as f s) = Gs+1) Gs), s R, where G is some cdf with finite mean value µ G. We will show that Ḡs) := k=1 f s k), s R, meets the requirements. Since f is continuous from the right, Ḡs) = fs k) = f + s k) = f + s 1 k), s R, k=1 k=1 so that f C implies that Ḡ is a nondecreasing function on R. Lemma 3. shows that n lim Ḡs) = lim Ḡn + 1) = lim f k) = s n n lim Ḡs) = lim s n n k= Moreover, for all z R we have limḡs) = lim s z f s k) = s z k=1 f k) = 0. k=1 k= limf s k) = s z k= f k) = 1 fz k) = Ḡz), k=1 9

where the inner equality is based on Lebesgue s dominated convergence theorem using k=1 f s k) 1 s). We conclude that Ḡ is a cdf indeed. Furthermore, it is easy to see that f s) = Ḡs + 1) Ḡs), s R. It remains to show that µḡ is finite. This follows from { t µḡ 1/ = t 1/) dḡt) = = = which is finite since f F. { } s dḡt) ds = s,s+1] sf s) ds, s ds t 1 } dḡt) s Ḡs + 1) Ḡs) ) ds Corollary 3.1 f C if and only if Gs) = k=1 f + s k), s R, is a cdf with finite mean value. PROOF. Without loss of generality we assume that f = f +, since f C if and only if f + C. The result now follows trivially from Theorem 3.1 and its proof. We conclude from Corollary 3.1 that, disregarding f F, the function Q is convex if and only if the random variable ξ has a pdf f such that k=1 f + s k) is a cdf with finite mean value. Moreover, using Definition 3. and Theorem 3.1, we can approximate any given distribution including discrete or mixed distributions) by a continuous distribution with a pdf f C. Accordingly, we obtain a convex approximation of the expected value function Q, which we recall is non-convex in general. Because of Lemma 3.3 iii) an interesting choice is to let f be generated by Gs) = Fs 1/), where F is the cdf of the given distribution, since f s) = Fs + 1/) Fs 1/), s R, has the same mean value as F and a variance that is increased by 1/1). We proceed by giving a few examples of cdfs F and corresponding pdfs f C. Example 3.1 Let F be a cdf and define fs) := Fs + 1/) Fs 1/), s R. a) If F is the cdf of the degenerated distribution in 0 then { 1, s 1/, 1/), fs) = 0, otherwise, i.e., f is the right-continuous pdf of the uniform distribution on 1/, 1/] notation U 1/, 1/)). b) If F is the cdf of a discrete distribution on Z with Pr F {k} = p k 0, k Z p k = 1, such that < k Z p kk <, then fs) = p s+1/, i.e., f is the right-continuous pdf of a distribution which is uniform with weight p k ) on every interval k 1/,k + 1/), k Z. c) If F is the cdf of the distribution U0, 1) then s + 1/, s 1/, 1/), fs) = 3/ s, s 1/, 3/), 0, otherwise, i.e., f is the right-continuous pdf of the triangular distribution on 1/, 3/]. 10

0.4 0.3 0. 0.1 0 0 5 10.5 1.5 1 0.5 0 1 0.5 0 0.5 1.5 1.5 1 0.5 0 1 0 1 3 0.4 0.3 0. 0.1 0 4 0 4 Figure 3.1: Illustrations to Example 3.1. Mass points are depicted by, the pdf of F by a dashed curve, and the pdf f by a solid curve. Top left: Part b) for a discrete distribution on {1, 3, 5, 7, 9} with probabilities 1/15, 5/15, 3/15, 4/15, /15, respectively. Top right: Part d). Bottom left: Part e) with λ =. Bottom right: Part f) with µ = 0 and σ = 1. d) If F is the cdf of the distribution U0, 1/) then s + 1, s 1/, 0), 1, s 0, 1/), fs) = s, s 1/, 1), 0, otherwise. e) If F is the cdf of the exponential distribution with parameter λ then 1 e λs+1/), s 1/, 1/), fs) = e λ/ e λ/ )e λs, s 1/, ), 0, otherwise. f) If F is the cdf of the normal distribution with mean value µ and variance σ then fs) = 1 s+1/ e t µ) σ dt, s R. πσ s 1/ See Figure 3.1 for illustrations of b), d), e), and f), and Figure 3. for convex approximations of the function Q corresponding to b) and f). The description of C is constructive in the sense that given a cdf G it is straightforward to construct the corresponding pdf f C. On the other hand, for a given pdf f it is in general not easy to determine if it belongs to C, i.e., to decide if k=1 f + s k), s R, is a cdf with finite mean value. The following two results settle this question for a restricted class of probability density functions. Corollary 3. Let the pdf f have support a,a + 1], a R. Then f C if and only if f is a pdf of the uniform distribution on this interval. 11

5 1 11 10 9 8 4.5 4 3.5 3 7 6 5 4 3 0 5 10.5 1.5 1 0.5 3 1 0 1 Figure 3.: Approximations of the function Q q + = 1, q = ) by replacing the distribution F by the distribution f s) = Fs +1/) Fs 1/), s R. The function value in discontinuity points is depicted by, the function Q by a dashed curve, and the approximation by a solid curve. Left: F is the discrete distribution on {1, 3, 5, 7, 9} with probabilities 1/15, 5/15, 3/15, 4/15, /15, respectively. Right: F is the normal distribution with mean value 0 and variance 0.05. PROOF. It follows from Theorem 3.1 and Definition 3. that f C has support b 1,c] if and only if the cdf G, such that f s) = Gs + 1) Gs), has support b,c]. Hence, f has support a, a + 1] if and only if G is degenerated at a + 1. A trivial calculation learns that in this case f is the right-continuous pdf of the uniform distribution on a,a + 1]. Corollary 3.3 Let f be a pdf of the uniform distribution with support a,b], a,b R. Then f C if and only if b a Z +. PROOF. Consider the right-continuous version of f given by f + s) = b a) 1 on a,b), and f + s) = 0 otherwise. By Corollary 3.1, the pdf f is in C if and only if Gs) := k=1 f + s k), s R, is a cdf with finite mean value. It holds Gs) = b a) 1 Ns), s R, where Ns) = {k Z + \{0} : s k a,b)}. It remains to verify under which conditions on a and b the function G is a cdf. Suppose b a = n + ε with n Z + and 0 < ε < 1. Then Na + 1 + n) = n + 1 so that Ga + 1+n) = n+1)/n+ε) > 1. Obviously, G is not a cdf in this case. It remains to consider b a = n with n Z + \ {0}. Then we get Gs) = 0, s,a + 1), k/n, s a + k,a + k + 1), k = 1,...,n 1, 1, s a + n, ), and this is a cdf with obviously a finite mean value), indeed. Finally, we give precise conditions such that a weighted sum possibly non-denumerable) of elements of C belongs to C itself. Lemma 3.4 Let, for any t R, f t be a pdf in C with first absolute moment µt). If is a cdf on R such that µt) d t) <, then the function f, defined by fs) := is a pdf in C too. f t s) d t), s R, 1

PROOF. We will show that f is generated by some cdf G with finite mean value as in Definition 3., so that f is a pdf in C by Theorem 3.1. For any t R, there exists a cdf G t which generates the pdf f t since f t C. Define Gs) := G t s) d t), s R. It is easy to see that G) = 0, G ) = 1 and that G is nondecreasing. It is right-continuous since, for s R, limgu) = lim u s u s G t u) d t) = limg t u) d t) = Gs), u s where the second equality holds by Lebesgue s dominated convergence theorem using that G t 1). Hence G is a cdf. Moreover, following the proof of Theorem 3.1, G has a finite mean value since f does: { } s fs) ds = s f t s) ds d t) = µt) d t) <. Finally, for s R, fs) = f t s) d t) = Gt s + 1) G t s) ) d t) = Gs + 1) Gs), so that f C. 3.1 Reformulation in terms of random variables Here we reformulate some of the results of the preceding section in terms of random variables. Note that the second part of the following lemma was already stated and proved in Lemma 3.3. However, the proof is more straightforward in this setting, and in particular it gives insight in the constant difference between the variances of a pdf f C and its generator cdf G. Lemma 3.5 Let G be an arbitrary cdf with finite mean value, and let η be a random variable distributed according to G. Let ξ be another random variable. Assume that, for each s R, the conditional distribution of ξ given η = s is the uniform distribution on s 1, s]. i) Then ξ is continuously distributed, and its pdf f is generated by G in the sense of Definition 3.. In particular, f C. ii) Denote the mean value and variance of ξ and η by µ f, σ f and µ G, σ G, respectively. PROOF. Then µ f = µ G 1/ and σ f = σ G + 1/1. i) For z R it holds Pr{ξ z} = Pr{ξ z η = s} dgs). The assumption implies Pr{ξ z} = dgs) + z + 1 s) dgs),z] = Gz) + = Gz) + = Gz) + = z+1 z z,z+1] z+1 z z+1 z Gt) dt. z,z+1] { } z+1 dt dgs) { s z,t] } dgs) dt Gt) Gz) ) dt 13

With reference to Lemma 3.3, this completes the proof. ii) Denoting by E ] the expectation with respect to the distribution with cdf, we have and µ f = E G EF G ξ η] ] = E G η 1/] = µ G 1/ σ f = var G E F G ξ η]) + E G varξ η] = varη 1/) + 1/1 = σ G + 1/1. This result implies the following alternative description of the set C: Corollary 3.4 Let ξ be a random variable with pdf f. Then f C if and only if there exists a random variable η with finite mean value, such that for all s R the conditional distribution of ξ given η = s is uniform on s 1,s]. It is not difficult to see sufficiency of this condition. For example, convexity of the function gz) = E ξ ξ z + ], z R, then follows from E ξ ξ z + ] = = z t z ft) dt { z } t z 1 s 1,s] t) dt dgs), where G denotes the cdf of η, since the inner integral is already a convex function of z by Corollary 3.. On the other hand, since the condition is also necessary, we see that convexity of the functions g, h, and Q depends ultimately on the requirement that any conditional distribution of ξ be uniform on some interval of length 1. In this sense, the pdf of the uniform distribution with unit support is the nucleus of the set C. 4 Continuous simple recourse representation of Q Next we consider the representation of convex instances of Q as the one-dimensional expected value function Q of a continuous simple recourse problem. The latter function can be written as Qz) := q + E ξ ξ z) + ] + q E ξ ξ z) ] = q + ) z 1 Ft) dt q z Ft) dt, where F is the cdf of the random variable ξ, see e.g. 1] or 1]. Such a representation exists by Theorem 3.1 in 3]. Theorem 4.1 Let ξ have a pdf f C and cdf F, and let G denote the cdf that generates f according to Definition 3.. Then Qz) = q + E ψg ψg z) +] + q E ψg ψg z) ] + q+ q q + + q, z R, 11) where ψ G is a random variable with cdf W G s) = q + q + + q Gs) + q q + + q Gs + 1), s R. 1) 14

PROOF. For z R we have, see ) and 3), Qz) = q + ) 1 Fz + k) q Fz k) = q + ) z 1 Gt) dt q z where we used that Fs) = s+1 s Gt) dt by Lemma 3.3. It follows that Q + z) = q+ 1 Gz) ) + q Gz + 1), Gt + 1) dt, 13) so that the cdf W G follows trivially from Theorem 3.1 in 3] which states that W G s) = Q + s) + q+ )/q + + q ), s R. The constant in 11) is equal to c = Qz) q + E ψg ψg z) +] + q E ψg ψg z) ]) for an arbitrary value of z R. Substitution of 13) gives c = q + ) z 1 Gt) dt + q z Gt + 1) dt q + 1 WG t) ) z dt q z = q + WG t) Gt) ) z dt + q z W G t) dt Gt + 1) WG t) ) dt. 14) Using 1) to substitute for W G, we find that the integrands of the first and the second term in 14) are equal to q ft)/q + + q ) and q + ft)/q + + q ), respectively. The result now follows. Example 4.1 In the setting of Theorem 4.1, let ξ be uniformly distributed on 0, 1], and q + = 1, q =. Then the cdf G is degenerated at 1, so that Pr{ψ G = 0} = /3 and Pr{ψ G = 1} = 1/3; the constant in 11) equals /3. We conclude that the function Q is equal up to a constant) to the one-dimensional expected value function of a continuous simple recourse problem. In addition to its intrinsic theoretical value, this result is of practical use if the cdf G that generates the pdf f is known actually, if we know G i for all m terms of the n 1 -dimensional expected value function Q). In that case, we know all data of the continuous simple recourse model that is equivalent to our integer simple recourse model, and hence may solve the problem using existing special purpose software. The remainder of this paper is devoted to a class of approximations of the underlying distribution, and hence of Q) for which it is possible to follow this approach. 5 Definition of α-approximations In Section 3 we defined the class C of probability density functions that correspond to convex expected value functions Q of simple integer recourse problems with fixed T matrix and random right-hand side parameters. If an arbitrary cdf F is approximated by a distribution in C, the corresponding one-dimensional expected value function will be a convex approximation of Q. We will study here a subclass of C for approximating a cdf F, consisting of what we call α-approximations. The definition of α-approximations below is independent of the exposition in the previous section. In Corollary 5.1 below it is indicated by which distribution the α-approximation is generated in the sense of the previous section. 15

Definition 5.1 For α 0, 1) and F a cdf of a random variable, the α-approximation F α of F is defined as the piecewise linear function generated by the restriction of F to the lattice α + Z. That is, for any s α + Z, F α s) := F s) + s s) F s + 1) F s) ), s s, s + 1]. Since F α is continuous and non-decreasing with lim s F α s) = 0 and lim s F α s) = 1, it is a cdf itself. If ξ is a random variable with cdf F, any random variable ξ α with cdf F α will be called an α-approximation of ξ. That is, the distribution of any α-approximation ξ α is characterized by the following two properties. For all s α+z, a) Pr{ξ α s, s + 1]} = Pr{ξ s, s + 1]} b) Given ξ α s, s + 1], its conditional distribution is uniform. Notice that ξ α has a pdf even if ξ does not. Remark 5.1 In Definition 5.1 the α-approximation is based on the right continuous cdf F. An alternative definition of the α-approximation is based on the left continuous version ˆF of F, in the following way. For any s α + Z, ) ˆF α s) := ˆF s) + s s) ˆF s + 1) ˆF s), s s, s + 1]. Of course, if Pr{ξ α + Z} = 0, both definitions lead to the same result. In particular this is true if ξ is continuously distributed. However, if Pr{ξ α + Z} > 0, the alternative definition gives a distribution that is different from the one in Definition 5.1. For the calculus with α-approximations we need the following generalization of the concepts of integer round down and integer round up. Definition 5. For s R and α 0, 1), the round down and the round up of s with respect to α + Z are defined as, respectively, s α := max{α + k : α + k s, k Z} s α := min{α + k : α + k s, k Z}, i.e., s α = s α + α and s α = s α + α. Notice that for all s R, α 0, 1), we have that s α s s α. In particular, if s α + Z then s α = s = s α, and if s α + Z then s α < s < s α = s α + 1. Obviously, the usual integer round down and round up correspond to the case α = 0. The next lemma gives formulae for the α-approximation F α of F and its pdf f α that will be used frequently. Lemma 5.1 Let F be the cdf of a random variable. For α 0, 1), its α-approximation is the cdf F α given by F α s) = F s α ) + s s α ) F s α ) F s α ) ), s R. The right-continuous version f α of the pdf of F α is given by the piecewise constant function f α s) = F s α + 1) F s α ), s R, 15) which is constant on every unit interval α + k,α + k + 1), k Z. In particular, f α s) = F s α ) F s α ) = dft), s α + Z, where S α s) = s α, s α ]. S α s) PROOF. The formula for F α is a reformulation of its definition for s α+z, but obviously it is also true for s α + Z. The formula for f α follows from the one for F α since f α is the right derivative of F α. 16

3 1 0 1 3 1 0.5 0.8 0.4 0.6 0.3 0.4 0. 0. 0.1 0 0 3 1 0 1 3 Figure 5.1: Let ξ N0, 1/) and α = 1/4. Left: cdfs F dashed) and F α. Right: pdfs f dashed) and f α. See Figure 5.1 for examples of α-approximations F α and f α. We proceed by showing that f α C if it has a finite mean value. In this context, it is relevant to consider the distribution of the random variables ξ α := ξ α + α and ξ α := ξ α + α for a fixed α 0, 1), where ξ is a random variable with an arbitrary cdf F. Both ξ α and ξ α have values in α + Z, and Pr{ ξ α < s} = Pr{ξ < s α } = ˆF s α ), s R, Pr{ ξ α s} = Pr{ξ s α } = F s α ), s R, where we used that x α < y x < y α and x α y x y α for all x,y R. Corollary 5.1 Let F be a cdf with finite mean value. Then f α C for all α 0, 1). In fact, f α is generated by the cdf G α s) = F s α ), s R of ξ α, where ξ is a random variable with cdf F. PROOF. Immediate from 15) and Lemma 3.3. Obviously, G α has finite mean value since F has. Remark 5.1 continued. It can be verified that the alternative definition of α approximations, based on the left-continuous version ˆF of F, leads to the pdf generated by the distribution of ξ α + 1. In the sequel we restrict ourselves to the case that ξ is continuously distributed. There are two reasons to accept this loss of generality. In the first place, for the case that ξ follows a discrete distribution we can resort to the convex hull of the function Q, see 3, 4]. Because of the favorable properties of the convex hull, there is no need to investigate other approximations for this class. Secondly, the analysis of the general case is more complicated, since it appears that for the α-approximation of the expected shortage function h the alternative definition in Remark 5.1 is appropriate, whereas Definition 5.1 is suitable in case of the expected surplus function g. Consequently, for discretely distributed ξ it is quite possible that different α-approximations are needed in the two constituents of the function Q. So far we have only been able to prove interesting results for continuously distributed ξ. 6 Analysis of α-approximations We first study α-approximations of the expected surplus function g. We derive a bound on the error of the approximations that is uniform in α 0, 1). In Subsection 6. we consider convex combinations of α-approximations. Subsection 6.3 contains the corresponding results for the expected shortage function h. In Subsection 6.4 we first combine the preceding 17

results to obtain convex approximations Q α for the one-dimensional expected value function Q, and then discuss interpretations of the above-mentioned uniform error bound. In Section 7 we consider the representation of Q α as the one-dimensional expected value function of a continuous simple recourse problem. In particular, it will be shown that the distribution of the random variable appearing in the latter problem can be calculated directly. Subsequently, in Section 8 we extend the results to the n 1 -dimensional expected value function Q, and discuss the relation between the optimal values of a simple integer recourse problem and its α-approximations. 6.1 The expected surplus function In the following lemma we define α-approximations of the expected surplus function g. Lemma 6.1 Let ξ be a random variable with E ξ ξ] R. For all α 0, 1), let ξ α be an α-approximation of ξ. For each α 0, 1) define g α z) := E ξα ξα z +], z R. Then g α z) = 1 F α z + k)), z R. For all α 0, 1) it is a convex function. The function g α is completely determined by the expected surplus function gz) = E ξ ξ z + ] in the following way. It is the piecewise linear function generated by the restriction of g to α + Z. That is, g α z) = g z α ) + z z α ) g z α ) g z α ) ) = g z α ) + z z α ) F z α ) 1 ), z R. In particular, g α z) = gz) for z α + Z. PROOF. Since f α, the pdf of ξ α, is in C by Corollary 5.1, we know that g α is a convex function satisfying g α z) = 1 F α z + k)), z R. Similarly, gz) = 1 Fz + k)). Restricting the attention to z α + Z, we see that z + k α + Z for all k Z, so that F α z + k) = Fz + k) for all k Z. Consequently, g α coincides with g on α + Z. Consider next the case z z, z + 1] for any z α + Z. Then for each k Z we have z + k z + k, z + k + 1] with z + k and z + k + 1 two neighboring points in α + Z. Hence, 1 F α z + k) is affine-)linear in z on z, z + 1] for each k Z, and so is g α z) = 1 F α z + k)). Consequently, for z α + Z, g α z) = g α z) + z z) g α z + 1) g α z) ) = g z) + z z) g z + 1) g z) ), z z, z + 1]. Since, for z R, z α z z α, z α α+z, and z α = z α +1 unless z z α = 0, this implies g α z) = g z α ) + z z α ) g z α ) g z α ) ) = g z α ) + z z α ) F z α ) 1 ), z R, where the last equality follows directly from ). Remark 6.1 Instead of using Corollary 5.1 to prove that g α is convex, this can also be seen directly from the relation between the functions g α and g. Convexity follows immediately from the fact that the cdf F is non-decreasing. Alternatively, since g α is piecewise linear and coincides with g on the set α + Z, it is convex by Lemma 3.1. In the following we assume that the random variable ξ in the definition of the expected surplus function g is continuously distributed with pdf f F. Under this assumption we will derive a bound, independent of α, on the uniform distance between g α and g: g α g = sup z R g α z) gz). The supremum norm is the appropriate norm here since it will later provide directly a bound on the difference between the minimum value of Q and that of its α-approximation Q α see Section 8). 18

Remark 6. Independent of the distribution type of ξ we have the trivial error bound g α g 1. This bound follows directly from the monotonicity of g α and g and the observation that, for all z R, both g α z) and gz) are bounded by g z α ) and g z α + 1). Hence g α z) gz) g z α ) g z α + 1) = 1 F z α ) 1 z R, where F is the cdf of ξ. Theorem 6.1 Assume that ξ has a pdf f F. Then, for all z R and all α 0, 1), so that g α z) gz) min { z z α, z α z } f, g α g f 4, where f denotes the total variation of f on R. In particular, if the pdf f is unimodal then g α g f ν)/ for all α 0, 1), where ν is the mode of the distribution. The rather technical proof of Theorem 6.1 is presented in Appendix A.. In many cases, the error bound given in Theorem 6.1 is sharper than the trivial bound g α g 1. Only if f > 4, which for example for normal distributions corresponds to a variance less than 0.04, this is not the case. In Section 6.4 where we present similar results for the function Q, we will discuss interpretations of the error bound. 6. Convex combinations of α-approximations For each α 0, 1) we defined the α-approximation F α of the cdf F of ξ, and derived a uniform upper bound on the difference of the corresponding expected surplus functions g α and g in case F has a pdf f. Moreover, we showed that F α has a pdf f α, and if F has a finite mean value then f α C. We now consider convex combinations of α- approximations n i=1 λ i F αi, with λ i > 0, n i=1 λ i = 1, α i 0, 1). Due to Lemma 3.4, such distributions yield convex expected surplus functions too. We will show that by taking convex combinations the uniform upper bound can be reduced by a factor. First we consider a special case. For any α,β 0, 1), we define f αβ s) = f αs) + f β s), s R, where f γ, γ {α,β}, is the right-continuous version of the pdf of the γ -approximation of F. To facilitate the exposition below, we introduce the following notation. For α 0, 1), define m α z) := min { z z α, z α z }, z R. The function m α is depicted in Figure 6.1. First of all, we note that m α z) 1/ for all z R and all α 0, 1). Moreover, for all z R m α z) + m α z + 1/) = 1/ α 0, 1). 16) This relation is easy to prove for the special case α = 0, and the general case then follows from m α z) = m 0 z α), z R, which is easily derived using Definition 5.. Finally, again using Definition 5. we see that for all α 0, 1/) it holds z + 1/ α = z 1/ α + α + 1 = z α+ 1 + 1/ z R. 19

1/ α β α+1 β+1 0 Figure 6.1: The functions m α and m β dashed) with β α = 1/. It follows that if α β = 1/ then m α z + 1/) = m β z) z R. 17) Corollary 6.1 Let F be a cdf with finite mean value. For α,β 0, 1), let ξ αβ be any random variable with pdf f αβ. Then the corresponding expected surplus function g αβ, defined by g αβ z) = E ξαβ ξαβ z +], z R, is piecewise linear and convex. Moreover, if F has a pdf f with f F and if α β = 1/, then g αβ g f 8. 18) PROOF. It is easy to see that g αβ z) = g αz) + g β z), z R, so that g αβ is piecewise linear and convex by Lemma 6.1. To prove the second statement, we use that for all z R gαβ z) gz) g αz) gz) + g β z) gz) m α z) + m β z) ) f 4 where the second inequality follows from Theorem 6.1. By our choice of α and β it follows from 16) and 17) that m α z) + m β z) 1/, which is illustrated in Figure 6.1. This completes the proof. Given Corollary 6.1, one might expect further improvement of the error bound 18) if the approximation is based on combinations of more than two piecewise constant pdfs f α, α 0, 1). However, we will prove that this is not the case. Since we wish to consider all possible convex combinations of pdfs f α, α 0, 1), it is convenient to interpret α as a random variable with support contained in 0, 1). To avoid possible confusion, we will write α to denote the random variable whereas α is a scalar in 0, 1) as before. Theorem 6. Let A denote the collection of all cdfs of distributions on the half-open interval 0, 1). Then min max m s z) d s) = 1 A z R 4. 0

PROOF. For any A m s z) d s) sup z R = sup z R = sup z R { { max m s z) d s), }} m s z + 1/) d s) { 1 sup m s z) d s) + z R 1 ms z) + m s z + 1/) ) d s) = 1 4, )} m s z + 1/) d s) where the final equality follows from 16). The result now follows, since Corollary 6.1 implies that if the random variable α has support {α,β} such that α β = 1/ and Pr{ α = α} = Pr{ α = β} = 1/ then m s z) d α s) = 1 ms z) + m s z + 1/) ) = 1 z R, 4 where α denotes the cdf of α. We conclude that, compared to the error bound given in Theorem 6.1, the error bound can be reduced by a factor if we allow convex combinations of pdfs f α. However, in that case we lose the property that the approximation coincides with g at all points in α + Z for some α 0, 1). 6.3 The expected shortage function Next we concentrate on approximations of the expected shortage functionhz) = E ξ ξ z ], z R. Recall that by Lemma.1 it holds that hz) = g ζ z) for all z, where g ζ is the function that is obtained from g if the random variable ξ is replaced by ζ = ξ. Moreover, ζ has cdf F ζ s) = 1 ˆF s), s R, where ˆF is the left continuous version of F. Since we assume that ξ is continuously distributed, we have F ζ s) = 1 F s), s R. If ξ has pdf f then ζ has pdf f ζ s) = f s), s R, with total variation f ζ = f. Consequently, all results derived for the approximations g α trivially imply corresponding results for the approximations h α z) = E ξα ξα z ], z R, α 0, 1). Instead of listing them separately, we refer to the results below for the function Q α, which on choosing q + = 0 and q = 1 apply to the function h α. Remark 6.3 Note that the indicated results for the function h are based on the fact that F α s) = Fs) = ˆFs), s α + Z, which is true by the assumption that ξ is a continuous random variable see Remark 5.1). As an example of the problems that arise if ξ is discretely distributed, we note that instead of h α z) = hz) for all z α + Z, we obtain h α z) = hz) + Pr{ξ z Z + } = lim s z hs), for all such z. 6.4 The one-dimensional expected value function By combining the results for g α and h α we obtain the corresponding results for the α- approximations Q α = q + g α + q h α, α 0, 1), of the one-dimensional expected value function Q. Theorem 6.3 Let ξ be a continuous random variable with pdf f. For all α 0, 1), let ξ α be an α-approximation of ξ. For each α 0, 1) define Q α z) := q + E ξα ξα z +] + q E ξα ξα z ], z R, where q + and q are non-negative scalars. Then 1

a) For each α, the function Q α is convex. It is related to the function Qz) = q + E ξ ξ z + ] + q E ξ ξ z ] by the equation Q α z) = Q z α ) + z z α ) Q z α ) Q z α ) ) = Q z α ) + z z α ) q + + q + F z α ) + q F z α ) ), so that Q α is the piecewise linear function that coincides with Q at all points α + k, k Z. b) Independent of the distribution type of ξ it holds Q α Q 1. Assume that the pdf f F. Then, for all z R, so that Q α z) Qz) min { z z α, z α z } q + + q ) f, Q α Q q + + q ) f 4. In particular, if the pdf f is unimodal then Q α Q q + + q ) fν), where ν is the mode of the distribution. c) Let, for α and β different numbers in 0, 1), ξ αβ be a random variable that is equal to ξ α with probability 1/ and to ξ β with the complementary probability. It has a piecewise constant right-continuous pdf f αβ s) = f α s) + f β s))/, s R. Then Q αβ z) = q + E ξαβ ξαβ z +] + q E ξαβ ξαβ z ], z R, is a piecewise linear convex function. If f F and in addition it holds α β = 1/, then Q αβ Q q + + q ) f 8. Moreover, this uniform error bound can not be reduced by using other convex combinations of pdfs of type f α. See Figure 6. for an example of the functions Q and Q α. Now it is time to reflect on the interpretation of the error bound that we have established for the case that ξ follows a continuous distribution. By Theorem 6.3 the error bound is proportional to the total variation of the pdf f of ξ. Computational experiments suggest that for many distributions the total variation of a pdf decreases as the variance of the distribution increases. For example, this correspondence holds if such a change of the distribution means that the probability mass is spread out more evenly over the perhaps wider) support. Consequently, we would expect that the approximation Q α becomes better as the variance in the distribution of ξ becomes higher. This expectation is supported by many examples, including the following.

5 4 3 1 3 1 0 1 3 Figure 6.: The functions Q dashed) and Q α in case ξ N0, 0.05), q + = 1, q = 1.5, and α = 0.5. Example 6.1 Assume that ξ Nµ,σ ) and that q + = q = 1. Then, for all z R and every α 0, 1), Q α Q q + + q ) f 4 = fµ) = 1 πσ, where we used that the pdf of the normal distribution is unimodal with mode µ. Another interpretation of the error bound comes to mind. Let G denote the collection of all finite convex functions on R. Then we may measure the degree of non-convexity of any finite function ϕ : R R by its distance dϕ G) to this set, defined as dϕ G) := inf v G ϕ v. We see that dϕ G) = 0 if a finite function ϕ is convex, whereas dϕ G) > 0 if ϕ is nonconvex. From Theorem 6.3 we conclude that the degree of non-convexity of Q is bounded by a multiple of the total variation of the pdf f : dq G) = inf Q v v G Q Q αβ q + + q ) f 8. Since this upper bound can not be reduced by using other convex combinations of densities of the type f α that are known to generate convex functions, we conjecture that this upper bound is rather sharp. This means that the degree of non-convexity of Q decreases if the total variation of the underlying pdf f decreases. 3

7 Continuous simple recourse representation of Q α Next we consider the representation of Q α as the one-dimensional expected value function of a continuous simple recourse problem, as implied by Theorem 4.1. Corollary 7.1 Let ξ be a continuous random variable with cdf F with finite mean value, and α 0, 1). Denote, as before, the α-approximation of the one-dimensional expected value function by Q α. Then Q α z) = q + E ψα ψα z) +] + q E ψα ψα z) ] + q+ q q + + q, z R, 19) where ψ α is a random variable with cdf W α s) = q + q + + q F s α) + q q + + q F s α + 1), s R. 0) That is, ψ α is a discrete random variable with support in α + Z and Pr{ψ α = α + k} = q + ) Fα + k) Fα + k 1) q + + q + q q + + q Fα + k + 1) Fα + k) ), k Z. PROOF. By Corollary 5.1, f α C since it is generated by F α ). Therefore, the equations 19) and 0) follow from Theorem 4.1, which proves these results for any f C. We conclude that the function Q α is equal up to a constant) to the one-dimensional expected value function of a continuous simple recourse problem, and that the discrete distribution of its right-hand side random variable ψ α can be computed directly from the distribution of ξ. The following corollary is an alternative formulation of the latter result in terms of random variables. It characterizes ψ α as a probabilistic round of ξ with respect to α + Z: Pr{ψ α = ξ α } = q q + + q, Pr{ψ q + α = ξ α } = q + + q. Corollary 7. Assume the setting of Corollary 7.1. Define the discrete random variable η, independent of ξ, such that Pr{η = 0} = q + q + + q, Pr{η = 1} = q Then ξ α η has the same distribution as ψ α. q + + q. PROOF. Obviously, the support of ψ α is a subset of α + Z. Defining p 0 := q + /q + + q ) and p 1 := q /q + + q ), it holds, for k Z, Pr{ ξ α η = α + k} = Pr{ ξ α = α + k η = 0} + Pr{ ξ α = α + k + 1 η = 1} = p 0 Pr{α + k 1 < ξ α + k} + p 1 Pr{α + k < ξ α + k + 1} = p 0 Fα + k) Fα + k 1) ) + p1 Fα + k + 1) Fα + k) ) = Pr{ψ α = α + k}. 4