Proofs, Tables and Figures

Similar documents
1 Hoeffding s Inequality

EVALUATIONS OF EXPECTED GENERALIZED ORDER STATISTICS IN VARIOUS SCALE UNITS

A converse Gaussian Poincare-type inequality for convex functions

Essential Maths 1. Macquarie University MAFC_Essential_Maths Page 1 of These notes were prepared by Anne Cooper and Catriona March.

March 1, Florida State University. Concentration Inequalities: Martingale. Approach and Entropy Method. Lizhe Sun and Boning Yang.

3 Continuous Random Variables

TIGHT BOUNDS FOR THE FIRST ORDER MARCUM Q-FUNCTION

h(x) lim H(x) = lim Since h is nondecreasing then h(x) 0 for all x, and if h is discontinuous at a point x then H(x) > 0. Denote

Machine Learning Theory Lecture 4

Chapter 7. Confidence Sets Lecture 30: Pivotal quantities and confidence sets

1 Delayed Renewal Processes: Exploiting Laplace Transforms

Real option valuation for reserve capacity

Georgian Mathematical Journal 1(94), No. 4, ON THE INITIAL VALUE PROBLEM FOR FUNCTIONAL DIFFERENTIAL SYSTEMS

A732: Exercise #7 Maximum Likelihood

Complexity of two and multi-stage stochastic programming problems

Generalized Geometric Series, The Ratio Comparison Test and Raabe s Test

MATH 217A HOMEWORK. P (A i A j ). First, the basis case. We make a union disjoint as follows: P (A B) = P (A) + P (A c B)

Weak bidders prefer first-price (sealed-bid) auctions. (This holds both ex-ante, and once the bidders have learned their types)

Continuous Random Variables 1

Probability Review. Yutian Li. January 18, Stanford University. Yutian Li (Stanford University) Probability Review January 18, / 27

STAT 7032 Probability Spring Wlodek Bryc

2 Chance constrained programming

Large Sample Theory. Consider a sequence of random variables Z 1, Z 2,..., Z n. Convergence in probability: Z n

The main results about probability measures are the following two facts:

STAT 200C: High-dimensional Statistics

Lecture Notes 3 Convergence (Chapter 5)

learning bounds for importance weighting Tamas Madarasz & Michael Rabadi April 15, 2015

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example

Stochastic Dynamic Programming: The One Sector Growth Model

Online Supplementary Appendix B

Chapter 7: Special Distributions

Case study: stochastic simulation via Rademacher bootstrap

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued

arxiv: v1 [math.oc] 22 Mar 2018

2 Continuous Random Variables and their Distributions

Tail bound inequalities and empirical likelihood for the mean

The strictly 1/2-stable example

Continuous Random Variables and Continuous Distributions

Lecture 13: Subsampling vs Bootstrap. Dimitris N. Politis, Joseph P. Romano, Michael Wolf

Lecture 16: Sample quantiles and their asymptotic properties

Continuous Probability Spaces

CHAPTER 3: LARGE SAMPLE THEORY

Lecture 2. We now introduce some fundamental tools in martingale theory, which are useful in controlling the fluctuation of martingales.

STA 2201/442 Assignment 2

Preliminary check: are the terms that we are adding up go to zero or not? If not, proceed! If the terms a n are going to zero, pick another test.

University of Luxembourg. Master in Mathematics. Student project. Compressed sensing. Supervisor: Prof. I. Nourdin. Author: Lucien May

A Probability Primer. A random walk down a probabilistic path leading to some stochastic thoughts on chance events and uncertain outcomes.

Math 152. Rumbos Fall Solutions to Exam #2

Continued Fraction Digit Averages and Maclaurin s Inequalities

1 + lim. n n+1. f(x) = x + 1, x 1. and we check that f is increasing, instead. Using the quotient rule, we easily find that. 1 (x + 1) 1 x (x + 1) 2 =

Traveling waves in a one-dimensional heterogeneous medium

THE DVORETZKY KIEFER WOLFOWITZ INEQUALITY WITH SHARP CONSTANT: MASSART S 1990 PROOF SEMINAR, SEPT. 28, R. M. Dudley

McGill University Math 354: Honors Analysis 3

Mathematics for Economists

S1. Proofs of the Key Properties of CIATA-Ph

Math Camp II. Calculus. Yiqing Xu. August 27, 2014 MIT

Convex Functions. Daniel P. Palomar. Hong Kong University of Science and Technology (HKUST)

On the convergence of sequences of random variables: A primer

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued

Tail probability of linear combinations of chi-square variables and its application to influence analysis in QTL detection

Review of Mathematical Concepts. Hongwei Zhang

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by

Confidence regions for stochastic variational inequalities

STRONG NORMALITY AND GENERALIZED COPELAND ERDŐS NUMBERS

Iowa State University. Instructor: Alex Roitershtein Summer Homework #5. Solutions

Generalized Seasonal Tapered Block Bootstrap

Chapter 1 Mathematical Preliminaries and Error Analysis

Part II Probability and Measure

Nonparametric estimation under Shape Restrictions

7 Influence Functions

2 Functions of random variables

i=1 β i,i.e. = β 1 x β x β 1 1 xβ d

Probability. Lecture Notes. Adolfo J. Rumbos

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1

Lecture 2: Convex Sets and Functions

Extended dynamic programming: technical details

Technical Appendix to "Sequential Exporting"

MAS223 Statistical Inference and Modelling Exercises

Nonparametric estimation of log-concave densities

MAS3706 Topology. Revision Lectures, May I do not answer enquiries as to what material will be in the exam.

X n D X lim n F n (x) = F (x) for all x C F. lim n F n(u) = F (u) for all u C F. (2)

Chapter 4. Continuous Random Variables 4.1 PDF

Nonlife Actuarial Models. Chapter 4 Risk Measures

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued

THEOREMS, ETC., FOR MATH 515

Limiting Distributions

LECTURE-13 : GENERALIZED CAUCHY S THEOREM

HIGH-DIMENSIONAL GRAPHS AND VARIABLE SELECTION WITH THE LASSO

Random Bernstein-Markov factors

Review 1: STAT Mark Carpenter, Ph.D. Professor of Statistics Department of Mathematics and Statistics. August 25, 2015

SUPPLEMENTARY MATERIAL TO IRONING WITHOUT CONTROL

Lecture 1: August 28

Homework 3 Solutions(Part 2) Due Friday Sept. 8

Appendix. Online supplement to Coordinated logistics with a truck and a drone

Course 212: Academic Year Section 1: Metric Spaces

Lecture 1 Measure concentration

Product measures, Tonelli s and Fubini s theorems For use in MAT4410, autumn 2017 Nadia S. Larsen. 17 November 2017.

LECTURE NOTES 57. Lecture 9

be the set of complex valued 2π-periodic functions f on R such that

Transcription:

e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem ec Proofs, Tales and Figures In this electronic companion to the paper, we provide proofs of the theorems and lemmas. This companion also contains several accompanying tales and figures to the paper. Tale EC. Range of critical fractile values where Assumption holds. Distriution When is Assumption satisfied? Notes Normalµ, σ Exponentialλ Lognormalµ, σ Paretox m, UniformA, B Gamma, β Beta, β Power Law Logisticµ, s GEVµ, σ, ξ Chik Chi-squaredk Laplaceµ, β Weiullλ,k +h +h 0 + erf σ +h 0 +h 0 +h γ, +h Γ B +β ;,β +h B,β 0 +h +h erf: error function Γ: gamma function, γ: incomplete gamma function B: eta function +h e ξ for ξ 0 P k, k for k ; P : regularized gamma function +h { Γ k γ k, k, if k +h 0, if k< +h +h { e k k, if k 0 if k< Tale EC. Average errors % with samples from an exponential distriution. a Sample average approximation Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5.39.83.08.4.6 3. 4.05 4.67 7.65 0.87 33.76 50 0.77 0.73 0.8 0.87.35.49.93.38 3.0 7.33 6.89 00 0.54 0.34 0.48 0.60 0.70 0.9 0.96.50.03 3.4 8.56 00 0.7 0.3 0.7 0.9 0.34 0.40 0.49 0.64.. 4.36 Distriution fitting Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5.88.54.54.69.03.60 3.37 4.6 5.8 9.64 40.06 50 0.65 0.64 0.69 0.80 0.99.3.53.90.7 4.88.93 00 0.36 0.34 0.39 0.46 0.57 0.73 0.96.33.9.6 9.03 00 0. 0.0 0. 0.4 0.8 0.34 0.43 0.59 0.94.64 7.5

ec e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem Figure EC. Upper ound for a log-concave distriution with +h quantile q. Tale EC.3 Average errors % with samples from a normal distriution. a Sample average approximation Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5 6.03 3.84 3.8 3..60.95 3.50 4.9 6.3 8.7 4.85 50.3.69.6.58.4.60.59.06 3.6 4.57 3.76 00.63.5 0.9 0.86 0.83 0.75 0.9.08.56.8 5.94 00 0.8 0.45 0.38 0.36 0.30 0.9 0.38 0.47 0.8.4 3.65 Distriution fitting Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5 4.65 3.53 3.07.73.6.77 3.6 3.74 5.48.83 75. 50.9.43.7.0.4.38.60.87.53 4.4 8.77 00.3 0.90 0.78 0.7 0.68 0.69 0.76 0.89.7.75 6.59 00 0.47 0.36 0.8 0.5 0.5 0.7 0.33 0.4 0.63.03 3.9

e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem ec3 Tale EC.4 Average errors % with samples from a Pareto distriution. a Sample average approximation Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5 0.88 0.69 0.83 0.93.4.59.8 3. 6.70 8.33 34.35 50 0.8 0.8 0.3 0.37 0.60 0.73.0.39.8 6. 33.39 00 0.9 0.3 0.9 0.4 0.9 0.39 0.45 0.83.54 3.8 39.74 00 0.09 0.08 0.0 0. 0.4 0.7 0.4 0.35 0.80.97 6.86 Distriution fitting Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5 0.70 0.6 0.69 0.79 0.96.4.68.47 4.83 9.69 40.3 50 0.5 0.5 0.8 0.3 0.39 0.53 0.76.5.5 4.5 8.9 00 0.5 0.4 0.6 0.0 0.4 0.3 0.4 0.6.34.85.7 00 0.08 0.08 0.09 0.0 0. 0.5 0.9 0.30 0.7.65 6.97 Tale EC.5 Average errors % with samples from a Beta distriution. a Sample average approximation Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5 5.5 4.80 4.07 3.07 3.06.63.9.90 4.30 4.7 4.99 50.69.8.5.99.63.4.6.5.34.88.47 00.86.7 0.94 0.8 0.88 0.85 0.73 0.77 0.79 0.89 0.78 00. 0.59 0.40 0.35 0.36 0.35 0.3 0.3 0.3 0.30 0.4 Distriution fitting Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5 5.6 4.4 3.4.70.38.3.34.39 3.40 7.3 35.94 50.90.4.77.50.43.49.6.65.68.88 9.40 00.43.06 0.83 0.69 0.6 0.6 0.6 0.63 0.64 0.99 4.45 00 0.70 0.43 0.30 0.6 0.5 0.5 0.6 0.6 0.4 0.37.9

ec4 e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem Tale EC.6 Average errors % with samples from a mixed normal distriution. a Sample average approximation Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5 3.99.64.89.4 6.7 3.98 5.5.7 5.78 3.84 4.4 50.9 0.86 0.6 0.39 0.38 0.35 0.53 0.79.8.6 4.6 00 0.74 0.43 0.35 0.7 0.39 0.45 0.33 0.39 0.55 0.7.5 00 0.37 0. 0.6 0.3 0.08 0.08 0. 0.9 0. 0.59.47 Distriution fitting Sample size 0. 0. 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.95 0.99 5.96.67.86 3.3 3.09.36 0.46 6.65 6.55 3.74 3.75 50.56.4 0.5 0.47 0.33 0.54 0.49.8 0.4.84 5.8 00 0.90 0.40 0.35.08 0.8 0.8 0.6.85 0.33.9 7.76 00 0.69 0.38 0.5 0.59 0.4 0.48 0.57 0.37 0.94 4.6.97

e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem ec5 EC.. Proof of Theorem As a preliminary for the proof, let us first state a version of Bernstein s inequality Bernstein 97: Theorem EC. Bernstein s inequality. Let X,X,...,X N that X c almost surely, and VarX =σ. Then, for any t>0, N Pr X i E[X ] t N i= Nt. σ +tc/3 For the proof of Theorem, we will require the following proposition. Proposition EC.. Suppose ˆQ N is the +h Then, for any γ > 0, Proof. Pr C ˆQ N γ and + C ˆQ N γ exp e i.i.d. random variales such quantile of a random sample from D with size N. 3Nγ 6h +8γ + h Let F e the complementary cdf of D, i.e., F q=prd q= F q+prd = q. For a random sample {D,...,D N } drawn from D, let ˆQ N e the +h. sample quantile. Define ˆF N q N ˆ F N q N N [ D i q], i= N [ D i q]. i= For simplicity, define γ +h and β. Define the events B [ +h +C ˆQ N < γ]=[f ˆQ N < β ] and L [ C ˆQ N > γ]=[ F ˆQ N < β ]. To prove Proposition EC., we need to find an upper ound for PrB and for PrL. Define the quantile q inf{q : F q β }. SinceF is nondecreasing, we have that B =[ˆQ N < q ]. Consider a monotonically decreasing, nonnegative sequence {τ k } k=, where τ k 0. Define the sequence of events {B k } k=, where B k [ ˆQ N q τ k ]=[ˆF N q τ k β]. Note that since ˆF N q τ k ˆF N q τ k+, then it follows that B k B k+. Thus, we have that B k lim k B k B, which implies PrB k Pr B. NotealsothatB B, thusprb Pr B. From the definition of q, oserve that for every k, there exists ε k > such that F q τ k = β ε k < β. Notethat F q τ k F q τ k < β β + ε k. EC.

ec6 e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem Thus, we have that PrB k =PrˆF N q τ k β, =PrˆF N q τ k F q τ k ε k, Nε k/ F q τ k F q τ k + ε, EC. k 3 Nε k /, EC.3 β β+β + ε k 3 where EC. follows from Bernstein s inequality and EC.3 follows from inequality EC.. Now, since ε k >, forallk, we have that N/ PrB k β β +β, 3 N/ β β+ 4 minβ, β 3 N/ β β+ =exp 4 3, 3Nγ 6h +8γ + h δ. Thus, PrB Pr B δ. In fact, y going through a similar argument, we can show that PrL δ. Thus, y the union ound, we have that Pr C ˆQ N > γ or + C ˆQ N < γ =PrB L PrB+PrL δ, proving Proposition EC.. Q.E.D. We can now proceed with the proof of Theorem. Note that S LRS ϵ consists of all q for which Cq γ and + Cq γ, withγ = ϵ min, h. From Proposition EC., the SAA solution from 3 a random sample with size N lies in Sϵ LRS with proaility at least Nϵ min{, h} exp = exp 8h +8ϵ + hmin{, h} Nϵ exp 8 + 8ϵ EC.. Proof of Theorem 3 Nϵ min{, h} 8 max{, h} +8ϵ + h min{, h}. + h Since C is convex, S f ϵ [q, can e equivalently expressed as {q : C q C q and q q }. Note that, C q= + hf q F q = + h [ q q fq +Oq q ] = ϵh q fq +Oϵ, EC.4. which follows from Taylor series approximation and from the definition of q in 7.

e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem ec7 To prove Theorem 3, note that the event that Q N S f ϵ [q,, where = C q, is equivalent to the intersection of events [ Q N q ] and [C Q N ]. We will prove an upper ound on the proaility of [ Q N <q ] and on the proaility of [C Q N > ]. It follows similar lines to the proof of Lemma 3.5 in Levi et al. 007, except we will use Bernstein s inequality instead of Hoeffding s inequality. Define β and γ. First, let us ound the proaility of B [ Q +h +h N <q ]. For a realvalued sequence {τ k } k= where τ k 0, define B k [ Q [ N q τ k ]= + + h ˆF N q τ k ] =[ˆF N q τ k β + γ]. Note that since ˆF N is monotonically increasing, it follows that B k B k+.thus,if B is the limiting event of the sequence of events {B k } k=, thenb k B, implying that PrB k Pr B. Note also that B B, thusprb Pr B. Therefore, to ound PrB, we only need to find a uniform upper ound for PrB k. Note that for any k, there exists ε k > 0 such that F q τ k =β ε k.thus, F q τ k F q τ k =β ε k β + ε k < β β + ε k. From Bernstein s inequality, we have that PrB k =Pr ˆFN q τ k β + γ =Pr ˆFN q τ k F q τ k γ + ε k Nγ + ε k F q τ k F q τ k + γ + 3 εk Nγ + ε k =exp β γ+ε k εk β γ + β ε k + 3 Nγ + ε k Nγ β β γ+β + β β γ+β + γ+ε k 3 γ 3 where the inequality follows when β γ 0. Hence, for all k, 3N PrB k. 4h +4 + h Since = C q, from EC.4 we have that PrB k 6Nϵh q fq +Oϵ 3/ 4h + Oϵ / Uϵ. EC.5 Now, let us ound the proaility of L [C Q [ ] N > ] = F Q N < h.defineq +h +h 0 { sup q : F } q h.thus,l =[ Q +h +h N >q 0 ]. Note that Q N =sup{q : h + h ˆ FN q }. For a real-valued sequence {τ k } k= where τ k 0, define L k [ Q [ N q 0 + τ k ]= h + h ˆ FN q 0 + τ k ] [ = ˆ F N q 0 + τ k h + h ] [ ] = ˆ FN q 0 + τ k β γ. + h

ec8 e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem Since ˆ FN is nonincreasing, then it follows that L k L k+.thus,if L is the limiting event of the sequence {L k } k=, thenl k L, implying that PrL k Pr L. Note also that L L, implying that PrL Pr L. Therefore, to prove a ound on PrL, it is sufficient to prove a uniform upper ound on PrL k. Note that for some ϵ k > 0, we have that F q 0 + τ k = β γ ϵ k.thus,l k =[ˆ FN q 0 + τ k F q 0 + τ k γ + ϵ k ]. Finally, from Bernstein s inequality, we have that Nγ + ϵ k PrL k F q 0 + τ k F q 0 + τ k + γ + 3 ϵk Nγ + ϵ k =exp Therefore, we have that for all k, β γ ϵ k β +γ + ϵ k + γ+ϵ k 3 Nγ + ϵ k =exp β γ ϵ k β + γ + β γ ϵ k + γ+ϵ k 3 Nγ + ϵ k β γβ + γ + β γ+ γ+ϵ k 3 Nγ β γβ + γ + β γ+ γ 3 Nγ =exp β β γ + 4 β γ+. γ 3 PrL k 3N 4h +47h 5 6. Since = C q, we have from EC.4 that PrL k 6Nϵh q fq +Oϵ 3/ Uϵ. 4h + Oϵ / EC.6 Summarizing from EC.5 and EC.6, we have that PrB Pr B Uϵ and that PrL Pr L Uϵ. Thus, { Pr Q N <q or C Q } N >C q =PrB L PrB+PrL Uϵ exp 4 Nϵ q fq, as ϵ 0. EC.3. Proof of Lemma Denote y gx or + gx the left-side or right-side derivative of a function g at x. The failure rate and reverse hazard rate is given y rx= fx F x fx and rx=.sincef is a log-concave F x

e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem ec9 distriution, it has an increasing failure rate. This implies that log rx = log fx log F x is increasing, and log rx 0 for all x. Thus, γ + γ 0 + h h = γ + fq F q log fq + fq F q = log rq 0. EC.7 A log-concave distriution also has a decreasing reversed hazard rate. This implies that log rx = log fx log F x is decreasing and + log rx 0 for all x. Thus, γ γ 0 + h = γ fq F q + log fq fq F q = + log rq 0. EC.8 Comining EC.7 and EC.8, we have that +h h γ γ 0 +h. EC.4. Proof of Lemma Note that since log f is concave, then log fx log γ 0 + γ x t, for all x such that fx > 0. Taking the exponent on oth sides proves our result. EC.5. Proof of Lemma 3 Note that d dx F x d dx F x y our assumption that f x f x. Moreover, since F t=f t, then F x F x for all x t and F x F x for all x t. Note that ED t D >t= PrD 0 >t+ s D >tds, = F F t 0 t + sds, F F t 0 t + sds, = ED t D >t With the same technique, we can also prove that Et D D t Et D D t. Comining these results proves the lemma. EC.6. Proof of Lemma 4 We first introduce the following notation: G β + log + β + β log β min{β, β}, Uβ β β log β, β Lβ β β log β. β We need to prove that each of the three functions are nonnegative.. Let us prove the result for G. First, we prove the result for the case when β. Note that + β G = log β. β

ec0 e-companion to Levi, Perakis and Uichanco: Data-driven Newsvendor Prolem The derivative is nonnegative if and only if G + βe β βe β 0. Note that for 0, G = β e β + βe β β βe β, β e β + β βe β, β e β e β 0 Note that G 0 = 0, thus, G 0 for all 0. Now define G + βe β βe β. Note that G G if 0. We have G = β e β e β 0, for 0. Note that G 0 = 0, thus, G G 0 for all 0. Thus, G is nondecreasing in 0, and non-increasing in 0. Since at = 0, this function is zero, then G 0 for all. Now we can also prove the result for β, if we define the function β = β Q.E.D. and G=G.. Let us prove the result for U. The result is true if and only if log β β. Note that log β is a convex function of β, thus the linear approximation at β = i.e., the function β ounds it from elow. Q.E.D. 3. Let us prove the result for L. Defining β = β, note that Lβ=U β 0, which follows from. EC.7. Proof of Theorem 4 Recall that if q Sϵ f [q,, then Cq + ϵcq. Also, Sϵ f [q, can e equivalently expressed as {q : C q C q and q q }. Let Q N e defined in 8, ut with = ϵh min{,h} + +h Oϵ. Since, C q= + hf q F q = + h [ q q fq +Oq q ] = ϵh q fq +Oϵ, then it follows from Proposition that C q when the demand distriution is log-concave. This implies that [ ] [ Q N q C Q ] [ ] [ N Q N q C Q ] N C q. Thus, we only need to derive a lower ound on the proaility of the left-hand side event to prove Theorem 4. Modifying the proof of Theorem 3 y letting = ϵh min{,h} + Oϵ, we can prove +h that Pr Q N <q or C Q N > U ϵ exp h} Nϵmin{,, as ϵ 0. 4 + h