Likewise, properties of the optimal policy for equipment replacement & maintenance problems can be used to reduce the computation.

Similar documents
The Mathematical Appendix

Point Estimation: definition of estimators

Solving Constrained Flow-Shop Scheduling. Problems with Three Machines

Feature Selection: Part 2. 1 Greedy Algorithms (continued from the last lecture)

Simulation Output Analysis

Discrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand DIS 10b

Summary of the lecture in Biostatistics

Lecture 2 - What are component and system reliability and how it can be improved?

X ε ) = 0, or equivalently, lim

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

IS 709/809: Computational Methods in IS Research. Simple Markovian Queueing Model

Analysis of System Performance IN2072 Chapter 5 Analysis of Non Markov Systems

Estimation of Stress- Strength Reliability model using finite mixture of exponential distributions

Chapter 5 Properties of a Random Sample

Optimal Strategy Analysis of an N-policy M/E k /1 Queueing System with Server Breakdowns and Multiple Vacations

PPCP: The Proofs. 1 Notations and Assumptions. Maxim Likhachev Computer and Information Science University of Pennsylvania Philadelphia, PA 19104

Mu Sequences/Series Solutions National Convention 2014

Chapter 3 Sampling For Proportions and Percentages

MATH 247/Winter Notes on the adjoint and on normal operators.

PTAS for Bin-Packing

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

Functions of Random Variables

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

Introduction to local (nonparametric) density estimation. methods

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

Multi Objective Fuzzy Inventory Model with. Demand Dependent Unit Cost and Lead Time. Constraints A Karush Kuhn Tucker Conditions.

Complete Convergence and Some Maximal Inequalities for Weighted Sums of Random Variables

Bayes (Naïve or not) Classifiers: Generative Approach

Chapter 14 Logistic Regression Models

7.0 Equality Contraints: Lagrange Multipliers

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

Waiting Time Distribution of Demand Requiring Multiple Items under a Base Stock Policy

ENGI 3423 Simple Linear Regression Page 12-01

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

A Primer on Summation Notation George H Olson, Ph. D. Doctoral Program in Educational Leadership Appalachian State University Spring 2010

Lecture 4 Sep 9, 2015

Econometric Methods. Review of Estimation

CHAPTER 4 RADICAL EXPRESSIONS

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Applying the condition for equilibrium to this equilibrium, we get (1) n i i =, r G and 5 i

Investigating Cellular Automata

Laboratory I.10 It All Adds Up

,m = 1,...,n; 2 ; p m (1 p) n m,m = 0,...,n; E[X] = np; n! e λ,n 0; E[X] = λ.

Chapter 8: Statistical Analysis of Simulated Data

ρ < 1 be five real numbers. The

3. Basic Concepts: Consequences and Properties

Dimensionality Reduction and Learning

1 Mixed Quantum State. 2 Density Matrix. CS Density Matrices, von Neumann Entropy 3/7/07 Spring 2007 Lecture 13. ψ = α x x. ρ = p i ψ i ψ i.

The Occupancy and Coupon Collector problems

means the first term, a2 means the term, etc. Infinite Sequences: follow the same pattern forever.

CHAPTER VI Statistical Analysis of Experimental Data

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

COMPROMISE HYPERSPHERE FOR STOCHASTIC DOMINANCE MODEL

Coherent Potential Approximation

18.657: Mathematics of Machine Learning

å 1 13 Practice Final Examination Solutions - = CS109 Dec 5, 2018

THE ROYAL STATISTICAL SOCIETY 2009 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULAR FORMAT MODULE 2 STATISTICAL INFERENCE

UNIT 4 CONTROL CHARTS FOR DEFECTS

A tighter lower bound on the circuit size of the hardest Boolean functions

Chapter 9 Jordan Block Matrices

22 Nonparametric Methods.

Idea is to sample from a different distribution that picks points in important regions of the sample space. Want ( ) ( ) ( ) E f X = f x g x dx

International Journal of

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Statistics Descriptive and Inferential Statistics. Instructor: Daisuke Nagakura

Lecture 2: Linear Least Squares Regression

TESTS BASED ON MAXIMUM LIKELIHOOD

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

Channel Models with Memory. Channel Models with Memory. Channel Models with Memory. Channel Models with Memory

MS exam problems Fall 2012

Investigation of Partially Conditional RP Model with Response Error. Ed Stanek

CHAPTER 6. d. With success = observation greater than 10, x = # of successes = 4, and

Extend the Borel-Cantelli Lemma to Sequences of. Non-Independent Random Variables

Random Variables and Probability Distributions

F. Inequalities. HKAL Pure Mathematics. 進佳數學團隊 Dr. Herbert Lam 林康榮博士. [Solution] Example Basic properties

Chapter 4 Multiple Random Variables

8.1 Hashing Algorithms

Wu-Hausman Test: But if X and ε are independent, βˆ. ECON 324 Page 1

Lecture 3 Probability review (cont d)

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

9 U-STATISTICS. Eh =(m!) 1 Eh(X (1),..., X (m ) ) i.i.d

Part 4b Asymptotic Results for MRR2 using PRESS. Recall that the PRESS statistic is a special type of cross validation procedure (see Allen (1971))

. The set of these sums. be a partition of [ ab, ]. Consider the sum f( x) f( x 1)

10.1 Approximation Algorithms

Class 13,14 June 17, 19, 2015

Homework 1: Solutions Sid Banerjee Problem 1: (Practice with Asymptotic Notation) ORIE 4520: Stochastics at Scale Fall 2015

1 Onto functions and bijections Applications to Counting

Strong Convergence of Weighted Averaged Approximants of Asymptotically Nonexpansive Mappings in Banach Spaces without Uniform Convexity

Exchangeable Sequences, Laws of Large Numbers, and the Mortgage Crisis.

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

To use adaptive cluster sampling we must first make some definitions of the sampling universe:

CS286.2 Lecture 4: Dinur s Proof of the PCP Theorem

C-1: Aerodynamics of Airfoils 1 C-2: Aerodynamics of Airfoils 2 C-3: Panel Methods C-4: Thin Airfoil Theory

MEASURES OF DISPERSION

Runtime analysis RLS on OneMax. Heuristic Optimization

1. The weight of six Golden Retrievers is 66, 61, 70, 67, 92 and 66 pounds. The weight of six Labrador Retrievers is 54, 60, 72, 78, 84 and 67.

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

DIFFERENTIAL GEOMETRIC APPROACH TO HAMILTONIAN MECHANICS

Application of Generating Functions to the Theory of Success Runs

Transcription:

Whe solvg a vetory repleshmet problem usg a MDP model, kowg that the optmal polcy s of the form (s,s) ca reduce the computatoal burde. That s, f t s optmal to replesh the vetory whe the vetory level s, the t s optmal to replesh whe the vetory level s -. Lkewse, propertes of the optmal polcy for equpmet replacemet & mateace problems ca be used to reduce the computato. Uder certa reasoable assumptos, f t s optmal to replace a mache of age, the t s optmal to replace a older mache. MDP- Mootoe Optmal Polces page D.L.Brcker

Defto: Suppose the states ad actos of a MDP are real varables, * ad let a ( ) be the optmal decso rule as a fucto of the state whe perods rema. * The a ( ) s mootoe odecreasg f ( ) ( ) a s a s + γ for each γ > 0 ad s S * * mootoe ocreasg f ( ) ( ) a s a s + γ for each γ > 0 ad s S * * MDP- Mootoe Optmal Polces page 2 D.L.Brcker

Examples: equpmet replacemet problem, where s = age of the equpmet, ad A s = { 0, } where dcates "replace", 0 dcates "keep". The optmal * polcy s mootoe odecreasg, sce f a ( ) s = (the optmal decso s to replace equpmet at age s), the optmal decso s also "replace" for older peces of equpmet. vetory repleshmet problem, where s = vetory level ad a A s s the vetory level after repleshmet,.e., the order-up-to level. The optmal polcy s mootoe ocreasg, sce f the optmal decso s to order up to level s' whe the curret level s s, the f the vetory level were greater, oe would ot order up to a larger quatty. Rather, f the vetory level s exceeds the reorder pot, the order-up-to level s also s, whle f the vetory level falls below the reorder pot, the order-up-to level creases. MDP- Mootoe Optmal Polces page 3 D.L.Brcker

Deftos: A bary decso process (BDP) s a Markov decso process wth a fte umber of states ad acto set As = { 0, } for each s S. A cotrol lmt V for a BDP s a quatty such that the optmal acto s a (s)=0 f ad oly f s V. Examples: Equpmet replacemet, where actos a (s) are 0 "keep" ad "replace". Stoppg problem, where the actos a(s) are 0 "stop" ad "cotue". Processer wth two rates, where the state s the umber of customers watg for processg ad the two actos are "slow" ad "fast" processg rates. MDP- Mootoe Optmal Polces page 4 D.L.Brcker

Cosder the Bary Decso Process where Defe the fucto f s cs psj f j a j S a a ( ) = m + β ( ) { 0,} γ ( sa, ) =.e., the codtoal probablty that, gve acto a selected state s, the ext state of the system s or greater. j p a sj MDP- Mootoe Optmal Polces page 5 D.L.Brcker

Theorem: Suppose 0 c c c c for s S 0 0 s+ s s+ s ad for each, the fuctos γ (,0), (,) γ (,0) γ (,) are all odecreasg. γ & the dfferece The for each there s a cotrol lmt V,.e., a optmal polcy s gve by a 0 f s = f s < V V Referece: Dael P. Heyma & Matthew J. Sobel, Stochastc Models Operatos Research, Volume II: Stochastc Optmzato, McGraw-Hll Book Compay, 984, page 387. MDP- Mootoe Optmal Polces page 6 D.L.Brcker

Iterpretato: The codto 0 0 0 cs+ cs cs+ cs states that the sgle-perod cost creases as s creases, but that the cremet cost s greater f a = 0 tha f a =. The codto that γ (,0) ad (,) γ are odecreasg meas that, regardless of the acto, large values of s ted to be followed by large values of s+, whle the codto that γ (,0) γ (,) ths tedecy s greater f a = 0 tha f a =. s odecreasg meas that MDP- Mootoe Optmal Polces page 7 D.L.Brcker

Example: Equpmet replacemet problem, where state s = ageof the equpmet, ad actos are: a = deotes "replacemet" ad a = 0 deotes "keep". Cosder frst the determstc problem wth o falures,.e., so that p 0 ss, ps,0 + = =, γ ( s,0) = 0 for s ad ( ) γ s,0 = for s >. Furthermore, γ ( s ) = for all s, ad ( ) 0, γ s, = 0 for. It follows that the codtos upo the fucto γ the theorem are satsfed. MDP- Mootoe Optmal Polces page 8 D.L.Brcker

Suppose that form ( r L s ) 0 cs s the operatg cost at age s ad that where r s the cost of the ew pece of equpmet ad L s s the salvage value of the replaced equpmet. The the codtos of the theorem requre that 0 0 0 Ls Ls+ cs+ cs cs has the that s, operatg cost creases wth age whle salvage value decreases, wth operatg costs rsg wth age at a rate at least as great as the reducto salvage value. Gve these reasoable assumptos, the theorem mples exstece of a optmal cotrol lmt,.e., t s optmal to replace the equpmet whe t exceeds a certa age. MDP- Mootoe Optmal Polces page 9 D.L.Brcker

Cosder ow the more realstc case whch radom falures may occur: b s s the probablty of falure of a mache of age s. The p = + b & p = b 0 0 ss, s s,0 s sce falg uts are mmedately replaced, so that γ 0 f s+ < s,0 = bs f 0< s + f = 0 ( ) whch s ot ecessarly odecreasg s as s requred to apply the theorem. MDP- Mootoe Optmal Polces page 0 D.L.Brcker

b b + However, "creasg falure rate",.e., s s s ofte a vald assumpto, ad order to apply the theorem we re-defe the states: Delete state 0 ad let state deote the "hghest" state such that breakdows cause a trasto to ths state, from whch replacemet madatory,.e., A = {}. MDP- Mootoe Optmal Polces page D.L.Brcker

For all s, let p = s, b0 deote the probablty that a ew tem breaks dow. p The s, ad = bs ad p 0 ss, + = bs for s, s, so that Therefore, (,0) ( ) γ s,0 = f s + ( ) γ s,0 = b f > s + s γ s odecreasg f falure rates are b b + odecreasg ( s s Smlarly, γ ( s,) for all s ). s a costat wth respect to s, ad so the codtos of the theorem relatg to γ are satsfed. MDP- Mootoe Optmal Polces page 2 D.L.Brcker

Let cs be defed as before for s ad c = r L. Also, we should crease expected costs uder acto 0 ("keep") to clude the expected replacemet costs,.e., where ( β ) ' ( ) c = b r K + b K 0 s s s s s Ks ' s the salvage value of a broke-dow tem ad Ks s the operatg cost of oe that has ot broke dow. MDP- Mootoe Optmal Polces page 3 D.L.Brcker

The { c k s } s odecreasg s f L b s+ s b L s s+,, ( ) ( ) b K b K, s s s+ s+ b K' b K ', ad L s s s+ s+ L s for all s. Uder these codtos, the theorem mples a optmal cotrol lmt polcy. MDP- Mootoe Optmal Polces page 4 D.L.Brcker

Suppose that the states are defed ot by chroologcal age, but by codto, so that p > 0 for may j (ot merely for j = s+ ad j = ). 0 sj I order for the codtos of the theorem to hold, t s ecessary that j p 0 sj If s be odecreasg s for each. c = r L, the a cotrol lmt polcy s optmal. s MDP- Mootoe Optmal Polces page 5 D.L.Brcker