Optimal Control of an Inventory System with Joint Production and Pricing Decisions

Size: px
Start display at page:

Download "Optimal Control of an Inventory System with Joint Production and Pricing Decisions"

Transcription

1 Optimal Control of an Inventory System with Joint Production and Pricing Decisions Ping Cao, Jingui Xie Abstract In this study, we consider a stochastic inventory system in which the objective of the manufacturer is to maximize the long-run average profit by dynamically offering the selling price and switching the production on or off. The demand process is non-homogeneous Poisson with a price-dependent arrival rate. System costs consist of switching costs, inventory holding and backlogging costs. We show that an s, S, p policy is optimal. Moreover, we characterize the structural properties of the optimal profit function and pricing strategy, and show that the optimal price is a quasiconcave function of the inventory level when the production is off and is a quasiconvex function when the production is on. Index Terms production/inventory systems; average criterion; optimal s, S, p policy; limited capacity. I. INTRODUCTION With the rapid development of e-commerce and new technologies e.g., big data, could computing and electronic funds transfer, a number of industries have begun to adopt dynamic pricing strategies and sell products directly to customers through their websites. For example, Dell sells its products through its website, offering promotions every week and even changing prices daily. The Alibaba Group operates leading online and mobile marketplaces in retail and wholesale trades and provides technology and services to enable buyers and suppliers to conduct commerce more easily. These developments have resulted in an increased interest in the construction of models to integrate production decisions, inventory control, and pricing strategies to improve the profitability of companies. In this study, we consider an inventory model with joint production and pricing decisions, in which the demand process is non-homogeneous Poisson with a price-dependent arrival rate, the production rate is constant and the production can be switched on or off with a fixed cost. When the production capacity is unlimited, this problem becomes an inventory control problem with pricing and replenishment decisions, which has been well studied in the literature. For example, Chen and Simchi-Levi [3] consider a periodic-review model with price-dependent demand and a fixed ordering cost, and they show under some conditions that the optimal policy for the infinite-horizon problem is an s, S, p policy. In [4], Chen and Simchi-Levi consider a continuous-review model in which the inter-arrival time and the demand size depend on the selling price. They show that optimal stationary s, S, p inventory policies exist for both discounted and average profit problems and prove that a higher inventory holding and shortage cost result in a School of Management, University of Science and Technology of China, Hefei, , China smaller selling price for the average profit problem. Chao and Zhou [2] consider an infinite-horizon, continuous-review stochastic inventory system in which the demand process is Poisson with a price-dependent arrival rate. They show that the optimal price is a unimodal function of the inventory level. However, none of them consider the case in which production capacity is limited. Several papers also consider a limited production capacity in inventory models. Federgruen and Zipkin [5] consider a periodic-review inventory model with a finite production capacity and show that under a few additional unrestrictive assumptions, a modified base-stock policy is optimal under the average-cost criterion. Foreest and Wijngarrd [6] consider a continuous review production-inventory system with compound Poisson demand and fixed switching costs and show that s, S policy is optimal if the holding cost is convex. However, they do not incorporate pricing decisions into their models. In this study, we consider a stochastic inventory system in which the manufacturer can dynamically adjust the selling price and switch the production on or off. To the best of our knowledge, this study is the first to incorporate pricing and production decisions into the inventory control. We assume that the production rate is large. Under this assumption, we show that an s, S, p policy is optimal for a discretized model. An s, S, p policy says that if the inventory level falls to s or below then the production will be switched on; if the inventory level reaches to S or above then the production will be switched off; and if the inventory level is x s < x < S, then the status of production will remain unchanged. The price p depends on the current inventory level. Moreover, we characterize the optimal inventory control and pricing strategy. We show that the optimal price is a quasiconcave function of the inventory level when the production is off and is a quasiconvex function when the production is on. The rest of the paper is organized as follows. In Section 2, we describe the inventory model and apply a discretization technique. In Section 3, we show that the s, S, p policy is optimal and characterize the structural properties of the optimal profit function and pricing strategy. In Section 4, we conclude the paper with several possible extensions. II. MODEL Consider a manufacturer who can control its inventory level by production and pricing jointly. The selling price of each product is set as p t at time t. Demand arrives according to a non-homogeneous Poisson process with rate Λ t p t, which c 205 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See for more information.

2 2 depends on the price p t. Demand is met immediately if the inventory is available. Otherwise, it is backlogged. At any time t, the production can be switched on off with cost K K 0. Production rate is constant r, i.e., the manufacturer can produce r products in one unit of time if production is on. This dynamic inventory control problem is analytically challenging. In this paper, we apply a discretization technique and solve the corresponding discretized problem. Let t = /r. Assume r is large enough so that t is very small. Due to industrialization and mass production, this assumption is not unrealistic. According to the property of non-homogeneous Poisson process, there is one arrival in time interval t, t+ t] with probability w.p. Λ t t + o t, more than one arrival w.p. o t, and no arrivals w.p. Λ t t. If t 0, then approximately there is one arrival in time interval t, t + t] with probability w.p. Λ t t, more than one arrival w.p. 0, and no arrivals w.p. Λ t t. Now, we can define the discretized problem. Let the length of each period be t. Suppose that price p P is chosen at the beginning of a period, where P is assumed to be a compact set. Let λp := Λ t t. Then with probability λp that a demand arrives, and with probability λp no demand arrives. Demand is met by end-of-period inventory, and unsatisfied demand is backlogged. Production rate is now, i.e., the manufacturer can produce one product in one period if the production is on. At the beginning of each period the state of production is chosen to be on or off. If production is off on at the end of a period, it can be switched on off at the beginning of the next period at the expense of a setup cost K K 0. When production is on off in a period, keeping it on off is free. We suppose that the production cost per item is c. Inventory costs are accrued at the end of a period according to the function h. A typical form for h is hx = hx + + bx, where x + = max{x, 0}, x = max{ x, 0}, and h and b are the holding cost and shortage cost per item per period, respectively. In the remainder of the paper, we take the following assumptions of hx. Assumption II.. hx is quasiconvex in x with its minimum point x = 0, h0 = 0, and lim x hx =. A function fx x Z is called quasiconvex unimodal if for some value m, it is increasing for x m and decreasing for x m. A function fx is quasiconcave if fx is quasiconvex. The system state can be described by its current inventory level x, and its production status y, thereby denoted by x, y, where y = indicates that the production is on and y = 0 indicates that the production is off. Assumption II.2. λp is strictly decreasing and continuous in p; Moreover, λ max p P λp <. Assumption II.3. There exists a positive number M such that max p P pλp = M <. Assumption II.4. K 0 + K > 0. Given a pricing and production policy u and the initial system state x, y, the long-run average profit can be formulated as V c x, y, u = lim inf T T E { T t= p t D t p t hx t+ cδy t = } K δy t =, y t = 0 K 0 δy t = 0, y t =, where x t, y t and p t are the inventory level, production state and selling price at the beginning of period t, respectively, D t p t is the demand in period t, which is w.p. λp t and 0 w.p. λp t. δa is an indicator function, which is if the event A is true, and 0 if A is false. The manufacturer s objective is to maximize its long-run average profit by dynamically adjusting its price and switching the production on or off, i.e., find an optimal policy u such that V c x, y V c x, y, u = sup V c x, y, u, u U where U is the set of all admissible policies. III. MAIN RESULTS A. Existence of optimal stationary policies Assumption II.4 implies that it is optimal to switch the production at most once at each state. Thus, the problem can be casted as a discrete-time infinite-horizon Markov decision problem under average criterion with state space S = {x, y x Z, y = 0, }, action space A = {p, z p P, z = 0, }, one-step reward function at state x, y under action p, z Rx, y, p, z = K z δz y + pλp cδz = λphx + z λphx + z and one-step transition probability from state x, y to x, z under action p, z λp, if x = x + δz =, P x, y, p, x, z = λp, if x = x + δz =. 0. otherwise. Before we show the existence of an optimal stationary policy, the following lemma is required, which comes from Corollary and Theorem in [9]. Lemma III.. Consider a discrete time Markov decision process {xt : t 0} consisting of four-element tuple {S, Ai, i S, pj i, a, ci, a} under average cost criterion: The state space S is denumerable; Each action space Ai is a subset of the finite action space A; The transition probability pj i, a satisfies pj i, a 0, i, j S, a Ai and j S pj i, a =, i S, a Ai; The one-step cost function ci, a 0, i S, a Ai. There exists an optimal stationary policy if the following set CAV of assumptions hold: CAV There exists a z standard policy d with positive recurrent class R d. d is defined to be a i 0 standard policy if the Markov process induced by d, {x d t : t 0} satisfies that for any i S, the expected time m i,i0 d of a first passage c 205 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See for more information.

3 3 from i to i 0 during which at least one transition occurs is finite and the expected cost c i,i0 d of a first passage from i to i 0 during which at least one transition occurs is finite. CAV2 Given U > 0, the set D U = {i Ci, a U for some a} is finite. CAV3 Given i S R d, there exists a policy θ i R z, i, where R z, i is the class of policies d such that the expected cost c z,i d of a first passage from z to i is finite. Since P is a compact set, it follows from that there exists a positive number M such that Rx, y, p, z M. Thus, the average profit problem can be casted as a discrete time Markov decision process under average cost criterion with one-step cost M Rx, y, p, z. Now we are ready to show the existence of an optimal stationary policy by applying Lemma III. to verify that CAV-3 holds in our model. Proposition III.. There exists an optimal stationary policy for the average profit problem. Proof. Let policy d be an s, S, p policy which is described in the introduction such that λpx, 0 > 0 for x > S, λpx, < for x < s, and 0 < λpx, y < for s x S and y = 0,. It is straightforward to see that for any inventory level z [s, S] Z, the Markov chain induced by d is z, 0 standard with its positive recurrent class R d = {x, y S : s x S}. Thus, CAV holds. Given U, it is straightforward to verify that D U = {x, y S Rx, y, p, z U for some p, z A} is finite by Assumption II. and thus CAV2 holds. Given any system state x 0, y 0 such that x 0, y 0 / R d, define an s, S, p policy denoted by d, such that λp x, 0 > 0 for x > S, λp x, < for x < s, 0 < λp x, y < for s x S and y = 0,, and s minx 0, s and S maxx 0, S. Thus, it is clear that d R z, 0, x 0, y 0 and thus CAV3 holds. B. Optimality of s, S, p policy Based on the results in subsection III-A, we only consider stationary policies, i.e., the decision depends deterministically on the system s current state. Under a stationary policy u, if the system s current state is x, y, then the price is a function p u x, y, and the production decision is y u x, y = +, 0 or y, where y u x, y = + implies that y = 0 and the production is switched on, y u x, y = 0 implies that y = and the production is switched off, and y u x, y = y implies keeping the production status unchanged. Note that the inventory process is two-sided skip free i.e., the inventory level x cannot jump to x + z, z 2 in one period. It follows from [0] that the following result holds. Proposition III.2. There exists an s, S, p policy which is optimal in the class of all stationary policies. Proof. Suppose that a stationary policy u is given and it is optimal. If the initial state is x 0,, define S u = inf{x Z : x x 0, y u x, = 0 } and s u = sup{x Z : x < S u, y u x, 0 = +}. First it is easy to see that S u <. Otherwise, the inventory level will go to and thus the policy u cannot be optimal. Moreover, we have s u >. Otherwise, the inventory level will go to and thus the policy u cannot be optimal. Since the inventory process is two-sided skip free, we know that the inventory level is always between s u and S u, and thus the long-run average profit under the policy u is equal to that under a s u, S u, p policy. If the initial state is x 0, 0, then we define s u = sup{x Z : x x 0, y u x, 0 = +} and S u = inf{x Z : x > s u, y u x, = 0 }. Using a similar argument, we know that the long-run average profit under the policy u is equal to that under a s u, S u, p policy. Therefore, there exists a s, S, p policy which is optimal in the class of all stationary policies. Note that the resulting s u and S u depend on the system s initial state. This proposition tells us that we can restrain our discussion on the s, S, p policies. We mention that the above argument does not work for discounted case. C. Average profit under an s, S, p policy To compute the average profit, we use renewal reward theory. A cycle starts from S,. The manufacturer switches off the production and the inventory level falls from S to s. Afterward, the production is switched on and the inventory level rises from s to S, which ends a circle. Next, we compute the expected cycle length and the expected profit per cycle. Under an s, S, p policy, if the production is off, the length of the time in which the inventory level falls from x to x, denoted by τ 0 x, is geometrically distributed with mean /λpx, 0, i.e., P τ 0 x = k = λpx, 0 λpx, 0 k, k. The expected profit in which the inventory level falls from x to x is R 0 x = E[px, 0 hxτ 0 x hx ] λpx, 0 = px, 0 hx hx. λpx, 0 If the production is on, the length of the time in which the inventory level rises from x to x +, denoted by τ x, is geometrically distributed with mean / λpx,.the expected profit in which the inventory level rises from x to x + is R x = E[px, τ x hxτ x hx + cτ x] λpx, = px, λpx, hx λpx, λpx, c hx + λpx,. It follows from renewal reward theory that the average profit under an s, S, p policy is K + S Rs, S, p = R 0i + R i S /λpi, 0 + / λpi,, where K = K + K 0. By subtracting a dummy profit γ from original profit in each period, we define the following auxiliary total profit function c 205 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See for more information.

4 4 in a cycle Ls, S, p, γ S = K + γ S R 0i + R i /λpi, 0 + / λpi, S = K + pi, 0 hi + γ + hi hi λpi, 0 pi, λpi, c hi γ + + hi hi + λpi, S = K + pi, 0 hi + γ λpi, 0 pi, λpi, c hi γ + λpi,. 2 Define Lγ = max s,s,p Ls, S, p, γ. Then it follows from basic theory of fractional programming [8] that γ is the maximal average profit if and only if Lγ = 0. It follows from the first equality of 2 that Ls, S, p, γ is a strictly decreasing, convex function of γ. Since both the monotonicity and convexity are preserved under maximization, we know that Lγ is also a strictly decreasing, convex function of γ. It follows from the first equality of 2 and the definition of Lγ that if γ is sufficiently large, Lγ < 0 and if γ is sufficiently small, Lγ > 0. In fact, in Proposition III.3 we show that Lγ = + if γ < 0. Since Lγ is strictly decreasing in γ, there exists a unique γ such that Lγ = 0 and γ is the optimal average profit. D. Analysis of the optimal price given s, S and γ We analyze the optimal price given s, S and γ, which is the optimal solution to the problem Ls, S, γ := max Ls, S, p, γ. p It follows from Assumption II.2 that λp has a strictly decreasing, continuous inverse pλ. Therefore, we can take arrival probability λ as the decision variable instead of p. Define f 0 γ, x, λ = pλ hx+γ/λ and f γ, x, λ = λpλ c hx γ λ. It follows from the last equality of 2 that Ls, S, p, γ = K + + S f γ, i, λpi,. f 0 γ, i, λpi, 0 We first compute the optimal arrival probability λ x, 0 = λp x, 0 when the inventory level is x and the production is off. The following assumption is required for our discussion. Assumption III.. For any 0 < λ λ, λp λ + 2p λ < 0. Note that the condition that pλ is concave decreasing in λ is sufficient for λp λ + 2p λ < 0. Moreover, it follows from Assumption III. that the feasible region of λ is the interval [0, λ]. It follows from dp d = d 2 p 2 p, d 2 = 4 p p that Assumption III. implies that p is strictly concave in for >. Thus, by letting = λ it is known that f 0 γ, x, λ = p hx + γ is strictly concave in and its first order derivative is f 0 = λ2 p λ hx + γ. 3 Now we can claim that it suffices to consider the case γ 0 in our paper. Proposition III.3. If γ < 0, then Lγ = +. Proof. Note that for admissible λ, λ 2 p λ 0. Thus, if γ < 0, then f 0 is increasing in and thus decreasing in λ at x = 0. Therefore, max λ [0, λ] f 0 γ, 0, λ = f 0 γ, 0, 0 = + and thus Ls, S, γ = + which implies Lγ = +. Let g 0 λ = λ 2 p λ, then g 0 is strictly increasing in λ for λ > 0 as g 0λ = λλp λ + 2p λ > 0 according to Assumption III.. Thus, g 0 has a strictly increasing inverse denoted by g0. Moreover, g 00 = 0 and g 0 λ = λ 2 p λ. It follows from 3 that the optimal arrival probability at state x, 0 is { λ g x, 0 = 0 hx + γ, if hx + γ g 0 λ λ, otherwise. Next we compute the optimal arrival probability λ x, when the inventory level is x and the production is on. Let = λ. It follows from d p = d 2 p + p, d 2 d 2 p = 4 p p and Assumption III. thatf γ, x, λ = p c+ hx + γ is strictly concave in for >, and its first order derivative is f = λ λp λ + pλ c + hx + γ. 5 Let g λ = λ λp λ + pλ, then g is strictly decreasing in λ for λ > 0 as g λ = λλp λ + 2p λ < 0 according to Assumption III.. Thus, g has a strictly decreasing inverse denoted by g. Moreover, g 0 = lim λ 0 λp λ + p0 and g λ = λ λp λ. It follows from 5 that the optimal arrival probability at state x, is { λ g x, = c + hx + γ, if c + hx + γ g 0 0, otherwise. 6 Define r 0 γ, x = max λ f 0 γ, x, λ = f 0 γ, x, λ x, 0 and r γ, x = max λ f γ, x, λ = f γ, x, λ x,. Obviously, both r 0 γ, x and r γ, x are strictly decreasing, 4, c 205 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See for more information.

5 5 convex functions of γ as both f 0 γ, x, λ and f γ, x, λ are strictly decreasing, convex functions of γ. Moreover. we have Ls, S, γ = K + = K + S r 0 γ, i + r γ, i S r 0 γ, i + r γ, i. 7 The following two propositions characterize the relationship of r 0 γ, x, r γ, x with x, which is crucial in determining the optimal sγ and Sγ. Proposition III.4. For any given γ 0, both r 0 γ, x and r γ, x x R are quasiconcave functions of x, and 0 is the maximum point. Proof. It follows from the envelope theorem [7] that dr 0γ,x dx = h x/λ x, 0 and drγ,x dx = h x/ λ x, 0. Since hx is quasiconvex in x, we have h x 0 when x 0 and h x 0 when x 0. Therefore, both r 0 γ, x and r γ, x are quasiconcave functions of x, and 0 is the maximum point. Proposition III.5. For any given γ 0, r 0 γ, x+r γ, x x Z is a quasiconcave function of x, and 0 or is the maximum point. Proof. It follows from Proposition III.4 that r 0 γ, x + r γ, x is increasing on x 0 and decreasing on x. Since x Z, r 0 γ, x + r γ, x takes its maximum at point 0 or and is quasiconcave in x Z. Note that the constraint x Z is necessary for Proposition III.5 to be held as r 0 γ, x + r γ, x might not be quasiconcave for 0 < x <. E. Analysis of the optimal s and S given γ Now we want to find the optimal s and S given γ, which is the solution of max Ls, S, γ. s<s,s,s Z Proposition III.6. Given γ 0, if the set {x Z : r 0 γ, x+ r γ, x 0, x 0} is non-empty, then the optimal sγ and Sγ are given by sγ = min{x Z : r 0 γ, x + r γ, x 0}, 8 Sγ = max{x Z : r 0 γ, x + r γ, x 0}. 9 Proof. Since r 0 γ, x+r γ, x is a quasiconcave function of x Z, it follows from 7 that the optimal solution for max s,s Ls, S, γ should be the s and S such that r 0 γ, i + r γ, i 0 for all s + i S. Thus, the result holds. The set {x Z : r 0 γ, x + r γ, x 0, x 0} might be empty if γ is quite large. In this case, r 0 γ, x + r γ, x < 0 for all x Z. In this case, if r 0 γ, 0 + r γ, r 0 γ, + r γ, 0, then the optimal sγ, Sγ =, 0. Otherwise, the optimal sγ, Sγ = 0,. Proposition III.7. Consider the domain of γ such that the set {x Z : r 0 γ, x + r γ, x 0, x 0} is non-empty. sγ is increasing in γ and Sγ is decreasing in γ. Proof. Note that r 0 γ, x+r γ, x is decreasing in γ. For γ < γ 2, it follows from 8 that r 0 γ, sγ +r γ, sγ < 0 and thus r 0 γ 2, sγ + r γ 2, sγ < 0. Again, it follows from 8 that sγ < sγ 2 +, or sγ sγ 2. Thus, sγ is increasing in γ. Similarly, it follows from 9 that Sγ is decreasing in γ. Proposition III.8. λ x, 0 is decreasing on x 0, and increasing on x 0; λ x, is increasing on x 0, and decreasing on x 0. Proof. It follows from the definition of λ x, 0 that f 0 γ, x, λ x, 0 f 0 γ, x, λ x, 0 f 0 γ, x, λ x, 0 f 0 γ, x, λ x, 0. Summing the above two inequalities, we obtain λ x, 0 λ hx hx 0. x, 0 It follows from the quasiconvexity of hx that λ x, 0 is decreasing on x 0, and increasing on x 0. Similarly, we have λ x, λ x, hx hx 0. It follows from the quasiconvexity of hx that λ x, is increasing on x 0, and decreasing on x 0. Note that pλ is strictly decreasing in λ and the optimal price p x, y = pλ x, y. Thus, the interpretation of Proposition III.8 is as follows: When the production is off i.e., y = 0, the optimal price increases in the inventory level x if x 0, and decreases in x if x 0. When the production is on i.e., y =, the optimal price decreases in the inventory level x if x 0, and increases in x if x 0. Therefore, the optimal price is a quasiconcave function of the inventory level x when the production is off and is a quasiconvex function of x when the production is on. The result is consistent with our intuition. Intuitively, because of inventory holding costs and backlogging costs, the manufacturer attempts to maintain the inventory level around 0, by keeping the inventory level around 0 as long as possible and moving the inventory level away from S and s as fast as possible. Thus, he will make different pricing strategies according to the current production state. Suppose that the production is off on: if the inventory level is high, the manufacturer should set the selling price low high to attract more demand to sell the product in stock to make the inventory level quickly reach S to switch the production off; if the backlogging level is high, the manufacturer should also set the selling price low high to make the inventory level quickly reach s to switch the production on to meet more backlogged demand c 205 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See for more information.

6 6 F. Comparative statics We discuss the effect of switching cost. Let γ be the optimal average profit and s = sγ, S = Sγ. Proposition III.9. γ is strictly decreasing in K = K 0 + K ; s is decreasing in K and S is increasing in K. Moreover, p x, 0 is increasing in K and p x, is decreasing in K. Both the expected production-on time length and productionoff time length of a cycle are increasing in K. Proof. It follows from 0 = L γ = max Ls, S, s<s,s,s Z γ 0 S = max K + r 0 γ, i + r γ, i s<s,s,s Z and the fact that both r 0 γ, x and r γ, x are strictly decreasing in γ that γ is strictly decreasing in K. Thus, it follows from Proposition III.7 that s is decreasing in K and S is increasing in K. It follows from 4 and 6 that λ x, 0 is decreasing in K and λ x, is increasing in K. Thus, p x, 0 is increasing in K and p x, is decreasing in K. The expected production-on time length of a cycle is S / λ i,, which is increasing in K as λ x, is increasing in K, s is decreasing in K, S is increasing in K and λ i, > 0 for all i Z. Similarly, the expected production-off time length of a cycle S + /λ i, 0 is also increasing in K. The above result indicates that when the switching cost K increases, switching less frequently is preferred. The impact of holding cost on the optimal policy is unclear, although γ decreases as holding costs increase. Consider a special case in which the holding cost hx at each state x is increased by h i.e., hx + h. In this case, the optimal average profit is decreased by h, but the optimal production-pricing policy i.e., s, S and the optimal price remains unchanged. G. A numerical example In this part, we provide a simple numerical example to illustrate the properties of the optimal pricing strategies. In this example, the production rate r =, and λp = e θp with θ = 0.2. The allowable price set P = [, 0]. The switching costs are K 0 = 50 and K = 00. The holding cost function is hx = hx + + bx with h = 0., b = 0.5. By using value iteration algorithm, we find that the optimal average profit is γ = 0.3, s = 2 and S = 7. Fig shows a sample trajectory of inventory level and price under optimal policy, starting from level 0 with production on, which reflects an asymptotic cyclic behavior. When the production is on, the price first decreases and then increases, the inventory level increases from s to S. The price reaches the minimum when the inventory level is close to zero, so that the demand and production can be balanced while keeping low inventory. When the production is off, the price first increases and then decreases, the inventory level decreases from S to s. The price reaches the maximum p = 0 and keep for a while when the inventory level is close to zero. 20 S = s = Inventory Level Price Time Fig.. A sample trajectory of inventory level and price under optimal policy IV. CONCLUSION In this paper, we consider a stochastic inventory system where the manufacturer can dynamically adjust the selling price and switch the production on or off to maximize the long-run average profit. We show that an s, S, p policy is optimal. Moreover, the structural properties of the optimal profit function and pricing strategies are presented. There are a few topics worthy of further discussion. One can consider a compound Poisson process or batch production, formulating the problem as a continuous model with Poisson process directly. The problem under a discounted criterion, in finite horizon, or with lead times can also be studied. ACKNOWLEDGMENT The authors thank Prof. Xin Chen for proposing this research problem. This work was supported by NSFC under grants 72054, 74059, and 75776, and the Fundamental Research Funds for the Central Universities under grants WK and WK REFERENCES [] Dimitri P. Bertsekas 200 Dynamic Programming and Optimal Control, Vol.II, Athena Scientific, Belmont, Massachusetts. [2] X.Chao, S. X. Zhou 2006 Joint inventory-and-pricing strategy for a stochastic continuous-review system, IIE Transactions, 385: [3] X. Chen, D. Simchi-Levi 2004 Coordinating inventory control and pricing strategies with random demand and fixed ordering cost: the infinite horizon case. Mathematics of Operations Research 293: [4] X. Chen, D. Simchi-Levi 2006 Coordinating inventory control and pricing strategies: the continous review models. Operations Research Letters 34: [5] A. Federgruen, P.H. Zipkin 986 An inventory model with limited production capacity and uncertain demands I. The average-cost criterion. Mathematics of Operations Research 2: [6] N.D. V. Foreest, J. Wijngaard 204 On optimal policies for production-inventory systems with compound Poisson demand and setup costs. Mathematics of Operations Research 392: [7] P. Milgrom, I. Segal 2002 Envelope theorems for arbitrary choice sets. Econometrica 70 2: [8] S. Schaible, T.Ibaraki 983 Fractional programming. European Journal of Operational Research, 2: [9] Sennott, L.I Stochastic Dynamic Programming and the Control of Queueing Systems. New York, John Wiley & Sons. [0] M. J. Sobel 969 Optimal Average-Cost Policy for a Queue with Start-Up and Shut-Down Costs. Operations Research 7: c 205 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See for more information.

Coordinating Inventory Control and Pricing Strategies with Random Demand and Fixed Ordering Cost: The Finite Horizon Case

Coordinating Inventory Control and Pricing Strategies with Random Demand and Fixed Ordering Cost: The Finite Horizon Case OPERATIONS RESEARCH Vol. 52, No. 6, November December 2004, pp. 887 896 issn 0030-364X eissn 1526-5463 04 5206 0887 informs doi 10.1287/opre.1040.0127 2004 INFORMS Coordinating Inventory Control Pricing

More information

(s, S) Optimality in Joint Inventory-Pricing Control: An Alternate Approach*

(s, S) Optimality in Joint Inventory-Pricing Control: An Alternate Approach* OPERATIONS RESEARCH Vol. 00, No. 0, Xxxxx 0000, pp. 000 000 issn 0030-364X eissn 1526-5463 00 0000 0001 INFORMS doi 10.1287/xxxx.0000.0000 c 0000 INFORMS (s, S) Optimality in Joint Inventory-Pricing Control:

More information

Markov Decision Processes and Dynamic Programming

Markov Decision Processes and Dynamic Programming Markov Decision Processes and Dynamic Programming A. LAZARIC (SequeL Team @INRIA-Lille) Ecole Centrale - Option DAD SequeL INRIA Lille EC-RL Course In This Lecture A. LAZARIC Markov Decision Processes

More information

Optimality Results in Inventory-Pricing Control: An Alternate Approach

Optimality Results in Inventory-Pricing Control: An Alternate Approach Optimality Results in Inventory-Pricing Control: An Alternate Approach Woonghee Tim Huh, Columbia University Ganesh Janakiraman, New York University May 9, 2006 Abstract We study a stationary, single-stage

More information

(s, S) Optimality in Joint Inventory-Pricing Control: An Alternate Approach

(s, S) Optimality in Joint Inventory-Pricing Control: An Alternate Approach (s, S) Optimality in Joint Inventory-Pricing Control: An Alternate Approach Woonghee Tim Huh, Columbia University Ganesh Janakiraman, New York University May 10, 2004; April 30, 2005; May 15, 2006; October

More information

Point Process Control

Point Process Control Point Process Control The following note is based on Chapters I, II and VII in Brémaud s book Point Processes and Queues (1981). 1 Basic Definitions Consider some probability space (Ω, F, P). A real-valued

More information

Operations Research Letters. Joint pricing and inventory management with deterministic demand and costly price adjustment

Operations Research Letters. Joint pricing and inventory management with deterministic demand and costly price adjustment Operations Research Letters 40 (2012) 385 389 Contents lists available at SciVerse ScienceDirect Operations Research Letters journal homepage: www.elsevier.com/locate/orl Joint pricing and inventory management

More information

Stationary Probabilities of Markov Chains with Upper Hessenberg Transition Matrices

Stationary Probabilities of Markov Chains with Upper Hessenberg Transition Matrices Stationary Probabilities of Marov Chains with Upper Hessenberg Transition Matrices Y. Quennel ZHAO Department of Mathematics and Statistics University of Winnipeg Winnipeg, Manitoba CANADA R3B 2E9 Susan

More information

ADMISSION CONTROL IN THE PRESENCE OF PRIORITIES: A SAMPLE PATH APPROACH

ADMISSION CONTROL IN THE PRESENCE OF PRIORITIES: A SAMPLE PATH APPROACH Chapter 1 ADMISSION CONTROL IN THE PRESENCE OF PRIORITIES: A SAMPLE PATH APPROACH Feng Chen Department of Statistics and Operations Research University of North Carolina at Chapel Hill chenf@email.unc.edu

More information

A Hierarchy of Suboptimal Policies for the Multi-period, Multi-echelon, Robust Inventory Problem

A Hierarchy of Suboptimal Policies for the Multi-period, Multi-echelon, Robust Inventory Problem A Hierarchy of Suboptimal Policies for the Multi-period, Multi-echelon, Robust Inventory Problem Dimitris J. Bertsimas Dan A. Iancu Pablo A. Parrilo Sloan School of Management and Operations Research Center,

More information

Optimal Control of Parallel Make-To-Stock Queues in an Assembly System

Optimal Control of Parallel Make-To-Stock Queues in an Assembly System Optimal Control of Parallel Make-To-Stock Queues in an Assembly System Jihong Ou Heng-Qing Ye Weiyi Ning Department of Decision Sciences, NUS Business School, National University of Singapore, Singapore

More information

Markov Decision Processes and Dynamic Programming

Markov Decision Processes and Dynamic Programming Markov Decision Processes and Dynamic Programming A. LAZARIC (SequeL Team @INRIA-Lille) ENS Cachan - Master 2 MVA SequeL INRIA Lille MVA-RL Course How to model an RL problem The Markov Decision Process

More information

Keywords: inventory control; finite horizon, infinite horizon; optimal policy, (s, S) policy.

Keywords: inventory control; finite horizon, infinite horizon; optimal policy, (s, S) policy. Structure of Optimal Policies to Periodic-Review Inventory Models with Convex Costs and Backorders for all Values of Discount Factors arxiv:1609.03984v2 [math.oc] 28 May 2017 Eugene A. Feinberg, Yan Liang

More information

A Duality-Based Relaxation and Decomposition Approach for Inventory Distribution Systems

A Duality-Based Relaxation and Decomposition Approach for Inventory Distribution Systems A Duality-Based Relaxation and Decomposition Approach for Inventory Distribution Systems Sumit Kunnumkal, 1 Huseyin Topaloglu 2 1 Indian School of Business, Gachibowli, Hyderabad 500032, India 2 School

More information

ON THE STRUCTURE OF OPTIMAL ORDERING POLICIES FOR STOCHASTIC INVENTORY SYSTEMS WITH MINIMUM ORDER QUANTITY

ON THE STRUCTURE OF OPTIMAL ORDERING POLICIES FOR STOCHASTIC INVENTORY SYSTEMS WITH MINIMUM ORDER QUANTITY Probability in the Engineering and Informational Sciences, 20, 2006, 257 270+ Printed in the U+S+A+ ON THE STRUCTURE OF OPTIMAL ORDERING POLICIES FOR STOCHASTIC INVENTORY SYSTEMS WITH MINIMUM ORDER QUANTITY

More information

Dynamic Pricing for Non-Perishable Products with Demand Learning

Dynamic Pricing for Non-Perishable Products with Demand Learning Dynamic Pricing for Non-Perishable Products with Demand Learning Victor F. Araman Stern School of Business New York University René A. Caldentey DIMACS Workshop on Yield Management and Dynamic Pricing

More information

Optimality Inequalities for Average Cost MDPs and their Inventory Control Applications

Optimality Inequalities for Average Cost MDPs and their Inventory Control Applications 43rd IEEE Conference on Decision and Control December 14-17, 2004 Atlantis, Paradise Island, Bahamas FrA08.6 Optimality Inequalities for Average Cost MDPs and their Inventory Control Applications Eugene

More information

6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games

6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games 6.254 : Game Theory with Engineering Applications Lecture 7: Asu Ozdaglar MIT February 25, 2010 1 Introduction Outline Uniqueness of a Pure Nash Equilibrium for Continuous Games Reading: Rosen J.B., Existence

More information

Asymptotically Optimal Inventory Control For Assemble-to-Order Systems

Asymptotically Optimal Inventory Control For Assemble-to-Order Systems Asymptotically Optimal Inventory Control For Assemble-to-Order Systems Marty Reiman Columbia Univerisity joint work with Mustafa Dogru, Haohua Wan, and Qiong Wang May 16, 2018 Outline The Assemble-to-Order

More information

THIELE CENTRE. The M/M/1 queue with inventory, lost sale and general lead times. Mohammad Saffari, Søren Asmussen and Rasoul Haji

THIELE CENTRE. The M/M/1 queue with inventory, lost sale and general lead times. Mohammad Saffari, Søren Asmussen and Rasoul Haji THIELE CENTRE for applied mathematics in natural science The M/M/1 queue with inventory, lost sale and general lead times Mohammad Saffari, Søren Asmussen and Rasoul Haji Research Report No. 11 September

More information

EFFECTS OF SYSTEM PARAMETERS ON THE OPTIMAL POLICY STRUCTURE IN A CLASS OF QUEUEING CONTROL PROBLEMS

EFFECTS OF SYSTEM PARAMETERS ON THE OPTIMAL POLICY STRUCTURE IN A CLASS OF QUEUEING CONTROL PROBLEMS EFFECTS OF SYSTEM PARAMETERS ON THE OPTIMAL POLICY STRUCTURE IN A CLASS OF QUEUEING CONTROL PROBLEMS Eren Başar Çil, E. Lerzan Örmeci and Fikri Karaesmen Kellogg School of Management Northwestern University

More information

Geometric insights and proofs on optimal inventory control policies

Geometric insights and proofs on optimal inventory control policies Geometric insights and proofs on optimal inventory control policies Nicky D. van Foreest, Onur. A. Kilic Faculty of Economics and Business, University of Groningen, Groningen, The Netherlands {n.d.van.foreest@rug.nl,

More information

Total Expected Discounted Reward MDPs: Existence of Optimal Policies

Total Expected Discounted Reward MDPs: Existence of Optimal Policies Total Expected Discounted Reward MDPs: Existence of Optimal Policies Eugene A. Feinberg Department of Applied Mathematics and Statistics State University of New York at Stony Brook Stony Brook, NY 11794-3600

More information

OPTIMALITY OF RANDOMIZED TRUNK RESERVATION FOR A PROBLEM WITH MULTIPLE CONSTRAINTS

OPTIMALITY OF RANDOMIZED TRUNK RESERVATION FOR A PROBLEM WITH MULTIPLE CONSTRAINTS OPTIMALITY OF RANDOMIZED TRUNK RESERVATION FOR A PROBLEM WITH MULTIPLE CONSTRAINTS Xiaofei Fan-Orzechowski Department of Applied Mathematics and Statistics State University of New York at Stony Brook Stony

More information

On the Convergence of Optimal Actions for Markov Decision Processes and the Optimality of (s, S) Inventory Policies

On the Convergence of Optimal Actions for Markov Decision Processes and the Optimality of (s, S) Inventory Policies On the Convergence of Optimal Actions for Markov Decision Processes and the Optimality of (s, S) Inventory Policies Eugene A. Feinberg Department of Applied Mathematics and Statistics Stony Brook University

More information

A Single-Unit Decomposition Approach to Multiechelon Inventory Systems

A Single-Unit Decomposition Approach to Multiechelon Inventory Systems OPERATIONS RESEARCH Vol. 56, No. 5, September October 2008, pp. 1089 1103 issn 0030-364X eissn 1526-5463 08 5605 1089 informs doi 10.1287/opre.1080.0620 2008 INFORMS A Single-Unit Decomposition Approach

More information

A monotonic property of the optimal admission control to an M/M/1 queue under periodic observations with average cost criterion

A monotonic property of the optimal admission control to an M/M/1 queue under periodic observations with average cost criterion A monotonic property of the optimal admission control to an M/M/1 queue under periodic observations with average cost criterion Cao, Jianhua; Nyberg, Christian Published in: Seventeenth Nordic Teletraffic

More information

Zero-Inventory Conditions For a Two-Part-Type Make-to-Stock Production System

Zero-Inventory Conditions For a Two-Part-Type Make-to-Stock Production System Zero-Inventory Conditions For a Two-Part-Type Make-to-Stock Production System MichaelH.Veatch Francis de Véricourt October 9, 2002 Abstract We consider the dynamic scheduling of a two-part-type make-tostock

More information

Motivation for introducing probabilities

Motivation for introducing probabilities for introducing probabilities Reaching the goals is often not sufficient: it is important that the expected costs do not outweigh the benefit of reaching the goals. 1 Objective: maximize benefits - costs.

More information

Inventory control with partial batch ordering

Inventory control with partial batch ordering Inventory control with partial batch ordering Alp, O.; Huh, W.T.; Tan, T. Published: 01/01/2009 Document Version Publisher s PDF, also known as Version of Record (includes final page, issue and volume

More information

On Finding Optimal Policies for Markovian Decision Processes Using Simulation

On Finding Optimal Policies for Markovian Decision Processes Using Simulation On Finding Optimal Policies for Markovian Decision Processes Using Simulation Apostolos N. Burnetas Case Western Reserve University Michael N. Katehakis Rutgers University February 1995 Abstract A simulation

More information

This paper studies the optimization of the S T inventory policy, where T is the replenishment interval and S

This paper studies the optimization of the S T inventory policy, where T is the replenishment interval and S MANUFACTURING & SERVICE OPERATIONS MANAGEMENT Vol. 14, No. 1, Winter 2012, pp. 42 49 ISSN 1523-4614 (print) ISSN 1526-5498 (online) http://dx.doi.org/10.1287/msom.1110.0353 2012 INFORMS Good and Bad News

More information

Dynamic control of a tandem system with abandonments

Dynamic control of a tandem system with abandonments Dynamic control of a tandem system with abandonments Gabriel Zayas-Cabán 1, Jingui Xie 2, Linda V. Green 3, and Mark E. Lewis 4 1 Center for Healthcare Engineering and Patient Safety University of Michigan

More information

Stochastic inventory system with two types of services

Stochastic inventory system with two types of services Int. J. Adv. Appl. Math. and Mech. 2() (204) 20-27 ISSN: 2347-2529 Available online at www.ijaamm.com International Journal of Advances in Applied Mathematics and Mechanics Stochastic inventory system

More information

MATH4406 Assignment 5

MATH4406 Assignment 5 MATH4406 Assignment 5 Patrick Laub (ID: 42051392) October 7, 2014 1 The machine replacement model 1.1 Real-world motivation Consider the machine to be the entire world. Over time the creator has running

More information

A Decomposition Approach for a Class of Capacitated Serial Systems 1

A Decomposition Approach for a Class of Capacitated Serial Systems 1 A Decomposition Approach for a Class of Capacitated Serial Systems 1 Ganesh Janakiraman 2 IOMS-OM Group Stern School of Business New York University 44 W. 4th Street, Room 8-71 New York, NY 10012-1126.

More information

Inventory Control with Convex Costs

Inventory Control with Convex Costs Inventory Control with Convex Costs Jian Yang and Gang Yu Department of Industrial and Manufacturing Engineering New Jersey Institute of Technology Newark, NJ 07102 yang@adm.njit.edu Department of Management

More information

Single-part-type, multiple stage systems

Single-part-type, multiple stage systems MIT 2.853/2.854 Introduction to Manufacturing Systems Single-part-type, multiple stage systems Stanley B. Gershwin Laboratory for Manufacturing and Productivity Massachusetts Institute of Technology Single-stage,

More information

Optimal Control of a Production-Inventory System with both Backorders and Lost Sales

Optimal Control of a Production-Inventory System with both Backorders and Lost Sales Optimal Control of a Production-Inventory System with both Backorders and Lost Sales Saif Benjaafar Mohsen ElHafsi Tingliang Huang 3 Industrial & Systems Engineering, Department of Mechanical Engineering,

More information

1 Markov decision processes

1 Markov decision processes 2.997 Decision-Making in Large-Scale Systems February 4 MI, Spring 2004 Handout #1 Lecture Note 1 1 Markov decision processes In this class we will study discrete-time stochastic systems. We can describe

More information

Robustness of policies in Constrained Markov Decision Processes

Robustness of policies in Constrained Markov Decision Processes 1 Robustness of policies in Constrained Markov Decision Processes Alexander Zadorojniy and Adam Shwartz, Senior Member, IEEE Abstract We consider the optimization of finite-state, finite-action Markov

More information

Coordinated Replenishments at a Single Stocking Point

Coordinated Replenishments at a Single Stocking Point Chapter 11 Coordinated Replenishments at a Single Stocking Point 11.1 Advantages and Disadvantages of Coordination Advantages of Coordination 1. Savings on unit purchase costs.. Savings on unit transportation

More information

ON ESSENTIAL INFORMATION IN SEQUENTIAL DECISION PROCESSES

ON ESSENTIAL INFORMATION IN SEQUENTIAL DECISION PROCESSES MMOR manuscript No. (will be inserted by the editor) ON ESSENTIAL INFORMATION IN SEQUENTIAL DECISION PROCESSES Eugene A. Feinberg Department of Applied Mathematics and Statistics; State University of New

More information

Dynamic Control of a Tandem Queueing System with Abandonments

Dynamic Control of a Tandem Queueing System with Abandonments Dynamic Control of a Tandem Queueing System with Abandonments Gabriel Zayas-Cabán 1 Jungui Xie 2 Linda V. Green 3 Mark E. Lewis 1 1 Cornell University Ithaca, NY 2 University of Science and Technology

More information

Ordering Policies for Periodic-Review Inventory Systems with Quantity-Dependent Fixed Costs

Ordering Policies for Periodic-Review Inventory Systems with Quantity-Dependent Fixed Costs OPERATIONS RESEARCH Vol. 60, No. 4, July August 2012, pp. 785 796 ISSN 0030-364X (print) ISSN 1526-5463 (online) http://dx.doi.org/10.1287/opre.1110.1033 2012 INFORMS Ordering Policies for Periodic-Review

More information

Online Supplement to

Online Supplement to Online Supplement to Pricing Decisions in a Strategic Single Retailer/Dual Suppliers Setting under Order Size Constraints Ali Ekici Department of Industrial Engineering, Ozyegin University, Istanbul, Turkey,

More information

A MODEL FOR THE LONG-TERM OPTIMAL CAPACITY LEVEL OF AN INVESTMENT PROJECT

A MODEL FOR THE LONG-TERM OPTIMAL CAPACITY LEVEL OF AN INVESTMENT PROJECT A MODEL FOR HE LONG-ERM OPIMAL CAPACIY LEVEL OF AN INVESMEN PROJEC ARNE LØKKA AND MIHAIL ZERVOS Abstract. We consider an investment project that produces a single commodity. he project s operation yields

More information

Influence of product return lead-time on inventory control

Influence of product return lead-time on inventory control Influence of product return lead-time on inventory control Mohamed Hichem Zerhouni, Jean-Philippe Gayon, Yannick Frein To cite this version: Mohamed Hichem Zerhouni, Jean-Philippe Gayon, Yannick Frein.

More information

Constrained Inventory Allocation

Constrained Inventory Allocation Constrained Inventory Allocation Phillip G. Bradford Michael N. Katehakis January 25, 2006 SYNOPTIC ABSTRACT Supply constraints may include minimum supply commitments and maximum delivery limits. Given

More information

Preservation of Supermodularity in Two Dimensional Parametric Optimization Problems and its Applications

Preservation of Supermodularity in Two Dimensional Parametric Optimization Problems and its Applications Preservation of Supermodularity in Two Dimensional Parametric Optimization Problems and its Applications Xin Chen Department of Industrial and Enterprise Systems Engineering University of Illinois at Urbana-Champaign

More information

Research Article A Partial Backlogging Inventory Model for Deteriorating Items with Fluctuating Selling Price and Purchasing Cost

Research Article A Partial Backlogging Inventory Model for Deteriorating Items with Fluctuating Selling Price and Purchasing Cost Advances in Operations Research Volume 2012, Article ID 385371, 15 pages doi:10.1155/2012/385371 Research Article A Partial Backlogging Inventory Model for Deteriorating Items with Fluctuating Selling

More information

On the Convergence of Optimal Actions for Markov Decision Processes and the Optimality of (s, S) Inventory Policies

On the Convergence of Optimal Actions for Markov Decision Processes and the Optimality of (s, S) Inventory Policies On the Convergence of Optimal Actions for Markov Decision Processes and the Optimality of (s, S) Inventory Policies Eugene A. Feinberg, 1 Mark E. Lewis 2 1 Department of Applied Mathematics and Statistics,

More information

Designing the Distribution Network for an Integrated Supply Chain

Designing the Distribution Network for an Integrated Supply Chain Designing the Distribution Network for an Integrated Supply Chain Jia Shu and Jie Sun Abstract We consider an integrated distribution network design problem in which all the retailers face uncertain demand.

More information

Procedia Computer Science 00 (2011) 000 6

Procedia Computer Science 00 (2011) 000 6 Procedia Computer Science (211) 6 Procedia Computer Science Complex Adaptive Systems, Volume 1 Cihan H. Dagli, Editor in Chief Conference Organized by Missouri University of Science and Technology 211-

More information

Ergodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R.

Ergodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R. Ergodic Theorems Samy Tindel Purdue University Probability Theory 2 - MA 539 Taken from Probability: Theory and examples by R. Durrett Samy T. Ergodic theorems Probability Theory 1 / 92 Outline 1 Definitions

More information

Section Notes 9. Midterm 2 Review. Applied Math / Engineering Sciences 121. Week of December 3, 2018

Section Notes 9. Midterm 2 Review. Applied Math / Engineering Sciences 121. Week of December 3, 2018 Section Notes 9 Midterm 2 Review Applied Math / Engineering Sciences 121 Week of December 3, 2018 The following list of topics is an overview of the material that was covered in the lectures and sections

More information

MDP Preliminaries. Nan Jiang. February 10, 2019

MDP Preliminaries. Nan Jiang. February 10, 2019 MDP Preliminaries Nan Jiang February 10, 2019 1 Markov Decision Processes In reinforcement learning, the interactions between the agent and the environment are often described by a Markov Decision Process

More information

Structure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factors

Structure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factors DOI 10.1007/s10479-017-2548-6 AVI-ITZHAK-SOBEL:PROBABILITY Structure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factors Eugene A.

More information

JOINT PRICING AND PRODUCTION PLANNING FOR FIXED PRICED MULTIPLE PRODUCTS WITH BACKORDERS. Lou Caccetta and Elham Mardaneh

JOINT PRICING AND PRODUCTION PLANNING FOR FIXED PRICED MULTIPLE PRODUCTS WITH BACKORDERS. Lou Caccetta and Elham Mardaneh JOURNAL OF INDUSTRIAL AND doi:10.3934/jimo.2010.6.123 MANAGEMENT OPTIMIZATION Volume 6, Number 1, February 2010 pp. 123 147 JOINT PRICING AND PRODUCTION PLANNING FOR FIXED PRICED MULTIPLE PRODUCTS WITH

More information

Serial Inventory Systems with Markov-Modulated Demand: Solution Bounds, Asymptotic Analysis, and Insights

Serial Inventory Systems with Markov-Modulated Demand: Solution Bounds, Asymptotic Analysis, and Insights Serial Inventory Systems with Markov-Modulated Demand: Solution Bounds, Asymptotic Analysis, and Insights Li Chen Jing-Sheng Song Yue Zhang Fuqua School of Business, Duke University, Durham, NC 7708 li.chen@duke.edu

More information

Optimal Control of Stochastic Inventory System with Multiple Types of Reverse Flows. Xiuli Chao University of Michigan Ann Arbor, MI 48109

Optimal Control of Stochastic Inventory System with Multiple Types of Reverse Flows. Xiuli Chao University of Michigan Ann Arbor, MI 48109 Optimal Control of Stochastic Inventory System with Multiple Types of Reverse Flows Xiuli Chao University of Michigan Ann Arbor, MI 4809 NCKU Seminar August 4, 009 Joint work with S. Zhou and Z. Tao /44

More information

ON THE POLICY IMPROVEMENT ALGORITHM IN CONTINUOUS TIME

ON THE POLICY IMPROVEMENT ALGORITHM IN CONTINUOUS TIME ON THE POLICY IMPROVEMENT ALGORITHM IN CONTINUOUS TIME SAUL D. JACKA AND ALEKSANDAR MIJATOVIĆ Abstract. We develop a general approach to the Policy Improvement Algorithm (PIA) for stochastic control problems

More information

Markov Decision Processes and their Applications to Supply Chain Management

Markov Decision Processes and their Applications to Supply Chain Management Markov Decision Processes and their Applications to Supply Chain Management Jefferson Huang School of Operations Research & Information Engineering Cornell University June 24 & 25, 2018 10 th Operations

More information

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 43, NO. 3, MARCH

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 43, NO. 3, MARCH IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 43, NO. 3, MARCH 1998 315 Asymptotic Buffer Overflow Probabilities in Multiclass Multiplexers: An Optimal Control Approach Dimitris Bertsimas, Ioannis Ch. Paschalidis,

More information

Replenishment Planning for Stochastic Inventory System with Shortage Cost

Replenishment Planning for Stochastic Inventory System with Shortage Cost Replenishment Planning for Stochastic Inventory System with Shortage Cost Roberto Rossi UCC, Ireland S. Armagan Tarim HU, Turkey Brahim Hnich IUE, Turkey Steven Prestwich UCC, Ireland Inventory Control

More information

Cover Page. The handle holds various files of this Leiden University dissertation

Cover Page. The handle  holds various files of this Leiden University dissertation Cover Page The handle http://hdl.handle.net/1887/39637 holds various files of this Leiden University dissertation Author: Smit, Laurens Title: Steady-state analysis of large scale systems : the successive

More information

Infinite-Horizon Average Reward Markov Decision Processes

Infinite-Horizon Average Reward Markov Decision Processes Infinite-Horizon Average Reward Markov Decision Processes Dan Zhang Leeds School of Business University of Colorado at Boulder Dan Zhang, Spring 2012 Infinite Horizon Average Reward MDP 1 Outline The average

More information

Cover Page. The handle holds various files of this Leiden University dissertation

Cover Page. The handle   holds various files of this Leiden University dissertation Cover Page The handle http://hdlhandlenet/1887/39637 holds various files of this Leiden University dissertation Author: Smit, Laurens Title: Steady-state analysis of large scale systems : the successive

More information

Zero-sum semi-markov games in Borel spaces: discounted and average payoff

Zero-sum semi-markov games in Borel spaces: discounted and average payoff Zero-sum semi-markov games in Borel spaces: discounted and average payoff Fernando Luque-Vásquez Departamento de Matemáticas Universidad de Sonora México May 2002 Abstract We study two-person zero-sum

More information

2001, Dennis Bricker Dept of Industrial Engineering The University of Iowa. DP: Producing 2 items page 1

2001, Dennis Bricker Dept of Industrial Engineering The University of Iowa. DP: Producing 2 items page 1 Consider a production facility which can be devoted in each period to one of two products. For simplicity, we assume that the production rate is deterministic and that production is always at full capacity.

More information

Make-to-Stock Production-Inventory Systems. and Lost Sales

Make-to-Stock Production-Inventory Systems. and Lost Sales Make-to-Stock Production-Inventory Systems with Compound Poisson emands, Constant Continuous Replenishment and Lost Sales By Junmin Shi A dissertation submitted to the Graduate School-Newark Rutgers, The

More information

7.1 INTRODUCTION. In this era of extreme competition, each subsystem in different

7.1 INTRODUCTION. In this era of extreme competition, each subsystem in different 7.1 INTRODUCTION In this era of extreme competition, each subsystem in different echelons of integrated model thrives to improve their operations, reduce costs and increase profitability. Currently, the

More information

Abstract Dynamic Programming

Abstract Dynamic Programming Abstract Dynamic Programming Dimitri P. Bertsekas Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology Overview of the Research Monograph Abstract Dynamic Programming"

More information

On the static assignment to parallel servers

On the static assignment to parallel servers On the static assignment to parallel servers Ger Koole Vrije Universiteit Faculty of Mathematics and Computer Science De Boelelaan 1081a, 1081 HV Amsterdam The Netherlands Email: koole@cs.vu.nl, Url: www.cs.vu.nl/

More information

Resource allocation in organizations: an optimality result

Resource allocation in organizations: an optimality result Resource allocation in organizations: an optimality result Arun Sundararajan New York University, Stern School of Business 44 West 4th Street, K-MEC 9-79 New York, NY 10012-1126 asundara@stern.nyu.edu

More information

Some Background Math Notes on Limsups, Sets, and Convexity

Some Background Math Notes on Limsups, Sets, and Convexity EE599 STOCHASTIC NETWORK OPTIMIZATION, MICHAEL J. NEELY, FALL 2008 1 Some Background Math Notes on Limsups, Sets, and Convexity I. LIMITS Let f(t) be a real valued function of time. Suppose f(t) converges

More information

The Impact of Customer Impatience on Production Control

The Impact of Customer Impatience on Production Control The Impact of Customer Impatience on Production Control Michael H. Veatch March 27, 27 Abstract Most analyses of make-to-stock production control assume that either all orders are eventually met (complete

More information

A.Piunovskiy. University of Liverpool Fluid Approximation to Controlled Markov. Chains with Local Transitions. A.Piunovskiy.

A.Piunovskiy. University of Liverpool Fluid Approximation to Controlled Markov. Chains with Local Transitions. A.Piunovskiy. University of Liverpool piunov@liv.ac.uk The Markov Decision Process under consideration is defined by the following elements X = {0, 1, 2,...} is the state space; A is the action space (Borel); p(z x,

More information

Stochastic Dynamic Programming. Jesus Fernandez-Villaverde University of Pennsylvania

Stochastic Dynamic Programming. Jesus Fernandez-Villaverde University of Pennsylvania Stochastic Dynamic Programming Jesus Fernande-Villaverde University of Pennsylvania 1 Introducing Uncertainty in Dynamic Programming Stochastic dynamic programming presents a very exible framework to handle

More information

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance In this technical appendix we provide proofs for the various results stated in the manuscript

More information

Lecture 5: The Bellman Equation

Lecture 5: The Bellman Equation Lecture 5: The Bellman Equation Florian Scheuer 1 Plan Prove properties of the Bellman equation (In particular, existence and uniqueness of solution) Use this to prove properties of the solution Think

More information

Lecture 2. We now introduce some fundamental tools in martingale theory, which are useful in controlling the fluctuation of martingales.

Lecture 2. We now introduce some fundamental tools in martingale theory, which are useful in controlling the fluctuation of martingales. Lecture 2 1 Martingales We now introduce some fundamental tools in martingale theory, which are useful in controlling the fluctuation of martingales. 1.1 Doob s inequality We have the following maximal

More information

Stochastic Analysis of Bidding in Sequential Auctions and Related Problems.

Stochastic Analysis of Bidding in Sequential Auctions and Related Problems. s Case Stochastic Analysis of Bidding in Sequential Auctions and Related Problems S keya Rutgers Business School s Case 1 New auction models demand model Integrated auction- inventory model 2 Optimizing

More information

Simplex Algorithm for Countable-state Discounted Markov Decision Processes

Simplex Algorithm for Countable-state Discounted Markov Decision Processes Simplex Algorithm for Countable-state Discounted Markov Decision Processes Ilbin Lee Marina A. Epelman H. Edwin Romeijn Robert L. Smith November 16, 2014 Abstract We consider discounted Markov Decision

More information

1 Positive Ordering Costs

1 Positive Ordering Costs IEOR 4000: Production Management Professor Guillermo Gallego November 15, 2004 1 Positive Ordering Costs 1.1 (Q, r) Policies Up to now we have considered stochastic inventory models where decisions are

More information

Tutorial: Optimal Control of Queueing Networks

Tutorial: Optimal Control of Queueing Networks Department of Mathematics Tutorial: Optimal Control of Queueing Networks Mike Veatch Presented at INFORMS Austin November 7, 2010 1 Overview Network models MDP formulations: features, efficient formulations

More information

Full terms and conditions of use:

Full terms and conditions of use: This article was downloaded by: [148.251.232.83] On: 13 November 2018, At: 16:50 Publisher: Institute for Operations Research and the Management Sciences (INFORMS) INFORMS is located in Maryland, USA INFORMS

More information

MARKOV DECISION PROCESSES (MDP) AND REINFORCEMENT LEARNING (RL) Versione originale delle slide fornita dal Prof. Francesco Lo Presti

MARKOV DECISION PROCESSES (MDP) AND REINFORCEMENT LEARNING (RL) Versione originale delle slide fornita dal Prof. Francesco Lo Presti 1 MARKOV DECISION PROCESSES (MDP) AND REINFORCEMENT LEARNING (RL) Versione originale delle slide fornita dal Prof. Francesco Lo Presti Historical background 2 Original motivation: animal learning Early

More information

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance In this technical appendix we provide proofs for the various results stated in the manuscript

More information

Markov decision processes

Markov decision processes CS 2740 Knowledge representation Lecture 24 Markov decision processes Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott Square Administrative announcements Final exam: Monday, December 8, 2008 In-class Only

More information

Dynamic Optimization with a Nonsmooth, Nonconvex Technology: The Case of a Linear Objective Function

Dynamic Optimization with a Nonsmooth, Nonconvex Technology: The Case of a Linear Objective Function Dynamic Optimization with a Nonsmooth, Nonconvex Technology: The Case of a Linear Objective Function Takashi Kamihigashi* RIEB Kobe University tkamihig@rieb.kobe-u.ac.jp Santanu Roy Department of Economics

More information

Waiting time characteristics in cyclic queues

Waiting time characteristics in cyclic queues Waiting time characteristics in cyclic queues Sanne R. Smits, Ivo Adan and Ton G. de Kok April 16, 2003 Abstract In this paper we study a single-server queue with FIFO service and cyclic interarrival and

More information

Infinite-Horizon Discounted Markov Decision Processes

Infinite-Horizon Discounted Markov Decision Processes Infinite-Horizon Discounted Markov Decision Processes Dan Zhang Leeds School of Business University of Colorado at Boulder Dan Zhang, Spring 2012 Infinite Horizon Discounted MDP 1 Outline The expected

More information

where u is the decision-maker s payoff function over her actions and S is the set of her feasible actions.

where u is the decision-maker s payoff function over her actions and S is the set of her feasible actions. Seminars on Mathematics for Economics and Finance Topic 3: Optimization - interior optima 1 Session: 11-12 Aug 2015 (Thu/Fri) 10:00am 1:00pm I. Optimization: introduction Decision-makers (e.g. consumers,

More information

Multiple-Item Dynamic Pricing under a Common Pricing Constraint

Multiple-Item Dynamic Pricing under a Common Pricing Constraint Multiple-Item Dynamic Pricing under a Common Pricing Constraint Binbin Liu Joseph Milner Joseph L. Rotman School of Management University of Toronto 105 St. George Street Toronto, ON, Canada M5S 3E6 binbin.liu@rotman.utoronto.ca

More information

Addendum to: Dual Sales Channel Management with Service Competition

Addendum to: Dual Sales Channel Management with Service Competition Addendum to: Dual Sales Channel Management with Service Competition Kay-Yut Chen Murat Kaya Özalp Özer Management Science & Engineering, Stanford University, Stanford, CA. December, 006 1. Single-Channel

More information

MS&E 321 Spring Stochastic Systems June 1, 2013 Prof. Peter W. Glynn Page 1 of 10. x n+1 = f(x n ),

MS&E 321 Spring Stochastic Systems June 1, 2013 Prof. Peter W. Glynn Page 1 of 10. x n+1 = f(x n ), MS&E 321 Spring 12-13 Stochastic Systems June 1, 2013 Prof. Peter W. Glynn Page 1 of 10 Section 4: Steady-State Theory Contents 4.1 The Concept of Stochastic Equilibrium.......................... 1 4.2

More information

Economic lot-sizing games

Economic lot-sizing games Economic lot-sizing games Wilco van den Heuvel a, Peter Borm b, Herbert Hamers b a Econometric Institute and Erasmus Research Institute of Management, Erasmus University Rotterdam, P.O. Box 1738, 3000

More information

Division of the Humanities and Social Sciences. Supergradients. KC Border Fall 2001 v ::15.45

Division of the Humanities and Social Sciences. Supergradients. KC Border Fall 2001 v ::15.45 Division of the Humanities and Social Sciences Supergradients KC Border Fall 2001 1 The supergradient of a concave function There is a useful way to characterize the concavity of differentiable functions.

More information

Random Times and Their Properties

Random Times and Their Properties Chapter 6 Random Times and Their Properties Section 6.1 recalls the definition of a filtration (a growing collection of σ-fields) and of stopping times (basically, measurable random times). Section 6.2

More information