Optimal Control of Parallel Make-To-Stock Queues in an Assembly System

Similar documents
Zero-Inventory Conditions For a Two-Part-Type Make-to-Stock Production System

PROBABILISTIC SERVICE LEVEL GUARANTEES IN MAKE-TO-STOCK MANUFACTURING SYSTEMS

Optimal Control of a Production-Inventory System with both Backorders and Lost Sales

ADMISSION CONTROL IN THE PRESENCE OF PRIORITIES: A SAMPLE PATH APPROACH

Expected Time Delay in Multi-Item Inventory Systems with Correlated Demands

STOCK RATIONING IN A MULTI-CLASS MAKE-TO-STOCK QUEUE WITH INFORMATION ON THE PRODUCTION STATUS. August 2006

Coordinating Inventory Control and Pricing Strategies with Random Demand and Fixed Ordering Cost: The Finite Horizon Case

Tutorial: Optimal Control of Queueing Networks

Optimal Control of an Inventory System with Joint Production and Pricing Decisions

On the Convexity of Discrete (r, Q) and (s, T ) Inventory Systems

New Markov Decision Process Formulations and Optimal Policy Structure for Assemble-to-Order and New Product Development Problems

EFFECTS OF SYSTEM PARAMETERS ON THE OPTIMAL POLICY STRUCTURE IN A CLASS OF QUEUEING CONTROL PROBLEMS

Optimal Policies for an Assemble-to-Order N-System

ON THE STRUCTURE OF OPTIMAL ORDERING POLICIES FOR STOCHASTIC INVENTORY SYSTEMS WITH MINIMUM ORDER QUANTITY

(s, S) Optimality in Joint Inventory-Pricing Control: An Alternate Approach*

Individual, Class-based, and Social Optimal Admission Policies in Two-Priority Queues

THIELE CENTRE. The M/M/1 queue with inventory, lost sale and general lead times. Mohammad Saffari, Søren Asmussen and Rasoul Haji

OPTIMAL CONTROL OF A FLEXIBLE SERVER

Ordering Policies for Periodic-Review Inventory Systems with Quantity-Dependent Fixed Costs

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 43, NO. 3, MARCH

Preservation of Supermodularity in Two Dimensional Parametric Optimization Problems and its Applications

Operations Research Letters. Instability of FIFO in a simple queueing system with arbitrarily low loads

Influence of product return lead-time on inventory control

The Impact of Customer Impatience on Production Control

On service level measures in stochastic inventory control

Near-Optimal Control of Queueing Systems via Approximate One-Step Policy Improvement

No-Holdback Allocation Rules for Continuous-Time Assemble-to-Order Systems

Appendix - E-Companion

On the Partitioning of Servers in Queueing Systems during Rush Hour

Improved Algorithms for Machine Allocation in Manufacturing Systems

Monotone control of queueing networks

Effects of system parameters on the optimal policy structure in a class of queueing control problems

Asymptotically Optimal Inventory Control For Assemble-to-Order Systems

On the static assignment to parallel servers

A cµ Rule for Two-Tiered Parallel Servers

Are Base-stock Policies Optimal in Inventory Problems with Multiple Delivery Modes?

Information Relaxation Bounds for Infinite Horizon Markov Decision Processes

Inventory Control with Convex Costs

A Customer-Item Decomposition Approach to Stochastic Inventory Systems with Correlation

A monotonic property of the optimal admission control to an M/M/1 queue under periodic observations with average cost criterion

Keywords: inventory control; finite horizon, infinite horizon; optimal policy, (s, S) policy.

Discrete Event and Process Oriented Simulation (2)

(s, S) Optimality in Joint Inventory-Pricing Control: An Alternate Approach

A Single-Unit Decomposition Approach to Multiechelon Inventory Systems

MONOTONE CONTROL OF QUEUEING AND PRODUCTION/INVENTORY SYSTEMS. Michael H. Veatch and Lawrence M. Wein

Optimal and Heuristic Echelon ( r, nq, T ) Policies in Serial Inventory Systems with Fixed Costs

Research Article A Partial Backlogging Inventory Model for Deteriorating Items with Fluctuating Selling Price and Purchasing Cost

Coordinated Quality Control in a Two-Stage System

Stationary Probabilities of Markov Chains with Upper Hessenberg Transition Matrices

OPTIMALITY OF RANDOMIZED TRUNK RESERVATION FOR A PROBLEM WITH MULTIPLE CONSTRAINTS

Optimality Results in Inventory-Pricing Control: An Alternate Approach

Optimal Policies for a Dual-Sourcing Inventory Problem with Endogenous Stochastic Lead Times

A Hierarchy of Suboptimal Policies for the Multi-period, Multi-echelon, Robust Inventory Problem

Single-stage Approximations for Optimal Policies in Serial Inventory Systems with Non-stationary Demand

Optimal Control of Stochastic Inventory System with Multiple Types of Reverse Flows. Xiuli Chao University of Michigan Ann Arbor, MI 48109

Performance Analysis and Evaluation of Assemble-to-Order Systems with Stochastic Sequential Lead Times

Stochastic Optimization

Optimal and Heuristic Resource Allocation Policies in Serial Production Systems

Single-stage Approximations for Optimal Policies in Serial Inventory Systems with Non-stationary Demand

A New Algorithm and a New Heuristic for Serial Supply Systems.

Central-limit approach to risk-aware Markov decision processes

Dynamic Control of a Tandem Queueing System with Abandonments

This paper studies the optimization of the S T inventory policy, where T is the replenishment interval and S

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

Preservation of Structural Properties in Optimization with Decisions Truncated by Random Variables and Its Applications

Online Appendices: Inventory Control in a Spare Parts Distribution System with Emergency Stocks and Pipeline Information

On the Partitioning of Servers in Queueing Systems during Rush Hour

Dynamic Capacity Management with General Upgrading

A New Approximate Evaluation Method for Two-Echelon Inventory Systems with Emergency Shipments

The Stochastic Knapsack Revisited: Switch-Over Policies and Dynamic Pricing

A Proof of the EOQ Formula Using Quasi-Variational. Inequalities. March 19, Abstract

Point Process Control

Improving Supply Chain Performance: Real-Time Demand Information and Flexible Deliveries

On the Convergence of Optimal Actions for Markov Decision Processes and the Optimality of (s, S) Inventory Policies

Distributionally Robust Discrete Optimization with Entropic Value-at-Risk

JOINT PRICING AND PRODUCTION PLANNING FOR FIXED PRICED MULTIPLE PRODUCTS WITH BACKORDERS. Lou Caccetta and Elham Mardaneh

Optimality Inequalities for Average Cost MDPs and their Inventory Control Applications

Dynamic Pricing for Non-Perishable Products with Demand Learning

A Simple Solution for the M/D/c Waiting Time Distribution

Serial Inventory Systems with Markov-Modulated Demand: Solution Bounds, Asymptotic Analysis, and Insights

Multimodularity and Its Applications in Three Stochastic Dynamic Inventory Problems

The Stochastic Economic Lot Sizing Problem for Continuous Multi-Grade Production

Scientiae Mathematicae Japonicae Online, Vol. 5, (2001), POSSIBILITY THAT THE CUSTOMERS GIVE UP PURCHASING THE MERCHANDISE

M/M/1 Queueing systems with inventory

Structure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factors

1 Markov decision processes

Using imperfect advance demand information in ordering and rationing decisions Tan, T.; Güllü, R.; Erkip, N.

Queue Decomposition and its Applications in State-Dependent Queueing Systems

CHARACTERIZING THE PERFORMANCE OF PROCESS FLEXIBILITY STRUCTURES. O. Zeynep Akşin and Fikri Karaesmen

Abstrct. In this paper, we consider the problem of optimal flow control for a production system with one machine which is subject to failures and prod

2. Assumptions and Notation

A PARAMETRIC DECOMPOSITION BASED APPROACH FOR MULTI-CLASS CLOSED QUEUING NETWORKS WITH SYNCHRONIZATION STATIONS

An Uncertain Bilevel Newsboy Model with a Budget Constraint

Maximizing throughput in zero-buffer tandem lines with dedicated and flexible servers

A Primal-Dual Algorithm for Computing a Cost Allocation in the. Core of Economic Lot-Sizing Games

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance

Stability of the two queue system

Minimizing response times and queue lengths in systems of parallel queues

Equilibrium solutions in the observable M/M/1 queue with overtaking

Appointment Scheduling with Discrete Random Durations

Transcription:

Optimal Control of Parallel Make-To-Stock Queues in an Assembly System Jihong Ou Heng-Qing Ye Weiyi Ning Department of Decision Sciences, NUS Business School, National University of Singapore, Singapore 117592 bizoujh@nus.edu.sg bizyehq@nus.edu.sg g0301025@nus.edu.sg Octorber 2005 NUS Business School Working Paper Abstract This paper considers an assembly system in which components are produced to stock but the final assembly of the product is made to order. Different components are produced on separate dedicated facilities. The production process for each component is modelled as an exponential make-to-stock queue and the assembly process is assumed instantaneous. Demand arrivals are modelled by a Poisson process. At each demand arrival, if all components have inventory in stock, a unit of the product is assembled to satisfy the demand; otherwise the demand is backordered which accrues penalty cost until it is satisfied. On the other hand components in inventory carry holding costs. The problem is to control the component production, based on the inventory levels of all the components in the system, to minimize the discounted total cost. We prove that the optimal control policy is dynamic in that each component should be produced up to an inventory level dependent on the inventory levels of the other components. Furthermore we show that this level is upper bounded so that production of each component is halted when its inventory reaches an absolutely high level independent of the inventory levels of the other components. We also report numerical examples and in particular observe that even the best static base stock policy can perform infinitely worse than the optimal dynamic policy. (Assembly System, Make-to-Stock Queues, Make-to-Order Assembly, Dynamic Optimal Control, Base Stock Policy) 1 Introduction Consider an assembly system that produces multiple components and assembles them into a final product. The components are produced to stock on separate dedicated facilities. The 1

component production processes are random due to uncertainties in production time and/or yield. When all components have inventory in stock it is possible to assemble a unit of finished product by taking one unit from each component inventory. The assembly process is assumed 100% reliable and requiring negligible amount of time. Demand for the finished product arrives in single units. At arrival if all components have inventory in stock the assembly process is executed and a unit of finished product is output to satisfy the demand. Otherwise if some components have zero inventory available in stock the demand is backordered until those components are produced. A profit is realized For each demand satisfied, a penalty cost is levied for demands backordered per unit time. On the other hand the components in inventory carry inventory holding costs per unit time. The problem we study in the paper is to optimally control the component production processes so that the discounted total cost of the assembly system is minimized. The system is a model for some assemble-to-order systems, on which a comprehensive survey of the research literature is done by Song and Zipkin (2003) (see also Cheng, et al. 2002, Lu, et al. 2005, Song and Yao 2002 and Xu 2001). As born out most of the existing works focuses on performance evaluation of some predefined control policies with the assemble-toorder systems. In particular, Song et al. (1999) investigated performances of base stock polices under various order fulfilling schemes for systems more general than what we study in the paper. We are making an attempt at the optimal control problem by considering a relatively simple type of system. The optimal control policy we determine in our work turns out to be a type of dynamic curve control. Namely, each component should be produced up to an inventory level which is dependent on the inventory levels of the other components. Roughly, the higher the other inventory levels of the other components, the higher the produce-up-to level for the concerned component. Furthermore we show that the control curve is upper bounded for each component. Thus for each component there is an inventory level over which production of the component is halted no matter what the inventory levels are of the other components. Also we verify the intuition that, if one component has zero inventory while there are backordered demands and all the other components have positive inventory, the zero inventory component must be produced. Although the dynamic optimal control policy of curve type is not surprising to many researchers, we believe our contributions are two folds. Firstly, we give a formal proof and characterization of the control curve and obtain some interesting asymptotic properties of the policy. Secondly, we learn from an extensive numerical study that the best base stock policy can perform infinitely worse than the optimal dynamic policy. Therefore we provide a cautionary note to the research literature s hitherto attention focus on the base stock policies. 2

Such policies may be easy to manage in practice and easy to analyze in theory, but could lead to significant under performance for the system. The remainder of the paper is organized as follows. Section 2 reviews the literature on the optimal control of make-to-stock queues. Section 3 describes the assembly system and presents the precise formulation of the optimal control problem that we study in the paper. Section 4 characterizes the optimal control policy and proves several asymptotic properties. Section 5 reports results from the numerical study. Section 6 generalizes the optimal control policy to multi-component system. Finally, Section 7 concludes the paper with a summary of results and a discussion on future research. 2 Literature Review Make-to-stock queues were first employed to model production systems by Gavish and Graves (1980) and Soble (1982), who modelled single product single facility production systems by M/D/1 queues and derived base stock optimal control polices. For M/M/1 make-to-stock queues the optimality of the base stock policy was first proved in Li (1988) for the discounted total cost. The result was extended by Feng and Yan (2000) to allow random break downs of the production facility. Later Feng and Xiao (2002) proved the same optimality result under long run average cost. However most of the literature on the optimal control of make-to-stock queues have been concerned with multiple product single facility systems. For these systems the control problem involves an additional issue of product sequencing when the production is on as well as the issue of turning production on and off. Zheng and Zipkin (1990) showed that in the case of two symmetric products the sequencing policy that always produce the product with the longest backorder queue is better than the policy of first-come-first-serve. Wein (1992) investigated the optimal control problem of a general multiclass make-to-stock queue by following the Brownian approximation approach, and proposed a hedging point policy which is a generalization of the base stock policy. Ha (1997a) analytically proved some optimality properties of the hedging point policy. In particular he showed that, for a make-to-stock queue with two products facing Poisson demand processes and requiring identically distributed exponential production times, a dynamic version of the hedging policy is optimal under the objective of minimizing the discounted total cost. The dynamic hedging point policy is of curve control type: one curve determines the on-off region for production and the other curve determines when to switch the product to be produced. Further characterization of the optimal switching curve was obtained by De Vericourt et al. (2000). Our work follows very much the same line of analysis as in Ha (1997a) and establishes similar structure properties of the optimal control. 3

There are also numerical studies on heuristics, e.g., Veatch and Wein (1996) and Peña- Perez and Zipkin (1997), and asymptotic studies on various limiting schemes, e.g., Glasserman and Wang (1998) and Bertsimas and Paschalidis (2001). Other types of control problems for multiclass make-to-stock queues have also been studied. For example, Ha (1997b, 1997c) considered the stock rationing issue, Carr and Duenyas (2000) included the issue of admission control, and Caldenty and Wein (2003) added a new dimension of revenue management. A few works are closely related to ours. One by Chen et al. (1993) concerns the same single product assembly system but in the setting that the assembly is done as soon as there are units in inventory for all the components and all assembled products yield profit. In this case the optimal controls on the component production processes are threshold type with the threshold for each component independent of the others. In Hsu et al. (2005), they also analyze the optimal stocking quantities for components of an assemble-to-order product. In their work, the manufacture faces uncertain and one time demand (and thus no queueing of demand), and need to make pricing for the final product. 3 System Description and Problem Formulation The assembly system we study is illustrated in Figure 1 below for a product with two components. In fact we will provide detailed proofs for the properties of the dynamic optimal control policy only for the case of two components, and will highlight their extensions to multiple components. Figure 1: An Assembly System with Two Parallel Make-To-Stock Queues In the system of Figure 1 the components are produced by two separate facilities. The production processes take exponential amount of time with a rate of µ i for component i, i = 1, 2. The finished product is assembled from one unit of each component. The assembly process is 100% reliable and takes 0 amount of time. Demand for the finished product arrives according to a Poisson process with rate λ. We assume that any unmet demand on arrival is 4

backordered, i.e., there is no rejection of demand. Therefore, we assume µ i > λ, i = 1, 2, to make sure the system s long term stability. At a demand arrival, if both components have inventories in stock, the assembly process is executed to output one unit of finished product to satisfy the demand. Otherwise, the demand is added to a waiting queue which is served in first-come-first-serve order. The halffinished components in inventory carry a holding cost of c + i per unit component per unit time, i = 1, 2. The backordered demands carry a penalty cost of c per waiting demand per unit time. The optimal control problem for the system is to manage the production processes of the two components, based on the component inventories and demand backordering, so that the discounted total cost over infinite time horizon is minimized. We specify a control policy for the system by a production rate function µ(t) = (µ 1 (t), µ 2 (t)) for all time t > 0, where µ i (t) = 0 (production off) or µ i (production on). The policy is said non-anticipatory if, at any t > 0, µ(t) depends only on information prior to t. the collection of all such non-anticipatory control policies. Let U be We use the components inventory positions as the state variables for the system, where a component s inventory position is its inventory in stock minus the number of backordered demands. Given a control policy u = {µ(t) : t > 0} U, we denote the components inventory positions at time t > 0 by x u (t) = (x u 1 (t), xu 2 (t)). Then starting from an initial state x = (x 1, x 2 ) the discounted total cost over infinite time horizon is given by [ ] J u (x) = E e γt c(x(t))dt. 0 Here, γ is the discount factor, and c(x(t)) is the cost function and given as 2 c(x(t)) = c + i w i(t) + c b(t), i=1 where b(t) is the amount of backordered demands at time t, i.e., b(t) = min{x 1 (t), x 2 (t), 0}; and w i (t), i = 1, 2, is the on hands inventory of component i at time t, i.e., w i (t) = x i (t) + b(t). It can be confirmed that the cost c(x 1, x 2 ) is a convex and submodular function. A policy u U is said optimal if it minimizes J u (x) for any starting state x, i.e., it solves the following optimization problem J (x) = inf u U J u (x), for x Z 2. (1) 5

Without loss of generality, we can uniformize the instantaneous rates of changes for the Markov decision process by assuming γ + λ + µ 1 + µ 2 = 1. Then, we obtain the Hamilton-Jacobi condition for this Markov decision process optimal control problem as follows (cf. Chapter 7 of Ross 1970), 2 J(x 1, x 2 ) = c(x 1, x 2 ) + λj(x 1 1, x 2 1) + (µ 1 + µ 2 )J(x 1, x 2 ) + min[0, µ i D i J(x 1, x 2 )], (2) i=1 where, for any function v(x 1, x 2 ), D 1 v(x 1, x 2 ) = v(x 1 + 1, x 2 ) v(x 1, x 2 ), and D 2 v(x 1, x 2 ) = v(x 1, x 2 + 1) v(x 1, x 2 ). 4 Dynamic Optimal Control and Its Properties In this section we characterize the solution to the Hamilton-Jacobi condition (2) and present some of its asymptotic properties. All proofs are collected in the appendix. For notational convenience, we define an operator T on the set of real-valued functions v(x 1, x 2 ) as follows, 2 T v(x 1, x 2 ) = c(x 1, x 2 ) + λv(x 1 1, x 2 1) + (µ 1 + µ 2 )v(x 1, x 2 ) + min[0, µ i D i v(x 1, x 2 )]. (3) i=1 Let V be the set of functions defined on Z 2 such that if v V, then: (a) Submodularity (D 12 v 0): D 1 v(x 1, x 2 ) is nonincreasing in x 2 and D 2 v(x 1, x 2 ) is nonincreasing in x 1 ; (b) Convexity (D 11 v 0, D 22 v 0): D 1 v(x 1, x 2 ) is nondecreasing in x 1 and D 2 v(x 1, x 2 ) is nondecreasing in x 2. The following lemma shows that submodularity and convexity are preserved under the operator T : Lemma 1 If v V, then, a) min[v(x 1 + 1, x 2 ), v(x 1, x 2 )] V, b) min[v(x 1, x 2 + 1), v(x 1, x 2 )] V, and c) T v(x 1, x 2 ) V. Lemma 1 enables us to characterize the solution to the Hamilton-Jacobi condition (2) as in the following theorem: Theorem 1 a) The optimal cost function J(x) is submodular and convex. 6

b) Let B 1 (x 2 ) = min{x 1 : D 1 J(x 1, x 2 ) 0}, and B 2 (x 1 ) = min{x 2 : D 2 J(x 1, x 2 ) 0}. Then, at state x = (x 1, x 2 ), it is optimal to stop production of component 1 if x 1 B 1 (x 2 ), and to stop production of component 2 if x 2 B 2 (x 1 ). c) B 1 (x 2 ) is increasing in x 2, and B 2 (x 1 ) is increasing in x 1. Theorem 1 reveals that the optimal control policy is dynamic and curve type. B 1 (x 2 ) is the produce-up-to level curve for component 1, and it depends on the inventory level of component 2, and vice versa. We state in the following some asymptotic properties of the control curves B 1 (x 2 ) and B 2 (x 1 ). Firstly, we verify the intuition that, if one component s inventory is zero while there are backordered demands and the other component has higher inventory position, then the former component must be produced. Formally this is expressed as in the following theorem: Theorem 2 B 1 (x 2 ) > min(0, x 2 ) and B 2 (x 1 ) > min(0, x 1 ). Next, based on submodularity and convexity, we can conclude that both D 1 J(x 1, x 2 ) and D 1 J(x 1, x 2 ) are monotone over x 1 and x 2, and therefore, the following limits exist: lim D ij(x 1, x 2 ) = f i (x j ), i, j = 1, 2, i j; x i lim D jj(x 1, x 2 ) = g i (x j ), i, j = 1, 2, i j. x i We further show that this convergence is uniform. A function v(x 1, x 2 ) converges to h(x 2 ) uniformly in x 2 as x 1, if for any given ɛ > 0 there exists a constant X 1, independent of X 2, such that for all x 1 > X 1, v(x 1, x 2 ) h(x 2 ) < ɛ. Theorem 3 D i J(x 1, x 2 ) f i (x j ) = c+ i γ uniformly in x j as x i, for i, j = 1, 2, i j; D j J(x 1, x 2 ) g i (x j ) < uniformly in x j as x i, for i, j = 1, 2, i j. Theorem 3 indicates that the value function under the optimal policy is asymptotically linear. In particular, the intuition for the first limit is that, suppose component 1 s inventory is very high, having one more unit of component 1 would incur an additional cost c + 1 In the limit, the net present value of this additional cost is c + 1 0 e γt dt = c+ 1 γ. per unit time. More importantly, Theorem 3 leads to the boundedness of the optimal control curves as stated in the following corollary: 7

Corollary 1 There exists a constant B 1 (independent of x 2 ) such that B 1 (x 2 ) B 1 for all x 2 ; and there exists a constant B 2 (independent of x 1 ) such that B 2 (x 1 ) B 2 for all x 1. same: Finally, we have that for symmetric components the two optimal control curves are the Theorem 4 If c + 1 = c+ 2 and µ 1 = µ 2, then B 1 ( ) = B 2 ( ). However below we will show by a numerical example that, even for symmetric components, it is not necessarily true that B 1 ( ) = B 2 ( ) = const. Thus, a static base stock policy cannot be optimal, and actually we will show that even the best static base stock policy can perform infinitely worse than the optimal dynamic policy. 5 Numerical Examples In this section, we compute the optimal dynamic control policy for two numerical examples, one with symmetric components and one with asymmetric components. For each example we also find through brute force search the best static base stock policy. We remark that the dynamic and static policies differ mostly at the instants when both components are backordered but one is more than the other. The static policy will try to bring both component s inventory up to the predefined base stock levels that are independent of the inventory state of the other component. However, the dynamic policy will first hold production of the less backordered component since the more of it will not reduce the product backordering queue but only increase inventory. As a result, the dynamic policy in the long run incurs less component inventory holding costs. We first evaluate a simple example of a single product with two symmetric components. The product demand arrival rate is λ = 0.29. The two components have the same production rate µ 1 = µ 2 = 0.35 and the same inventory holding cost c + 1 = c+ 2 = 1. The backorder cost is c = 1 and the discount rate γ = 0.01. The optimal dynamic control curves are shown in Figure 2. The best static base stock policy for the example is to set base stock levels at S1 = S2 = 2, under which the system cost is higher than the optimal dynamic base stock policy by 9.52%. Not surprisingly performance difference between the two policies increases with increasing load factor of the system as indicated in Figure 3. Here, we use the same cost parameters as above but increase the system load factor ρ = λ/µ 1 = λ/µ 2. In all instances, we uniformize the rate parameters so that the uniformization equation, i.e., λ + γ + µ 1 + µ 2 = 1, holds. We also expect that the performance gap between the two policies diminish with increasing backorder cost c as indicated in Figure 4, in which all the parameters are the same as in Figure 8

20 15 B 1 (x 2 ) 10 Produce Component 1 5 B 2 (x 1 ) 0 x 2 5 10 15 Produce Both Components Produce Component 2 20 25 30 30 25 20 15 10 5 0 5 10 15 20 x 1 Figure 2: The Dynamic Optimal Control for the Symmetric System 10% 9% Percentage Performance Difference Between Optimal Dynamic and Best Base Stock Policy 8% 7% 6% 5% 4% 3% 2% 1% 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Load Factor ρ Figure 3: Percentage Performance Difference Between the Optimal Dynamic Control and the Best Base Stock Policy at different load levels 9

80% 70% Percentage Performance Difference Between Optimal Dynamic and Best Base Stock Policy 60% 50% 40% 30% 20% 10% 0 10 1 10 0 10 1 10 2 Log(c ) Figure 4: Percentage Performance Difference Between the Optimal Dynamic Control and the Best Base Stock Policy for different backorer costs 1 except that c is varied. This is because for large c the base stock levels set by both the static and dynamic are so high that the chances of having backordering queue are extremely small, therefore the two policies will behave quite similarly. Next, we examine a system with two asymmetric components to illustrate that performance difference between the optimal dynamic policy and the static base stock can be arbitrarily large. The two components production rates are µ 1 = 0.0105, and µ 2 = 0.979, and inventory holding costs are c + 1 = 10 and c+ 2 = 100. The product demand rate is λ = 0.01 with backorder cost c = 100. The discount rate is γ = 0.0005. The optimal dynamic control curves are shown in Figure 5. The best static base stock policy for the example is to set base stock level for component 1 at S 1 1 = 11 and for component 2 at S 2 = 0, under which the system cost is higher than the optimal dynamic base stock policy by 88.2%. We observe that in this system component 2 has a faster production rate, so when both components are backordered the static policy might build up a large queue for component 2 when component 1 is being produced slowly. Thus we experimented with increasing c + 2 and increasing µ 2 /µ 1 while keeping all other parameters the same as in Figure 5. The results are shown in Figure 6, which indicates that indeed performance difference becomes ever larger with larger c + 2. It also confirms the expectation that the difference gap is wider with higher µ 2/µ 1 as in this case faster over producing of less backordered component leads to higher inventory holding costs. 10

20 15 B 1 (x 2 ) 10 Produce Component 1 5 0 B (x ) 2 1 x 2 5 10 Produce Component 2 15 20 Produce Both Components 25 30 30 25 20 15 10 5 0 5 10 15 20 x 1 Figure 5: The Dynamic Optimal Control for the Asymmetric System Percentage Performance Difference Between Optimal Dynamic and Best Base Stock Policy 350% 300% 250% 200% 150% 100% 50% µ 2 / µ 1 = 1.64 µ 2 / µ 1 = 2 µ 2 / µ 1 = 3 µ 2 / µ 1 = 6 high µ 2 low µ 2 0 10 0 10 1 10 2 10 3 Log(c + 2 ) Figure 6: Percentage Performance Difference Between the Optimal Dynamic Control and the Best Base Stock Policy for different µ 2 µ 1 and component 2 s inventory holding cost c + 2 11

6 Extensions to Multiple Component Systems The results of Section 4 can be generalized to multiple component systems due to additivity of the cost terms in the objective function J u (x) over the n components, where x = (x 1,..., x n ). For convenience, denote S i = (x 1,..., x i 1, x i+1,...x n ). We state the extensions but omit their proofs. Theorem 5 b) Let a) The optimal cost function J(x) is submodular and convex; B i (S i ) = min{x i : D i J(x) 0}, at state x, it is always optimal to stop production of component i if its inventory level is at or above B i (S i ); c) B i (S i ) is increasing in all x j, j i. Theorem 6 It is optimal to produce a component that is backlogged and whose inventory position is lower than all the other components. Theorem 7 The optimal cost function J(x) has the following limits, for each component i: D i J(x) c+ i uniformly in S i as x i ; γ D j J(x) g ij (S i ) uniformly in S i as x i, for all j i. Corollary 2 The optimal control curve B i (S i ) is bounded from above for every component i. 7 Conclusion In this paper, the optimal dynamic control policy is characterized for an assembly system in which components are produced to stock but the final assembly of the product is made to order. Not only we prove that the optimal control policy is dynamic in that each component should be produced up to an inventory level dependent on the inventory levels of the other components, but also we show that this level is upper bounded so that production of each component is halted when its inventory reaches an absolutely high level. We also report numerical examples which indicate that comparing with the optimal dynamic control policy even the best static base stock policy can perform infinitely worse. The results are established for systems of a single product and the product assembled from one unit each of the components. A natural next step is to consider systems of multiple products and the products assembled from multiple component units. For those systems there can arise the issues of full or partial fulfillment of the product demands and of priority allocation 12

of components to the product demands. Technically our models are based on assumptions of exponential production times and Poisson demand process which although are standard for optimal control studies, nevertheless are restrictive, might be extended to allow more general production times and demand arrival processes. Acknowledgement. This research is supported in part by the Academic Research Fund under R-314-000-061-112. We would thank Hong Chen, Xiuli Chao and Susan Xu for their comments at the 12th INFORMS Applied Probability Conference, Beijing, China, 23-25 June 2004. References [1] Bertsekas,D.P. 1987. Dynamic Programming: Deterministic and Stochastic Models, Prentice-Hall, Inc. [2] Bertsimas D. and I. Paschalidis. 2001. Probabilistic service level guarantees in make-tostock manufacturing systems, Opns. Res. 49, 119-133. [3] Caldentey, R., L. M. Wein. 2003. Analysis of a decentralized production-inventory system. Manufacturing and Services Opns. Sci. 5, 1-17. [4] Carr, S. and I. Duenyas. 2000. Optimal Admission Control and Sequencing in a Make-to- Stock Make-to-Order Production System. Opns. Res. 48, 709-720. [5] Chen, H., P. Yang, D. Yao, and X. Chao 1993. Optimal Control of a Simple Assembly System. Opns. Res. Letters 14, 199-205. [6] De Vericourt F., F. Karaesmen and Y. Dallery. 2000. Dynamic Scheduling in a Make-to- Stock System: A Partial Characterization of Optimal Policies. Opns. Res. 48, 811-819. [7] Feng,Y., Yan,H. 2000. Optimal Production Control in a Discrete Manufacturing System with Unreliable Machines and Random Demands. IEEE Transactions on Automatic Control, 45, 2280-2296. [8] Feng Y., B. Xiao 2002. Optimal Threshold Control in Discrete Failure-Prone Manufacturing Systems. Forthcoming in IEEE Transaction, Automatic Control. [9] Gavish, B., and S. Graves, 1980. A One-Product Production/Inventory Problem Under Continuous Review Policy. Opns. Res. 28, 1228-1236. 13

[10] Glasserman P., Y. Wang, 1998. Lead-time Inventory Trade-offs in Assemble-to-Order Systems. Opns. Res. 46:6, 858-871. [11] Ha Albert Y. 1997a. Optimal Dynamic Scheduling Policy for a Make-to-Stock Production System. Opns. Res. 45, 42-53. [12] Ha Albert Y. 1997b. Inventory Rationing in a Make-to-Stock Production System with Several Demand Classes and Lost Sales. Management Sci. 43, 1093-1103. [13] Ha Albert Y. 1997c. Optimal Inventory Rationing Policy in a Make-to-Stock Production System with Backordering, Naval Res. Logistics 44, 457-472. [14] Harrison, J. M. 1988. Brownian Models of Queueing Networks with Heterogeneous Customer Populations. Stochastic Differential Systems, Stochastic Control Theory and their Applications, Fleming, W. and P.L.Lions. (eds.). IMA Series, 10, 147-186, Springer. [15] Hsu, V.N., C.Y. Le and K.C. So. 2005. Optimal Component Stocking Policy for Assemble- To-Order Systems with Leadtime-Dependent Component and Product Pricing. Management Science, to appear. [16] Li, L. 1988. A Stochastic Theory of the Firm. Mathematics of Opns. Res. 13, 447-466. [17] Peña Perez A. and P. Zipkin 1997. Dynamic Scheduling Rules For a Multiproduct Maketo-Stock Queue. Opns. Res. 45, 919-930. [18] Porteus, E.L. 1982. Conditions for Characterizing the Structure of Optimal Strategies in Infinite Horizon Dynamic Programs. J. Optim. Theory Appl. 36, 419-432. [19] Ross, P. 1960. Point Processes and Queues, Martingale Dynamics. New York: Springer- Verlag. [20] Ross, S. M. 1970. Applied Probability Models with Optimization Applications. Dover Pubns. [21] Soble M. 1982. The Optimality of Full-Service Policies. Opns. Res. 30, 636-649. [22] Song, J., Xu, H. S., AND Liu, B. 1999. Order-Fulfillment Performance Measures in an Assemble-To-Order System With Stochastic Leadtimes. Opns. Res. 47, 131-149. [23] Cheng, F., M. Ettl, G. Y. Lin, and D. D. Yao. 2003. Inventory-Service Optimization in Configure-to-Order Systems, M&S Opns. Manag. 4, 114-132. [24] Lu, Y., J. S. Song and D. D. Yao. Backorder Minimization in Multiproduct Assemble-to- Order Systems, IIE Trans. forthcoming. [25] Song, J. S. and D. D. Yao. 2002. Performance Analysis and Optimization in Assemble-to- Order Systems with Random Leadtimes, Opns. Res. 50, 889-903. 14

[26] Xu, S. H. 2001. Dependence Analysis of Assemble-to-Order Systems, 359-324 in Supply Chain Structures: Coordination, Information and Optimization, J. S. Song and D. D. Yao, eds, Kluwer Academic. [27] Song, J.-S., AND P. Zipkin. 2003. Supply Chain Operations: Assemble-to-Order Systems, Chapter 11 in Handbooks in Opns. Res. and Management Management Sci., XXX: Supply Chain Management, S. Graves and T. De Kok, eds, North-Holland. [28] Topkis, D. 1978. Minimizing a Submodular Function on a Lattice. Opns. Res. 26, 305-321. [29] Veatch, M.H., L.M. Wein. 1996. Scheduling a make-to-stock queue: index policies and hedging points. Opns. Res. 44:4, 634 647. [30] Wein, L.M. 1992. Dynamic scheduling of a multiclass make-to-stock queue. Opns. Res. 40:4, 724-735. [31] Zheng, Y., P. Zipkin. 1990. A queueing model to analyze the value of centralized inventory information. Opns. Res. 38, 296-307. 15

Appendix Proof of Lemma 1 a) Take any v V and let w be a function on {0, 1} Z 2 defined by w(u, x 1, x 2 ) = (1 u)v(x 1, x 2 ) + uv(x 1 + 1, x 2 ) { v(x1, x 2 ), if u = 0, = v(x 1 + 1, x 2 ), if u = 1. Since v V, for any u, w is submodular in (x 1, x 2 ). Moreover, w(u, x 1, x 2 + 1) w(u, x 1, x 2 ) = { D2 v(x 1, x 2 ), if u = 0; D 2 v(x 1 + 1, x 2 ), if u = 1, and it is decreasing in u. Hence w is submodular in (u, x 2 ) (see Topkis 1978). Next we show that g(x 1, x 2 ) = min[v(x 1 + 1, x 2 ), v(x 1, x 2 )] = min w(u, x 1, x 2 ) u {0,1} is also submodular and convex. To prove submodularity, we assume u 1, u 2 {0, 1} be such that Then, we have either (i) or (ii) as follows. i) u 1 u 2 which yields g(x 1 + 1, x 2 ) = w(u 1, x 1 + 1, x 2 ), and g(x 1, x 2 + 1) = w(u 2, x 1, x 2 + 1). g(x 1 + 1, x 2 + 1) + g(x 1, x 2 ) w(u 1, x 1 + 1, x 2 + 1) + w(u 2, x 1, x 2 ) [w(u 1, x 1 + 1, x 2 ) + w(u 2, x 1 + 1, x 2 + 1) w(u 2, x 1 + 1, x 2 )] +w(u 2, x 1, x 2 ) w(u 1, x 1 + 1, x 2 ) w(u 2, x 1 + 1, x 2 ) + w(u 2, x 1, x 2 ) +[w(u 2, x 1 + 1, x 2 ) + w(u 2, x 1, x 2 + 1) w(u 2, x 1, x 2 )] = w(u 1, x 1 + 1, x 2 ) + w(u 2, x 1, x 2 + 1) = g(x 1 + 1, x 2 ) + g(x 1, x 2 + 1), where the first inequality follows from the definition of g, the second from the submodularity of w in (u, x 2 ), and the third from the submodularity of w in (x 1, x 2 ); or 16

ii) u 1 < u 2, i.e., u 1 = 0, and u 2 = 1, which yields g(x 1 + 1, x 2 + 1) + g(x 1, x 2 ) w(1, x 1 + 1, x 2 + 1) + w(0, x 1, x 2 ) [w(1, x 1 + 1, x 2 ) + w(0, x 1 + 1, x 2 + 1) w(0, x 1 + 1, x 2 )] +w(0, x 1, x 2 ) w(1, x 1 + 1, x 2 ) w(0, x 1 + 1, x 2 ) + w(0, x 1, x 2 ) +[w(0, x 1 + 1, x 2 ) + w(0, x 1, x 2 + 1) w(0, x 1, x 2 )] = w(1, x 1 + 1, x 2 ) + w(0, x 1, x 2 + 1) = g(x 1, x 2 ) + g(x 1, x 2 + 2). Hence in both cases we prove submodularity of g. To prove convexity of g we need to check the following two inequalities: g(x 1, x 2 + 2) + g(x 1, x 2 ) 2g(x 1, x 2 + 1), and (4) g(x 1 + 2, x 2 ) + g(x 1, x 2 ) 2g(x 1 + 1, x 2 ). (5) To check inequality (4) we assume u 1, u 2 {0, 1} be such that g(x 1, x 2 ) = w(u 1, x 1, x 2 ), g(x 1, x 2 + 2) = w(u 2, x 1, x 2 + 2). Then we have either (i) or (ii) as follows. i) u 1 u 2 which yields 2g(x 1, x 2 + 1) w(u 1, x 1, x 2 + 1) + w(u 2, x 1, x 2 + 1) [w(u 2, x 1, x 2 + 1) + w(u 1, x 1, x 2 ) w(u 2, x 1, x 2 )] +w(u 2, x 1, x 2 + 1) [w(u 2, x 1, x 2 + 2) + w(u 2, x 1, x 2 )] +w(u 1, x 1, x 2 ) w(u 2, x 1, x 2 ) = w(u 1, x 1, x 2 ) + w(u 2, x 1, x 2 + 2) = g(x 1, x 2 ) + g(x 1, x 2 + 2), where the first inequality follows from the definition of g, the second from the submodularity of w in (u, x 2 ), and the third from the convexity of w in x 2 ; Or ii) u 1 < u 2, i.e., u 1 = 0 and u 2 = 1, in which case inequality (4) follows straightforwardly. To check inequality (5) we define w(u, x 1, x 2 ) = (1 u)v(x 1, x 2 ) + uv(x 1, x 2 + 1) { v(x1, x 2 ), if u = 0, = v(x 1, x 2 + 1), if u = 1. 17

Then, by applying the above steps to w instead of w, we can verify the inequality (5). b) The proof is similar to that of a) as given above and omitted. c) It is easy to check that c(x 1, x 2 ) V. Since V is closed under addition and multiplication by positive scalars, we obtain that T v(x 1, x 2 ) V. Proof of Theorem 1 Lemma 1 asserts that convexity and submodularity are preserved by operator T. Since the cost function is nonnegative and the control set is finite, by Proposition 13 of Chapter 5 (page 218) in Bertsekas (1987) we can conclude that the optimal cost function J(x) is also submodular and convex. Consequently, for any given x 2, there exists a B 1 (x 2 ) such that, for any x 1 B 1 (x 2 ), D 1 J(x 1, x 2 ) > 0; and for any x 1 < B 1 (x 2 ), D 1 J(x 1, x 2 ) 0. Furthermore, by convexity and submodularity of J(x 1, x 2 ), B 1 (x 2 ) is increasing in x 2. Similar conclusions can be obtained regarding B 2 (x 1 ). Proof of Theorem 2 Let V 1 be the subset of V that contains all the functions with D 2 v(x 1, x 2 ) < 0 for x 1 > x 2 and x 2 < 0. We want to show that V 1 is closed under operator T. Given a function v V 1, we have D 2 T v(x 1, x 2 ) = T v(x 1, x 2 + 1) T v(x 1, x 2 ) = c(x 1, x 2 + 1) + λv(x 1 1, x 2 ) + (µ 1 + µ 2 )v(x 1, x 2 + 1) + min[0, µ 1 D 1 v(x 1, x 2 + 1)] + min[0, µ 2 D 2 v(x 1, x 2 + 1)] c(x 1, x 2 ) λv(x 1 1, x 2 1) (µ 1 + µ 2 )v(x 1, x 2 ) min[0, µ 1 D 1 v(x 1, x 2 )] min[0, µ 2 D 2 v(x 1, x 2 )] = D 2 c(x 1, x 2 ) + λd 2 v(x 1 1, x 2 1) + (µ 1 + µ 2 )D 2 v(x 1, x 2 ) + min[0, µ 1 D 1 v(x 1, x 2 + 1)] + min[0, µ 2 D 2 v(x 1, x 2 + 1)] min[0, µ 1 D 1 v(x 1, x 2 )] min[0, µ 2 D 2 v(x 1, x 2 )] = D 2 c(x 1, x 2 ) + λd 2 v(x 1 1, x 2 1) + min[0, µ 1 D 1 v(x 1, x 2 + 1)] + min[0, µ 2 D 2 v(x 1, x 2 + 1)] { µ1 [D 2 v(x 1, x 2 ) D 1 v(x 1, x 2 )], if 0 > D 1 v(x 1, x 2 ), + µ 1 D 2 v(x 1, x 2 ), if 0 D 1 v(x 1, x 2 ). Note that when x 1 > x 2 and x 2 < 0, having one more unit of x 2 reduces both the backorder cost of the product and the holding cost of component 1, so D 2 c(x 1, x 2 ) < 0. Using the identity D 2 v(x 1, x 2 ) D 1 v(x 1, x 2 ) = D 2 v(x 1 + 1, x 2 ) D 1 v(x 1, x 2 + 1), 18

we obtain D 2 T v(x 1, x 2 ) = D 2 c(x 1, x 2 ) + λd 2 v(x 1 1, x 2 1) + min[0, µ 1 D 1 v(x 1, x 2 + 1)] + min[0, µ 2 D 2 v(x 1, x 2 + 1)] { µ1 [D 2 v(x 1 + 1, x 2 ) D 1 v(x 1, x 2 + 1)], if 0 > D 1 v(x 1, x 2 ), + µ 1 D 2 v(x 1, x 2 ), if 0 D 1 v(x 1, x 2 ). = D 2 c(x 1, x 2 ) + λd 2 v(x 1 1, x 2 1) + min[0, µ 2 D 2 v(x 1, x 2 + 1)] { µ1 D 2 v(x 1 + 1, x 2 ), if 0 > D 1 v(x 1, x 2 ), + min[0, µ 1 D 1 v(x 1, x 2 + 1)] + µ 1 D 2 v(x 1, x 2 ), if 0 D 1 v(x 1, x 2 ). In which the last equality comes from submodular property of V that D 1 v(x 1, x 2 ) is a decreasing function of x 2, so D 1 v(x 1, x 2 + 1) D 1 v(x 1, x 2 ) < 0. Thus all the terms in the last formula are non-positive, which gives D 2 T v(x 1, x 2 ) < 0. Again by Proposition 13 of Chapter 5 (page 218) in Bertsekas (1987) we conclude that the optimal cost function J(x 1, x 2 ) is also inv 1. It implies that B 2 (x 1 ) > x 1 for x 1 < 0. Since B 2 (x 1 ) is an increasing function of x 1, we have B 2 (x 1 ) > min(0, x 1 ). The same conclusion on B 1 (x 2 ) is proved similarly. Proof of Theorem 3 following properties: Let V 2 be the subset of V that contains all the functions with the D i J(x 1, x 2 ) c+ i γ uniformly in x j as x i, for i, j = 1, 2, i j; (6) D j J(x 1, x 2 ) g i (x j v) uniformly in x j as x i, for i, j = 1, 2, i j. (7) Next, we show that V 2 is closed under operator T. To simplify the expressions below we denote vectors (x 1, x 2 ), (1, 0), and (0, 1) as x, e 1, and e 2 respectively. Thus D 1 v(x) = v(x + e 1 ) v(x), for i = 1, 2. Given a function v V 2, lets first consider D 1 T v(x): D 1 T v(x) = T v(x + e 1 ) T v(x) = D 1 c(x) + λd 1 v(x j e j) + j µ jd 1 v(x) + µ 1 [(R 1 v(x + e 1 ) R 1 v(x))] + µ 2 [(R 2 v(x + e 1 ) R 2 v(x))], where R j v(x) = min[0, D j v(x)]. When x 1, by (6), the first five terms converges uniformly; by (7), the last two terms converges uniformly and to the same value min[0, g 1 (x 2 v))]. Thus, we have the following uniform convergence in x 2, as x 1, D 1 T v(x) c + 1 + λc+ 1 j γ + µ jc + 1 γ = c+ 1 γ (γ + λ + j µ j) = c+ 1 γ. 19

Similarly, we have the following uniform convergence in x 1, as x 2, D 1 T v(x) g 2 (x 1 v) + λg 2 (x 1 1 v) + j µ jg 2 (x 1 v) + µ 1 [min[0, g 2 (x 1 + 1 v)] min[0, g 2 (x 1 v)]] := g 2 (x 1 T v). The other cases of (6) and (7) can be verified in the same way. In summary we conclude that operator T preserves the properties (6) and (7). Therefore the optimal cost function J(x 1, x 2 ) is also in V 2. Proof of Corollary 1 Theorem 3 states that D 1 J(x 1, x 2 ) converges to c+ 1 γ uniformly in x 2. Therefore, there exists an X 1 (independent of x 2 ) such that when x 1 > X 1, D 1 J(x 1, x 2 ) > 0 for all x 2, which implies B 1 (x 2 ) X 1. The same goes for B 2 (x 1 ). Proof of Theorem 4 Let V 3 be the subset of V that contains all the functions satisfying D 1 J(x 2, x 1 ) = D 2 J(x 1, x 2 ). We want to show that V 3 is closed under operator T, which is verified below D 2 T v(x 1, x 2 ) = T v(x 1, x 2 + 1) T v(x 1, x 2 ) = D 2 c(x 1, x 2 ) + λd 2 v(x 1 1, x 2 1) + (µ 1 + µ 2 )D 2 v(x 1, x 2 ) + min[0, µ 1 D 1 v(x 1, x 2 + 1)] + min[0, µ 2 D 2 v(x 1, x 2 + 1)] min[0, µ 1 D 1 v(x 1, x 2 )] min[0, µ 2 D 2 v(x 1, x 2 )] = D 1 c(x 2, x 1 ) + λd 1 v(x 2 1, x 1 1) + (µ 1 + µ 2 )D 1 v(x 2, x 1 ) + min[0, µ 1 D 2 v(x 2 + 1, x 1 )] + min[0, µ 2 D 1 v(x 2 + 1, x 1 )] min[0, µ 1 D 2 v(x 2, x 1 )] min[0, µ 2 D 1 v(x 2, x 1 )] = D 1 T v(x 2, x 1 ), for any function v V 3. Therefore, we can conclude that the optimal cost function J(x 1, x 2 ) is also in V 3. Then, by the definitions of B 1 (x 2 ) and B 2 (x 1 ), we obtain B 1 (x 2 ) = min{x 1 : D 1 J(x 1, x 2 ) 0} = min{x 2 : D 2 J(x 2, x 1 ) 0} = B 2 (x 2 ). 20