APPENDIX to Stabilizing Performance in Many-Server Queues with Time-Varying Arrivals and Customer Feedback

Similar documents
ec1 e-companion to Liu and Whitt: Stabilizing Performance

Stabilizing Performance in a Service System with Time-Varying Arrivals and Customer Feedback

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

STABILIZING PERFORMANCE IN NETWORKS OF QUEUES WITH TIME-VARYING ARRIVAL RATES

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates

Making Delay Announcements

Many-Server Loss Models with Non-Poisson Time-Varying Arrivals

The G t /GI/s t + GI Many-Server Fluid Queue: Longer Online Version with Appendix

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis.

Q = (c) Assuming that Ricoh has been working continuously for 7 days, what is the probability that it will remain working at least 8 more days?

Electronic Companion Fluid Models for Overloaded Multi-Class Many-Server Queueing Systems with FCFS Routing

Queueing Theory I Summary! Little s Law! Queueing System Notation! Stationary Analysis of Elementary Queueing Systems " M/M/1 " M/M/m " M/M/1/K "

A STAFFING ALGORITHM FOR CALL CENTERS WITH SKILL-BASED ROUTING: SUPPLEMENTARY MATERIAL

ESTIMATING WAITING TIMES WITH THE TIME-VARYING LITTLE S LAW

Author's personal copy

DELAY PREDICTORS FOR CUSTOMER SERVICE SYSTEMS WITH TIME-VARYING PARAMETERS. Rouba Ibrahim Ward Whitt

LIMITS FOR QUEUES AS THE WAITING ROOM GROWS. Bell Communications Research AT&T Bell Laboratories Red Bank, NJ Murray Hill, NJ 07974

Modeling Service Networks with Time-Varying Demand

Wait-Time Predictors for Customer Service Systems with Time-Varying Demand and Capacity

The Performance Impact of Delay Announcements

HITTING TIME IN AN ERLANG LOSS SYSTEM

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS

Choosing Arrival Process Models for Service Systems: Tests of a Nonhomogeneous Poisson Process

What You Should Know About Queueing Models To Set Staffing Requirements in Service Systems

Figure 10.1: Recording when the event E occurs

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Electrical Engineering and Computer Science

Non Markovian Queues (contd.)

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY SUPPLEMENTARY MATERIAL

Online Supplement to Are Call Center and Hospital Arrivals Well Modeled by Nonhomogeneous Poisson Processes?

Lecture 9: Deterministic Fluid Models and Many-Server Heavy-Traffic Limits. IEOR 4615: Service Engineering Professor Whitt February 19, 2015

(b) What is the variance of the time until the second customer arrives, starting empty, assuming that we measure time in minutes?

TOWARDS BETTER MULTI-CLASS PARAMETRIC-DECOMPOSITION APPROXIMATIONS FOR OPEN QUEUEING NETWORKS

IEOR 6711: Stochastic Models I, Fall 2003, Professor Whitt. Solutions to Final Exam: Thursday, December 18.

SOLUTIONS IEOR 3106: Second Midterm Exam, Chapters 5-6, November 8, 2012

ENGINEERING SOLUTION OF A BASIC CALL-CENTER MODEL

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Electrical Engineering and Computer Science

USING DIFFERENT RESPONSE-TIME REQUIREMENTS TO SMOOTH TIME-VARYING DEMAND FOR SERVICE

STAFFING A CALL CENTER WITH UNCERTAIN ARRIVAL RATE AND ABSENTEEISM

THE HEAVY-TRAFFIC BOTTLENECK PHENOMENON IN OPEN QUEUEING NETWORKS. S. Suresh and W. Whitt AT&T Bell Laboratories Murray Hill, New Jersey 07974

The G t /GI/s t +GI many-server fluid queue. Yunan Liu & Ward Whitt. Queueing Systems Theory and Applications. ISSN Volume 71 Number 4

Queues and Queueing Networks

Stationary Birth-and-Death Processes Fit to Queues with Periodic Arrival Rate Functions

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

Wait-Time Predictors for Customer Service Systems With Time-Varying Demand and Capacity

Overflow Networks: Approximations and Implications to Call-Center Outsourcing

Designing a Telephone Call Center with Impatient Customers

Since D has an exponential distribution, E[D] = 0.09 years. Since {A(t) : t 0} is a Poisson process with rate λ = 10, 000, A(0.

Delay Announcements. Predicting Queueing Delays for Delay Announcements. IEOR 4615, Service Engineering, Professor Whitt. Lecture 21, April 21, 2015

Motivated by models of tenant assignment in public housing, we study approximating deterministic fluid

Proactive Care with Degrading Class Types

Using a Birth-and-Death Process to Estimate the Steady-State Distribution of a Periodic Queue

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS

ISE/OR 762 Stochastic Simulation Techniques

This paper investigates the impact of dependence among successive service times on the transient and

A Robust Queueing Network Analyzer Based on Indices of Dispersion

Other properties of M M 1

Dynamic Control of Parallel-Server Systems

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

A Fluid Approximation for Service Systems Responding to Unexpected Overloads

A note on the GI/GI/ system with identical service and interarrival-time distributions

Managing Service Systems with an Offline Waiting Option and Customer Abandonment: Companion Note

2905 Queueing Theory and Simulation PART III: HIGHER DIMENSIONAL AND NON-MARKOVIAN QUEUES

Using a Birth-and-Death Process to Estimate the Steady-State Distribution of a Periodic Queue

1 Delayed Renewal Processes: Exploiting Laplace Transforms

Stationary Birth-and-Death Processes Fit to Queues with Periodic Arrival Rate Functions

Introduction to Markov Chains, Queuing Theory, and Network Performance

We consider how two networked large-scale service systems that normally operate separately, such as call

Markov Chains. X(t) is a Markov Process if, for arbitrary times t 1 < t 2 <... < t k < t k+1. If X(t) is discrete-valued. If X(t) is continuous-valued

Operations Research Letters. Instability of FIFO in a simple queueing system with arbitrarily low loads

Using a Birth-and-Death Process to Estimate the Steady-State Distribution of a Periodic Queue

Modelling the risk process

A Note on the Event Horizon for a Processor Sharing Queue

Queueing Systems: Lecture 3. Amedeo R. Odoni October 18, Announcements

A Diffusion Approximation for the G/GI/n/m Queue

Queueing Theory II. Summary. ! M/M/1 Output process. ! Networks of Queue! Method of Stages. ! General Distributions

Optional Stopping Theorem Let X be a martingale and T be a stopping time such

On the static assignment to parallel servers

Advanced Computer Networks Lecture 3. Models of Queuing

Part I Stochastic variables and Markov chains

SENSITIVITY TO THE SERVICE-TIME DISTRIBUTION IN THE NONSTATIONARY ERLANG LOSS MODEL

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance

Online Supplement to Creating Work Breaks From Available Idleness

Stochastic-Process Limits

Continuous-time Markov Chains

Introduction to Queueing Theory with Applications to Air Transportation Systems

HEAVY-TRAFFIC LIMITS FOR STATIONARY NETWORK FLOWS. By Ward Whitt, and Wei You November 28, 2018

Waiting time characteristics in cyclic queues

IEOR 3106: Second Midterm Exam, Chapters 5-6, November 7, 2013

Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals

A Study on M x /G/1 Queuing System with Essential, Optional Service, Modified Vacation and Setup time

Queueing systems. Renato Lo Cigno. Simulation and Performance Evaluation Queueing systems - Renato Lo Cigno 1

BRAVO for QED Queues

SERVER STAFFING TO MEET TIME-VARYING DEMAND

BIRTH DEATH PROCESSES AND QUEUEING SYSTEMS

Slides 9: Queuing Models

Cross-Selling in a Call Center with a Heterogeneous Customer Population. Technical Appendix

Online Supplement to Creating Work Breaks From Available Idleness

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers 1

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/6/16. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

Transcription:

APPENDIX to Stabilizing Performance in Many-Server Queues with -Varying Arrivals and Customer Feedback Yunan Liu and Ward Whitt Department of Industrial Engineering North Carolina State University Raleigh, NC 7695 Department of Industrial Engineering and Operations Research Columbia University New York, NY 7 yliu48@ncsu.edu, ww4@columbia.edu February 6, 6 Abstract Analytical approximations are developed to determine staffing levels that stabilize performance at designated targets in a many-server queueing model with time-varying arrival rates, customer feedback and abandonment. To provide a flexible model that can be readily fit to system data, Markovian routing is not assumed. Instead, the model has Bernoulli routing, with at most finitely many feedbacks, where the feedback probabilities, service-time and patience distributions all may depend on the visit number. Before returning to receive a new service, the fed-back customers experience delays in an infinite-server or finite-capacity queue, where the parameters may again depend on the visit number. A many-server heavy-traffic FWLLN shows that the performance targets are achieved asymptotically as the scale increases. A new refined modified-offered-load approximation is developed to obtain good results with low waiting-time targets. Simulation experiments confirm that the approximations are effective

Overview This is an appendix to the main paper [9], providing additional supplementary material. In we present additional results supplementing the DIS approximation in of [9]. In 3 we give the proof of the asymptotic effectiveness (Theorem in the main paper). In 4 we elaborate on details of our estimation procedures for the performance functions. In 5 we present additional results from simulation experiments, supplementing 5 and 6 of the main paper. In 6 we present additional estimates of the implied empirical Quality of Service (QoS) function, supplementing 5.3 of [9]. Finally, in 7 we examine the index of dispersion for counts of the various arrival processes in the feedback model to see if we see significant deviation from a nonhomogeneous Poisson process, which might degrade the DIS-MOL approximation. Supplement to the DIS Model in of [9] In this section we present additional material supplementing the DIS approximation in of the main paper [9]. We first give additional results in special cases.. Sinusoidal Arrival Rate Since many service systems have daily cycles, it is natural to consider sinusoidal and other periodic arrival rates, as was done in [, 4, 7]. For periodic arrival processes, we can simply focus on the dynamic steady state if we start the system at the infinite past. Theorem. Consider the DIS approximation for the (M t /GI, GI/s t + GI, GI) + (GI/ ) model specified above, starting in the distant past with specified delay target w > and with sinusoidal arrival-rate function λ(t) = a+b sin(ct). Then Q (t), B (t), O(t), Q (t) and B (t) are independent Poisson random variables having sinusoidal means E[Q (t)] = E[T ](a + γ(t,e )b sin(ct θ(t,e ))), E[B (t)] = F (w)e[s ] (a + γ(s,e )b sin[c(t w) θ(s,e )]), E[O(t)] = p F (w)e[u] (a + γ(s )γ(u e )b sin[c(t w) θ(s ) θ(u e )]), E[Q (t)] = p F (w)e[t ] (a + γ(s )γ(u)γ(t,e )b sin[c(t w) θ(s ) θ(u) θ(t,e )]), E[B (t)] = p F (w) F (w)e[s ] (a + γ(s )γ(u)γ(s,e )b sin[c(t w) θ(s ) θ(u) θ(s,e )]), where θ(x) arctan(φ (X)/φ (X)), γ(x) φ (X) + φ (X), φ (X) E[sin(cX)], φ (X) E[cos(cX)]. The abandonment rates from the two waiting rooms are sinusoidal ξ (t) = af (w) + γ(a)b sin[ct θ(a)], ξ (t) = a p F (w) F (w) + p F (w)γ(s )γ(u) γ(a)b sin[c(t w) θ(s ) θ(u) θ(a)], where θ(x) φ (X)/ φ (X), γ(x) φ (X) + φ (X), φ (X) E[sin(cX) {X<w} ], φ (X) E[cos(cX) {X<w} ]. The rates of entering the two service facilities are sinusoidal β (t) = λ(t w) F (w), β (t) = p F (w) F (w) (a + γ(s )γ(u)b sin[c(t w) θ(s ) θ(u)]), The departure rates from the two service facilities are sinusoidal σ (t) = F (w) (a + γ(s )b sin[c(t w) θ(s )]), σ (t) = p F (w) F (w) ( a + γ(s ) γ(u)b sin[c(t w) θ(s ) θ(u)] ).

The arrival rate to the second waiting room is sinusoidal λ F (t) = p F (w) (a + γ(s )γ(u)b sin[c(t w) θ(s ) θ(u)]). Remark. (extreme values of the sinusoidal performance functions) Note the extreme values of E[Q (t)], E[B (t)], E[O(t)], E[Q (t)] and E[B (t)] occur at t Q = t λ + θ(t,e )/c, t B = t λ + w + θ(s,e )/c, t O = t λ + w + (θ(s ) + θ(u e ))/c, t Q = t λ + w + (θ(s ) + θ(u) + θ(t,e ))/c, t B = t λ + w + (θ(s ) + θ(u) + θ(s,e ))/c, respectively, where t λ = π/c + nπ/c for n integer are times at which the extreme values of λ(t) occurs. Their extreme values are respectively. E[Q (t Q )] = E[T ](a + γ(t,e )b), E[B (t B )] = F (w)e[s ] (a + γ(s,e )b), E[O(t O )] = p F (w)e[u] (a + γ(s )γ(u e )b), E[Q (t Q )] = p F (w)e[t ] (a + γ(s )γ(u)γ(t,e )b), E[B (t B )] = p F (w) E[S] (a + γ(s )γ(u)γ(s,e )b), It is interesting to investigate how the new feature of delayed feedback influence the variation of the OL function. In particular, we want to see if the relative amplitude of the new OL function is flattened or exaggerated compared to the old one. However, the general scheme is complicated because the OL function strongly depends not only on the basic model parameters F i, G i, H and λ, it also depends on the target service level w. For the rest of this section, we assume that F = F = F and G = G = G. Under that condition, we consider two special cases: (i) exponential service (S) and orbit (U) times and (ii) deterministic service and orbit times. Let RA(m) and RA(m ) be the relative amplitude (relative variation around the average) of the new and old OL functions, respectively. We also want to investigate the time lag incurred by the feedback structure. Let the phase difference of the two OL functions be P H(m, m ) P hase(m ) P hase(m). The following result is proved in the appendix. Theorem. Consider the DIS approximation for the (M t /GI, GI/s t + GI, GI) + (GI/ ) model specified above with F = F = F and G = G = G. Let the system start empty in the distant past with specified delay target w > and with sinusoidal arrival-rate function λ(t) = a + b sin(ct). Then the OL function m(t) E[B (t)] + E[B (t)] is sinusoidal m(t) = F ( (w)e[s] a( + p F (w)) + b γ(s e ) u + v sin[c(t w) θ] ), () where θ arctan(u/v), u sin[θ(s e )]+p F (w)γ(s)γ(u) sin( θ), v cos[θ(s e )]+p F (w)γ(s)γ(u) cos( θ), θ cw + θ(s) + θ(u) + θ(s e ), θ(x) φ (X)/φ (X), γ(x) φ (X) + φ (X), φ (X) E[sin(cX)], φ (X) E[cos(cX)]. 3

(i) If both service (S) and orbit (U) times are exponential, then RA(m) < RA(m ) if ) ) ( + ( c µ + c δ >. (ii) If both service and orbit times are deterministic, then RA(m) RA(m ). Furthermore, in both cases lim c RA(m) RA(m ) =, lim P H(m, c m ) =. Proof. First, the offered load formula in (3) within Theorem 3 of [9] can be easily verified by adding up E[B (t)] and E[B (t)] in Theorem of [9]. So the relative amplitude of m (t) and m(t) E[B (t)] are RA(m ) = b γ(s e) a and RA(m) = b γ(s e) u + v a( + p F (w)). Therefore, it remains to determine the ratio RA(m) RA(m ) = = u + v + p F (w) + p F (w) γ(s) γ(u) + p F (w)γ(s)γ(u) cos[cw + θ(s) + θ(u)] + p F. () (w) If S and U are exponentially distributed with rate µ and δ respectively, it is easy to see that φ (S) = c/µ + c /µ and φ (S) = + c /µ, so that Similarly, we have γ(s) = γ(u) = ( ) c and θ(s) = arctan. (3) + c /µ µ ( c ) and θ(u) = arctan. (4) + c /δ δ Plugging (3) and (4) into () yields RA(m) RA(m ) = < p + F (w) (+c /µ )(+c /δ ) + p F (w) cos[cw+θ(s)+θ(u)] (+c /µ )(+c /δ ) + p F (w) + p F (w) + p F (w) + p F =, (w) 4

when the condition in (i) holds. The time lag ( u ) P H(m, m ) = arctan θ(s e ) v [ ( sin arctan c µ = arctan = arctan [ cos as c. ( arctan c µ c + p F [ (w) sin µ + p F [ (w) cos If S and U are deterministic, then Therefore, from () we have )] + p F [ (w) sin )+arctan( c δ ) ] cw+ arctan( c µ (+c /µ )(+c /δ ) )] + p F [ (w) cos cw+ arctan( c µ )+arctan( c δ ) ] (+c /µ )(+c /δ ) cw+ arctan( c µ )+arctan( c δ ) ] ( +c /δ cw+ arctan( c µ )+arctan( c δ ) ] c arctan µ +c /δ γ(s) = γ(u) =, θ(s) = c S = c µ and θ(u) = c U = c δ. RA(m) RA(m ) = + p F (w) + p F (w) cos[c(w + S + U)] + p F (w) + p F (w) + p F (w) + p F =. (w) ( ) c arctan µ Since S = /µ, S e Unif(, /µ), which implies that θ(s e ) = c/µ. Therefore, the time lag ( ) sin c P H(m, m µ + p F ( ) (w) sin cw + c µ + c δ + c µ ) = ( ) cos c µ + p F ( ) c (w) cos cw + c µ + c δ + c µ µ as c.. Additional Results We first consider how the (M t /GI, GI/s t +GI, GI)+(GI/ ) model simplifies when the arrival-rate function is constant. When the arrival rate is constant, i.e., λ(t) = λ, the steady-state performance functions can be easily obtained. This analysis entails simple calculations for a five-queue IS network, which in particular simplifies to five IS queues in series. Here we display the results for the simple case in which G = G = G and F = F = F. Corollary. (steady state performance of DIS-OL) Consider the DIS approximation for the (M t /GI, GI/s t + GI, GI) + (GI/ ) model in Theorem of [9] with constant arrival rate λ, G = G = G, F = F = F and delay target w. The steady-state (as t ) numbers of customers in the waiting rooms, in the service facilities, and in the orbit room, Q ( ), B ( ), O( ), Q ( ), B ( ), are independent Poisson random variables with means E[Q ( )] = λ E[T ], E[B ( )] = F (w) λ E[S], E[O( )] = p F (w) λ E[U], E[Q ( )] = p F (w) λ E[T ], E[B ( )] = p F (w)λ E[S], 5 )

where T A w. Thus, X( ), the steady-state total number of customers in the system is a Poisson random variable with a mean E[X( )] = E[Q ( )] + E[Q ( )] + E[B ( )] + E[B ( )] = λ ( + p F (w) ) ( E[T ] + F (w)e[s] ). If the system is initially in steady state, the processes counting the numbers of customers abandoning from waiting room and are Poisson processes with rates α = λ F (w) and α = λ p F (w) F (w). The processes counting the numbers of customers entering service facility and are Poisson processes with rates β = λ F (w) and β = λ p F (w). The departure processes (counting the number of customers completing service) from service facility and are Poisson processes with rate ( p) σ (t) and σ (t), where σ = λ F (w) and σ = p λ F (w). The process counting the numbers of customers entering the second waiting room is a Poisson process with rate function λ F = p λ F (w). As discussed in [] and [3], simple linear and quadratic approximations derived from Taylor series for general arrival-rate functions can be convenient. These approximations show simple time lags and space shifts. As before, we ignore the fact that these arrival rate functions are necessarily negative on part of the domain. Assuming that the approximations are used with proper care, they can still be very useful. Theorem.3 Consider the DIS approximation for the (M t /GI, GI/s t + GI, GI) + (GI/ ) model specified above with G = G = G, F = F = F, starting in the distant past with specified delay target w and with quadratic arrival-rate function λ(t) = a + bt + ct. Then Q (t), B (t), O(t), Q (t) and B (t) are independent Poisson random variables having quadratic means E[Q (t)] = E[T ](λ(t E[T e ]) + c V ar(t e )), E[B (t)] = F (w) E[S] (λ(t w E[S e ]) + c V ar(s e )), E[O(t)] = p F (w) E[U] (λ(t w E[S] E(U e )) + c(v ar(s) + V ar(u e ))), E[Q (t)] = p F (w) E[T ] (λ(t w E[S] E[U] E[T e ]) + c(v ar(u) + V ar(s) + V ar(t e ))), E[B (t)] = p F (w) E[S] (λ(t w E[S] E[U] E[S e ]) + c(v ar(u) + V ar(s) + V ar(s 3 ))). We next consider a slightly generalized scheme. Suppose the system is not empty at the beginning of the day (at time ) and the initial number of waiting customers in the system along with their elapsed waiting times are observed (not random). For instance, there are n customers waiting in a single line at time and their elapsed waiting times are w w w n. The goal is to design an appropriate staffing function s(t) for t T such that the average customer waiting times can be stabilized in the next period [, T ] (e.g., T = 8 or T = 4). A typical example is the Manhattan DMV office. On a regular morning, by the opening of the office (8: am), a long line of waiting customers may have already formed outside the door. Since the system is initially 6

non-empty, analogously the DIS approximation has an initially non-empty waiting room, i.e., there are n customers in the first waiting room at time. The next Theorem characterizes the OL function of this case. Let S (i) k, A(i) k be the service and patience time of the kth customer in his ith visit, let U k be the orbit time of the kth customer, k n, i =,. Let R k if the kth customer revisits the system and let R k otherwise. Theorem.4 (the initially non-empty DIS model) Consider the DIS approximation for the (M t /GI, GI/s t + GI, GI) + (GI/ ) model model with customer feedback, G = G = G, F = F = F, and delay target w specified above, starting at time. Suppose initially there are n customers waiting in the queue with elapsed waiting times w w w n. The approximation makes W (t) = w with probability and the probability of abandonment F (w) for all arrivals by letting n(w) customers entering service at t =, where n(w) inf{k : w k w}. Moreover, the total numbers of customers in the waiting rooms, in the service facilities, and in the orbit room at time t, Q (t), B (t), Õ(t), Q (t), B (t), can be written as sums of independent random variables: Q (t) = Q (t) + Q (t), B (t) = B (t) + B (t) + B (t), Õ(t) = O(t) + O (t) + O (t), Q (t) = Q (t) + Q (t) + Q (t), B (t) = B (t) + B (t) + B (t), where Q (t), B (t), O(t), Q (t) and B (t) are Poisson random variables with mean in Theorem of [9], B (t), O(t), Q (t) and B (t) are Binomial random variables with parameters n B = n O = n Q = n B = n(w), p B = Ḡ(t), po = pp (S () < t, S () + U > t) = p p Q = p P (S () + U < t, S () + U + A () > t) = p t t p B = p P (S () + U + A () < t, S () + U + A () + S () > t) = p H(t x)dg(x), F (t x)dg H(x), t Ḡ(t x)dg H F (x), and Q (t), B (t), O(t), Q (t) and B (t) can be expressed as sums of independent indicator random variables, in particular, Q (t) = B(t) = O (t) = Q (t) = B(t) = n k=n(w)+ n {wk +t w, A () k >t+w k}, {wk +t>w, A () k >w, S() k >t (w w k)}, k=n(w)+ n {wk +t>w, A () k >w, S() k <t (w w k), U k +S () k >t (w w k), R k =}, k=n(w)+ n {wk +t>w, A () k >w, S() k <t (w w k), t (w w k )<U k +S () k <t (w w k), U k +S () k +A() k >t (w w k), R k =}, k=n(w)+ n {wk +t>w, A () k >w, S() k <t (w w k), U k +S () k <t (w w k), A () k >w, U k+s () k +S() k >t (w w k), R k =}. k=n(w)+ 7

Remark. (interpretations of random variables in Theorem??) In order to stabilize the potential waiting time for t >, we let n(w) customers (who have waited longer than target w by t = ) enter service immediately at time. According to the same policy, the rest n n(w) initial customers have to wait for extra time because their elapsed waiting time w k < w for n(w) + k n. They will enter service after time at different moments t k w w k. It is easy to see that B (t) (O (t), Q (t) and B (t)) denotes the number of initial n(w) customers who entered service at time and are in the first service facility (orbit, second waiting room and the second service facility) at time t. Similarly, Q (t) (B (t), O(t), Q (t) and B (t)) denotes the number of initial n n(w) customers who entered service after time and are in the first waiting room (first service facility, orbit, second waiting room and the second service facility) at time t. 3 Proof of Theorem in [9] We first act as if the service facility can be partitioned into two parts, one dedicated to the new arrivals, with the other dedicated to the fed-back customers. In model n, the capacities of these two parts are s i,n (t) ns i (t) for i =,. For the fluid model, the corresponding capacities are s i (t) = m i (t) E[B i (t)] for i =,. We first discuss the fluid limit and then establish the FWLLN for the partitioned system. Afterwards, we show that the performance in the original system is asymptotically equivalent to the performance in the partitioned system. 3. The Partitioned Fluid Model It is significant that the limit in the FWLLN for the each component of the partitioned system is a deterministic fluid model. The fluid model for the first component also has parameter vectors (λ, s, F, G, w, α ), but they have a different interpretation: Now λ(t) is the arrival rate of the divisible deterministic fluid at time t. A proportion F (x) of the fluid to directly enter the queue from the external input abandons by time x of entering the queue if it has not yet entered service; a proportion G (x) of the fluid to directly enter service from the external input completes service by time x after it has begun service. The staffing function s (t) stabilizes the waiting time in the fluid model at w. We refer to 4 of [5] for a discussion of the connection between the DIS model and the fluid model and of [5] for the explicit performance functions achieving the waiting-time target w. Just as in [5], the content of the two types of fluid in service and queue are described by twoparameter deterministic functions B i (t, y) and Q i (t, y); B i (t, y) is the quantity of type-i fluid in service at time t that has been so for time at most y, while Q i (t, y) is the quantity of type-i fluid in queue at time t that has been so for time at most y. The total content of type-i fluid in service and in queue at time t are thus B i (t) = B i (t, ) and Q i (t) = Q i (t, ), respectively. The overall totals are the sums over the two types. Given the staffing function that we have used, we can verify that the type-i fluid content in service is B i (t) = m i (t) and the overall content is B(t) = m(t) for all t > w, and that all fluid waits exactly time w before entering service if it does not first abandon. We summarize these observations in the following theorem. (We first establish this result for the partitioned model and then the original model.) Theorem 3. (DIS staffing stabilizes the waiting time in the fluid model with feedback) The DIS staffing in of [9] is the unique staffing that stabilizes the waiting time at w and the abandonment probabilities at α i = F i (w) for i =, in the (G t /GI, GI/s t + GI, GI) + (GI/ ) fluid queue with Bernoulli feedback. All fluid waits in queue exactly time w before entering service if it has not 8

abandoned. Just as in Theorem of [9], the abandonment rates of the two kinds of fluid are ξ i (t), the rates that the two kinds of fluid enter service are β i (t), the service-completion rates of the two kinds of fluid are σ i (t) and the feedback arrival rate function is λ F (t). 3. The FWLLN for the Partitioned System For the partitioned system, we can establish the FWLLN recursively, just as we analyze the DIS model in. We first consider the model with staffing functions s,n (t) containing only the external arrivals. For this model, just as in [7], we can apply the established FWLLN in [6] to obtain the desired FWLLN. Since the waiting time target is w, we can use of [5] to uniquely characterize the limiting fluid model, which has staffing function s i (t). We now proceed forward to the next queue. From this initial FWLLN for the first partition of the system, we obtain the limit for the sequence of scaled departure counting processes of these D () customers, denoted by { n : n }. Given that D () in D, we can next obtain the corresponding limit for the sequence of customers fed back after service completion, denoted by (,) { D n : n }. For that purpose, let {X n,,k : k } be a sequence of i.i.d. routing random variables with X n,i,k = if the j th departure in D n () is fed back. Then we can represent D n (,) (t) explicitly as and the associated scaled version as D (,) n (t) = D n () (t) k= D () n {Xn,,k =}, t, (5) D n (,) (t) = i Z n (t) D () n (t), t, (6) where is the composition function and nt Z n (t) n {Xn,,k =} pt in D (7) k= We now apply the continuous mapping theorem in 3.4 of [] for the continuous composition functions appearing in (6), see Theorem 3.. of [], with the established limit for D n () and the FWLLN for partial sums of i.i.d. random variables Z (,) n,i,j. to obtain the limit D n D (,). (,) Given the limit for D n just established, we can apply the FWLLN for the IS orbit queue in [] to obtain the FWLLN for all the processes associated with the orbit queue, including its departure process, which serves as the arrival process to the second part of the partitioned system, serving the fed-back customers. Finally, we obtain a corresponding FWLLN for the second partition of the partitioned system, serving the fed-back customers, using the same reasoning as above. Since the waiting-time target is w for both classes the fluid models are uniquely determined by Theorem 8 in of [5]. Hence all the performance functions are as described. It only remains to show that the partitioned system is asymptotically equivalent to the original system. We first discuss the relation between the corresponding fluid models in the partitioned system. 3.3 Additivity of Fluid Models We now observe that the limiting fluid model in the theorem is actually equivalent to the fluid limit for the partitioned system, because both systems have the common constant waiting time w. This 9

equivalence is a consequence of the following more general theorem about fluid models, which we state without proof. Theorem 3. (additivity of fluid models) Two fluid models with the FCFS discipline indexed by i that are combined into a two-class FCFS fluid queue by having total arrival-rate function λ = λ +λ and staffing s(t) = s (t) + s (t) have additive performance with B(t, x) = B (t, x) + B (t, x) and Q(t, x) = Q (t, x) + Q (t, x) for all t, x (8) if and only if the two boundary waiting functions w i (t) coincide, in which case w(t) = w (t) = w (t) for all t. 3.4 Asymptotic Equivalence Even though the limiting fluid models of the partitioned system and the original system are the same, it remains to show that the established FWLLN for the partitioned system implies a corresponding FWLLN for the original system, with identical limits. The problem is that the two kinds of customers interact in the original system, so that the partitioning is not actually valid for each n. However, we can show that the customers from the different components of the partition interact over an asymptotically small part of the total capacity. Thus, the difference can be shown to be asymptotically negligible. To visually think of the separation, we can think of the servers being numbered, with arrivals from one class taking the smallest numbered free server, while arrivals from the other class taking the largest numbered free server. Then the two classes contend only in the middle, when the system becomes full (which will be the case here after an initial transient period). We will sketch the argument to show the asymptotic equivalence. To do so, we observe from of [5] that a small perturbation of the waiting-time target w in the fluid model yields a controlled uniformly small perturbation of the staffing over any bounded time interval [a, b], where a > w. Let s i (t, w) be the staffing function for the two classes (i = for external input and i = for the fedback fluid) at time t as a function of the constant waiting-time target w. It follows that, for any ɛ >, there exists δ δ(ɛ) > so that B n i s i (t, w + ɛ) δ < s i (t, w) < s i (t, w ɛ) + δ for a t b and i =,. (9) Moreover, by the FWLLN for the partitioned system just established, the scaled content (t, w) can be made arbitrarily close to the staffing s(t, w), i.e, for any a > w >, sup { B i n (t, w) s i (t, w) } as n. () a t b Hence, given w > ɛ >, suppose that the waiting-time target is required to fall in the interval [w ɛ, w + ɛ]. Then, there exists δ δ(ɛ) > and n such that for n n s i (t, w + ɛ) δ < B n i (t, w) < s i (t, w ɛ) + δ for a t b and i =,. () Of course, in our combined system we also have s(t, w) = s (t, w) + s (t, w), but we now have slack so that the content of one class can be too large, while the content of the other class is too small. Since δ(ɛ) as ɛ and we can let ɛ be arbitrarily small, we achieve the fluid limit of the partitioned model for the original model. Hence, the proof of Theorem of [9] is complete. In closing, we remark that an alternative proof can be done by the compactness argument, where we show that the sequence of scaled queueing processes are tight and then uniquely characterize the

limit in terms of the fluid model. Tightness for the sequence of class-i scaled departure counting processes holds because the increments, conditional on any history, are stochastically bounded over any bounded interval by a constant rate Poisson process, with rate equal to the supremum of the staffing function multiplied by the supremum of the service-time hazard-rate function, which is bounded because the system starts empty and the service-time distributions have positive finite densities. 4 Estimation Procedures for the Performance Functions We now provide extra details about our estimation procedure for the following time-dependent performance functions: (i) the mean potential waiting time, E[W (t)], (ii) the class i abandonment probability, P (i) t (Ab), i =,, (iii) the delay probability, P t (Delay), and (iv) the mean queue length for both the main queue and the orbit queue, E[Q(t)] and E[O(t)]. We estimate these performance measures in a time interval [, T ] with T =. At the j th simulation replication, we periodically generate virtual arrivals (of both classes) at deterministic times t, t, 3 t,..., T, with t =.. These virtual customers have the same patience-time distribution; they abandon as if they were real customers, but they will not be removed from the queue if they abandon. They still wait for their turn to enter service so that we can record their virtual waiting times (although we won t let them enter so they don t affect the dynamics in the service facility). We use indicator variables ξ d,j (k) (ξ (i) a,j (k)) to indicate if the kth (type i) virtual customer of the j th replication is delayed (abandons), i =,, k =,..., T/ t. We use W j (k) to record the virtual waiting time of the k th virtual arrival, that is E j (k) A j (k), where E j (k) is the time this virtual customer enters service and A j (k) = k t is its arrival time. For the queue-length processes, we sample the continuoustime queue-length process at discrete time points t, t,..., T, Q j (k) and O j (k). Here we make sure to exclude (not counting) those virtual arrivals in queue. We generate N = independent replications to estimate the delay probability, abandonment probability, mean potential waiting time and mean queue lengths. Specifically, for at t k k t, k =,..., T/ T, we approximate E[W (t k )], P (i) t k (Ab), P tk (Delay), E[Q(t k )] and E[O(t k )] with N N W j (k), j= N N j= ξ (i) a,j (k), N N ξ d,j (k), j= N N Q j (k) j= and N N O j (k). j= 5 Additional Experiments In this section we supplement the main paper by presenting additional results from simulation experiments. We start in 5. by considering a variant of the main example in 5 of [9] with much smaller external arrival rate, with λ reduced from to. In 5.3 we consider a variant of the main example in 5 of [9] with more balanced mean service times, now having E[S ] =. instead of E[S ] = 5. (with E[S ] = ). In 5.4 we consider an example with two feedback opportunities, as shown in Figure 8 of [9]. In all examples we use the arrival rate function λ(t) = λ( + r sin(t)) = ( + r sin(t)), t, () for average arrival rate λ and relative amplitudes r, denoted by M t (r). We usually let λ = and r =..

5. Lower Arrival Rates and Staffing We now supplement 5 by showing in Figures and of the performance functions in the same (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (., H (, 4)/ ) model except that λ is reduced from to. As the scale decreases, the discretization becomes a more and more serious issue. Thus there is a limit to the stabilization that can be achieved with very small scale. Here we increase the number of replications to 5. Figure : Performance functions in the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (., H (, 4)/ ) model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =. and an IS orbit queue: four cases of high waiting-time (low QoS) targets (w =.,.,.3 and.4) and simple DIS staffing. Arrival rate 8 4 6 8 4 6 8 queue length 5 orbit number 4 6 8 4 6 8 4 4 6 8 4 6 8 Abandonment probability (new) Abandonment probability (old) Delay probability...4..8.6 4 6 8 4 6 8 4 6 8 4 6 8.4 4 6 8 4 6 8.6 delay.4. 3 4 6 8 4 6 8 Staffing 4 6 8 4 6 8 To show the discretization effect, we display the DIS staffing for this model with λ = with high and low waiting-time targets, respectively, in Figures 3 and 4. Figure 3 shows that a difference of. in the waiting-time target is approximately worth a single server in this context. For comparison with Figure 3, we also show the DIS staffing in the corresponding model with λ further reduced to 5 in Figure 5. 5. Performance in Model with Non-Exponential Patience Distributions We now supplement 5 of [9] by showing in Figures 6 and 7 the analog of Figures and 3 for the same (M t /GI, GI/s t +GI, GI)+(GI/ ) model with Bernoulli feedback after a random delay in an IS orbit queue, but with non-exponential patience distributions. Here all distributions in the model

Figure : Performance functions in the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (., H (, 4)/ ) model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =. and an IS orbit queue: four cases of low waiting-time (high QoS) targets (w =.,.,.3 and.4) and DIS-MOL staffing. Arrival rate 8 4 6 8 4 6 8.5 queue length.5 4 6 8 4 6 8 orbit number 4 Abandonment probability (new) 4 6 8 4 6 8.3.. 4 6 8 4 6 8 Abandonment probability (old).4. 4 6 8 4 6 8.3 Delay probability.. delay.4. 4 4 6 8 4 6 8 4 6 8 4 6 8 Staffing 3 4 6 8 4 6 8 are H (m, 4). Otherwise the parameters are the same as before. Figures 6 and 7 show excellent performance just as in 5 of [9]. 3

Figure 3: DIS staffing functions in the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (., H (, 4)/ ) model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =. and an IS orbit queue: four cases of high waiting-time (high QoS) targets (w =.,.,.3 and.4) and DIS-MOL staffing. 35 3 5 DIS Staffing 5 w =.4 w =.3 w =. w =. 5 4 6 8 4 6 8 Figure 4: DIS staffing functions in the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (., H (, 4)/ ) model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =. and an IS orbit queue: four cases of low waiting-time (high QoS) targets (w =.5,.,.3 and.6) and DIS-MOL staffing. 4 35 3 DIS Staffing 5 5 w =.6 w =.3 w =. w =.5 5 4 6 8 4 6 8 4

Figure 5: DIS staffing functions in the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (., H (, 4)/ ) model with the sinusoidal arrival rate in () for λ = 5 and r =., Bernoulli feedback with probability p =. and an IS orbit queue: four cases of high waiting-time (high QoS) targets (w =.,.,.3 and.4) and DIS-MOL staffing. 9 8 7 DIS Staffing 6 5 4 w =.4 w =.3 w =. w =. 3 4 6 8 4 6 8 5

Figure 6: Performance functions in the (M t (.)/H (, 4), H (5, 4)/s t + H (, 4), H (, 4)) + (., H (, 4)/ ) model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =. and an IS orbit queue: four cases of high waiting-time (low QoS) targets (w =.,.,.3 and.4) and simple DIS staffing. Arrival rate 9 queue length 4 6 8 4 6 8 5 4 3 4 6 8 4 6 8 orbit number 5 5 Abandonment probability (new) Abandonment probability (old) 4 6 8 4 6 8.3.. 4 6 8 4 6 8.4. 4 6 8 4 6 8 Delay probability delay Staffing.9.8.4.3.. 4 6 8 4 6 8 4 6 8 4 6 8 5 5 4 6 8 4 6 8 6

Figure 7: Performance functions in the (M t (.)/H (, 4), H (5, 4)/s t + H (, 4), H (, 4)) + (., H (, 4)/ ) model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =. and an IS orbit queue: four cases of low waiting-time (high QoS) targets (w =.,.,.3 and.4) and DIS-MOL staffing. Arrival rate 9 queue length orbit number Abandonment probability (new) 6 4 4 6 8 4 6 8 4 6 8 4 6 8 8 6 4 4 6 8 4 6 8.3.. 4 6 8 4 6 8 Abandonment probability (old).4. 4 6 8 4 6 8.6 Delay probability.4. delay 4 6 8 4 6 8.5.4.3.. 4 6 8 4 6 8 Staffing 4 6 8 4 6 8 7

5.3 Performance in Model with Similar Mean Service s We now consider a variant of the same (M t (.)/H (, 4), H (5, 4)/s t +M(), M())+(., H (, 4)/ ) model in Figures and 3 except the mean service time for the fedback customers is E[S ] =. The feedback probability is then increased to.5, so that the two contributions to the overall offered load, by new arrivals and fedback customers, are nearly equal. Figures 8 and 9 show the performance functions for the cases of high and low waiting-time targets, respectively. 8

Figure 8: Performance functions in the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (., H (, 4)/ ) model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =.5, E[S ] =.5 and an IS orbit queue: four cases of high waiting-time (low QoS) targets (w =.,.,.3 and.4) and simple DIS staffing. Arrival rate 9 4 6 8 4 6 8 queue length orbit number 6 4 4 6 8 4 6 8 6 5 4 4 6 8 4 6 8 Abandonment probability (new) Abandonment probability (old) Delay probability.. 4 6 8 4 6 8.4. 4 6 8 4 6 8.9 delay.8 4 6 8 4 6 8.5.4.3.. 4 6 8 4 6 8 Staffing 5 4 6 8 4 6 8 9

Figure 9: Performance functions in the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (., H (, 4)/ ) model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =.5, E[S ] =.5 and an IS orbit queue: four cases of low waiting-time (high QoS) targets (w =.,.,.3 and.4) and DIS-MOL staffing. Arrival rate 9 queue length orbit number Abandonment probability (new) 8 6 4 4 6 8 4 6 8 4 6 8 4 6 8 6 5 4 4 6 8 4 6 8.3.. 4 6 8 4 6 8 Abandonment probability (old).4. 4 6 8 4 6 8 Delay probability.4. 4 6 8 4 6 8 delay.4. 4 6 8 4 6 8 5 Staffing 5 4 6 8 4 6 8

5.4 Performance with Two Feedback Opportunities We now consider an example in which there are two feedback opportunities, as in Figure 8 in 6.3. As before, we let all service-time distributions be hyperexponential H (m, 4), but with different means, and all patience distributions be exponential, M(m). Specifically, the three service-time means are m =., m = /6, m 3 =. and the three patience means are m =., m =., m 3 = /8. Just as in 5, the arrival process is an NHPP with sinusoidal arrival-rate function as in (5) of the main paper for λ = and r =.. The two feedback probabilities are p =.6, p =.5. Figures and show the main performance functions for the cases of high waiting time targets (.,.,.3,.4) with DIS staffing and low waiting time targets (.,.,.3,.4) with DIS-MOL staffing, respectively.

Figure : Performance functions in the (M t /H (m, 4), H (m, 4)/s t + M(m), M(m)) + (H (m, 4)/ ) + (H (m, 4)/ ) model having two delayed customer feedback opportunities, with sinusoidal arrival rate in (5) for r =., mean service times m =., m = /6, m 3 =., mean patience times m =., m =., m 3 = /8 and feedback probabilities p =.6, p =.5: the cases of high waiting-time (low QoS targets) (w =.,.,.3 and.4) and simple DIS staffing. Arrival rate Mean queue length Orbit mean queue length Orbit mean queue length P(Ab) (new) P(Ab) (second time) P(Ab) (third time) Delay probability delay Staffing 9 5 4 6 8 4 6 8 4 6 8 4 6 8 6 4 4 6 8 4 6 8 4 6 8 4 6 8.. 4 6 8 4 6 8.4. 4 6 8 4 6 8.3.. 4 6 8 4 6 8.9.8.5 4 6 8 4 6 8 4 6 8 4 6 8 4 6 8 4 6 8

Figure : Performance functions in the (M t /H (m, 4), H (m, 4)/s t + M(m), M(m)) + (H (m, 4)/ ) + (H (m, 4)/ ) model having two delayed customer feedback opportunities, with sinusoidal arrival rate in (5) for r =., mean service times m =., m = /6, m 3 =., mean patience times m =., m =., m 3 = /8 and feedback probabilities p =.6, p =.5: the cases of low waiting-time (high QoS targets) (w =.,.,.3 and.4) and simple DIS staffing. Arrival rate Mean queue length Orbit mean queue length Orbit mean queue length 9 4 6 8 4 6 8 5 4 6 8 4 6 8 6 5 4 3 4 6 8 4 6 8 4 6 8 4 6 8 P(Ab) (new) P(Ab) (nd time) P(Ab) (3rd time).. 4 6 8 4 6 8.5 4 6 8 4 6 8.4. 4 6 8 4 6 8 Delay probability.5 4 6 8 4 6 8 delay Staffing.4. 4 6 8 4 6 8 4 6 8 4 6 8 3

6 Implied Empirical Quality of Service Functions In this section we elaborate on 5.3 of the main paper, where we presented the implied empirical QoS function, β DIS MOL (t) = s DIS MOL(t) m(t) m(t) (3) associated with DIS-MOL staffing for the base model, denoted by s DIS MOL (t). We now present corresponding plots for the other models. We start with the i= (M t/h (m i, 4) + M(m i )/s t two-class model. The results are plotted in Figure. Figure : The empirical Quality of Service (QoS) provided by the DIS-MOL staffing as a function of the waiting-time target w in the i= (M t/h (m i, 4) + M(m i )/s t two-class model example of 6. in the main paper. Empirical QoS β(t).5.5 w =.5 w =.5 w =. w =. w =.3 w =.4 w =.6 4 6 8 4 6 8 We next consider the example from 6. involving the (M t (.)/H (, 4), H (/6, 4)/s t + M(), M()) + (.6, H (, 4)/s t + M()) model with feedback after a delay in a finite-capacity orbit queue. The results are displayed in Figure 3. We next consider the example with two feedback opportunities from 6.3. Specifically, we consider the (M t /GI, GI/s t + GI, GI) + (GI/ ) + (GI/ ) model with two delayed customer feedback opportunities. The results are displayed in Figure 4. Finally, we show the empirical Quality of Service (QoS) provided by the DIS-MOL staffing in the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (., H (, 4)/ ) model from 5. in this appendix with λ is reduced from, first to, and then to 5. Figures 5 and 6 show the results for and 5, respectively. It should be compared to the corresponding plots for the model with λ = in Figure 4 of [9]. From the greater thickness of the plots here, we see that the discretization now has a bigger impact. But we see that the OL can be useful. 4

Figure 3: The empirical Quality of Service (QoS) provided by the DIS-MOL staffing as a function of the waiting-time target w in the (Mt (.)/H (, 4), H (/6, 4)/st + M (), M ()) + (.6, H (, 4)/st + M ()) example from 6.. Empirical QoS β(t): main queue w =.5.5 w =.5 w =..5 w =. w =.3 w =.4 w =.6 4 6 8 4 6 8 Empirical QoS β(t): orbit queue.5 w =.5 w =.5.5 w =. w =. w =.3 w =.4 w =.6.5 4 6 8 4 6 8 Figure 4: The empirical Quality of Service (QoS) provided by the DIS-MOL staffing as a function of the waiting-time target w in the (Mt (.)/H (, 4), H (5, 4)/st +M (), M ())+(., H (, 4)/ ) example from 6.3. Empirical QoS β(t) w =.5.5 w =.5 w =. w =. w =.3.5 w =.4 w =.6 4 6 8 5 4 6 8

Figure 5: The empirical Quality of Service (QoS) provided by the DIS-MOL staffing as a function of the waiting-time target w in the (M t (.)/H (, 4), H (5, 4)/s t +M(), M())+(., H (, 4)/ ) model from 5. in this appendix with λ is reduced from to. 3.5 Empirical QoS β(t).5.5 w =.5 w =.5 w =. w =. w =.3 w =.4 w =.6 4 6 8 4 6 8 Figure 6: The empirical Quality of Service (QoS) provided by the DIS-MOL staffing as a function of the waiting-time target w in the (M t (.)/H (, 4), H (5, 4)/s t +M(), M())+(., H (, 4)/ ) model from 5. in this appendix with λ is reduced from to 5. 3.5 w =.5 Empirical QoS β(t).5 w =.5 w =. w =. w =.3 w =.4 w =.6.5 4 6 8 4 6 8 6

7 The Index of Disperson for Counts (IDC) of the Flows A statistical analysis of the departure processes was a major component of our recent paper on feed-forward networks of many-server queues [8]. The DIS approximation for high waiting-time targets is valid for general G t arrival processes, but the DIS-MOL approximation for low waiting-time targets depends on the M t NHPP property, in order for the stationary M/GI/s + GI model to yield a reasonable approximation for each t. As discussed in [8], it is possible that there can be significant degradation of the DIS-MOL approximation at a queue if the arrival process is not nearly NHPP. That can be caused by an upstream queue with a high waiting-time (low QoS) target and a service-time distribution that is not nearly exponential, such as the H (m, 4) distributions we have been considering. We did not find such performance degradation to be a serious problem in the context of the present paper, so we did not discuss it in the main paper. However, we did investigate the question for the present feedback model. We discuss the results here. First, we give some background. We assume that the external arrival process is NHPP. Under low waiting-time (high QoS) targets, that will make all queues similar to M t /GI/ IS queues, which have NHPP departure processes. Moreover, independent thinning of an NHPP is again an NHPP. So the NHPP property should propogate forward to all arrival processes in a lightly loaded feedback model. To understand complications in more heavily loaded models, we first observe that the servers are all likely to be busy simultaneously in a more heavily loaded system. Thus, the departure process should behave like the superposition of a random number of i.i.d. renewal processes, where the interrenewal times are the service times. Such superposition processes are well studied for stationary models. For a stationary model, the departure process from the upstream queue will tend to be similar to the superposition of a fixed number i.i.d. renewal processes, with inter-renewal times distributed as the service times in that upstream queue. Such superposition processes approach a Poisson process locally as the number of component processes increases, but they also have the same central-limit-theorem behavior as a single renewal process as time increases for any fixed number; see [8] and 9.8 of [] for more discussion. In [8] we found that the index of dispersion for counts (IDC) revealed when the flow could be regarded as approximately an NHPP as far as performance with the DIS-MOL approximation is concerned. The IDC is the ratio of the variance to the mean of the counting process. That is, if A(t) counts the number of arrivals in [, t], then the IDC is the function I(t) V ar(a(t), t. (4) E[A(t)] For an NHPP, I(t) = for all t. Based on [8], we are motivated to examine the IDC s of the various counting processes arising in our feedback models. We report the results in this section of the appendix. For the most part, we conclude that the flows should be approximately NHPP when it matters. Hence, we did not discuss this feature in the main paper. Our most important finding is that the IDC of the overall departure process from the system could well have an IDC significantly greater than, so that the departure process would have significantly more stochastic variability than an NHPP. That could cause performance degradation at a subsequent queue fed by the departure process from the feedback model. 7

7. The Base Example: an IS Orbit Queue we start by examining the IDC of various counting processes in the base (M t (r)/h (, 4), H (5, 4)/s t + M(), M()) + (p, H (, 4)/ ) model with an IS orbit queue and r = p =. from 5 of [9]. We consider the same models here as before. All service-time distributions are H, while all patience distributions are M, but the means vary, as indicated. Since the departure process from an M t /GI/ IS model is always Poisson, by Theorem of [], we expect no problem with an IS orbit queue. Paralleling Figures and 3 of the main paper, Figure 7 (8) shows the results when both queues have high waiting-time,low QoS, targets and DIS staffing (low waiting-time, high QoS, targets and DIS-MOL staffing), respectively. The entrant arrivals are from the process with rate function λ F (t) coming out of the orbit queue; the total arrival process combined the reentrant arrivals with the external arrivals, having rate function λ(t) + λ F (t); The total departure process is the aggregate departure process from the system, entering the outside world, corresponding to the rate functions ( p)σ (t) + σ (t); and the total orbiting process is the process entering the orbit queue, with rate function pσ (t). From the perspective of the performance of DIS-MOL at the main queue, the most important process is the total arrival process. Figure 8 shows that all the IDC s are consistently near with low waiting-time (high QoS) targets and DIS-MOL staffing, so that all flows are nearly NHPP s. Since the queues are quite lightly loaded with DIS-MOL staffing, they are not too different from M t /GI/ queues. These IDC s are consistent with the theoretical properties that (i) the departure process from an M t /GI/ queue is NHPP and (ii) the independent thinning of an NHPP is again an NHPP. Hence, the NHPP property propagates through the network, approximately. However, when both queues have high waiting-time (low QoS) targets, we see that the total departure process exhibits an IDC approaching, but none of the arrival processes have IDC s that differ significantly from. Evidently the thinning of the initial departure process makes the arrival process at the orbit queue nearly NHPP. These IDC plots are consistent with the excellent performance we saw in 6. of the main paper. However, we emphasize that DIS staffing with high waiting-time (low QoS) targets is effective without requiring the NHPP property. However, it is important to note that the overall departure process does have an IDC that is significantly greater than. Hence, it is possible that this feedback queue, with these high waitingtime (low QoS) targets, could have adverse effect on the performance of a subsequent queue fed by the overall departure process, because the overall departure process is significantly more variable than an NHPP. Experience indicates that the IDC values in the range between and do not affect the performance too much, so this phenomenon appears to be not too serious. 8

Figure 7: Estimates of the mean ED(t), variance V ar(d(t)) and IDC I(t) V ar(d(t))/ed(t) for several counting processes from the (M t (r)/h (, 4), H (5, 4)/s t + M(), M()) + (p, H (, 4)/ ) base model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =. and an IS orbit queue, as in 5 of [9]: the cases of high waiting-time (low QoS) targets w =.4 and. and DIS staffing. Reentrant arrival 3 5 5 5 mean: w =.4 variance: w =.4 mean: w =. variance: w =. Mean and variance.5 Variance to mean ratiio w =.4 w =. 6 8 4 6 8.5 6 8 4 6 8.5 Total arrival 5 5 5 6 8 4 6 8.5 6 8 4 6 8 Total departure 5 5.8.6.4. 3 5 6 8 4 6 8.5 6 8 4 6 8 Total orbiting 5 5 6 8 4 6 8.5 6 8 4 6 8 9

Figure 8: Estimates of the mean ED(t), variance V ar(d(t)) and IDC I(t) V ar(d(t))/ed(t) for several counting processes from the (M t (r)/h (, 4), H (5, 4)/s t + M(), M()) + (p, H (, 4)/ ) base model with the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =. and an IS orbit queue, as in 5 of [9]: the cases of low waiting-time (low QoS) targets w =.4 and. and DIS-MOL staffing. Reentrant arrival 3 5 5 5 mean: w =.4 variance: w =.4 mean: w =. variance: w =. Mean and variance.5 Variance to mean ratio w =.4 w =. 5 6 8 4 6 8.5 6 8 4 6 8.5 Total arrival 5 5 Total departure 4 8 6 4 3 5 6 8 4 6 8 6 8 4 6 8.5 6 8 4 6 8.5.5 6 8 4 6 8.5 Total orbiting 5 5 6 8 4 6 8.5 6 8 4 6 8 3

7. A Finite-Capacity Orbit Queue As indicated above, there is a potential problem with a finite-capacity orbit queue. Hence, we now estimate the IDC s of the main flows in the (M t (r)/gi, GI/s t + GI, GI) + (p, GI/s t + GI) model with a finite-capacity orbit queue, as in 6. of [9]. Again we use the same model parameters as in the main paper. We will be presenting IDC s in the two cases corresponding to Figures 7 and 8 in the main paper. As is done there, we let the waiting-time targets be the same in the two queues, but we consider both high and low waiting-time targets. In particular, we consider the (M t (r)/h (, 4), H (5, 4)/s t + M(), M()) + (p, H (, 4)/s t + M()) model with r =. and p =.6. Figures 9 and show the results with high and low waiting-time targets. The same target is used at the finite-capacity orbit queue as at the main queue. Here the feedback probability is p =.6 instead of p =., as for the IS orbit queue. Figure 9: Estimates of the mean ED(t), variance V ar(d(t)) and IDC I(t) V ar(d(t))/ed(t) for several counting processes from the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + (.6, H (, 4)/s t + M()) model with finite-capacity orbit queue, the sinusoidal arrival rate in () for λ = and r =., Bernoulli feedback with probability p =.6, as in 6. of [9]: the cases of identical high waiting-time (low QoS) targets W =. and w =.4 and DIS staffing at both queues. Reentrant arrival 6 5 4 3 Mean: w =.4 Variance: w =.4 Mean: w =. Variance: w =. Mean and variance.5 Variance to mean ratio w =.4 w =. 6 8 4 6 8.5 6 8 4 6 8.5 Total arrival 5 5 6 8 4 6 8 5.5 6 8 4 6 8.5 Total departure 5 5.5 6 8 4 6 8.5 6 8 4 6 8 Total orbiting 8 6 4.4.3.. 6 8 4 6 8 6 8 4 6 8 The results are similar to those for the IS orbit queue before. In both cases the total arrival process appears to be approximately an NHPP. However, as before, in the case of high waiting-time (low QoS) targets W =. and w =.4 and DIS staffing at both queues, the total departure process has an IDC significantly above. Hence, there could well be performance degradation at a subsequent queue fed by the overall departure process. 3

Figure : Estimates of the mean ED(t), variance V ar(d(t)) and IDC I(t) V ar(d(t))/ed(t) for several counting processes from the (M t (.)/H (, 4), H (5, 4)/s t + M(), M()) + +(.6, H (, 4)/s t + M()) model with a finite-capacity orbit queue, the sinusoidal arrival rate in () for λ = and r =. and Bernoulli feedback with probability p =.6, as in 6. of [9]: the cases of identical low waiting-time (high QoS) targets W =. and w =.4 and DIS-MOL staffing at both queues. Reentrant arrival 8 7 6 5 4 3 Mean: w =.4 Variance: w =.4 Mean: w =. Variance: w =. Mean and Variance.5 Variance to mean ratio w =.4 w =. 6 8 4 6 8 35.5 6 8 4 6 8.5 Total arrival 3 5 5 5 6 8 4 6 8 4.5 6 8 4 6 8.5 Total departure 8 6 4 8 6 8 4 6 8.5 6 8 4 6 8.5 Total orbiting 6 4 6 8 4 6 8.5 6 8 4 6 8 3