Fluid Limit of A Many-Server Queueing Network with Abandonment

Similar documents
State Space Collapse in Many-Server Diffusion Limits of Parallel Server Systems. J. G. Dai. Tolga Tezcan

Electronic Companion Fluid Models for Overloaded Multi-Class Many-Server Queueing Systems with FCFS Routing

Asymptotic Coupling of an SPDE, with Applications to Many-Server Queues

Dynamic Control of Parallel-Server Systems

Motivated by models of tenant assignment in public housing, we study approximating deterministic fluid

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers 1

STABILITY OF TWO FAMILIES OF REAL-TIME QUEUEING NETWORKS ŁUKASZ K RU K (WARSZAWA)

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

Operations Research Letters. Instability of FIFO in a simple queueing system with arbitrarily low loads

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers

Queueing Theory I Summary! Little s Law! Queueing System Notation! Stationary Analysis of Elementary Queueing Systems " M/M/1 " M/M/m " M/M/1/K "

Figure 10.1: Recording when the event E occurs

A Diffusion Approximation for Stationary Distribution of Many-Server Queueing System In Halfin-Whitt Regime

LIMITS FOR QUEUES AS THE WAITING ROOM GROWS. Bell Communications Research AT&T Bell Laboratories Red Bank, NJ Murray Hill, NJ 07974

Lecture 9: Deterministic Fluid Models and Many-Server Heavy-Traffic Limits. IEOR 4615: Service Engineering Professor Whitt February 19, 2015

The Fluid Limit of an Overloaded Processor Sharing Queue

Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates

The G/GI/N Queue in the Halfin-Whitt Regime I: Infinite Server Queue System Equations

Minimizing response times and queue lengths in systems of parallel queues

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis.

Queues and Queueing Networks

Fluid Models of Parallel Service Systems under FCFS

Yongjiang Guo & Yunan Liu

Synchronized Queues with Deterministic Arrivals

2009), URL

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/25/17. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

Stationary Probabilities of Markov Chains with Upper Hessenberg Transition Matrices

arxiv: v3 [math.pr] 4 Dec 2017

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/6/16. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

Designing a Telephone Call Center with Impatient Customers

This paper investigates the impact of dependence among successive service times on the transient and

Queuing Networks: Burke s Theorem, Kleinrock s Approximation, and Jackson s Theorem. Wade Trappe

Analysis of an M/G/1 queue with customer impatience and an adaptive arrival process

Since D has an exponential distribution, E[D] = 0.09 years. Since {A(t) : t 0} is a Poisson process with rate λ = 10, 000, A(0.

Dynamic Control of a Tandem Queueing System with Abandonments

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

Exercises Stochastic Performance Modelling. Hamilton Institute, Summer 2010

Control of Fork-Join Networks in Heavy-Traffic

Erlang-C = M/M/N. agents. queue ACD. arrivals. busy ACD. queue. abandonment BACK FRONT. lost calls. arrivals. lost calls

The G t /GI/s t + GI Many-Server Fluid Queue: Longer Online Version with Appendix

A Note on the Event Horizon for a Processor Sharing Queue

E-Companion to Fully Sequential Procedures for Large-Scale Ranking-and-Selection Problems in Parallel Computing Environments

Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals

Queues with Many Servers and Impatient Customers

The Performance Impact of Delay Announcements

HITTING TIME IN AN ERLANG LOSS SYSTEM

Wait-Time Predictors for Customer Service Systems with Time-Varying Demand and Capacity

Topics in queueing theory

A Fluid Approximation for Service Systems Responding to Unexpected Overloads

Solutions to Homework Discrete Stochastic Processes MIT, Spring 2011

A STAFFING ALGORITHM FOR CALL CENTERS WITH SKILL-BASED ROUTING: SUPPLEMENTARY MATERIAL

Making Delay Announcements

Chapter 1. Introduction. 1.1 Stochastic process

Directional Derivatives of Oblique Reflection Maps

(b) What is the variance of the time until the second customer arrives, starting empty, assuming that we measure time in minutes?

DS-GA 1002 Lecture notes 2 Fall Random variables

Online Supplement to Creating Work Breaks From Available Idleness

Statistics 150: Spring 2007

Monotonicity Properties for Multiserver Queues with Reneging and Finite Waiting Lines

SOLUTIONS IEOR 3106: Second Midterm Exam, Chapters 5-6, November 8, 2012

Online Supplement to Creating Work Breaks From Available Idleness

Blind Fair Routing in Large-Scale Service Systems with Heterogeneous Customers and Servers

Other properties of M M 1

Functional Limit Theorems for A New Class of Non-Stationary Shot Noise Processes

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

M/G/1 and Priority Queueing

Introduction to Queueing Theory with Applications to Air Transportation Systems

Quality of Real-Time Streaming in Wireless Cellular Networks : Stochastic Modeling and Analysis

Positive Harris Recurrence and Diffusion Scale Analysis of a Push Pull Queueing Network. Haifa Statistics Seminar May 5, 2008

ANALYSIS OF THE LAW OF THE ITERATED LOGARITHM FOR THE IDLE TIME OF A CUSTOMER IN MULTIPHASE QUEUES

Fluid Limits for Processor-Sharing Queues with Impatience

A FUNCTIONAL CENTRAL LIMIT THEOREM FOR MARKOV ADDITIVE ARRIVAL PROCESSES AND ITS APPLICATIONS TO QUEUEING SYSTEMS

Heavy Traffic Analysis for EDF Queues with Reneging

Blanchet, J., Dong, J., and Pei, Y.

TOWARDS BETTER MULTI-CLASS PARAMETRIC-DECOMPOSITION APPROXIMATIONS FOR OPEN QUEUEING NETWORKS

Point Process Control

Design, staffing and control of large service systems: The case of a single customer class and multiple server types. April 2, 2004 DRAFT.

Analysis of a Two-Phase Queueing System with Impatient Customers and Multiple Vacations

An Analysis of the Preemptive Repeat Queueing Discipline

CONTINUITY OF A QUEUEING INTEGRAL REPRESENTATION IN THE M 1 TOPOLOGY. Department of Industrial Engineering and Operations Research Columbia University

One billion+ terminals in voice network alone

STABILITY OF MULTICLASS QUEUEING NETWORKS UNDER LONGEST-QUEUE AND LONGEST-DOMINATING-QUEUE SCHEDULING

Appendix. A. Simulation study on the effect of aggregate data

Proactive Care with Degrading Class Types

Chapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries

Modeling Service Networks with Time-Varying Demand

Data analysis and stochastic modeling

Dynamic control of a tandem system with abandonments

Invariant measures for iterated function systems

State Space Collapse in Many-Server Diffusion Limits of Parallel Server Systems and Applications. Tolga Tezcan

[4] T. I. Seidman, \\First Come First Serve" is Unstable!," tech. rep., University of Maryland Baltimore County, 1993.

Fork-Join Networks in Heavy Traffic: Diffusion Approximations and Control. M.Sc. Research Proposal

57:022 Principles of Design II Final Exam Solutions - Spring 1997

Random variables. DS GA 1002 Probability and Statistics for Data Science.

Blind Fair Routing in Large-Scale Service Systems with Heterogeneous Customers and Servers

Design and evaluation of overloaded service systems with skill based routing, under FCFS policies

ESTIMATING WAITING TIMES WITH THE TIME-VARYING LITTLE S LAW

Transcription:

Fluid Limit of A Many-Server Queueing Networ with Abandonment Weining Kang Guodong Pang October 22, 213 Abstract This paper studies a non-marovian many-server queueing networ with abandonment, where externally arrived and internally routed customers are served under the non-idling First-Come- First-Serve (FCFS discipline at each station of many parallel servers. Externally arrived and internally routed customers in each queue may have different patience time distributions. The system dynamics is described by the total number of customers in each queue (both waiting and receiving service and a triplet of measure-valued processes, tracing the amount of service time each customer in service has received, the waiting times of externally arrived customers and the waiting times of internally routed customers in queue. We show a functional strong law of large numbers in the many-server regime for these processes, where the limit processes are the unique solution to a system of deterministic measure-valued integral equations, and characterize the invariant states of the fluid limit. 1 Introduction We study a non-marovian many-server queueing networ model with customer abandonment and Marov routing. There are a fixed number of service stations, each of which has either finitely or infinitely many parallel servers, a single queue and its own designated customer class. Customers enter the system at a service station, and receive service immediately if there is a free server at the station, and oin the queue at the station otherwise. Upon service completion, a customer is immediately routed to one of the service stations or leaves the system following a Marovian routing mechanism, independent of other customers. Our networ model allows for customer feedbac. Externally arrived and internally routed customers at each service station are served in the non-idling, First-Come-First-Serve (FCFS discipline. Customers can be out of patience and leave the system (without reentry when they are waiting in the queue before receiving service. The patience-time distributions of externally arrived and internally routed customers may be different. The service and patience time distributions of customers depend on the service station. See 2.1 for a more detailed description of the model. This model has many interesting applications in customer contact centers and patient flow analysis. In particular, recent empirical analysis shows that the patience times of customers first Department of Mathematics and Statistics, University of Maryland, Baltimore County, Baltimore, MD 2125 (wang@umbc.edu The Harold and Inge Marcus Department of Industrial and Manufacturing Engineering, Pennsylvania State University, University Par, PA 1682 (gup3@psu.edu 1

visit and reentrants can be very different, see [18 and [38. Thus, it is important to understand the system dynamics of this networ model with this feature of different patience-time distributions of externally arrived and internally routed customers in each service station. However, this causes technical difficulties in the exact analysis as well as asymptotically. In the many-server regime, even for the Marovian model with exponential service and patience times at each station, the conventional approach in [29 does not wor. Here we adapt the measure-valued stochastic-process framewor developed by Kaspi and Ramanan [16 and Kang and Ramanan [15, 14 to obtain fluid approximations of the system dynamics in the many-server regime. In particular, we use a triplet of measure-valued stochastic processes together with the process counting the total number of customers at each service station to provide a Marovian representation of the system evolution dynamics. Here, one measure-valued process eeps trac of the amount of service time each customer in service has received, and the other two measure-valued processes eep trac of the waiting times of externally arrived and internally routed customers in the queue, respectively. It is critical for our analysis to use two measure-valued processes to describe the dynamics of externally arrived and internally routed customers in each queue separately, because a single measure-valued process cannot simultaneously characterize the different waiting (impatient behaviors of externally arrived and internally routed customers. See 2.2 for a more detailed presentation of these state descriptors and dynamical equations. We investigate the approximate system dynamics in the many-server heavy-traffic regime where arrival rates in all service stations and the numbers of servers in those service stations go to infinity together in an appropriate manner while the service and patience time distributions are fixed; specifically, we assume that the external arrival rates (possibly time-varying at each service station grows proportionally to the total number of servers from those service stations in the system as it increases to infinity. The main results of this paper include a functional strong law of large numbers (FSLLN, fluid limit (Theorem 6.1 for the Marovian state descriptor, i.e., the triplet of measurevalued processes and the process counting the number of customers at each queue, uniqueness of a solution to the limit (Theorems 3.4 and the characterization of its invariant states (Theorem 3.9. We also discuss the two special cases, a single station with immediate feedbac and a tandem networ of two many-server queues with abandonment. The proof of the convergence for the FSLLN is based on the arguments in [16, 15, 14, by constructing martingales for the departure processes at each service station. However, characterizations of the limit processes are more involved than the single service station setting [15. One complication is that all the service stations are lined together due to customers internal routing, and the analysis of one service station inevitably involves analyzing other service stations at the same time. Moreover, patience-time differentiation of externally arrived and internally routed customers adds more complication because the two measure-valued processes describing the dynamics of customers in a single FCFS queue at each service station are only lined by the waiting-time process of the head-of-line customer, who can be either an externally arrived or internally routed customer. Unlie fluid limits of single-server queues and networs in the conventional regime, where a Sorohod mapping can be identified to show the uniqueness of its solution and the convergence (due to the server idleness, the fluid limits for many-server queues and networs in the FCFS regimes do not have reflections and cannot be put in the framewor of Sorohod problems. Thus, new arguments are needed in the proofs of uniqueness of a solution to the fluid equations and the characterization of its invariant state to address the complications from the interconnection of service stations as well as from patience-time differentiation. These methods may turn out to be useful for the study of uniqueness of fluid model solutions in service networs with other networ structures, e.g., sill-based routing in [1, 35 and customers switching queues in [3. Literature Review. Many-server queues with abandonment and their networs have been exten- 2

sively used to model large-scale service systems, for example, customer contact centers and patient flows in hospitals; see [11, [1, [27, [12, [19, [1 and [34 and references therein. There is a vast literature on Marovian many-server queueing (networ models with abandonment. We refer the readers to the above cited papers for a complete review of them. Empirical study of call centers and patient flows have shown that customers service and patience times are usually non-exponential; see, e.g., [4, [28, [1 and [34. Thus, it is significant that stochastic models for these systems capture the realistic feature of non-exponential service and patience time distributions. There has been substantial development in the recent years on non-marovian many-server queueing (networ models. Here we review those mostly relevant to our wor. (i For non-marovian many-server queues, we refer to [36, 16, 14, 15, 39, 2, 21, 22, 23, 13 for fluid models using measure-valued and two-parameter processes tracing elapsed or residual times and [17, 32, 6, 7, 26, 8 for approximations in the Halfin-Whitt regime. (ii There is very limited research on non-marovian many-server queuing networ models with abandonment. Atar et al. [3 generalize the measure-valued process approach in [16, 14 to study a multiclass non-marovian many-server model with abandonment, in which customers are served according to a non-preemptive priority policy. Reed and Shai [33 consider the G/GI/N queue with multiple server pools and pool-dependent general service time distributions in the Halfin-Whitt regime, where customers are routed to the server pool with the longest weighted cumulative idle time in order to achieve fairness in the worload among the server pools. Liu and Whitt [24, 25 study a fluid networ model for a non-marovian open queueing networ of many-server queues, where all model elements are time-varying. They generalize their algorithm in [2 to this time-varying fluid networ model. Their model is closest to ours, but they do not consider the differentiation of patience times of external and internal customers. 1.1 Organization of the Paper The notation used in this paper is the same as in [16, 15, 14, so we give a brief description of notation in 1.2. We describe the model precisely and present the measure-valued stochastic-process descriptor for the system dynamics in 2. We present the fluid model in 3.1, and characterize its invariant state in 3.2. Two special cases are discussed in 4.1 and 4.2. We prove the uniqueness of solutions to the fluid equations in 5, and show the convergence of the fluid-scaled processes to the fluid limit in 6. Some additional proofs are collected in the Appendix. 1.2 Notation The following notation will be used throughout the paper. N, R (R +, Z (Z + represent the set of strictly positive integers, real numbers (non-negative, integers (non-negative, respectively. For a, b R, a b = max{a, b}, a b = min{a, b} and a + = a. For a set B, 1 B denotes the indicator function of the set B. For a square matrix P, P denotes its transpose, P 1 denotes its inverse if P is invertible, and P n denotes its nth power. For any metric space E, C b (E and C c (E are, respectively, the space of bounded, continuous functions and the space of continuous real-valued functions with compact support defined on E, while C 1 (E is the space of real-valued, once continuously differentiable functions on E, and Cc 1 (E is the subspace of functions in C 1 (E that have compact support. The subspace of functions in C 1 (E that, together with their first derivatives, are bounded, will be denoted by Cb 1(E. For ϕ : E R, let ϕ. = sup x E ϕ(x and supp(ϕ be the support of ϕ. For H, let L 1 [, H and L 1 loc [, H, respectively, represent the spaces of integrable and locally integrable functions on [, H, where a locally integrable function f on [, H is a measurable function on [, H that satisfies [,a f(x dx < for all a < H. The constant functions f 1 and f will be represented by the symbols 1 and, respectively. 3

. For any càdlàg function f : [, R, f T = sups [,T f(s for every T <. Given a nondecreasing, right continuous function f having left limits on [,, f 1 denotes the left continuous inverse function of f: f 1 (y = inf{x : f(x y} with the convention that infimum over an empty set is. For each differentiable function f defined on R, f denotes the first derivative of f. For each function f(t, x defined on R R n, f t denotes the partial derivative of f with respect to t and f x denotes the partial derivative of f with respect to x R. The space of Radon measures on a Polish space E, endowed with the Borel σ-algebra, is denoted by M(E, while M + (E and M F (E are, respectively, the subspaces of non-negative, finite nonnegative measures in M(E. M(E (M + (E and M F (E, endowed with the vague and wea topologies [31, 37, respectively, are Polish spaces. The symbol δ x denotes the measure with unit mass at the point x. We will also use to denote the identically zero Radon measure on E. When E is an interval, say [, H, for notational conciseness, we will often write M[, H instead of M([, H. We say a measure µ is continuous at x [, H if and only if µ({x} = and µ is continuous on [, H if µ is continuous at each x [, H. When E = [, H and E = [, H R +, for some H (,. we will usually use f to denote generic functions on [, H and ϕ to denote generic functions on [, H R +. For any Borel measurable function f : [, H R that is integrable with respect to ξ M[, H, we often use the short-hand notation f, ξ =. [,H f(x ξ(dx. For each measure µ on [,, let F µ (x =. µ[, x for each x [,. Given a Polish space H, we denote by D H [, T (respectively, D H [, the space of H-valued, càdlàg functions on [, T (respectively, [,, and we endow this space with the usual Sorohod J 1 -topology [31,[37 so that they are Polish. Let I R+ [, be the subset of non-decreasing functions f D R+ [, with f( =. A sequence {X n } of càdlàg, H-valued processes, with X n defined on (Ω n, F n, P n, is said to converge in distribution to a càdlàg H-valued process X defined on (Ω, F, P if, for every bounded, continuous functional F : D H [, R, we have lim n E n [F (X n = E [F (X, where E n and E are the expectation operators with respect to the probability measures P n and P, respectively. Convergence in distribution of X n to X will be denoted by X n X. 2 Descriptions of Model and System Dynamics 2.1 Model Description and Primitive Data Consider a system with K (1 K < service stations and each service station has its own designated customer class. Thus, there are K customer classes for the entire system. Let K =. {1,, K}. For each K, customers of class are served by the th service station with N [1, identical servers, in which arriving customers are served in a non-idling, FCFS manner, that is, a newly arriving customer immediately enters service if there are any idle servers or, if all servers are busy, then the customer oins the end of the queue, and the customer at the head of the queue (if one is present enters service as soon as a server becomes free. Note that here we do allow the possibility that a service station may have infinitely many identical servers. Let N be a positive integer. We assume that, for each K, N = s N, where s is a fixed constant in (, independent of N. Note that for each K, N = if s = and N /N s as N. External Arrivals. We assume that there exists a K-dimensional cumulative external arrival process E (N such that for each K and t >, E (N (t represents the total number of customers of class that have arrived into the system from outside during the time interval (, t. We assume that E (N is a non-decreasing, pure ump process with E (N ( = and a.s., for each t [,, E (N (t < and E (N (t E (N (t {, 1}. Also, for each K, let E (N be an a.s. 4

Z + -valued random variable that represents the number of customers of class that have entered the system by time zero. The set of random variables {E (N, K} are only used for booeeping purposes to eep trac of the indices of customers. Service Times. For each K, the customers of class are either coming externally from the outside of the system, or coming internally from one of the service stations upon service completion due to internal routing. We shall call the first type of customers as external customers and the second type of customers as internal customers. We assume that for each K, there exists two i.i.d. sequences of i.i.d. random variables {v 1, i, i Z} and {v 2, i, i Z}, with common cumulative distribution function G s on [,. For each K and i N, v1, i (resp. v 2, i represents the service requirement of the ith external (resp. internal customer of class to enter the system after time zero, while {v 1, i, i N {}} (resp. {v 2, i, i N {}} represents the service requirements of external (resp. internal customers of class that arrived by time zero (if such customers exist, ordered according to their arrival times (by time zero. We assume that G s has density gs. Let C s. = sup{x [, : G s (x = } and. Hs = sup{x [, : G s (x < 1}. Then C s and Hs denote, respectively, the left end and the right end of the support of gs. We assume that the service time distribution G s has positive finite mean, that is, m s. = (1 G s [,H s (xdx (,. (2.1 Routing. We assume Marovian routing, described as follows. Let e 1,, e K be the K unit coordinate vectors in R K and e be the K-dimensional vector of all zeros. For each K, {φ 1, (i, i Z} and {φ 2, (i, i Z} are two i.i.d. sequences of i.i.d. routing vectors where for = 1, 2 and i Z, φ, (i taes values in the set {e, e 1,, e K }. For each K and i Z, the ith external (resp. internal customer of class to depart from the th service station is next routed to class l if φ 1, (i = e l (resp. φ 2, (i = e l for some l K, or it leaves the system if φ 1, (i = e (resp. φ 2, (i = e. Let P be a K K matrix such that for each, l K and i Z, P l = P(φ 1, (i = e l = P(φ 2, (i = e l. The matrix P is called the routing matrix. Let P. = 1 l K P l for each K. We assume that P satisfies the conditions that I P is invertible and H. = (I P 1 = I + P + (P 2 + (P 3 +. Note that the matrix H has non-negative entries. Reneging. It is assumed that customers are impatient, and that a customer reneges from the queue as soon as the amount of time it has spent in queue reaches its patience limit. Customers do not renege once they have entered service. We assume that external customers and internal customers in queue may have different patience time distributions. For each K, the patience times of external customers of class are given by an i.i.d. sequence, {r 1, i, i Z}, with a common cumulative distribution function G r 1, on [,, where for each i N, r1, i represents the patience time of the ith external customer of class to enter the system after time zero, while {r 1, i, i N {}} represents the patience times of external customers of class that entered the system by time zero, ordered according to their arrival times (by time zero. For each K, the patience times of internal customers of class are given by another i.i.d. sequence, {r 2, i, i Z}, with a common cumulative distribution function G r 2, on [,, where for each i N, r2, i represents the 5

patience time of the ith internal customer of class to reenter the system after time zero, while {r 2, i, i N {}} represents the patience times of internal customers of class arrived by time zero, ordered according to their reentering times (by time zero. We assume that for = 1, 2, G r,, restricted on [,, has density g, r. For = 1, 2, let. Cr, = sup{x [, : G r, (x = } and H, r. = sup{x [, : G r, (x < 1} denote, respectively, the left end and the right end of the support of g, r. For each = 1, 2, the mean of the patience time distribution Gr, is denoted by m r., = (1 G r [,H, r, (xdx [,. (2.2 Independence We assume that the cumulative external arrival processes E (N, K, the sequences of service requirements {v, i, i Z}, = 1, 2, K, the sequences of patience times {r, i, i Z}, = 1, 2, K, and the sequences of feedbac vectors {φ, (i, i N}, = 1, 2, K are mutually independent. 2.2 System Dynamics For each K, we shall use two measure-valued processes to describe the queue dynamics for external and internal customers of class, respectively. We first describe the potential queue dynamics for external customers of class. For each Z, let ζ (N,1, let η (N,1, t denote the arrival time of external customer of class into the system. For t [,, be a non-negative Borel measure on [, H1, r with the representation: η (N,1, t = E (N (t = E (N +1 δ (N,1, w (t 1 (N,1, {w (t<r 1, }, (2.3 where w (N,1, (t = [ t ζ (N,1, r 1, represents the amount of time external customer of class has been in the potential queue by time t. In a similar fashion, we can define another measure-valued process η (N,2, t to describe the potential queue dynamics for internal customers of class. Specifically, let C (N be an a.s. Z + - valued random variable that represents the number of internal customers of class that reentered the system due to internal routing by time zero and ζ (N,2, denote the time at which internal customer of class reenters the system upon service completion. Moreover, let I (N (t denote the cumulative number of internal customers of class routed to the service station in the time interval (, t. Since the service time distributions G s, K, have densities on their supports, then with probability one, there is at most one customer finishes service from those K service stations at any given time t [,, that is, I (N (t I (N (t {, 1} for each t with probability one. Then η (N,2, t on [, H2, r can be defined as η (N,2, t = I (N (t = C (N +1 δ (N,2, w (t 1 (N,2, {w (t<r 2, }, (2.4 where w (N,2, (t = [ t ζ (N,2, r 2, represents the amount of time internal customer of class has been in the potential queue by time t. 6

For each t, let The measure η (N, t t. Note that 1, η (N,1, t η (N, t = η (N,1, t + η (N,2, t. (2.5 eeps trac of the waiting times of all customers in the potential queue at time = η (N,1, t [, represents the total number of external customers waiting in the potential queue at time t, and 1, η (N,2, t = η (N,2, t [, represents the total number of internal customers waiting in the potential queue at time t. Thus, 1, η (N, t represents the total number of customers waiting in the potential queue at time t. = η (N, t [, For t [,, let X (N (t be the total number of customers of class (including external customers and internal customers in the system and Q (N (t be the number of customers of class waiting in queue at time t. Due to the non-idling condition, the queue length process Q (N of customers of class is then given by Q (N (t = [X (N (t N +. (2.6 Moreover, since the head-of-the-line customer (external or internal of class in queue is the customer of class in queue with the longest waiting time, the quantity χ (N (t =. { } inf x > : η (N, t [, x Q (N (t (2.7 represents the waiting time of the head-of-the-line customer of class in the queue at time t. Since this is an FCFS system, any mass in η (N, t that lies to the right of χ (N (t represents a customer that is either in service or has departed by time t. Therefore, the queue length process Q (N admits the following alternative representation in terms of χ (N and η (N, : Q (N (t = η (N, t [, χ (N (t. (2.8 For t [,, in a fashion analogous to (2.3 and (2.4, we can also define ν (N, t to be a discrete non-negative Borel measure on [, H s that has a unit mass at the amount of time each of the customers (either external or internal has spent in service by time t. For each Z, let ς (N,1, denote the time external customer of class enters service and ς (N,2, denote the time internal customer of class enters service. Note that if external (resp. internal customer of class reneged, then ς (N,1, = (resp. ς (N,2, =. Let a (N,1, (t = [ t ς (N,1, v 1, (resp. a (N,2, (t = [ t ς (N,2, v 2, represents the amount of time external (resp. internal customer of class has been in service by time t. Then, the measure ν (N, t is defined as ν (N, t = ν (N,1, t + ν (N,2, t, (2.9 where and ν (N,1, t = ν (N,2, t = E (N (t = E (N +1 I (N (t = C (N +1 δ (N,1, a (t 1{ } (2.1 a (N,1, <v 1, δ (N,2, a (t 1{ }. (2.11 a (N,2, <v 2, 7

Note that 1, ν (N, t = ν (N, t [, represents the total number of customers of class in service at time t. We now introduce some auxiliary processes. Fix K. Let D (N l, l K {}, denote the cumulative routing processes, where for each l K, D (N l (t is the cumulative number of customers of class that have completed the service and oined class l in the time interval [, t, and D (N (t is the cumulative number of customers of class that have completed the service and left the system in the interval [, t. Then D (N l (t has the representation D (N l (t = It is obvious that E (N (t = E (N +1 s [,t + I (N (t = C (N +1 s [,t 1 {φ 1, (=e l } 1 { da (N,1, (s >, 1 {φ 2, (=e l } 1 { da (N,2, (s >, I (N (t = l K da (N,1, } (s+= }. da (N,2, (2.12 (s+= D (N l (t. (2.13 In addition, the departure process D (N, where D (N (t represents the cumulative number of customers of class that have completed service from the service station in the time interval [, t, can be represented in term of D (N l, l K {}, as D (N (t = D (N l (t. (2.14 l K {} Let S (N denote the cumulative potential reneging process, where S (N (t represents the cumulative number of customers of class whose waiting times in the potential queue have reached their patience times in the interval [, t. Thus, S (N admits the representation S (N (t = E (N (t = E (N +1 s [,t + I (N (t = C (N +1 s [,t 1 { dw (N,1, (s >, 1 { dw (N,2, (s >, dw (N,1, } (2.15 (s+= }. dw (N,2, (s+= Let R (N denote the cumulative reneging process, where R (N (t is the cumulative number of customers of class that have reneged in the time interval [, t. Then R (N admit the representation R (N (t = E (N (t = E (N +1 s [,t + I (N (t = C (N +1 s [,t 1 { w (N,1, 1 { w (N,2, (s χ (N (s χ (N (s, dw (N,1, (s, dw (N,2, (s >, (s >, dw (N,1, } (2.16 (s+= }, dw (N,2, (s+= 8

where the additional restrictions w (N,1, (s χ (N (s and w (N,2, (s χ (N (s are imposed so as to only count the reneging of customers of class (including external and internal customers actually in queue. Here, one considers the left limit χ (N (s of χ (N at time s to capture the situation in which χ (N umps down at time s due to the head-of-the-line customer of class reneging from the queue or entering service. Therefore, for each K, mass balances on the total number of customers of class in the system, the number of customers of class waiting in the potential queue, and the number of customers of class in service, show that and X (N ( + E (N + I (N = X (N + R (N + D (N, (2.17 1, η (N, + E (N + I (N = 1, η (N, + S (N, (2.18 1, ν (N, + L (N = 1, ν (N, + D (N, (2.19 where L (N (t represents the cumulative number of customers of class that have entered service in the interval [, t. In addition, it is also clear that X (N = 1, ν (N, + Q (N. (2.2 Combining (2.17, (2.19 and (2.2, we obtain the following mass balance equation for the number of customers in queue: Q (N ( + E (N + I (N = Q (N + R (N + L (N. (2.21 Furthermore, the non-idling condition taes the form N 1, ν (N, = [N X (N +. Note that if N =, then the above non-idling condition holds automatically. 3 The Fluid Model In this section, we present the fluid model that asymptotically describes the system dynamics in 3.1, and characterize the invariant states for the fluid model in 3.2. 3.1 Fluid Model Equations Recall that I R+ [, denote the subset of non-decreasing functions f D R+ [, with f( =. Define (e, x, ν, η 1, η 2 I R+ [, K R K + Π K M F [, H s. S = Π K M + [, H1, r Π KM + [, H2, r : s 1, ν = [s x +, [x s + 1, η 1, + 1, η2,, (3.1 where M + [, H1, r and M +[, H2, r represent the set of positive, locally finite measures on [, Hr 1, and [, H2, r, respectively. Recall that N /N s as N. The set S serves as the space of possible input data for the fluid model equations. In order to state the definition of fluid model equations, for each = 1, 2 and K, define the hazard rate functions of G r, and Gs in the usual manner: h r, (x. g, r (x = 1 G r, (x, x [, Hr,, (3.2 9

h s (x. g s = (x 1 G s (x, x [, Hs. (3.3 It is easy to verify that h r, L1 loc [, Hr, and hs L1 loc [, Hs. Definition 3.1 The càdlàg function (X, ν, η 1, η 2 defined on R + such that X = (X, K R K +, ν = (ν, K Π K M F [, H s, η1 = (η 1,, K Π K M + [, H1, r, and η2 = (η 2,, K Π K M + [, H2, r is said to solve the fluid model equations associated with the data (E, X(, ν, η 1, η2 S and the hazard rate functions h r, and hs, = 1, 2 and K, if and only if for every t [,, = 1, 2 and K, t h r,, η, s ds <, t h s, ν u du <, (3.4 and the following relations are satisfied: for every f C b (R +, f(x ν [,H s t (dx = f(x + t 1 Gs (x + t [,H s 1 G s (x ν (dx + f(t s(1 G s (t s dl (s, (3.5 where where [,H r 1, f(x η 1, t (dx = [,H r 2, f(x η 2, t (dx = [,t L (t = 1, ν t 1, ν + t h s, ν u du; (3.6 f(x + t 1 Gr 1, (x + t [,H1, r 1 G r 1, (x η 1, (dx + f(t s(1 G r 1, (t s de (s; (3.7 I (t = l K [,t f(x + t 1 Gr 2, (x + t [,H2, r 1 G r 2, (x η 2, (dx + t f(t s(1 G r 2, (t s di (s, (3.8 t P l h s l, νl u du; (3.9 Q (t = X (t 1, ν t ; (3.1 Q (t 1, η 1, t + 1, η 2, t ; (3.11 2 ( t R (t = 1 [,χ (s(uh r [,H, r, (uη, s (du ds, (3.12 =1 where χ (s = (F η s 1 (Q (s and η s and. = η 1, s + η 2, s ; X (t = X ( + E (t + I (t t h s, ν u du R (t; (3.13 s 1, ν t = [s X (t +. (3.14 1

It immediately follows from (3.1 and (3.14 that for each t [,, Q (t = [X (t s +. (3.15 For future use, we also observe that (3.6, (3.1 and (3.13, when combined, show that for every t [, and K, Q ( + E (t + I (t = Q (t + L (t + R (t. (3.16 Remar 3.2 It follows from (3.9 and (3.12 that for each K, processes I and R are continuous. If the process E is also continuous for each K, the relation in (3.13 implies that the process X and then Q by (3.1 are continuous. Moreover, for each K, the continuity of E also implies that L is continuous by (3.16. Remar 3.3 For each K, if s =, the non-idling condition (3.14 holds automatically and in this case, Q (t = χ (t = R (t = for all t by (3.15. We now state the uniqueness result for the solutions to the fluid model equations. deferred to 5. Its proof is Theorem 3.4 Suppose that for each K, g2, r is continuously differentiable on its support [C2, r, Hr 2, and hr 1, is locally bounded. Given any (E, X(, ν, η 1, η2 S such that E is continuous and η 1, η2 are continuous measures, there exists at most one solution (X, ν, η1, η 2 to the associated fluid equations (3.4 (3.14. 3.2 Invariant States Given a positive constant vector λ = (λ : K, a state (x, ν, η 1, η2 such that (e λ, x, ν, η 1, η2 S and η 1, η2 are continuous on R + is said to be an invariant state for the fluid model equations in Definition 3.1 if there is a solution (X, ν, η 1, η 2 to the fluid model equations associated with the initial data (e λ, x, ν, η 1, η2 satisfies (X(t, ν t, η 1 t, η 2 t = (x, ν, η 1, η2 for all t, where e λ (t = λt for each t. Let ν and η,, K, = 1, 2, be the non-negative measures defined as follows: ν [, x = η, [, x = x x (1 G s (ydy, x [, Hs, (3.17 (1 G r, (ydy, x [, Hr,. (3.18 Note that (2.1 implies that ν is actually a finite measure. Define the effective/overall arrival rate of customers of class, λ, as the th entry of the vector λ defined by λ = (I P 1 λ = Hλ. (3.19 Define K 1 := { K : λ m s s }, the set of potentially critically loaded or overloaded service stations in the absence of impatience. Note that / K 1 if s =, which says that any service station with infinite servers can not be potentially critically loaded or overloaded. Let z and χ be K-dimensional vectors satisfying the equation z = (λ g(χ + (I G(χP z, (3.2 11

where g(χ is the K-dimensional vector with its th entry λ G r 1, (χ and G is the K K diagonal matrix with its diagonal entry G r 2, (χ. Let I λ be the set of states defined by I λ = {(x, z ν, λη 1, w η 2, z Z, w = P z, x X z }, (3.21 where Z = z RK + : J K 1 such that z m s = s for J, z m s < s for K \ J and there exists χ R K + such that χ = for each K \ J and (z, χ satisfies (3.2, (3.22 and for each z Z, X z = x RK + : z = λ (1 G r 1, ((F η 1 ((x s + +w (1 G r 2, ((F η 1 ((x s + for J ; x = z m s for K \ J, w = P z, η = λη 1 + wη 2. (3.23 It is clear that, if K 1 =, then Z contains only one element z = λ. On the other hand, if K 1, the set J in (3.22 should be non-empty. Note that for each K 1, the presence of customers impatience may reduce the actual arrival rate to service station from internal routing. Then the actual arrival rate to service station may be less than the effective arrival rate λ. As a result, the set J in (3.22 could be a strict subset of K 1. In general, the set Z may contain several elements depending on the choices of J. Let J be a non-empty subset of K 1. Define z J and χ J as follows. Let z J = s /m s for each J and χ J = for each / J. For each / J, we have from (3.2 that z J = λ + l J P l s l /m s l + l / J P l z J l. Let P J c be the principal submatrix of P by removing rows and columns of P with indexes in J. It is clear from the properties of P and Lemma 7.5 of [5 that I P J c is also invertible and its inverse. H J c = (I P J c 1 has the representation H J c = I + P J c + (P J c2 + (P J c3 +. Let zj J and λ c J c be the vectors obtained from z J and λ by removing entries with indexes in J, respectively. Let P J J c be the submatrix of P by removing rows of P with indexes not in J and columns of P with indexes in J and let s J /m s J be a vector whose entries are {s l/m s l, l J }. Then z J J c = H J c(λ J c + (P J J c s J /m s J. Given the z J defined above, for each J, by (3.2, we have λ 2 z J = λ (1 G r 1,(χ + (1 G r 2,(χ (P z J, and let χ J be any solution to the above equation. This completes the definition of χ J. So it is clear that the z J defined above is in Z if zj J < s c J /m s J. For example, consider Z when K = 2, m s 1 = ms 2 = 1, [ [ [ [ [ λ1 1/4 s1 1/6 1/2 1/4 =, =, and P =. 1/4 5/6 1/4 1/2 s 2 12

It is easy to chec that [ λ1 m s 1 λ 2 m s 2 = [ 1 1 > [ s1 s 2. Thus, K 1 = {1, 2}. Choose J = K 1. Let z 1 = 1/6 and z 2 = 5/6. Obviously, z 1 m s 1 s 1, z 2 m s 2 s 2 and (3.2 has a solution for χ 1 and χ 2. Then z = (1/6, 5/6 Z. Next, choose J = {1}. Let z 1 = 1/6 and χ 2 =. Then we can solve (3.2 for z 2 and χ 1 to get z 2 = 7/12 and χ 1 is a solution to the equation 11 48 Gr 2,1 (χ 1 + 1 4 Gr 1,1 (χ 1 = 5 16. Note that z 2m s 2 = 7/12 < 5/6. Thus, z = (1/6, 7/12 Z. At last, choose J = {2}. Let z 2 = 5/6 and χ 1 =. Then we can solve (3.2 for z 1 and χ 2 to get z 1 = 11/12 and χ 2 is a solution to the equation 12G r 1,2 (χ 2 + 31G r 2,2 (χ 2 = 3. Note that z 1 m s 1 = 11/12 > 1/6. Thus, z = (11/12, 5/6 / Z. Therefore, in this example, Z has two elements. Remar 3.5 There are two possible sources for non-uniqueness of the invariant states in (3.21- (3.23 associated with the fluid equations. One source comes from the fact that Z may have several elements, as shown in the above example. The other comes from (3.23, that is, for a given z Z, X z may have more than one element due to the fact that the patience time distributions G r 1, and G r 2,, K, may not be strictly increasing. This second source is the same for the single-station model in [15 where non-uniqueness is solely due to that the single patience distribution is not strictly increasing. Remar 3.6 When customers in the system have infinite patience, that is, G r, (x = for all x [,, K and = 1, 2, and K 1 = K = { K : λ m s > s }, that is, all service stations are overloaded, the invariant state I λ is actually an empty set. This is consistent with the fact that the overloaded system in the absence of impatience is not stable. In fact, suppose that Z. Then, for each z Z, there exists χ R K + such that (z, χ satisfies (3.2. Note that, in this case, (3.2 is reduced to the equation z = λ + P z and this implies that z = λ. Since λ m s > s for each K, J in (3.22 does not exist. This is a contradiction to the fact that z Z. Thus, when all stations are overloaded, the set Z in (3.22 is an empty set, and so is I λ. Remar 3.7 If the system has only one critically loaded or overloaded service station, that is, K 1 has only one element, or every service station in the system is underloaded, that is, K 1 =, then the set Z can only have a single element. If, in addition, the patience time distributions G r 1, and G r 2,, K, are strictly increasing, then the system has only one invariant state. Remar 3.8 If the system has feed-forward routing, that is, the routing matrix P has the property that P i = for each i, then the set Z also has a unique element and hence the system has unique invariant state if the patience time distributions are all strictly increasing. In fact, if K 1, let K 1 be the increasingly ordered set { 1, 2,, n } and J be a non-empty subset of K 1, which is associated with an invariant state. Note that all service stations with indices less than 1 are underloaded, then z i = λ i for each i < 1. Since service station 1 is the first critically loaded or overloaded station, then z 1 = s 1 /m s 1 and 1 J. For any service station 1 < i < 2, it will remain underloaded since the input rate from service station 1 to service station i is less than the effective input rate due to abandonment at service station 1 and then z i = λ i + <i, 1 H i λ +H i1 s 1 /m s 1. For service station 2, we now that the effective arrival rate is λ 2 + < 2 H i2 λ s 2 /m s 2 and the actual arrival rate is λ 2 + < 2, 1 H i λ +H 2 1 s 1 /m s 1. So if λ 2 + < 2, 1 H i λ + H 2 1 s 1 /m s 1 s 2 /m s 2, then 2 J ; otherwise, 2 / J. Note that whether 2 is in J or not depends only on λ, s, m s for K and P. By a similar argument for the rest of the stations in K 1, we can see that the choice of J is unique based on λ, s, m s for 13

K and P. Thus, there is a unique element in Z. If X z also has a unique element, for example, when all patience time distributions are strictly increasing, then the system has a unique invariant state. Theorem 3.9 (Characterization of the Invariant States Given the arrival rate vector λ = (λ : K, the set I λ gives all invariant states associated with the fluid equations (3.4 (3.14. Proof. Fix the arrival rate vector λ = (λ : K and let e λ (t = λt for each t. We brea the argument into the following two claims. Claim 1. The invariant state is a subset of I λ. Let (x, ν, η 1, η2 be an invariant state and (X, ν, η 1, η 2 be the solution to the fluid equations associated with the initial data (e λ, x, ν, η 1, η2 that satisfies (X t, ν t, η 1 t, η 2 t = (x, ν, η 1, η2 for all t. We will show that (i η 1 = λη1, (ii ν = z ν, η 2 = w η, 2 and x = x for z Z, w = P z and x X z. To establish (i, fix K. Since η 1, t = η 1, for each t, by (3.7, we see that for every f C c (R +, f(x η 1, [,H1, r (dx = f(x + t 1 Gr 1, (x + t [,H1, r 1 G r 1, (x η 1, (dx +λ t f(t s(1 G r 1, (t s ds. Letting t and using the fact that f has compact support, we obtain f(x η 1, [,H1, r (dx = λ f(s(1 G r [,H1, r 1, (s ds, which implies that η 1, (dx = λ (1 G r 1, (x dx = λ η 1, (dx. (3.24 This establishes (i. Next, we focus on establishing (ii. Fix K. Since X (t = x for each t, by (3.15, we have Q (t = (x s + for each t. Since η, t = η, for each t and = 1, 2, we also have η t = η 1, t + η 2, t = η 1, + η 2, = η for each t. This implies, in particular, that for each t, χ (t = (F η t 1 (Q (t = (F η 1 ((x s + = χ. (3.25 Note that χ is a constant function, and thus by (3.12, we can express R (t = (c 1, + c 2, t, for t, where by (3.24, c 1, = 1 [,χ [,H1, r (uhr 1, (uη1, (du = λ G r 1, (χ, and Let c 2, = 1 [,χ [,H2, r (uhr 2, (uη2, (du. (3.26 w = l K P l h s l, νl. (3.27 14

Then it follows from (3.9 that I (t = w t, t. (3.28 Thus, by (3.16, we obtain that L (t = (λ + w c 1, c 2, t, t. Let z = λ + w c 1, c 2,. Now by letting ν t = ν for each t in (3.5, we see that for every f C c(r +, f(xν [,H s (dx = f(x + t 1 Gs (x + t [,H s 1 G s (x ν (dx + z t f(u(1 G s (u du. (3.29 Again, letting t on both sides of (3.29 and using the fact that f has compact support, we obtain f(xν [,H s (dx = z f(u(1 G s [,H s (u du. (3.3 This implies that Since 1, ν s, we then have ν (dx = z (1 G s (xdx = z ν (dx. (3.31 z m s s. (3.32 It follows from (3.31 that for each l K, h s l, νl = z l. Thus, plugging into (3.27, we have w = l K P l z l. (3.33 Next, by letting η 2, t = η 2, for each t in (3.8 and using (3.28, we see that for every f C c (R +, f(xη 2, [,H2, r (dx = f(x+t 1 Gr 2, (x + t [,H2, r 1 G r 2, (x η 2, (dx+w and letting t and using the fact that f has compact support, we obtain f(xη 2, [,H2, r (dx = w f(s(1 G r [,H2, r 2, (s ds. This implies that t f(s(1 G r 2, (s ds, (3.34 η 2, (dx = w (1 G r 2, (xdx = w η 2, (dx. (3.35 Combining the above with (3.26, we have c 2, = w G r 2, (χ. It follows that z = λ (1 G r 1, (χ + w (1 G r 2, (χ. (3.36 Thus, from (3.36 and (3.33, we see that z and χ satisfy (3.2, that is, z = (λ g(χ + (I G(χ P z. As a consequence, we have and From (3.38, it is easy to see that (I (I G(χ P z = λ g(χ (3.37 z = (λ g(χ + (I G(χ P z λ + P z. (3.38 z (I P 1 λ = λ. (3.39 15

For each / K 1, the above implies that z m s λ m s < s. Now, let J = { K 1 : z m s = s }. Thus for each K \ J, by (3.32, z m s < s, then 1, ν < s and hence χ =. Therefore, z Z, which is defined in (3.22. Note that for each z Z, we obtain w = P z by (3.33, and x X z by (3.36 and (3.25. Claim 2. The set I λ is a subset of the invariant state. Fix z Z, w = P z and x X z. Choose a process (X, ν, η 1, η 2 to be such that (X(t, ν t, η 1 t, η 2 t = (x, z ν, λη, 1 w η 2 for each t. We now show that (X, ν, η 1, η 2 is a solution to the fluid equations associated with the initial data (e λ, x, z ν, λη, 1 w η. 2 It is evident to see that (3.4 holds for z ν, λη 1 and w η. 2 For each K and t, let Ī (t. = l K t P l h s l, νl u du = P l h s l, z l νl t = (P z t = w t. (3.4 l K and L (t. = t h s, ν u du = h s, z ν t = z t. (3.41 Thus, (3.6 and (3.9 are satisfied by (X, ν, η 1, η 2. Since z Z, there exists J K 1 and χ R K + such that z ms = s for each J, z ms < s and χ = for each K \ J and (z, χ satisfies (3.2. Since x X z, we have that for each K \ J, x = z ms and for each J, z = λ (1 G r 1, ((F η 1 ((x s + + w (1 Gr 2, ((F η 1 ((x s +, (3.42 where η = λη 1 + w η. 2 For each K \ J and t, let Q (t = R (t =.. Since 1, ν t = 1, z ν = z ms for each t, we have that (3.1 (3.12 and (3.14 hold. Notice that for each t, (e λ (t + Ī(t t h s, ν u du = (λ + w z t =, where the last equality holds due to the fact that (z, χ satisfies (3.2. This shows that (3.13 holds. Next for each J and t, let Q (t = x s. Since (z, χ satisfies (3.2, we have that z = λ (1 G r 1, (χ + (1 Gr 2, (χ w. This and (3.42 imply that χ = (F η 1 (x s. For each t, let R (t. = (λ G r 1, (χ + w Gr 2, (χ t. Hence, we have checed that (3.1 (3.14 hold. It remains to show that (3.5, (3.7 and (3.8 hold. But this can be readily verified using the expressions of (ν, η 1, η 2, Ī in (3.4 and K in (3.41. Hence (X, ν, η 1, η 2 is a solution to the fluid equations associated with the initial data (e λ, x, z ν, λη, 1 w η. 2 Then we have that (x, z ν, λη, 1 w η 2 is an invariant state. This established Claim 2 and completes the proof of Theorem 3.9. 4 Two Special Cases 4.1 A Single Station with Immediate Feedbac In this section, we consider the special case of our model where there is a single station of finitely many servers with immediate feedbac. The subscript or superscript is omitted to simplify the 16

notation. The Marovian routing probabilities P 1 = p and P 11 = 1 p, that is, after each service completion, with probability p the customer will leave the system and with probability 1 p the customer will reenter the system. The effective arrival rate λ = (1 P 11 1 λ = λ/p. Without loss of generality, let s 1 = 1. The underloaded, critically loaded and overloaded regimes are determined by λm s < 1, λm s = 1, and λm s > 1, respectively. As a special case of Theorem 3.9, we obtain the following theorem for the invariant states associated with the fluid equations for this model of a single station with immediate feedbac. Theorem 4.1 (Invariant States for A Single Station with Immediate Feedbac (i If the system is underloaded, λm s < p, or critically loaded, λm s = p, then the invariant state has a unique element (x, z ν, λη 1, w η 2 given by x = λm s /p, z = λ/p, w = (1 pz = (1 pλ/p, (4.1 and the head-of-line waiting time χ =. (ii If the system is overloaded, λm s > p, then any invariant state (x, z ν, λη 1, w η 2 has the representation: z = 1/m s, w = (1 pz = (1 p/m s, (4.2 and x = 1 + λ χ (1 G r 1(udu + 1 p m s χ where the head-of-line waiting time χ is a solution to the equation (1 G r 2(udu, (4.3 λm s (1 G r 1( χ = 1 (1 p(1 G r 2( χ. (4.4 (iii In (ii, if, in addition, G r 1 = Gr 2 = Gr and m r 1 = mr 2 = mr, then any invariant state has z and w in (4.2, and x = 1 + (λ + (1 p/m s χ where the head-of-line waiting time χ is a solution to the equation (1 G r (udu, (4.5 G r ( χ = 1 (λm s + (1 p 1. (4.6 Note that the equation (4.6 may not have unique solution for χ. When the service and patience time distributions are all exponential, we obtain the following corollary and as can be easily seen, the invariant states are unique. Corollary 4.2 Assume that G s (x = 1 e µx, and G r (x = 1 e θ x, = 1, 2, with µ, θ (,. (i If the system is underloaded λ/(pµ < 1 or critically loaded λ/(pµ = 1, then the invariant state has a unique element (x, z ν, λη 1, w η 2 given by x = λ/(pµ, z = λ/p, w = (1 pz = (1 pλ/p, (4.7 and the head-of-line waiting time χ =. (ii If the system is overloaded λ/(pµ > 1, then the invariant state has a unique element: z = µ, w = (1 pµ, x = 1 + λ θ 1 (1 e θ 1 χ + 17 (1 pµ θ 2 (1 e θ 2 χ, (4.8

where the head-of-line waiting time χ is the unique solution to the equation (1 (1 pe θ 2 χ µ = λe θ 1 χ. (4.9 (iii In (ii, if, in addition, θ 1 = θ 2 = θ, then the invariant state has a unique element: z and w are in (4.8, and x = 1 + λ pµ. (4.1 θ The head-of-line waiting time χ = θ 1 log(λ/µ + (1 p. (4.11 We give a numerical example in Table 1 to illustrate our results in the model with one service station. In the first two models, the interarrival, service and patience times are all exponential with parameter values λ = 1, µ = 1, p =.4, n = 2, and with identical abandonment rate θ 1 = θ 2 =.5 in the first model and different abandonment rates θ 1 =.5 and θ 2 = 1 in the second model. In the third model, the arrival process is Poisson with rate λ = 1, the service time distribution is H 2 with density f(x = e 2x + 3 1 e 2x/3 and mean 1, the patience time distribution of new customers is H 2 with density g 1 (x =.5e x + 6 1 e x/3 and mean 2, and the patience time distribution of feedbac customers is E 2 with mean 2. In all these models, the systems are in the overloaded regime. We conducted simulations to validate the heavy-traffic approximations in Theorem 4.1 and Corollary 4.2, in particular, we compare the heavy-traffic approximations and the simulated steady-state values of the average queue sizes Q 1 and Q 2 of new customers and feedbac customers, respectively, and the average waiting-time χ of the customer at the head-of-the-line. We see that the approximations match very well with the simulation results, and, more importantly, the impact of differentiated patience time distributions of new and feedbac customers is evident by the results in the first two models. The values below the simulation values are the halfwih for the 95% confidence interval. The simulation results are the estimates from one sample path in the time interval [2, 5, and the halfwihs of the 95% confidence interval are obtained by running four more independent simulations and using Student t-distribution with three degrees of freedom for each model. Table 1: Comparison of the fluid approximations and simulations in many-server models of a single station with immediate feedbac and differentiated patience-time distributions. Model Q 1 Q 2 χ Sim. Approx. Sim. Approx. Sim. Approx. M/M/n + M/M 18.2226 18.1818 21.517 21.8182.1922.196 θ 1 = θ 2 ±.624 ±.63 ±.54 M/M/n + M/M 12.1571 12.197 13.756 13.992.1259.124 θ 1 θ 2 ±.79 ±.691 ±.64 M/H 2 /n + H 2 /E 2 24.9345 24.967 32.1587 32.3598.2748.2726 ±.581 ±.766 ±.65 4.2 A Tandem Networ of Two Many-Server Queues with Abandonment In this section, we consider a second special case: a tandem networ of two service stations of finitely many servers with abandonment, where customers at each service station have patience time distribution of positive finite mean. Here K = 2 and the Marovian routing probabilities 18

P 11 =, P 12 = p, P 1 = 1 p, P 2 = 1, P 21 = and P 22 =, that is, after service completion at the first station, customers move to the second queue with probability p and leave the system with probability 1 p, and after service completion at the second station, customers must leave the system. We assume that p (, 1. When p = 1, all customers completing service from the first station will oin the second station. Let m r, and Gr,,e be the mean and stationary excess distribution function associated with the distribution G r,2 for = 1, 2 and = 1, 2. Recall that = 1 for external arrivals and = 2 for internal customers. As a special case of Theorem 3.9, we give the invariant states associated with the fluid equations for this tandem networ model below. As in Corollary 4.2, it is also easy to verify that the invariant state is unique when the service and patience times are exponential. Theorem 4.3 (Invariant States for a tandem networ of two many-server queues with abandonment (i If λ 1 m s 1 s 1 and (λ 1 p + λ 2 m s 2 s 2, then the invariant state has a unique element (x, z ν, λη 1, w η 2 given by x = (x 1, x 2 = (λ 1 m s 1, (λ 1 p + λ 2 m s 2, z = (z 1, z 2 = (λ 1, λ 1 p + λ 2, (4.12 w = (w 1, w 2 = (, λ 1 p, χ = ( χ 1, χ 2 = (,. (4.13 (ii If λ 1 m s 1 s 1 and (λ 1 p + λ 2 m s 2 > s 2, then any invariant state (x, z ν, λη 1, w η 2 has the representation: z = (z 1, z 2 = (λ 1, s 2 /m s 2, w = (w 1, w 2 = (, λ 1 p, x 1 = λ 1 m s 1, χ 1 =, (4.14 where χ 2 is a solution to the equation x 2 = s 2 + λ 1 pm r 2,2G r 2,2,e( χ 2 + λ 2 m r 1,2G r 1,2,e( χ 2, (4.15 λ 1 p(1 G r 2,2( χ 2 + λ 2 (1 G r 1,2( χ 2 = s 2 /m s 2. (4.16 (iii If λ 1 m s 1 > s 1 and (s 1 p/m s 1 + λ 2m s 2 s 2, then any invariant state (x, z ν, λη 1, w η 2 has the representation: z = (z 1, z 2 = (s 1 /m s 1, s 1 p/m s 1 + λ 2, w = (w 1, w 2 = (, s 1 p/m s 1, (4.17 where χ 1 x 1 = s 1 + λ 1 m r 1,1G r 1,1,e( χ 1, x 2 = (s 1 p/m s 1 + λ 2 m s 2, χ 2 =, (4.18 is a solution to the equation λ 1 (1 G r 1,1( χ 1 = s 1 /m s 1. (4.19 (iv If λ 1 m s 1 > s 1 and (s 1 p/m s 1 + λ 2m s 2 > s 2, then any invariant state (x, z ν, λη 1, w η 2 has the representation: z = (z 1, z 2 = (s 1 /m s 1, s 2 /m s 2, w = (w 1, w 2 = (, s 1 p/m s 1, (4.2 x 1 has the same expression in (4.18 with χ 1 being a solution to the equation (4.19, and x 2 is x 2 = s 2 + s 1 pm r 2,2G r 2,2,e( χ 2/m s 1 + λ 2 m r 1,2G r 1,2,e( χ 2, (4.21 where χ 2 is a solution to the following equation s 1 p(1 G r 2,2( χ 2/m s 1 + λ 2 (1 G r 1,2( χ 2 = s 2 /m s 2. (4.22 19