Overflow Networks: Approximations and Implications to Call-Center Outsourcing

Similar documents
MODELING WEBCHAT SERVICE CENTER WITH MANY LPS SERVERS

Electronic Companion Fluid Models for Overloaded Multi-Class Many-Server Queueing Systems with FCFS Routing

Beyond heavy-traffic regimes: Universal bounds and controls for the single-server (M/GI/1+GI) queue

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis.

Designing a Telephone Call Center with Impatient Customers

E-Companion to Fully Sequential Procedures for Large-Scale Ranking-and-Selection Problems in Parallel Computing Environments

Author's personal copy

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers 1

Lecture 9: Deterministic Fluid Models and Many-Server Heavy-Traffic Limits. IEOR 4615: Service Engineering Professor Whitt February 19, 2015

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers

Managing Service Systems with an Offline Waiting Option and Customer Abandonment: Companion Note

Dynamic Control of Parallel-Server Systems

A Fluid Approximation for Service Systems Responding to Unexpected Overloads

On the Resource/Performance Tradeoff in Large Scale Queueing Systems

Economy of Scale in Multiserver Service Systems: A Retrospective. Ward Whitt. IEOR Department. Columbia University

Blind Fair Routing in Large-Scale Service Systems with Heterogeneous Customers and Servers

A Diffusion Approximation for the G/GI/n/m Queue

TOWARDS BETTER MULTI-CLASS PARAMETRIC-DECOMPOSITION APPROXIMATIONS FOR OPEN QUEUEING NETWORKS

Cross-Selling in a Call Center with a Heterogeneous Customer Population. Technical Appendix

LIMITS FOR QUEUES AS THE WAITING ROOM GROWS. Bell Communications Research AT&T Bell Laboratories Red Bank, NJ Murray Hill, NJ 07974

Stochastic-Process Limits

Load Balancing in Distributed Service System: A Survey

Blind Fair Routing in Large-Scale Service Systems with Heterogeneous Customers and Servers

Asymptotic Coupling of an SPDE, with Applications to Many-Server Queues

A FUNCTIONAL CENTRAL LIMIT THEOREM FOR MARKOV ADDITIVE ARRIVAL PROCESSES AND ITS APPLICATIONS TO QUEUEING SYSTEMS

A STAFFING ALGORITHM FOR CALL CENTERS WITH SKILL-BASED ROUTING: SUPPLEMENTARY MATERIAL

A Fluid Approximation for Service Systems Responding to Unexpected Overloads

A Robust Queueing Network Analyzer Based on Indices of Dispersion

Modeling Service Networks with Time-Varying Demand

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

Stein s Method for Steady-State Approximations: Error Bounds and Engineering Solutions

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance

A Diffusion Approximation for Stationary Distribution of Many-Server Queueing System In Halfin-Whitt Regime

Design, staffing and control of large service systems: The case of a single customer class and multiple server types. April 2, 2004 DRAFT.

This paper investigates the impact of dependence among successive service times on the transient and

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

A Review of Basic FCLT s

Appendix. A. Simulation study on the effect of aggregate data

Stochastic Networks and Parameter Uncertainty

Service Level Agreements in Call Centers: Perils and Prescriptions

Queueing Theory I Summary! Little s Law! Queueing System Notation! Stationary Analysis of Elementary Queueing Systems " M/M/1 " M/M/m " M/M/1/K "

Classical Queueing Models.

State Space Collapse in Many-Server Diffusion Limits of Parallel Server Systems. J. G. Dai. Tolga Tezcan

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

Markovian N-Server Queues (Birth & Death Models)

Queues with Many Servers and Impatient Customers

A Note on the Event Horizon for a Processor Sharing Queue

Stochastic relations of random variables and processes

Convexity Properties of Loss and Overflow Functions

Service-Level Differentiation in Many-Server Service Systems Via Queue-Ratio Routing

Design and evaluation of overloaded service systems with skill based routing, under FCFS policies

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 22 12/09/2013. Skorokhod Mapping Theorem. Reflected Brownian Motion

Motivated by models of tenant assignment in public housing, we study approximating deterministic fluid

A DIFFUSION APPROXIMATION FOR THE G/GI/n/m QUEUE

Positive Harris Recurrence and Diffusion Scale Analysis of a Push Pull Queueing Network. Haifa Statistics Seminar May 5, 2008

Erlang-C = M/M/N. agents. queue ACD. arrivals. busy ACD. queue. abandonment BACK FRONT. lost calls. arrivals. lost calls

Stochastic process. X, a series of random variables indexed by t

Many-Server Loss Models with Non-Poisson Time-Varying Arrivals

Analysis of an M/G/1 queue with customer impatience and an adaptive arrival process

Solution: The process is a compound Poisson Process with E[N (t)] = λt/p by Wald's equation.

Continuous-Time Markov Chain

Designing load balancing and admission control policies: lessons from NDS regime

Making Delay Announcements

Operations Research Letters. Instability of FIFO in a simple queueing system with arbitrarily low loads

Stein s Method for Steady-State Approximations: A Toolbox of Techniques

Simple and explicit bounds for multi-server queues with universal (and better) scaling

Dynamics of Order Positions and Related Queues in a LOB. Xin Guo UC Berkeley

BRAVO for QED Queues

Managing Call Centers with Many Strategic Agents

Proactive Care with Degrading Class Types

HEAVY-TRAFFIC EXTREME-VALUE LIMITS FOR QUEUES

Service-Level Differentiation in Many-Server Service Systems: A Solution Based on Fixed-Queue-Ratio Routing

HEAVY-TRAFFIC LIMITS VIA AN AVERAGING PRINCIPLE FOR SERVICE SYSTEMS RESPONDING TO UNEXPECTED OVERLOADS OHAD PERRY

HEAVY-TRAFFIC LIMITS FOR STATIONARY NETWORK FLOWS. By Ward Whitt, and Wei You November 28, 2018

The Generalized Drift Skorokhod Problem in One Dimension: Its solution and application to the GI/GI/1 + GI and. M/M/N/N transient distributions

Introduction to queuing theory

Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY SUPPLEMENTARY MATERIAL

Time Reversibility and Burke s Theorem

The G/GI/N Queue in the Halfin-Whitt Regime I: Infinite Server Queue System Equations

APPENDIX to Stabilizing Performance in Many-Server Queues with Time-Varying Arrivals and Customer Feedback

Transient Analysis of a Series Configuration Queueing System

THE HEAVY-TRAFFIC BOTTLENECK PHENOMENON IN OPEN QUEUEING NETWORKS. S. Suresh and W. Whitt AT&T Bell Laboratories Murray Hill, New Jersey 07974

State-dependent Limited Processor Sharing - Approximations and Optimal Control

Control of Fork-Join Networks in Heavy-Traffic

STAFFING A CALL CENTER WITH UNCERTAIN ARRIVAL RATE AND ABSENTEEISM

TCOM 501: Networking Theory & Fundamentals. Lecture 6 February 19, 2003 Prof. Yannis A. Korilis

Performance Evaluation of Queuing Systems

Other properties of M M 1

Estimation of arrival and service rates for M/M/c queue system

Fair Dynamic Routing in Large-Scale Heterogeneous-Server Systems

Maximum pressure policies for stochastic processing networks

Lecture 20: Reversible Processes and Queues

J. MEDHI STOCHASTIC MODELS IN QUEUEING THEORY

ANALYSIS OF THE LAW OF THE ITERATED LOGARITHM FOR THE IDLE TIME OF A CUSTOMER IN MULTIPHASE QUEUES

Moments of the maximum of the Gaussian random walk

IEOR 8100: Topics in OR: Asymptotic Methods in Queueing Theory. Fall 2009, Professor Whitt. Class Lecture Notes: Wednesday, September 9.

Quality of Real-Time Streaming in Wireless Cellular Networks : Stochastic Modeling and Analysis

Transcription:

Overflow Networks: Approximations and Implications to Call-Center Outsourcing Itai Gurvich (Northwestern University) Joint work with Ohad Perry (CWI)

Call Centers with Overflow λ 1 λ 2 Source of complexity: dependence

Motivating Example 1 C O s (N O ) = capacity cost function for station O. W(t) = virtual waiting time at time t. (Similarly, W I (t) and W O (t)) W(t) = W I (t)1{x I (t) < N I + K I } + W O (t)1{x I (t) = N I + K I } min NO Cs O (N O ) s.t. E [f (W O (t))1{x I (t) = N I + K I }] α K λ θ θ λ 1 N O Z +, NI NO µ I µ O

Motivating Example 2 λ λ 1 λ 2 min i E[ T 0 C i(q π i (s))ds] s.t. π Π.

Motivating Example 2 λ λ 1 λ 2 min i E[ T 0 C i(q π i (s))ds] s.t. π Π. Optimal policy may benefit from state information on in-house

Related Literature Blocking and overflow: Exact characterization: Van Doorn ( 83); Approximations: Whitt ( 83), Koole et. al ( 00, 05); Heavy Traffic: Hunt and Kurtz ( 94), Koçaga and Ward ( 10), Pang et. al ( 07), Whitt ( 04); Outsourcing and optimization: Gans and Zhou ( 07); Chevalier et. al. ( 03, 04); Technical: Whitt ( 91), Bassamboo et. al ( 05), Perry and Whitt ( 10a); Glynn and Whitt ( 93), Perry and Whitt ( 10b);

Basic Model A(t) - number of arrivals to station I by time t: A(t) is a Poisson process with rate λ. A O (t) - number of overflowed calls by time t. K λ θ θ A I (t) = A(t) A O (t) - arrivals entering station I. X I (t), X O (t) - total number in respective system at t. NI NO K I - threshold in station I (K I 0). µ I µ O In isolation: station O is an GI/M/N + M queue.

Main Results A sequence of overflow networks in a many-server heavy-traffic regime 1 Functional Central Limit Theorem (FCLT) for Overflow Process 2 Pointwise Stationarity and Asymptotic Independence

Asymptotic (Heavy Traffic) Analysis We consider a sequence of networks indexed by arrival rate λ, with λ. Main Assumption: 1 Non-negligible overflow: K λ θ θ µ I NI λ + θki λ ν := lim λ λ < 1 NI NO µ I µ O 1 ν interpreted as rough estimate for the (steady-state) blocking probability

Asymptotic (Heavy Traffic) Analysis We consider a sequence of networks indexed by arrival rate λ, with λ. Main Assumption: 1 Non-negligible overflow: K λ θ θ µ I NI λ + θki λ ν := lim λ λ < 1 2 Sufficient capacity in station O: NI NO N λ O = λ µ IN λ I θ I K λ I µ O + o(λ) µ I µ O 1 ν interpreted as rough estimate for the (steady-state) blocking probability

First Result D λ I (t) = N λ I K λ I X λ I (t), D λ I (t) := Dλ I (t) λ, Â λ O(t) := Aλ O (t) (λ µ INI λ θki λ )t, X O(t) λ := Xλ O (t) λ µ INλ I θkλ I µ λ O λ

First Result D λ I (t) = N λ I K λ I X λ I (t), D λ I (t) := Dλ I (t) λ, Â λ O(t) := Aλ O (t) (λ µ INI λ θki λ )t, X O(t) λ := Xλ O (t) λ µ INλ I θkλ I µ λ O λ Theorem If D λ I (0) 0, then ( D λ I, Âλ O ) (0, σb), u.o.c., where B is a standard Brownian motion and σ 2 = 1 + ν.

First Result D λ I (t) = N λ I K λ I X λ I (t), D λ I (t) := Dλ I (t) λ, Â λ O(t) := Aλ O (t) (λ µ INI λ θki λ )t, X O(t) λ := Xλ O (t) λ µ INλ I θkλ I µ λ O λ Theorem If D λ I (0) 0, then ( D λ I, Âλ O ) (0, σb), u.o.c., where B is a standard Brownian motion and σ 2 = 1 + ν. Recall: ν := lim λ µ I N λ I + θk λ I λ

Outline of the proof: D λ I 0 D λ I (t) := Nλ I + KI λ XI λ(t) When close to threshold: rate µ I NI λ + θki λ, rate = λ. Slowing D λ I (t) down gives an M/M/1 with arrival rate = µ IN λ I + θk λ I λ ν < 1 and service rate = λ λ = 1 Let Q b (t) be M/M/1 with arrival rate ν and service rate 1. Then, {D λ I (t) : t [0, T)} {Q b(t) : t [0, λt)}. From extreme-value theory for M/M/1: sup t T D λ I (t) = O(log(λT))

Outline of the proof:  λ O σb A λ O(t) = t 0 1{D λ I (s) = 0}dA λ (s) D λ I completes O(λ) cycles over any time interval [s, t] Functional limits for the cumulative processes t 0 1{D λ I (s) = 0}ds (1 ν)t ( t ) λ 1{D λ I (s) = 0}ds (1 ν)t σb(t) 0 Functional limit for  λ O follows from that for t 0 1{Dλ I (s) = 0}ds

Outline of the proof:  λ O σb A λ O(t) = t 0 1{D λ I (s) = 0}dA λ (s) D λ I completes O(λ) cycles over any time interval [s, t] Functional limits for the cumulative processes t 0 1{D λ I (s) = 0}ds (1 ν)t ( t ) λ 1{D λ I (s) = 0}ds (1 ν)t σb(t) 0 Functional limit for  λ O follows from that for t 0 1{Dλ I (s) = 0}ds An Averaging Principle (AP):  λ O is driven by a process that moves at a different time scale

Implications Approximating (complicated) overflow process with a simple process: A λ O (t) (λ µ IN λ I θk λ I )t + λσb(t), with the approximation being asymptotically exact. A λ O (t) is close to a renewal process with mean inter-arrival time (λ µ I N λ I θ I K λ I ) 1 and squared coefficient of variation (SCV) λσ 2 λ µ I N λ I θk λ I σ2 (1 ν) 1.

Implications Approximating (complicated) overflow process with a simple process: A λ O (t) (λ µ IN λ I θk λ I )t + λσb(t), with the approximation being asymptotically exact. A λ O (t) is close to a renewal process with mean inter-arrival time (λ µ I N λ I θ I K λ I ) 1 and squared coefficient of variation (SCV) λσ 2 λ µ I N λ I θk λ I σ2 (1 ν) 1. Simpler than the original overflow renewal process.

Implications Cont. In isolation, station O is a GI/M/N + M queue. Using overflow convergence and known results for GI/M/N + M, ( D λ I, X O) λ (0, X O ) Asymptotic independence in a trivialized sense

Implications Cont. In isolation, station O is a GI/M/N + M queue. Using overflow convergence and known results for GI/M/N + M, ( D λ I, X O) λ (0, X O ) Asymptotic independence in a trivialized sense What does this limit imply for joint distributions?... not much... E[W λ (t)] = E[W λ I (t)1{x λ I (t) < N λ I + K λ I }] + E[W λ O(t)1{X λ I (t) = N λ I + K λ I }].

Independence of the Limits Independence of limits does not carry over to the pre-limits. Example: 1/ λ, w.p. 1/2 1, if Y λ > 0 Y λ := X λ := 0, w.p. 1/2 0, otherwise 1 w.p. 1/2 (Y λ, X λ ) (0, X), where X = 0 w.p. 1/2 Trivially, the limits 0 and X are independent. However, 1/2 = P{X λ > 0, Y λ > 0} P{X λ > 0}P{Y λ > 0} = 1/4, for all λ, no matter how large.

Asymptotic Independence Natural scale of station I = constant Natural scale of station O = λ Theorem (asymptotic independence) D λ I is asymptotically independent of X O λ, i.e, for all t > 0, { } { } } P D λ I (t) x, X O(t) λ y = P D λ I (t) x P { X O(t) λ y + o(1)

Asymptotic Independence Natural scale of station I = constant Natural scale of station O = λ Theorem (asymptotic independence) D λ I is asymptotically independent of X O λ, i.e, for all t > 0, { } { } } P D λ I (t) x, X O(t) λ y = P D λ I (t) x P { X O(t) λ y + o(1) Note that X λ O is scaled, but Dλ I is not (requires refined analysis).

Main Idea of the Proof Showing asymptotic independence of the sequence in n via asymptotic independence of the process in t

Main Idea of the Proof Showing asymptotic independence of the sequence in n via asymptotic independence of the process in t (*) Recall that {D λ I (s) : t s t + ɛ} {Q b(s) : t s t + λɛ} for λ large, with Q b denoting a M/M/1.

Main Idea of the Proof Showing asymptotic independence of the sequence in n via asymptotic independence of the process in t (*) Recall that {D λ I (s) : t s t + ɛ} {Q b(s) : t s t + λɛ} for λ large, with Q b denoting a M/M/1. (**) Q b (t + λɛ) Q b ( ) as λ for all ɛ > 0.

Main Idea of the Proof Showing asymptotic independence of the sequence in n via asymptotic independence of the process in t (*) Recall that {D λ I (s) : t s t + ɛ} {Q b(s) : t s t + λɛ} for λ large, with Q b denoting a M/M/1. (**) Q b (t + λɛ) Q b ( ) as λ for all ɛ > 0. Steady state Q b ( ) is independent of Q b (t).

Main Idea of the Proof Showing asymptotic independence of the sequence in n via asymptotic independence of the process in t (*) Recall that {D λ I (s) : t s t + ɛ} {Q b(s) : t s t + λɛ} for λ large, with Q b denoting a M/M/1. (**) Q b (t + λɛ) Q b ( ) as λ for all ɛ > 0. Steady state Q b ( ) is independent of Q b (t). X λ O has a continuous limit hence hardly changes within ɛ, X λ O(t + ɛ) X λ O(t).

Pointwise Stationarity The following pointwise stationarity follows from (*) and (**): Theorem (pointwise stationarity) D λ I (t) Q b( ) in R as λ.

Pointwise Stationarity The following pointwise stationarity follows from (*) and (**): Theorem (pointwise stationarity) D λ I (t) Q b( ) in R as λ. Pointwise stationarity and asymptotic independence allow us to obtain performance metrics by treating Station I as a stationary M/M/N/K + M queue Station O as a GI/M/N + M queue (that is independent of station I) P{D λ I (t) d, XO(t) λ q} = P{D λ I ( ) d}p{xo(t) λ q} Overflow approximation simplifies analysis of station O

Waiting Times and Asymptotic ASTA w λ k, wλ I,k, wλ O,k - waiting time of kth arrival to respective station. f is a continuous and bounded function or of the form f (x) := 1{x > τ}. Theorem (asymptotic finite-horizon ASTA) For all t > 0, lim E 1 λ A λ (t) A λ (t) k=1 f (w λ k ) = ν 1 t t 0 [ ] E f (Ŵ I (s)) ds+(1 ν) 1 t t 0 [ ] E f (Ŵ O (s))ds. where Ŵ O is the diffusion limit of the virtual waiting-time process in the GI/M/N + M queue and Ŵ I K I.

Generalizing to multiple classes λ λ 1 λ 2 min i E[ T 0 C i(q π i (s))ds] s.t. π Π. Theorem: Benefit from in-house state information is marginal.

Summary Motivated by an outsourcing problem, we considered an overflow system: from M/M/N I /K I + M to G/M/N O + M. Under a resource pooling condition our heavy traffic analysis: provides a simple approximation for the overflow renewal process, which is asymptotically correct. proves that in-house is asymptotically independent of outsourcer. Proofs build on a separation of time scales and a resulting AP and pointwise stationarity. Results are applied to waiting times and virtual waiting times. Generalized to more complicated systems (if queues are C-tight).

Questions?