Delay Announcements. Predicting Queueing Delays for Delay Announcements. IEOR 4615, Service Engineering, Professor Whitt. Lecture 21, April 21, 2015

Similar documents
Making Delay Announcements

The Performance Impact of Delay Announcements

DELAY PREDICTORS FOR CUSTOMER SERVICE SYSTEMS WITH TIME-VARYING PARAMETERS. Rouba Ibrahim Ward Whitt

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS

Wait-Time Predictors for Customer Service Systems With Time-Varying Demand and Capacity

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS

Wait-Time Predictors for Customer Service Systems with Time-Varying Demand and Capacity

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY SUPPLEMENTARY MATERIAL

Real-Time Delay Prediction in Customer Service Systems

Lecture 9: Deterministic Fluid Models and Many-Server Heavy-Traffic Limits. IEOR 4615: Service Engineering Professor Whitt February 19, 2015

ec1 e-companion to Liu and Whitt: Stabilizing Performance

Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates

Q = (c) Assuming that Ricoh has been working continuously for 7 days, what is the probability that it will remain working at least 8 more days?

Managing Call Centers with Many Strategic Agents

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/6/16. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/25/17. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

Modeling Service Networks with Time-Varying Demand

ENGINEERING SOLUTION OF A BASIC CALL-CENTER MODEL

Queueing Systems: Lecture 3. Amedeo R. Odoni October 18, Announcements

ESTIMATING WAITING TIMES WITH THE TIME-VARYING LITTLE S LAW

Dynamic Control of Parallel-Server Systems

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

APPENDIX to Stabilizing Performance in Many-Server Queues with Time-Varying Arrivals and Customer Feedback

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis.

We consider how two networked large-scale service systems that normally operate separately, such as call

State-Dependent Estimation of Delay Distributions in Fork-Join Networks

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

(b) What is the variance of the time until the second customer arrives, starting empty, assuming that we measure time in minutes?

Service Engineering January Laws of Congestion

Stationary Birth-and-Death Processes Fit to Queues with Periodic Arrival Rate Functions

STAFFING A CALL CENTER WITH UNCERTAIN ARRIVAL RATE AND ABSENTEEISM

Non Markovian Queues (contd.)

What You Should Know About Queueing Models To Set Staffing Requirements in Service Systems

A multi-class fluid model for a contact center with skill-based routing

Designing a Telephone Call Center with Impatient Customers

Stochastic grey-box modeling of queueing systems: fitting birth-and-death processes to data

λ λ λ In-class problems

Data-Based Resource-View of Service Networks: Performance Analysis, Delay Prediction and Asymptotics. PhD Proposal. Nitzan Carmeli.

Queueing Theory I Summary! Little s Law! Queueing System Notation! Stationary Analysis of Elementary Queueing Systems " M/M/1 " M/M/m " M/M/1/K "

Chapter 6 Queueing Models. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

QUEUING SYSTEM. Yetunde Folajimi, PhD

Many-Server Loss Models with Non-Poisson Time-Varying Arrivals

SERVER STAFFING TO MEET TIME-VARYING DEMAND

SENSITIVITY OF PERFORMANCE IN THE ERLANG-A QUEUEING MODEL TO CHANGES IN THE MODEL PARAMETERS

The Offered-Load Process: Modeling, Inference and Applications. Research Thesis

Stabilizing Performance in a Service System with Time-Varying Arrivals and Customer Feedback

SOLUTIONS IEOR 3106: Second Midterm Exam, Chapters 5-6, November 8, 2012

ANALYTICAL AND EMPIRICAL INVESTIGATIONS OF TICKET QUEUES: IMPLICATIONS FOR SYSTEM PERFORMANCE, CUSTOMER PERCEPTIONS AND BEHAVIOR

STABILIZING PERFORMANCE IN NETWORKS OF QUEUES WITH TIME-VARYING ARRIVAL RATES

IEOR 6711: Stochastic Models I, Fall 2003, Professor Whitt. Solutions to Final Exam: Thursday, December 18.

IEOR 6711, HMWK 5, Professor Sigman

Economy of Scale in Multiserver Service Systems: A Retrospective. Ward Whitt. IEOR Department. Columbia University

GI/M/1 and GI/M/m queuing systems

USING DIFFERENT RESPONSE-TIME REQUIREMENTS TO SMOOTH TIME-VARYING DEMAND FOR SERVICE

Continuous-Time Markov Chain

M/G/1 and Priority Queueing

Analysis of a Two-Phase Queueing System with Impatient Customers and Multiple Vacations

Stochastic Networks and Parameter Uncertainty

A Study on M x /G/1 Queuing System with Essential, Optional Service, Modified Vacation and Setup time

BIRTH DEATH PROCESSES AND QUEUEING SYSTEMS

1 Delayed Renewal Processes: Exploiting Laplace Transforms

Slides 9: Queuing Models

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers 1

The G t /GI/s t + GI Many-Server Fluid Queue: Longer Online Version with Appendix

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers

THE HEAVY-TRAFFIC BOTTLENECK PHENOMENON IN OPEN QUEUEING NETWORKS. S. Suresh and W. Whitt AT&T Bell Laboratories Murray Hill, New Jersey 07974

Abandonment and Customers Patience in Tele-Queues. The Palm/Erlang-A Model.

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017

IEOR 4106: Spring Solutions to Homework Assignment 7: Due on Tuesday, March 22.

Managing Information in Queues: The Impact of Giving Delayed Information to Customers

Link Models for Packet Switching

Erlang-C = M/M/N. agents. queue ACD. arrivals. busy ACD. queue. abandonment BACK FRONT. lost calls. arrivals. lost calls

Electronic Companion Fluid Models for Overloaded Multi-Class Many-Server Queueing Systems with FCFS Routing

Systems Simulation Chapter 6: Queuing Models

Online Supplement to Creating Work Breaks From Available Idleness

Figure 10.1: Recording when the event E occurs

Operations Research II, IEOR161 University of California, Berkeley Spring 2007 Final Exam. Name: Student ID:

Since D has an exponential distribution, E[D] = 0.09 years. Since {A(t) : t 0} is a Poisson process with rate λ = 10, 000, A(0.

The M/M/n+G Queue: Summary of Performance Measures

Online Supplement to Creating Work Breaks From Available Idleness

A Fluid Approximation for Service Systems Responding to Unexpected Overloads

The Offered-Load Process: Modeling, Inference and Applications. Research Thesis

Review of Queuing Models

Classical Queueing Models.

Stationary Birth-and-Death Processes Fit to Queues with Periodic Arrival Rate Functions

Stationary remaining service time conditional on queue length

CS 798: Homework Assignment 3 (Queueing Theory)

Readings: Finish Section 5.2

An analysis of queues with delayed information and time-varying arrival rates

A STAFFING ALGORITHM FOR CALL CENTERS WITH SKILL-BASED ROUTING: SUPPLEMENTARY MATERIAL

An M/M/1/N Queuing system with Encouraged Arrivals

Computer Systems Modelling

Introduction to Queueing Theory with Applications to Air Transportation Systems

A Fluid Approximation for Service Systems Responding to Unexpected Overloads

MODELING WEBCHAT SERVICE CENTER WITH MANY LPS SERVERS

QUEUING MODELS AND MARKOV PROCESSES

Networking = Plumbing. Queueing Analysis: I. Last Lecture. Lecture Outline. Jeremiah Deng. 29 July 2013

A Diffusion Approximation for the G/GI/n/m Queue

BRAVO for QED Queues

Introduction to queuing theory

Transcription:

Delay Announcements Predicting Queueing Delays for Delay Announcements IEOR 4615, Service Engineering, Professor Whitt Lecture 21, April 21, 2015

OUTLINE Delay Announcements: Why, What and When? [Slide 3] Delay Prediction: How? and How Accurate? [Slide 6] Alternative Delay Estimators [Slide 7] Evaluating the Accuracy [Slide 11] Different Cases [Slide 14] A/M/s [Slide 15] A/M/s+M (exponential abandonment) [Slide 21] A/M/s+GI (non-exponential abandonment) [Slide 25] M t /M/s (time-varying arrival rate) [Slide 28] (References at end)

The Purpose of Delay Announcements Why might a service system manager want to tell each customer an estimate of the delay that customer will experience? Improve customer satisfaction. Propositions about the Psychology of Waiting, Maister (1985), Lec. 1 Uncertain Waits feel longer than known finite waits. Anxiety makes waits seem longer. Unexplained Waits feel longer than explained waits. Improve performance for the customers who are served. By inducing some customers to balk or abandon earlier and then retry later when the system is more lightly loaded.

Making Delay Announcements: Some Questions What delay to predict? delay before entering service (assuming will not abandon) response time, i.e., delay until completing service When to predict and announce? immediately upon arrival throughout time in queue (continuously or periodically) What to announce/report? Customer ability to process information? single number w, may be different for each customer full distribution or partial summary, e.g., mean and variance. past prediction accuracy average error or variance. explanation of cause of delay.

Examples of Possible Delay Announcements When the delay will be short: We should be able to serve you soon. The last customer to enter service waited only one minute. (Aims to encourage customer to wait.) When the delay will be long: We are currently experiencing unexpected high demand; the last customer to enter service had to wait w minutes before beginning service. We will do our best to serve you without excessive delay, but you might want to try again later. (Aims to encourage customer to balk or abandon sooner and then retry later. By doing so, aims to provide better service to the customers who are served.)

Problem for Today: Delay Prediction Assume the standard multi-server queueing system with random arrivals, service times and patience times, given system history up to arrival time. Given that we will announce, immediately upon arrival, our best estimate of a single number w to each customer who has to wait before starting service, how should we predict w and how accurate is our prediction? Two Approaches: simplicity versus complexity.

Information to Exploit full model: distributions and parameters no information: steady-state E[Wait Wait > 0] delay history (major focus) queue length with and without customer abandonment queue length and elapsed service times queue length, customer classes and elapsed service times

Alternative Real-Time Delay Predictors Assume the standard multi-server queueing system with random arrivals, service times and patience times, given system history up to arrival time. Mean Steady-State Delay for the model (but model error?) Standard Simple Queue-Length-Based Delay Predictor QL s Delay Experienced by the Last to Enter Service (LES) Elapsed Delay of the Customer at the Head of Line (HOL) Refined Predictors calculate E[Wait Q(t) = n] or E[Wait entire history at t] fluid model refinements

The Standard Simple Queue-Length-Based Delay Predictor Servers Queue New arrival n customers in line upon arrival s = number of agents, and µ 1 = mean service time θ QLs (n) (n + 1) µ 1 s

The Head-of-Line (HOL) Delay Predictor Servers Queue New arrival at t HOL Elapsed delay at t is equal to w w = elapsed delay of HOL customer (similar to LES delay) θ HOL (w) w

Actual Random Delays After Prediction W HOL (w): distributed as the delay of a new arriving customer given that: (i) there is a customer at the HOL upon arrival (non-restrictive) (ii) elapsed delay of HOL customer is equal to w W Q (n): distributed as the delay of a new arriving customer given that: (i) the customer has to wait (ii) the customer finds n customers in line upon arrival For example, with HOL, we announce θ HOL (w) w. w is a single-number prediction of the random variable W HOL (w).

Quantifying The Accuracy of the Predictors Mean Squared Error (MSE) MSE(θ QLs (n)) = E[(W Q (n) θ QLs (n)) 2 ] E[MSE(θ QLs (Q w ))] = MSE(θ QLs (n))p[q w = n] n=0 Q w has the conditional distribution of the steady-state QL upon arrival given that the customer must wait.

How to Evaluate Predictors with Simulation Simulation Estimate of MSE: Average Squared Error (ASE) ASE 1 k k (p i d i ) 2 (k = sample size) i=1 p i = predicted delay for customer i (p i > 0) d i = actual delay (or potential delay with abandonments) Root Relative Average Squared Error (RASE) RASE ASE 1 k k i=1 d i

Different Cases (Models) to Consider Assume the standard multi-server queueing system with random arrivals, service times and patience times, given system history up to arrival time. A/M/s (stationary model without abandonment) abandonment: A/M/s + M non-exponential abandonment: A/M/s + GI time-varying arrivals: M t /M/s and M t /M/s + M

QL s in the GI/M/s Model (or A/M/s) n+1 W Q (n) = i=1 where V i i.i.d. exponential with mean (sµ) 1 n+1 E[W Q (n)] = E[V i ] = i=1 n+1 i=1 V i 1 sµ = n + 1 sµ θ QL s (n) n+1 MSE(θ QLs (n)) = Var[W Q (n)] = Var[V i ] = i=1 n+1 i=1 1 s 2 µ 2 = n + 1 s 2 µ 2 θ QLs (n) is an unbiased estimator. It minimizes the MSE!

Compare QL s to Steady-State Mean in the GI/M/s Model In steady state, 1 + Q W > 0 is geometric on {1, 2,...} with mean 1/(1 ρ) as in M/M/1 with service rate sµ and W W > 0 is exponential with E[W W > 0] = 1 sµ(1 ρ) and Var[W W > 0] = 1 s 2 µ 2 (1 ρ) 2 RSE(W W > 0) Var[W W > 0] E[W W > 0] = 1 In contrast, E[W Q (n)] = n + 1 sµ, Var[W Q(n)] = n + 1 s 2 µ 2 Var[WQ (n)] 1 RMSE(θ QLs (n)) = = 1 E[W Q (n)] n + 1 n

Compare QL s to Steady-State Mean in the GI/M/s Model E[MSE(θ QLs (Q w ))] = = = MSE(θ QLs (n))p[q w = n] n=0 Var(θ QLs (n))p(q w = n) = n=0 1 s 2 µ 2 (1 ρ) n=0 n + 1 s 2 µ 2 P(Qw = n) Hence, it is much better to use the information: Var(W W > 0) E[Var(W Q (Q ))] = 1/s2 µ 2 (1 ρ) 2 1/s 2 µ 2 (1 ρ) = 1 1 ρ.

HOL in the M/M/s Model W HOL (w) = A(w)+2 i=1 V i where V i i.i.d. exponential with mean (sµ) 1 E[W HOL (w)] = E A(w)+2 V i i=1 = E[A(w) + 2]E[V] w θ HOL (w) MSE(θ HOL (w)) depends on Var[A(w)].

Simulations for the GI/M/s Model: Poisson Arrivals In Tables: ASE s in units of 10 3 (RASE in %); ρ = λ/sµ; c 2 a = Var/mean 2. M/M/100 ρ QL s HOL HOL/QL s (c 2 a + 1)/ρ 0.98 5.03 (14%) 10.2 (20%) 2.03 2.04 0.95 2.04 (22%) 4.27 (32%) 2.09 2.11 0.93 1.44 (26%) 3.08 (39%) 2.14 2.15 0.90 0.994 (32%) 2.19 (47%) 2.20 2.22 Similar for other renewal arrival processes: ratio (c 2 a + 1)/ρ.

Simulations for the GI/M/s Model: Deterministic Arrivals In Tables: ASE s in units of 10 3 (RASE in %); ρ = λ/sµ; c 2 a = Var/mean 2. D/M/100 ρ QL s HOL HOL/QL s (c 2 a + 1)/ρ 0.98 2.48 (20%) 2.62 (21%) 1.06 1.02 0.95 1.01 (32%) 1.15 (34%) 1.14 1.05 0.93 0.725 (37%) 0.871 (41%) 1.20 1.08 0.90 0.519 (44%) 0.664 (50%) 1.28 1.11

Abandonments: Simulations for the M/M/s + M Model 4.5 4 HOL QL s s ASE in the M/M/s+M Model with ρ = 1.4 3.5 3 s ASE 2.5 2 1.5 1 0.5 0 100 200 300 400 500 600 700 800 900 1000 s

W Q (n) in the GI/M/s + M Model W Q (n): distributed as the potential delay of a new arriving customer given that: (i) the customer has to wait (ii) the customer finds n customers in line upon arrival W Q (n) = n i=0 X i where X i independent exponential with mean (sµ + iν) 1

Markovian QL Predictor (QL m ) The Markovian Queue-Length Predictor (QL m ) θ QLm (n) = n i=0 1 sµ + iν QL m in the GI/M/s + M Model θ QLm (n) = n 1/(sµ + iν) = E[W Q (n)] i=0 θ QLm (n) minimizes the MSE!

Refined QL Predictor for Same M/M/s + M Model 4.5 4 3.5 QL m HOL QL s s ASE in the M/M/s+M Model with ρ = 1.4 3 s ASE 2.5 2 1.5 1 0.5 0 100 200 300 400 500 600 700 800 900 1000 s

But when the Abandonment Distribution is Not Exponential 7 6 HOL QL m s ASE in the M/M/s+E 10 Model with ρ = 1.4 5 s ASE 4 3 2 1 0 100 200 300 400 500 600 700 800 900 1000 s

s ASE 2 1.8 1.6 1.4 1.2 1 0.8 QL a QL r QL m HOL QL s s ASE in the M/M/s+H 2 Model with ρ = 1.4 0.6 0.4 0.2 0 100 200 300 400 500 600 700 800 900 1000 s

7 s ASE in the M/E 10 /s+e 10 Model with ρ = 1.4 6 5 QL a QL r HOL QL m s ASE 4 3 2 1 0 100 200 300 400 500 600 700 800 900 1000 s

Time-Varying Arrival Rates arrivals per hour to a medium-sized financial-services call center

HOL Delay Prediction in the M t /M/100 Model Potential Delays and HOL Predictions in the M(t)/M/100 Model with α = 0.5 and E[S] = 5 minutes 40 35 Delays HOL 30 25 Delays 20 15 10 5 0 1400 1450 1500 1550 1600 1650 1700 1750 1800 1850 1900 time Arrival process: Nonhomogeneous Poisson with rate λ(t) sinusoidal λ(t) = λ + α λ sin(γt), ρ = λ/sµ = 0.95

HOL Delay Prediction With Constant Arrival Rate 0.8 0.7 Actual Delays and HOL Estimates in the M/M/100 Model with ρ = 0.95 Delays HOL 0.6 0.5 Delays 0.4 0.3 0.2 0.1 0 1400 1410 1420 1430 1440 1450 1460 1470 1480 1490 1500 time Arrival process: homogeneous Poisson with rate λ ρ = λ/sµ = 0.95

Problem: Time Lag in HOL Delay HOL delay was potential delay of arrival in the past. Use fluid model to create refined predictor. v(t) potential delay of new arrival in fluid model at time t w(t) HOL delay in the fluid model at time t θ HOLr (w, t) = v(t) w(t) w, where w is observed HOL delay at time t in actual system.

HOL r Delay Prediction in the M t /M/100 Model Actual Delays, HOL, and HOL r in the M t /M/100 Model with α = 0.5 and E[S] = 5 minutes 40 35 30 Delays HOL r HOL 25 Delays 20 15 10 5 0 1400 1450 1500 1550 1600 1650 1700 1750 1800 1850 1900 time Arrival process: Nonhomogeneous Poisson with rate λ(t) λ(t) = λ + α λ sin(γt), ρ = λ/sµ = 0.95

SUMMARY We have shown how to predict delays to make delay announcements. 1 The simple queue-length estimator θ QLs (n) is optimal for G/M/s, but HOL (and LES) is not too bad. 2 θ QLs (n) can perform poorly with abandonments, while HOL is robust. 3 For the G/M/s + M model, a new Markovian estimator is optimal. 4 But when the patience times are not exponential, it too can perform poorly. New refined estimators can do better. HOL remains robust. 5 With time-varying arrivals, even HOL can perform poorly, but fluid models can be used to refine the HOL estimator.

The End

References, all but first with Rouba Ibrahim Predicting Queueing Delays. Man. Sci. 45 (1999) 870 888. Real-Time Delay Estimation Based on Delay History. Manuf. Serv. Opns. Mgt. 11 (2009) 397 415. Real-Time Delay Estimation in Overloaded Multiserver Queues with Abandonment. Man. Sci. 55 (2009) 1729 1742. Real-Time Delay Estimation Based on Delay History in Many-Server Queues with Time-Varying Arrivals. Prod. Opns. Mgt. 20 (2011) 654-667. Wait-Time Predictors for Cust. Serv. Systems with Time-Varying Demand and Capacity. Oper. Res. 59 (2011) 1106 1118.

More References M. Armony, N. Shimkin and WW. The Impact of Delay Announcements in Many-Server Queues with Abandonment. Operations Res. 57 (2009) 66 81. (See Lecture 21a in Courseworks.) WW. Improving Service By Informing Customers About Anticipated Delays. Management Sci. 45 (1999) 192 207. R. J. Batt and C, Terwiesch. Waiting Patiently: An Empirical Study of Patient Abandonment in an Emergency Department Management Sci. 61 (2015) 39 59. G. Allon and A. Bassamboo. The Impact of Delaying the Delay Announcements. Oper. Res 59 (2011) 1198 1210.

Even More References A. Senderovich1, M. Weidlich, A. Gal1 and A. Mandelbaum1 Queue Mining Predicting Delays in Service Processes. In Advanced Information Systems Engineering, eds. M. Jarke et al., Proceedings of 26 th CAiSE Conference in Greece, 2014, eds. M. Jarke et al.