Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis.

Similar documents
Single-Server Service-Station (G/G/1)

Erlang-C = M/M/N. agents. queue ACD. arrivals. busy ACD. queue. abandonment BACK FRONT. lost calls. arrivals. lost calls

Service Engineering January Laws of Congestion

Classical Queueing Models.

Queueing Theory I Summary! Little s Law! Queueing System Notation! Stationary Analysis of Elementary Queueing Systems " M/M/1 " M/M/m " M/M/1/K "

Contents Preface The Exponential Distribution and the Poisson Process Introduction to Renewal Theory

M/G/1 and M/G/1/K systems

P (L d k = n). P (L(t) = n),

Link Models for Circuit Switching

GI/M/1 and GI/M/m queuing systems

Designing a Telephone Call Center with Impatient Customers

Non Markovian Queues (contd.)

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017

Computer Networks More general queuing systems

Data analysis and stochastic modeling

Dynamic Control of Parallel-Server Systems

Queueing Theory II. Summary. ! M/M/1 Output process. ! Networks of Queue! Method of Stages. ! General Distributions

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/6/16. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

Abandonment and Customers Patience in Tele-Queues. The Palm/Erlang-A Model.

Introduction to Queuing Theory. Mathematical Modelling

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/25/17. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

M/G/1 and Priority Queueing

Figure 10.1: Recording when the event E occurs

Introduction to Queueing Theory with Applications to Air Transportation Systems

Queueing systems. Renato Lo Cigno. Simulation and Performance Evaluation Queueing systems - Renato Lo Cigno 1

Economy of Scale in Multiserver Service Systems: A Retrospective. Ward Whitt. IEOR Department. Columbia University

NEW FRONTIERS IN APPLIED PROBABILITY

On Priority Queues with Impatient Customers: Stationary and Time-Varying Analysis

IEOR 6711, HMWK 5, Professor Sigman

QUEUING SYSTEM. Yetunde Folajimi, PhD

Introduction to Markov Chains, Queuing Theory, and Network Performance

Simple queueing models

Exercises Stochastic Performance Modelling. Hamilton Institute, Summer 2010

Intro to Queueing Theory

ECE 3511: Communications Networks Theory and Analysis. Fall Quarter Instructor: Prof. A. Bruce McDonald. Lecture Topic

On Priority Queues with Impatient Customers: Stationary and Time-Varying Analysis

Lecture 7: Simulation of Markov Processes. Pasi Lassila Department of Communications and Networking

Stationary remaining service time conditional on queue length

LIMITS AND APPROXIMATIONS FOR THE M/G/1 LIFO WAITING-TIME DISTRIBUTION

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

A Robust Queueing Network Analyzer Based on Indices of Dispersion

PBW 654 Applied Statistics - I Urban Operations Research

Name of the Student:

Introduction to Queueing Theory

IEOR 6711: Stochastic Models I, Fall 2003, Professor Whitt. Solutions to Final Exam: Thursday, December 18.

TOWARDS BETTER MULTI-CLASS PARAMETRIC-DECOMPOSITION APPROXIMATIONS FOR OPEN QUEUEING NETWORKS

Networking = Plumbing. Queueing Analysis: I. Last Lecture. Lecture Outline. Jeremiah Deng. 29 July 2013

Queueing Systems: Lecture 3. Amedeo R. Odoni October 18, Announcements

Chapter 5: Special Types of Queuing Models

Solutions For Stochastic Process Final Exam

J. MEDHI STOCHASTIC MODELS IN QUEUEING THEORY

Queueing Theory. VK Room: M Last updated: October 17, 2013.

IEOR 8100: Topics in OR: Asymptotic Methods in Queueing Theory. Fall 2009, Professor Whitt. Class Lecture Notes: Wednesday, September 9.

Stochastic Models of Manufacturing Systems

Lecture 9: Deterministic Fluid Models and Many-Server Heavy-Traffic Limits. IEOR 4615: Service Engineering Professor Whitt February 19, 2015

Lecture 20: Reversible Processes and Queues

LIMITS FOR QUEUES AS THE WAITING ROOM GROWS. Bell Communications Research AT&T Bell Laboratories Red Bank, NJ Murray Hill, NJ 07974

(b) What is the variance of the time until the second customer arrives, starting empty, assuming that we measure time in minutes?

Author's personal copy

Exact Simulation of the Stationary Distribution of M/G/c Queues

Other properties of M M 1

Stochastic process. X, a series of random variables indexed by t

Since D has an exponential distribution, E[D] = 0.09 years. Since {A(t) : t 0} is a Poisson process with rate λ = 10, 000, A(0.

Solutions to Homework Discrete Stochastic Processes MIT, Spring 2011

Introduction to Queueing Theory

Stochastic grey-box modeling of queueing systems: fitting birth-and-death processes to data

Buzen s algorithm. Cyclic network Extension of Jackson networks

Solution: The process is a compound Poisson Process with E[N (t)] = λt/p by Wald's equation.

Markov processes and queueing networks

Performance Evaluation of Queuing Systems

Chapter 6 Queueing Models. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

MAT SYS 5120 (Winter 2012) Assignment 5 (not to be submitted) There are 4 questions.

Chapter 2. Poisson Processes. Prof. Shun-Ren Yang Department of Computer Science, National Tsing Hua University, Taiwan

Q = (c) Assuming that Ricoh has been working continuously for 7 days, what is the probability that it will remain working at least 8 more days?

Kendall notation. PASTA theorem Basics of M/M/1 queue

15 Closed production networks

Simple and explicit bounds for multi-server queues with universal (and better) scaling

Part II: continuous time Markov chain (CTMC)

Queueing Theory and Simulation. Introduction

Class 9 Stochastic Markovian Service Station in Steady State - Part I: Classical Queueing Birth & Death Models

The Offered-Load Process: Modeling, Inference and Applications. Research Thesis

15 Closed production networks

Introduction to queuing theory

Queuing Theory. Richard Lockhart. Simon Fraser University. STAT 870 Summer 2011

ec1 e-companion to Liu and Whitt: Stabilizing Performance

Queuing Theory. Using the Math. Management Science

Markovian N-Server Queues (Birth & Death Models)

Chapter 1. Introduction. 1.1 Stochastic process

Chapter 2 Queueing Theory and Simulation

Systems Simulation Chapter 6: Queuing Models

BIRTH DEATH PROCESSES AND QUEUEING SYSTEMS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 22 12/09/2013. Skorokhod Mapping Theorem. Reflected Brownian Motion

THE HEAVY-TRAFFIC BOTTLENECK PHENOMENON IN OPEN QUEUEING NETWORKS. S. Suresh and W. Whitt AT&T Bell Laboratories Murray Hill, New Jersey 07974

On Approximations for the GI/GI/1 Queue and Generalized Jackson Open Queueing Networks Using Indices of Dispersion

Part I Stochastic variables and Markov chains

Electronic Companion Fluid Models for Overloaded Multi-Class Many-Server Queueing Systems with FCFS Routing

CS418 Operating Systems

INDEX. production, see Applications, manufacturing

SOLUTIONS IEOR 3106: Second Midterm Exam, Chapters 5-6, November 8, 2012

Solutions to COMP9334 Week 8 Sample Problems

Transcription:

Service Engineering Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis. G/G/1 Queue: Virtual Waiting Time (Unfinished Work). GI/GI/1: Lindley s Equations and Stability. M/GI/1 (=M/G/1): The Khintchine-Pollaczek Formula. G/G/1 and G/G/n: Allen-Cunneen Approximation; Kingman s Exponential Law. Call Centers: The M/G/n+G queue. Queueing Systems with Priorities (Recitation). 1

The G/G/1 Queue Non-Parametric model of a service station. Exact model but Approximate analysis (vs. Markovian queues). We start with Single-Server models. (Will be generalized to Multi-Servers.) Building Blocks: Arrivals: Counting Process A = {A(t), t 0}, with arrivals (jumps) at {A 1, A 2,..., A i,...}. Services: {S 1, S 2,..., S i,...} denote service durations. First Come First Served (FCFS = FIFO here). Work arriving up to time t: A(t) i=1 S i, t 0. Virtual Waiting Time V = {V (t), t 0}: V(t) is the amount of time that a (possibly virtual) arrival at time t would have to wait. (Sometimes referred to as Unfinished Work, but Virtual Waiting Time is more appropriate for multi-server queues.) It is possible to create the sample paths of V from those of Work. We prefer a direct visual (seesaw) construction: Assume V (0) = 0,... 2

GI/GI/1 Number in system is NOT a Markov process (in contrast to Markovian queues). For some analysis need some minimal Assumptions: Arrival times A 1, A 2,..., A n,... are jumps of a renewal process: Inter-arrival times T i = A i A i 1, i 1, are iid (A 0 = 0). E[T 1 ] = 1/λ; C 2 (T 1 ) = C 2 a. Note: λ = Arrival rate. Service durations S 1, S 2,..., S n,... are iid. E[S 1 ] = 1/µ; C 2 (S 1 ) = C 2 s. Note: µ = Service rate. Independence between arrivals and services. Service discipline is First Come First Served. 3

G/G/1: Lindley s Equations (1952) Let W q (n) = Waiting Time of customer number n, n=1,2,... Recursion: W q (n) = In short, 0, A n A n 1 + W q (n 1) + S n 1 A n, departure time of customer (n 1) {}}{ A n 1 + W q (n 1) + S n 1 otherwise. W q (n) = [A n 1 A n + W q (n 1) + S n 1 ] + ( x + = x 0) = [W q (n 1) + X n 1 ] +, where X n 1 = S n 1 T n, n 2 (iid in GI/GI/1). Note: X 1, X 2,... are data, enabling calculation of successive waiting times via Lindley s Equations: W q (n) = [W q (n 1) + X n 1 ] +, n 2, W q (1) = 0. Useful: Recursion amenable for spreadsheet calculations. 4

GI/GI/1 Queue: Stability Explicit representation of W q (n): W q (n) = max(0, X n 1, X n 1 +X n 2,..., X n 1 +X n 2 +...+X 1 ). (Try unfolding the recursion with, say, n = 3.) We have GI/GI/1, in which X i s are iid. Hence, W q (n) d = max(0, X 1, X 1 + X 2,..., X 1 + X 2 +... + X n 1 ). Define the Random Walk Y k = k i=1 X i, Y 0 = 0. Fact: P{sup k 0 Y k < } = 1 E[X 1 ] < 0. Proof: By the Strong Law of Large Numbers, lim k 1 k (X 1 +... X k ) = E[X 1 ] < 0. Hence, lim k Y k =, and their sup = max is finite. Consequence: d W q (n) sup k 0 Y k < as n if and only if λ < µ (ρ = λ/µ < 1). Conclude: GI/GI/1 stable if and only if ρ < 1. From now on, all models are in steady-state. 5

M/GI/1 (=M/G/1) in Steady-State The Khintchine-Pollaczek Formula M/G/1 Queue: Poisson arrivals, generally distributed (iid) service durations. Theorem. (Khintchine-Pollaczek) ρ E(W q ) = E(S) 1 ρ 1 + C 2 (S). 2 Remarks: A remarkable second-moment formula quantifying congestion. Congestion Index = E(W q )/E(S) (unitless). Decomposes Congestion into two multiplicative components (the two congestion-drivers, in our simple M/G/1 context): Server-Utilization: ρ ; Stochastic-Variability, arising from Services: C(S) ; ( Where are the Arrivals? - to be discussed momentarily). Quantifies the effect of the service-time distribution (via its CV); for example, changing from a human-service to a robot. The Number-in-System is not Markov; however at instants of service completions it is an (embedded) Markov-chain. Illuminating derivation, with the ingredients: Little, PASTA, Biased sampling; Wald. 6

Derivation of Khintchine-Pollaczek For customer n = 1, 2,..., denote W q (n) = waiting-time of n-th customer. R(n) = residual service time, at time of the n-th arrival; ( = 0, for arrivals without waiting). L q (n) = # of customers in queue, at time of n-th arrival. {S n } = sequence of service-times. W q (n) = R(n) + n 1 k=n L q (n) S k, n 1. EW q (n) = ER(n) + E(S 1 ) EL q (n), E(W q ) = E(R) + E(S 1 )E(L q ), by Wald, n, assuming limit + PASTA, = E(R) + λe(s 1 )E(W q ), by Little, E(W q ) = E(R) + ρe(w q ), E(W q ) = E(R)/(1 ρ). Left to calculate E[R]? Via Biased Sampling (see next page): ρ < 1 steady-state, - ρ = Prob. of arriving to a busy server. (PASTA+Little) - E(R) = (1 ρ) 0 + ρ E(S) 1 + C 2 (S). q.e.d. 2 7

Biased Sampling (via PASTA) A renewal process is a counting process with iid interarrivals. Descriptions: R = {R(t), t 0} or {T 1, T 2,...} iid, or {S 1, S 2,...} Example: Poisson exponential Erlang Story: Buses arrive to a bus stop according to a renewal process R b = {R b (t), t 0}. Ti b times between arrivals of the buses. Passengers arrive to the bus stop in a completely random fashion (Poisson). S p i arrival times of the passengers. Question: How long, on average, do they wait? Plan service-level. X(t) b b b b 1 2 3 4 T T T T t iid A = {A(t), t 0} = Poisson arrivals of passengers. X = {X(t), t 0} = state = Virtual waiting time. 1 T 1 N PASTA: lim X(t)dt = lim X(S p T T 0 N N n ) = τ n=1 τ = 1 (area under X, over [0, T ]) T 1 ( 1 T 2 (T 1 b ) 2 + 1 2 (T 2 b ) 2 + + 1 ) 2 (T R b b (T )) 2 = R b(t ) 1 T 2 T 1 2 + + TR 2 b (T ) R b (T ) 1 = 2 E(T 1 b ) }{{} Deterministic answer Check Poisson bus arrivals to derive Paradox: 1( stochastic answer) = 1 2 1 T E(T1 b ) 1 2 E(T 1 b ) 2, by SLLN [1 + c 2 (T1 b )], c = σ }{{} E Bias, due to variability ( deterministic answer). coefficient of variation. 8

GI/GI/1 The Allen-Cunneen Approximation Assume General Arrivals (renewal) and General Services (iid): ρ E(W q ) E(S) 1 ρ C 2 (A) + C 2 (S). 2 Mean Service Time Facts: Exact for M/G/1. Upper bound in general. Utilization Availability Stochastic Variability Asymptotically exact as ρ 1 - in Heavy Traffic. (But then can actually say much more - momentarily). Internalize: Assume C 2 (A) = C 2 (S) = 1, as in M/M/1: E(W q ) E(S) = ρ 1 ρ. Now substitute ρ = 0.5 (1), 0.8 (4), 0.9 (10), 0.95 (19). Finally think in terms of 5 minute telephone service-time (or 1 week job-shop processing-time ). Other Measures of (Average) Performance: E(W ) = E(S) + E(W q ), E(L q ) = λe(w q ), E(L) = λe(w ) = E(L q ) + ρ. 9

GI/GI/1 Kingman s Exponential Law Fact (Kingman, 1961): In heavy-traffic, Waiting-Time is Exponential. Get its mean from the Allen-Cunneen approximation. Formally: Kingman s Exponential Law of Congestion: W q E(S) exp mean = 1 1 ρ C 2 (A) + C 2 (S), wp ρ, 2 0, wp 1 ρ, Remarks: Congestion Index = E(W q )/E(S) (unitless): The Allen-Cunneen Approximation. Decomposes Congestion into two multiplicative components (the two congestion-drivers, in our simple G/G/1 context): Server-Utilization: ρ ; Stochastic-Variability, which arises from Arrivals - C(A) and Services - C(S). Both ρ and C(S) effect congestion non-linearly draw congestion curves. M/M/1 Special case in which C 2 (A) = C 2 (S) = 1 : Exact. M/G/1 Only E(W q ) is Exact. 10

Justifying the Law of Congestion: Why W q exp ( mean = 1 1 C 2 a+c 2 s 2 )? via Strong Approximations. (Heavy Traffic Theory) S(t) = t n=1 S n 1 µ t + σ sb s (t) (Donsker for partial sums) A(t) λt + λ 3/2 σ a B a (t) = λt + λ 1/2 C a B a (t) (for renewals) L(t) = S[A(t)] 1 [ λt + λ 3/2 σ a B a (t) ] + σ s B s (λt) (B fluctuations) µ = λ µ t + λ1/2 µ C ab a (t) + λ1/2 µ C sλ 1/2 B s (λt) X(t) = L(t) t (1 ρ)t + λ1/2 µ d = (1 ρ)t + λ1/2 [ ] 1 C a B a (t) + C s λ B s (λt) µ (C2 a + Cs 2 ) 1/2 B(t) sum of independent BM = d BM ; selfsimilarity (both by characterization) = (1 ρ)t + σb(t), where σ 2 = 1 µ ρ(c2 a + C 2 s ) Recall: V obtained from X through reflection, and reflection is Lipshitz continuous. V RBM( (1 ρ), σ) with stationary distribution exp ( mean = ρ 1 ρ ) Ca 2+C2 s Hence, V ( ) d exp ( mean = 1 µ 2 v. significant Generalized P K, for EW q ) σ2 2(1 ρ). Approximation improves as ρ 1 (heavy traffic) EW 1 µ 1 + ρ 1 ρ }{{} utilization cost of congestion 0 Ca 2 + Cs 2 }{{ 2 } stoch. variability strictly convex, increasing in ρ, C a, C s. 12

Approximating G/G/n Stability condition: ρ = λ nµ < 1. Kingman s Exponential Law: W q E(S) exp ( mean = 1 n 1 1 ρ C 2 (A)+C 2 ) (S) 2, wp E 2,n, 0, otherwise. In particular, a popular measure for service-level, used to determine the number-of-servers n, is: 2n(1 ρ) P {W q > x E(S)} E 2,n exp x C 2 (A) + C 2, x > 0. (S) Allen-Cunneen Approximation: E(W q ) E(S) 1 n E2,n 1 ρ C 2 (A) + C 2 (S). 2 or equivalently, E(W q ) E(W q,m/m/n ) C 2 (A) + C 2 (S). 2 - Above accurate in Efficiency-Driven (ED) systems. Rules-of-thumb ED-Characterization: In small systems (few servers), over 75% of the customers are delayed in queue prior to service; in large systems (many 10 s or several 100 s of servers), essentially all customers delayed - more on that in future classes. 11

Performance vs. Availability \ Accessibility 400 350 300 250 200 150 100 50 0 0 0.2 0.4 0.6 0.8 1 Occupancy Average waiting time, sec Queueing Science: Measurements Model Validation 10

M/G/n+G: The Basic Call Center Model Why fundamental? since, in call centers, and elsewhere, Arrivals reasonably-approximated by Poisson, Services typically not Exponential, (Im)Patience typically not Exponential. From M/G/n+G to M/M/n+M (Erlang-A): 1. M/M/n+G: Assume Exponential service times with the same mean (Whitt, 2005, via simulations); 2. M/M/n+M: Assume Exponential (im)patience times; 3. Estimate the patience-parameter θ via P{Ab}/E[W q ] (with Zeltyn, 2005). Possible inaccuracies in the exponential approximation for service times, when Very large or very small C(S); Very patient customers (very small θ). 12

Theoretical Congestion Curves: Staffing Tools (4CallCenters) Economies of Scale Average Waiting Time But Only of Those Who Wait E[W q W q > 0] (Load: 10 per server) 200 No. of Servers = 4 180 5 160 6 140 Waiting Time In Seconds 120 100 80 7 8 9 10 60 11 12 40 13 14 15 16 20 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 Arrivals Rate 12