Stochastic Networks and Parameter Uncertainty

Similar documents
We study a capacity sizing problem in a service system that is modeled as a single-class queue with multiple

Managing Call Centers with Many Strategic Agents

We consider a call center model with m input flows and r pools of agents; the m-vector of instantaneous

Dynamic Control of Parallel-Server Systems

MODELING WEBCHAT SERVICE CENTER WITH MANY LPS SERVERS

Dynamic Control of a Tandem Queueing System with Abandonments

Staffing Call-Centers With Uncertain Demand Forecasts: A Chance-Constrained Optimization Approach

Data analysis and stochastic modeling

STAFFING A CALL CENTER WITH UNCERTAIN ARRIVAL RATE AND ABSENTEEISM

Queueing Theory I Summary! Little s Law! Queueing System Notation! Stationary Analysis of Elementary Queueing Systems " M/M/1 " M/M/m " M/M/1/K "

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis.

Figure 10.1: Recording when the event E occurs

Performance Evaluation of Queuing Systems

ec1 e-companion to Liu and Whitt: Stabilizing Performance

Service System Planning in the Presence of a Random Arrival Rate

Lecture 9: Deterministic Fluid Models and Many-Server Heavy-Traffic Limits. IEOR 4615: Service Engineering Professor Whitt February 19, 2015

Designing load balancing and admission control policies: lessons from NDS regime

Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates

Little s result. T = average sojourn time (time spent) in the system N = average number of customers in the system. Little s result says that

BIRTH DEATH PROCESSES AND QUEUEING SYSTEMS

Erlang-C = M/M/N. agents. queue ACD. arrivals. busy ACD. queue. abandonment BACK FRONT. lost calls. arrivals. lost calls

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers 1

Electronic Companion Fluid Models for Overloaded Multi-Class Many-Server Queueing Systems with FCFS Routing

Proactive Care with Degrading Class Types

IEOR 6711: Stochastic Models I, Fall 2003, Professor Whitt. Solutions to Final Exam: Thursday, December 18.

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers

Robust and Data-Driven Approaches to Call Centers

Introduction to queuing theory

Chapter 2. Poisson Processes. Prof. Shun-Ren Yang Department of Computer Science, National Tsing Hua University, Taiwan

Since D has an exponential distribution, E[D] = 0.09 years. Since {A(t) : t 0} is a Poisson process with rate λ = 10, 000, A(0.

Queuing Networks: Burke s Theorem, Kleinrock s Approximation, and Jackson s Theorem. Wade Trappe

Point Process Control

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017

Design and evaluation of overloaded service systems with skill based routing, under FCFS policies

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/6/16. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

We consider the problem of staffing call centers with multiple customer classes and agent types operating

Lecture 20: Reversible Processes and Queues

Continuous Time Processes

Overflow Networks: Approximations and Implications to Call-Center Outsourcing

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

Queuing Analysis. Chapter Copyright 2010 Pearson Education, Inc. Publishing as Prentice Hall

Queueing Theory II. Summary. ! M/M/1 Output process. ! Networks of Queue! Method of Stages. ! General Distributions

A STAFFING ALGORITHM FOR CALL CENTERS WITH SKILL-BASED ROUTING: SUPPLEMENTARY MATERIAL

Copyright by Jing Zan 2012

Designing a Telephone Call Center with Impatient Customers

USING DIFFERENT RESPONSE-TIME REQUIREMENTS TO SMOOTH TIME-VARYING DEMAND FOR SERVICE

Introduction to Queueing Theory with Applications to Air Transportation Systems

Cover Page. The handle holds various files of this Leiden University dissertation

Near-Optimal Control of Queueing Systems via Approximate One-Step Policy Improvement

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

Efficient Monitoring Algorithm for Fast News Alert

CS418 Operating Systems

Introduction to Markov Chains, Queuing Theory, and Network Performance

Markov Processes and Queues

Peak Load Forecasting

Quantifying Stochastic Model Errors via Robust Optimization

Name of the Student:

COMP9334: Capacity Planning of Computer Systems and Networks

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/25/17. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

Chapter 6 Queueing Models. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

Networking = Plumbing. Queueing Analysis: I. Last Lecture. Lecture Outline. Jeremiah Deng. 29 July 2013

The Offered-Load Process: Modeling, Inference and Applications. Research Thesis

Advanced Computer Networks Lecture 3. Models of Queuing

Queueing Systems: Lecture 3. Amedeo R. Odoni October 18, Announcements

Systems Simulation Chapter 6: Queuing Models

Slides 9: Queuing Models

State Space Collapse in Many-Server Diffusion Limits of Parallel Server Systems. J. G. Dai. Tolga Tezcan

Classical Queueing Models.

Contents Preface The Exponential Distribution and the Poisson Process Introduction to Renewal Theory

Delay Announcements. Predicting Queueing Delays for Delay Announcements. IEOR 4615, Service Engineering, Professor Whitt. Lecture 21, April 21, 2015

Section 3-3 Measures of Variation

A Little Flexibility is All You Need: Optimality of Tailored Chaining and Pairing*

We consider how two networked large-scale service systems that normally operate separately, such as call

λ λ λ In-class problems

Retrial queue for cloud systems with separated processing and storage units

STOCHASTIC MODELS FOR RELIABILITY, AVAILABILITY, AND MAINTAINABILITY

Abandonment and Customers Patience in Tele-Queues. The Palm/Erlang-A Model.

Q = (c) Assuming that Ricoh has been working continuously for 7 days, what is the probability that it will remain working at least 8 more days?

IS 709/809: Computational Methods in IS Research Fall Exam Review

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

COMP9334 Capacity Planning for Computer Systems and Networks

A Semi-parametric Bayesian Framework for Performance Analysis of Call Centers

Author's personal copy

SOLUTIONS IEOR 3106: Second Midterm Exam, Chapters 5-6, November 8, 2012

Queuing Theory. Using the Math. Management Science

Stein s Method for Steady-State Approximations: Error Bounds and Engineering Solutions

A Queueing System with Queue Length Dependent Service Times, with Applications to Cell Discarding in ATM Networks

Non Markovian Queues (contd.)

QUEUING MODELS AND MARKOV PROCESSES

A Fluid Approximation for Service Systems Responding to Unexpected Overloads

Introduction to Queueing Theory

This lecture is expanded from:

What You Should Know About Queueing Models To Set Staffing Requirements in Service Systems

On the Partitioning of Servers in Queueing Systems during Rush Hour

IEOR 3106: Second Midterm Exam, Chapters 5-6, November 7, 2013

ENGINEERING SOLUTION OF A BASIC CALL-CENTER MODEL

ESTIMATING WAITING TIMES WITH THE TIME-VARYING LITTLE S LAW

BRAVO for QED Queues

Asymptotic Coupling of an SPDE, with Applications to Many-Server Queues

Stochastic Simulation Discrete simulation/event-by-event

Transcription:

Stochastic Networks and Parameter Uncertainty Assaf Zeevi Graduate School of Business Columbia University Stochastic Processing Networks Conference, August 2009 based on joint work with Mike Harrison Achal Bassamboo and Ramandeep Randhawa

Motivation for this talk Much of the work on stochastic processing networks: assumes model structure is known a priori, is accurate and stationary parameters describing system and environment known and static no model misspecification errors 2

Motivation for this talk Much of the work on stochastic processing networks: assumes model structure is known a priori, is accurate and stationary parameters describing system and environment known and static no model misspecification errors In practice: model structure may be only partially known model primitives need to be inferred from historical data in on-line manner model may be misspecified... both system model and environment environment may be changing over time 2-a

Example 1: Design of global delivery centers Arrival rate patterns in medium sized service center 120 100 Regression (non-parametric) 80 60 40 Confidence intervals 20 0 8 10 12 14 16 18 20 Hour (9:00 20:00) how to deal with forecasting errors? 3

Example 2: Price engineering 0 0.5 1 optimal price ˆθ2 1.5 2 2.5 3 0 50 100 150 200 250 300 350 5.5 5 4.5 4 time (days) 3.5 0 50 100 150 200 250 300 350 time (days) 4

Example 2: Price engineering 0 0.5 1 optimal price ˆθ2 1.5 2 2.5 3 0 50 100 150 200 250 300 350 5.5 5 4.5 4 time (days) 3.5 0 50 100 150 200 250 300 350 time (days) how to deal with changing environment? 6

Example 3: Cloud computing 1000 900 800 700 600 number of requests (per minute)1100 500 0 1 2 3 4 5 6 7 8 9 10 Time (hours) 7

What s in this talk Impact of parameter uncertainty on: static capacity / processing rate decisions revisiting the square-root logic... model specification and calibration estimation and testing dynamic control and resource allocation revisiting the static planning problem... 8

Parameter uncertainty and capacity planning Mean call arrivals 8 10AM in medium sized call center Day of Week Mean no. of arriving calls CV [empirical] (%) CV [Poisson] (%) Mon 943 26.5 3.3 Tue 824 22.3 3.5 Wed 807 26.5 3.5 Thu 778 28.5 3.6 Fri 767 33.5 3.6 Sat 293 61.8 5.8 Sun 139 148.1 8.5 CV [empirical] = coefficient of variation (in %) CV [Poisson] = calculated assuming arrival process Poisson 9

Parallel server network arriving calls: rates Λ 1 (t), Λ 2 (t), Λ 3 (t) abandonments 1 2 3 waiting calls routing control X 1 2 server pools staffing: b 1, b 2 completed services arrival process = doubly stochastic with rate Λ 1 (t) 10

System dynamics N i (t) = Arrivals Completed Services Abanbonments N : headcount process N i (t) = # of class i customers present at time t Q : queue length process Q i (t) = # of class i customers not being served at time t X : dynamic control [ (RX) i = rate of service in class i ] X j (t) = # of servers allocated to activity j b : staffing vector (X, N, Q) satisfy AX(t) b, Q(t) = N(t) BX(t) 0, N(t) 0, X(t) 0 11

System dynamics N i (t) = Arrivals rate: Λ i (t) Completed Services rate: (RX) i (t) Abanbonments rate: γ i Q i (t) N : headcount process N i (t) = # of class i customers present at time t Q : queue length process Q i (t) = # of class i customers not being served at time t X : dynamic control [ (RX) i = rate of service in class i ] X j (t) = # of servers allocated to activity j b : staffing vector (X, N, Q) satisfy AX(t) b, Q(t) = N(t) BX(t) 0, N(t) 0, X(t) 0 11-a

Design and control objectives capacity costs minimize: c b + p E [ T γq(s)ds ] 0 E [# of abandonments across classes] s.t. admissible routing control X over [0, T] b = r-dim l vector of staffing levels in agent pools c = personnel cost vector p = penalty cost vector Q(t) = vector of queuelengths at time t in class i [depends on routing...] γ = abandonment rate vector T = planning horizon over which staffing is held fixed 12

Design and control objectives capacity costs minimize: c b + p E [ T γq(s)ds ] 0 E [# of abandonments across classes] s.t. admissible routing control X over [0, T] b = r-dim l vector of staffing levels in agent pools c = personnel cost vector p = penalty cost vector Q(t) = vector of queuelengths at time t in class i [depends on routing...] γ = abandonment rate vector T = planning horizon over which staffing is held fixed Decision variables : capacity vector b and control X 12-a

A simple single-class / single-pool example Solve the simplest problem you don t know the answer to. Mike Harrison 13

A simple single-class / single-pool example Solve the simplest problem you don t know the answer to. Mike Harrison arrival process doubly stochastic w/ rate Λ(t) exponential services w/ rate µ exponential reneging w/ rate γ b statistically identical servers 13-a

A simple single-class / single-pool example Solve the simplest problem you don t know the answer to. Mike Harrison arrival process doubly stochastic w/ rate Λ(t) exponential services w/ rate µ exponential reneging w/ rate γ b statistically identical servers objective: minimize Π(b) := c b + p E [ T 0 γq(s)ds ] b = optimal capacity choice 13-b

Mike s key observation if Λ µ, γ and of reasonable magnitude e.g., 100 s of calls/hour, processing/reneging order of minutes then expect accurate fluid approximation short relaxation times 14

Mike s key observation if Λ µ, γ and of reasonable magnitude e.g., 100 s of calls/hour, processing/reneging order of minutes then expect accurate fluid approximation short relaxation times γq(t) (Λ(t) bµ) + pointwise stationary fluid model (PSFM) 14-a

Mike s key observation if Λ µ, γ and of reasonable magnitude e.g., 100 s of calls/hour, processing/reneging order of minutes then expect accurate fluid approximation short relaxation times γq(t) (Λ(t) bµ) + pointwise stationary fluid model (PSFM) More generally: in each class i = 1,..., m, and all time t ( ) γ i Q i (t) Λ i (t) (RX) i (t) abandonment rate arrival rate processing rate 14-b

Picture proof 200 180 160 simulation asymptotic path 140 Headcount 120 100 80 60 40 20 0 0 50 100 150 200 250 300 350 400 450 500 Time (minutes) for real proof see Bassamboo-Harrison-Z (06a,06b) 15

Consequence original objective fn: Π(b) = c b + p E [ T 0 γq(s)ds ] optimal solution b 16

Consequence original objective fn: Π(b) = c b + p E [ T 0 γq(s)ds ] optimal solution b approximate objective fn: Π(b) = c b + p E [ T (Λ(s) bµ) + ds ] 0 approximate solution b 16-a

Relation to traditional models if Λ = λ (deterministic case) then optimal solution takes form b = λ µ + β λ µ Erlang s square root rule... 17

Relation to traditional models if Λ = λ (deterministic case) then optimal solution takes form b = λ µ + β λ µ Erlang s square root rule... Arrival rate Optimal solution Prescription Difference λ b Π b Π( b) b b Π( b) Π 37.5 40 14.75 37 15.02 3 0.27 75 79 28.17 75 28.45 4 0.28 300 307 106.32 300 106.91 7 0.59 17-a

In pictures... 3.7 3.2 3.6 3.1 3 3.5 2.9 Total daily cost ($10K) 3.4 3.3 3.2 3.1 upper 95% confidence interval simulation fluid approximation lower 95% confidence interval Total daily cost ($10K) 2.8 2.7 2.6 2.5 2.4 fluid approximation simulation 3 90 95 100 105 110 115 120 125 130 135 140 Number of servers (b) Stochastic arrival rate 90 95 100 105 110 115 Number of servers (b) Deterministic arrival rate 18

Relation to traditional models (cont d) arrival rate Λ time homogenous random drawn from distribution F 19

Relation to traditional models (cont d) arrival rate Λ time homogenous random drawn from distribution F objective: minimize Π(b) := c b + p E[N b] + N = number of customers in system in steady-state 19-a

Relation to traditional models (cont d) arrival rate Λ time homogenous random drawn from distribution F objective: minimize Π(b) := c b + p E[N b] + N = number of customers in system in steady-state PSFM approximation: minimize Π(b) := c b + p E[Λ bµ] + simple newsvendor problem with fractile solution b = 1 ( ) µ F c 1 pµ 19-b

Accuracy of the newsvendor-based logic arrival rates constant and random (CV = 19.2%) Arrival rate Optimal solution Prescription Difference distribution b Π b Π( b) b b Π( b) Π U[25,50] 42 16.21 41 16.23 1 0.02 U[50,100] 83 31.47 83 31.47 0 0 U[200,400] 332 122.89 333 122.89 1 0 20

Why is the prescription so accurate under uncertainty? Consider simple case where µ = γ [ just for purposes of intuition ] infinite server queue use normal approximation to Poisson... Π(b) = c b + p E [ T 0 γq(s)ds ] 21

Why is the prescription so accurate under uncertainty? Consider simple case where µ = γ [ just for purposes of intuition ] infinite server queue use normal approximation to Poisson... Π(b) = c b + p E [ T 0 γq(s)ds ] c b + pµ E[Λ/µ b] + + K E [ )] Λ/µ (Λ/µ b)2 exp ( 2(Λ/µ) 21-a

Why is the prescription so accurate under uncertainty? Consider simple case where µ = γ [ just for purposes of intuition ] infinite server queue use normal approximation to Poisson... Π(b) = c b + p E [ T 0 γq(s)ds ] c b + pµ E[Λ/µ b] + + K E [ )] Λ/µ (Λ/µ b)2 exp ( 2(Λ/µ) = approximate objective fn + approximation error 21-b

Why is the prescription so accurate under uncertainty? Consider simple case where µ = γ [ just for purposes of intuition ] infinite server queue use normal approximation to Poisson... Π(b) = c b + p E [ T 0 γq(s)ds ] c b + pµ E[Λ/µ b] + + K E [ )] Λ/µ (Λ/µ b)2 exp ( 2(Λ/µ) = approximate objective fn + approximation error This suggests that performance gap is bounded... performance gap = Π( b) Π is independent of scale of system... 21-c

Rigorous foundations Put EΛ = n and CV n = coefficient of variation Thm. [ Bassamboo-Randhawa-Z (09) ] Uncertainty-driven regime: if CV n 1/ n, then Π( b n ) = Π + O(1/CV n ). Variability-driven regime: if CV n 1/ n, then Π( b n ) = Π + O( n). 22

Rigorous foundations Put EΛ = n and CV n = coefficient of variation Thm. [ Bassamboo-Randhawa-Z (09) ] Uncertainty-driven regime: if CV n 1/ n, then Π( b n ) = Π + O(1/CV n ). Variability-driven regime: if CV n 1/ n, then Π( b n ) = Π + O( n). Cor 1. If CV bounded away from 0 then prescription is O(1)-optimal 22-a

Rigorous foundations Put EΛ = n and CV n = coefficient of variation Thm. [ Bassamboo-Randhawa-Z (09) ] Uncertainty-driven regime: if CV n 1/ n, then Π( b n ) = Π + O(1/CV n ). Variability-driven regime: if CV n 1/ n, then Π( b n ) = Π + O( n). Cor 1. If CV bounded away from 0 then prescription is O(1)-optimal Cor 2. Performance of b n is not sensitive to O( n) perturbations 22-b

Inference and model calibration problem: previous slides assume distribution of arrival rate is known... 23

Inference and model calibration problem: previous slides assume distribution of arrival rate is known... possible approach: [ Bassamboo- Z (2009) ] estimate arrival rate distribution F n [ n = sample size ] form empirical (approximate) objective fn Πn ( ) compute b n [ estimator of b ] evaluate performance of estimator EΠ( b n ) 23-a

Inference and model calibration problem: previous slides assume distribution of arrival rate is known... possible approach: [ Bassamboo- Z (2009) ] estimate arrival rate distribution F n [ n = sample size ] form empirical (approximate) objective fn Πn ( ) compute b n [ estimator of b ] evaluate performance of estimator EΠ( b n ) key ideas in analysis: need Π n ( ) to be amenable to M-estimation theory... e.g., Lipschitz fn guarantees finite bracketing entropy use Talagrand s bounds to establish 1/ n accuracy EΠ( b n ) Π = C/ n + approximation error interaction between approximation bound and estimation bound... 23-b

Picture proof... 5.5 x 10 5 EΠ( bn) Π 4.5 4 3.5 3 C 1 + C 2 / n 2.5 2 50 100 150 200 250 300 350 400 Data Size n 24

Takeaway messages Parameter uncertainty: creates insensitivity in the objective fn makes it easier to achieve near optimal performance simple capacity planning fluid problem very simple control rules estimate/calibrate model off line estimation [ capacity planning ] real-time estimation [ control ] 25