Data-Based Resource-View of Service Networks: Performance Analysis, Delay Prediction and Asymptotics. PhD Proposal. Nitzan Carmeli.

Size: px
Start display at page:

Download "Data-Based Resource-View of Service Networks: Performance Analysis, Delay Prediction and Asymptotics. PhD Proposal. Nitzan Carmeli."

Transcription

1 Data-Based Resource-View of Service Networks: Performance Analysis, Delay Prediction and Asymptotics PhD Proposal Nitzan Carmeli Advisers: Prof. Avishai Mandelbaum Dr. Galit Yom-Tov Technion - Israel Institute of Technology February 6, 217

2 Contents 1 Introduction 3 2 Literature Review Wait Time and Sojourn Time Predictors Steady State Approximations of Waiting Times in Queueing Networks Heavy-Traffic Approximations Real-Time Estimators Delay Announcements Activity networks Wait Time Estimators for Two Queues in Tandem Wait Time Distribution Given Number of Customers in Both Queues: Exact Analysis Sojourn Time Wait Times Many-Server Heavy Traffic Approximations, Time Varying Fluid Approximation Diffusion Approximation Resource-Driven Activity Networks Fluid RDAN: The Static Model Theoretical Example Leading to a Limit with Two Time-Scales Two Coupled Membership Stations Data-Based Creation of Resource-Driven Activity Networks Data-Based Example of Two-Scale Model Research Goals 47 A Data-Based Creation of RDAN 49 1

3 List of Figures 1.1 Two snapshots of customer and server flows in a call center. The left graph shows the customer flow, while the right one shows the server flow. Animation: Customer / Server Networks A snapshots of a resource-driven activity network of a call center, including both customers and servers. Animation: RDAN animation Flow chart of an ED process Sojourn time at the nurse admission station, April 214 to May 215, All days Sojourn time at the doctor station, April 214 to May 215, All days Transition diagram, two M/M/1 queues in tandem Simulation results, CDF of sojourn time in the system Simulation results, CDF of wait time at the second queue. (Simulation (blue), estimation given T 1 < T 2 (red).) The waiting time diffusion term, taken from Mandelbaum et al. [26] Schematic description of the η membership model X η transition rate diagram Modified transition diagram SBR, routing of inbound calls to top-level agent groups. The ellipses represent different types of queues; the rectangles represent different types of top-level agent groups Occupancy profile of an exam room in an outpatient hospital Occupancy profile of an infusion chair in an outpatient hospital

4 Chapter 1 Introduction The proposed research focuses on developing a theory for analyzing queueing networks. Our primary motivation comes from healthcare systems, such as Emergency Departments (EDs); these are complex networks that involve multiple resource types (physicians, nurses, etc.), multiple patient types (differentiated by severity and medical specialty), highly variable and long processes, and a time-varying environment [4]. Such characteristics often result in patients experiencing long waiting times. Patients in emergency departments, clinics and other healthcare facilities often feel confused, stressed and helpless, as a result of the ambiguity of their service processes and waiting durations. This stress inflicts high pressure on personnel as well. Initial research [2, 3, 33] suggests that providing real-time information for patients on process development and durations might improve patient satisfaction and reduce stress. Furthermore, it was shown in [12] that providing delay information influences customer choices, which in turn impact the system performance. In particular, they showed that there is a substantial growth in Internet queries regarding ED wait times, and that patients take this information into consideration when choosing the ED they will turn to. This results in a pooling effect; a balance between the ED loads in different hospitals. Hence, if delay estimations are accurate the average waiting times in all hospitals (in a certain area) should decrease. We aim to develop the required theory and tools in order to provide such information by predicting delays in a complex queueing network. Therefore, our first goal is to develop wait-time predictors which will be based on individual data as well as the system state, all of which will be updated in real-time. This is a part of a multidisciplinary research which aims to test the effects of providing real-time, personalized information to patients on the behavior and feelings of patients, and on the operation efficiency of a hospital Emergency Department (ED). The research includes design of a platform for information transparency regarding medical processes, anticipated waiting time, duration and completion of tests and treatment, which will be implemented in a large Israeli hospital. It is therefore of necessary to develop wait time and sojourn time estimators which will account for the complexity in healthcare systems. Current research probes prediction methods that are either based on classical queuing theory [19 21, 28, 31], process mining techniques [13, 37], statistical learning methods or combinations of the following [3]. We shall focus on queuing theory predictors and how ought to expand these predictors to queueing networks (rather than a single station) with different, and sometimes unknown, priority policies. We are interested in individual, real-time wait time estimation, as opposed to the average wait time of all customers, which is usually based on steady-state assumption. We focus on estimating wait time distributions, rather than merely averages, as it may be beneficial to inform patients with higher percentiles of their expected waiting time, or even present it within a range. For example, in [3] it was shown that patient satisfaction deteriorates sharply as the waiting time exceeds the announced average waiting time, which may result in anger and violence towards the medical staff. Moreover, a preliminary user-study, as part of the multidisciplinary research, revealed that customers prominently prefer delay estimators which are given as a range 3

5 Version I: node 2 Figure 1.1: Two snapshots of customer and server flows in a call center. The left graph shows the customer flow, while the right one shows the server flow. Animation: Customer / Server Networks. ( your wait will be between 45 to 6 minutes ), and find them more understandable and credible than delay estimators which are given as: your wait will be approximately 45 minutes or your wait will be at least 45 minutes. The exploration of complicated service networks, such as healthcare systems, has led to the realization that a new framework for modeling and analyzing queueing networks is greatly needed. Healthcare systems, as well as other service systems (e.g. call centers), have become extremely large in the last decade. For example, one of the healthcare systems which provides the data for our research is a US outpatient hospital. On average, 1 patients visit this hospital every day, and receive treatment by 35 staff members. The essence of a large number of servers that grow with the arrival rate is captured in the many-server heavy traffic theory. In [14] it was shown that in the many-server regime, as opposed to conventional heavy traffic, servers may not always be busy, and the probability that an arriving customer will have to wait is strictly less than one. Therefore, not only customers wait for servers, but also servers wait for customers. This is also true in small scale systems, in which in order to provide a satisfactory level of service there is a need to reduce agent efficiency. Additionally, although servers are present in the system, they might be occupied with other activities, hence, unable to provide service. Consider for example an ED physician which is called to treat a patient in an internal ward, hence becomes unavailable for ED patients. This dynamic was introduced in the Erlang S model ( S for Servers ) presented in [6], and is indeed a natural dynamic in service systems which is also evident through data. An example for that is seen in Figure 1.1, taken from Momčilović et al. [29], which shows a time snapshot of both customer flow within a call center (on the left) as well as server flow (on the right) within that system. 1 (An animation representing the time evolution of the two networks can be found at Customer / Server Networks.) This apparent symmetry between customers and servers is the motivation for our novel framework 1 for 1 Rectangles in both graphs represent activities. Green rectangles correspond to activities that involve both customers and servers. The vertical bars on top of activities represent the number of customers/servers involved in a particular activity at a given time. Individual customers and servers are shown with small discs. A disc moves between two activities while the corresponding customer/server is engaged in the activity that corresponds to the egress node. 4

6 Figure 1.2: A snapshots of a resource-driven activity network of a call center, including both customers and servers. Animation: RDAN animation. queueing networks. Traditionally, when modeling or analyzing queueing networks, we focus on one entity in the service system usually either customers or servers, and analyze the system where each of the entities has its own role. Following [29] we propose a unifying framework under which each entity is equally considered as a resource. The queueing model will then be activity-oriented, where each activity requires a subset of the resources and produces others. We therefore refer to it as a Resource-Driven Activity Network or RDAN. Figure 1.2 is an example of such a network, taken from the same data of Figure 1.1. Here both customers and servers are shown in a single network, and both play a symmetric role. (See RDAN animation for the corresponding animation.) Our proposed framework is also natural when more than one type of server is needed in order to complete a service, which is exactly the case in healthcare systems, such as an ED or the outpatient hospital we are partnering with. For example, in order for an infusion process to take place, a patient, an available nurse, a vacant bed and the drug must all be present; all function as resources for the same activity. We note that there are other natural settings for which our framework would be beneficial; for example, the Uber application which matches customers and taxi drivers based on their location and requirements. Our second goal is therefore to develop methodologies for modeling service networks via data-driven RDANs, which will be then used to support design, staffing, and control of service networks, and will inspire novel prediction methods. Two healthcare systems provide the data for our research: an ED of a large Israeli hospital, and a US outpatient hospital. In the US hospital, each patient, staff member and equipment is fitted with RTLS (Real- Time Location System) device, which provides extraordinary transaction-level data. The Israeli ED database was characterized and established as one of the first steps in our research. It is unique transaction-level data that will become a database which updates in real-time in the future. These two systems are very different as the outpatient hospital processes are all planned in advance while an ED environment is much more stochastic. We will test the methodologies developed in this research on both environments. 5

7 This research proposal is structured as follows: in Chapter 2 we survey existing literature; in Chapter 3 we present two approaches for real-time estimation of wait time distribution - via exact analysis, and by heavy-traffic approximations. Chapter 4 presents the general framework of RDANs, followed by an example of a two time-scale limit arising naturally from the model, and an example of data-based creation of such RDANs. We conclude with Chapter 5 where we summarize our research goals. 6

8 Chapter 2 Literature Review We divide the literature review to two parts. First, we start with a survey, in Section 2.1, of known wait time estimators, and review existing research on the influence of delay announcements on customer behavior and system operation. Then, in Section 2.2, we review the theoretical grounding for our Resource-Driven Activity Networks. 2.1 Wait Time and Sojourn Time Predictors There is ample literature on wait time and sojourn time estimators for service systems. Each of those estimators, or prediction methods are appropriate under different circumstances. We start with steady state approximations for queuing networks, followed by heavy traffic approximations, both conventional heavy traffic and many-server heavy traffic (QED regime). We then survey the literature on real-time waiting estimators for single station, as well as real-time estimators for queuing networks Steady State Approximations of Waiting Times in Queueing Networks A classic result by Reich [34, 35] shows that in a system of M/M/1 queues in tandem, in steady state, the sojourn times (waiting time + service time) in each of the stations are independent and are exponentially distributed with rate (λ µ i ), where λ is the arrival rate and µ i is the service rate at station i. However, Burke et al. [1] showed that the waiting times in a system of M/M/1 queues in tandem are in fact dependent. For example, if W 1 is the waiting time at the first station, and W 2 is the waiting time at the second station, then it was shown that P (W 2 = W 1 = ) > P (W 2 = ). Burke [9] also showed that although the sojourn times in two M/M/s queues in tandem are independent, this does not hold for more than two stations. It was shown that a necessary condition for the independence of the sojourn times in a series of k M/M/s queues in tandem is that the number of servers in stations 2,..., k 1 will be equal to 1. This means that the first and the last stations could have multiple servers; however, the stations in the middle must be single server stations. This is actually the case in the outpatient hospital we are partnering with. A patient path within the hospital usually starts with blood draw, where the patient can receive treatment from a pool of nurses. The second station is a doctor exam; here each patient has her own doctor, and therefore, this station could be modeled as a single server queue. The third station is usually infusion; here the patient is assigned to one of several beds/chairs and can then receive treatment from a pool of nurses. Although the above results are based on the assumption of Poisson arrivals and exponential service times in steady state, it shall be interesting to examine these estimators on our outpatient hospital data with the three stations described here. 7

9 Lemoine [25] developed a set of recursive equations for the Laplace transform of the residual sojourn time within a service network. The model he analyzed is a Jackson network of N single server stations, with a FCFS priority policy. The goal was to derive the sojourn time moments for a customer entering the system at node i when the system is in steady state. Our goal is to expand this kind of analysis, which provides not just the expected value of the remaining sojourn time in the system but also moments of any order (or even better, the distribution), to real-time estimators (rather than steady state estimators) for customers remaining sojourn time in the system. Boxma [7] considered a service network of two M/G/1 queues in tandem, where the service time distribution of the first queue has a support (, b], and the service time distribution of the second queue has a support [b, ). This means that the probability that the service time in the first queue is shorter than the service time in the second queue is equal to 1. For the steady state case, he derived the sojourn time distribution at the second station, the joint distribution of the waiting time in both queues and the total waiting time. It follows from his analysis that when longer services are performed first the waiting time at the second station is zero. While this is true in theory, in practice, we find that also non-bottleneck stations have queues. We note that the results provided here are also presented in an extensive survey of product form on non-product form results for sojourn time in queueing networks given by Boxma and Daduna [8] Heavy-Traffic Approximations Conventional heavy-traffic Reiman [36] studied heavy-traffic diffusion approximations for sojourn time in Jackson networks with K single server stations. Defining λ i as the exogenous arrival rate to node i, µ i as the service rate at node i, P as the routing matrix, and denoting R = (I P ) 1, and γ = λr, the traffic intensity in node i is then defined as ρ i = γ i µ i. The conventional heavy traffic approximation is derived by taking a sequence of Jackson networks indexed by n 1, such that arrival rates, service rates, and the routing matrix depend on n, and ρ i (n) 1, as n. The main result of [36] is the snapshot principle: It is as if the customer takes a snapshot of the network when he enters and all queues remain at the same value during the customer s sojourn through the network. To elaborate on that, let h be a visiting vector - a vector associated with customer route through the system; h i denotes the number of times the customer enters station i during a visit in the system. The snapshot principle states that the diffusion approximation of the stationary sojourn time distribution, for a customer entering the system at node k and following a visiting vector h, is given by, W k,h K h i τ i where τ i are exponentially distributed with mean (µ i γ i ). This implies that the diffusion approximation only depends on the visiting vector h and not on the specific path within the system. It also implies that the customer waiting time at station i is the same for every visit of station i along the path. Snapshot principle-based estimators follow the assumption that changes in queue lengths are negligible during a customer visit within the system; this gave rise to the Last-to-Enter-Service and Head-of-the-Line estimators (see Subsection 2.1.3) which basically approximate the waiting time of an arriving customer by the waiting time of the last customer to enter service, or the elapsed waiting time of the customer who is currently at the head of the queue. Huang et al. [17], for example, used the snapshot principle to approximate queue length, virtual waiting time and sojourn times in a single server multi-class queue. They considered an emergency department physician treating two types of patients: triage (arriving) patients and in-process (IP) patients, where each of i=1 8

10 these types may be further divided into priority classes. The snapshot principle suggests that when a patient of class k completes service, the queue of class k patients approximately equals the number of patients that arrived during this patient waiting time (note that in conventional heavy-traffic, service time is negligible compared to the waiting time, hence there is no need to consider the patient service time). Defining Q k (t) as the queue length of class k patients at time t, λ k as the arrival rate of class k patients, ω k (t) as the virtual waiting time of a patient of class k arriving to the queue at time t, and τ k (t) as the age of the patient at the head of class k queue at time t, then by the snapshot principle it was shown that, and by Little s law, it follows that, Q k (t) λ k τ k (t), ω k (t) τ k (t). If a patient enters the ED at time t, and has a predetermined route which is characterized by the visiting vector h (as defined in [36]), then an approximation for the patient sojourn time in the ED will simply be: K h k τ k (t). k=1 The snapshot principle based estimators are appropriate in cryptically loaded systems (ρ 1) and when the priority policy within each class is FCFS. These criteria may not hold in healthcare systems, which often alternate between overloaded and underloaded periods, and in which the priority policy is rarely FCFS. Furthermore, patients priorities are usually unknown in advance and may change during their stay in the system. Therefore, although the snapshot principle provides simple estimators for wait times and sojourn times in service networks, it may not be adequate in our case. We are now in the process of testing the snapshot estimators on two healthcare data sets we acquired for this research. Many-server heavy-traffic Mandelbaum et al. [27] developed limit theorems for a variety of queueing networks, including classical Jackson networks, queues with abandonment and retrials, priority queues and Jackson networks with state dependent routing. The many-server heavy-traffic approximations are derived by taking a sequence of systems, indexed by n 1, such that arrival rates and the number of servers in the n th system are scaled by n, while service rates remain unscaled. The authors have developed both fluid and diffusion approximations for the queue length processes at each station (queue length was considered as the number of customers in queue and in service). In Section 3.2 we follow their theory, and use their functional strong law of large numbers and functional central limit theorems to derive wait time and sojourn time approximations. In the following papers [26, 28] they focus on a queue with abandonment and retrials and develop fluid and diffusion approximations for the queue length and the waiting time process. We shall elaborate on their approach as we follow it in Section 3.2 to develop wait time estimators for a system of two queues in tandem Real-Time Estimators Single station Whitt [4] presented the value of real-time estimators over steady state estimators, by showing that adding information on the system state (such as the number of customers in the system) can only reduce the variance of the estimator. As stated in the introduction, a preliminary user study suggests that customers perceive wait time estimators which are given as ranges as more credible. Thus, reducing the variance using information on the system state is important as one may provide wait time estimators which include a certain range around the average but are still informative enough. Whitt also emphasized the importance of estimating 9

11 the full CDF (or distribution) of the wait time, as it supports the interesting question of what should be announced to customers, and allows one to provide more information to customers than merely the average waiting time. He also provided exact and approximate predictors for wait time distribution in single station, which we intend to expand in our research for service networks. Ibrahim and Whitt [19, 2, 21] proposed a series of real-time wait time estimators for a single queue with abandonments, time varying arrival rate and capacity, general service, inter-arrival and patience distribution. Their predictors could be categorized as two types: queue-length-based (QL) predictors, and delay-historybased predictors. The simple (QL) predictors state that if a customer observes n customers in queue upon arrival, the mean service duration is m and there are s servers in the system, then the average waiting time can be approximated by: θ QL (n) = (n + 1)m s as the customer will have to wait for the completion of n+1 services before reaching service. This predictor can then be refined to account for abandonments (which will shorten the queue during the customer wait), and time varying capacity. An example of the delay-history-based predictors are the Head-of-the-Line (HOL) predictor and the Last-to-Enter-Service predictor. The simple form of these predictors state that the waiting time of an arriving customer will be as the waiting time of the customer who is currently at the head of the queue, or as the waiting time of the last customer to enter service. These predictors are also refined to account for the case of the time varying arrival rate. The authors also suggested a method for estimating the queue length using the wait time of the head of the line customer, and a method for estimating the wait time of the head of the line by the queue length. Ibrahim et al. [18] assumed that an arriving customer at time t is provided with the LES estimator, and defined the initial conditions which guarantee the asymptotically relative accuracy of this announced delay. They considered both the Quality and Efficiency Driven (QED) regime and the Efficiency Driven (ED) regime. In the QED regime, the condition for the relative accuracy of the LES estimator is that the initial number of idle servers and the number of customers in queues are not too large. In order for the LES delay announcement to be relatively accurate, the difference between the queue length seen at the customer arrival and the queue length seen at the arrival of the last customer to enter service should be negligible. In the ED regime, waiting times are very long; hence the state of the queue may change significantly during a customer s stay in the queue. The main condition, therefore, for the relative accuracy of the LES announcement, is the convergence of the initial number of customers in the system to its fluid limit. Thiongane et al. [39] expand the delay-history-based predictors to a weighted average form, that can coupe with multi-class queue. They estimated the arriving customer wait times using the wait time of customers from the same type that are either currently in queue (can be considered as weighted HOL) or that completed service (can be considered as weighted LES). They also suggested a predictor which is based on the wait times of customers from the same type who found the same queue length at their arrival. Although these real-time predictors cover a wide range of service systems they only account for single stations and provide solely an estimation for the average waiting time. Nakibly [31] focused on estimating wait time distribution given the system state at the time of arrival, in single station service systems with skills-based-routing (their motivation came from call centers). In order to derive the moment generating function of the waiting time in the system, they used first step analysis and solved it using difference equations. They also suggested estimating the wait time distribution using the Phase-type representation of the system and defining the absorbing state such that the wait time of a specific customer type will be given by the time to absorption. We shall use a similar approach in our exact analysis, given in Section 3.1. (2.1) 1

12 Queueing networks Ang et al. [3] focused on predicting the time-to-treatment in an ED. This is the time elapsed between the patient registration and the first encounter with a physician or a nurse practitioner (this time includes the triage process). The authors have shown that incorporating information about the ED state (the number of patients in queue, the processing rate or the number of providers), as well as fluid model estimators (such as given in 2.1) in statistical learning methods (e.g. regularized linear regression) significantly improves the accuracy of the time-to-treatment estimator. The authors suggested that their prediction method can be applied not just to entire time-to-treatment (from registration to first treatment) but also to the remaining time-to-treatment, which is the time between the triage process and the first treatment. After the triage state patients are given their acute level which defines their priority in the system; hence one needs to restrict the training data set to patients of the same triage level. We note here that Senderovich et al. [37, 38] also showed that when applying statistical learning methods, enriching the set of possible predictors with queueing perspective variables, such as the queue-length, results in better prediction models. Gal et al. [13] tackled the issue of predicting buses traveling times given a scheduled bus journey. Their predictors are a function of the source/destination pair and the time at which the prediction is made. The authors have modeled road segments between different bus stops as a network of queues, where the buses are assumed to be customers that travel through the network. Per each segment, they have defined a snapshotbased predictor: Last-Bus-to-Travel-Segment, and then extended it to the network setting as it was shown in [36] that the snapshot principle holds for networks when the route is predetermined. This approach was compared with machine learning techniques such as random forest and extremely randomized trees. These are boosting algorithms which construct sequences of models, each based on the previous one. Thus, the final stage was to combine both methods by using the snapshot-based model as the baseline model for the machine learning techniques. The three types of prediction methods were evaluated against real data, where the latter (combination of snapshot principle and machine learning) proved to be the most promising. The bus traveling time prediction is somewhat similar to the well familiar Waze application. Although the Waze application has the freedom to choose among different routes, it receives a source, a destination, and a time stamp, and produces a traveling time estimation (after choosing the best route). The application algorithms are not published. However, according to their website (Waze) they break the route into segments and use real time current road speed, provided by other Waze users on a specific segment road, to predict its travel time. It is therefore assumed that the speed of another Waze user ahead of you in a specific segment will be the same as your own speed in the segment, which is basically the snapshot principle. If there are no current observations, or not enough observations of travelers along a particular segment, Waze will use historic data, according to the time of the day, day of the week, month, special dates, and so on. This is actually applying machine learning techniques, which uses historic data to formalize a prediction method. We note that our goal of predicting waiting times and sojourn times within a service network is very similar to the goals of the time-to-treatment estimation and the traveling time estimation. In a sense, we are aiming to provide Waze for healthcare systems. However, unlike the proposed methods here, in our case customers routes may not be known in advanced, nor even the destination point. This is a problem that we do not address in this proposal but is part of our research plan. Additionally, the methods specified above results in a single number estimation, while we are interested in the full distribution or at least the first and second moments. As part of our research we shall use our ED and outpatient hospital data to test the methods proposed here, specifically those that are applying machine learning techniques with queueing theory estimators. 11

13 2.1.4 Delay Announcements As stated before, our research on wait time and sojourn time estimation in service networks, and in particular healthcare systems, is part of a multidisciplinary research that aims to test the effect of providing such estimators to patients. One of our goals is to develop estimators for wait/sojourn time distribution which could then provide different delay announcements to patients. We are currently collaborating with a large Israeli hospital in developing a platform which will provide, along with process information, individual real-time delay announcements to patients and their escorts. The effect of these delay announcements on patients feeling and behavior, and on the system operation will be then tested using a field study. Interesting questions are therefore what should be announced to patients? How will the delay announcements influence patients behavior and in turn system operation? These issues will not be addressed in this research proposal. However, we note that there is extensive literature on this subject, of which we only review a few here those that are the most relevant to our study. Dong et al. [12] found that patient are using information provided on the Internet regarding expected ED delays in order to chose the ED they will turn to. It was shown that accurate delay announcements will increase synchronization between near by hospitals, which means that the loads in approximate hospitals will be balanced, hence causing a desirable pooling effect. However, if delay estimations are not accurate, pooling will be partial and hence, network resources will not be fully utilized. Moreover, it was shown that the accuracy of the estimator is not the only parameter influencing the system efficiency, but also the method by which the estimation is made. Using simulation they found that a moving average estimator (based on the last 4 hours) may cause the opposite effect of desynchronization. (An effect that did not appear when using an estimator which is based on the true waiting time plus normally distributed white noise.) Although both estimators has the same accuracy, the former will cause self-osculation. Thus, the accuracy of a delay announcement is not the only measure to be considered, but also its time-bias. Jouini et al. [23] proposed a model in which customers are announced with a certain percentile of the estimated delay distribution, and may either balk or abandon. They presented the tradeoff in choosing the desired percentile. By informing high percentiles more customers are likely to balk, which may cause the system to be less efficient than it could be. On the other hand, announcing a low percentile may reduce balking but cause more abandonments which will be accompanied with dissatisfaction with the organization and a reduction in its perceived reliability. To choose the ideal percentile the authors used numerical analysis. In [22] it was assumed that the cost of overestimating (choosing high percentiles) and underestimating (choosing low percentiles) are not equal; hence, a newsvendor model was used to choose the announced percentile. In Jouini et al. [24] the virtual wait time distribution for multi-class M/M/s queue with two priority classes was found, at first with no delay announcements. Then, it was assumed that customers are provided with delay announcements which are based on a certain percentile of the estimated delay distribution, and that customers have a balking probability given the announced delay. The mean and variance of the virtual delay for both customer priorities, given the balking probability, was then calculated. This was done by turning the virtual delay into the time to absorption in a two-dimensional transient Markov chain. Armony et al. [5] also assumed that customers react to delay announcements in two ways by balking or by adjusting their patience according to the expected delay. However, they noted that the system performance is influenced by customers response to delay announcements; hence the actual delay may change. This leads to a need to find the equilibrium delay announcement, which will be consistent with the actual delay, given customers reaction to it. In this work, the conditions for the existence and the uniqueness of equilibrium delay announcement were provided. The equilibrium delay announcement was obtained for the fluid model of an M/M/s + M queue with exponential balking function and exponential conditional patience distribution. 12

14 Allon et al. [1] assumed that the service organization and the customers are strategic players, gaining (different) rewards from service and paying (differently) for the time spent in the system. The organization chooses a certain delay announcement given the queue length (which is not observed by customers) and customers choose whether to join or balk given the announcement. The organization may provide various types of delay announcements, which may be concrete or intangible, quantifiable or not, and even misleading. Although customers cannot confirm the information they receive, they are not naive, and thus, will make decisions based on their judgment of the information given. They characterized the equilibrium obtained which depends on the queueing dynamics and the strategic nature of the players. It was shown that in equilibrium, if the customers and the organizations are not perfectly aligned (perfectly aligned means that the customers and the organization agree on the preferred action for each state of the system), then the organization will impose an intentional vagueness. The purpose of this vagueness is to convince customers to join the queue when the system congestion is not too high and in cases for which, under full-information, the customers would not have joined it. This vagueness is in line of our approach that wait time announcements should provide customer range of waiting and not a single estimator of the mean. 2.2 Activity networks In Chapter 4 we present our resource-view approach for modeling service networks, which we refer to as RDAN. Our model relies on Harrison s stochastic processing networks (SPNs) and activity analysis [15, 16], which we now review. In SPNs, resources interact with materials via activities. One can think of resources as servers, materials as customers, and activities as services. Materials arrive to the network according to specified arrival rates and follow routes within the networks. The model inputs are k different materials, each with its own arrival rate, n different resources with their given capacity, and m activities. The model output is a plan vector x which specifies the rate at which each activity is conducted. Harrison [15] presented both a static fluid model and a stochastic, dynamic model. To explain the SPN approach and its difference from our RDAN we focus on the static model. The model defines a capacity consumption matrix A, where A i,j represents the rate at which activity j consumes the capacity of resource i. It also defines an input-output matrix R, where R l,j represents the rate at which material l is either produced (R l,j < ) or consumed (R l,j > ) by activity j. Defining b as the vector of resource capacities, and λ as the vector of materials arrival rates, a plan x will therefore be feasible if Ax b, Rx λ, and x. (2.2) There are a few fundamental differences between SPNs and our RDANs; we review them here, as they were presented in [29]. First, the fluid model is based on a conventional heavy-traffic regime, which means that the arrival rates and individual service rates are scaled (exhibit time speed-up) while the number of servers remains unscaled. Thus, resource pools can be considered as single server queues with high processing rates, that is, individual service times are negligibly short. As a result, for example, non-bottleneck nodes can be removed from the network for analysis purposes. In contrast, our model is based on a many-server regime, which means that there is no time speed-up for activity durations. Consequentially, all activities in the network must be considered, and one cannot focus on bottleneck nodes only. Additionally, in our model constraints on activity rates are due to a limited number of resources, which are considered to be both customers and servers, while in SPNs, constraints on activity rates are obtained due to finiteness of processing capacities of servers only. Finally, we are interested in the symmetry between customers and servers, however, we note that there is inherent asymmetry between the two entities. As customers arrive to the system exogenously and leave the system after completing their service (in Figure 1.1 this is depicted as discs moving along edges emanating 13

15 from activities and ending with a dot), servers stay in the system forever. Taking that into account, we focus on closed queueing networks (rather than open ones like SPNs), modeling exogenous arrivals and departures as an activity which produces arriving customers and to which all leaving customers are routed to. Note that analyzing closed networks is both more challenging and more general than analyzing open networks, as every open network can be modeled as a closed one, simply by modeling the outside world as a node in the system. 14

16 Chapter 3 Wait Time Estimators for Two Queues in Tandem Healthcare systems are complex queueing networks. As illustrated in Figure 3.1, a patient visit to an ED will include registration, preliminary exams by a nurse and then a doctor which might be followed by a series of tests (lab tests, imaging, etc.) and sometimes checkups or treatments provided by a nurse. The doctor will then decide whether to release or hospitalize the patient. Figure 3.1 presents a scheme of the ED process. We immediately observe that the resulting queueing network will consist of queues in tandem, queues in parallel, and fork-join queues. While some of the stations are single-server stations (e.g. nurse admission), others have multiple servers (e.g. lab tests). The fact that most patients return to the ED physician more than once imposes yet another challenge, as noted in Yom-Tov and Mandelbaum [41]. Huang et al. [17], for example, distinguished between two types of patients: triage (arriving) patients, and in-process (IP) patients, where each of these types can be divided into several classes according to the medical severity of the patient. They showed that the priority policy the physician should apply depends on the triage score as well as how advanced the patient is in the entire process. Even though most EDs do not apply such a complicated priority scheme, prioritizing patients according to severity is very common. Note that in reality, patients priorities, as well as their paths through the ED process, may not be known in advance, and may change during their ED visit. Another characteristic of this environment is its time-variability, as was shown clearly in Armony et al. [4]. As a result of this variability the system may alternate between overloaded, critically and underloaded periods, or equivalently with high, medium, or no wait for specific activities during one s stay. As discussed in Senderovich et al. [37], different wait time predictors are appropriate under different circumstances. For example, they compared queue-length based predictors with snapshot principle predictors (Last-to-Enter-Service, Head-of-the-Line), and predictors which are based on past delays of served customers (denoted by PTS). They have shown that queue-length based predictors performed poorly in heavily loaded systems, compared to the other predictors. However, in moderate loads, the queue-length based predictors outperform the PTS predictors. The snapshot principle predictors were superior in heavy loads and in moderate loads. It is important to note that the predictors were tested in a call center environment, in which the assumption that the queue length does not change significantly during the wait time of a single customer is fairly reasonable. Our preliminary ED data analysis revealed that the average length of stay in the ED is around 5 hours (and actually depends on the time of arrival, with a substantial decrease for patients arriving around 8:). As it was shown in Armony et al. [4], during such a long time the number of patients in the ED may change significantly; hence the snapshot principle may not be adequate in this case. The aforementioned properties of a healthcare system therefore imply that it is necessary to examine several types of wait time predictors, as there may not be one unified predictor which will be appropriate for 15

17 the whole system, at all times. Following our first goal of developing wait time estimators for service networks, we start with creating predictors for the real-time wait time distribution in two tandem queues (network); those queues may be time-varying and multi-server with different loads (Section 3.2). Other features that were discussed in Chapter 1 (such as different priorities and unknown patients routes) are not considered but will be part of our planned research. We start with a simple example of two queues in tandem. These two queues may correspond to the first two stations in the ED (after registration); the preliminary exams by nurse and then doctor. Exploratory data analysis of our ED data revealed that the sojourn time in both stations is well described by a log-normal distribution, as seen in Figures 3.2, and 3.3 (this is actually also true for the X-Ray station). Nurse Queue Lab tests Queue Nurse Lab Entrance Reception Triage Queue Triage (nurse) Doctor Queue Doctor Examination Imaging Queue Imaging CT/US/X-Ray Interpretation Queue Imaging Interpretation Consultant Queue Consultants External exams Queue External exams Eco/Gastro/ Exercise Stress Test External exam Interpretation Treatment Resource queue Synchronization queue Waiting for Hospitalization Discharge Exit ED Figure 3.1: Flow chart of an ED process 3.1 Wait Time Distribution Given Number of Customers in Both Queues: Exact Analysis In this section we take an exact approach for estimating the waiting time distribution for a customer reaching a network of two M t /M/1 queues in tandem, with FCFS service policy, and encountering x 1 customers in the first station (queue+service) and x 2 customers in the second station (queue+service). We shall call this customer the target customer. Let µ 1 be the service rate at the first station and µ 2 the service rate at the second station. Following the approach in [31], we can describe the system as a Markov jump process, having states (i, j) S, such that i represents the number of customers in station 1, and j represents the number of customers in station 2. Since we are assuming FCFS policy, we can stop all exogenous arrivals 16

18 Figure 3.2: Sojourn time at the nurse admission station, April 214 to May 215, All days. Figure 3.3: Sojourn time at the doctor station, April 214 to May 215, All days. after the target customer arrival. State (x 1 + 1, x 2 ) will be the initial state, and (, ) the absorbing state; all other states are hence transient. The sojourn time in the system will then have a phase-type distribution, which is the absorption time, starting at (x 1 + 1, x 2 ). Let matrix R be the transient part of the infinitesimal generator, and q the initial distribution, q = (1,,..., ). The system moves from state (i, j) to state (i 1, j + 1) when a completion of service in station 1 occurs before a completion of service in station 2. This happens with rate mu 1. The system moves from state (i, j) to state (i, j 1) when a completion of service in station 2 occurs before a completion of service in station 1. Thus, the entries of R will be of the form: R (i,j),(i 1,j+1) = µ 1, i = 1,..., x 1 + 1, j =,..., x 2 + x R (i,j),(i,j 1) = µ 2, i =,..., x 1 + 1, j = 1,..., x 2 + x The resulting transition diagram is given in Figure Sojourn Time Let T be the absorption time, or equivalently, the sojourn time in the system for a target customer arriving when there are x 1 customers in station 1 and x 2 customers in station 2. The cumulative distribution function of T is therefore: F T (t) = 1 q exp tr 1, 17

19 where 1 is a vector of ones. Figure 3.4: Transition diagram, two M/M/1 queues in tandem. We have simulated four scenarios of the two queues in tandem, in all four of them we set λ = 1. We realize that the time spent in each of the queues depends on the service rate in each of them and the number of customers that are already in the system at the time arrival (x 1, x 2 ). Thus, the four scenarios correspond to: Scenario 1 (denoted S1): µ 1 > µ 2, x 1 > x 2. Scenario 2 (denoted S2): µ 1 < µ 2, x 1 > x 2. Scenario 3 (denoted S3): µ 1 > µ 2, x 1 < x 2. Scenario 4 (denoted S4): µ 1 < µ 2, x 1 < x 2. We chose the service rate parameters to be close to the arrival rate (which is 1) in order for the simulation to easily reach reasonable queue lengths in both stations. Hence, we chose µ 1 = 1.2, and µ 2 = 1.5, and vice versa. The number of customers in each of the stations was chosen to be x = 12, and x 2 = 5, and vice versa. These values were chosen in order to create a substantial difference between the two queues, yet simulate a system which is of reasonable for the case of an ED for example. In each of the cases the simulation run for more than 2 iterations. Figure 3.5 presents the CDF of the total sojourn time of the target customer (in blue) and the CDF of the absorption time in the corresponding phase-type distribution (in red). We see that, as expected, in all four cases there is an excellent fit Wait Times Let W 1 be the customer waiting time in the first queue. A customer that finds x 1 customers in front of her, assuming FCFS policy, needs to wait a total of x 1 service times, each exponentially distributed with rate µ 1. Hence, (W 1 Q 1 = x 1, Q 2 = x 2 ) Gamma(x 1, µ 1 ). 18

20 Figure 3.5: Simulation results, CDF of sojourn time in the system. We define T 1 as the time elapsed from the customer s arrival until her exit from the first station (given x 1 customers in the first station at her arrival). It is given by, T 1 Gamma(x 1 + 1, µ 1 ). We define T 2 as the time required for the server in station 2 to serve all the x 2 customer in the second station; then T 2 Gamma(x 2, µ 2 ). Let T 3 be the time required for the server in station 2 to serve all x 1 + x 2 customers in the system, if they were all present in the second station; then T 3 Gamma(x 1 + x 2, µ 2 ). We note that there might be cases in which the server at station 2 is idle but there are still customers in station 1 (in the transition diagram in Figure 3.4, these are states (i, ), i > ). We define K as the number of time lags which occur whenever the server at station 2 is idle and the server at station 1 is busy. We can 19

21 think of K as the number of states of the form (i, ), i >, visited in the path from state (x + 1, x 2 ) to state (, ), thus K x From the memoryless property of the exponential distribution, each of these time lags is exponentially distributed with rate µ 1. We denote by L the sum of these time lags; then L k := (L K = k) Gamma(k, µ 1 ). Let W 2 be the target customer waiting time in the second queue; then P (W 2 > t Q 1 = x 1, Q 2 = x 2 ) = P (T 3 + L T 1 > t) = x 1 +1 k= We note that for each k =, 1,..., x 1 + 1: Hence, P (T 3 + L T 1 > t K = k) P (K = k) = x 1 +1 k= T 1 L k + L x1 +1 k. P (T 3 > t + T 1 L k K = k) P (K = k). P (T 3 > t + T 1 L k K = k) = P (T 3 > t + L x1 +1 k), L x1 +1 k Gamma (x k, µ 1 ), and we get, P (W 2 > t Q 1 = x 1, Q 2 = x 2 ) = = x 1 +1 k= x 1 +x 2 1 i= x 1 +1 k= P (T 3 > t + L x1 +1 k L x1 +1 k = s) f (s)ds Lx1 P (K = k) +1 k ((t + s)µ 2 ) i e ( (t+s)µ 2) µ x 1+1 k 1 i! Γ(x k) sx 1 k e µ 1s ds P (K = k). (3.1) It is left to find the distribution of K and then, by the law of total probability, the calculation of the waiting time distribution at the second queue will be complete. We note that entry (x, y) in the fundamental matrix: N = [I R] 1 represents the probability to visit state x given that we have started at state y. Hence, we can use the fundamental matrix to find the probability to reach each of the states (i, ), i >, starting at (x 1 + 1, x 2 ). Nevertheless, using this method to find the distribution of K may still be somewhat tedious. Our aim is to find a simple expression for the distribution of K. We shall also examine ways to detour the conditioning on the value of K, for example, by replacing it with the expected value of K. In some cases we can greatly simplify the calculations above, and eliminate the need for working with the Phase-Type matrices. By conditioning on the ratio between T 1 and T 2, for example, we get, P (W 2 > t Q 1 = x 1, Q 2 = x 2 ) = P (W 2 > t Q 1 = x 1, Q 2 = x 2, T 1 < T 2 ) P(T 1 < T 2 Q 1 = x 1, Q 2 = x 2 ) + P (W 2 > t Q 1 = x 1, Q 2 = x 2, T 1 > T 2 ) P(T 1 > T 2 Q 1 = x 1, Q 2 = x 2 ). 2

22 Given T 2 > T 1, it is obvious that K =, since this means that while server 2 serves the x 2 customers which are already there, all x 1 customers and the target customer, move from station 1 to station 2; hence there is no idleness in station 2. Therefore, P (W 2 > t Q 1 = x 1, Q 2 = x 2, T 1 < T 2 ) = P (T 3 T 1 > t T 1 = s) f T1 (s) ds. (3.2) Thus, we can first calculate P(T 1 < T 2 Q 1 = x 1, Q 2 = x 2 ) and if it is close to 1 use (3.2), instead of (3.1), to find the wait time distribution at the second queue. Figure 3.6 presents, for the four scenarios specified above, the empirical CDF of the wait time of the target customer at the second queue (in blue), and the results of: 1 P (T 3 T 1 > t T 1 = s) f T1 (s) ds (in red), which is the wait time estimator for the second queue given T 1 < T 2. Since the chosen service rates in both station were quite close, it turns out that the initial number of customers in each station had more effect on the probability that T 1 < T 2. We can see that in scenarios 3 and 4, where x = 5, and x 2 = 12, the probability that T 2 > T 1 is close to one; hence there is a very good fit. However, in the other two scenarios, 1 P (T 3 > t + s) f T1 (s) ds results with higher values than the actual CDF. This is due to the fact that without considering the time lags in which server 2 is idle, the total time it will take server 2 to clear all the customers in the system (x 1 + x 2 ) will be shorter than it actually is. We would have expected that the worst fit will be obtained for scenario 2, where the service rate at station 2 is higher and there are more customers at station 1 at the time of arrival. However, it seems that scenario 1 present the worst fit, although in this case station 1 has a higher service rate. To summarize this section, we note that the exact analysis approach presents some limitations: 1. When the first station has more than 1 server skips and slips during service might occur. This means that the target customer may enter the second queue before customers who were ahead of her at the first queue; in addition, customers arriving after the target customer might enter the second queue before her. One may assume that on average, the number of skips equals the number of slips hence; the above analysis will hold for the case of more than 1 server at the first station. We shall examine that using simulation. 2. When P(T 2 > T 1 ) < 1, one needs to use a matrix exponential in order to either find the phase-type distribution of the time to absorption, or to find the distribution of K (the number of time lags in which server 2 is idle). This approach will have the the curse of dimensionality as the number of stations in the system increases, or even if the initial state (x 1, x 2 ) is large. 3. This analysis allows only the arrivals to be time-varying, it does not support the case of time-varying service rates or staffing. Given the above limitations in the exact analysis approach, we now turn to examine heavy traffic approximations for the system of two queues in tandem. Our goal is to find approximations for the wait time distribution at each of the queues, as well as an approximation for the total sojourn time distribution. 21

23 Figure 3.6: Simulation results, CDF of wait time at the second queue. (Simulation (blue), estimation given T 1 < T 2 (red).) 3.2 Many-Server Heavy Traffic Approximations, Time Varying Following Mandelbaum et al. [27] in analyzing dynamics of queuing networks, we consider a network of two M t /M t /n t queues in tandem, with FCFS policy. We start with fluid analysis, in which we establish the fluid limits of the queue length processes, and derive the fluid limits of the wait time processes, in each of the two queues. The fluid limit will provide an approximation for the expected wait time, at each of the queues, and an approximation for the expected sojourn time. We shall then use diffusion analysis to derive a Gaussian approximation for the wait time in both queues. Let Q 1 (t) and Q 2 (t) be the number of customers at time t, in the first and second station (queue+service) respectively. The following notation will be of use: λ t - Arrival rate at the first station, at time t. µ 1t - Individual service rate at station 1, at time t. µ 2t - Individual service rate at station 2, at time t. 22

State-Dependent Estimation of Delay Distributions in Fork-Join Networks

State-Dependent Estimation of Delay Distributions in Fork-Join Networks State-Dependent Estimation of Delay Distributions in Fork-Join Networks Nitzan Carmeli, Galit B. Yom-Tov Technion Israel Institute of Technology, nitzany@campus.technion.ac.il, gality@technion.ac.il, Onno

More information

Making Delay Announcements

Making Delay Announcements Making Delay Announcements Performance Impact and Predicting Queueing Delays Ward Whitt With Mor Armony, Rouba Ibrahim and Nahum Shimkin March 7, 2012 Last Class 1 The Overloaded G/GI/s + GI Fluid Queue

More information

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis.

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis. Service Engineering Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis. G/G/1 Queue: Virtual Waiting Time (Unfinished Work). GI/GI/1: Lindley s Equations

More information

Dynamic Control of Parallel-Server Systems

Dynamic Control of Parallel-Server Systems Dynamic Control of Parallel-Server Systems Jim Dai Georgia Institute of Technology Tolga Tezcan University of Illinois at Urbana-Champaign May 13, 2009 Jim Dai (Georgia Tech) Many-Server Asymptotic Optimality

More information

A STAFFING ALGORITHM FOR CALL CENTERS WITH SKILL-BASED ROUTING: SUPPLEMENTARY MATERIAL

A STAFFING ALGORITHM FOR CALL CENTERS WITH SKILL-BASED ROUTING: SUPPLEMENTARY MATERIAL A STAFFING ALGORITHM FOR CALL CENTERS WITH SKILL-BASED ROUTING: SUPPLEMENTARY MATERIAL by Rodney B. Wallace IBM and The George Washington University rodney.wallace@us.ibm.com Ward Whitt Columbia University

More information

Wait-Time Predictors for Customer Service Systems with Time-Varying Demand and Capacity

Wait-Time Predictors for Customer Service Systems with Time-Varying Demand and Capacity OPERATIONS RESEARCH Vol. 59, No. 5, September October 211, pp. 116 1118 issn 3-364X eissn 1526-5463 11 595 116 http://dx.doi.org/1.1287/opre.111.974 211 INFORMS Wait-Time Predictors for Customer Service

More information

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY SUPPLEMENTARY MATERIAL

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY SUPPLEMENTARY MATERIAL REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY SUPPLEMENTARY MATERIAL by Rouba Ibrahim and Ward Whitt IEOR Department Columbia University {rei2101, ww2040}@columbia.edu Abstract Motivated by interest

More information

Electronic Companion Fluid Models for Overloaded Multi-Class Many-Server Queueing Systems with FCFS Routing

Electronic Companion Fluid Models for Overloaded Multi-Class Many-Server Queueing Systems with FCFS Routing Submitted to Management Science manuscript MS-251-27 Electronic Companion Fluid Models for Overloaded Multi-Class Many-Server Queueing Systems with FCFS Routing Rishi Talreja, Ward Whitt Department of

More information

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement Submitted to imanufacturing & Service Operations Management manuscript MSOM-11-370.R3 Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement (Authors names blinded

More information

The Performance Impact of Delay Announcements

The Performance Impact of Delay Announcements The Performance Impact of Delay Announcements Taking Account of Customer Response IEOR 4615, Service Engineering, Professor Whitt Supplement to Lecture 21, April 21, 2015 Review: The Purpose of Delay Announcements

More information

Delay Announcements. Predicting Queueing Delays for Delay Announcements. IEOR 4615, Service Engineering, Professor Whitt. Lecture 21, April 21, 2015

Delay Announcements. Predicting Queueing Delays for Delay Announcements. IEOR 4615, Service Engineering, Professor Whitt. Lecture 21, April 21, 2015 Delay Announcements Predicting Queueing Delays for Delay Announcements IEOR 4615, Service Engineering, Professor Whitt Lecture 21, April 21, 2015 OUTLINE Delay Announcements: Why, What and When? [Slide

More information

Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates

Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates Xu Sun and Ward Whitt Department of Industrial Engineering and Operations Research, Columbia University

More information

Session-Based Queueing Systems

Session-Based Queueing Systems Session-Based Queueing Systems Modelling, Simulation, and Approximation Jeroen Horters Supervisor VU: Sandjai Bhulai Executive Summary Companies often offer services that require multiple steps on the

More information

Queues and Queueing Networks

Queues and Queueing Networks Queues and Queueing Networks Sanjay K. Bose Dept. of EEE, IITG Copyright 2015, Sanjay K. Bose 1 Introduction to Queueing Models and Queueing Analysis Copyright 2015, Sanjay K. Bose 2 Model of a Queue Arrivals

More information

MULTIPLE CHOICE QUESTIONS DECISION SCIENCE

MULTIPLE CHOICE QUESTIONS DECISION SCIENCE MULTIPLE CHOICE QUESTIONS DECISION SCIENCE 1. Decision Science approach is a. Multi-disciplinary b. Scientific c. Intuitive 2. For analyzing a problem, decision-makers should study a. Its qualitative aspects

More information

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement

Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement Wyean Chan DIRO, Université de Montréal, C.P. 6128, Succ. Centre-Ville, Montréal (Québec), H3C 3J7, CANADA,

More information

Control of Fork-Join Networks in Heavy-Traffic

Control of Fork-Join Networks in Heavy-Traffic in Heavy-Traffic Asaf Zviran Based on MSc work under the guidance of Rami Atar (Technion) and Avishai Mandelbaum (Technion) Industrial Engineering and Management Technion June 2010 Introduction Network

More information

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers 1

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers 1 Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers 1 Mor Armony 2 Avishai Mandelbaum 3 June 25, 2008 Abstract Motivated by call centers,

More information

Dynamic Control of a Tandem Queueing System with Abandonments

Dynamic Control of a Tandem Queueing System with Abandonments Dynamic Control of a Tandem Queueing System with Abandonments Gabriel Zayas-Cabán 1 Jungui Xie 2 Linda V. Green 3 Mark E. Lewis 1 1 Cornell University Ithaca, NY 2 University of Science and Technology

More information

Classical Queueing Models.

Classical Queueing Models. Sergey Zeltyn January 2005 STAT 99. Service Engineering. The Wharton School. University of Pennsylvania. Based on: Classical Queueing Models. Mandelbaum A. Service Engineering course, Technion. http://iew3.technion.ac.il/serveng2005w

More information

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS Abstract Motivated by interest in making delay announcements in service systems, we study real-time

More information

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS

REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS REAL-TIME DELAY ESTIMATION BASED ON DELAY HISTORY IN MANY-SERVER SERVICE SYSTEMS WITH TIME-VARYING ARRIVALS by Rouba Ibrahim and Ward Whitt IEOR Department Columbia University 500 West 120th Street 313

More information

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers

Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers OPERATIONS RESEARCH Vol. 59, No. 1, January February 2011, pp. 50 65 issn 0030-364X eissn 1526-5463 11 5901 0050 informs doi 10.1287/opre.1100.0878 2011 INFORMS Routing and Staffing in Large-Scale Service

More information

1.225 Transportation Flow Systems Quiz (December 17, 2001; Duration: 3 hours)

1.225 Transportation Flow Systems Quiz (December 17, 2001; Duration: 3 hours) 1.225 Transportation Flow Systems Quiz (December 17, 2001; Duration: 3 hours) Student Name: Alias: Instructions: 1. This exam is open-book 2. No cooperation is permitted 3. Please write down your name

More information

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals

Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals OPERATIONS RESEARCH Vol. 6, No. 6, November December 212, pp. 1551 1564 ISSN 3-364X (print) ISSN 1526-5463 (online) http://dx.doi.org/1.1287/opre.112.114 212 INFORMS Stabilizing Customer Abandonment in

More information

THE HEAVY-TRAFFIC BOTTLENECK PHENOMENON IN OPEN QUEUEING NETWORKS. S. Suresh and W. Whitt AT&T Bell Laboratories Murray Hill, New Jersey 07974

THE HEAVY-TRAFFIC BOTTLENECK PHENOMENON IN OPEN QUEUEING NETWORKS. S. Suresh and W. Whitt AT&T Bell Laboratories Murray Hill, New Jersey 07974 THE HEAVY-TRAFFIC BOTTLENECK PHENOMENON IN OPEN QUEUEING NETWORKS by S. Suresh and W. Whitt AT&T Bell Laboratories Murray Hill, New Jersey 07974 ABSTRACT This note describes a simulation experiment involving

More information

Managing Call Centers with Many Strategic Agents

Managing Call Centers with Many Strategic Agents Managing Call Centers with Many Strategic Agents Rouba Ibrahim, Kenan Arifoglu Management Science and Innovation, UCL YEQT Workshop - Eindhoven November 2014 Work Schedule Flexibility 86SwofwthewqBestwCompanieswtowWorkwForqwofferw

More information

The effect of probabilities of departure with time in a bank

The effect of probabilities of departure with time in a bank International Journal of Scientific & Engineering Research, Volume 3, Issue 7, July-2012 The effect of probabilities of departure with time in a bank Kasturi Nirmala, Shahnaz Bathul Abstract This paper

More information

Performance Evaluation of Queuing Systems

Performance Evaluation of Queuing Systems Performance Evaluation of Queuing Systems Introduction to Queuing Systems System Performance Measures & Little s Law Equilibrium Solution of Birth-Death Processes Analysis of Single-Station Queuing Systems

More information

Queues in Hospitals: Queueing Networks with ReEntering Customers in the QED Regime

Queues in Hospitals: Queueing Networks with ReEntering Customers in the QED Regime Queues in Hospitals: Queueing Networks with ReEntering Customers in the QED Regime QED = Quality- and Efficiency-Driven Galit Bracha Yom-Tov Queues in Hospitals: Stochastic Networks with ReEntering Customers

More information

Erlang-S: A Data-Based Model of Servers in Queueing Networks

Erlang-S: A Data-Based Model of Servers in Queueing Networks Erlang-S: A Data-Based Model of Servers in Queueing Networks David Azriel, Paul D. Feigin, Avishai Mandelbaum. January 15, 2017 Abstract Classical queueing theory has typically focused on customers, while

More information

Data analysis and stochastic modeling

Data analysis and stochastic modeling Data analysis and stochastic modeling Lecture 7 An introduction to queueing theory Guillaume Gravier guillaume.gravier@irisa.fr with a lot of help from Paul Jensen s course http://www.me.utexas.edu/ jensen/ormm/instruction/powerpoint/or_models_09/14_queuing.ppt

More information

Queueing Theory. VK Room: M Last updated: October 17, 2013.

Queueing Theory. VK Room: M Last updated: October 17, 2013. Queueing Theory VK Room: M1.30 knightva@cf.ac.uk www.vincent-knight.com Last updated: October 17, 2013. 1 / 63 Overview Description of Queueing Processes The Single Server Markovian Queue Multi Server

More information

CS 798: Homework Assignment 3 (Queueing Theory)

CS 798: Homework Assignment 3 (Queueing Theory) 1.0 Little s law Assigned: October 6, 009 Patients arriving to the emergency room at the Grand River Hospital have a mean waiting time of three hours. It has been found that, averaged over the period of

More information

Q = (c) Assuming that Ricoh has been working continuously for 7 days, what is the probability that it will remain working at least 8 more days?

Q = (c) Assuming that Ricoh has been working continuously for 7 days, what is the probability that it will remain working at least 8 more days? IEOR 4106: Introduction to Operations Research: Stochastic Models Spring 2005, Professor Whitt, Second Midterm Exam Chapters 5-6 in Ross, Thursday, March 31, 11:00am-1:00pm Open Book: but only the Ross

More information

Wait-Time Predictors for Customer Service Systems With Time-Varying Demand and Capacity

Wait-Time Predictors for Customer Service Systems With Time-Varying Demand and Capacity Submitted to Operations Research manuscript (Please, provide the mansucript number!) Wait-Time Predictors for Customer Service Systems With Time-Varying Demand and Capacity Rouba Ibrahim, Ward Whitt Department

More information

Abandonment and Customers Patience in Tele-Queues. The Palm/Erlang-A Model.

Abandonment and Customers Patience in Tele-Queues. The Palm/Erlang-A Model. Sergey Zeltyn February 25 zeltyn@ie.technion.ac.il STAT 991. Service Engineering. The Wharton School. University of Pennsylvania. Abandonment and Customers Patience in Tele-Queues. The Palm/Erlang-A Model.

More information

Queuing Networks: Burke s Theorem, Kleinrock s Approximation, and Jackson s Theorem. Wade Trappe

Queuing Networks: Burke s Theorem, Kleinrock s Approximation, and Jackson s Theorem. Wade Trappe Queuing Networks: Burke s Theorem, Kleinrock s Approximation, and Jackson s Theorem Wade Trappe Lecture Overview Network of Queues Introduction Queues in Tandem roduct Form Solutions Burke s Theorem What

More information

Slides 9: Queuing Models

Slides 9: Queuing Models Slides 9: Queuing Models Purpose Simulation is often used in the analysis of queuing models. A simple but typical queuing model is: Queuing models provide the analyst with a powerful tool for designing

More information

Lecture 9: Deterministic Fluid Models and Many-Server Heavy-Traffic Limits. IEOR 4615: Service Engineering Professor Whitt February 19, 2015

Lecture 9: Deterministic Fluid Models and Many-Server Heavy-Traffic Limits. IEOR 4615: Service Engineering Professor Whitt February 19, 2015 Lecture 9: Deterministic Fluid Models and Many-Server Heavy-Traffic Limits IEOR 4615: Service Engineering Professor Whitt February 19, 2015 Outline Deterministic Fluid Models Directly From Data: Cumulative

More information

Data analytics in the practice of Emergency Medicine

Data analytics in the practice of Emergency Medicine Data analytics in the practice of Emergency Medicine Devashish Das Industrial and Management Systems Engineering University of South Florida November 16, 2018 Outline Healthcare systems engineering in

More information

Proactive Care with Degrading Class Types

Proactive Care with Degrading Class Types Proactive Care with Degrading Class Types Yue Hu (DRO, Columbia Business School) Joint work with Prof. Carri Chan (DRO, Columbia Business School) and Prof. Jing Dong (DRO, Columbia Business School) Motivation

More information

Name of the Student:

Name of the Student: SUBJECT NAME : Probability & Queueing Theory SUBJECT CODE : MA 6453 MATERIAL NAME : Part A questions REGULATION : R2013 UPDATED ON : November 2017 (Upto N/D 2017 QP) (Scan the above QR code for the direct

More information

Design, staffing and control of large service systems: The case of a single customer class and multiple server types. April 2, 2004 DRAFT.

Design, staffing and control of large service systems: The case of a single customer class and multiple server types. April 2, 2004 DRAFT. Design, staffing and control of large service systems: The case of a single customer class and multiple server types. Mor Armony 1 Avishai Mandelbaum 2 April 2, 2004 DRAFT Abstract Motivated by modern

More information

Nitzan Carmeli. MSc Advisors: Prof. Haya Kaspi, Prof. Avishai Mandelbaum Empirical Support: Dr. Galit Yom-Tov

Nitzan Carmeli. MSc Advisors: Prof. Haya Kaspi, Prof. Avishai Mandelbaum Empirical Support: Dr. Galit Yom-Tov Nitzan Carmeli MSc Advisors: Prof. Haya Kaspi, Prof. Avishai Mandelbaum Empirical Support: Dr. Galit Yom-Tov Industrial Engineering and Management Technion 23.06.2014 1 Introduction o Interactive Voice

More information

Waiting Time Analysis of A Single Server Queue in an Out- Patient Clinic

Waiting Time Analysis of A Single Server Queue in an Out- Patient Clinic IOSR Journal of Mathematics (IOSR-JM) e-issn: 2278-5728, p-issn: 2319-765X. Volume 11, Issue 3 Ver. V (May - Jun. 2015), PP 54-58 www.iosrjournals.org Waiting Time Analysis of A Single Server Queue in

More information

DELAY PREDICTORS FOR CUSTOMER SERVICE SYSTEMS WITH TIME-VARYING PARAMETERS. Rouba Ibrahim Ward Whitt

DELAY PREDICTORS FOR CUSTOMER SERVICE SYSTEMS WITH TIME-VARYING PARAMETERS. Rouba Ibrahim Ward Whitt Proceedings of the 21 Winter Simulation Conference B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan, and E. Yücesan, eds. DELAY PREDICTORS FOR CUSTOMER SERVICE SYSTEMS WITH TIME-VARYING PARAMETERS Rouba

More information

Introduction to Markov Chains, Queuing Theory, and Network Performance

Introduction to Markov Chains, Queuing Theory, and Network Performance Introduction to Markov Chains, Queuing Theory, and Network Performance Marceau Coupechoux Telecom ParisTech, departement Informatique et Réseaux marceau.coupechoux@telecom-paristech.fr IT.2403 Modélisation

More information

57:022 Principles of Design II Final Exam Solutions - Spring 1997

57:022 Principles of Design II Final Exam Solutions - Spring 1997 57:022 Principles of Design II Final Exam Solutions - Spring 1997 Part: I II III IV V VI Total Possible Pts: 52 10 12 16 13 12 115 PART ONE Indicate "+" if True and "o" if False: + a. If a component's

More information

Blind Fair Routing in Large-Scale Service Systems with Heterogeneous Customers and Servers

Blind Fair Routing in Large-Scale Service Systems with Heterogeneous Customers and Servers OPERATIONS RESEARCH Vol. 6, No., January February 23, pp. 228 243 ISSN 3-364X (print) ISSN 526-5463 (online) http://dx.doi.org/.287/opre.2.29 23 INFORMS Blind Fair Routing in Large-Scale Service Systems

More information

Exercises Stochastic Performance Modelling. Hamilton Institute, Summer 2010

Exercises Stochastic Performance Modelling. Hamilton Institute, Summer 2010 Exercises Stochastic Performance Modelling Hamilton Institute, Summer Instruction Exercise Let X be a non-negative random variable with E[X ]

More information

Figure 10.1: Recording when the event E occurs

Figure 10.1: Recording when the event E occurs 10 Poisson Processes Let T R be an interval. A family of random variables {X(t) ; t T} is called a continuous time stochastic process. We often consider T = [0, 1] and T = [0, ). As X(t) is a random variable

More information

We consider how two networked large-scale service systems that normally operate separately, such as call

We consider how two networked large-scale service systems that normally operate separately, such as call MANAGEMENT SCIENCE Vol. 55, No. 8, August 2009, pp. 1353 1367 issn 0025-1909 eissn 1526-5501 09 5508 1353 informs doi 10.1287/mnsc.1090.1025 2009 INFORMS Responding to Unexpected Overloads in Large-Scale

More information

Non Markovian Queues (contd.)

Non Markovian Queues (contd.) MODULE 7: RENEWAL PROCESSES 29 Lecture 5 Non Markovian Queues (contd) For the case where the service time is constant, V ar(b) = 0, then the P-K formula for M/D/ queue reduces to L s = ρ + ρ 2 2( ρ) where

More information

Queuing Theory. The present section focuses on the standard vocabulary of Waiting Line Models.

Queuing Theory. The present section focuses on the standard vocabulary of Waiting Line Models. Queuing Theory Introduction Waiting lines are the most frequently encountered problems in everyday life. For example, queue at a cafeteria, library, bank, etc. Common to all of these cases are the arrivals

More information

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017 CPSC 531: System Modeling and Simulation Carey Williamson Department of Computer Science University of Calgary Fall 2017 Motivating Quote for Queueing Models Good things come to those who wait - poet/writer

More information

Introduction to Queueing Theory with Applications to Air Transportation Systems

Introduction to Queueing Theory with Applications to Air Transportation Systems Introduction to Queueing Theory with Applications to Air Transportation Systems John Shortle George Mason University February 28, 2018 Outline Why stochastic models matter M/M/1 queue Little s law Priority

More information

Contact Centers with a Call-Back Option and Real-Time Delay Information

Contact Centers with a Call-Back Option and Real-Time Delay Information OPERATIOS RESEARCH Vol. 52, o. 4, July August 2004, pp. 527 545 issn 0030-364X eissn 526-5463 04 5204 0527 informs doi 0.287/opre.040.023 2004 IFORMS Contact Centers with a Call-Back Option and Real-Time

More information

ENGINEERING SOLUTION OF A BASIC CALL-CENTER MODEL

ENGINEERING SOLUTION OF A BASIC CALL-CENTER MODEL ENGINEERING SOLUTION OF A BASIC CALL-CENTER MODEL by Ward Whitt Department of Industrial Engineering and Operations Research Columbia University, New York, NY 10027 Abstract An algorithm is developed to

More information

Motivated by models of tenant assignment in public housing, we study approximating deterministic fluid

Motivated by models of tenant assignment in public housing, we study approximating deterministic fluid MANAGEMENT SCIENCE Vol. 54, No. 8, August 2008, pp. 1513 1527 issn 0025-1909 eissn 1526-5501 08 5408 1513 informs doi 10.1287/mnsc.1080.0868 2008 INFORMS Fluid Models for Overloaded Multiclass Many-Server

More information

Online Supplement to Creating Work Breaks From Available Idleness

Online Supplement to Creating Work Breaks From Available Idleness Online Supplement to Creating Work Breaks From Available Idleness Xu Sun and Ward Whitt Department of Industrial Engineering and Operations Research, Columbia University New York, NY, 127 June 3, 217 1

More information

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr. Simulation Discrete-Event System Simulation Chapter 9 Verification and Validation of Simulation Models Purpose & Overview The goal of the validation process is: To produce a model that represents true

More information

Online Supplement to Creating Work Breaks From Available Idleness

Online Supplement to Creating Work Breaks From Available Idleness Online Supplement to Creating Work Breaks From Available Idleness Xu Sun and Ward Whitt Department of Industrial Engineering and Operations Research, Columbia University New York, NY, 127 September 7,

More information

Designing load balancing and admission control policies: lessons from NDS regime

Designing load balancing and admission control policies: lessons from NDS regime Designing load balancing and admission control policies: lessons from NDS regime VARUN GUPTA University of Chicago Based on works with : Neil Walton, Jiheng Zhang ρ K θ is a useful regime to study the

More information

I, A BRIEF REVIEW ON INFINITE QUEUE MODEL M.

I, A BRIEF REVIEW ON INFINITE QUEUE MODEL M. A BRIEF REVIEW ON INFINITE QUEUE MODEL M. Vasuki*, A. Dinesh Kumar** & G. Vijayaprabha** * Assistant Professor, Department of Mathematics, Srinivasan College of Arts and Science, Perambalur, Tamilnadu

More information

PBW 654 Applied Statistics - I Urban Operations Research

PBW 654 Applied Statistics - I Urban Operations Research PBW 654 Applied Statistics - I Urban Operations Research Lecture 2.I Queuing Systems An Introduction Operations Research Models Deterministic Models Linear Programming Integer Programming Network Optimization

More information

TOWARDS BETTER MULTI-CLASS PARAMETRIC-DECOMPOSITION APPROXIMATIONS FOR OPEN QUEUEING NETWORKS

TOWARDS BETTER MULTI-CLASS PARAMETRIC-DECOMPOSITION APPROXIMATIONS FOR OPEN QUEUEING NETWORKS TOWARDS BETTER MULTI-CLASS PARAMETRIC-DECOMPOSITION APPROXIMATIONS FOR OPEN QUEUEING NETWORKS by Ward Whitt AT&T Bell Laboratories Murray Hill, NJ 07974-0636 March 31, 199 Revision: November 9, 199 ABSTRACT

More information

HITTING TIME IN AN ERLANG LOSS SYSTEM

HITTING TIME IN AN ERLANG LOSS SYSTEM Probability in the Engineering and Informational Sciences, 16, 2002, 167 184+ Printed in the U+S+A+ HITTING TIME IN AN ERLANG LOSS SYSTEM SHELDON M. ROSS Department of Industrial Engineering and Operations

More information

5/15/18. Operations Research: An Introduction Hamdy A. Taha. Copyright 2011, 2007 by Pearson Education, Inc. All rights reserved.

5/15/18. Operations Research: An Introduction Hamdy A. Taha. Copyright 2011, 2007 by Pearson Education, Inc. All rights reserved. The objective of queuing analysis is to offer a reasonably satisfactory service to waiting customers. Unlike the other tools of OR, queuing theory is not an optimization technique. Rather, it determines

More information

Real-Time Delay Prediction in Customer Service Systems

Real-Time Delay Prediction in Customer Service Systems Real-Time Delay Prediction in Customer Service Systems Rouba Ibrahim Submitted in partial fulfillment of the Requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences

More information

Recap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks

Recap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks Recap Probability, stochastic processes, Markov chains ELEC-C7210 Modeling and analysis of communication networks 1 Recap: Probability theory important distributions Discrete distributions Geometric distribution

More information

L = λ W time average number in Line or system arrival rate W average Waiting time per customer

L = λ W time average number in Line or system arrival rate W average Waiting time per customer IEOR 4615, Lecture 3, January 27, 2015 L λ Little s Law* L = λ W time average number in Line or system arrival rate W average Waiting time per customer *J. D. C. Little, A proof of the queueing formula:

More information

1 Modelling and Simulation

1 Modelling and Simulation 1 Modelling and Simulation 1.1 Introduction This course teaches various aspects of computer-aided modelling for the performance evaluation of computer systems and communication networks. The performance

More information

Maximizing throughput in zero-buffer tandem lines with dedicated and flexible servers

Maximizing throughput in zero-buffer tandem lines with dedicated and flexible servers Maximizing throughput in zero-buffer tandem lines with dedicated and flexible servers Mohammad H. Yarmand and Douglas G. Down Department of Computing and Software, McMaster University, Hamilton, ON, L8S

More information

Answers to selected exercises

Answers to selected exercises Answers to selected exercises A First Course in Stochastic Models, Henk C. Tijms 1.1 ( ) 1.2 (a) Let waiting time if passengers already arrived,. Then,, (b) { (c) Long-run fraction for is (d) Let waiting

More information

Proceedings of the 2017 Winter Simulation Conference W. K. V. Chan, A. D Ambrogio, G. Zacharewicz, N. Mustafee, G. Wainer, and E. Page, eds.

Proceedings of the 2017 Winter Simulation Conference W. K. V. Chan, A. D Ambrogio, G. Zacharewicz, N. Mustafee, G. Wainer, and E. Page, eds. Proceedings of the 2017 Winter Simulation Conference W. K. V. Chan, A. D Ambrogio, G. Zacharewicz, N. Mustafee, G. Wainer, and E. Page, eds. SIMULATION-BASED PREDICTIVE ANALYTICS FOR DYNAMIC QUEUEING SYSTEMS

More information

Operations Research II, IEOR161 University of California, Berkeley Spring 2007 Final Exam. Name: Student ID:

Operations Research II, IEOR161 University of California, Berkeley Spring 2007 Final Exam. Name: Student ID: Operations Research II, IEOR161 University of California, Berkeley Spring 2007 Final Exam 1 2 3 4 5 6 7 8 9 10 7 questions. 1. [5+5] Let X and Y be independent exponential random variables where X has

More information

Computer Systems Modelling

Computer Systems Modelling Computer Systems Modelling Computer Laboratory Computer Science Tripos, Part II Michaelmas Term 2003 R. J. Gibbens Problem sheet William Gates Building JJ Thomson Avenue Cambridge CB3 0FD http://www.cl.cam.ac.uk/

More information

Chapter 6 Queueing Models. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

Chapter 6 Queueing Models. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation Chapter 6 Queueing Models Banks, Carson, Nelson & Nicol Discrete-Event System Simulation Purpose Simulation is often used in the analysis of queueing models. A simple but typical queueing model: Queueing

More information

Operations Research Letters. Instability of FIFO in a simple queueing system with arbitrarily low loads

Operations Research Letters. Instability of FIFO in a simple queueing system with arbitrarily low loads Operations Research Letters 37 (2009) 312 316 Contents lists available at ScienceDirect Operations Research Letters journal homepage: www.elsevier.com/locate/orl Instability of FIFO in a simple queueing

More information

Master thesis. Multi-class Fork-Join queues & The stochastic knapsack problem

Master thesis. Multi-class Fork-Join queues & The stochastic knapsack problem Master thesis Multi-class Fork-Join queues & The stochastic knapsack problem Sihan Ding August 26th, 2011 Supervisor UL: Dr. Floske Spieksma Supervisors CWI: Drs. Chrétien Verhoef Prof.dr. Rob van der

More information

Robustness and performance of threshold-based resource allocation policies

Robustness and performance of threshold-based resource allocation policies Robustness and performance of threshold-based resource allocation policies Takayuki Osogami Mor Harchol-Balter Alan Scheller-Wolf Computer Science Department, Carnegie Mellon University, 5000 Forbes Avenue,

More information

Dynamic control of a tandem system with abandonments

Dynamic control of a tandem system with abandonments Dynamic control of a tandem system with abandonments Gabriel Zayas-Cabán 1, Jingui Xie 2, Linda V. Green 3, and Mark E. Lewis 4 1 Center for Healthcare Engineering and Patient Safety University of Michigan

More information

Queueing Systems with Customer Abandonments and Retrials. Song Deng

Queueing Systems with Customer Abandonments and Retrials. Song Deng Queueing Systems with Customer Abandonments and Retrials by Song Deng A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Engineering - Industrial

More information

Classification of Queuing Models

Classification of Queuing Models Classification of Queuing Models Generally Queuing models may be completely specified in the following symbol form:(a/b/c):(d/e)where a = Probability law for the arrival(or inter arrival)time, b = Probability

More information

Markov Processes and Queues

Markov Processes and Queues MIT 2.853/2.854 Introduction to Manufacturing Systems Markov Processes and Queues Stanley B. Gershwin Laboratory for Manufacturing and Productivity Massachusetts Institute of Technology Markov Processes

More information

A Fluid Approximation for Service Systems Responding to Unexpected Overloads

A Fluid Approximation for Service Systems Responding to Unexpected Overloads OPERATIONS RESEARCH Vol. 59, No. 5, September October 2011, pp. 1159 1170 issn 0030-364X eissn 1526-5463 11 5905 1159 http://dx.doi.org/10.1287/opre.1110.0985 2011 INFORMS A Fluid Approximation for Service

More information

Erlang-C = M/M/N. agents. queue ACD. arrivals. busy ACD. queue. abandonment BACK FRONT. lost calls. arrivals. lost calls

Erlang-C = M/M/N. agents. queue ACD. arrivals. busy ACD. queue. abandonment BACK FRONT. lost calls. arrivals. lost calls Erlang-C = M/M/N agents arrivals ACD queue Erlang-A lost calls FRONT BACK arrivals busy ACD queue abandonment lost calls Erlang-C = M/M/N agents arrivals ACD queue Rough Performance Analysis

More information

Tales of Time Scales. Ward Whitt AT&T Labs Research Florham Park, NJ

Tales of Time Scales. Ward Whitt AT&T Labs Research Florham Park, NJ Tales of Time Scales Ward Whitt AT&T Labs Research Florham Park, NJ New Book Stochastic-Process Limits An Introduction to Stochastic-Process Limits and Their Application to Queues Springer 2001 I won t

More information

Stochastic process. X, a series of random variables indexed by t

Stochastic process. X, a series of random variables indexed by t Stochastic process X, a series of random variables indexed by t X={X(t), t 0} is a continuous time stochastic process X={X(t), t=0,1, } is a discrete time stochastic process X(t) is the state at time t,

More information

Queueing Theory I Summary! Little s Law! Queueing System Notation! Stationary Analysis of Elementary Queueing Systems " M/M/1 " M/M/m " M/M/1/K "

Queueing Theory I Summary! Little s Law! Queueing System Notation! Stationary Analysis of Elementary Queueing Systems  M/M/1  M/M/m  M/M/1/K Queueing Theory I Summary Little s Law Queueing System Notation Stationary Analysis of Elementary Queueing Systems " M/M/1 " M/M/m " M/M/1/K " Little s Law a(t): the process that counts the number of arrivals

More information

The Offered-Load Process: Modeling, Inference and Applications. Research Thesis

The Offered-Load Process: Modeling, Inference and Applications. Research Thesis The Offered-Load Process: Modeling, Inference and Applications Research Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science in Statistics Michael Reich Submitted

More information

Introduction to Queuing Networks Solutions to Problem Sheet 3

Introduction to Queuing Networks Solutions to Problem Sheet 3 Introduction to Queuing Networks Solutions to Problem Sheet 3 1. (a) The state space is the whole numbers {, 1, 2,...}. The transition rates are q i,i+1 λ for all i and q i, for all i 1 since, when a bus

More information

Chapter 11. Output Analysis for a Single Model Prof. Dr. Mesut Güneş Ch. 11 Output Analysis for a Single Model

Chapter 11. Output Analysis for a Single Model Prof. Dr. Mesut Güneş Ch. 11 Output Analysis for a Single Model Chapter Output Analysis for a Single Model. Contents Types of Simulation Stochastic Nature of Output Data Measures of Performance Output Analysis for Terminating Simulations Output Analysis for Steady-state

More information

Staffing Tandem Queues with Impatient Customers Application in Financial Service Operations

Staffing Tandem Queues with Impatient Customers Application in Financial Service Operations Staffing Tandem Queues with Impatient Customers Application in Financial Service Operations Jianfu Wang, Hossein Abouee-Mehrizi 2, Opher Baron 3, Oded Berman 3 : Nanyang Business School, Nanyang Technological

More information

Design and evaluation of overloaded service systems with skill based routing, under FCFS policies

Design and evaluation of overloaded service systems with skill based routing, under FCFS policies Design and evaluation of overloaded service systems with skill based routing, under FCFS policies Ivo Adan Marko Boon Gideon Weiss April 2, 2013 Abstract We study an overloaded service system with servers

More information

Modelling and Design of Service-Time; Phase-Type Distributions.

Modelling and Design of Service-Time; Phase-Type Distributions. Service Engineering June 97 Last Revised: April 8, 004 Modelling and Design of Service-Time; Phase-Type Distributions. References. Issaev, Eva,: Fitting phase-type distributions to data from a call center,

More information

A Queueing System with Queue Length Dependent Service Times, with Applications to Cell Discarding in ATM Networks

A Queueing System with Queue Length Dependent Service Times, with Applications to Cell Discarding in ATM Networks A Queueing System with Queue Length Dependent Service Times, with Applications to Cell Discarding in ATM Networks by Doo Il Choi, Charles Knessl and Charles Tier University of Illinois at Chicago 85 South

More information

EE 368. Weeks 3 (Notes)

EE 368. Weeks 3 (Notes) EE 368 Weeks 3 (Notes) 1 State of a Queuing System State: Set of parameters that describe the condition of the system at a point in time. Why do we need it? Average size of Queue Average waiting time How

More information

Erlang-R: A Time-Varying Queue with Reentrant Customers, in Support of Healthcare Staffing

Erlang-R: A Time-Varying Queue with Reentrant Customers, in Support of Healthcare Staffing Submitted to Manufacturing & Service Operations Management manuscript (Please, provide the mansucript number! Authors are encouraged to submit new papers to INFORMS journals by means of a style file template,

More information