Scheduling: Queues & Computation

Similar documents
Congestion In Large Balanced Fair Links

Maximum pressure policies for stochastic processing networks

Fairness and Optimal Stochastic Control for Heterogeneous Networks

Load Balancing in Distributed Service System: A Survey

Dynamic Power Allocation and Routing for Time Varying Wireless Networks

Yuan Zhong. A u th o r... Sloan School of Mandement August 17, 2012 Certified by.../ 'D1evavrat Shah

Dynamic resource sharing

Optimal scaling of average queue sizes in an input-queued switch: an open problem

Concave switching in single-hop and multihop networks

Fair Scheduling in Input-Queued Switches under Inadmissible Traffic

Full terms and conditions of use:

Maximum Pressure Policies in Stochastic Processing Networks

On the Resource/Performance Tradeoff in Large Scale Queueing Systems

On the stability of flow-aware CSMA

Information in Aloha Networks

arxiv: v1 [cs.sy] 10 Apr 2014

1 Markov decision processes

On Optimal Routing in Overloaded Parallel Queues

Linear Model Predictive Control for Queueing Networks in Manufacturing and Road Traffic

Utility-Maximizing Scheduling for Stochastic Processing Networks

Operations Research Letters. Instability of FIFO in a simple queueing system with arbitrarily low loads

Understanding the Capacity Region of the Greedy Maximal Scheduling Algorithm in Multi-hop Wireless Networks

Imperfect Randomized Algorithms for the Optimal Control of Wireless Networks

Large number of queues in tandem: Scaling properties under back-pressure algorithm

Congestion Control In The Internet Part 1: Theory. JY Le Boudec 2018

Queueing Networks and Insensitivity

Intro Refresher Reversibility Open networks Closed networks Multiclass networks Other networks. Queuing Networks. Florence Perronnin

Min Congestion Control for High- Speed Heterogeneous Networks. JetMax: Scalable Max-Min

Stability and Asymptotic Optimality of h-maxweight Policies

Convexity Properties of Loss and Overflow Functions

A Starvation-free Algorithm For Achieving 100% Throughput in an Input- Queued Switch

Simplification of Network Dynamics in Large Systems

Utility, Fairness and Rate Allocation

Scheduling for the tail: Robustness versus Optimality

Mean-Field Analysis of Coding Versus Replication in Large Data Storage Systems

Distributed Random Access Algorithm: Scheduling and Congesion Control

A Study on Performance Analysis of Queuing System with Multiple Heterogeneous Servers

Understanding the Capacity Region of the Greedy Maximal Scheduling Algorithm in Multi-hop Wireless Networks

On the Throughput-Optimality of CSMA Policies in Multihop Wireless Networks

Node-based Distributed Optimal Control of Wireless Networks

A Queueing System with Queue Length Dependent Service Times, with Applications to Cell Discarding in ATM Networks

Energy Optimal Control for Time Varying Wireless Networks. Michael J. Neely University of Southern California

Dynamic Matching Models

Performance Evaluation of Queuing Systems

Continuous-Model Communication Complexity with Application in Distributed Resource Allocation in Wireless Ad hoc Networks

A STAFFING ALGORITHM FOR CALL CENTERS WITH SKILL-BASED ROUTING: SUPPLEMENTARY MATERIAL

Linear Programming. Scheduling problems

Designing load balancing and admission control policies: lessons from NDS regime

Chapter 11. Approximation Algorithms. Slides by Kevin Wayne Pearson-Addison Wesley. All rights reserved.

Poly-Symmetry in Processor-Sharing Systems

Input-queued switches: Scheduling algorithms for a crossbar switch. EE 384X Packet Switch Architectures 1

Efficient Nonlinear Optimizations of Queuing Systems

TRANSMISSION STRATEGIES FOR SINGLE-DESTINATION WIRELESS NETWORKS

OPTIMAL CONTROL OF A FLEXIBLE SERVER

UNIVERSITY OF YORK. MSc Examinations 2004 MATHEMATICS Networks. Time Allowed: 3 hours.

Quality of Real-Time Streaming in Wireless Cellular Networks : Stochastic Modeling and Analysis

Tail Inequalities Randomized Algorithms. Sariel Har-Peled. December 20, 2002

Scheduling Coflows in Datacenter Networks: Improved Bound for Total Weighted Completion Time

On Two Class-Constrained Versions of the Multiple Knapsack Problem

OPEN MULTICLASS HL QUEUEING NETWORKS: PROGRESS AND SURPRISES OF THE PAST 15 YEARS. w 1. v 2. v 3. Ruth J. Williams University of California, San Diego

The Impact of Queue Length Information on Buffer Overflow in Parallel Queues

Guaranteeing Solution Quality for SAS Optimization Problems by being Greedy

COT 6936: Topics in Algorithms! Giri Narasimhan. ECS 254A / EC 2443; Phone: x3748

Collaboration and Multitasking in Networks: Architectures, Bottlenecks and Capacity

Introduction to Markov Chains, Queuing Theory, and Network Performance

Efficiency and Braess Paradox under Pricing

arxiv:math/ v1 [math.pr] 5 Jul 2004

On the Pathwise Optimal Bernoulli Routing Policy for Homogeneous Parallel Servers

Gideon Weiss University of Haifa. Joint work with students: Anat Kopzon Yoni Nazarathy. Stanford University, MSE, February, 2009

Wireless Scheduling. Mung Chiang Electrical Engineering, Princeton University. WiOpt Seoul, Korea June 23, 2009

Queueing Theory I Summary! Little s Law! Queueing System Notation! Stationary Analysis of Elementary Queueing Systems " M/M/1 " M/M/m " M/M/1/K "

Low-complexity Scheduling Algorithms for Multi-channel Downlink Wireless Networks

Delay Stability of Back-Pressure Policies in the presence of Heavy-Tailed Traffic

CS 798: Homework Assignment 3 (Queueing Theory)

Performance Analysis of Priority Queueing Schemes in Internet Routers

Stability and Rare Events in Stochastic Models Sergey Foss Heriot-Watt University, Edinburgh and Institute of Mathematics, Novosibirsk

On the Flow-level Dynamics of a Packet-switched Network

The Multi-Path Utility Maximization Problem

CHAPTER 4. Networks of queues. 1. Open networks Suppose that we have a network of queues as given in Figure 4.1. Arrivals

Routing. Topics: 6.976/ESD.937 1

A Generalized Processor Sharing Approach to Flow Control in Integrated Services Networks: The Single Node Case. 1

Computer Networks Fairness

On the Design of Efficient CSMA Algorithms for Wireless Networks

Stochastic Network Calculus

- Well-characterized problems, min-max relations, approximate certificates. - LP problems in the standard form, primal and dual linear programs

Chapter 11. Approximation Algorithms. Slides by Kevin Wayne Pearson-Addison Wesley. All rights reserved.

Flow-level performance of wireless data networks

How to deal with uncertainties and dynamicity?

Information Theory vs. Queueing Theory for Resource Allocation in Multiple Access Channels

Improved Algorithms for Machine Allocation in Manufacturing Systems

Genetic Algorithm. Outline

arxiv: v2 [math.pr] 17 Jun 2015 October 3, 2018

Optimization of Submodular Functions Tutorial - lecture I

Non Markovian Queues (contd.)

Tales of Time Scales. Ward Whitt AT&T Labs Research Florham Park, NJ

AN ALGORITHM TO COMPUTE BLOCKING PROBABILITIES IN MULTI-RATE MULTI-CLASS MULTI-RESOURCE LOSS MODELS. AT&T Bell Laboratories.

STABILITY OF MULTICLASS QUEUEING NETWORKS UNDER LONGEST-QUEUE AND LONGEST-DOMINATING-QUEUE SCHEDULING

Near-Optimal Control of Queueing Systems via Approximate One-Step Policy Improvement

Lecture 7: Simulation of Markov Processes. Pasi Lassila Department of Communications and Networking

Transcription:

Scheduling: Queues Computation achieving baseline performance efficiently Devavrat Shah LIDS, MIT

Outline Two models switched network and bandwidth sharing Scheduling: desirable performance queue-size scaling and computation Both models are equivalent from perspective of desirable scheduling performance Performance of known scheduling algorithms Discussion

Switch 11 Q 11 Input 1 12 21 Q 12 Q 21 Input 2 22 Q 22 Output 1 Output 2 Unit sized packets arrive as per independent Poisson processes

Switch: scheduling constraints Input 1 Input 2! Invalid 0 1 " 0 1 $ % Output 1 Output 2 Each integral time instance each input can send at most one packet each output can receive at most one packet

Switch: scheduling constraints Input 1 Input 2 Valid! " 1 0 0 1 $ % Output 1 Output 2 Each integral time instance each input can send at most one packet each output can receive at most one packet

Switch: resource capacity The set of feasible rates convex combination of possible schedules In a 2-port switch, =[ ij] is feasible iff [ ] 11 12 21 22 [ ] 1 apple + 0 0 0 1 1 [ ] 1 0 s.t. + apple 1,, 0. Load of : the smallest possible + satisfying above

Switch: resource capacity The set of feasible rates convex combination of possible schedules In a 2-port switch, =[ ij] is feasible iff 11 + 12 apple 1 11 + 21 apple 1 21 + 22 apple 1 12 + 22 apple 1 Load of : maximum of the above four terms

Switched network 1 Q 1 Q N 1 S N 1 N Q N Scheduling constraints each time choose schedule 2 S S Z N 0 is a finite monotone set if and then 2 S 0 2 Z N 0, 0 apple 0 2 S

Switched network Feasibility region convex hull of S can be represented as 2 R N 0 : A apple b, A 2 R M N 0, b 2 R M >0 given a feasible, it s load is defined as ( )=min{ apple 1:A apple b}

Maximum weight policy [Tassiulas-Ephremides 92] Current queue-sizes Q =[Q i ] Choose schedule 2 S so that 2 arg max 2S X i iq i

Max weight: example Q 11 Input 1 Q 12 Q 21 Input 2! " 1 0 0 1 $ % Q 22 Output 1 Output 2

Bandwidth sharing network [Roberts-Massoulie 00] 1 2 3 4 Each link has unit capacity Jobs arrive as per independent Poisson processes job sizes have exponential distribution (unit mean)

Bandwidth sharing network 1 x 1 + x 2 apple 1 x 1 + x 3 apple 1 2 3 4 x 3 + x 4 apple 1 x 2 + x 4 apple 1 Rate allocation x =[x i ] such that link capacity constraints are satisfied

Bandwidth sharing network The set of feasible rates that satisfy all link capacity constraints In our example, =[ i] is feasible iff 1 + 2 apple 1 1 + 3 apple 1 3 + 4 apple 1 2 + 4 apple 1 Load of : maximum of the above four terms

Bandwidth sharing network N routes/flows M links capacity of link i is Routing matrix: A 2 R M N 0 b i, 1 apple i apple M A ij : resource on link i consumed by unit flow on route j Rate or bandwidth allocation x 0, Ax apple b

Bandwidth sharing network Feasibility region 2 R N 0 : A apple b, A 2 R M N 0, b 2 R M >0 given a feasible, it s load is defined as ( )=min{ apple 1:A apple b}

Proportional fair policy [Kelly 97, Kelly-Maullo-Tan 97] Current queue-sizes Q =[Q i ] Choose rate allocation x =[x i ] s.t. maximize X i Q i log x i Ax apple b, x 0 x i =0ifQ i =0

Proportional fair: example x 1 + x 2 apple 1 x 1 + x 3 apple 1 x 3 + x 4 apple 1 x 2 + x 4 apple 1 Q i =1, 8 i ) x i =1/2, 8i

Queue-size Scheduling or allocation policy determines queue-sizes we ll restrict to policies with steady-state distribution for all with ( ) < 1 We ll focus on average total queue-size: E X tail probability of total queue-size: i Q i 1 t log P X i Q i >t for large t

Desired Queue-size scaling Baseline performance cf. [Harrison et al 15] average total queue-size: h X E i Q i i apple M (1 ) tail probability of total queue-size: X P Q i t exp (1 )t, for 1, large t. i

Desired Queue-size scaling Baseline performance cf. [Harrison et al 15] average total queue-size: lim sup N!1,!1 sup (1 X )E : ( )= i Q i apple M. tail probability of total queue-size: lim sup N!1,!1 sup : ( )= lim sup t 1 t(1 ) log P X i Q i >t apple const. Note: order of limits! 1 followed by N!1

Desired Queue-size scaling Baseline performance cf. [Harrison et al 15] average total queue-size: N=n and M=2n 2 for switch: (1 h X )E i Q i i apple 2n tail probability of total queue-size: behaves like an M/D/1 queue for any system size!

Computation Scheduling or allocation policy finds allocation or schedule each time instance Tractable computation to find a schedule with queue-sizes Q =[Q i ] total computation should be polynomial in M, N and log kqk 1

Computation Tractable computation to find a schedule with queue-sizes Q =[Q i ] total computation should be polynomial in M, N and log kqk 1 Can not expect exponential better dependence on M while retaining good queue-size scaling [Shah-Tse-Tsitsiklis 12] With baseline performance: tractable computation on average, polynomial in M, N and log(1/(1 ))

Question Does there exist scheduling/allocation policy Baseline performance and tractable computation For any switched and/or bandwidth sharing network How are these networks related? How well do max weight and prop. fair perform? Consider other policy?

Switched and bandwidth sharing networks Feasibility region of a switched network convex hull of S can be represented 2 R N 0 : A apple b, A 2 R M N 0, b 2 R M >0 Corresponding relaxed bandwidth-sharing network N routes/flows M links capacity of link i is b i, 1 apple i apple M Rate allocation from convex hull of S

Relaxation of switch Input 1 Input 2 Output Output

Switched network vs it s relaxation Both are single-hop networks Both have same number of queues with identical arrival processes Only difference is in scheduling constraints switched network operates in discrete time can choose one of finitely many schedules bandwidth sharing network operates in continuous time can choose any of the convex combination of schedules as feasible rate allocation

Switched network vs it s relaxation Any policy in a switched network is a valid policy for it s relaxation but not vice versa Any policy for bandwidth sharing network with baseline performance leads to a policy for corresponding switched network with baseline performance [Shah-Walton-Zhong 12, 14]

Explanation via an example! " Time 1 1 2 1 2 1 2 1 2 $ %! " 1 2 1 2 1 2 1 2 $ % Lag! " 0 0 0 0 $ %

Explanation via an example! " Time 2 1 4 3 4 3 4 1 4 $ %! " 1 2 1 2 1 2 1 2 $! + % " 1 4 3 4 3 4 1 4 $ %

Explanation via an example! " Time 2 1 4 3 4 3 4 1 4 $ %! " 3 4 5 4 >1 5 4 >1 3 4 $ " $ 0 1 % 1 0 % '! " 0 1 1 0 $ %

Explanation via an example! " Time 2 1 4 3 4 3 4 1 4 $ %! " 3 4 1 4 1 4 3 4 $ % Lag! " 0 1 1 0 $ %

Explanation via an example Time 3, etc Lag! " 1 2 1 2 1 2 1 2 $ = 1! 2 % " No matching 1 0 0 1 $ + 1! % 2 " 0 1 1 0 $ %

Explanation via an example Time 3, etc Lag! " 3 4 5 4 5 4 3 4! pick " $ = 3! 4 % " 0 1 1 0 $ % 1 0 0 1 $ + 5! % 4" 0 1 1 0 $ %

Explanation via an example Time 3, etc Lag! " 7 4 5 4 5 4 7 4! pick " $ = 7! 4 % " 1 0 0 1 $ % 1 0 0 1 $ + 5! % 4" 0 1 1 0 $ %

Explanation via an example Time 3, etc Lemma. Total lag is at most n 2. That is, difference in total q-size is at most n 3.

So do we have such a policy? Baseline Performance Tractable Computation Max Weight [Tassiulas-Ephremides 92] Not Clear (may be: Srikant s Talk) Yes Proportional Fair [Kelly 97, KMT 97] Under Local Traffic Condition [Kang-Kelly-Lee-Williams 08] Yes Store-and-Forward [Massoulie, Bonald-Proutiere 03] Yes [Zachary 08] Not Clear

Maximum weight policy Computation using Ellipsoid algorithm can evaluate membership in the convex hull of S using computations polynomial in M, N by Grotchel-Lovasz-Schrijver 79 can solve for maximum weight solution using computation polynomial in M, N and log Q

Maximum weight policy Queue-size scaling Heavy traffic approximation in hope of obtaining crisp characterization Stolyar 04, Dai-Lin 07, Shah-Wischik 06 12, Kang-Williams 06 Recent result by Maguluri and Srikant (talk coming up) For switch, under constant fraction of nodes saturated lim sup N!1,!1 sup (1 X )E : ( )= ij Beyond this special case, it remains open Q ij apple 2n.

Maximum weight policy Tail probability [Stolyar 08][Venkataramanan-Lin 10] established an implicit result max-weight policy optimizes the decay-rate lim t!1 1 t log P X i Q 2 i t = ( ). [Shah-Walton-Zhong 12] characterizes this for a class of networks But not known for total of queue-sizes

Proportional fair policy Average queue-size [Kang-Kelly-Lee-Williams 08] (nearly) achieves base-line performance in heavy traffic limit associate independent exponential variable to each link with appropriate parameter induced by original system queue-size of a route is equal (in dist.) to the (weighted) sum of these link variables on it s route under local traffic condition pretty general, but not satisfied by switch, for example

Store and forward allocation [Massoulie, Bonald-Proutiere 03] An unusual policy for bandwidth sharing network (BSN) derived from an analogy to a multi-class network that has product-form stationary distribution achieves the baseline performance exactly! The insensitivity result holds quite generally [Zachary 08]

Store and forward allocation Analogous multi-class network (MCN) [Massoulie, Bonald-Proutiere 03] each link has a processor sharing queue with it s fixed capacity each route defines a class jobs of a class travel through servers on the route Rate allocation let total jobs in MCN equal to that in BSN allocate rate to route in BSN equal to average departure rate in MCN

Store and forward allocation Product-form stationary distribution for MCN queue associated with links are independent have geometric distribution with parameter given by induced load on the link Insensitivity established by [Zachary 08] For BSN total jobs in BSN equal to (in dist.) total jobs in MCN

Store and forward allocation Queue-size scaling property average total queue-size: lim sup N!1,!1 sup (1 X )E : ( )= i Q i apple M. tail probability of total queue-size: lim sup N!1,!1 sup : ( )= lim sup t 1 t(1 ) log P X i Q i >t apple const.

Store and forward allocation Computation of rate allocation Requires summation of exponentially many terms Just like computation of Permanent has polynomial time approx algorithm [Jerrum-Sinclair-Vigoda 01] Actually, there is a much closer connection between these objects We believe there is a polynomial time approximation algorithm

Summary Desirable scheduling algorithm Baseline performance Tractable computation Exists for switched network iff it does so for bandwidth sharing

Summary Candidates for desirable scheduling algorithm Maximum weight believable, but far for able to establish Proportional fair allocation it achieves for all networks with local traffic condition Store-and-forward computation is unresolved

Summary Baseline performance need is not the best achievable 1. Hierarchical Greedy Performance suggested by [Harrison et al 15] UFOS policy does better than baseline for few examples 2. Independent set model for switched network max weight induced avg queue size O(N 2 ) but M ought be super-polynomial in N therefore baseline performance can be terribly off