immediately, without knowledge of the jobs that arrive later The jobs cannot be preempted, ie, once a job is scheduled (assigned to a machine), it can

Similar documents
bound of (1 + p 37)=6 1: Finally, we present a randomized non-preemptive 8 -competitive algorithm for m = 2 7 machines and prove that this is op

On-line Bin-Stretching. Yossi Azar y Oded Regev z. Abstract. We are given a sequence of items that can be packed into m unit size bins.

S. ABERS Vohra [3] then gave an algorithm that is.986-competitive, for all m 70. Karger, Phillips and Torng [] generalized the algorithm and proved a

Preemptive Online Scheduling: Optimal Algorithms for All Speeds

A lower bound on deterministic online algorithms for scheduling on related machines without preemption

Lower bounds for online makespan minimization on a small

Optimal Online Scheduling. Shang-Hua Teng x. December Abstract. We study the following general online scheduling problem.

A note on semi-online machine covering

Semi-Online Preemptive Scheduling: One Algorithm for All Variants

Minimizing Average Completion Time in the. Presence of Release Dates. September 4, Abstract

A lower bound for scheduling of unit jobs with immediate decision on parallel machines

On-line Scheduling to Minimize Max Flow Time: An Optimal Preemptive Algorithm

More Approximation Algorithms

Semi-Online Multiprocessor Scheduling with Given Total Processing Time

Online interval scheduling on uniformly related machines

Semi-Online Preemptive Scheduling: One Algorithm for All Variants

Partition is reducible to P2 C max. c. P2 Pj = 1, prec Cmax is solvable in polynomial time. P Pj = 1, prec Cmax is NP-hard

Introduction Consider a set of jobs that are created in an on-line fashion and should be assigned to disks. Each job has a weight which is the frequen

A Better Algorithm For an Ancient Scheduling Problem. David R. Karger Steven J. Phillips Eric Torng. Department of Computer Science

Online Coloring of Intervals with Bandwidth

Chapter 1. Comparison-Sorting and Selecting in. Totally Monotone Matrices. totally monotone matrices can be found in [4], [5], [9],

SUM x. 2x y x. x y x/2. (i)

Open Problems in Throughput Scheduling

2 Martin Skutella modeled by machine-dependent release dates r i 0 which denote the earliest point in time when ob may be processed on machine i. Toge

ABSTRACT We consider the classical problem of online preemptive job scheduling on uniprocessor and multiprocessor machines. For a given job, we measur

Colored Bin Packing: Online Algorithms and Lower Bounds

showed that the SMAT algorithm generates shelf based schedules with an approximation factor of 8.53 [10]. Turek et al. [14] proved that a generalizati

A new analysis of Best Fit bin packing

SPT is Optimally Competitive for Uniprocessor Flow

First Fit bin packing: A tight analysis

All-norm Approximation Algorithms

2 PAVEL PUDLAK AND JIRI SGALL may lead to a monotone model of computation, which makes it possible to use available lower bounds for monotone models o

A Robust APTAS for the Classical Bin Packing Problem

Speed is as Powerful as Clairvoyance. Pittsburgh, PA Pittsburgh, PA tive ratio for a problem is then. es the input I.

Approximation Schemes for Scheduling on Uniformly Related and Identical Parallel Machines

Scheduling Parallel Jobs with Linear Speedup

On Machine Dependency in Shop Scheduling

Nader H. Bshouty Lisa Higham Jolanta Warpechowska-Gruca. Canada. (

Scheduling Adaptively Parallel Jobs. Bin Song. Submitted to the Department of Electrical Engineering and Computer Science. Master of Science.

ONLINE SCHEDULING OF MALLEABLE PARALLEL JOBS

On-line Scheduling of Two Parallel Machines. with a Single Server

Minimizing Mean Flowtime and Makespan on Master-Slave Systems

Lecture 9 : PPAD and the Complexity of Equilibrium Computation. 1 Complexity Class PPAD. 1.1 What does PPAD mean?

An improved approximation algorithm for two-machine flow shop scheduling with an availability constraint

Rate-monotonic scheduling on uniform multiprocessors

Improved Bounds for Flow Shop Scheduling

Optimal on-line algorithms for single-machine scheduling

The Minimum Reservation Rate Problem in Digital. Audio/Video Systems. Abstract

Polynomial Time Algorithms for Minimum Energy Scheduling

Multiprocessor EDF and Deadline Monotonic Schedulability Analysis

Completion Time Scheduling and the WSRPT Algorithm

ON THE COMPLEXITY OF SOLVING THE GENERALIZED SET PACKING PROBLEM APPROXIMATELY. Nimrod Megiddoy

The problem comes in three avors: 1. Find a deterministic rule for orienting the edges and analyze it on the the worst input sequence.. Suggest a rule

Lower Bounds for Shellsort. April 20, Abstract. We show lower bounds on the worst-case complexity of Shellsort.

Upper and Lower Bounds on the Number of Faults. a System Can Withstand Without Repairs. Cambridge, MA 02139

Multiprocessor jobs, preemptive schedules, and one-competitive online algorithms

On-line Scheduling with Hard Deadlines. Subhash Suri x. St. Louis, MO WUCS December Abstract

Task Models and Scheduling

SIGACT News Online Algorithms Column 26: Bin packing in multiple dimensions

Weighted flow time does not admit O(1)-competitive algorithms

1 Ordinary Load Balancing

k-degenerate Graphs Allan Bickle Date Western Michigan University

Online Scheduling of Parallel Jobs on Two Machines is 2-Competitive

Optimal analysis of Best Fit bin packing

arxiv: v1 [cs.ds] 6 Jun 2018

APTAS for Bin Packing

Formally, a network is modeled as a graph G, each of whose edges e is presumed to fail (disappear) with some probability p e and thus to survive with

ARobustPTASforMachineCoveringand Packing

Complexity analysis of job-shop scheduling with deteriorating jobs

Restricted b-matchings in degree-bounded graphs

the subset partial order Paul Pritchard Technical Report CIT School of Computing and Information Technology

Multiprocessor Scheduling I: Partitioned Scheduling. LS 12, TU Dortmund

Online Algorithms for 1-Space Bounded Multi Dimensional Bin Packing and Hypercube Packing

Online algorithms for parallel job scheduling and strip packing Hurink, J.L.; Paulus, J.J.

Tight Upper Bounds for Semi-Online Scheduling on Two Uniform Machines with Known Optimum

Online Competitive Algorithms for Maximizing Weighted Throughput of Unit Jobs

to provide continuous buered playback ofavariable-rate output schedule. The

The Weighted Majority Algorithm. Nick Littlestone. Manfred K. Warmuth y UCSC-CRL Revised. October 26, 1992.

Stochastic Online Scheduling Revisited

Equalprocessing and equalsetup time cases of scheduling parallel machines with a single server

On Controllability and Normality of Discrete Event. Dynamical Systems. Ratnesh Kumar Vijay Garg Steven I. Marcus

Approximation Schemes for Scheduling on Parallel Machines

The makespan problem of scheduling multi groups of jobs on multi processors at different speeds

Online Interval Coloring and Variants

Abstract. This paper discusses polynomial-time reductions from Hamiltonian Circuit (HC),

Degradable Agreement in the Presence of. Byzantine Faults. Nitin H. Vaidya. Technical Report #

Reproduced without access to the TeX macros. Ad-hoc macro denitions were used instead. ON THE POWER OF TWO-POINTS BASED SAMPLING

Vector Space Basics. 1 Abstract Vector Spaces. 1. (commutativity of vector addition) u + v = v + u. 2. (associativity of vector addition)

number of reversals in instruction (2) is O(h). Since machine N works in time T (jxj)

Counting and Constructing Minimal Spanning Trees. Perrin Wright. Department of Mathematics. Florida State University. Tallahassee, FL

The 2-valued case of makespan minimization with assignment constraints

Optimal blocking of two-level fractional factorial designs

Memorandum COSOR 97-23, 1997, Eindhoven University of Technology

How much can lookahead help in online single machine scheduling

COSC 341: Lecture 25 Coping with NP-hardness (2)

Non-clairvoyant Scheduling for Minimizing Mean Slowdown

Bounding the End-to-End Response Times of Tasks in a Distributed. Real-Time System Using the Direct Synchronization Protocol.

NP-Complete Scheduling Problems*

Sampling Contingency Tables

Definition 2.3. We define addition and multiplication of matrices as follows.

Transcription:

A Lower Bound for Randomized On-Line Multiprocessor Scheduling Jir Sgall Abstract We signicantly improve the previous lower bounds on the performance of randomized algorithms for on-line scheduling jobs on m identical machines We also show that a natural idea for constructing an algorithm with matching performance does not work Keywords combinatorial problems, on-line algorithms, randomization, scheduling, worst case bounds 1 Introduction We study the model for scheduling introduced [7] and studied recently in [6, 1, 8] This model is essentially a modied version of the game of Tetris We have some xed number of columns Rectangles arrive one by one, each of them is one column wide and extends over one or more rows We have to put each rectangle in one of the columns The goal is to minimize the total number of rows that are at least partially used by the rectangles In this scenario the columns represent the machines, rows represent the time steps and the rectangles represent the jobs with a running time corresponding to the height of a rectangle More precisely, we have m machines and a sequence of jobs arriving one by one; there are no dependencies between the jobs When a job arrives, we know its running time, which is the same on all the machines (ie, the machines are identical) We have to assign the job to one of the machines Mathematical Institute, AV CR, Zitna 25, 115 67 Praha 1, Czech Republic, e-mail: sgall@mathcascz Partially supported by grants A119107 and A1019602 of AV CR; part of this work was done at Carnegie-Mellon University, Pittsburgh, PA, USA 1

immediately, without knowledge of the jobs that arrive later The jobs cannot be preempted, ie, once a job is scheduled (assigned to a machine), it cannot be moved to another machine Our goal is to minimize the makespan, the time when the last job is completed In the randomized model we measure the expected makespan The performance of an on-line algorithm is measured by the competitive ratio a randomized algorithm is -competitive (wrt makespan) if for each input the expected makespan of the generated schedule is at most times larger than the optimal makespan In Section 2 we signicantly improve the previously known lower bounds on the competitive ratio of randomized on-line algorithm for this problem This result was reported in [10] It was also proved independently by Chen, van Vliet and Woeginger [3], however our proof is more explicit and therefore it gives more insight for a possible construction of an improved algorithm This insight leads to a natural invariant that should be preserved by any algorithm matching our lower bound An algorithm based on this invariant would be a natural generalization of the optimal algorithm for two processors from [1]; it would also follow the suggestion from [4] Section 3 we demonstrate that it is impossible to preserve this invariant inductively Therefore for a design of such an algorithm we would have to use a stronger condition; at the present time we do not know if this is possible Let us survey the previous related results While the deterministic case has been studied extensively [7, 6, 1, 8], much less was known about the randomized case; all of the following results are by Bartal, Fiat, Karlo, and Vohra [1] Only the case of m = 2 is solved completely; an optimal 4=3-competitive algorithm is known and provably better than any deterministic algorithm For m = 3 a nontrivial lower bound of 1:4 was proved For m > 3, the best known lower bound was the easy 4=3 bound, not even matching the bound for m = 3 For small values of m 3 randomized algorithm with a better competitive ratio than the best deterministic algorithm were discovered very recently by Seiden [9] For large m no randomized algorithm with a better competitive ratio than the best deterministic algorithm is known, which means that we do not know how to make use of randomization at all For a long time the best deterministic algorithm was the (2? 1 )-competitive Graham's al- m gorithm [7] For m = 2 and m = 3, Graham's algorithm is provably optimal For larger m, it was improved several times during the last few years [6, 1, 8] For suciently large m the best known algorithm is 1:945-competitive [8] For m > 3, an algorithm slightly better than Graham's In 2

is presented in [6] The best lower bound on the competitive ratio of the deterministic scheduling algorithm for large m is approximately 1:837 [2]; an earlier bound of 1 + p 1= 2 1:707 is valid for any m > 3 [5] Interestingly, for a related model, deterministic preemptive on-line scheduling on m identical machines, Chen, van Vliet and Woeginger [4] not only proved the same lower bound as we prove for randomized nonpreemptive algorithms, but also gave an algorithm matching this bound 2 An improved lower bound We prove that the competitive ratio of any randomized on-line scheduling algorithm for m machines is at least 1 +1=(( m m?1 )m? 1) This matches the known tight bound of 4=3 for m = 2 and improves the previous bounds for all m > 2 If m is large, this bound approaches 1 + 1=(e? 1) 1:582 Table 1 compares the bounds for a few values number of machines our bound previous lower bound upper bound 2 4=3 1:333 4=3 4=3 3 27=19 1:421 1:4 1:559 4 256=175 1:463 4=3 1:675 5 3125=2101 1:487 4=3 1:741 6 46656=31031 1:504 4=3 1:788 1 1 + 1=(e? 1) 1:582 4=3 1:945 Table 1: Known bounds on the competitive ratio for randomized on-line scheduling of sequential jobs The intuition for our lower bound can be better understood if we look at the algorithm and the lower bound for two machines from [1] Their algorithm guarantees that the ratio of the expected loads on the two machines is 2 : 1 most of the time (unless there is a job with a very long running time), where the load of a machine is dened as the total running time of all jobs assigned to the machine, also called height in [1] Their lower bound essentially shows that any optimal algorithm 3

has to maintain this ratio of expected loads, and that it is not possible to maintain a better ratio The optimal algorithm can balance the loads exactly, hence the competitive ratio is 2=1:5 = 4=3 In the case of m machines we show that the best ratio of loads which might be possible to maintain is 1 : x : x 2 : : : : : x m?1, where x = m=(m? 1) The optimal schedule can balance the loads to be (1 + x + + x m?1 )=m = (x m? 1)=m(x? 1), hence the competitive ratio is at least mx m?1 (x? 1)=(x m? 1), which gives our bound See Figures 1 and 2 for an illustration? height x 2m?1 T height x m+1 T @R height x m T @R - (x m? 1)x m?1 T (x m? 1)T (x m? 1)T time T - x m?1 T - (x m? 1)x m?1 T? m machines Figure 1: An optimal on-line schedule for the sequence of jobs used for the lower bound? m machines Figure 2: An optimal o-line schedule for the sequence of jobs used for the lower bound We rst prove a general lemma which applies to any sequence of jobs For our proof the last m jobs of the sequence are most important: the m dierent input sequences we use for lower bound are the initial segments of the sequence with fewer than m jobs from the end deleted In fact in the particular sequence that we use later, the only purpose of the other jobs is to pad the sequence so that the optimal schedule can always balance the loads exactly Let a sequence of jobs J be given Denote the last m jobs of J by J 1 ; : : : ; J m and their running times by t 1 ; : : : ; t m Let J i be an initial segment of J ending by J i (ie, J m = J ) Let S i be the sum of the running times of all jobs in J i and let T opt (J i ) be the length of an optimal schedule for J i For a given randomized algorithm A, E[T A (J i )] denotes the expected length of the schedule it 4

generates on J i The denition of the competitive ratio implies that for any choice of the sequences of jobs J i we have P m E[T A (J i )] T opt (J i ) T opt (J i ) T opt (J i ) = : The following lemma gives an upper bound on the expectation in the left-hand side Lemma 21 For any randomized on-line algorithm A for scheduling on m machines we have E[T A (J i )] S m Proof Fix a sequence of random bits used by the algorithm A Let T i be the makespan of the schedule generated by the algorithm A for the jobs in J i with the xed random bits Since the algorithm is on-line, the schedule for J i is obtained from the schedule for J i?1 by adding J i to one of the machines Order the machines so that the load of ith machine does not change after J i is scheduled There always exists such an ordering: Designate a machine to be the ith one, if J i is the last job scheduled on it Clearly at most one machine is designated as the ith one If no machine is designated as the jth one for some j, pick one of the remaining machines arbitrarily; note that no job J i is scheduled on these machines This denes an ordering satisfying the condition P m Let L i be the load of ith machine after scheduling all jobs Obviously L i = S m The load of ith machine is L i already after scheduling J i because of our condition on the order of the machines Therefore T i L i and P m T i P m L i = S m for any choice of the random bits By the linearity of expectation we have mx E[T A (J i )] E[ mx T A (J i )] = E[ mx T i ] S m Let x = m=(m? 1) We choose our sequence of jobs so that the following property is satised If all the jobs preceding J i are scheduled so that the ratio of loads is 1 : x : : : : : x m?1, then after scheduling J i on the least loaded machine the ratio of the loads is preserved Let T = (m? 1) 2m?1 The sequence J consists of (1 + + x m?1 )T jobs of running time 1 followed by m jobs with running times t i = (x m? 1)x i?1 T The value of T is chosen so that 2 5

all running times are integers Optimal on-line and o-line schedules are illustrated on Figures 1 and 2 After scheduling the jobs preceding J i so that the ratio is as described above, the loads of the machines are x i?1 T; : : : ; x i+m?2 T For i = 1 this is obvious It is maintained inductively, since by our choice of running times we have x i?1 T + t i = x i+m?1 T, which is exactly the condition we need to preserve the ratio of the loads The next theorem says that no on-line algorithm can do better, even if it is randomized Theorem 22 For any randomized on-line scheduling algorithm for m machines, the competitive ratio m is at least 1 + 1=(( m m?1 )m? 1) Proof First we prove that T opt (J i ) = t i The total running time of all jobs in J i is S i = (1 + + x m?1 )T + (x m? 1)(1 + + x i?1 )T = (x i + + x m?1 )T + x m (1 + + x i?1 )T = x i (1 + + x m?1 )T = x i xm? 1 x? 1 T = x x? 1 t i = mt i : Since running times of all jobs are integers, at most m of them are greater than 1 and t i is the maximal running time, the loads of the m machines can be balanced perfectly and T opt (J i ) = t i (More explicitely, the schedule will use distinct processors for all jobs with running time greater than 1, and distribute the jobs with running time 1 so that the load of each processor is exactly t i ) Using Lemma 21 and S m = x m (1+ +x m?1 )T from the previous computation, the competitive ratio m for any randomized on-line algorithm A is bounded by m = E[T A (J i )] T opt (J i ) S m = t i x m (1 + + x m?1 )T (1 + + x m?1 )(x m? 1)T = xm x m? 1 = 1 + 1 ( m m?1 )m? 1 : 2 6

3 A note on a possible matching upper bound The lower bound from the previous section suggests a natural generalization of the approach used in [1] in the optimal algorithm for two processors After each sequence of jobs we have a certain probability distribution on the possible congurations of the machines, where the conguration consists of the loads of the individual machines We assume that in each conguration the machines are ordered by their load, starting with the smallest one Now we take the average (expected) load of each machine In [1] it is proved for two processors that the ratio of the average loads can be maintained inductively at 1 : 2, as long as there is no large job That means that if the ratio is 1 : 2 and a job arrives, it can be randomly assigned to one of the machines so that the ratio remains unchanged A natural generalization, also suggested in [3], is to maintain the ratio of the average loads at 1 : x : x 2 : : x m?1 where x = m=(m? 1) and m is the number of processors, as long as the jobs are small In fact, the lower bound shows that this has to be the case at least at all times when the jobs in the sequence together with the jobs used in the lower bound can be balanced exactly on m processors Extending the denitions from [1] we say that a job is critical, if its running time is 1=(m? 1) times the sum of the previous running times; any job with longer running time is a large job Thus, if a large job arrives, we know that the optimal schedule cannot balance the jobs exactly, and the optimal length of the schedule is at least its running time For the rest of the paper we consider only sequences with no large jobs (except the rst few ones); large jobs have to be handled separately from the above invariant, similarly as in [1] We now demonstrate that the invariant ratio above cannot be maintained inductively even for sequences with no large jobs The lower bound proof in Section 2 also implies that when a critical job arrives then in any conguration with non-zero probability the dierence of the loads of the rst and the last machine must be at most the running time of the critical job: if the dierence between the loads is larger, the average load of the most loaded machine increases too much, and the invariant is not preserved Now it is easy to construct a concrete counterexample for three machines Consider a sequence of 2 9 19 = 342 jobs of running time 1 and 1 job of running time 9 19 = 171; note that the 7

last job is critical Assume that after scheduling all jobs with running time 1 the loads of the machines are (2 38; 3 38; 4 38) with probability 18=19 and (0; 0; 9 38) with probability 1=19 The average loads are (4 18; 6 18; 9 18), and thus this distribution satises the invariant After the last job with running time 9 19 is scheduled, the average loads should be (4 27; 6 27; 9 27) to preserve the invariant However, after scheduling the last job in the rst conguration, the largest load will be at least 2 38 + 9 19 = 13 19 and in the second conguration it will be at least 9 38 Thus the average is at least 18 1 13 19 + 9 38 = 9 28 > 9 27 and the invariant 19 19 cannot be preserved; moreover, the schedule for this particular instance does not have the desired competitive ratio To make the example more convincing we shall show that the distribution above can be achieved by a sequence where the expected loads satisfy the invariant at each step, except the rst few ones After scheduling 1, 2, 4, or 5 jobs of running time 1, no distribution satises the invariant 1 A straightforward but somewhat tedious calculation gives a legal sequence of distributions on congurations for a sequence of jobs of running time 1 where the invariant is preserved at all other steps, after i jobs are scheduled the probability of the conguration (0; 0; i) is always 1=19, and after 9i jobs (i integer) the probability of the conguration (2i; 3i; 4i) is 18=19; this includes our desired conguration We conclude that to construct an algortihm matching our lower bound, we would have to use a more rened construction, perhaps with a stronger invariant which would account for the possibility above We still conjecture that an algorithm matching the lower bound from Section 2 exists, however, the most intuitive approach does not work References [1] Y Bartal, A Fiat, H Karlo, and R Vohra New algorithms for an ancient scheduling problem J Comput Syst Sci, 51(3):359{366, 1995 1 For 1 and 2 this is obvious, for 4 and 5 jobs an easy computation is required (In contrast, for two machines after the rst job the desired invariant can be maintained at all times) The fact that for 4 or 5 jobs the invariant is not preserved has no direct implication, since due to the small number of jobs, the optimal solution cannot balance them exactly as is needed in the lower bound proof Thus it would be reasonable to assume that the algorithm can treat a rst few jobs in a special way For that reason we need to present the above example, where the invariant cannot be preserved for more substantial reasons 8

[2] Y Bartal, H Karlo, and Y Rabani A new lower bound for m-machine scheduling Inf Process Lett, 50:113{116, 1994 [3] B Chen, A van Vliet, and G J Woeginger A lower bound for randomized on-line scheduling algorithms Inf Process Lett, 51:219{222, 1994 [4] B Chen, A van Vliet, and G J Woeginger An optimal algorithm for preemptive on-line scheduling Oper Res Lett, 18:127{131, 1995 [5] U Faigle, W Kern, and G Turan On the performance of on-line algorithms for partition problems Acta Cybernetica, 9:107{119, 1989 [6] G Galambos and G J Woeginger An on-line scheduling heuristic with better worst case ratio than Graham's list scheduling SIAM J Comput, 22(2):349{355, 1993 [7] R L Graham Bounds for certain multiprocessor anomalies Bell System Technical J, 45:1563{ 1581, Nov 1966 [8] D R Karger, S J Phillips, and E Torng A better algorithm for an ancient scheduling problem In Proc of the 5th Ann ACM-SIAM Symp on Discrete Algorithms, pages 132{140 ACM-SIAM, 1994 [9] S Seiden A randomized algorithm for that ancient scheduling problem Manuscript, 1996 [10] J Sgall On-Line Scheduling on Parallel Machines PhD thesis, Technical Report CMU-CS- 94-144, Carnegie-Mellon University, Pittsburgh, PA, USA, 1994 9