The unbounded single machine parallel batch scheduling problem with family jobs and release dates to minimize makespan

Similar documents
Scheduling jobs with agreeable processing times and due dates on a single batch processing machine

Multi-agent scheduling on a single machine to minimize total weighted number of tardy jobs

Scheduling in an Assembly-Type Production Chain with Batch Transfer

An FPTAS for parallel-machine scheduling under a grade of service provision to minimize makespan

Complexity analysis of job-shop scheduling with deteriorating jobs

A polynomial-time approximation scheme for the two-machine flow shop scheduling problem with an availability constraint

Batch delivery scheduling with simple linear deterioration on a single machine 1

Batching Identical Jobs

Scheduling Linear Deteriorating Jobs with an Availability Constraint on a Single Machine 1

Single Machine Scheduling with a Non-renewable Financial Resource

Parallel machine scheduling with batch delivery costs

University of Twente. Faculty of Mathematical Sciences. Scheduling split-jobs on parallel machines. University for Technical and Social Sciences

Polynomially solvable and NP-hard special cases for scheduling with heads and tails

Scheduling Parallel Jobs with Linear Speedup

Scheduling linear deteriorating jobs with an availability constraint on a single machine

Research Article Batch Scheduling on Two-Machine Flowshop with Machine-Dependent Setup Times

Scheduling chains of operations on a batching machine with disjoint sets of operation compatibility

Complexity and Algorithms for Two-Stage Flexible Flowshop Scheduling with Availability Constraints

This means that we can assume each list ) is

Lecture 4 Scheduling 1

Minimizing Total Weighted Completion Time on Uniform Machines with Unbounded Batch

A note on proving the strong NP-hardness of some scheduling problems with start time dependent job processing times

Common-Deadline Lazy Bureaucrat Scheduling Problems

Single machine batch scheduling with release times

Complexity of flow shop scheduling problems with transportation constraints

An analysis of the LPT algorithm for the max min and the min ratio partition problems

Minimizing Mean Flowtime and Makespan on Master-Slave Systems

Optimal on-line algorithms for single-machine scheduling

Polynomial time solutions for scheduling problems on a proportionate flowshop with two competing agents

Machine Minimization for Scheduling Jobs with Interval Constraints

Machine Scheduling with Deliveries to Multiple Customer Locations

Lecture 2: Scheduling on Parallel Machines

Approximation complexity of min-max (regret) versions of shortest path, spanning tree, and knapsack

Improved Bounds for Flow Shop Scheduling

Machine scheduling with resource dependent processing times

A Note on the Density of the Multiple Subset Sum Problems

Partition is reducible to P2 C max. c. P2 Pj = 1, prec Cmax is solvable in polynomial time. P Pj = 1, prec Cmax is NP-hard

Scheduling a Bounded Parallel-Batching Machine with Incompatible Job Families and Rejection

Approximation schemes for parallel machine scheduling with non-renewable resources

8 Knapsack Problem 8.1 (Knapsack)

Gearing optimization

Preemptive Scheduling of Independent Jobs on Identical Parallel Machines Subject to Migration Delays

Single machine scheduling with forbidden start times

Ideal preemptive schedules on two processors

STABILITY OF JOHNSON S SCHEDULE WITH LIMITED MACHINE AVAILABILITY

SUPPLY CHAIN SCHEDULING: ASSEMBLY SYSTEMS. Zhi-Long Chen. Nicholas G. Hall

On Machine Dependency in Shop Scheduling

Deciding Emptiness of the Gomory-Chvátal Closure is NP-Complete, Even for a Rational Polyhedron Containing No Integer Point

Partial job order for solving the two-machine flow-shop minimum-length problem with uncertain processing times

Approximation Algorithm of Minimizing Makespan in Parallel Bounded Batch Scheduling

A half-product based approximation scheme for agreeably weighted completion time variance

Throughput Optimization in Single and Dual-Gripper Robotic Cells

Discrete Applied Mathematics. Tighter bounds of the First Fit algorithm for the bin-packing problem

Online Over Time Scheduling on Parallel-Batch Machines: A Survey

HYBRID FLOW-SHOP WITH ADJUSTMENT

HEURISTICS FOR TWO-MACHINE FLOWSHOP SCHEDULING WITH SETUP TIMES AND AN AVAILABILITY CONSTRAINT

A Robust APTAS for the Classical Bin Packing Problem

arxiv: v1 [cs.ds] 15 Sep 2018

Approximation Schemes for Scheduling on Parallel Machines

Single processor scheduling with time restrictions

Algorithms. Outline! Approximation Algorithms. The class APX. The intelligence behind the hardware. ! Based on

1 Ordinary Load Balancing

MINIMIZING TOTAL TARDINESS FOR SINGLE MACHINE SEQUENCING

Scheduling with resource allocation and past-sequence-dependent setup times including maintenance

Complexity analysis of the discrete sequential search problem with group activities

Online algorithms for parallel job scheduling and strip packing Hurink, J.L.; Paulus, J.J.

Marjan van den Akker. Han Hoogeveen Jules van Kempen

Improved Bounds on Relaxations of a Parallel Machine Scheduling Problem

Combinatorial Algorithms for Minimizing the Weighted Sum of Completion Times on a Single Machine

Low-Complexity Algorithms for Sequencing Jobs with a Fixed Number of Job-Classes

M 2 M 3. Robot M (O)

NP-Completeness. NP-Completeness 1

On the Robust Single Machine Scheduling Problem

Fixed interval scheduling problems under uncertainty

The effect of machine availability on the worst-case performance of LPT

A hybrid two-stage transportation and batch scheduling problem

Single Machine Scheduling with Generalized Total Tardiness Objective Function

Heuristics for two-machine flowshop scheduling with setup times and an availability constraint

Single Machine Scheduling with Job-Dependent Machine Deterioration

On a Quadratic Euclidean Problem of Vector Subset Choice: Complexity and Algorithmic Approach

NP-Completeness. f(n) \ n n sec sec sec. n sec 24.3 sec 5.2 mins. 2 n sec 17.9 mins 35.

On-line Scheduling of Two Parallel Machines. with a Single Server

Combinatorial Structure of Single machine rescheduling problem

Using column generation to solve parallel machine scheduling problems with minmax objective functions

Complexity of the min-max and min-max regret assignment problems

The NP-Hardness of the Connected p-median Problem on Bipartite Graphs and Split Graphs

Scheduling on Unrelated Parallel Machines. Approximation Algorithms, V. V. Vazirani Book Chapter 17

An improved approximation algorithm for two-machine flow shop scheduling with an availability constraint

Minimizing the number of tardy jobs for the single machine scheduling problem: MIP-based lower and upper bounds

CS264: Beyond Worst-Case Analysis Lecture #18: Smoothed Complexity and Pseudopolynomial-Time Algorithms

Approximation algorithms for scheduling problems with a modified total weighted tardiness objective

arxiv: v1 [math.oc] 3 Jan 2019

arxiv: v1 [cs.cg] 29 Jun 2012

Tensor Complementarity Problem and Semi-positive Tensors

Scheduling Coflows in Datacenter Networks: Improved Bound for Total Weighted Completion Time

Using column generation to solve parallel machine scheduling problems with minmax objective functions

CS264: Beyond Worst-Case Analysis Lecture #15: Smoothed Complexity and Pseudopolynomial-Time Algorithms

On-line Bin-Stretching. Yossi Azar y Oded Regev z. Abstract. We are given a sequence of items that can be packed into m unit size bins.

The Multiple Traveling Salesman Problem with Time Windows: Bounds for the Minimum Number of Vehicles

An Integer Batch Scheduling Model for a Single Machine with Simultaneous Learning and Deterioration Effects to Minimize Total Actual Flow Time

Transcription:

The unbounded single machine parallel batch scheduling problem with family jobs and release dates to minimize makespan J.J. YUAN 1,, Z.H. Liu 2,, C.T. NG and T.C.E. CHENG 1 Department of Mathematics, Zhengzhou University, Zhengzhou, Henan 450052, People s Republic of China 2 Department of Mathematics, East China University of Science and Technology, Shanghai 20027, People s Republic of China Department of Logistics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, People s Republic of China ABSTRACT In this paper we consider the unbounded single machine parallel batch scheduling problem with family jobs and release dates to minimize makespan. We show that this problem is strongly NP-hard, and give an O n n m +1 m time dynamic programming algorithm and an O(mk k+1 P 2k 1 ) time dynamic programming algorithm, where n is the number of jobs, m is the number of families, k is the number of distinct release dates and P is the sum of the processing times of all families. We further give a heuristic with a performance ratio 2. We also give a polynomial-time approximation scheme (PTAS) for the problem. Keywords: Scheduling; Family; Batching; Makespan 1 Introduction and Problem Formulation Let n jobs J 1,J 2,..., J n and a single machine that can process the jobs concurrently in batches be given. Each job J j has a processing time p j and a release date r j. The jobs are processed in batches, where a batch is a subset of the jobs and we require that the batches form a partition of the set of all jobs. The processing time of a batch is equal to Corresponding author 1

the maximum processing time among the jobs in the batch, i.e., the processing time of a batch B is defined as p B =max{p j : J j B}. Suppose that a batch sequence (which indicates the processing order of a certain batch partition of the jobs) is given and we will process the batched jobs according to this batch sequence. We require that the starting time of a batch is at least the maximum release date of the jobs in it. So, we define the release date of a batch B as r B =max{r j : J j B}. When the objective function considered is regular, we suppose that all the jobs in the same batch start simultaneously at the earliest possible starting time. Consequently, the starting time of each batch is determined by the batch sequence. For a batch B, ifthe starting time of B is s B, then the completion time of B and all the jobs in B is simply s B + p B. Following [] and [9], we call this model the parallel batch scheduling problem and denote it by 1 p-batch; r j f, where f is the objective function to be minimized, and p-batch means that the jobs containedinthesamebatchareprocessedinparallel,i.e.,concurrently,sothatthe processing time of a batch is equal to the maximum processing time among the jobs in the batch. The parallel batch scheduling problem is one of the important modern scheduling models that has received much attention in the literature. The fundamental model of the parallel batch scheduling problem was first introduced by Lee et al. [10] with the restriction that the number of jobs in each batch is bounded by a number b, which is denoted by 1 p-batch; b < n f. This bounded model is motivated by the burn-in operations in semiconductor manufacturing [10]. For example, a batch of integrated circuits (jobs) may be put inside an oven of limited size to test for their thermal standing ability. The circuits are heated inside the oven until all circuits are burned. The burn-in time of the circuits (job processing times) may be different. When a circuit is burned, it has to wait inside the oven until all circuits are burned. Therefore, the processing time of a batch of circuits is the processing time of the longest job in the batch. An extensive discussion of the unbounded version of the parallel batch scheduling problem was provided in [1]. This unbounded model can be applied, for example, to situations where the batch contents are heated in a sufficiently large kiln and so the batch size is not restricted [1]. Recent developments of this topic can be found in [2] and from the web site []. In addition, [4], [6], [7], [11] and [12] presented new complexity results and approximation algorithms for the parallel batch scheduling problem subject to release dates. In this paper we consider a generalization of the unbounded single machine parallel batch scheduling problem. Suppose that we have m families of jobs F 1, F 2,..., F m,which partition the job set {J 1,J 2,..., J n }. Jobs from the same family are processed in batches while jobs from different families cannot be contained in the same batch. We call this problem the unbounded single machine parallel batch scheduling with family jobs. The objective we consider in this paper is to minimize the makespan with release dates. In 2

the sequel, this problem will be denoted by 1 family-jobs; p-batch; r j C max. We show in this paper that the problem 1 family-jobs; p-batch; r j C max is strongly NP-hard. We give an O n n m +1 m time dynamic programming algorithm and an O(mk k+1 P 2k 1 ) time dynamic programming algorithm for the problem, where n is the number of jobs, m is the number of families, k is the number of distinct release dates and P is the sum of the processing times of all families. We further give a heuristic with a performance ratio 2. We also give a polynomial-time approximation scheme (PTAS) for the problem. 2 A Useful Lemma We first give an easy lemma, which will be used in the following sections. Lemma 2.1 For the problem 1 family-jobs; p-batch; r j C max, there is an optimal batch sequence BS =(B 1,B 2,..., B b ) such that if two jobs J i and J j belong to the same family, where J i B x and J j B y with x<y,thenp i >p j. Proof Let BS =(B 1,B 2,..., B b ) be an optimal batch sequence for which the property of Lemma 2.1 does not hold. Then there are two jobs J i and J j that belong to the same family F f such that J i B x and J j B y with x < y,butp i p j. We obtain a new batch sequence BS by shifting the job J i from B x to B y. It is easy to see that C max (BS ) C max (BS), and so BS is optimal, too. Continuing this procedure, we eventually obtain an optimal batch sequence with the required property. 2 Corollary 2.2 There is an optimal batch sequence BS =(B 1,B 2,..., B b )forthe problem 1 family-jobs; p-batch; r j C max such that each batch B x of family F f is in the form B x = {J j F f : l p j u} for some numbers l and u. NP-hardness Proof We need the following strongly NP-complete -Partition problem. -Partition Problem: Given a set of t integers a 1,a 2,..., a t, each of size between B/4 andb/2, such that t i=1 a i = tb, is there a partition of the a i s into t groups of, each summing exactly to B? By Garey and Johnson [8], we have

Lemma.1 The -Partition problem is strongly NP-complete. Theorem.2 The problem 1 family-jobs; p-batch; r j C max is strongly NP-hard. Proof: The decision version of the problem is clearly in NP. To prove its NPcompleteness, we use the strongly NP-complete -Partition problem for our reduction. For a given instance of the -Partition problem with a 1,a 2,..., a t,where 1 t t i=1 a i = B, we construct an instance of the decision version of the problem 1 family-jobs; p-batch; r j C max as follows. Three key numbers are used in the construction: t 2 jobs: J (i,j),, 1 j t; t families F 1, F 2,..., F t,where = t 2 B +1, Z = t B + 1 2 (t2 t)b +1, X = t B + 1 2 (t2 t)(z + B)+1; F i = {J (i,j) :1 j t}, 1 i t; Processing times of the jobs are defined as p (i,j) =(t +1 j)(z + a i ), 1 i t, 2 j t, and p (i,1) = X + a i, 1 i t; Release dates of the jobs are defined as r (i,j) =(j 1)X +(j 1) B, 1 i t, 1 j t; Threshold value of the makespan is defined as Y =tx + t B + 1 2 (t2 t)(z + B). The decision version of the problem 1 family-jobs; P-batch; r j C max asks whether there is a batch sequence BS such that the makespan C max (BS) Y. Clearly, the construction can be done in polynomial time. We show in the sequel that the instance of the -Partition problem has a solution if and only if there is a batch sequence BS for the constructed instance of the scheduling problem such that the makespan C max (BS) Y. Set r (j) =(j 1)X +(j 1) B, 1 j t. Thenr (i,j) = r (j), i.e., r (i,j) is independent of i. We will call J (i,j) the j-th job of family F i. 4

If the -Partition problem has a solution, we can re-lable the indices of a 1,a 2,..., a t such that a i 2 + a i 1 + a i = B, for 1 i t. We construct a batch sequence BS of the scheduling problem as follows. Each family F f,1 f (t 1) is divided into two batches B f and A f such that B f = {J (f,j) :1 j 1 f} and A f = {J (f,j) : 1 f <j t}. Each family F f, f =t 2, t 1, t, actsasabatch. WewriteB f = F f for f = t 2, t 1, t in the following. The batches are processed according to the following order under BS: B 1, B 2, B,..., B i 2, B i 1, B i,..., B t 2, B t 1, B t, A 1, A 2, A,..., A t 5, A t 4, A t. It is not hard to verify that, under the above batch sequence BS, C max (BS) =Y. Hence, the scheduling problem has the required batch sequence. Now suppose that the scheduling problem has a required batch sequence. We need to show that the -Partition problem has a solution. By Lemma 2.1, we have the following claim. Claim 1 There is a required batch sequence BS =(B 1,B 2,...B m ) of the scheduling problem such that (1) for every two jobs J (f,i) and J (f,j) of any family F f with i<j,eitherj (f,i) and J (f,j) are included in the same batch, or J (f,i) is included in a batch with an index smaller than that of the batch that contains J (f,j), i.e., C (f,i) (BS) C (f,j) (BS); (2) the job indices in each batch are consecutive, i.e., if B is a batch of family F f under BS, then for every two jobs J (f,i),j (f,j) B with i<j, {J (f,k) : i k j} B. Let BS =(B 1,B 2,...B m ) be a batch sequence of the scheduling problem that satisfies the properties in Claim 1. We need more properties of BS. For each i, 1 i t, letb i be the batch in BS such that J (i,1) B i.then,b i F i. Let m i = B i.then,byclaim1,wehave B i = {J (i,j) :1 j m i }. For convenience, we re-enumerate the families of jobs such that Claim 2 m i i for 1 i t. m 1 m 2... m t. 5

Suppose to the contrary that m i i +1 for some i, 1 i t. Then the earliest starting time of the batches in {B x :i x t} is at least r (i+1) =ix + i B >ix. Hence, the makespan is estimated as C max (BS) >r (i+1) +(t i +1)X tx + X>Y. This contradicts our assumption. Claim m i = 1 i for 1 i t. In fact, by Claim 2, each family F i with m i (t 1) is splitted into at least two batches such that the sum of the processing times of these batches is at least (X + a i )+ (t m i )(Z + a i ) >X+(t m i )Z. So, the makespan can be estimated as C max (BS) > tx +t 2 Z m i Z. Since C max (BS) Y =tx + t B + 1 2 (t2 t)(z + B), we have i.e., So, tx +t 2 Z m i Z<tX + t B + 1 2 (t2 t)(z + B), m i Z> 2 (t2 + t)z t B 1 2 (t2 t)b = 2 (t2 + t)z Z +1. By the integrality of m i,wededucethat Since we then have or equavilently, m i > 2 (t2 + t) 1+ 1 Z. m i 2 (t2 + t). 2 (t2 + t) = m i 1 i, 1 i, (m i 1 i) 0. By Claim 2, m i 1i, andsom i 1 i 0for each i. This implies that the only possiblity is that m i = 1 i for each i, as required. 6

Claim 4 Each family F i,1 i (t 1), is divided into just two batches under π. Otherwise, some family F k (1 k (t 2)) is divided into at least three batches under π. By Claim, two of these batches have processing times X + a k >Xand (t m k )(Z + a k ) > (t m k )Z, and each of the remaining batches has at least a processing time Z + a k Z. So, the sum of the processing times of the batch F k is greater than Z + X +(t m k )Z. The sum of the processing times of the batches of any other family F i is at least (X + ta i )+(t m i )(Z + a i ) >X+(t m i )Z. Hence,themakespanofBS is estimated as C max (BS) > Z + (X +(t m i )Z) = Z +tx +t 2 Z 1 iz = Z +tx + 2 (t2 t)z > tx + t B + 1 2 (t2 t)(z + B) = Y, a contradiction. Set A i = F i \B i, for 1 i t. By Claim 1(2), Claim and Claim 4, each family F i, 1 i (t 1), is divided into two batches B i and A i under BS, andwehave B i = {J (i,j) :1 j i }, for 1 i t, and A i = {J (i,j) : i +1 j t}, for 1 i (t 1). Now, for 1 k t, writeα k = a k 2 + a k 1 + a k. Then, t k=1 α k = tb. Wenotice the following facts: (1) The common release date of B k 2, B k 1 and B k is r (k) =(k 1)X +(k 1) B. (2)The common release date of A i,1 i (t 1), is r (t) >r (k),1 k t 1. () B k 2 = B k 1 = B k = k for 1 k t; and A k 2 = A k 1 = A k = t k for 1 k t 1. For each k with 1 k t, we consider the batches B k 2, B k 1,..., B t, A 1, A 2,..., A (t 1). Since the minimum release date of these batches is r (k),themakespanc max (BS) isgreater than or equal to the value obtained by summing up r (k) and the processing times of these batches. Now, r (k) =(k 1)X +(k 1) B, and the sum of the processing times of the jobs in these batches is (t k +1)X + α i + i(z + α i ). k i t 7 1 i t

Hence, C max (BS) Xt +(k 1) B + k i t α i + t(t 1)Z + (t i)α i. 2 1 i t By the assumption of C max (BS) Y =tx + t B + t(t 1)Z + 1 t(t 1)B, wededuce 2 2 that ( ) : α i (t k +1)B + (t i)(b α i ). This implies that or equivalently k i t k i t 1 i t α i (t k +1)B + 1, 1 k t, k i t α i (t k +1)B +1 1. By the integrality of α i, we deduce the following t inequalities (I k ), 1 k t: (I k ): k i t By setting k = 1 in the inequality ( ), we obtain This can be rewriten as i.e., 1 i t α i (t k +1)B. (t i)(b α i ) 0, 1 k t. 1 i t 1 k t k i t iα i α i 1 k t 1 i t ib, (t k +1)B. Combining this with the t inequalities I k (1 k t), we deduce that k i t α i =(t k +1)B, for 1 k t. Consequently, α k = B, for 1 k t. Hence, the -Partition problem has a solution. The result follows. 2 Recall the following NP-complete Equal-size 2-Partition problem [8]. Equal-size 2-Partition Given a set of 2t positive integers a 1,a 2,..., a 2t such that 2t i=1 a i =2B, is there a partition of the a i s into 2 groups of t, each summing exactly to B? 8

By using the NP-complete Equal-size 2-Partition problem for the reduction, we can further prove the following two results. Theorem. The problem 1 family-jobs; p-batch; r j C max is NP-hard even when the jobs have at most 2 distinct release dates. Proof Similar to the proof of Theorem.2. 2 4 Algorithms Consider the problem 1 family-jobs; p-batch; r j C max. By Corollary 2.2, if there are two jobs J i and J j in the same family such that r i r j and p i p j, we can delete the job J i from the job system. This procedure requires only O(n) time. Hence, in the sequel, we can suppose that any two jobs J i and J j in the same family have different release dates and different processing times, and furthermore r i <r j implies p i >p j. The algorithms presented in this section include: an O n n m +1 m time dynamic programming algorithm; an O(mk k+1 P 2k 1 ) time dynamic programming algorithm, where n is the number of jobs, m is the number of families, k is the number of distinct release dates and P is the sum of the processing times of all families; a heuristic with a performance ratio 2; and a polynomial time approximation scheme. 4.1 A genaral dynamic programming algorithm Suppose that we have m families F 1, F 2,...,F m, and each family is in the form F i = {J (i,1),j (i,2),..., J (i,ni )}, 1 i m. Suppose further that the jobs are enumerated such that r (i,1) <r (i,2) <... < r (i,ni ), 1 i m, and p (i,1) >p (i,2) >... > r (i,ni ), 1 i m. For a nonnegative integer x, write F (x) i = {J (i,j) :1 j x}, 1 i m. For m integers x 1,x 2,..., x m with 0 x i n i,1 i m, letr(x 1,x 2,..., x m )be the minimum makespan of the problem 1 family-jobs; p-batch; r j C max restricted to the m subfamilies of jobs F (x 1) 1, F (x 2) 2,..., F (x m) m. 9

Consider an optimal batch sequence BS for the problem 1 family-jobs; p-batch; r j C max restricted to the fimilies F (x 1) 1, F (x 2) 2,..., F m (xm) such that BS satisfies the property described in Lemma 2.1. If the last batch B b in BS is a subset of F i and the maximum release date of the jobs in B b is r Bb {r (i,j) :1 j x i },thenwehave R(x 1,x 2,...,x m ) = max{r Bb,R(x 1,..., x i 1,x i B b,x i+1,..., x m )} + P Bb, where r Bb = r (i,xi ) is the release date of the last job in B b and P Bb = p (i,xi B b +1) is the processing time of the first job in B b. Hence, our dynamic programming recursion can be expressed as R(x 1,x 2,..., x m )= min max{r (i,x 1 i m,1 y i x i 1 i ),R(x 1,...,x i 1,y i,x i+1,..., x m )} + p (i,yi +1). The initial condition is given by R(0, 0,...,0) = 0. The dynamic programming function has at most n m (n 1 +1)(n 2 +1)...(n m +1) m +1 states. Each recursion runs only O(n) time,sincewehaveatmosto(n) choices for (i, y i ) with 1 i m and 1 y i n i. Hence, the overall complexity of the above dynamic programming recursion is O n n m +1 m. One interesting corollary of the above discussion is that, when m =1,theproblem becomes the unbouned parallel batch scheduling problem to minimize makespan, i.e., 1 p-batch,r j C max [2, 1], and can be solved in O(n 2 ) time. The complexity of this problem was posed as open in [], but in fact, an O(n 2 ) time algorithm was implied in [11]. 4.2 A pseudopolynomial dynamic programming formulation under fixed number of release dates Let k distinct release dates be given: R 1,R 2,...,R k, satisfying R 1 <R 2 < <R k. Then, the interval [R 1, + ) is divided into k segments [R 1,R 2 ), [R 2,R ),...,[R k,r k+1 ), where R k+1 =+. Let g(a 1,...,a k ) denote the minimum makespan of the schedules for the n jobs, subject to the constraint that the first batch starting in [R i,r i+1 ) (if it exists) starts at time a i (i =1, 2,...,k). Clearly, we may require a 1 = R 1 and R i a i < min{r i+1,r i + P } (2 i k), where P = m f=1 max n f j=1 p (f,j). Then, (a 1,...,a k ) has at most O(P k 1 ) configurations. 10

Now assume that some (a 1,...,a k ) is given. To compute g(a 1,...,a k ), we further introduce h(j; l 1,...,l k )(0 j m) as the minimum makespan to schedule the jobs of families F 1, F 2,...,F j, subject to the constraints that, for i =1, 2,...,k, (i) the first batch starting in [R i,r i+1 ) (if it exists) starts at a i ; (ii) the total processing time of the batches starting in [R i,r i+1 )isl i. Then, g(a 1,...,a k )=min (l1,...,l k ) h(m; l 1,...,l k ), where (l 1,...,l k )satisfies (i) 0 l i P (1 i k); (ii) a i + l i a i+u if l i+1 = l i+2 = = l i+u 1 =0andl i+u =0(1 i k 1, 1 u k i). h(m; l 1,...,l k ) can be computed recursively. Initially, we define h(0; 0,...,0) = a k and for other cases, h(0; l 1,...,l k )=+. Then, for j =1, 2,...,m, h(j; l 1,...,l k )= min {h(j 1; l 1 δ 1,...,l k δ k )+δ k }, (δ 1,...,δ k ) where δ i (i =1, 2,...,k) is the processing time of the batch of F j starting in [R i,r i+1 )and satisfies 0 l i δ i R i+1 a i if δ i = 0. Since there are at most O(n k j ) O(k k )waysto partition the jobs of F j into k sets, i.e., (δ 1,...,δ k ) has at most O(k k )configurations, and each recursion requires O(k k+1 ) time. The size of the domain of h(j; l 1,...,l k )iso(mp k ). Thus, g(a 1,...,a k ) is obtained in O(mk k+1 P k ) time. The globally optimal schedule is obtained by considering all configurations of (a 1,...,a k ), which requires O(mk k+1 P 2k 1 ) time. 4. A heuristic Consider the following heuristic for the problem 1 family-jobs; p-batch; r j C max. Algorithm 4..1 Family Batchng Rule: Each family acts as a batch. First, we re-enumerate the families F 1,..., F m such that r F1 r F2... r Fm. Then, set BS =(F 1,..., F m ). Theorem 4..2 Family Batching Rule is a 2-approximation algorithm for the problem 1 family-jobs; p-batch; r j C max. Proof Let Cmax opt be the minimum makespan for the problem 1 family-jobs; p-batch; r j C max. Two obvious lower bounds for C opt max are C opt max max{r j :1 j n} =max{r Fi :1 i m} 11

and C opt max 1 i m p Fi. Furthermore, an obvious upper bound for C max (BS) is C max (BS) max{r j :1 j n} + 1 i m where BS is the batch sequence obtained by the Family Batching Rule (Algorithm 4..1). It follows that C max (BS) 2C opt max. 2 p Fi, The result of Theorem 4..2 is the best possible. To see this, let ε > 0beanysmallpositive number. We will construct an instance I of the problem 1 family-jobs; p-batch; r j C max such that the batch sequence obtained by the Family Batching Rule on I is not a (2 ε)- approximation solution. We have m families F 1,..., F m,wherem 2. Each family F ε i has two jobs, i.e., F i = {J (i,1),j (i,2) }. The proceesing times of the jobs are defined as The release dates of the jobs are defined as p (i,1) = m and p (i,2) =1, 1 i m. r (i,1) =0andr (i,2) = m 2, 1 i m. One can verify that one of the optimal batch sequences is ({J (1,1) }, {J (1,1) },..., {J (m,1) }, {J (1,2) }, {J (2,2) },..., {J (m,2) }, and the minimum makespan is given by C opt max = m 2 +m. But the batch sequence obtained by the Family Batching Rule is and the makespan of BS is BS =(F 1,..., F m ), C max (BS) =2m 2 > (2 ε)(m 2 + m) =(2 ε)c opt max. Hence, the result of Theorem 4..2 is the best possible. 12

4.4 A polynomial time approximation scheme In this section we will derive a polynomial time approximation scheme for the scheduling problem. First, we give a lemma that allows us to focus on the special case with a constant number of distinct release dates. Lemma 4.4.1 Given a PTAS for the special case with a constant number of distinct release dates, then there exists a PTAS for the general problem. Proof Let > 0begiven. Define r max =max 1 i n r i and δ = r max /2. Note that δ Cmax/2, opt since r max Cmax, opt wherecmax opt denotes the optimal objective value. Round each release date r i down to the nearest multiple of δ, i.e., r i = δr i/δ (i =1, 2,...,n). Clearly, the number of distinct ri is no more than 1+r max/δ =1+2/, which is a constant number for a given. LetCmax denote the optimal objective value for the problem with the scaled release dates ri. Consider a (1+/2)-approximation solution to the problem with the scaled release dates. Add δ to each batch s start time in the solution. Then we get a feasible schedule with respect to the release dates r i,thec max of which is bounded by (1 + /2)Cmax + δ (1 + )Cmax opt, where Cmax Cmax opt is applied. 2 In the following, we present an FPTAS for the special case with a constant number k of distinct release dates. This is done by applying the well-known rounding technique to the dynamic programming formulation in Section 4.2. Given > 0, we define ν = P/(n +1). Let ri = r i/ν, p i = p i/ν (i =1, 2,...,n). Suppose that we have found an optimal schedule and its objective value Cmax for the problem with the scaled parameters ri and p i by the dynamic program in Section 4.2. Increase each processing time p i by p i /ν p i, which increases the objective value by at most n. Now consider 1/ν to be a unit of time. Then we can get a schedule with respect to r i and p i,anditsobjectivevalueisgivenby C max νc max + nν. Also, consider an optimal schedule with respect to r i and p i, the objective value of which is denoted by Cmax. opt Delay the starting of each batch by ν units of time in the schedule, and consider ν to be a unit of time. We obtain a schedule with the objective value C max =1+C opt max/ν. 1

Obviously, C max C max holds. Thus, C max C opt max +(n +1)ν = C opt max + P (1 + )C opt max. The time complexity of the approximation scheme is dominated by the step to solve the problem with the scaled parameters. Let P = m f=1 max n f j=1 p (f,j). Clearly, it holds that P P ν = n +1. Thus, the running time of the approximation scheme is bounded by O mk k+1 P 2k 1 O mk k+1 n +1 2k 1 which is polynomial for given k and 1/. In other words, the approximation scheme is an FPTAS., Acknowledgements This research was supported in part by the Research Grant Council of Hong Kong under grant numbers PolyU B.2.7.Q454 and G-YW59. The first two authors were also supported by the National Natural Science Foundation of China under grant numbers 1071112 and 10101007. References [1] P. Brucker, A. Gladky, H. Hoogeveen, M.Y. Kovalyov, C.N. Potts, T. Tautenhahn and S.L. van de Velde, Scheduling a batching machine, Journal of Scheduling, 1(1998), 1-54. [2] P. Brucker, Scheduling Algorithms, Springer-Verlag, Berlin, 2001. [] P. Brucker and S. Knust, Complexity results for scheduling problems, http://www. mathematik.uni-osnabrueck.de/research/or/class/, 200. [4] T.C.E. Cheng, Z.H. Liu and W.C. Yu, Scheduling jobs with release dates and deadlines on a batch processing machine, IIE Transactions, (2001), 685-690. [5] E.G. Coffman, M. Yannakakis, M.J. Magazine and C. Santos, Batch sizing and sequencing on a single machine, Annals of Operations Research, 26(1990), 15-147. [6] X. Deng and Y.Z. Zhang, Minimizing mean response time for batch processing systems, Lecture Notes in Computer Science, 1627(1999), 21-240. 14

[7] X. Deng, C.K. Poon and Y.Z. Zhang, Approximation algorithms in batch processing, Lecture Notes in Computer Science, 1741(2000), 15-162. [8] M.R. Garey and D.S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness, Freeman, San Francisco, CA, 1979. [9] J.K. Lenstra, A.H.G. Rinnooy Kan and P. Brucker, Complexity of machine scheduling problems, Annals of Discrete Mathematics, 1(1977), 4-62. [10] C.-Y. Lee, R. Uzsoy and L.A. Martin-Vega, Efficient algorithms for scheduling semiconductor burn-in operations, Operations Research, 40(1992), 764-775. [11] Z.H. Liu and W.C. Yu, Scheduling one batch processor subject to job release dates, Discrete Applied Mathematics, 105(2000), 129-16. [12] Z.H. Liu, J.J. Yuan and T.C.E. Cheng, On scheduling an unbounded batch machine, Operation Research Letters, 1(200), 42-48. [1] C.N. Potts and M.Y. Kovalyov, Scheduling with batching: a review, European Journal of Operational Research, 120(2000), 228-249. 15