Chapter 11. Approximation Algorithms. Slides by Kevin Wayne Pearson-Addison Wesley. All rights reserved.

Similar documents
Chapter 11. Approximation Algorithms. Slides by Kevin Wayne Pearson-Addison Wesley. All rights reserved.

Algorithms Design & Analysis. Approximation Algorithm

NP and Computational Intractability

COP 4531 Complexity & Analysis of Data Structures & Algorithms

Approximation and Randomized Algorithms (ARA) Lecture 2, September 1, 2010

CS 580: Algorithm Design and Analysis. Jeremiah Blocki Purdue University Spring 2018

Algorithm Design and Analysis

Algorithm Design and Analysis

8. INTRACTABILITY I. Lecture slides by Kevin Wayne Copyright 2005 Pearson-Addison Wesley. Last updated on 2/6/18 2:16 AM

Computational Intractability 2010/4/15. Lecture 2

4/12/2011. Chapter 8. NP and Computational Intractability. Directed Hamiltonian Cycle. Traveling Salesman Problem. Directed Hamiltonian Cycle

Chapter 8. NP and Computational Intractability

CS 301: Complexity of Algorithms (Term I 2008) Alex Tiskin Harald Räcke. Hamiltonian Cycle. 8.5 Sequencing Problems. Directed Hamiltonian Cycle

8.5 Sequencing Problems

8.5 Sequencing Problems. Chapter 8. NP and Computational Intractability. Hamiltonian Cycle. Hamiltonian Cycle

Chapter 8. NP and Computational Intractability. Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

8.1 Polynomial-Time Reductions. Chapter 8. NP and Computational Intractability. Classify Problems

This means that we can assume each list ) is

Linear Programming. Scheduling problems

4/22/12. NP and NP completeness. Efficient Certification. Decision Problems. Definition of P

Algorithms and Theory of Computation. Lecture 22: NP-Completeness (2)

4/19/11. NP and NP completeness. Decision Problems. Definition of P. Certifiers and Certificates: COMPOSITES

Computability and Complexity

CS 580: Algorithm Design and Analysis. Jeremiah Blocki Purdue University Spring 2018

3/22/2018. CS 580: Algorithm Design and Analysis. 8.3 Definition of NP. Chapter 8. NP and Computational Intractability. Decision Problems.

CS 583: Algorithms. NP Completeness Ch 34. Intractability

Algorithm Design and Analysis

Polynomial-Time Reductions

4/30/14. Chapter Sequencing Problems. NP and Computational Intractability. Hamiltonian Cycle

Intro to Theory of Computation

12. LOCAL SEARCH. gradient descent Metropolis algorithm Hopfield neural networks maximum cut Nash equilibria

P and NP. Warm Up: Super Hard Problems. Overview. Problem Classification. Tools for classifying problems according to relative hardness.

NP and Computational Intractability

Lecture 4: NP and computational intractability

Algorithms. Outline! Approximation Algorithms. The class APX. The intelligence behind the hardware. ! Based on

Bin packing and scheduling

ECS122A Handout on NP-Completeness March 12, 2018

NP completeness and computational tractability Part II

CS 580: Algorithm Design and Analysis. Jeremiah Blocki Purdue University Spring 2018

Announcements. Analysis of Algorithms

NP-Completeness. Andreas Klappenecker. [based on slides by Prof. Welch]

Lecture 2: Scheduling on Parallel Machines

Approximation Algorithms

Algorithms. NP -Complete Problems. Dong Kyue Kim Hanyang University

NP-Complete Problems. More reductions

P and NP. Inge Li Gørtz. Thank you to Kevin Wayne, Philip Bille and Paul Fischer for inspiration to slides

1. Introduction Recap

Intro to Theory of Computation

1 Ordinary Load Balancing

NP Completeness and Approximation Algorithms

Computational Complexity. IE 496 Lecture 6. Dr. Ted Ralphs

Reductions. Reduction. Linear Time Reduction: Examples. Linear Time Reductions

Scheduling on Unrelated Parallel Machines. Approximation Algorithms, V. V. Vazirani Book Chapter 17

Chapter 8. NP and Computational Intractability. CS 350 Winter 2018

CS 580: Algorithm Design and Analysis

More on NP and Reductions

3/22/2018. CS 580: Algorithm Design and Analysis. Circuit Satisfiability. Recap. The "First" NP-Complete Problem. Example.

Chapter 4. Greedy Algorithms. Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

Computability and Complexity Theory: An Introduction

Discrete Optimization 2010 Lecture 12 TSP, SAT & Outlook

Approximation Basics

Correctness of Dijkstra s algorithm

SAT, Coloring, Hamiltonian Cycle, TSP

SAT, NP, NP-Completeness

12. LOCAL SEARCH. gradient descent Metropolis algorithm Hopfield neural networks maximum cut Nash equilibria

Limitations of Algorithm Power

7.8 Intractability. Overview. Properties of Algorithms. Exponential Growth. Q. What is an algorithm? A. Definition formalized using Turing machines.

Polynomial-time Reductions

CS 580: Algorithm Design and Analysis

Theory of Computation Chapter 9

Algorithms 6.5 REDUCTIONS. designing algorithms establishing lower bounds classifying problems intractability

CS 350 Algorithms and Complexity

Solutions to Exercises

Discrete Optimization 2010 Lecture 12 TSP, SAT & Outlook

NP Completeness and Approximation Algorithms

More NP-Complete Problems

Lecture 13, Fall 04/05

Lecture 11 October 7, 2013

NP Complete Problems. COMP 215 Lecture 20

CS 350 Algorithms and Complexity

CSE 202 Dynamic Programming II

Computer Science 385 Analysis of Algorithms Siena College Spring Topic Notes: Limitations of Algorithms

Design and Analysis of Algorithms

NP-complete problems. CSE 101: Design and Analysis of Algorithms Lecture 20

Easy Problems vs. Hard Problems. CSE 421 Introduction to Algorithms Winter Is P a good definition of efficient? The class P

Discrete Optimization 2010 Lecture 10 P, N P, and N PCompleteness

Asymptotic Polynomial-Time Approximation (APTAS) and Randomized Approximation Algorithms

Topics in Complexity Theory

Chapter 2. Reductions and NP. 2.1 Reductions Continued The Satisfiability Problem (SAT) SAT 3SAT. CS 573: Algorithms, Fall 2013 August 29, 2013

All-norm Approximation Algorithms

Chapter 4. Greedy Algorithms. Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

Summer School on Introduction to Algorithms and Optimization Techniques July 4-12, 2017 Organized by ACMU, ISI and IEEE CEDA.

1.1 P, NP, and NP-complete

Data Structures and Algorithms

NP and NP Completeness

Unit 1A: Computational Complexity

Notes for Lecture 21

Essential facts about NP-completeness:

6. DYNAMIC PROGRAMMING I

Transcription:

Chapter 11 Approximation Algorithms Slides by Kevin Wayne. Copyright @ 2005 Pearson-Addison Wesley. All rights reserved. 1

P and NP P: The family of problems that can be solved quickly in polynomial time. NP: The family of problems for which a given solution to the problem can be verified in polynomial time. Remark. NP stands for nondeterministic polynomial-time. 2

Certifiers and Certificates: 3-Satisfiability SAT. Given a CNF formula Φ, is there a satisfying assignment? Certificate. An assignment of truth values to the n boolean variables. Certifier. Check that each clause in Φ has at least one true literal. Ex. ( x 1! x 2! x 3 ) " ( x 1! x 2! x 3 ) " ( x 1! x 2! x 4 ) " ( x 1! x 3! x 4 ) instance s x 1 =1, x 2 =1, x 3 = 0, x 4 =1 certificate t Conclusion. SAT is in NP. 3

Certifiers and Certificates: Hamiltonian Cycle HAM-CYCLE. Given an undirected graph G = (V, E), does there exist a simple cycle C that visits every node? Certificate. A permutation of the n nodes. Certifier. Check that the permutation contains each node in V exactly once, and that there is an edge between each pair of adjacent nodes in the permutation. Conclusion. HAM-CYCLE is in NP. instance s certificate t 4

P, NP, EXP P. Decision problems for which there is a poly-time algorithm. EXP. Decision problems for which there is an exponential-time algorithm. NP. Decision problems for which there is a poly-time certifier. Claim. P NP. Pf. Consider any problem X in P. By definition, there exists a poly-time algorithm A(s) that solves X. Certificate: t = ε, certifier C(s, t) = A(s). Claim. NP EXP. Pf. Consider any problem X in NP. By definition, there exists a poly-time certifier C(s, t) for X. To solve input s, run C(s, t) on all strings t with t p( s ). Return yes, if C(s, t) returns yes for any of these. 5

The Main Question: P Versus NP Does P = NP? [Cook 1971, Edmonds, Levin, Yablonski, Gödel] Is the decision problem as easy as the certification problem? Clay $1 million prize. EXP P NP EXP P = NP If P NP If P = NP 6

NP-Completeness Observation. All problems below are NP-complete and polynomial reduce to one another! CIRCUIT-SAT by definition of NP-completeness 3-SAT 3-SAT reduces to INDEPENDENT SET INDEPENDENT SET DIR-HAM-CYCLE GRAPH 3-COLOR SUBSET-SUM VERTEX COVER HAM-CYCLE PLANAR 3-COLOR SCHEDULING SET COVER TSP 7

Some NP-Complete Problems Six basic genres of NP-complete problems and paradigmatic examples. Packing problems: SET-PACKING, INDEPENDENT SET. Covering problems: SET-COVER, VERTEX-COVER. Constraint satisfaction problems: SAT, 3-SAT. Sequencing problems: HAMILTONIAN-CYCLE, TSP. Partitioning problems: 3D-MATCHING 3-COLOR. Numerical problems: SUBSET-SUM, KNAPSACK. Practice. Most NP problems are either known to be in P or NP-complete. Notable exceptions. Factoring, graph isomorphism, Nash equilibrium. 8

I can't find an efficient algorithm, I guess I'm just too dumb. 9

I can't find an efficient algorithm, because no such algorithm is possible. 10

I can't find an efficient algorithm, but neither can all these famous people. 11

12

Approximation Algorithms Q. Suppose I need to solve an NP-hard problem. What should I do? A. Theory says you're unlikely to find a poly-time algorithm. Must sacrifice one of three desired features. Solve problem to optimality. Solve problem in poly-time. Solve arbitrary instances of the problem. ρ-approximation algorithm. Guaranteed to run in poly-time. Guaranteed to solve arbitrary instance of the problem Guaranteed to find solution within ratio ρ of true optimum. Challenge. Need to prove a solution's value is close to optimum, without even knowing what optimum value is! 13

11.1 Load Balancing

Load Balancing Input. m identical machines; n jobs, job j has processing time t j. Job j must run contiguously on one machine. A machine can process at most one job at a time. Def. Let J(i) be the subset of jobs assigned to machine i. The load of machine i is L i = Σ j J(i) t j. Def. The makespan is the maximum load on any machine L = max i L i. Load balancing. Assign each job to a machine to minimize makespan. 15

Load Balancing: List Scheduling List-scheduling algorithm. Consider n jobs in some fixed order. Assign job j to machine whose load is smallest so far. List-Scheduling(m, n, t 1,t 2,,t n ) { for i = 1 to m { L i 0 load on machine i J(i) φ } jobs assigned to machine i } for j = 1 to n { i = argmin k L k J(i) J(i) {j} L i L i + t j } machine i has smallest load assign job j to machine i update load of machine i Implementation. O(n log n) using a priority queue. 16

Load Balancing: List Scheduling A B C D E F G H I J Machine 1 Machine 2 Machine 3 0 Time 17

Load Balancing: List Scheduling B C D E F G H I J A Machine 1 Machine 2 Machine 3 0 Time 18

Load Balancing: List Scheduling C D E F G H I J A Machine 1 B Machine 2 Machine 3 0 Time 19

Load Balancing: List Scheduling D E F G H I J A Machine 1 B Machine 2 C Machine 3 0 Time 20

Load Balancing: List Scheduling E F G H I J A Machine 1 B D Machine 2 C Machine 3 0 Time 21

Load Balancing: List Scheduling F G H I J A E Machine 1 B D Machine 2 C Machine 3 0 Time 22

Load Balancing: List Scheduling G H I J A E Machine 1 B D Machine 2 C F Machine 3 0 Time 23

Load Balancing: List Scheduling H I J A E Machine 1 B D Machine 2 G C F Machine 3 0 Time 24

Load Balancing: List Scheduling I J A E HMachine 1 B D Machine 2 G C F Machine 3 0 Time 25

Load Balancing: List Scheduling J A E HMachine 1 I B D Machine 2 G G C F Machine 3 0 Time 26

Load Balancing: List Scheduling A E HMachine 1 I B D Machine 2 G C F Machine 3 J 0 Time 27

Load Balancing: List Scheduling D E H Machine 1 C B A Machine G 2 I F Machine 3 J 0 Optimal Schedule A E HMachine 1 I B D Machine 2 G C F Machine 3 J 0 List schedule 28

Load Balancing: List Scheduling Analysis Theorem. [Graham, 1966] Greedy algorithm is a 2-approximation. First worst-case analysis of an approximation algorithm. Need to compare resulting solution with optimal makespan L*. Lemma 1. The optimal makespan L* max j t j. Pf. Some machine must process the most time-consuming job. Lemma 2. The optimal makespan Pf. The total processing time is Σ j t j. L *! m 1 " j t j. One of m machines must do at least a 1/m fraction of total work. 29

Load Balancing: List Scheduling Analysis Theorem. Greedy algorithm is a 2-approximation. Pf. Consider load L i of bottleneck machine i. Let j be last job scheduled on machine i. When job j assigned to machine i, i had smallest load. Its load before assignment is L i - t j L i - t j L k for all 1 k m. blue jobs scheduled before j machine i j 0 L i - t j L = L i 30

Load Balancing: List Scheduling Analysis Theorem. Greedy algorithm is a 2-approximation. Pf. Consider load L i of bottleneck machine i. Let j be last job scheduled on machine i. When job j assigned to machine i, i had smallest load. Its load before assignment is L i - t j L i - t j L k for all 1 k m. Sum inequalities over all k and divide by m: L i! t j " 1 m # k L k = 1 m # k t k " L * Now L i = (L i! t j )!# "# $ + t j " 2L *. % " L* Lemma 1 " L* Lemma 2 31

Load Balancing: List Scheduling Analysis Q. Is our analysis tight? A. Essentially yes. Ex: m machines, m(m-1) jobs length 1 jobs, one job of length m machine 2 idle machine 3 idle machine 4 idle m = 10 machine 5 idle machine 6 idle machine 7 idle machine 8 idle machine 9 idle machine 10 idle list scheduling makespan = 19 32

Load Balancing: List Scheduling Analysis Q. Is our analysis tight? A. Essentially yes. Ex: m machines, m(m-1) jobs length 1 jobs, one job of length m m = 10 optimal makespan = 10 33

Load Balancing: LPT Rule Longest processing time (LPT). Sort n jobs in descending order of processing time, and then run list scheduling algorithm. LPT-List-Scheduling(m, n, t 1,t 2,,t n ) { Sort jobs so that t 1 t 2 t n for i = 1 to m { L i 0 J(i) φ } load on machine i jobs assigned to machine i } for j = 1 to n { i = argmin k L k J(i) J(i) {j} L i L i + t j } machine i has smallest load assign job j to machine i update load of machine i 34

Load Balancing: LPT Rule Observation. If at most m jobs, then list-scheduling is optimal. Pf. Each job put on its own machine. Lemma 3. If there are more than m jobs, L* 2 t m+1. Pf. Consider first m+1 jobs t 1,, t m+1. Since the t i 's are in descending order, each takes at least t m+1 time. There are m+1 jobs and m machines, so by pigeonhole principle, at least one machine gets two jobs. Theorem. LPT rule is a 3/2 approximation algorithm. Pf. Same basic approach as for list scheduling. L i = (L i! t j )!# "# $ + t j % " 2 3 L *. " L* " 1 2 L* Lemma 3 ( by observation, can assume number of jobs > m ) 35

Load Balancing: LPT Rule Q. Is our 3/2 analysis tight? A. No. Theorem. [Graham, 1969] LPT rule is a 4/3-approximation. Pf. More sophisticated analysis of same algorithm. Q. Is Graham's 4/3 analysis tight? A. Essentially yes. Ex: m machines, n = 2m+1 jobs, 2 jobs of length m+1, m+2,, 2m-1 and three jobs of length m. 36

11.2 Center Selection

Center Selection Problem Input. Set of n sites s 1,, s n. Center selection problem. Select k centers C so that maximum distance from a site to nearest center is minimized. k = 4 r(c) center site 38

Center Selection Problem Input. Set of n sites s 1,, s n. Center selection problem. Select k centers C so that maximum distance from a site to nearest center is minimized. Notation. dist(x, y) = distance between x and y. dist(s i, C) = min c C dist(s i, c) = distance from s i r(c) = max i dist(s i, C) = smallest covering radius. to closest center. Goal. Find set of centers C that minimizes r(c), subject to C = k. Distance function properties. dist(x, x) = 0 (identity) dist(x, y) = dist(y, x) (symmetry) dist(x, y) dist(x, z) + dist(z, y) (triangle inequality) 39

Center Selection Example Ex: each site is a point in the plane, a center can be any point in the plane, dist(x, y) = Euclidean distance. Remark: search can be infinite! r(c) center site 40

Greedy Algorithm: A False Start Greedy algorithm. Put the first center at the best possible location for a single center, and then keep adding centers so as to reduce the covering radius each time by as much as possible. Remark: arbitrarily bad! greedy center 1 k = 2 centers center site 41

Center Selection: Greedy Algorithm Greedy algorithm. Repeatedly choose the next center to be the site farthest from any existing center. Greedy-Center-Selection(k, n, s 1,s 2,,s n ) { } C = φ repeat k times { Select a site s i with maximum dist(s i, C) Add s i to C } site farthest from any center return C Observation. Upon termination all centers in C are pairwise at least r(c) apart. Pf. By construction of algorithm. 42

Center Selection: Analysis of Greedy Algorithm Theorem. Let C* be an optimal set of centers. Then r(c) 2r(C*). Pf. (by contradiction) Assume r(c*) < ½ r(c). For each site c i in C, consider ball of radius ½ r(c) around it. Exactly one c i * in each ball; let c i be the site paired with c i *. Consider any site s and its closest center c i * in C*. dist(s, C) dist(s, c i ) dist(s, c i *) + dist(c i *, c i ) 2r(C*). Thus r(c) 2r(C*). Δ-inequality r(c*) since c i * is closest center ½ r(c) ½ r(c) C* sites ½ r(c) s c i c i * 43

Center Selection Theorem. Let C* be an optimal set of centers. Then r(c) 2r(C*). Theorem. Greedy algorithm is a 2-approximation for center selection problem. Remark. Greedy algorithm always places centers at sites, but is still within a factor of 2 of best solution that is allowed to place centers anywhere. e.g., points in the plane Question. Is there hope of a 3/2-approximation? 4/3? Theorem. Unless P = NP, there no ρ-approximation for center-selection problem for any ρ < 2. 44

11.4 The Pricing Method: Vertex Cover

Weighted Vertex Cover Weighted vertex cover. Given a graph G with vertex weights, find a vertex cover of minimum weight. 2 4 2 4 2 9 2 9 weight = 2 + 2 + 4 weight = 9 46

Weighted Vertex Cover Pricing method. Each edge must be covered by some vertex i. Edge e pays price p e 0 to use vertex i. Fairness. Edges incident to vertex i should pay w i in total. 2 4 for each vertex i : " p e! w e= ( i, j) i 2 9 Claim. For any vertex cover S and any fair prices p e : e p e w(s). Proof. " e# E p! " " " e pe! wi = i# S e= ( i, j) i# S w( S). each edge e covered by at least one node in S sum fairness inequalities for each node in S 47

Pricing Method Pricing method. Set prices and find vertex cover simultaneously. Weighted-Vertex-Cover-Approx(G, w) { foreach e in E p e = 0! e = ( i, j) p = e w i while ( edge i-j such that neither i nor j are tight) select such an edge e increase p e without violating fairness } } S set of all tight nodes return S 48

Pricing Method price of edge a-b vertex weight Figure 11.8 49

Pricing Method: Analysis Theorem. Pricing method is a 2-approximation. Pf. Algorithm terminates since at least one new node becomes tight after each iteration of while loop. Let S = set of all tight nodes upon termination of algorithm. S is a vertex cover: if some edge i-j is uncovered, then neither i nor j is tight. But then while loop would not terminate. Let S* be optimal vertex cover. We show w(s) 2w(S*). w(s) = " w i = i! S " " p e # i! S e=(i, j) " " p e = 2 " p e # 2w(S*). i!v e=(i, j) e! E all nodes in S are tight S V, prices 0 each edge counted twice fairness lemma 50

11.6 LP Rounding: Vertex Cover

Weighted Vertex Cover Weighted vertex cover. Given an undirected graph G = (V, E) with vertex weights w i 0, find a minimum weight subset of nodes S such that every edge is incident to at least one vertex in S. 10 A 6F 9 16 B 7G 10 6 3C H 9 23 D I 33 7 E 10 J 32 total weight = 55 52

Weighted Vertex Cover: IP Formulation Weighted vertex cover. Given an undirected graph G = (V, E) with vertex weights w i 0, find a minimum weight subset of nodes S such that every edge is incident to at least one vertex in S. Integer programming formulation. Model inclusion of each vertex i using a 0/1 variable x i. x i =! " # 0 if vertex i is not in vertex cover 1 if vertex i is in vertex cover Vertex covers in 1-1 correspondence with 0/1 assignments: S = {i V : x i = 1} Objective function: maximize Σ i w i x i. Must take either i or j: x i + x j 1. 53

Weighted Vertex Cover: IP Formulation Weighted vertex cover. Integer programming formulation. (ILP) min " w i x i i! V s. t. x i + x j # 1 (i, j)! E x i! {0,1} i! V Observation. If x* is optimal solution to (ILP), then S = {i V : x* i = 1} is a min weight vertex cover. 54

Integer Programming INTEGER-PROGRAMMING. Given integers a ij and b i, find integers x j that satisfy: max c t x s. t. Ax! b x integral n! a ij x j " b i 1# i # m j=1 x j " 0 1# j # n x j integral 1# j # n Observation. Vertex cover formulation proves that integer programming is NP-hard search problem. even if all coefficients are 0/1 and at most two variables per inequality 55

Linear Programming Linear programming. Max/min linear objective function subject to linear inequalities. Input: integers c j, b i, a ij. Output: real numbers x j. (P) max c t x s. t. Ax! b x! 0 (P) max! c j x j n j=1 n s. t.! a ij x j " b i 1# i # m j=1 x j " 0 1# j # n Linear. No x 2, xy, arccos(x), x(1-x), etc. Simplex algorithm. [Dantzig 1947] Can solve LP in practice. Ellipsoid algorithm. [Khachian 1979] Can solve LP in poly-time. 56

LP Feasible Region LP geometry in 2D. x 1 = 0 x 2 = 0 2x 1 + x 2 = 6 x 1 + 2x 2 = 6 57

Weighted Vertex Cover: LP Relaxation Weighted vertex cover. Linear programming formulation. (LP) min " w i x i i! V s. t. x i + x j # 1 (i, j)! E x i # 0 i! V Observation. Optimal value of (LP) is optimal value of (ILP). Pf. LP has fewer constraints. Note. LP is not equivalent to vertex cover. ½ ½ Q. How can solving LP help us find a small vertex cover? A. Solve LP and round fractional values. ½ 58

Weighted Vertex Cover Theorem. If x* is optimal solution to (LP), then S = {i V : x* i ½} is a vertex cover whose weight is at most twice the min possible weight. Pf. [S is a vertex cover] Consider an edge (i, j) E. Since x* i + x* j 1, either x* i ½ or x* j ½ (i, j) covered. Pf. [S has desired cost] Let S* be optimal vertex cover. Then w i i! S* * " # " w i x i # 1 w 2 " i i! S i! S LP is a relaxation x* i ½ 59

Weighted Vertex Cover Theorem. 2-approximation algorithm for weighted vertex cover. Theorem. [Dinur-Safra 2001] If P NP, then no ρ-approximation for ρ < 1.3607, even with unit weights. 10 5-21 Open research problem. Close the gap. 60

* 11.7 Load Balancing Reloaded

Generalized Load Balancing Input. Set of m machines M; set of n jobs J. Job j must run contiguously on an authorized machine in M j M. Job j has processing time t j. Each machine can process at most one job at a time. Def. Let J(i) be the subset of jobs assigned to machine i. The load of machine i is L i = Σ j J(i) t j. Def. The makespan is the maximum load on any machine = max i L i. Generalized load balancing. Assign each job to an authorized machine to minimize makespan. 62

Generalized Load Balancing: Integer Linear Program and Relaxation ILP formulation. x ij = time machine i spends processing job j. (IP) min L s. t.! x i j = t j for all j " J i! # L for all i " M j x i j x i j " {0, t j } for all j " J and i " M j x i j = 0 for all j " J and i $ M j LP relaxation. (LP) min L s. t.! x i j = t j for all j " J i! # L for all i " M j x i j x i j $ 0 for all j " J and i " M j x i j = 0 for all j " J and i % M j 63

Generalized Load Balancing: Lower Bounds Lemma 1. Let L be the optimal value to the LP. Then, the optimal makespan L* L. Pf. LP has fewer constraints than IP formulation. Lemma 2. The optimal makespan L* max j t j. Pf. Some machine must process the most time-consuming job. 64

Generalized Load Balancing: Structure of LP Solution Lemma 3. Let x be solution to LP. Let G(x) be the graph with an edge from machine i to job j if x ij > 0. Then G(x) is acyclic. Pf. (deferred) can transform x into another LP solution where G(x) is acyclic if LP solver doesn't return such an x x ij > 0 G(x) acyclic job machine G(x) cyclic 65

Generalized Load Balancing: Rounding Rounded solution. Find LP solution x where G(x) is a forest. Root forest G(x) at some arbitrary machine node r. If job j is a leaf node, assign j to its parent machine i. If job j is not a leaf node, assign j to one of its children. Lemma 4. Rounded solution only assigns jobs to authorized machines. Pf. If job j is assigned to machine i, then x ij > 0. LP solution can only assign positive value to authorized machines. job machine 66

Generalized Load Balancing: Analysis Lemma 5. If job j is a leaf node and machine i = parent(j), then x ij = t j. Pf. Since i is a leaf, x ij = 0 for all j parent(i). LP constraint guarantees Σ i x ij = t j. Lemma 6. At most one non-leaf job is assigned to a machine. Pf. The only possible non-leaf job assigned to machine i is parent(i). job machine 67

Generalized Load Balancing: Analysis Theorem. Rounded solution is a 2-approximation. Pf. Let J(i) be the jobs assigned to machine i. By Lemma 6, the load L i on machine i has two components: leaf nodes Lemma 5 LP Lemma 1 (LP is a relaxation) t j j! J(i) j is a leaf " = " x ij # " x ij # L # L * j! J(i) j is a leaf Lemma 2 j! J optimal value of LP parent(i) t parent(i)! L * Thus, the overall load L i 2L*. 68

Generalized Load Balancing: Flow Formulation Flow formulation of LP. x i j! = t j for all j " J i x i j! # L for all i " M j x i j $ 0 for all j " J and i " M j x i j = 0 for all j " J and i % M j Observation. Solution to feasible flow problem with value L are in oneto-one correspondence with LP solutions of value L. 69

Generalized Load Balancing: Structure of Solution Lemma 3. Let (x, L) be solution to LP. Let G(x) be the graph with an edge from machine i to job j if x ij > 0. We can find another solution (x', L) such that G(x') is acyclic. Pf. Let C be a cycle in G(x). Augment flow along the cycle C. flow conservation maintained At least one edge from C is removed (and none are added). Repeat until G(x') is acyclic. 3 3 3 3 6 6 4 2 2 4 3 1 1 5 5 4 3 4 4 G(x) augment along C G(x') 70

Conclusions Running time. The bottleneck operation in our 2-approximation is solving one LP with mn + 1 variables. Remark. Can solve LP using flow techniques on a graph with m+n+1 nodes: given L, find feasible flow if it exists. Binary search to find L*. Extensions: unrelated parallel machines. [Lenstra-Shmoys-Tardos 1990] Job j takes t ij time if processed on machine i. 2-approximation algorithm via LP rounding. No 3/2-approximation algorithm unless P = NP. 71

11.8 Knapsack Problem

Polynomial Time Approximation Scheme PTAS. (1 + ε)-approximation algorithm for any constant ε > 0. Load balancing. [Hochbaum-Shmoys 1987] Euclidean TSP. [Arora 1996] Consequence. PTAS produces arbitrarily high quality solution, but trades off accuracy for time. This section. PTAS for knapsack problem via rounding and scaling. 73

Knapsack Problem Knapsack problem. Given n objects and a "knapsack." Item i has value v i > 0 and weighs w i > 0. Knapsack can carry weight up to W. Goal: fill knapsack so as to maximize total value. we'll assume w i W Ex: { 3, 4 } has value 40. Item Value Weight 1 1 1 W = 11 2 3 6 2 18 5 4 5 22 28 6 7 74

Knapsack is NP-Complete KNAPSACK: Given a finite set X, nonnegative weights w i, nonnegative values v i, a weight limit W, and a target value V, is there a subset S X such that: " # W SUBSET-SUM: Given a finite set X, nonnegative values u i, and an integer U, is there a subset S X whose elements sum to exactly U? i!s w i " $ V i!s v i Claim. SUBSET-SUM P KNAPSACK. Pf. Given instance (u 1,, u n, U) of SUBSET-SUM, create KNAPSACK instance: v i = w i = u i V = W = U u i i!s u i i!s " # U " $ U 75

Knapsack Problem: Dynamic Programming 1 Def. OPT(i, w) = max value subset of items 1,..., i with weight limit w. Case 1: OPT does not select item i. OPT selects best of 1,, i 1 using up to weight limit w Case 2: OPT selects item i. new weight limit = w w i OPT selects best of 1,, i 1 using up to weight limit w w i " $ OPT(i, w) = # $ % 0 if i = 0 OPT(i!1, w) if w i > w max{ OPT(i!1, w), v i + OPT(i!1, w! w i ) } otherwise Running time. O(n W). W = weight limit. Not polynomial in input size! 76

Knapsack Problem: Dynamic Programming II Def. OPT(i, v) = min weight subset of items 1,, i that yields value exactly v. Case 1: OPT does not select item i. OPT selects best of 1,, i-1 that achieves exactly value v Case 2: OPT selects item i. consumes weight w i, new value needed = v v i OPT selects best of 1,, i-1 that achieves exactly value v # % OPT(i, v) = $ % &% 0 if v = 0! if i = 0, v > 0 OPT(i "1, v) if v i > v min{ OPT(i "1, v), w i + OPT(i "1, v " v i ) } otherwise Running time. O(n V*) = O(n 2 v max ). V* = optimal value = maximum v such that OPT(n, v) W. V* n v max Not polynomial in input size! 77

Knapsack: FPTAS Intuition for approximation algorithm. Round all values up to lie in smaller range. Run dynamic programming algorithm on rounded instance. Return optimal items in rounded instance. Item Value Weight 1 134,221 1 2 656,342 2 3 1,810,013 5 4 22,217,800 6 5 28,343,199 7 Item Value Weight 1 2 1 2 7 2 3 19 5 4 23 6 5 29 7 W = 11 W = 11 original instance rounded instance 78

Knapsack: FPTAS Knapsack FPTAS. Round up all values: v i = " # v i! $ %!, ˆ " v i = v i #! $ % v max = largest value in original instance ε = precision parameter θ = scaling factor = ε v max / n Observation. Optimal solution to problems with or are equivalent. Intuition. close to v so optimal solution using is nearly optimal; v ˆ v small and integral so dynamic programming algorithm is fast. Running time. O(n 3 / ε). Dynamic program II running time is O(n 2 v ˆ max ), where ˆ v max = " # v max! $ % = " # n & $ % v v v ˆ 79

Knapsack: FPTAS Knapsack FPTAS. Round up all values: v i = " # v i! $ %! Theorem. If S is solution found by our algorithm and S* is any other feasible solution then $ (1+!) v i " v i i # S Pf. Let S* be any feasible solution satisfying weight constraint. $ i # S* v i i! S* " # " v i i! S* # " v i i! S # " (v i + $) i! S always round up solve rounded instance optimally never round up by more than θ # " v i + n$ i! S S n DP alg can take v max # (1+%) " v i i! S n θ = ε v max, v max Σ i S v i 80

Extra Slides

Load Balancing on 2 Machines Claim. Load balancing is hard even if only 2 machines. Pf. NUMBER-PARTITIONING P LOAD-BALANCE. NP-complete by Exercise 8.26 a b c d e f g length of job f machine 1 machine 2 a d Machine 1 f b c Machine e2 g yes 0 Time L 82

Center Selection: Hardness of Approximation Theorem. Unless P = NP, there is no ρ-approximation algorithm for metric k-center problem for any ρ < 2. Pf. We show how we could use a (2 - ε) approximation algorithm for k- center to solve DOMINATING-SET in poly-time. Let G = (V, E), k be an instance of DOMINATING-SET. Construct instance G' of k-center with sites V and distances d(u, v) = 2 if (u, v) E d(u, v) = 1 if (u, v) E Note that G' satisfies the triangle inequality. Claim: G has dominating set of size k iff there exists k centers C* with r(c*) = 1. see Exercise 8.29 Thus, if G has a dominating set of size k, a (2 - ε)-approximation algorithm on G' must find a solution C* with r(c*) = 1 since it cannot use any edge of distance 2. 83