AN ANALYSIS OF THE GREEDY HEURISTIC FOR INDEPENDENCE SYSTEMS

Similar documents
Matroid Representation of Clique Complexes

The Greedy Algorithm for the Symmetric TSP

The minimum G c cut problem

Perfect matchings in highly cyclically connected regular graphs

A combinatorial algorithm minimizing submodular functions in strongly polynomial time

Decomposing dense bipartite graphs into 4-cycles

Worst Case Performance of Approximation Algorithm for Asymmetric TSP

The Multiple Traveling Salesman Problem with Time Windows: Bounds for the Minimum Number of Vehicles

arxiv: v1 [math.co] 28 Oct 2016

Packing of Rigid Spanning Subgraphs and Spanning Trees

Constrained Maximization of Non-Monotone Submodular Functions

The cocycle lattice of binary matroids

NP-Complete Problems. More reductions

Bounds on the Traveling Salesman Problem

ON SIZE, CIRCUMFERENCE AND CIRCUIT REMOVAL IN 3 CONNECTED MATROIDS

Arithmetic Progressions with Constant Weight

A Randomized Rounding Approach to the Traveling Salesman Problem

Minimal Decompositions of Hypergraphs into Mutually Isomorphic Subhypergraphs

arxiv: v1 [math.co] 13 May 2016

Polynomial-time Reductions

Parity Versions of 2-Connectedness

The Complexity of Maximum. Matroid-Greedoid Intersection and. Weighted Greedoid Maximization

4 CONNECTED PROJECTIVE-PLANAR GRAPHS ARE HAMILTONIAN. Robin Thomas* Xingxing Yu**

ACO Comprehensive Exam March 17 and 18, Computability, Complexity and Algorithms

Preliminaries. Graphs. E : set of edges (arcs) (Undirected) Graph : (i, j) = (j, i) (edges) V = {1, 2, 3, 4, 5}, E = {(1, 3), (3, 2), (2, 4)}

Lifting theorems and facet characterization for a class of clique partitioning inequalities

arxiv: v2 [math.co] 19 Jun 2018

Eulerian Subgraphs in Graphs with Short Cycles

On non-hamiltonian circulant digraphs of outdegree three

On the matrix-cut rank of polyhedra

Diskrete Mathematik und Optimierung

Maximum union-free subfamilies

Decomposition of random graphs into complete bipartite graphs

Graphs and Combinatorics

NP-complete problems. CSE 101: Design and Analysis of Algorithms Lecture 20

Decomposing oriented graphs into transitive tournaments

Compatible Circuit Decompositions of 4-Regular Graphs

The Turán number of sparse spanning graphs

ACO Comprehensive Exam October 14 and 15, 2013

Paths and cycles in extended and decomposable digraphs

Decompositions of graphs into cycles with chords

Compatible Circuit Decompositions of Eulerian Graphs

GRAPH ALGORITHMS Week 7 (13 Nov - 18 Nov 2017)

Packing and decomposition of graphs with trees

Acyclic and Oriented Chromatic Numbers of Graphs

NP-Complete Problems and Approximation Algorithms

Jian Cheng, Cun-Quan Zhang & Bao- Xuan Zhu

A simple LP relaxation for the Asymmetric Traveling Salesman Problem

Graph coloring, perfect graphs

Spanning Paths in Infinite Planar Graphs

Scheduling and Optimization Course (MPRI)

Chapter 3: Proving NP-completeness Results

On judicious bipartitions of graphs

Containment restrictions

Decomposition of random graphs into complete bipartite graphs

Partial cubes: structures, characterizations, and constructions

Induced Saturation of Graphs

Probabilistic Proofs of Existence of Rare Events. Noga Alon

HAMILTONIAN PROPERTIES OF TRIANGULAR GRID GRAPHS. 1. Introduction

Chapter 34: NP-Completeness

THE COMPLEXITY OF DISSOCIATION SET PROBLEMS IN GRAPHS. 1. Introduction

10.3 Matroids and approximation

Group Colorability of Graphs

arxiv: v1 [math.co] 22 Jan 2018

Some Nordhaus-Gaddum-type Results

Connectedness of Efficient Solutions in Multiple. Objective Combinatorial Optimization

Nowhere-zero Unoriented Flows in Hamiltonian Graphs

Approximation algorithms and hardness results for the clique packing problem. October, 2007

ON COST MATRICES WITH TWO AND THREE DISTINCT VALUES OF HAMILTONIAN PATHS AND CYCLES

On the number of cycles in a graph with restricted cycle lengths

Rainbow Matchings of Size δ(g) in Properly Edge-Colored Graphs

Independent Transversals in r-partite Graphs

Decomposing planar cubic graphs

Longest paths in strong spanning oriented subgraphs of strong semicomplete multipartite digraphs

Dual Consistent Systems of Linear Inequalities and Cardinality Constrained Polytopes. Satoru FUJISHIGE and Jens MASSBERG.

Approximation Algorithms for Maximum. Coverage and Max Cut with Given Sizes of. Parts? A. A. Ageev and M. I. Sviridenko

An Improved Approximation Algorithm for Maximum Edge 2-Coloring in Simple Graphs

Cycles with consecutive odd lengths

Lecture 11 October 7, 2013

Matroid Optimisation Problems with Nested Non-linear Monomials in the Objective Function

UNAVOIDABLE INDUCED SUBGRAPHS IN LARGE GRAPHS WITH NO HOMOGENEOUS SETS

Group connectivity of certain graphs

On disjoint common bases in two matroids

1 Submodular functions

Technische Universität München, Zentrum Mathematik Lehrstuhl für Angewandte Geometrie und Diskrete Mathematik. Combinatorial Optimization (MA 4502)

Volume 35, Issue 4. Note on goodness-of-fit measures for the revealed preference test: The computational complexity of the minimum cost index

Zero-Sum Flows in Regular Graphs

Graphs with large maximum degree containing no odd cycles of a given length

Approximation Algorithms for Asymmetric TSP by Decomposing Directed Regular Multigraphs

Antoni Marczyk A NOTE ON ARBITRARILY VERTEX DECOMPOSABLE GRAPHS

Proof methods and greedy algorithms

arxiv: v3 [cs.dm] 18 Oct 2017

CS675: Convex and Combinatorial Optimization Fall 2014 Combinatorial Problems as Linear Programs. Instructor: Shaddin Dughmi

Induced subgraphs of prescribed size

Sergey Norin Department of Mathematics and Statistics McGill University Montreal, Quebec H3A 2K6, Canada. and

SOME GRAPHIC USES OF AN EVEN NUMBER OF ODD NODES (to appear in Ann. Inst. Fourier, Grenoble, 49, special number (1999), 1-13.)

Approximation Hardness for Small Occurrence Instances of NP-Hard Problems

Combinatorial Optimisation, Problems I, Solutions Term /2015

Neighborly families of boxes and bipartite coverings

MINIMALLY NON-PFAFFIAN GRAPHS

Transcription:

Annals of Discrete Mathematics 2 (1978) 65-74. @ North-Holland Publishing Company AN ANALYSIS OF THE GREEDY HEURISTIC FOR INDEPENDENCE SYSTEMS Bernhard KORTE and Dirk HAUSMANN Institut fur Okonometrie und Operations Research, Uniuersitiit Bonn, D-53 Bonn, W. Germany The worst case behaviour of the greedy heuristic for independence systems is analyzed by deriving lower bounds for the ratio of the greedy solution value to the optimal value. For two special independence systems, this ratio can be bounded by 1/2, for two other independence systems, it converges with increasing problem size to zero. The main theorem states that for every independence system (E, 9) the ratio is bounded by Ilk, k such that (E, 9) can be represented as the intersection of k matroids. 1. Introduction Since Cook [2] and Karp [8] have shown that many notoriously hard problems in combinatorial optimization are equivalent in the sense that either all or none of them can be solved in polynomial time, the study and analysis of fast heuristic algorithms has become more interesting again. A great number of these heuristics are variants of the well-kn0w.n greedy heuristic for the problem of finding an independent set with maximum weight of a given independence system. Consider an independence system (E, 9), E being an arbitrary finite set and 9 a system of subsets of E with the property: F c G E 9 =+FEY. The elements of 9 are called 9-independent sets or simply independent sets. Moreover, consider a weight function c : E + R' and the optimization problem where c(f) = CeEFc(e). Let {el, e2,..., en} be a numbering of E with i Sj += c(e,)>c(e,). Let E, = {el,..., e,} for 1 S i S n. A subset F c E is called greedy solution of (1) if, for 1 C i C n, F n E, is a maximal independent subset of E,. It is easy to see by induction that a greedy solution is just the set F yielded by the following greedy heuristic : 65

66 B. Korte, D. Hausmann begin F = 8; for i = 1 to n do begin if F U{e,}E 9 then F = F U{e,}; end; end. Recall that an independence system (E, 9) is called a matroid iff, for any subset S C E, all maximal independent subsets of S have the same cardinality. It is well known that, for a matroid (E, S), every greedy solution F is an optimal solution of (1). In this paper, we consider arbitrary independence systems (E, 9) which are not necessarily matroids. We want to characterize the quality of greedy solutions by deriving lower bounds for the ratio c(f,)/c(fo) where F, is a greedy solution and F,, an optimal solution. A first step towards such a characterization is the following basic theorem. Several researchers have worked on this theorem. It was conveyed to us by Edmonds [5]. To our knowledge it was first conjectured by Nemhauser; Jenkyns attacked the theorem in his Ph.D. thesis [6]. Independently Baumgarten [l] found some other proof. Because of the great interest the theorem has received we will below give a very short new proof of it. Let (E, 9) be an independence system and S C E an arbitrary subset; we define: lower rank of S = Ir (S) = min { I FJ : F a maximal independent subset of S} upper rank of S = ur (S) = max { 1 FI : F a maximal independent subset of S}. Obviously, (E, 9) is a matroid iff Ir ( S) = ur(s) for any S C E. Hence, the so-called rank quotient Ir (S) min SCE ur(s) can be interpreted as a measure of how much (E, 9) differs from being a matroid. Theorem 1.1. Let (E, 9) be an independence system, F, a greedy solution and E, an optimum solution of (1). Then for any weight function c Proof. Setting c(en+,): = 0 one obtains through a suitable summation that c(f,)= 2 IF, ne,i(c(e,)-c(e,+,)),=i c(fd = 2 IF,, n EL,=I I (c(e,) - c(e,+j). As Fo n E, c Fo E 9 we have IF,, n E, 1 s ur (E,).

An analysis of the greedy heurisric for independence systems 67 By the definition of a greedy solution, F, n E, is a maximal independent subset of E,, hence 1 F, fl E, 1 2 lr (E,) and thus: From (2), (3), and (4) it follows that c(f,) 3 minsgelr(s)/ur(s). c(fo). Corollary. The bound in Theorem 1.1 is sharp in the following sense: For every independence system (E, 9), there exists a weight function c : E + R' and a greedy solution F, with Proof. Let = Ir(S) ur(so) SCE ur(s) c(e): = 1, e E So 0, e@ So. Let (el, e2,..., e,) be a numbering of E such that e, E F,, ej E So - F,, ek E E - So i < j < k. Obviously such a numbering satisfies ( i S j + c (ei) 3 c (e,)) and Fl is a greedy solution of (1). Hence Applying Theorem 1.I, the corollary follows. Having Theorem 1.1 and its corollary, it is enough to inspect the rank quotient because every sharp bound for the rank quotient is a sharp bound for c(fg)/c(fo) also. In Section 2, we calculate the rank quotient for four special independence systems, the independence systems of the matching problem, the symmetrical travelling salesman problem, the stable set problem, and the acyclic subgraph problem. In Section 3, we show that every independence system can be represented as the intersection of some matroids. The following main theorem states that, for an intersection of k matroids, the rank quotient and hence the quotient c(fg)/c(fl,), too, is bounded below by Ilk. We conclude the paper with some remarks about the sharpness of this bound.

68 B. Korte, D. Hausmann 2. The rank quotient for some special independence systems Let G = (V, E) be a finite undirected graph without loops or multiple edges. A subset F C E is called a matching of G, iff no two edges in F are adjacent, i.e. have a vertex in common. Let 9 be the set of all matchings of G. Obviously, (E, 9) is an independence system. For the so defined 9, (1) is called the matching problem. Theorem 2.1. Let (E, 9) be the independence system of the matching problem for the graph G. In rhe trivial case where every connected component of G is a triangle K,, a path P2, a single edge KZ, or a single vertex K,, we have min Ire= 1. SSE ur(s) Otherwise Proof. We show first that min u 2 1 SSE ur(s) 2 Let S E be any subset and Fl, Fz maximal independent subsets of S. It is enough to show that IF,l/lF2/31/2. Let e EFz \ F,. Since F, U{e}c S and F, is maximal, the set F, U{e} is not independent, i.e. is not a matching of G. Hence there exists an edge 4(e) E F, that is adjacent to e. Since F, is a matching, @(e)e FI\F2. Thus we have defined a mapping @ : F2\Fl + F1\F2. Obviously, for any edge in FI\F2, there are at most two edges in Fz\FI adjacent to it. Hence, 4 maps at most two edges of F2\Fl onto one edge of FI \ F,. It follows: hence I Fi \ Fz I 3 1 +(F2 \ FI) I 3 f I Fz\ Fi 1, 1 F, I = I Fi\ Fz I + I Fi n F zi 3 4 I F2\ Fi 1 + I I Fi n Fz1 = f 1 Fz 1, whence /Fl///F2/21/2. If every connected component of G is isomorphic to K,, Pzr Kz, or K,, obviously lr(s)/ur(s) = 1 for any S C E. Otherwise G contains a subgraph (V',S) isomorphic to P,. Since clearly lr(s)= 1, ur(s)= 2, we have the upper bound min Ires 1/2 SSE ur(s) which together with the lower bound (5) proves the theorem. Now let (V, E) be a complete undirected graph. Let 9 be the set of all subsets of Hamiltonian cycles in (V,E). Since the Hamiltonian cycles are the feasible solutions of the symmetrical travelling salesman problem (TSP), (E, 9) is called the

An analysis of rhe greedy heuristic for independence systems 69 independence system of the symmetrical TSP. Obviously, a subset F C E belongs to 9 iff: (i) every vertex u E V is incident to at most two edges of F and (ii) the partial graph (V, F) contains no non-hamiltonian cycle. Theorem 2.2. Let (E, 9) be the independence system of the symmetrical TSP for the complete graph (V, E). Then Proof. The proof of the lower bound is analogous to that in the proof of Theorem 2.1. Again we construct a mapping 4 : F2\ FI + FI \ Fz which maps as few as possible elements of F2\FI onto the same element of FI\F2. Since the complete proof does not contain new ideas but a rather complicated construction, we omit the details. For the upper bound, we assume V = {vl, uzr..., u,} and define the following three sets Fz = {U,U,+~: 16 i s n - I} U{u,uI}, S = FI U Fz. Obviously, FI and FZ are maximal independent subsets of S, and hence ur(s) = IFz/ = n, The next two theorems show that for some independence systems (E, 9) the rank quotient converges to 0 as the "problem size" I E 1 tends to infinity. Consequently, for these independence systems, the greedy heuristic for large problems can be 'arbitrarily bad.

70 B. Korte, D. Hausmann Let G = (V, E ) be agraph. A subset F c V is called a stable set (or independent set, vertex packing) if no two distinct vertices in F are adjacent. Let 9 be the system of all stable sets of G. Since then (1) is the vertex packing problem, (V, 9) is called the independence system of the vertex packing problem in G. Theorem 2.3. Let G = (V, E) be a graph containing an induced subgraph isomorphic to the star K,,k. Let (V, 9) be the independence system of the vertex packing problem in G. Then Fm min Iro(JO. -- scv ur(s) Proof. Let S : = {u, u,,..., vk} be the set of vertices that induces the subgraph isomorphic to Kl.k and let WEE, lsisk, but u,u,ee, lsi,jsk. FI = {u} and F2 = {v,,..., uk} are maximal independent subsets of S and Theorem 2.4. Let G = (V, E) be a complete directed graph, i.e. E = V x V. Let 9 be the system of (the edge sets of) all its acyclic subgraphs, i.e. 9 = {F c E : F contains no directed cycle}. For the independence system (E, 9), we have Proof. Let n : = I V( = dm, V = {v],..., v,} and F, = {u,u,+,: 1 s i s n - l}, Fz = {au, : 1 S j < i s n}, S = F, U Fa. Clearly, F, and F2 are maximal independent subsets of S and 3. The rank quotient for an arbitrary independence system Having inspected the rank quotient for some special independence systems, we

An analysis of the greedy heuristic for independence systems 71 now turn to an arbitrary independence system. We start with the following easy lemma: Lemma 3.1. For any independence system (E, S), there exist k matroids (E, 9 ), 1 s i s k, with k 9= n 9,=I 1. Proof. Let (E, 9) {F C E : Fff 9}, the so-called circuits of (E, 9). It is easy to see that k be an independence system and cl,..., ck the minimal sets in 9= 0 9 where 9 : ={FCE:C,EF}., = I Clearly, any (E, 9 ) is an independence system. Let S C E. If C,P S, S itself is the only maximal independent subset of S. Otherwise, if C, G S, every maximal independent subset of S consists of all elements of S except one element of C,. Hence all maximal independent subsets of S have the same cardinality, whence (E, 9 ) is a matroid. By the proof of Lemma 3.1, the minimum number k = k(e, 9), for which an independence system (E, 9) can be represented as the intersection of k matroids, is bounded by the number of its circuits. Of course this bound is far from being sharp; it is easy to see (cf. Edmonds [3]) that e.g. the independence system of bipartite matchings is an intersection of two matroids and the independence system of the asymmetric TSP is an intersection of three matroids. The importance of this minimum number k (E, 9) lies in the following main theorem: Theorem 3.2. Let (E, S ), 1 s i s k, be matroids and 9 : = n:=, 9. For the independence system (E,9) we have min Ir(Sr*I SCE ur(s) k Proof. Let S C E be any subset and F,, F2 maximal independent subsets of S. It is enough to show IFII/IFzJ 3 l/k. For i = 1,..., k and j = 1,2, let Ff be a maximal $ -independent subset of FI U F2 containing 6. If there were an element e E F2\Fl with e E n:=, F;\FI, then F, U {e} C n:=, FI 9, a contradiction to the maximality of F,. Hence each e E F2\FI can be an element of F,\F, for at most k - 1 indices i. It follows By the definition of a matroid, we have IF;/= IF;\ for any i, 1s i S k.

12 B. Korte, D. Hausmann Hence the above inequality implies IF2)C(i IF;JklF,I)+lF~( \ i=l k =c i=l IF; s k IF,) The theorem follows. As for the question of sharpness of the bound in Theorem 3.2, we can state the following easy theorem. Theorem 3.3. For every integer k > 1, there exists an independence system (E, 9) which is an intersection of k matroids but not an intersection of less than k matroids and which has the property Proof. Let G = (V, E ) be a graph isomorphic to Kl.k, say v = {U, UI,..., Uk}, E = {UU,: 1 i C k}. By the proof of Theorem 2.3, the independence system (V, 9) of the stable sets of G has the property min IrO& SGV ur(s) k' Obviously, the k edges {u, u,}, 1 S i C k, are the circuits of (V,9). Setting 9' : = {F C V :{u, u,} g F}, it follows from the proof of Lemma 3.1 that (V, 9) is the intersection of the k matroids 9', 1 s i c k. Hence, by Theorem 3.2 and (6), min Wl=L SGV ur(s) k ' and (V, 9) cannot be represented as the intersection of less than k matroids. Theorem 3.3 says that, for every integer k, there exists an independence system for which the bound in Theorem 3.2 is sharp. More interesting is the question if the bound is sharp for every independence system. The next theorem gives a (negative) answer to this question. Theorem 3.4. Let (E, 9) be the independence system of the symmetrical TSP for a complete graph (V, E) with 1 V I z= 4. Then there is no integer k such that (E, 9) can be represented as the intersection of k matroids and that

An analysis of the greedy heuristic for independence systems 73 Proof. Assume there is such an integer k. Then, by Theorem 2.2, k = 2 and (E, 9) can be represented as the intersection of two matroids Mi = (E, Si), i = 1,2. Let V = {ul, u2,..., un}, n 24. Then F: = {uiu2, u22)3,..., unul}e 9, but F U {ulu3} 9. Obviously, every "circuit" of (E, 9), (i.e. every minimal dependent subset of E) contained in F U {ulu3} is also a circuit of MI or of M2. Now it is well known (cf. e.g. [lo]) that, for a matroid, the union of an independent set and a singleton contains at most one circuit. Hence F U {ulu,} contains at most two circuits of (E, 9). But in fact, F U {ulu3} contains even four circuits, namely: c1 = {UIUZ, UIU3, UlU"), c3 = (UIU2, UZV3, U3UI}, c2 = {U3UI, u3u2, u3u41, c. = (UIU3, u3u4, u4us,...) unu,}. We conclude the paper with some final remarks. Remark 3.5. Theorem 3.3 and its proof say that there are families of independence systems (En, 9"), n = 1,2,..., with IE. I = II and the property that the minimum number k (En, 9") for which (En, 9") can be represented as the inters%- tion of k matroids is at least a linear function of n. In particular, there is no general constant k' with k(e, 9) S k' for all independence systems. We conjecture that there are even families (E", 9") where k (En, 9,,) is a super-linear - perhaps even exponential - function of n. This conjecture cannot be proven by means of Theorem 3.2 because for every independence system (E, 9) 1 = max -,/El Ir(S) SCE Ir(S) min SCE ur(s) Remark 3.6. Up to now, we inspected the greedy heuristic for the maximum problem (1) only. What about a greedy heuristic for a minimum problem? Let us consider the problem c(f)=min! F maximal in 9 (7) where (E, 9) is an independence system and c : E + R' is a weight function. The greedy heuristic for (7) starts with the independent set F = 0 and, in every step of the algorithm, adds a new element e E E \F to the current independent set F such that the new set F U {e} is again independent and that, subject to this condition, e has minimum weight c,. For the maximum problem (l), it follows from Lemma 3.1, Theorem 3.3, and Theorem 1.1, that for any independence system (E,9) there exists a bound Ilk such that

74 B. Korte, D. Hausmann for any greedy solution F, and optima1 - solution F, and for all weight functions c. This is not true for the minimum problem (7). E.g. consider the independence system (V, 9) of the vertex packing problem in the graph K,, 7 5 5 and the weight function c(v) = v = v, M>2, v = ~ 3. The optimal solution of the corresponding minimum problem (7) is Fo = {vz} and the greedy solution is F, = {v,, v3}. The quotient c(fo)/c (FB) = 2/M converges to 0 as M tends to infinity. Acknowledgement After finishing this paper we became aware of a working paper by Jenkyns (71, which coincides with some of our results. We greatfully acknowledge some comments of T.A. Jenkyns on this paper especially for shortening the proof of Theorem 3.2. References [ 11 U. Baumgarten, Worst-case-Abschatzung fur die Greedy-Heuristic auf monotonen Mengensystemen, Working Paper, Institut fur Okonometrie und Operations Research, University Bonn (1976). [2] S.A. Cook, The complexity of theorem-proving procedures, Proc. 3rd ACM Symp. Theory of Computing (1971) 151-158. [3] J. Edmonds, Matroids and the greedy algorithm, Math. Programming 1 (1971) 127-136. [4] J. Edmonds, Matroid intersection, unpublished working paper (1976). [S] J. Edmonds, private communication (1976). [6] T.A. Jenkyns, Matchoids: a generalization of matchings and matroids, Ph.D. Thesis, University of Waterloo (1974). [7] T.A. Jenkyns, The efficacy of the greedy algorithm, Proc. 7th S-E Conf. Combinatorics, Graph Theory and Computing, Utilitas Math. Winnipeg (1976) 341-350. [8] R.M. Karp, Reducibility among combinatorial problems, in: R.E. Miller and J.W. Thatcher, eds., Complexity of Computer Computations (Plenum Press, New York, 1972) 85-104. [9] R.M. Karp, The fast approximate solution of hard combinatorial problems, Proc. 6th S-E Conf. Combinatorics, Graph Theory and Computing, Winnipeg (1976) 15-31. [lo] R. von Randow, Introduction to the Theory of Matroids (Springer, Berlin, 1975).