On the minimum neighborhood of independent sets in the n-cube

Similar documents
An Approximation Algorithm for Constructing Error Detecting Prefix Codes

Paulo E. D. Pinto 1,Fábio Protti 2 and Jayme L. Szwarcfiter 3

Ma/CS 6b Class 25: Error Correcting Codes 2

Run-length & Entropy Coding. Redundancy Removal. Sampling. Quantization. Perform inverse operations at the receiver EEE

EMBEDDED PATHS AND CYCLES IN FAULTY HYPERCUBES

CSCI 2570 Introduction to Nanocomputing

On improving matchings in trees, via bounded-length augmentations 1

1. Basics of Information

Applicable Analysis and Discrete Mathematics available online at HAMILTONIAN PATHS IN ODD GRAPHS 1

Information Theory and Statistics Lecture 2: Source coding

Fault Tolerance Technique in Huffman Coding applies to Baseline JPEG

Locating-Total Dominating Sets in Twin-Free Graphs: a Conjecture

On the Dynamic Chromatic Number of Graphs

Hanna Furmańczyk EQUITABLE COLORING OF GRAPH PRODUCTS

Robust Network Codes for Unicast Connections: A Case Study

FRACTIONAL PACKING OF T-JOINS. 1. Introduction

Information Theory with Applications, Math6397 Lecture Notes from September 30, 2014 taken by Ilknur Telkes

ON DOMINATING THE CARTESIAN PRODUCT OF A GRAPH AND K 2. Bert L. Hartnell

Graph coloring, perfect graphs

10.4 The Kruskal Katona theorem

Parity Dominating Sets in Grid Graphs

Intro to Information Theory

Nowhere-zero Unoriented Flows in Hamiltonian Graphs

Image and Multidimensional Signal Processing

Geometric Steiner Trees

Binary Decision Diagrams. Graphs. Boolean Functions

FINAL EXAM PRACTICE PROBLEMS CMSC 451 (Spring 2016)

Partial cubes: structures, characterizations, and constructions

CS264: Beyond Worst-Case Analysis Lecture #11: LP Decoding

1 Introduction to information theory

EECS 229A Spring 2007 * * (a) By stationarity and the chain rule for entropy, we have

Lecture 1 : Data Compression and Entropy

SIGNAL COMPRESSION Lecture 7. Variable to Fix Encoding

exercise in the previous class (1)

17.1 Binary Codes Normal numbers we use are in base 10, which are called decimal numbers. Each digit can be 10 possible numbers: 0, 1, 2, 9.

Near-domination in graphs

Maximising the number of induced cycles in a graph

The domination game played on unions of graphs

ACO Comprehensive Exam 19 March Graph Theory

On the Cost of Worst-Case Coding Length Constraints

On the diameter of the Cayley. graphs H l,p.

Pascal Ochem 1 and Elise Vaslet Introduction REPETITION THRESHOLDS FOR SUBDIVIDED GRAPHS AND TREES

Perfect matchings in highly cyclically connected regular graphs

STRUCTURE OF THE SET OF ALL MINIMAL TOTAL DOMINATING FUNCTIONS OF SOME CLASSES OF GRAPHS

Coloring. Basics. A k-coloring of a loopless graph G is a function f : V (G) S where S = k (often S = [k]).

Algorithms and Theory of Computation. Lecture 22: NP-Completeness (2)

Error Detection and Correction: Hamming Code; Reed-Muller Code

Partial characterizations of clique-perfect graphs II: diamond-free and Helly circular-arc graphs

Chapter 3 Source Coding. 3.1 An Introduction to Source Coding 3.2 Optimal Source Codes 3.3 Shannon-Fano Code 3.4 Huffman Code

Notes on Graph Theory

Transducers for bidirectional decoding of codes with a finite deciphering delay

IOSR Journal of Mathematics (IOSR-JM) e-issn: , p-issn: x. Volume 9, Issue 2 (Nov. Dec. 2013), PP

Colourings of cubic graphs inducing isomorphic monochromatic subgraphs

( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r C h a p t e r 1 7 : I n f o r m a t i o n S c i e n c e P a g e 1

Lecture 4 : Introduction to Low-density Parity-check Codes

NOTE ON CYCLIC DECOMPOSITIONS OF COMPLETE BIPARTITE GRAPHS INTO CUBES

On decomposing graphs of large minimum degree into locally irregular subgraphs

On Minimal Words With Given Subword Complexity

Low-Density Parity-Check Codes

The Pure Parsimony Problem

The Number of Non-Zero (mod k) Dominating Sets of Some Graphs

On graphs having a unique minimum independent dominating set

EXACT DOUBLE DOMINATION IN GRAPHS

Module 5 EMBEDDED WAVELET CODING. Version 2 ECE IIT, Kharagpur

Computability and Complexity Theory: An Introduction

Efficient Reassembling of Graphs, Part 1: The Linear Case

Lecture 6: Expander Codes

Colourings of cubic graphs inducing isomorphic monochromatic subgraphs

On two conjectures about the proper connection number of graphs

AALBORG UNIVERSITY. Total domination in partitioned graphs. Allan Frendrup, Preben Dahl Vestergaard and Anders Yeo

Group connectivity of certain graphs

MATH 433 Applied Algebra Lecture 21: Linear codes (continued). Classification of groups.

Zero-Sum Flows in Regular Graphs

Lecture 4 : Adaptive source coding algorithms

NP-complete Problems

Coding of memoryless sources 1/35

Modular Monochromatic Colorings, Spectra and Frames in Graphs

Analogies and discrepancies between the vertex cover number and the weakly connected domination number of a graph

A Mathematical Theory of Communication

Antimagic Properties of Graphs with Large Maximum Degree

Lecture 16. Error-free variable length schemes (contd.): Shannon-Fano-Elias code, Huffman code

HAMBURGER BEITRÄGE ZUR MATHEMATIK

Intersecting curves (variation on an observation of Maxim Kontsevich) by Étienne GHYS

1.6: Solutions 17. Solution to exercise 1.6 (p.13).

Maximal and Maximum Independent Sets In Graphs With At Most r Cycles

CSE 202 Homework 4 Matthias Springer, A

1.3 Vertex Degrees. Vertex Degree for Undirected Graphs: Let G be an undirected. Vertex Degree for Digraphs: Let D be a digraph and y V (D).

Algorithm Design and Analysis

Codes on graphs. Chapter Elementary realizations of linear block codes

Chapter 11. Min Cut Min Cut Problem Definition Some Definitions. By Sariel Har-Peled, December 10, Version: 1.

Equational Logic. Chapter Syntax Terms and Term Algebras

More on NP and Reductions

arxiv: v2 [math.co] 7 Jan 2016

Binary Decision Diagrams

Analysis of Absorbing Sets and Fully Absorbing Sets of Array-Based LDPC Codes

4-coloring P 6 -free graphs with no induced 5-cycles

Packing and decomposition of graphs with trees

ON THE NUMBERS OF CUT-VERTICES AND END-BLOCKS IN 4-REGULAR GRAPHS

Even Pairs and Prism Corners in Square-Free Berge Graphs

Bounds on the generalised acyclic chromatic numbers of bounded degree graphs

Transcription:

Matemática Contemporânea, Vol. 44, 1 10 c 2015, Sociedade Brasileira de Matemática On the minimum neighborhood of independent sets in the n-cube Moysés da S. Sampaio Júnior Fabiano de S. Oliveira Luérbio Faria Paulo E. D. Pinto Abstract Let n, l be positive integers. This paper considers the problem of finding an independent set L having size l of the n-cube graph Q n such that the cardinality of the neighborhood of L is minimum, i.e., N(L) = min{ N(L ) : L = l and L is an independent set of Q n }, denoted by Opt(n, l). Such a problem is related to a more restricted version of that of constructing optimal Hamming-Huffman trees for t symbols uniformly distributed, an open problem since the eighties. We present a lower and an upper bound for Opt(n, l). These bounds are found to have the same values for a large number of instances, encouraging us to conjecture that either bound is actually Opt(n, l). 1 Introduction In data transmission processes, two tasks are usually done separately from each other: data compression and preparation for error detection. Paradoxically, the two phases have conflicting goals: while data compression shrinks the message as much as possible, data preparation for error 2000 AMS Subject Classification: 68R10. Key Words and Phrases: data compression, error detection, hypercube, independent set. Supported by CAPES and FAPERJ.

On the minimum neighborhood of independent sets in the n-cube 2 detection adds redundancy to messages so that a receiver can detect (or even better, fix) corrupted ones. Data compression can be achieved with the use of Huffman trees [Huf52]. In 1980, Hamming proposed the union of both compression and error detection through the use of a structure called Hamming-Huffman tree [Ham80]. This data structure compresses data similarly to Huffman trees with the additional feature of enabling the detection of any 1-bit error due to error transmission. A Huffman tree (HT) T is a rooted binary tree in which each edge (u, v), v being a left (resp. right) child of u, is labeled by 0 (resp. 1) and there is a one-to-one mapping between the set of leaves and Σ, the set of all distinct symbols from which a message M to be sent is consisted of. Given T, each symbol a of M is sequentially encoded into a binary string c(a), namely, the sequence of 0 s and 1 s found on the edges of the directed path from the root of T to the leaf corresponding to a. An optimal HT for M is a tree T such that a Σ p(a) c(a) is minimized, where p(a) stands for the frequency of a (probability of occurrence) and c(a) is the length of string c(a). A Hamming-Huffman tree (HHT) T is a HT except for the fact that, for each leaf labeled with a Σ, there must exist leaves e 1,..., e k such that each c(e i ), 1 i k, differs from c(a) for exactly one position. Therefore, k = c(a). Leaves e 1,..., e k are called error leaves of a. Under the assumption that when data is transmitted at most 1 bit can inadvertently be flipped, when c(e) is identified in the decoding process, where e is an error leaf, a transmission error is detected. Optimal HHTs are defined exactly the same as optimal HTs. Figure 1 depicts two HHTs associated with messages using symbols Σ = {a, b}. Error leaves are colored with black. In the tree on the right, leaves having encodings 000 and 111 are chosen as symbol leaves, resulting in 6 error leaves, whereas in the tree on the left, leaves having encodings 000 and 011 are the ones chosen as symbol leaves, resulting in 4 error nodes. Although optimal HTs are possible to be built efficiently in a greedy

On the minimum neighborhood of independent sets in the n-cube 3 fashion, the construction of HHTs are open since defined by Hamming in the eighties [Ham80]. Since different HHTs may have different number of error leaves, the problem of minimizing the number of error leaves seems to be related to the problem of building an optimal HHT, since error leaves being numerous may force symbol leaves to be chosen from higher levels of the tree, increasing the size of encodings. Figure 1: Examples of Hamming-Huffman trees. The problem considered in this paper is the following: given natural numbers n and l 2 n 1, determine the smallest number f(n, l) of error leaves in a HHT having height n and in which all the l symbol leaves are at level n. To illustrate the relation between computing f(n, l) and computing optimal HHTs, let us consider the constrained problem of building an optimal HHT for n symbols such that the leaves lie on at most two levels. The general idea of a dynamic programming that builds such a tree is as follows. A full binary tree having height lg n + 1 is enough to build a

On the minimum neighborhood of independent sets in the n-cube 4 HHT in which all symbols correspond to leaves at the last level (choose any n-size subset from the set of leaves encoded with an even number of 1 s). Therefore, for an optimal HHT having leaves only at levels h 1 and h 2, with h 1 h 2, we have h 1 lg n + 1. For each 1 h 1 lg n + 1, 1 l 1 min{n, 2 h 1 1 }, we consider mapping l 1 leaves to symbols on height h 1. The least number possible of error leaves is f(h 1, l 1 ) and thus r = 2 h 1 (l 1 + f(h 1, l 1 )) are free leaves (neither mapped to a symbol nor error leaves). If r > 0 and n > l 1, allocate the remaining n l 1 symbols in r trees (whose roots are the free leaves) and find a HHT for l 2 = (n l 1 )/r symbols having the minimum height h 2 possible. This second problem can be reduced to that of minimizing the number of error leaves, i.e., such h 2 is the smallest natural for which f(h 2, l 2 ) is defined. The cost of each HHT generated is therefore h 1 c(a) =h 1 p(a) + h 2 c(a) =h 2 p(a), and an optimal HHT is any of those having the smallest computed cost. 2 Neighborhood of independent sets in Q n Let G be a simple graph. The neighborhood of u V (G) is N G (u) = {v V (G) (u, v) E(G)}. Let U V (G) and define N G (U) = u U N G(u). When G is clear in the context, it may be omitted from the notation. A n-cube, denoted by Q n, is the graph having as vertex set all the binary numbers with size n (2 n vertices therefore) where two distinct vertices are adjacent if their binary representations differ only in one position. We can make a one-to-one mapping between the leaves of a HHT which is a full binary tree having height n to the vertices of Q n, and among those possible mappings, we shall assume the more natural one in which leaf a corresponds to c(a) V (Q n ). We define the parity of a binary representation v as the parity of the number of 1 s in v. It is shown in [Föl77] that Q n is a bipartite graph with partition E O, where E and O are the subsets of vertices of V (Q n ) with even parity and odd parity, respectively.

On the minimum neighborhood of independent sets in the n-cube 5 The problem of finding the minimum number of error leaves in a HHT which is a full binary tree T of height n with l symbol leaves is equivalent to that of finding, over all independent sets L of size l in n-cubes, one that minimizes N(L). This is so because U can be the set of symbol leaves in T if and only if c(u) and c(v) differ at least in two positions for all distinct u, v U if and only if L = {c(u) : u U} is an independent set of Q n. Thus, the set of errors leaves associated with U is precisely N(L) in Q n. Shifting to this perspective, given two integers n and l, the problem considered in this paper is alternatively to find an independent set L of Q n with L = l having the least neighborhood cardinality over all the possible independent sets with the same size. Such optimal cardinality will be denoted by Opt(n, l). Lemma 2.1. If the distance between u, v V (Q n ) is 2, then N(u) N(v) = 2. Lemma 2.2. If L is an independent set of Q n with L = l such that N(L) = Opt(n, l), then all vertices of L belong to a same part of the partition E O of V (Q n ), where E (resp. O) consists of vertices with even (resp. odd) parity. 3 Lower bound for Opt(n, l) This section presents a lower bound for Opt(n, l). Given a vertex v Q n, let v(i) denote the i-th binary digit of v and 0v (resp. 1v) be the binary number obtained from v by prefixing a 0 (resp. 1). Let (V 0, V 1 ) be a bipartition of V (Q n ) such that V 0 = {v V (Q n ) v(1) = 0} and V 1 = {v V (Q n ) v(1) = 1}. Let Q 0 = Q n [V 0 ] and Q 1 = Q n [V 1 ]. Clearly, Q 0 and Q 1 are isomorphic to Q n 1, with v V (Q n 1 ) mapped to vertices 0v V 0 and 1v V 1, respectively. Furthermore, for E 01 = {(0v, 1v) : v V (Q n 1 )}, E(Q n ) = E(Q 0 ) E(Q 1 ) E 01 holds. Theorem 1. Opt(n, l) min{max{l 1, Opt(n 1, l 0 )}+max{l 0, Opt(n 1, l 1 )} : l 0 + l 1 = l}.

On the minimum neighborhood of independent sets in the n-cube 6 Proof. Let L V (Q n ), L = l, be an independent set of Q n such that N(L) = Opt(n, l). As V 0 V 1 is a bipartition of V (Q n ), then (L V 0 ) (L V 1 ) is a bipartition of L and (N(L) V 0 ) (N(L) V 1 ) is a bipartition of N(L). Let l 0 = L V 0 and l 1 = L V 1. Since N(L) V 0 N(L V 0 ) V 0 whose cardinality is at least Opt(n 1, l 0 ), and N(L) V 0 N(L V 1 ) V 0 whose cardinality is at least l 1 because N(v) V 0 = 1 for all v V 1, then N(L) V 0 max{l 1, Opt(n 1, l 0 )} and, by symmetric argument, N(L) V 1 max{l 0, Opt(n 1, l 1 )}. So, we have that N(L) max{l 1, Opt(n 1, l 0 )}+max{l 0, Opt(n 1, l 1 )} min{max{l 1, Opt(n 1, l 0 )} + max{l 0, Opt(n 1, l 1 )} : l 0 + l 1 = l}. 4 Upper bound for Opt(n, l) We define an extension of the Breadth-First Search (BFS) in Q n, called Unrestricted Breadth-First Search and show some properties related to this search useful in helping to find an upper limit for Opt(n, l). In UBFS, each vertex becomes available to be visited again after its exploration is completed. A similar variation of DFS, called Unrestricted Depth-First Search, is described in [Szw88]. The UBFS tree differs from the classical BFS tree by the fact that some vertices appear more than once. Contrary to UDFS, in which even allowing multiple visits to a same vertex the search comes to an end in finite time, UBFS must impose some termination condition to avoid infinite search. In the context of this paper, the UBFS algorithm stops when the corresponding tree has n levels. For this section, we consider the levels numbered from 0 to n 1. Figure 2 shows a search tree for Q 5 starting with the vertex 0 (vertices are named in their decimal representation). The tree edges are represented by solid lines and cross edges are represented by dashed lines. In the case of UBFS in the Q n, cross edges always connect two nodes in consecutive levels. For the sake of clearness, the figure only depicts the cross edges incident to the second node of each level of the tree.

On the minimum neighborhood of independent sets in the n-cube 7 Figure 2: UBFS tree for the Q 5. It is easy to see that, in any UBFS tree for the Q n, all the nodes at even levels have the same parity of the root while nodes at odd levels have opposite parity. Theorem 2. The set of vertices at the level (n 1) of an UBFS tree for Q n is precisely either E or O, the parts of the partition E O of V (Q n ), where E (resp. O) consists of vertices with even (resp. odd) parity. The previous theorem justifies the termination condition of stopping the search at level n 1: the search beyond the (n 1)-th level will alternate the vertices of E and O for each new level. Lemma 4.1. Let S = s 1, s 2, s 3,, s n be an ordering of the n vertices of N(u), u Q n. Then N(s i ) N({s 1,, s i 1 }) = i, for 2 i n. Let R(i, j) be the number of leaves which are descendants of the node v = (i, j), which is the j-th node of i-the level of a particular UBFS tree, for all 1 j n, 0 i n 1. The value R(i, j) is said to be the number of leaves represented by v. We define R(i, j) = 0 for all 1 < j n when i = 0. Theorem 3 helps to partially evaluate the function R for the first n nodes of each level of the tree. In the sequence it is shown how to extend the function to all nodes.

On the minimum neighborhood of independent sets in the n-cube 8 Theorem 3. For all 0 i n 1 and 1 j n, we have: 0, if i = 0 and j > 1 1, if i = n 1 R(i, j) = 1 k n R(i + 1, k), if i < n 1, j = 1 R(i + 1, k), if i < n 1, j > 1 j+1 k n (1) For each node (i, j) in the tree, a(i, j) is its affiliation order, defined in terms of the position of (i, j) relatively to its siblings and the number of cross edges of its parent node. If (i, j) is the k-th child node and its parent has c cross edges, than a(i, j) = k + c. It can be shown that we can extend the function R to all nodes (i, j), j > n, of the tree, by considering R(i, j) = R(i, a(i, j)). Now let us consider the cardinality of the neighborhood of the set L composed by the first l leaves of the tree. This value can be obtained by expanding the search one level ahead and counting the descendants of those nodes. However, this expansion is not necessary. If L corresponds exactly to the set of leaves descendants of the first node of a level i > 0 of the tree, that is, node (i, 1), then such a value is the number of vertices with distance n i to (i, 1), which is R(i 1, 1). If the set L is fully represented by the first j < n nodes of level i, a similar argument shows that N(L) = 1 k j R(i 1, k). In general, the cardinality of the neighborhood for the leaves represented by node (i, j) that is not common to other leaves is R(i 1, a(i, j)). Now we describe a method to evaluate the cardinality of the neighborhood L. We search the tree looking for a minimal set S of nodes (i, j) whose union of its representation is L. For each node (i, j) of S, we accumulate R(i 1, a(i, j)). This evaluation can be made in time O(n) in a top-down fashion using the matrix R. Such a process is as follows. We start by searching the lowest line i of R having R(i, 1) l. Mark the cell (i, 1) and continue sequentially right in this line marking cells, while the sum of R(i, j) of marked cells (i, j) does not exceed l. When this happens (in column k + 1), continue the process in the following line,

On the minimum neighborhood of independent sets in the n-cube 9 column k + 1. Repeat the process in the next lines until the sum of R(i, j) of all marked cells (i, j) equals l. Then the sum of R(i 1, j) related to all marked cells is f(n, l) = N(L). Clearly, f(n, l) is an upper bound for Opt(b, l). 5 Conclusion This work was motivated by the problem of determining optimal Hamming-Huffman trees, an open problem for 30 years now. A particular case of the problem, namely when Hamming-Huffman trees allow symbol leaves lying at most at two levels, reduces to the problem of finding an independent set having a given size l of a n-cube with the smallest neighborhood cardinality over all such independent sets. We provide lower and upper bounds for the problem. Experimental results show that the lower and upper bounds are equal for all 1 n 22, 1 l 2 n 1. This fact may suggest that both bounds are the actual Opt(n, l) values in general. Acknowledgment: the authors are thankful to the referees for their thorough reports which improved substantially the final version. References [Föl77] S. Földes, A characterization of hypercubes, Discrete Math. 17 (1977), no. 2, 155 159. MR 0450136 [Ham80] Richard Wesley Hamming, Coding and information theory, Prentice-Hall, Inc., Englewood Cliffs, N.J., 1980. MR 555735 [Huf52] [Szw88] D. A. Huffman, A method for the construction of minimumredundancy codes, Proceedings of the IRE 40 (1952), no. 9, 1098 1101. J. L. Szwarcfiter, Grafos e algoritmos computacionais, 2 ed., Ed. Campus Ltda, 1988.

On the minimum neighborhood of independent sets in the n-cube 10 Moysés da S. Sampaio Júnior Universidade do Estado do Rio de Janeiro (UERJ) Rio de Janeiro, Brazil moyses.sampaio@gmail.com Fabiano de S. Oliveira Universidade do Estado do Rio de Janeiro (UERJ) Rio de Janeiro, Brazil fabiano.oliveira@ime.uerj.br Luérbio Faria Universidade do Estado do Rio de Janeiro (UERJ) Rio de Janeiro, Brazil luerbio@cos.ufrj.br Paulo E. D. Pinto Universidade do Estado do Rio de Janeiro (UERJ) Rio de Janeiro, Brazil pauloedp@ime.uerj.br