On the Readability of Monotone Boolean Formulae

Similar documents
On the Readability of Monotone Boolean Formulae

A Study on Monotone Self-Dual Boolean Functions

A Collection of Problems in Propositional Logic

On the Gap Between ess(f) and cnf size(f) (Extended Abstract)

R u t c o r Research R e p o r t. Relations of Threshold and k-interval Boolean Functions. David Kronus a. RRR , April 2008

arxiv: v2 [cs.ds] 17 Sep 2017

MATHEMATICAL ENGINEERING TECHNICAL REPORTS. Deductive Inference for the Interiors and Exteriors of Horn Theories

f 0 (a,b,c,w,x,y,z) = ay cxy bw bz is a read-once function since it can be factored into the expression

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Evaluation of DNF Formulas

Polynomial Certificates for Propositional Classes

8.1 Polynomial Threshold Functions

On the Complexity of Some Enumeration Problems for Matroids

Factoring and Recognition of Read-Once Functions using Cographs and Normality and the Readability of Functions Associated with Partial k-trees

[3] R.M. Colomb and C.Y.C. Chung. Very fast decision table execution of propositional

Comp487/587 - Boolean Formulas

A Lower Bound of 2 n Conditional Jumps for Boolean Satisfiability on A Random Access Machine

Specifying a positive threshold function via extremal points

Minimization of Matched Formulas

Complexity Theory VU , SS The Polynomial Hierarchy. Reinhard Pichler

Outline. Complexity Theory EXACT TSP. The Class DP. Definition. Problem EXACT TSP. Complexity of EXACT TSP. Proposition VU 181.

Logical Design of Digital Systems

Generating Maximal Independent Sets for Hypergraphs with Bounded Edge-Intersections

On the Sensitivity of Cyclically-Invariant Boolean Functions

Read-Once Threshold Formulas, Justifying Assignments, Nader H. Bshouty 1. Thomas R. Hancock 2. Lisa Hellerstein 3. Marek Karpinski 4 TR

Efficient Approximation for Restricted Biclique Cover Problems

Self-duality of bounded monotone boolean functions and related problems

A Lower Bound for Boolean Satisfiability on Turing Machines

Notations. For any hypergraph H 2 V we have. H + H = and H + H = 2 V.

Complexity of DNF and Isomorphism of Monotone Formulas

CS 151 Complexity Theory Spring Solution Set 5

Part V. Intractable Problems

A note on unsatisfiable k-cnf formulas with few occurrences per variable

NP-Completeness of Refutability by Literal-Once Resolution

A necessary and sufficient condition for the existence of a spanning tree with specified vertices having large degrees

Out-colourings of Digraphs

On the correlation of parity and small-depth circuits

Database Theory VU , SS Complexity of Query Evaluation. Reinhard Pichler

CSE 555 HW 5 SAMPLE SOLUTION. Question 1.

Complete problems for classes in PH, The Polynomial-Time Hierarchy (PH) oracle is like a subroutine, or function in

Non-Deterministic Time

Normal Forms of Propositional Logic

i times p(p(... (p( n))...) = n ki.

NP-completeness. Chapter 34. Sergey Bereg

Lecture 6,7 (Sept 27 and 29, 2011 ): Bin Packing, MAX-SAT

arxiv: v1 [cs.ds] 20 Nov 2016

Trichotomy Results on the Complexity of Reasoning with Disjunctive Logic Programs

Solving SAT for CNF formulas with a one-sided restriction on variable occurrences

R u t c o r Research R e p o r t. Uniform partitions and Erdös-Ko-Rado Theorem a. Vladimir Gurvich b. RRR , August, 2009

CS Introduction to Complexity Theory. Lecture #11: Dec 8th, 2015

Classes of Boolean Functions

Dual-Bounded Generating Problems: Weighted Transversals of a Hypergraph

1 The Low-Degree Testing Assumption

1 Basic Combinatorics

On the Gap between the Complexity of SAT and Minimization for Certain Classes of Boolean Formulas

ON SENSITIVITY OF k-uniform HYPERGRAPH PROPERTIES

An Intersection Inequality for Discrete Distributions and Related Generation Problems

On the Structure and the Number of Prime Implicants of 2-CNFs

A Tutorial on Computational Learning Theory Presented at Genetic Programming 1997 Stanford University, July 1997

Inference of A Minimum Size Boolean Function by Using A New Efficient Branch-and-Bound Approach From Examples

Counting Peaks and Valleys in a Partition of a Set

Partitions and Covers

Sensitivity, Block Sensitivity and Certificate Complexity of Boolean Functions (Master s Thesis)

INSTITUTE OF MATHEMATICS THE CZECH ACADEMY OF SCIENCES. A trade-off between length and width in resolution. Neil Thapen

Nash-solvable bidirected cyclic two-person game forms

Approximation Algorithms for Stochastic Boolean Function Evaluation and Stochastic Submodular Set Cover

Welsh s problem on the number of bases of matroids

1 PSPACE-Completeness

Resolution for Predicate Logic

The Strength of Multilinear Proofs

CSC 2429 Approaches to the P vs. NP Question and Related Complexity Questions Lecture 2: Switching Lemma, AC 0 Circuit Lower Bounds

On the Complexity of Enumerating Pseudo-intents

Proclaiming Dictators and Juntas or Testing Boolean Formulae

THE POSTDOC VARIANT OF THE SECRETARY PROBLEM 1. INTRODUCTION.

Evaluation of Monotone DNF Formulas

CS6840: Advanced Complexity Theory Mar 29, Lecturer: Jayalal Sarma M.N. Scribe: Dinesh K.

A Note on Maxflow-Mincut and Homomorphic Equivalence in Matroids

Approximating maximum satisfiable subsystems of linear equations of bounded width

Endre Boros b Vladimir Gurvich d ;

Maximum 3-SAT as QUBO

arxiv: v1 [cs.cc] 16 Mar 2017

Factorised Representations of Query Results: Size Bounds and Readability

Yale University Department of Computer Science

Lecture 5: February 21, 2017

Space complexity of cutting planes refutations

ON POSITIVE, LINEAR AND QUADRATIC BOOLEAN FUNCTIONS

Nondeterminism LECTURE Nondeterminism as a proof system. University of California, Los Angeles CS 289A Communication Complexity

ON IMPLICATIONAL BASES OF CLOSURE SYSTEMS WITH UNIQUE CRITICAL SETS

De Morgan s a Laws. De Morgan s laws say that. (φ 1 φ 2 ) = φ 1 φ 2, (φ 1 φ 2 ) = φ 1 φ 2.

Exercises 1 - Solutions

On the P versus NP intersected with co-np question in communication complexity

CS 395T Computational Learning Theory. Scribe: Mike Halcrow. x 4. x 2. x 6

COL 352 Introduction to Automata and Theory of Computation Major Exam, Sem II , Max 80, Time 2 hr. Name Entry No. Group

Projective DNF Formulae and Their Revision

The Lefthanded Local Lemma characterizes chordal dependency graphs

MULTIPLICITIES OF MONOMIAL IDEALS

On Minimum Representations of Matched Formulas

Tutorial Obtain the principal disjunctive normal form and principal conjunction form of the statement

Part 1: Propositional Logic

Finding Low Degree Annihilators for a Boolean Function Using Polynomial Algorithms

Transcription:

On the Readability of Monotone Boolean Formulae Khaled Elbassioni Kazuhisa Maino Imran Rauf January 30, 2009 Abstract Golumbic et al. [Discrete Applied Mathematics 1542006 1465-1477] defined the readability of a monotone Boolean function f to be the minimum integer such that there exists an -formula equivalent to f in which each variable appears at most times. They ased whether there exists a polynomial-time algorithm, which given a monotone Boolean function f, in CNF or DNF form, checs whether f is a read- function, for a fixed. In this paper, we paratially answer this question already for = 2 by showing that it is NP-hard to decide if a given monotone formula represents a read-twice function. It follows also from our reduction that it is NP-hard to approximate the readability of a given montone Boolean function f : {0, 1} n {0, 1} within a factor of On. We also give tight subinear upper bounds on the readability of a montone Boolean function given in CNF or DNF form, paramterized by the number of terms in the CNF and the maximum size in each term, or more generally the maximum number of variabels in the inetrsection of any constant number of terms. When the varaibles of the DNF can be ordered so that each term consists of a sset oc consecutive variables, we give much tgher polylogarithmic bounds on the readability. 1 Introduction Let f : {0, 1} n {0, 1} be a monotone Boolean function, i.e. for any x, x {0, 1} n, x x implies fx fx. One property of such functions is that they can be represented by negation-free Boolean formulae. A minterm maxterm of monotone Boolean function fx 1,..., x n is a minimal set of variables which, if assigned the value 1 resp., value 0, forces the function to tae the value 1 resp., value 0 regardless of the values assigned to the remaining variables. It is wellnown that the irredundant disjunctive normal form DNF and conjunctive Max-Planc-Institut für Informati, Saarbrücen, Germany; {elbassio,irauf}@mpiinf.mpg.de Department of Mathematical Informatics, University of Toyo, Toyo, Japan; maino@mist.i.u-toyo.ac.jp 1

normal form CNF of monotone Boolean function f consist respectively of all of its minterms and maxterms cf. [Weg87]. A monotone read- formula is a Boolean formula over the operators {, } in which each variable occurs at most times. The readability of f, denoted by readabilityf, is the minimum such that f can be represented by a monotone read- formula. We also call f a read- function when it has readability. Finding the readability of an arbitrary Boolean function and computing a formula which achieves this readability has applications in circuit design among others and therefore is one of the earliest problems considered in Computer Science. Given a monotone Boolean function in one of the normal forms CNF/DNF, a complete combinatorial characterization for it to be read-once was given by Gurvich [Gur77, Gur91]. A polynomial-time algorithm based on this criterion is given by Golumbic et al. [GMR06] to decide whether a given CNF or DNF is read-once. The algorithm also computes the unique read-once representation when a read-once function is given as input. For 2, no characterization is nown for a given monotone Boolean CNF or DNF to be read-, and in fact, Golumbic et al. ased in [GG08] whether there exists a polynomial-time algorithm, which given a normal monotone Boolean function f in CNF or DNF form, checs whether f is a a read- function, for a fixed. The case when the function is given by an oracle has also been considered in the machine learning community. It is shown in [AHK93] that given a read-once function by a membership oracle, we can compute its read-once representation in polynomial time. However, the correctness of the algorithm is based on the assumption that the function provided as an oracle is read-once. If its not read-once then the algorithm terminates with incorrect output. In this paper, we show that, given an vee-formula, it is NP-hard to chec if it represents a read-twice function f. This partailly answers the question of Golumbic et al. [GMR06], but leaves open the case when f is given by the CNF or DNF normal form. It follows also from our reduction that it is NP-hard to approximate the readability of a given montone Boolean function f : {0, 1} n {0, 1} within a factor of On. It follows from a result in [Weg87] that almost all monotone Boolean functions on n variables, in which each minterm has size exactly, have readability Ωn 1 log 1 n. Assuming that the function is given by its irredundant DNF or CNF of m minterms, this implies a lower bound of Omegam 1 1 on the readability. This naturally raises the question whether this bound is tight, i.e for any monotone CNF formula of m terms, there exists an equivalent read- Om 1 1 representation. In this paper, we show that this indeed the case, and moreover that such a represnetation can be found in polynomila time. In fact, we prove a more general result. For integers p, d 0, let us say a monotone CNF f has p, d-bounded intersection [] if every at d terms intersect in at most p vairables. We show that any such CNF has read-op + d + 1m 1 1 p+1 representation which can be found in polynomial-time. Confronted with this almost tight sublinear bound on readability, an intersting question is whether it can be improved for interesting special cases. For the class of interval DNF s, i.e. those 2

for which there is an ordering on the variables such that each term contains only consecutive variables in that ordering, we show that readability is at most Olog 2 m. The paper is organized as follows. In the next section, we point out that the characterization of [Gur77, Gur91] for read-once functions does not carry over to read- functions already for = 2. In Section 3, we present upper bounds on the readability of some classes monotone Boolean DNF resp. CNF that depends only on the number of terms in the normal form. In Section 4 we show that the finding the readability in general is hard when the input formula is not a DNF or CNF. We also give an On inapproximability result in this case. 2 On Generalization of Read-once Functions An elegant characterization of read-once functions is provided by the following theorem of Gurvich. Theorem 1 [Gur77, Gur91] For any monotone Boolean function f the following two statements are equivalent: 1. f is read-once. 2. Every minterm and maxterm of f intersect in exactly c = 1 variable. However, this result does not generalizes to read-twice functions as the following example shows. Consider the read-twice formula gx 1,..., x n, y 1,..., y n = x i y i x 1... x n. 1 i n It is easy to see that the g has a minterm x 1... x n which intersects with the maxterm x 1... x n in n variables. Hence hypergraphs corresponding to read-twice functions do not necessarily satisfy Condition 2 of Theorem 1 for any constant c. Conversely, a generalization of Condition 3 to any constant c is also not sufficient for a function to be read-c, as implied by the following result on the shortest possible size of -homogeneous DNF where the size of each term is exactly and hence each minterm and maxterm intersect in at most. Theorem 2 cf. [Weg87] For an integer, let H n be the class of monotone Boolean functions on n variables such that size of every minterm is exactly. The monotone formula size of almost all h H is Ωn log 1 n. Theorem 2 implies that the readability of almost all h H is Ωn 1 log 1 n, since otherwise the formula achieving a smaller readability has smaller then shortest possible size. 3

3 Upper Bounds In this section, we consider various classes of monotone Boolean DNF s and give upper bounds on their readability. First we consider Interval DNF s whose terms correspond to consecutive variables, given some ordering on variables. Next, we consider p-intersecting DNF where every two of its terms intersect in at most p variables and give an almost tight upper bound on their readability. Finally, we consider a special case of teh latter class, namely -DNF, where the size of each term is bounded by and again give a tight upper bound on their readability. Even though we get the same upper bound implied by the more genral case, the formula computed by our algorithm has only depth 2 in this case. In our description of the algorithms, we use set-theoretic notations to describe various operations on the structure of DNF s. In this sense, we treat the DNF φ = t i as its corresponding hypergraph {t i t i is a term in φ}. For example, we write t φ when t is term of φ and similarly by x t we mean that the term t contains variable x. Let us denote the degree of a variable in φ by deg φ x, which is the number of terms in φ containing x V. For a Boolean formula f and a literal x resp. set of literals S in f, we denote by f x=1 resp. f S=1 the resulting f after replacing every occurrence of x resp. x S in f with 1. 3.1 Interval DNF A monotone Boolean DNF I = I I x I x is called interval DNF if there is an ordering of variables V = {x 1, x 2,..., x n } such that each I I contains only consecutive elements from the ordering. We show that an interval DNF containing m terms is Olog 2 m-readable. For a variable x j V, let I <xj = {I I : I {x 1,..., x j 1 }}, I >xj = {I I : I {x j+1,..., x n }} and I xj = {I I : x j I}. For a term I = x i x i+1... x j in interval DNF I, we call x i and x j its left and right end-points, and denote them with LI and RI respectively. We also denote the first resp. last term in the ordering of terms of I with respect to their left end point as firsti and lasti respectively. The algorithm is given in Figure 1. It proceeds by choosing a variable x j such that at most half of the intervals are completely on the left I l and half on the right I r. The formulae for I l and I r are computed recursively and the remaining terms in I xj are divided into two halves I 1 and I 2 by considering them in order with respect to their left end-point. The algorithm then factors out common variables from I 1 and I 2 and computes their equivalent formulae recursively. Theorem 3 Let I be an interval DNF containing m terms. Then I is Olog 2 m- readable. Proof. We show that the procedure REDUCE1I returns a formula with Olog 2 m-readability given an interval DNF. Let r 1 m and r 2 m be the readability of the formulae generated by the procedures REDUCE1I and REDUCE2I, 4

Procedure REDUCE1I: Input: A monotone Boolean interval DNF I = m j=1 I j on variables ordering x 1,..., x n Output: A Olog 2 m readable formula ψ equivalent to I 1. if I 2 then return the read-once formula representing I 2. Let x j V such that I <xj m 2 and I >x j m 2 3. return REDUCE1I <xj REDUCE2I xj REDUCE1I >xj Procedure REDUCE2I: Input: A monotone Boolean interval DNF I s.t. m j=1 I j Output: A Olog m readable formula ψ equivalent to I 1. if I 2 then return the read-once formula representing I 2. Consider terms in I in order of their left end points, let I 1 resp. I 2 be first half resp. remaining half elements of I. 3. Let φ 1 resp. φ 2 be maximum set of variables that occur in every term of I 1 resp. I 2 4. t 1 := firsti 1, t 2 := lasti 1, t 3 := firsti 2, t 4 := lasti 2 5. ψ 1 = REDUCE2I 1 \ {t 1, t 2 } φ1=1 6. ψ 2 = REDUCE2I 2 \ {t 3, t 4 } φ2=1 7. return φ 1 ψ 1 t 1 φ1=1 t 2 φ1=1 φ 2 ψ 2 t 3 φ2=1 t 4 φ2=1 Figure 1: An algorithm to find an Olog 2 m-readable formula for interval DNF consisting of m intersecting terms 5

respectively, when given an interval DNF I containing m terms as input. Let x j be the variable chosen in Step 2 of the algorithm. Since the subproblems I <xj and I >xj are disjoint and have size at most m/2, the recurrence for readability computed by REDUCE1I is r 1 m r 1 m/2 + r 2 m. Similarly, given an intersecting interval DNF I the procedure REDUCE2I divides the problem into subproblems I 1 and I 2 respectively. Note that the subproblems in the recursive call i.e. I 1 \ {firsti 1, lasti 1 } φ1=1 and I 2 \ {firsti 2, lasti 2 } φ2=1 are again intersecting since I 1 and I 2 are irredundant. For calculating the readability of the formula computed by REDUCE2I, consider the case when a variable x i occurs in both subproblems. We show that if x i does not occur in φ 1 resp. φ 2 then it is necessarily be the case that it appears in φ 2 resp. φ 1 and thus occuring only once in at least one of the subproblem. Note that since I is irredundant, the set φ 2 forms the interval [LlastI 2, RfirstI 2 ]. Also observe that since x i occurs in both subproblems and not in φ 1, it must lie in the interval [RfirstI 1, RlastI 1 ]. It is easy to see that the later interval is the subset of φ 2 since RlastI 1 appears before RfirstI 2 in the ordering of variables because of the definition of I 1 and I 2. Also because of the assumption that I is intersecting, LlastI 2 appears before RfirstI 1 in the ordering. So the maximum readability of the formula generated by REDUCE2I where I consists of m terms satisfies r 2 m 2 + r 2 m/2. Solving the recurrenses yields the stated bound on the readability of I. 3.2 p-intersecting DNF A monotone Boolean DNF is called p-intersecting if every two of its distinct terms intersect in at most p variables. For instance, a quadratic DNF is 1- intersecting and -DNF, i. e., DNF where the size of each term is bounded by is 1-intersecting. In this section, we give a p + 1m 1 1 p+1 bound on the readability of p-intersecting DNF containing m-terms. Theorem 2 implies that this bound is almost tight. By considering p + 1-homogeneous DNF containing m = Θn p+1 terms we get, Corollary 1 For a constant p, let G p be the class of monotone Boolean DNF on n variables with m terms such that size of every minterm is exactly p + 1. The readability of almost all g G p is Ωm 1 1 p+1 log 1 n. Let φ be a p-intersecting monotone Boolean DNF on variables V = {x 1,..., x n }. The algorithm is given in Figure 2. It wors by picing a variable x with high degree in φ and recursively compute a formula equivalent to the part of φ where x occurs. The algorithm stops when every variable has low degree in the remaining expression. More precisely, for a variable x V, let φ x be the DNF consist of terms of φ which contain x, i.e. φ x = t φ,x t t. Note that if φ is p-intersecting then φ x x=1 is p 1-intersecting DNF, so the algorithm recurs when p > 1 and otherwise it returns the read-once formula x φ x. The next Theorem bound the readability of the formula generated by the algorithm. 6

Procedure REDUCE3φ, p: Input: A monotone Boolean p-intersecting DNF φ on variables set V Output: A p + 1m 1 1 p+1 readable formula ψ equivalent to φ 1. ψ := 0, m := φ 2. while x V s.t. deg φ x m 1 1 p+1 3. let φ x = t φ,x t t 4. φ := φ \ φ x 5. if p > 1 then 6. ψ := ψ x REDUCE2φ x x=1, r 1 7. else 8. ψ := ψ x φ x x=1 9. return φ ψ Figure 2: An algorithm to find p+1m 1 1 p+1 readable formula for p-intersecting DNF consists of m terms Theorem 4 Given a monotone Boolean DNF µ which is p-intersecting for p 1. The formula µ = REDUCE2µ, p is p + 1m 1 1 p+1 readable and it is equivalent to µ. Proof. The proof is by induction on p. When p = 1, the while loop in Step 2 ensure that every variable in φ has degree less then m after the loop ends. Moreover, a read-once formula is added to ψ in each iteration of while loop. Since there are at most m iterations, the formula φ ψ in Step 9 has readability at most m + m. Now assume that the claim is true for p 1 intersecting DNF, where p 2. We prove it for p-intersecting DNF using similar arguments as in the previous paragraph. After the while loop ends, every variable in the remaining φ has degree less then m 1 1 p+1. Let m1,..., m q be number of terms removed from φ in each iteration of while loop, where q is the number of iterations. Note that q can be bounded from above by m 1 p+1 since each mi is at least m 1 1 p+1. Now, 7

denoting the readability of p-intersecting DNF on m terms by r p m, we have, q r p m m 1 1 p+1 + r p 1 m i 1 i=1 q m 1 1 p+1 + pm 1 1 p i i=1 2 q m 1 1 p+1 + p i=1 m 1 1 p i q 3 q m 1 1 p+1 + p m 1 1 p q 1 p 4 m 1 1 p+1 + p m 1 1 p+1 5 where we apply induction hypothesis to get Equation 2 and use Jensen s inequality to get Equation 3. The correctness of the procedure is straightforward since the invariant that φ ψ is equal to µ holds after completion of every iteration. Note that the algorithm produce a depth p formula for a p-intersecting DNF. In the next section we will see that we can do much better in this regard for the a subclass of p-intersecting DNF, namely the class of DNF where the size of each term is bounded by p + 1. 3.3 -DNF A monotone Boolean DNF is called -DNF if every term in it has size at most. In this section, we give an algorithm to compute 2m 1 1/ readable formula of depth two and equivalent to given -DNF. We need the following definitions. A sunflower with p petals and a core Y is a collection of sets S 1,..., S p such that S i S j = Y for all i j and none of the sets S i Y is empty. We allow the core Y to be empty however, so every pairwise disjoint family of sets constitutes a sunflower. Lemma 1 Sunflower Lemma [ER60] Let H 2 V be a hypergraph with m = H and size of each edge is bounded by. If m >!p then H contains a sunflower with p + 1 petals. Since a sunflower has a straightforward read-once representation, the above lemma immediately gives an upper bound on the readability of -DNF with m terms. The algorithm wors by finding a sunflower with certain minimal size, representing them as read-once formula and recurse on the remaining edges. Theorem 5 Let f be a monotone Boolean DNF with m terms such that the size of each term in f is bounded by then f is 2m 1 1/ -readable. 6 8

Proof. Any -DNF with m terms contains a sunflower of size at least m/! 1/ which we remove and recurse on the remaining terms. Let rm denote the readability of boolean -DNF with m terms then the readability of f can be bounded by the recurrence rm 1 + rm m/! 1/ with r2 = r1 = 1. By using the inequality! and substituting rm = 2m 1 1/ in the 1 1 above recurrence we get g, m = 2m 1 1 1 1 m 1 1 1. In the appendix we prove the follwoing claim: Claim 1 For 2 and m 1, the function g, m is monotonacally decreasing in m and monotonacally increasing in. Thus the minimum of g is attained when = 2 and m approaches infinity. The minimum value is 1 and hence rm 2m 1 1/. We note that the proof of Lemma 1 is constructive and a sunflower of desired size can be computed in time polynomial in number of variables and terms of a DNF. 4 Hardness and Inapproximability In this section, we show that finding the readability of a given monotone Boolean formula is NP hard. The reduction we use is gap-introducing and so it also gives hardness of approximating readability unless P = NP. Our reduction is from the well-nown NP-complete problem of deciding satisfiability of a given Boolean 3-CNF Φx 1... x n = m j=1 Φ j. For all i [n] and j [m], let us define new variables y ij, y ij, z ij for a literal x i in clause Φ j and variables z ij, z ij, y ij for a literal x i in clause Φ j. Let φy 11... y nm, z 11... z nm be the monotone CNF we get from Φx 1... x n by substituting y ij for x i in Φ j and z i j for x i in Φ j such that φy, z Φx, for y ij = x i and z ij = x i, i [n], j [m]. 7 Furthermore, let I i = {j : x i Φ j } {j : x i Φ j }, we define, n ρy, z = y ij z ij 8 i=1 j I i j I i and ψy, z, y, z = x i Φ j y ij z ij x i Φ j y ijz ij 9 Now consider the following Boolean function fy, z, y, z = φy, z ρy, z ψy, z, y, z 10 9

y y y 12 22 32 y 11 y 13 y 21 y 23 y 31 y 33 y 11 z 12 y 13 s z 11 z 13 z 21 z 23 z 31 z 33 z 12 z 22 z 32 y 11 y 21 y 22 z 31 y 32 z 23 z 33 t z 11 y 33 z 33 y 11 z 11 y 33 z 33 Figure 3: Applying reduction in Equation 10 to 3-CNF Φ = x 1 x 2 x 3 x 1 x 2 x 3 x 1 x 2 x 3. Minimal s t paths in the figure correspond to minterms of f, whereas minimal s t cuts are maxterms of f. Note that the size of f is 15m, where m is number of clauses in Φ. The next lemma shows that finding the readability of Boolean formula f defined in Equation 10 is equivalent to solving satisfiability for Φx. Lemma 2 The monotone Boolean function f in Equation 10 is read-2 if and only if Φx is satisfiable. Its read-once otherwise. Proof. Denote the two disjuncts in f by f 1 = φy, z ρy, z and f 2 = ψy, z, y, z. We first show that the minterms of f 1 which are not absorbed by minterms of f 2 correspond precisely to the satisfiable assignments of Φ and so f = ψ is clearly a read-once function if Φ is not satisfiable. Let ˆx be a satisfiable assignment of Φx. Since ˆx maes atleast one literal true in each clause of Φx, the set t φ = {y ij ˆx i = 1} {z ij ˆx i = 0} contains a minterm t φ of φy, z. Similarly, note that the set t ρ = {y ij ˆx i = 1} {z ij ˆx i = 0} defines a minterm of ρy, z, and so the set t = t φ t ρ is a minterm of f 1. It is easy to chec that t does not contain any minterm of f 2 since for all i [n] and j [m], atmost one from each pair y ij, z ij and y ij, z ij are members of t. Conversely, any minterm t of f 1 contains one of y i1... y im or z i1... z im for all i [n] to cover the conjunct ρ. Assume t is not absorbed by any term of f 2. Consequently, t does not contain both y ij z ij or y ij z ij for all i [n] and j [m]. Therefore it must contain from each clause φ j, at least one of the variable y ij or z ij consistent with the primed variable selected from ρ. Hence the assignment x i = 1 if y ij t and x i = 0 if z ij t satisfies Φx. It only remains to prove that f is not a read-once function when Φx is satisfiable. Assume wlog that variable x 1 appears in clause Φ 1. Let us define a maxterm c of f by c = {y 11, z 11} i [n],j [m] {y ij, y ij } and consider the minterm t of f corresponding to a satisfiable assignment ˆx of Φ as defined above. It is easy to see that t c > 1 since for any literal x i appears in clause Φ j such that ˆx i = 1, t would contain both y ij and y ij. Hence f is not a read-once function because of Theorem 1. Note that it is read-2 since we have Equation 10 as its read-2 representation. 10

Since f in Equation 10 is compose of two read-once formulae, Lemma 2 also implies the hardness of determining if a given monotone formula is disjunction of two read-once formulae. Corollary 2 It is NP-hard to now whether a given monotone Boolean formula is a read-once function or a disjunction of two monotone read-once functions. Another interesting problem for which we get a hardness result as a corollary of Lemma 2 is the problem of generating all minterms or maxterms of given monotone Boolean formula. Note that the problem is trivial when the input formula is read-once since all minterms or all maxterms can be generated without repetition by straightforward multiplication. Lemma 3 Let F be the class of monotone Boolean formulae in which each variable appears at most twice. For a formula f F, let C and D denote the sets of the maxterms and the minterms of f, respectively. i Given a formula f F and a subset C of C, it is conp-complete to decide whether C = C. ii Similarly, for a formula f F and a subset D of D, it is conp-complete to decide whether D = D. Proof. Note that since the class F is closed under duality, both parts of the theorem are equivalent. The hardness of ii implied immediately from Lemma 2 by setting D = {t t is a term in ψ}. The possibly remaining minterms in D \ D correspond to satisfiable assignments of Φ. In the following, we generalize the reduction introduced in Equation 10 and get an inapproximability result for the problem of determining readability of given monotone Boolean formula. We use a result of Gál [G 02] that gives an explicit monotone Boolean function α on s variables such that the size of the shortest monotone formula representing α is s Ωlog s, moreover its irredundant monotone DNF has size s Olog s. Note that the readability of α is also s Ωlog s, since otherwise we could represent α by a formula with smaller then shortest possible size. We define the following reduction, f w, y, z, y, z = φy, z ρy, z αw ψy, z, y, z 11 where the size of f is 15m + s Olog s. Note that if Φ is satisfiable, f has readability s Θlog s by applying the same reasoning as in Lemma 2. By choosing s and m such that m = s c1 log s and m = c 2 n for a suitable constants c 1, c 2, we get the following, Corollary 3 There is no polynomial-time algorithm to approximate the readability of given monotone Boolean formula f within factor of On, unless P = NP. 11

References [AHK93] Dana Angluin, Lisa Hellerstein, and Mare Karpinsi. Learning readonce formulas with queries. J. ACM, 401:185 210, 1993. [ER60] P. Erdös and R. Rado. Intersection theorems for systems of sets. J. London Math. Soc., 35:85 90, 1960. [G 02] Anna Gál. A characterization of span program size and improved lower bounds for monotone span programs. Comput. Complex., 104:277 296, 2002. [GG08] Martin Charles Golumbic and Vladimir Gurvich. Read-once functions. In Yves Crama and Peter Ladislaw Hammer, editors, Boolean Functions: Theory, Algorithms and Applications. Cambridge University Press, 2008. [GMR06] Martin Charles Golumbic, Aviad Mintz, and Udi Rotics. Factoring and recognition of read-once functions using cographs and normality and the readability of functions associated with partial -trees. Discrete Applied Mathematics, 15410:1465 1477, 2006. [Gur77] [Gur91] [Weg87] Vladimir Gurvich. On repetition-free boolean functions. Uspehi Mat. Nau. Russian Math. Surveys, 32:183 184, 1977. in Russian. Vladimir Gurvich. Criteria for repetition-freeness of functions in the algebra of logic. Soviet Math. Dol., 433:721 726, 1991. Ingo Wegener. The complexity of Boolean functions. John Wiley & Sons, Inc., New Yor, NY, USA, 1987. Appendix: Proof of Claim 1. Assume for the rest of the proof that 2 and m 1. We first observe that the derivative of g with respect to is positive 12

and so g is increasing in. Substituting x = m 1+ 1 /, we obtain, 1 1 x 1 1 g 2 =2 1 1 x 1 1 m 1 1 + 2 1 x 1 1 1 1 x + x log m 2 + 1 x =2m 1 1 1 1 x 1 1 1 + log m 1 x 1 1 x 1 1 1 + log m + 1 x =2m 1 1 1 + log m 1 1 x 1 1 =2m 1 1 1 + log m x1 1 1 x 1 1 1 2x + x 1 x 1 logmm 1 1 log 1 x 2 log 1 x 2 m 1 1 + log1 x1 x1 1 1 + log m log1 x1 x1 1 1 + log m Only the last factor in the above equation has negative terms. Note that x = m 1+ 1 / is always positive and at most 1/2 when 2 and m 1. Consequently, the term 1 2x + x /1 x 1 has value at most 1 and so resulted in positive quanitity when subtracted from 1. Furthermore, the factor log1 x is negative and therefore maing the last term positive as well. Simililarly, the derivative of g with respect to m is, g m =2 1 1 1 = m 2 1 1 1 m 1 1 1 1 m 1 1 m 1 1 which is negative since the term m 1 1 1 1 m 1/ + 1 m 1 1 1 1 1 m 1 1 m 1 + 1 m 1 1 + 1 1 m 1 1 1 1/ 1 + 1 m 1 1 is increasing and approaches 1 1 as m goes to infinity. 13