Corrigendum to On the undecidability of implications between embedded multivalued database dependencies [Inform. and Comput. 122 (1995) ]

Similar documents
Axiomatizing Conditional Independence and Inclusion Dependencies

CS632 Notes on Relational Query Languages I

Guaranteeing No Interaction Between Functional Dependencies and Tree-Like Inclusion Dependencies

Stat 451: Solutions to Assignment #1

Another algorithm for nonnegative matrices

A MODEL-THEORETIC PROOF OF HILBERT S NULLSTELLENSATZ

Functional Dependencies

A CORRECTED 5NF DEFINITION FOR RELATIONAL DATABASE DESIGN. Millist W. Vincent ABSTRACT

Relational Database Design

On the Interaction of Inclusion Dependencies with Independence Atoms

Schema Refinement: Other Dependencies and Higher Normal Forms

Herbrand Theorem, Equality, and Compactness

arxiv: v1 [math.ra] 11 Jan 2019

Jónsson posets and unary Jónsson algebras

arxiv: v1 [cs.db] 17 Apr 2014

Boundedly complete weak-cauchy basic sequences in Banach spaces with the PCP

Löwenheim-Skolem Theorems, Countable Approximations, and L ω. David W. Kueker (Lecture Notes, Fall 2007)

More Model Theory Notes

FREE PRODUCTS AND BRITTON S LEMMA

On the logical Implication of Multivalued Dependencies with Null Values

Optimal global rates of convergence for interpolation problems with random design

A Comparative Study of Noncontextual and Contextual Dependencies

Discrete Mathematics: Lectures 6 and 7 Sets, Relations, Functions and Counting Instructor: Arijit Bishnu Date: August 4 and 6, 2009

Journal of Computer and System Sciences

On a problem of Fagin concerning multivalued dependencies in relational databases

The continuity method

Automata Theory and Formal Grammars: Lecture 1

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Global Database Design based on Storage Space and Update Time Minimization

9. Birational Maps and Blowing Up

The universal algebra generated by a power partial isometry

5 Set Operations, Functions, and Counting

Some hard families of parameterised counting problems

The Banach-Tarski paradox

Lattices, closure operators, and Galois connections.

A finite basis theorem for product varieties of groups

int cl int cl A = int cl A.

Automorphism Groups of Simple Moufang Loops over Perfect Fields

Homological Decision Problems for Finitely Generated Groups with Solvable Word Problem

Linear Algebra March 16, 2019

Valence automata as a generalization of automata with storage

TROPICAL SCHEME THEORY

Foundations of Mathematics

October 15, 2014 EXISTENCE OF FINITE BASES FOR QUASI-EQUATIONS OF UNARY ALGEBRAS WITH 0

The Emptiness Problem for Valence Automata or: Another Decidable Extension of Petri Nets

THE CORONA FACTORIZATION PROPERTY AND REFINEMENT MONOIDS

Mathematics Course 111: Algebra I Part I: Algebraic Structures, Sets and Permutations

Characterising FS domains by means of power domains

Tuples of Disjoint NP-Sets

General Topology. Summer Term Michael Kunzinger

1 xa 2. 2 xan n. + c 2 x α 2

ON MATCHINGS IN GROUPS

Linear Algebra and its Applications

Homogeneous Structures and Ramsey Classes

Journal of Combinatorial Theory, Series A

Problem set 2. Math 212b February 8, 2001 due Feb. 27

0 Sets and Induction. Sets

Economics 204 Fall 2011 Problem Set 1 Suggested Solutions

Induced Saturation of Graphs

Axioms of Kleene Algebra

Partial cubes: structures, characterizations, and constructions

Introductory Analysis I Fall 2014 Homework #5 Solutions

Three Classes of Orthomodular Lattices 3

Solution. 1 Solution of Homework 7. Sangchul Lee. March 22, Problem 1.1

IDEMPOTENT n-permutable VARIETIES

INDEPENDENCE THEORIES AND GENERALIZED ZERO-ONE LAWS

James J. Madden LSU, Baton Rouge. Dedication: To the memory of Paul Conrad.

arxiv: v1 [math.lo] 24 May 2012

Relational Design Theory

Math 421, Homework #9 Solutions

Algebraic Topology M3P solutions 1

Lecture Notes on Combinatory Modal Logic

The basic feasible functionals in computable analysis

EXISTENCE OF FINITE BASES FOR QUASI-EQUATIONS OF UNARY ALGEBRAS WITH 0

7 RC Simulates RA. Lemma: For every RA expression E(A 1... A k ) there exists a DRC formula F with F V (F ) = {A 1,..., A k } and

Logic: Propositional Logic Truth Tables

Section 2: Classes of Sets

Note On Parikh slender context-free languages

group Jean-Eric Pin and Christophe Reutenauer

Extremal Graphs Having No Stable Cutsets

Chapter 2 Metric Spaces

Equivalence of SQL Queries In Presence of Embedded Dependencies

Large subsets of semigroups

Binary operations derived from symmetric permutation sets and applications to absolute geometry

Topological automorphisms of modified Sierpiński gaskets realize arbitrary finite permutation groups

Possible numbers of ones in 0 1 matrices with a given rank

Pattern Logics and Auxiliary Relations

Databases Lecture 8. Timothy G. Griffin. Computer Laboratory University of Cambridge, UK. Databases, Lent 2009

The Structure of Inverses in Schema Mappings

Design theory for relational databases

V (v i + W i ) (v i + W i ) is path-connected and hence is connected.

:= {(x,y) K 2 y 2 = x 3 +ax+b} {O}

Universal homogeneous causal sets

TESTING FOR A SEMILATTICE TERM

The Query Containment Problem: Set Semantics vs. Bag Semantics. Phokion G. Kolaitis University of California Santa Cruz & IBM Research - Almaden

The Complexity of Computing the Behaviour of Lattice Automata on Infinite Trees

Desirable properties of decompositions 1. Decomposition of relational schemes. Desirable properties of decompositions 3

Existence, stability and instability for Einstein-scalar field Lichnerowicz equations by Emmanuel Hebey

A PASCAL-LIKE BOUND FOR THE NUMBER OF NECKLACES WITH FIXED DENSITY

DENSELY k-separable COMPACTA ARE DENSELY SEPARABLE

Transcription:

Information and Computation 204 (2006) 1847 1851 www.elsevier.com/locate/ic Corrigendum Corrigendum to On the undecidability of implications between embedded multivalued database dependencies [Inform. and Comput. 122 (1995) 221 235] Christian Herrmann * Technische Universität Darmstadt, Fachbereich Mathematik, Schlossgartenstrasse 7, D64289 Darmstadt, Germany Available online 30 October 2006 By an implication for database dependencies we mean an expression H F, where H is a conjunction of dependencies and F a single dependency. Fixing a class of such implications, a solution of the (finite) implication problem consists in an algorithmic procedure deciding for every implication in the class whether or not it holds in all (finite) databases (in which it is to be interpreted). In [3] this problem was studied for dependencies which are functional (fd) or embedded multivalued (emvd). As pointed out by Luc Segoufin, what was really shown is the following. Theorem 1. The implication problem and the finite implication problem for implications H F, where F is an emvd and H a conjunction of emvds and fds, are unsolvable. The claimed extension to emvds, alone, relied on an elimination of fds from the problem which was attributed to Beeri and Vardi [1, Lemma 4]. However, this reference does not supply the elimination claimed in Theorem 16 of [3]. On the other hand, the arguments given in [3] do not provide a proof those for Lemma 18 are vacuous. Both facts have been observed by Luc Segoufin. The purpose of this Corrigendum is to provide a proof for this elimination and so for the result stated in [3]: Theorem 2. The implication problem and the finite implication problem for emvds are unsolvable. The proof of Theorem 2 will be self contained but relying on Theorem l and on Theorem 4, below, which recalls the result of Beeri and Vardi [2], crucial for the elimination of fds. DOI of original article: 10.1006/inco.1995.1148 Fax: +49 9151 193317. E-mail address: herrmann@mathematik.tu-darmstadt.de 0890-5401/$ - see front matter 2006 Elsevier Inc. All rights reserved. doi:10.1016/j.ic.2006.09.002

1848 C. Herrmann / Information and Computation 204 (2006) 1847 1851 A, B, C will be variables to denote pairwise distinct attributes (also the singleton sets), X, Y variables for sets of attributes and XY = X Y. We refer to the relational database model with a single relation I over a finite universe U of attributes (cf. Beeri and Vardi [2]). As such, we call a U-database. By convention, U will always denote a finite set so that it makes sense to consider U the attribute set of a database. Given t I and X U we denote by t[x ] the restriction of t to X. Fds and emvds are written in the form X Y and [X, Y ], respectively. If X, Y U, then [X, Y ] holds in I if and only if for all t 1, t 2 I such that t 1 [X Y ]=t 2 [X Y ] there is t I such that t[x ]=t 1 [X ] and t[y ]=t 2 [Y ]. ByaU-dependency, respectively, implication we mean one with all attributes in U. AU-mvd is an emvd [X, Y ] such that XY = U. For conjunctions H and G of dependencies in the attribute set U we say that HU-implies G if G holds in all U-databases in which H holds. Given U # U, we say that a U-implication H F is U U # -similar to the U # implication H # F provided that H F holds in all U-databases if and only if H # F holds in all U # - databases and provided that the same takes place for finite databases. The following result relies heavily on Beeri and Vardi [2] and has been known to them, yet we were not able to find an explicit reference. Theorem 3. With each U-implication H F, where F is an emvd and H a conjunction of fds and emvds, one can effectively associate U # U and a U U # -similar U # -implication H # F, where H # is a conjunction of emvds. Actually, U # will depend on U, only, and H # will arise from H replacing the fds by conjunctions of U # -mvds. Proof of Theorem 2. Any decision procedure for the emvd implication problem could be converted into one solving the problem referred to in Theorem 1. Indeed, given an emvd F and conjunction H of emvds and fds, choose U to comprise all attributes in H F, form U # and H # according to Theorem 3, and apply the decision procedure. It remains to prove Theorem 3. In [2], Beeri and Vardi associate with an fd B A over the universe U the conjunction (B 4) U of two total tuple generating dependencies (ttgds). Let C 1,..., C m be a listing of U AB with no repetitions. Choose pairwise distinct variables for values: a i, b i, c ij for i = 0, 1, 2 and j = 1,..., m. Let the two ttgds be given by Table 1. The first tuple stands for the conclusion, the others for the premise. By definition, the first ttgd is valid in a relation I over U if the statement described in Table 2 is valid for each map h from the set of variables in the set of values of I. Similarly, for the second ttgd. Table 1 Translating an fd into ttgds A B C 1... C m A B C 1... C m a 0 b 1 c 21... c 2m a 1 b 1 c 21... c 2m a 0 b 0 c 01... c 0m a 0 b 0 c 01... c 0m a 1 b 0 c 11... c 1m a 1 b 0 c 11... c 1m a 1 b 1 c 21... c 2m a 0 b 1 c 21... c 2m

C. Herrmann / Information and Computation 204 (2006) 1847 1851 1849 Table 2 Semantics of ttgds And And Jointly imply (h(a 0 ), h(b 0 ), h(c 01 ),..., h(c 0m )) I (h(a 1 ), h(b 0 ), h(c 11 ),..., h(c 1m )) I (h(a 1 ), h(b 1 ), h(c 21 ),..., h(c 2m )) I (h(a 0 ), h(b 1 ), h(c 21 ),..., h(c 2m )) I Given a conjunction H of U-dependencies, let HU denote the conjunction arising from H if each fd B A is replaced by (B A) U. As a special case of Theorem 7 in Beeri and Vardi [2], one obtains the following. Theorem 4. Let F be an emvd and H a conjunction of emvds and fds of the form B A. Assume that H F is a U-implication. Then H F is U U-similar to H U F. The idea of proof is now as follows. In Lemma 17 of [3] it was shown that fds can be replaced by mvds if one uses a second copy of the attribute set. The equivalence of an attribute A and its copy  can, of course, be captured by the fds A  and  A. In the context of similarity, the latter can be replaced by ttgds according to Theorem 4. Moreover, the fds imply the ttgds. The key is to interpolate such implications with mvds. In order to do so, we use a third copy of U. A similar technique is used in the coordinatization of lattices and relation algebras and in commutator theory of algebraic structures. The following auxiliary definitions and results are formulated for all U-databases and pairwise distinct A, B, C U. We write A B := A B B A A B C := A B A C B C and consider the following conjunctions U (A, B, C) of U-mvds: [ACD, AB] [BCD, AB] [BCD, AC] where D = U ABC. Lemma 5. A B CU-implies U (A, B, C). Proof. A B implies [AB, ACD]. Indeed, if t 1 [A] =t 2 [A], then let t = t 2. Then remaining cases follow by symmetry. Lemma 6. U (A, B, C) U-implies (A B) U, (A B) U, (B A) U, (A C) U, (C A) U, (B C) U, and (C B) U. Proof of (B A) U by the tuple generating chases in Table 3. D stands for U ABC, the d i for corresponding parts of tuples. We name the mvds used and the tuples involved, first the one which is kept except changing the value of a single attribute. The remaining cases follow by symmetry. Preparing for the proof of Theorem 3, fix a countably infinite set U (think of its members as possible original attributes). Let Û and Ũ be disjoint copies of U (providing the copy attributes) and A  and A à bijections from U onto Û and Ũ, respectively. For U U let

1850 C. Herrmann / Information and Computation 204 (2006) 1847 1851 Table 3 Tuple generating chases A B C D 1 a 0 b 0 c 0 d 0 2 a 1 b 0 c 1 d 1 3 a 1 b 1 c 2 d 2 4 a 1 b 0 c 2 d 2 3,2, [ACD, AB] 5 a 0 b 0 c 2 d 2 4,1, [BCD, AB] a 0 b 1 c 2 d 2 3,5, [BCD, AC] A B C D 1 a 0 b 0 c 0 d 0 2 a 1 b 0 c 1 d 1 3 a 0 b 1 c 2 d 2 4 a 0 b 0 c 2 d 2 3,1, [ACD, AB] 5 a 1 b 0 c 2 d 2 4,2, [BCD, AB] a 1 b 1 c 2 d 2 5,3, [BCD, AC] Û and Ũ denote the images under these maps and U = U Û Ũ. Let I U be the conjunction of all A Â Ã, A U. For an fd X A, let (X A) U be defined as the U -mvd [U A, XA]. Lemma 7. Let F be an emvd and H a conjunction of emvds and fds of the form X A with A not in X. Assume that H F is a U-implication and let H arise from H by replacing X A with (X A) U and adding the conjunct I U. Then H F and H F are U U -similar. This is basically Lemma 17 in [3]. Proof. The dependencies X A and (X A) U are equivalent for all U -databases which satisfy I U. Namely, consider a U -model J of I U and [U A, XA] and t, u J such that t[x ]=u[x ]. By the mvd one has w J such that w[xa] =u[xa], w[u A] =t[u A]. In particular, w(a) = u(a) and w(â) = t(â), whence w(a) = t(a) by I U and t(a) = u(a). The converse (that the fd implies the mvd) is trivial. Now, given a U-model J of H, choose the domains for the new attributes such that for each A U there are bijections A : DOM(A) DOM(Â), A : DOM(A) DOM(Â) and define the U database J to consist of all t such that t[u] J, t(â) = A (t(a)), t(ã) = A (t(a)) for all A U. Then J is a model of H. Conversely, from a U -model J of H pass to J just by restricting J to U to obtain a model of H. In both directions, the status of the emvd F remains unchanged.

C. Herrmann / Information and Computation 204 (2006) 1847 1851 1851 Proof of Theorem 3. We may assume that the fds in H are of the form X A with A not in X, omitting the trivial ones. Form U and H according to Lemma 7. Let U # = U and H # be the conjunction of emvds which arises from H replacing A Â Ã by U (A, Â, Ã). In view of Lemma 7 it suffices to show that H F and H # F are U U -similar. Now, applying Lemma 5 to the attribute set U, we have that H U -implies H #. Hence, H # FU - implies H F. In particular, if H # F holds for all (finite) U -databases, then so does H F. To prove the converse, assume there is a U -model J # of H # which is not a model of F. Since H # arises from H U, replacing (A Â Ã) U, with U (A, Â, Ã), from Lemma 6 we have that H # U -implies H U. It follows that J # is also a model of H U. Now, by Theorem 4 there is a U -model J of H which is not a model of F. And J can be chosen finite if J # is finite. The author must thank Luc Segoufin and Moshe Vardi for pointing out the error and for support in the repair effort. References [1] C. Beeri, M.Y. Vardi, The implication problem for data dependencies, in: S. Even, O. Kariv (Eds.), ICALP 81, LNCS, vol. 115, Springer, Berlin, 1981, pp. 73 85. [2] C. Beeri, M.Y. Vardi, A proof procedure for data dependencies, JACM 31 (1984) 718 741. [3] C. Herrmann, On the undecidability of implications between embedded multivalued database dependencies, Inform. and Comput. 122 (1995) 221 235.