Query Evaluation on Probabilistic Databases. CSE 544: Wednesday, May 24, 2006

Size: px
Start display at page:

Download "Query Evaluation on Probabilistic Databases. CSE 544: Wednesday, May 24, 2006"

Transcription

1 Query Evaluation on Probabilistic Databases CSE 544: Wednesday, May 24, 2006

2 Problem Setting Queries: Tables: Review A(x,y) :- Review(x,y), Movie(x,z), z > 1991 name rating p Movie Monkey Love good.5 title year p Twelve Monkeys Monkey Love fair.2 fair.6 poor.9 Answers: Top k Monkey Love title rating p Monkey Love Pl Twelve Monkeys fair.53 Problem: complexity Monkey Love good.42 Monkey Love Pl fair.15 of query evaluation 2

3 Two Problems Fixed schema S, conjunctive query Q(x,y) Query evaluation problem Fix answer tuple (a,b) Given database I, compute Pr(Q(a,b)) Top-k answering problem Fix k > 0 Given database I, find k answer tuples with highest probabilities 3

4 Related Work: DB Cavallo&Pitarelli:1987 Barbara,Garcia-Molina, Porter:1992 Lakshmanan,Leone,Ross&Subrahmanian:1997 Fuhr&Roellke:1997 Dalvi&S:2004 Widom:2005 4

5 Related Work: Logic Query reliability [Gradel,Gurevitch,Hirsch 98] Degrees of belief [Bacchus,Grove,Halpern,Koller 96] Probabilistic Logic [Nielson] Probabilistic model checking [Kwiatkowska 02] Probabilistic Relational Model [Taskar,Abbeel,Koller 02] 5

6 Probabilistic Database Schema S, Domain D, Set of instances Inst Definition Probabilistic database is a probability distribution Pr : Inst [0,1], I Pr[I] = 1 If Pr[I] > 0 then I is called possible world 6

7 Probabilistic Database Representation: Independent tuples: I-database DB over some schema S i Independent and disjoint tuples: ID-database DB over some schema S id Semantics: DB means probability distribution Pr over schema S 7

8 Independent Events A tuple is in the database with probability p Any two tuples are independent events 8

9 Representation I-Databases Reviews i (M,S,p) Movie Score P m42 good p 1 m99 good p 2 Reviews(M,S) m76 poor p 3 Mov Scor Mov Scor Mov Scor Mov Scor Mov Scor Mov Scor Mov Scor Mov Scor m42 good m99 good m42 good m76 poor m42 good m m42 good Pr[I 1 ]= (1-p 1 )*(1-p 2 )*(1-p 3 ) Pr[I 4 ]= m99 good m76 poor p 1 *p 2 *(1-p 3 ) Pr[I 1 ] + Pr[I 2 ] Pr[I 8 ] = 1 m76 poor Pr[I 8 ]= m99 good m76 poor p 1 *p 2 *p 3 Possible worlds semantics 9

10 Disjoint Events Needed in Many-to-1 matchings Possible values for attributes [Barbara 92] Name Age Name Age P John Mary (0.3) 43 (0.7) John John Mary

11 ID-Databases Activities id Time d Activity P t walk p 1 t run p 2 Activities t+1 walk p 3 Time Act Time Act Time Act Pr[I 1 ]= t (1-p 1 -p 2 )*(1-p 3 ) walk Pr[I 3 ]= t run p 2 *(1-p 3 ) Time Act t+1 walk Pr[I 5 ]= Time Pr[I 1 ] + Pr[I 2 ] Pr[I 6 ] = 1 11 t Act walk t+1 walk p 1 *p 3 Time Act t run t+1 walk

12 ID subsumes I Reviews id Movie d Score d P m42 good p 1 m99 good p 2 m76 poor p 3 = Reviews i Movie Score P m42 good p 1 m99 good p 2 m76 poor p 3 Note: Movie Score P Reviews id m42 good p 1 m99 good p 2 means all tuples are m76 poor p 3 disjoint 12

13 Queries Syntax: conjunctive queries over schema S Q(y) :- Movie(x,y), Review(x,z), z >= 3 Movie i id year P m m m m Review i mid rating p m m m m m

14 Two Query Semantics Possible answer sets Given set A: Pr[{t I Q(t)} = A] Used for views This Possible tuples talk Given tuple t: Pr[I Q(t)] Used for query evaluation and top-k 14

15 Query Semantics id year p 1 id year p 2 id year p 3 id year p 4 m m m m m m m m m m Q(y) :- Movie(x,y), Review(x,z) year p Tuple probabilities 1935 p 2 + p 3 = p 1 + p 3 = p 3 = top k 15

16 Summary on Data Model Data Model: Semantics = possible worlds Syntax = I-databases or ID-databases Queries: Syntax = unchanged (conjunctive queries) Semantics = tuple probabilities 16

17 Problem Definition Fix schema S, query Q, answer tuple t Problem: given I/ID-database DB, compute Pr[I Q(t)] notation: Pr[Q(t)] Conventions: For upper bounds (P or #P): probabilities are rationals For lower bounds (#P): probabilities are 1/2 17

18 Query Evaluation on I-Databases Outline Intuition Extensional plans: PTIME case Hard queries: #P-complete case Dichotomy Theorem 18

19 Q(y) :- Movie(x,y), Review(x,z) Movie i Intuition Review i Answer id year p m p 1 m p 2 m p 3 m p 4 mid rate p m42 4 q 1 m42 2 q 2 m42 3 q 3 m99 1 q 4 m99 3 q 5 m76 5 q 6 Year 1995 p p 1 (1 - (1 - q 1 ) (1 - q 2 ) (1 - q 3 )) (1 - p 2 (1 - (1 - q 4 ) (1 - q 5 ))) (1 - p 3 q 6 ) 19

20 I-Extensional Plans [Barbara92,Lakshmanan97] Add p Join p = p 1 * p 2 Projection p = 1-(1-p 1 )(1-p 2 )...(1-p n ) Selection σ p = p Note: data complexity is PTIME 20

21 Extensional Query Plans x x pq x 1-(1-p1)(1-p2)(1-p3) x p σ x p x q x x x p1 p2 p3 x p 21

22 Extensional Query Plans Each tuple t has a probability t.p Algebra operators compute t.p Data complexity: PTIME 22

23 (1-pq 1 )(1-pq 2 )(1-pq 3 ) Q(y) :- Movie(x,y), Review(x,z) 1995 m1 pq m1 p(1-(1-q 1 )(1-q 2 )(1-q 3 )) 1995 m1 pq m1 pq 3 m1 1 - (1-q 1 )(1-q 2 )(1-q 3 ) Movie 1995 p Review m1 q 1 Movie Review m1 q p m1 q 1 m1 q 3 INCORRECT! 23 m1 q 2 m1 q 3 CORRECT

24 #P-Complete Queries R i S T i A p A B B p p 1 q 1 p 2 q 2 p 3 q 3 p 4 q 4 Q bad :- R i (x), S(x,y), T i (y) Theorem: Data complexity is #P-complete

25 Proof: Theorem [Provan&Ball83] Counting the number of satisfying assignments for bipartite DNF is #P-complete R i S T i Reduction: A p x 1 1/2 A B x 2 y 3 B p y 1 1/2 x 2 y 3 V x 1 y 2 V x 4 y 3 V x 3 y 1 x 2 1/2 x 1 y 2 y 2 1/2 x 3 1/2 x 4 y 3 y 3 1/2 x 4 1/2 x 3 y 1 Q bad :- R i (x), S(x,y), T i (y) 25

26 I-Dichotomy Q = boolean conjunctive query Definition 1. For each variable x: goals(x) = set of goals that contain x Definition 2. Q is hierarchical if forall x, y: (a) goals(x) goals(y) =, or (b) goals(x) goals(y), or (c) goals(y) goals(x)

27 Q :- R(x),S(x,y),T(x,y,z),K(x,v) x y z S R T v K hierarchical Q :- R(x), S(x,y), T(y) x y non-hierarchical R S T 27

28 [Dalvi&S. 04] I-Dichotomy Schema S i = {R 1i, R 2i,..., R mi } Theorem Let Q = conjunctive query w/o self-joins. Then one of the following holds: Q is in PTIME Q has a correct extensional plan or: Q is hierarchical Q is #P-complete Q has subgoals R(x,...),S(x,y,...),T(y,...) 28

29 Proof Lemma 1. If Q is non-hierarchical, then #P-complete Proof: v x y R S T K z Q :- R i (v,x), S i (x,y), T i (y,z), K i (z) rest is like for Q bad 29

30 Proof Lemma 2. If Q is hierarchical, then PTIME Proof: Case 1: has no root Pr(Q) = Pr(Q 1 ) Pr(Q 2 ) Pr(Q 3 ) This is extensional join 30

31 Proof Case 2: has root x x Dom={a 1, a 2,..., a n } Pr(Q) = 1 - (1-Pr(Q(a 1 /x))(1-pr(q(a 2 /x))...(1-pr(q(a n /x))) This is an extensional projection: QED 31

32 Query Evaluation on ID-Databases ID-extensional plans #P-complete queries Dichotomoy Theorem 32

33 Extensional Plans for ID-DBs Only difference: two kinds of projections: independent 1-(1-p 1 )...(1-p n ) disjoint p p n 33

34 #P-Complete Queries Q 1 :- R i (x), S i (x,y), T i (y) Q 2 :- R d (x d,y), S d (y d ) Q 3 :- R d (x d,y), S d (z d,y) 34

35 [Dalvi&S. 04] I-DB Dichotomy Schema S id s.t. each table is either R i or R id Theorem Let Q = conjunctive query w/o self-joins. Then one of the following holds: Q is in PTIME Q has a correct extensional plan or: Q is #P-complete Q has one of Q 1, Q 2, Q 3 as subqueries 35

36 Extensions Extensions of the dichotomoy theorem exists for: Mixed schemas (some relations are deterministic) Functional dependencies 36

37 Summary on Query Evaluation Extensional plans: popular, efficient, BUT Equivalent plans lead to different results Some queries admit correct plans Some simple queries: #P-complete complexity Dichotomy theorem Future work: remove no-self-join restriction 37

38 Conclusions Strong motivation from practical applications Merge query and search technologies Probabilistic DB s are hard! Hacks don t work (yet). Need principled approach. 38

39 Thank you! Questions?

A Dichotomy. in in Probabilistic Databases. Joint work with Robert Fink. for Non-Repeating Queries with Negation Queries with Negation

A Dichotomy. in in Probabilistic Databases. Joint work with Robert Fink. for Non-Repeating Queries with Negation Queries with Negation Dichotomy for Non-Repeating Queries with Negation Queries with Negation in in Probabilistic Databases Robert Dan Olteanu Fink and Dan Olteanu Joint work with Robert Fink Uncertainty in Computation Simons

More information

Topics in Probabilistic and Statistical Databases. Lecture 2: Representation of Probabilistic Databases. Dan Suciu University of Washington

Topics in Probabilistic and Statistical Databases. Lecture 2: Representation of Probabilistic Databases. Dan Suciu University of Washington Topics in Probabilistic and Statistical Databases Lecture 2: Representation of Probabilistic Databases Dan Suciu University of Washington 1 Review: Definition The set of all possible database instances:

More information

Database design and implementation CMPSCI 645

Database design and implementation CMPSCI 645 Database design and implementation CMPSCI 645 Lectures 20: Probabilistic Databases *based on a tutorial by Dan Suciu Have we seen uncertainty in DB yet? } NULL values Age Height Weight 20 NULL 200 NULL

More information

Queries and Materialized Views on Probabilistic Databases

Queries and Materialized Views on Probabilistic Databases Queries and Materialized Views on Probabilistic Databases Nilesh Dalvi Christopher Ré Dan Suciu September 11, 2008 Abstract We review in this paper some recent yet fundamental results on evaluating queries

More information

Database Design and Implementation

Database Design and Implementation Database Design and Implementation CS 645 Data provenance Provenance provenance, n. The fact of coming from some particular source or quarter; origin, derivation [Oxford English Dictionary] Data provenance

More information

Brief Tutorial on Probabilistic Databases

Brief Tutorial on Probabilistic Databases Brief Tutorial on Probabilistic Databases Dan Suciu University of Washington Simons 206 About This Talk Probabilistic databases Tuple-independent Query evaluation Statistical relational models Representation,

More information

A Toolbox of Query Evaluation Techniques for Probabilistic Databases

A Toolbox of Query Evaluation Techniques for Probabilistic Databases 2nd Workshop on Management and mining Of UNcertain Data (MOUND) Long Beach, March 1st, 2010 A Toolbox of Query Evaluation Techniques for Probabilistic Databases Dan Olteanu, Oxford University Computing

More information

Efficient Query Evaluation on Probabilistic Databases

Efficient Query Evaluation on Probabilistic Databases Efficient Query Evaluation on Probabilistic Databases Nilesh Dalvi and Dan Suciu April 4, 2004 Abstract We describe a system that supports arbitrarily complex SQL queries with uncertain predicates. The

More information

The Trichotomy of HAVING Queries on a Probabilistic Database

The Trichotomy of HAVING Queries on a Probabilistic Database VLDBJ manuscript No. (will be inserted by the editor) The Trichotomy of HAVING Queries on a Probabilistic Database Christopher Ré Dan Suciu Received: date / Accepted: date Abstract We study the evaluation

More information

Outline. Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC

Outline. Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC Outline Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC Part 5: Completeness of Lifted Inference Part 6: Query Compilation Part 7:

More information

Query answering using views

Query answering using views Query answering using views General setting: database relations R 1,...,R n. Several views V 1,...,V k are defined as results of queries over the R i s. We have a query Q over R 1,...,R n. Question: Can

More information

GAV-sound with conjunctive queries

GAV-sound with conjunctive queries GAV-sound with conjunctive queries Source and global schema as before: source R 1 (A, B),R 2 (B,C) Global schema: T 1 (A, C), T 2 (B,C) GAV mappings become sound: T 1 {x, y, z R 1 (x,y) R 2 (y,z)} T 2

More information

Provenance Semirings. Todd Green Grigoris Karvounarakis Val Tannen. presented by Clemens Ley

Provenance Semirings. Todd Green Grigoris Karvounarakis Val Tannen. presented by Clemens Ley Provenance Semirings Todd Green Grigoris Karvounarakis Val Tannen presented by Clemens Ley place of origin Provenance Semirings Todd Green Grigoris Karvounarakis Val Tannen presented by Clemens Ley place

More information

Efficient Top-k Query Evaluation on Probabilistic Data

Efficient Top-k Query Evaluation on Probabilistic Data Efficient Top-k Query Evaluation on Probabilistic Data Extended Version TR: #2006-06-05 Christopher Ré, Nilesh Dalvi and Dan Suciu University of Washington Dept. of Computer Science and Engineering E-mail:

More information

Probabilistic Databases

Probabilistic Databases Probabilistic Databases Amol Deshpande University of Maryland Goal Introduction to probabilistic databases Focus on an overview of: Different possible representations Challenges in using them Probabilistic

More information

Tractable Lineages on Treelike Instances: Limits and Extensions

Tractable Lineages on Treelike Instances: Limits and Extensions Tractable Lineages on Treelike Instances: Limits and Extensions Antoine Amarilli 1, Pierre Bourhis 2, Pierre enellart 1,3 June 29th, 2016 1 Télécom ParisTech 2 CNR CRItAL 3 National University of ingapore

More information

arxiv: v1 [cs.ai] 21 Sep 2014

arxiv: v1 [cs.ai] 21 Sep 2014 Oblivious Bounds on the Probability of Boolean Functions WOLFGANG GATTERBAUER, Carnegie Mellon University DAN SUCIU, University of Washington arxiv:409.6052v [cs.ai] 2 Sep 204 This paper develops upper

More information

Logic and Databases. Phokion G. Kolaitis. UC Santa Cruz & IBM Research Almaden. Lecture 4 Part 1

Logic and Databases. Phokion G. Kolaitis. UC Santa Cruz & IBM Research Almaden. Lecture 4 Part 1 Logic and Databases Phokion G. Kolaitis UC Santa Cruz & IBM Research Almaden Lecture 4 Part 1 1 Thematic Roadmap Logic and Database Query Languages Relational Algebra and Relational Calculus Conjunctive

More information

Query Evaluation with Soft-Key Constraints

Query Evaluation with Soft-Key Constraints Query Evaluation with Soft-Key Constraints Abhay Jha CSE, University of Washington Seattle WA, 98195-2350 abhaykj@cs.washington.edu Vibhor Rastogi CSE, University of Washington Seattle WA, 98195-2350 vibhor@cs.washington.edu

More information

Representing and Querying Correlated Tuples in Probabilistic Databases

Representing and Querying Correlated Tuples in Probabilistic Databases Representing and Querying Correlated Tuples in Probabilistic Databases Prithviraj Sen Dept. of Computer Science, University of Maryland, College Park, MD 20742. sen@cs.umd.edu Amol Deshpande Dept. of Computer

More information

Semantics of Ranking Queries for Probabilistic Data and Expected Ranks

Semantics of Ranking Queries for Probabilistic Data and Expected Ranks Semantics of Ranking Queries for Probabilistic Data and Expected Ranks Graham Cormode AT&T Labs Feifei Li FSU Ke Yi HKUST 1-1 Uncertain, uncertain, uncertain... (Probabilistic, probabilistic, probabilistic...)

More information

CSE 562 Database Systems

CSE 562 Database Systems Outline Query Optimization CSE 562 Database Systems Query Processing: Algebraic Optimization Some slides are based or modified from originals by Database Systems: The Complete Book, Pearson Prentice Hall

More information

The Veracity of Big Data

The Veracity of Big Data The Veracity of Big Data Pierre Senellart École normale supérieure 13 October 2016, Big Data & Market Insights 23 April 2013, Dow Jones (cnn.com) 23 April 2013, Dow Jones (cnn.com) 23 April 2013, Dow Jones

More information

Towards Deterministic Decomposable Circuits for Safe Queries

Towards Deterministic Decomposable Circuits for Safe Queries Towards Deterministic Decomposable Circuits for Safe Queries Mikaël Monet 1 and Dan Olteanu 2 1 LTCI, Télécom ParisTech, Université Paris-Saclay, France, Inria Paris; Paris, France mikael.monet@telecom-paristech.fr

More information

Computing Query Probability with Incidence Algebras Technical Report UW-CSE University of Washington

Computing Query Probability with Incidence Algebras Technical Report UW-CSE University of Washington Computing Query Probability with Incidence Algebras Technical Report UW-CSE-10-03-02 University of Washington Nilesh Dalvi, Karl Schnaitter and Dan Suciu Revised: August 24, 2010 Abstract We describe an

More information

Probabilistic Databases

Probabilistic Databases Probabilistic Databases Dan Olteanu (Oxford) London, November 2017 The National Archives Probabilistic Databases For the purpose of this introductory talk: Probabilistic data = Relational data + Probabilities

More information

Relational Database Design

Relational Database Design Relational Database Design Jan Chomicki University at Buffalo Jan Chomicki () Relational database design 1 / 16 Outline 1 Functional dependencies 2 Normal forms 3 Multivalued dependencies Jan Chomicki

More information

Data Dependencies in the Presence of Difference

Data Dependencies in the Presence of Difference Data Dependencies in the Presence of Difference Tsinghua University sxsong@tsinghua.edu.cn Outline Introduction Application Foundation Discovery Conclusion and Future Work Data Dependencies in the Presence

More information

Top-k Query Processing in Probabilistic Databases with Non-Materialized Views. Maximilian Dylla Iris Miliaraki Martin Theobald

Top-k Query Processing in Probabilistic Databases with Non-Materialized Views. Maximilian Dylla Iris Miliaraki Martin Theobald Top-k Query Processing in Probabilistic Databases with Non-Materialized Views Maximilian Dylla Iris Miliaraki Martin Theobald MPI I 2012 5-002 June 2012 Authors Addresses Maximilian Dylla Max-Planck-Institut

More information

Outline. Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC

Outline. Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC Outline Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC Part 5: Completeness of Lifted Inference Part 6: Query Compilation Part 7:

More information

Knowledge Compilation Meets Database Theory : Compiling Queries to Decision Diagrams

Knowledge Compilation Meets Database Theory : Compiling Queries to Decision Diagrams Knowledge Compilation Meets Database Theory : Compiling Queries to Decision Diagrams Abhay Jha Computer Science and Engineering University of Washington, WA, USA abhaykj@cs.washington.edu Dan Suciu Computer

More information

Database Applications (15-415)

Database Applications (15-415) Database Applications (15-415) Relational Calculus Lecture 6, January 26, 2016 Mohammad Hammoud Today Last Session: Relational Algebra Today s Session: Relational calculus Relational tuple calculus Announcements:

More information

Scalable Uncertainty Management

Scalable Uncertainty Management Scalable Uncertainty Management 05 Query Evaluation in Probabilistic Databases Rainer Gemulla Jun 1, 2012 Overview In this lecture Primer: relational calculus Understand complexity of query evaluation

More information

arxiv: v1 [cs.db] 21 Feb 2017

arxiv: v1 [cs.db] 21 Feb 2017 Answering Conjunctive Queries under Updates Christoph Berkholz, Jens Keppeler, Nicole Schweikardt Humboldt-Universität zu Berlin {berkholz,keppelej,schweika}@informatik.hu-berlin.de February 22, 207 arxiv:702.06370v

More information

StarAI Full, 6+1 pages Short, 2 page position paper or abstract

StarAI Full, 6+1 pages Short, 2 page position paper or abstract StarAI 2015 Fifth International Workshop on Statistical Relational AI At the 31st Conference on Uncertainty in Artificial Intelligence (UAI) (right after ICML) In Amsterdam, The Netherlands, on July 16.

More information

Schema Refinement: Other Dependencies and Higher Normal Forms

Schema Refinement: Other Dependencies and Higher Normal Forms Schema Refinement: Other Dependencies and Higher Normal Forms Spring 2018 School of Computer Science University of Waterloo Databases CS348 (University of Waterloo) Higher Normal Forms 1 / 14 Outline 1

More information

QSQL: Incorporating Logic-based Retrieval Conditions into SQL

QSQL: Incorporating Logic-based Retrieval Conditions into SQL QSQL: Incorporating Logic-based Retrieval Conditions into SQL Sebastian Lehrack and Ingo Schmitt Brandenburg University of Technology Cottbus Institute of Computer Science Chair of Database and Information

More information

Answering Queries from Statistics and Probabilistic Views. Nilesh Dalvi and Dan Suciu, University of Washington.

Answering Queries from Statistics and Probabilistic Views. Nilesh Dalvi and Dan Suciu, University of Washington. Answering Queries from Statistics and Probabilistic Views Nilesh Dalvi and Dan Suciu, University of Washington. Background Query answering using Views problem: find answers to a query q over a database

More information

Factorised Representations of Query Results: Size Bounds and Readability

Factorised Representations of Query Results: Size Bounds and Readability Factorised Representations of Query Results: Size Bounds and Readability Dan Olteanu and Jakub Závodný Department of Computer Science University of Oxford {dan.olteanu,jakub.zavodny}@cs.ox.ac.uk ABSTRACT

More information

Data Bases Data Mining Foundations of databases: from functional dependencies to normal forms

Data Bases Data Mining Foundations of databases: from functional dependencies to normal forms Data Bases Data Mining Foundations of databases: from functional dependencies to normal forms Database Group http://liris.cnrs.fr/ecoquery/dokuwiki/doku.php?id=enseignement: dbdm:start March 1, 2017 Exemple

More information

P Q1 Q2 Q3 Q4 Q5 Tot (60) (20) (20) (20) (60) (20) (200) You are allotted a maximum of 4 hours to complete this exam.

P Q1 Q2 Q3 Q4 Q5 Tot (60) (20) (20) (20) (60) (20) (200) You are allotted a maximum of 4 hours to complete this exam. Exam INFO-H-417 Database System Architecture 13 January 2014 Name: ULB Student ID: P Q1 Q2 Q3 Q4 Q5 Tot (60 (20 (20 (20 (60 (20 (200 Exam modalities You are allotted a maximum of 4 hours to complete this

More information

7 RC Simulates RA. Lemma: For every RA expression E(A 1... A k ) there exists a DRC formula F with F V (F ) = {A 1,..., A k } and

7 RC Simulates RA. Lemma: For every RA expression E(A 1... A k ) there exists a DRC formula F with F V (F ) = {A 1,..., A k } and 7 RC Simulates RA. We now show that DRC (and hence TRC) is at least as expressive as RA. That is, given an RA expression E that mentions at most C, there is an equivalent DRC expression E that mentions

More information

Fundamentos lógicos de bases de datos (Logical foundations of databases)

Fundamentos lógicos de bases de datos (Logical foundations of databases) 20/7/2015 ECI 2015 Buenos Aires Fundamentos lógicos de bases de datos (Logical foundations of databases) Diego Figueira Gabriele Puppis CNRS LaBRI About the speakers Gabriele Puppis PhD from Udine (Italy)

More information

6 The Relational Data Model: Algebraic operations on tabular data

6 The Relational Data Model: Algebraic operations on tabular data 6 The Relational Data Model: Algebraic operations on tabular data 6.1 Basic idea of relational languages 6.2 Relational Algebra operations 6.3 Relational Algebra: Syntax and Semantics 6.4. More Operators

More information

Querying Big Data by Accessing Small Data

Querying Big Data by Accessing Small Data Querying Big Data by Accessing Small Data Wenfei Fan 1,3 Floris Geerts 2 Yang Cao 1,3 Ting Deng 3 Ping Lu 3 1 University of Edinburgh 2 University of Antwerp 3 Beihang University {wenfei@inf, yang.cao@}.ed.ac.uk,

More information

On the Semantics and Evaluation of Top-k Queries in Probabilistic Databases

On the Semantics and Evaluation of Top-k Queries in Probabilistic Databases On the Semantics and Evaluation of Top-k Queries in Probabilistic Databases Xi Zhang Jan Chomicki SUNY at Buffalo September 23, 2008 Xi Zhang, Jan Chomicki (SUNY at Buffalo) Topk Queries in Prob. DB September

More information

Lightweight Description Logics: DL-Lite A and EL ++

Lightweight Description Logics: DL-Lite A and EL ++ Lightweight Description Logics: DL-Lite A and EL ++ Elena Botoeva 1 KRDB Research Centre Free University of Bozen-Bolzano January 13, 2011 Departamento de Ciencias de la Computación Universidad de Chile,

More information

Efficient query evaluation

Efficient query evaluation Efficient query evaluation Maria Luisa Sapino Set of values E.g. select * from EMPLOYEES where SALARY = 1500; Result of a query Sorted list E.g. select * from CAR-IMAGE where color = red ; 2 Queries as

More information

Profiling Sets for Preference Querying

Profiling Sets for Preference Querying Profiling Sets for Preference Querying Xi Zhang and Jan Chomicki Department of Computer Science and Engineering University at Buffalo, SUNY, U.S.A. {xizhang,chomicki}@cse.buffalo.edu Abstract. We propose

More information

Databases Lecture 8. Timothy G. Griffin. Computer Laboratory University of Cambridge, UK. Databases, Lent 2009

Databases Lecture 8. Timothy G. Griffin. Computer Laboratory University of Cambridge, UK. Databases, Lent 2009 Databases Lecture 8 Timothy G. Griffin Computer Laboratory University of Cambridge, UK Databases, Lent 2009 T. Griffin (cl.cam.ac.uk) Databases Lecture 8 DB 2009 1 / 15 Lecture 08: Multivalued Dependencies

More information

Approximate Lineage for Probabilistic Databases. University of Washington Technical Report UW TR: #TR

Approximate Lineage for Probabilistic Databases. University of Washington Technical Report UW TR: #TR Approximate Lineage for Probabilistic Databases University of Washington Technical Report UW TR: #TR2008-03-02 Christopher Ré and Dan Suciu {chrisre,suciu}@cs.washington.edu Abstract In probabilistic databases,

More information

Computational Logic. Relational Query Languages with Negation. Free University of Bozen-Bolzano, Werner Nutt

Computational Logic. Relational Query Languages with Negation. Free University of Bozen-Bolzano, Werner Nutt Computational Logic Free University of Bozen-Bolzano, 2010 Werner Nutt (Slides adapted from Thomas Eiter and Leonid Libkin) Computational Logic 1 Queries with All Who are the directors whose movies are

More information

The Calculus of Computation: Decision Procedures with Applications to Verification. Part I: FOUNDATIONS. by Aaron Bradley Zohar Manna

The Calculus of Computation: Decision Procedures with Applications to Verification. Part I: FOUNDATIONS. by Aaron Bradley Zohar Manna The Calculus of Computation: Decision Procedures with Applications to Verification Part I: FOUNDATIONS by Aaron Bradley Zohar Manna 1. Propositional Logic(PL) Springer 2007 1-1 1-2 Propositional Logic(PL)

More information

On Factorisation of Provenance Polynomials

On Factorisation of Provenance Polynomials On Factorisation of Provenance Polynomials Dan Olteanu and Jakub Závodný Oxford University Computing Laboratory Wolfson Building, Parks Road, OX1 3QD, Oxford, UK 1 Introduction Tracking and managing provenance

More information

Representing and Querying Correlated Tuples in Probabilistic Databases

Representing and Querying Correlated Tuples in Probabilistic Databases Representing and Querying Correlated Tuples in Probabilistic Databases Prithviraj Sen Amol Deshpande Department of Computer Science University of Maryland, College Park. International Conference on Data

More information

Functional Dependencies. Getting a good DB design Lisa Ball November 2012

Functional Dependencies. Getting a good DB design Lisa Ball November 2012 Functional Dependencies Getting a good DB design Lisa Ball November 2012 Outline (2012) SEE NEXT SLIDE FOR ALL TOPICS (some for you to read) Normalization covered by Dr Sanchez Armstrong s Axioms other

More information

Relational Database: Identities of Relational Algebra; Example of Query Optimization

Relational Database: Identities of Relational Algebra; Example of Query Optimization Relational Database: Identities of Relational Algebra; Example of Query Optimization Greg Plaxton Theory in Programming Practice, Fall 2005 Department of Computer Science University of Texas at Austin

More information

Database Applications (15-415)

Database Applications (15-415) Database Applications (15-415) Relational Calculus Lecture 5, January 27, 2014 Mohammad Hammoud Today Last Session: Relational Algebra Today s Session: Relational algebra The division operator and summary

More information

Language of Propositional Logic

Language of Propositional Logic Logic A logic has: 1. An alphabet that contains all the symbols of the language of the logic. 2. A syntax giving the rules that define the well formed expressions of the language of the logic (often called

More information

Homework Assignment 2. Due Date: October 17th, CS425 - Database Organization Results

Homework Assignment 2. Due Date: October 17th, CS425 - Database Organization Results Name CWID Homework Assignment 2 Due Date: October 17th, 2017 CS425 - Database Organization Results Please leave this empty! 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 2.10 2.11 2.12 2.15 2.16 2.17 2.18 2.19 Sum

More information

On the Optimal Approximation of Queries Using Tractable Propositional Languages

On the Optimal Approximation of Queries Using Tractable Propositional Languages On the Optimal Approximation of Queries Using Tractable Propositional Languages Robert Fink and Dan Olteanu Oxford University Computing Laboratory {robert.fink,dan.olteanu}@comlab.ox.ac.uk ABSTRACT This

More information

QQL: A DB&IR Query Language

QQL: A DB&IR Query Language The VLDB Journal (2008) 7:39 56 DOI 0.007/s00778-007-0070- SPECIAL ISSUE PAPER QQL: A DB&IR Query Language Ingo Schmitt Received: 8 September 2006 / Revised: 4 June 2007 / Accepted: 8 July 2007 / Published

More information

Automata-Theoretic Model Checking of Reactive Systems

Automata-Theoretic Model Checking of Reactive Systems Automata-Theoretic Model Checking of Reactive Systems Radu Iosif Verimag/CNRS (Grenoble, France) Thanks to Tom Henzinger (IST, Austria), Barbara Jobstmann (CNRS, Grenoble) and Doron Peled (Bar-Ilan University,

More information

Languages. Theory I: Database Foundations. Relational Algebra. Paradigms. Projection. Basic Operators. Jan-Georg Smaus (Georg Lausen)

Languages. Theory I: Database Foundations. Relational Algebra. Paradigms. Projection. Basic Operators. Jan-Georg Smaus (Georg Lausen) Languages Theory I: Database Foundations Jan-Georg Smaus (Georg Lausen) Paradigms 1. Languages: Relational Algebra Projection Union and Difference Summary 26.7.2011 Relational algebra Relational calculus

More information

Probabilistic Representation and Reasoning

Probabilistic Representation and Reasoning Probabilistic Representation and Reasoning Alessandro Panella Department of Computer Science University of Illinois at Chicago May 4, 2010 Alessandro Panella (CS Dept. - UIC) Probabilistic Representation

More information

Database Design: Normal Forms as Quality Criteria. Functional Dependencies Normal Forms Design and Normal forms

Database Design: Normal Forms as Quality Criteria. Functional Dependencies Normal Forms Design and Normal forms Database Design: Normal Forms as Quality Criteria Functional Dependencies Normal Forms Design and Normal forms Design Quality: Introduction Good conceptual model: - Many alternatives - Informal guidelines

More information

Journal of Computer and System Sciences

Journal of Computer and System Sciences Journal of Computer and System Sciences 78 (2012) 1026 1044 Contents lists available at SciVerse ScienceDirect Journal of Computer and System Sciences www.elsevier.com/locate/jcss Characterisations of

More information

The Assignment Axiom (Hoare)

The Assignment Axiom (Hoare) The Assignment Axiom (Hoare) Syntax: V := E Semantics: value of V in final state is value of E in initial state Example: X:=X+ (adds one to the value of the variable X) The Assignment Axiom {Q[E/V ]} V

More information

First-order logic Syntax and semantics

First-order logic Syntax and semantics 1 / 43 First-order logic Syntax and semantics Mario Alviano University of Calabria, Italy A.Y. 2017/2018 Outline 2 / 43 1 Motivation Why more than propositional logic? Intuition 2 Syntax Terms Formulas

More information

Prof. Dr. Ralf Möller Dr. Özgür L. Özçep Universität zu Lübeck Institut für Informationssysteme. Tanya Braun (Exercises)

Prof. Dr. Ralf Möller Dr. Özgür L. Özçep Universität zu Lübeck Institut für Informationssysteme. Tanya Braun (Exercises) Prof. Dr. Ralf Möller Dr. Özgür L. Özçep Universität zu Lübeck Institut für Informationssysteme Tanya Braun (Exercises) Slides taken from the presentation (subset only) Learning Statistical Models From

More information

Propositional Logic Part 1

Propositional Logic Part 1 Propositional Logic Part 1 Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department University of Wisconsin, Madison [Based on slides from Louis Oliphant, Andrew Moore, Jerry Zhu] slide 1 5 is even

More information

Chapter 4: Classical Propositional Semantics

Chapter 4: Classical Propositional Semantics Chapter 4: Classical Propositional Semantics Language : L {,,, }. Classical Semantics assumptions: TWO VALUES: there are only two logical values: truth (T) and false (F), and EXTENSIONALITY: the logical

More information

Sebastian Link University of Auckland, Auckland, New Zealand Henri Prade IRIT, CNRS and Université de Toulouse III, Toulouse, France

Sebastian Link University of Auckland, Auckland, New Zealand Henri Prade IRIT, CNRS and Université de Toulouse III, Toulouse, France CDMTCS Research Report Series Relational Database Schema Design for Uncertain Data Sebastian Link University of Auckland, Auckland, New Zealand Henri Prade IRIT, CNRS and Université de Toulouse III, Toulouse,

More information

CSE 544: Principles of Database Systems

CSE 544: Principles of Database Systems CSE 544: Principles of Database Systems Conjunctive Queries (and beyond) CSE544 - Spring, 2013 1 Outline Conjunctive queries Query containment and equivalence Query minimization Undecidability for relational

More information

CS632 Notes on Relational Query Languages I

CS632 Notes on Relational Query Languages I CS632 Notes on Relational Query Languages I A. Demers 6 Feb 2003 1 Introduction Here we define relations, and introduce our notational conventions, which are taken almost directly from [AD93]. We begin

More information

Countability Sets of Measure Zero Random Reals Normal Numbers There s a Bear in There. The Real Thing. Paul McCann. Wednesday, 3 August, 2011

Countability Sets of Measure Zero Random Reals Normal Numbers There s a Bear in There. The Real Thing. Paul McCann. Wednesday, 3 August, 2011 Wednesday, 3 August, 2011 Outline Countability 1 Countability 2 3 4 5 Counting with the Natural Numbers A set S is countable if there is a function f from N onto S. That is, we can label each s S by a

More information

Computing Query Probability with Incidence Algebras

Computing Query Probability with Incidence Algebras Computing Query Probability with Incidence Algebras Nilesh Dalvi Yahoo Research Santa Clara, CA, USA ndalvi@yahoo-inc.com Karl Schnaitter UC Santa Cruz Santa Cruz, CA, USA karlsch@soe.ucsc.edu Dan Suciu

More information

Bounded Query Rewriting Using Views

Bounded Query Rewriting Using Views Bounded Query Rewriting Using Views Yang Cao 1,3 Wenfei Fan 1,3 Floris Geerts 2 Ping Lu 3 1 University of Edinburgh 2 University of Antwerp 3 Beihang University {wenfei@inf, Y.Cao-17@sms}.ed.ac.uk, floris.geerts@uantwerpen.be,

More information

Expiration of Historical Databases

Expiration of Historical Databases Expiration of Historical Databases (Extended Abstract) David Toman Department of Computer Science, University of Waterloo Waterloo, Ontario, Canada N2L 3G1 E-mail: david@uwaterloo.ca Abstract We present

More information

1 CHAPTER 1 INTRODUCTION 1.1 Background One branch of the study of descriptive complexity aims at characterizing complexity classes according to the l

1 CHAPTER 1 INTRODUCTION 1.1 Background One branch of the study of descriptive complexity aims at characterizing complexity classes according to the l viii CONTENTS ABSTRACT IN ENGLISH ABSTRACT IN TAMIL LIST OF TABLES LIST OF FIGURES iii v ix x 1 INTRODUCTION 1 1.1 Background : : : : : : : : : : : : : : : : : : : : : : : : : : : : 1 1.2 Preliminaries

More information

The Meaning of Entity-Relationship Diagrams, part II

The Meaning of Entity-Relationship Diagrams, part II (1/23) Logic The Meaning of Entity-Relationship Diagrams, part II Enrico Franconi franconi@inf.unibz.it http://www.inf.unibz.it/ franconi Faculty of Computer Science, Free University of Bozen-Bolzano (2/23)

More information

From Constructibility and Absoluteness to Computability and Domain Independence

From Constructibility and Absoluteness to Computability and Domain Independence From Constructibility and Absoluteness to Computability and Domain Independence Arnon Avron School of Computer Science Tel Aviv University, Tel Aviv 69978, Israel aa@math.tau.ac.il Abstract. Gödel s main

More information

Annotated revision programs

Annotated revision programs Annotated revision programs Victor Marek Inna Pivkina Miros law Truszczyński Department of Computer Science, University of Kentucky, Lexington, KY 40506-0046 marek inna mirek@cs.engr.uky.edu Abstract Revision

More information

A Knowledge-Based Approach to Entity Resolution

A Knowledge-Based Approach to Entity Resolution A Knowledge-Based Approach to Entity Resolution Klaus-Dieter Schewe Software Competence Center Hagenberg and Johannes-Kepler-University Linz Austria kd.schewe@scch.at, kd.schewe@faw.at Qing Wang Research

More information

Relational Calculus. Dr Paolo Guagliardo. University of Edinburgh. Fall 2016

Relational Calculus. Dr Paolo Guagliardo. University of Edinburgh. Fall 2016 Relational Calculus Dr Paolo Guagliardo University of Edinburgh Fall 2016 First-order logic term t := x (variable) c (constant) f(t 1,..., t n ) (function application) formula ϕ := P (t 1,..., t n ) t

More information

Advanced DB CHAPTER 5 DATALOG

Advanced DB CHAPTER 5 DATALOG Advanced DB CHAPTER 5 DATALOG Datalog Basic Structure Syntax of Datalog Rules Semantics of Nonrecursive Datalog Safety Relational Operations in Datalog Recursion in Datalog The Power of Recursion A More

More information

Correlated subqueries. Query Optimization. Magic decorrelation. COUNT bug. Magic example (slide 2) Magic example (slide 1)

Correlated subqueries. Query Optimization. Magic decorrelation. COUNT bug. Magic example (slide 2) Magic example (slide 1) Correlated subqueries Query Optimization CPS Advanced Database Systems SELECT CID FROM Course Executing correlated subquery is expensive The subquery is evaluated once for every CPS course Decorrelate!

More information

Logic. (Propositional Logic)

Logic. (Propositional Logic) Logic (Propositional Logic) 1 REPRESENTING KNOWLEDGE: LOGIC Logic is the branch of mathematics / philosophy concerned with knowledge and reasoning Aristotle distinguished between three types of arguments:

More information

Pseudorandom functions and permutations

Pseudorandom functions and permutations Introduction Pseudorandom functions and permutations 15-859I Spring 2003 Informally, a Pseudorandom function family (PRF is a collection of functions which are indistinguishable from random functions PRFs

More information

arxiv: v2 [cs.db] 16 Jun 2008

arxiv: v2 [cs.db] 16 Jun 2008 Conditioning Probabilistic Databases Christoph Koch Department of Computer Science Cornell University Ithaca, NY 14853, USA koch@cs.cornell.edu Dan Olteanu Computing Laboratory Oxford University Oxford,

More information

The semijoin algebra and the guarded fragment

The semijoin algebra and the guarded fragment The semijoin algebra and the guarded fragment Dirk Leinders (dirk.leinders@luc.ac.be) Limburgs Universitair Centrum, Universitaire Campus, Diepenbeek, Belgium Maarten Marx (marx@science.uva.nl) Informatics

More information

The Query Containment Problem: Set Semantics vs. Bag Semantics. Phokion G. Kolaitis University of California Santa Cruz & IBM Research - Almaden

The Query Containment Problem: Set Semantics vs. Bag Semantics. Phokion G. Kolaitis University of California Santa Cruz & IBM Research - Almaden The Query Containment Problem: Set Semantics vs. Bag Semantics Phokion G. Kolaitis University of California Santa Cruz & IBM Research - Almaden PROBLEMS Problems worthy of attack prove their worth by hitting

More information

Y. Xiang, Inference with Uncertain Knowledge 1

Y. Xiang, Inference with Uncertain Knowledge 1 Inference with Uncertain Knowledge Objectives Why must agent use uncertain knowledge? Fundamentals of Bayesian probability Inference with full joint distributions Inference with Bayes rule Bayesian networks

More information

02 Propositional Logic

02 Propositional Logic SE 2F03 Fall 2005 02 Propositional Logic Instructor: W. M. Farmer Revised: 25 September 2005 1 What is Propositional Logic? Propositional logic is the study of the truth or falsehood of propositions or

More information

α-acyclic Joins Jef Wijsen May 4, 2017

α-acyclic Joins Jef Wijsen May 4, 2017 α-acyclic Joins Jef Wijsen May 4, 2017 1 Motivation Joins in a Distributed Environment Assume the following relations. 1 M[NN, Field of Study, Year] stores data about students of UMONS. For example, (19950423158,

More information

Finding Frequent Items in Probabilistic Data

Finding Frequent Items in Probabilistic Data Finding Frequent Items in Probabilistic Data Qin Zhang, Hong Kong University of Science & Technology Feifei Li, Florida State University Ke Yi, Hong Kong University of Science & Technology SIGMOD 2008

More information

Differential Privacy

Differential Privacy CS 380S Differential Privacy Vitaly Shmatikov most slides from Adam Smith (Penn State) slide 1 Reading Assignment Dwork. Differential Privacy (invited talk at ICALP 2006). slide 2 Basic Setting DB= x 1

More information

cse541 LOGIC FOR COMPUTER SCIENCE

cse541 LOGIC FOR COMPUTER SCIENCE cse541 LOGIC FOR COMPUTER SCIENCE Professor Anita Wasilewska Spring 2015 LECTURE 2 Chapter 2 Introduction to Classical Propositional Logic PART 1: Classical Propositional Model Assumptions PART 2: Syntax

More information

A Magic Approach to Optimizing Incremental Relational Expressions

A Magic Approach to Optimizing Incremental Relational Expressions A Magic Approach to Optimizing Incremental Relational Expressions Andreas Behrend Institute of Computer Science III, University of Bonn Römerstr. 164, D-53117 Bonn, Germany behrend@cs.uni-bonn.de ABSTRACT

More information

Lower Bound Techniques for Multiparty Communication Complexity

Lower Bound Techniques for Multiparty Communication Complexity Lower Bound Techniques for Multiparty Communication Complexity Qin Zhang Indiana University Bloomington Based on works with Jeff Phillips, Elad Verbin and David Woodruff 1-1 The multiparty number-in-hand

More information