Query Evaluation on Probabilistic Databases. CSE 544: Wednesday, May 24, 2006
|
|
- Noreen Watson
- 5 years ago
- Views:
Transcription
1 Query Evaluation on Probabilistic Databases CSE 544: Wednesday, May 24, 2006
2 Problem Setting Queries: Tables: Review A(x,y) :- Review(x,y), Movie(x,z), z > 1991 name rating p Movie Monkey Love good.5 title year p Twelve Monkeys Monkey Love fair.2 fair.6 poor.9 Answers: Top k Monkey Love title rating p Monkey Love Pl Twelve Monkeys fair.53 Problem: complexity Monkey Love good.42 Monkey Love Pl fair.15 of query evaluation 2
3 Two Problems Fixed schema S, conjunctive query Q(x,y) Query evaluation problem Fix answer tuple (a,b) Given database I, compute Pr(Q(a,b)) Top-k answering problem Fix k > 0 Given database I, find k answer tuples with highest probabilities 3
4 Related Work: DB Cavallo&Pitarelli:1987 Barbara,Garcia-Molina, Porter:1992 Lakshmanan,Leone,Ross&Subrahmanian:1997 Fuhr&Roellke:1997 Dalvi&S:2004 Widom:2005 4
5 Related Work: Logic Query reliability [Gradel,Gurevitch,Hirsch 98] Degrees of belief [Bacchus,Grove,Halpern,Koller 96] Probabilistic Logic [Nielson] Probabilistic model checking [Kwiatkowska 02] Probabilistic Relational Model [Taskar,Abbeel,Koller 02] 5
6 Probabilistic Database Schema S, Domain D, Set of instances Inst Definition Probabilistic database is a probability distribution Pr : Inst [0,1], I Pr[I] = 1 If Pr[I] > 0 then I is called possible world 6
7 Probabilistic Database Representation: Independent tuples: I-database DB over some schema S i Independent and disjoint tuples: ID-database DB over some schema S id Semantics: DB means probability distribution Pr over schema S 7
8 Independent Events A tuple is in the database with probability p Any two tuples are independent events 8
9 Representation I-Databases Reviews i (M,S,p) Movie Score P m42 good p 1 m99 good p 2 Reviews(M,S) m76 poor p 3 Mov Scor Mov Scor Mov Scor Mov Scor Mov Scor Mov Scor Mov Scor Mov Scor m42 good m99 good m42 good m76 poor m42 good m m42 good Pr[I 1 ]= (1-p 1 )*(1-p 2 )*(1-p 3 ) Pr[I 4 ]= m99 good m76 poor p 1 *p 2 *(1-p 3 ) Pr[I 1 ] + Pr[I 2 ] Pr[I 8 ] = 1 m76 poor Pr[I 8 ]= m99 good m76 poor p 1 *p 2 *p 3 Possible worlds semantics 9
10 Disjoint Events Needed in Many-to-1 matchings Possible values for attributes [Barbara 92] Name Age Name Age P John Mary (0.3) 43 (0.7) John John Mary
11 ID-Databases Activities id Time d Activity P t walk p 1 t run p 2 Activities t+1 walk p 3 Time Act Time Act Time Act Pr[I 1 ]= t (1-p 1 -p 2 )*(1-p 3 ) walk Pr[I 3 ]= t run p 2 *(1-p 3 ) Time Act t+1 walk Pr[I 5 ]= Time Pr[I 1 ] + Pr[I 2 ] Pr[I 6 ] = 1 11 t Act walk t+1 walk p 1 *p 3 Time Act t run t+1 walk
12 ID subsumes I Reviews id Movie d Score d P m42 good p 1 m99 good p 2 m76 poor p 3 = Reviews i Movie Score P m42 good p 1 m99 good p 2 m76 poor p 3 Note: Movie Score P Reviews id m42 good p 1 m99 good p 2 means all tuples are m76 poor p 3 disjoint 12
13 Queries Syntax: conjunctive queries over schema S Q(y) :- Movie(x,y), Review(x,z), z >= 3 Movie i id year P m m m m Review i mid rating p m m m m m
14 Two Query Semantics Possible answer sets Given set A: Pr[{t I Q(t)} = A] Used for views This Possible tuples talk Given tuple t: Pr[I Q(t)] Used for query evaluation and top-k 14
15 Query Semantics id year p 1 id year p 2 id year p 3 id year p 4 m m m m m m m m m m Q(y) :- Movie(x,y), Review(x,z) year p Tuple probabilities 1935 p 2 + p 3 = p 1 + p 3 = p 3 = top k 15
16 Summary on Data Model Data Model: Semantics = possible worlds Syntax = I-databases or ID-databases Queries: Syntax = unchanged (conjunctive queries) Semantics = tuple probabilities 16
17 Problem Definition Fix schema S, query Q, answer tuple t Problem: given I/ID-database DB, compute Pr[I Q(t)] notation: Pr[Q(t)] Conventions: For upper bounds (P or #P): probabilities are rationals For lower bounds (#P): probabilities are 1/2 17
18 Query Evaluation on I-Databases Outline Intuition Extensional plans: PTIME case Hard queries: #P-complete case Dichotomy Theorem 18
19 Q(y) :- Movie(x,y), Review(x,z) Movie i Intuition Review i Answer id year p m p 1 m p 2 m p 3 m p 4 mid rate p m42 4 q 1 m42 2 q 2 m42 3 q 3 m99 1 q 4 m99 3 q 5 m76 5 q 6 Year 1995 p p 1 (1 - (1 - q 1 ) (1 - q 2 ) (1 - q 3 )) (1 - p 2 (1 - (1 - q 4 ) (1 - q 5 ))) (1 - p 3 q 6 ) 19
20 I-Extensional Plans [Barbara92,Lakshmanan97] Add p Join p = p 1 * p 2 Projection p = 1-(1-p 1 )(1-p 2 )...(1-p n ) Selection σ p = p Note: data complexity is PTIME 20
21 Extensional Query Plans x x pq x 1-(1-p1)(1-p2)(1-p3) x p σ x p x q x x x p1 p2 p3 x p 21
22 Extensional Query Plans Each tuple t has a probability t.p Algebra operators compute t.p Data complexity: PTIME 22
23 (1-pq 1 )(1-pq 2 )(1-pq 3 ) Q(y) :- Movie(x,y), Review(x,z) 1995 m1 pq m1 p(1-(1-q 1 )(1-q 2 )(1-q 3 )) 1995 m1 pq m1 pq 3 m1 1 - (1-q 1 )(1-q 2 )(1-q 3 ) Movie 1995 p Review m1 q 1 Movie Review m1 q p m1 q 1 m1 q 3 INCORRECT! 23 m1 q 2 m1 q 3 CORRECT
24 #P-Complete Queries R i S T i A p A B B p p 1 q 1 p 2 q 2 p 3 q 3 p 4 q 4 Q bad :- R i (x), S(x,y), T i (y) Theorem: Data complexity is #P-complete
25 Proof: Theorem [Provan&Ball83] Counting the number of satisfying assignments for bipartite DNF is #P-complete R i S T i Reduction: A p x 1 1/2 A B x 2 y 3 B p y 1 1/2 x 2 y 3 V x 1 y 2 V x 4 y 3 V x 3 y 1 x 2 1/2 x 1 y 2 y 2 1/2 x 3 1/2 x 4 y 3 y 3 1/2 x 4 1/2 x 3 y 1 Q bad :- R i (x), S(x,y), T i (y) 25
26 I-Dichotomy Q = boolean conjunctive query Definition 1. For each variable x: goals(x) = set of goals that contain x Definition 2. Q is hierarchical if forall x, y: (a) goals(x) goals(y) =, or (b) goals(x) goals(y), or (c) goals(y) goals(x)
27 Q :- R(x),S(x,y),T(x,y,z),K(x,v) x y z S R T v K hierarchical Q :- R(x), S(x,y), T(y) x y non-hierarchical R S T 27
28 [Dalvi&S. 04] I-Dichotomy Schema S i = {R 1i, R 2i,..., R mi } Theorem Let Q = conjunctive query w/o self-joins. Then one of the following holds: Q is in PTIME Q has a correct extensional plan or: Q is hierarchical Q is #P-complete Q has subgoals R(x,...),S(x,y,...),T(y,...) 28
29 Proof Lemma 1. If Q is non-hierarchical, then #P-complete Proof: v x y R S T K z Q :- R i (v,x), S i (x,y), T i (y,z), K i (z) rest is like for Q bad 29
30 Proof Lemma 2. If Q is hierarchical, then PTIME Proof: Case 1: has no root Pr(Q) = Pr(Q 1 ) Pr(Q 2 ) Pr(Q 3 ) This is extensional join 30
31 Proof Case 2: has root x x Dom={a 1, a 2,..., a n } Pr(Q) = 1 - (1-Pr(Q(a 1 /x))(1-pr(q(a 2 /x))...(1-pr(q(a n /x))) This is an extensional projection: QED 31
32 Query Evaluation on ID-Databases ID-extensional plans #P-complete queries Dichotomoy Theorem 32
33 Extensional Plans for ID-DBs Only difference: two kinds of projections: independent 1-(1-p 1 )...(1-p n ) disjoint p p n 33
34 #P-Complete Queries Q 1 :- R i (x), S i (x,y), T i (y) Q 2 :- R d (x d,y), S d (y d ) Q 3 :- R d (x d,y), S d (z d,y) 34
35 [Dalvi&S. 04] I-DB Dichotomy Schema S id s.t. each table is either R i or R id Theorem Let Q = conjunctive query w/o self-joins. Then one of the following holds: Q is in PTIME Q has a correct extensional plan or: Q is #P-complete Q has one of Q 1, Q 2, Q 3 as subqueries 35
36 Extensions Extensions of the dichotomoy theorem exists for: Mixed schemas (some relations are deterministic) Functional dependencies 36
37 Summary on Query Evaluation Extensional plans: popular, efficient, BUT Equivalent plans lead to different results Some queries admit correct plans Some simple queries: #P-complete complexity Dichotomy theorem Future work: remove no-self-join restriction 37
38 Conclusions Strong motivation from practical applications Merge query and search technologies Probabilistic DB s are hard! Hacks don t work (yet). Need principled approach. 38
39 Thank you! Questions?
A Dichotomy. in in Probabilistic Databases. Joint work with Robert Fink. for Non-Repeating Queries with Negation Queries with Negation
Dichotomy for Non-Repeating Queries with Negation Queries with Negation in in Probabilistic Databases Robert Dan Olteanu Fink and Dan Olteanu Joint work with Robert Fink Uncertainty in Computation Simons
More informationTopics in Probabilistic and Statistical Databases. Lecture 2: Representation of Probabilistic Databases. Dan Suciu University of Washington
Topics in Probabilistic and Statistical Databases Lecture 2: Representation of Probabilistic Databases Dan Suciu University of Washington 1 Review: Definition The set of all possible database instances:
More informationDatabase design and implementation CMPSCI 645
Database design and implementation CMPSCI 645 Lectures 20: Probabilistic Databases *based on a tutorial by Dan Suciu Have we seen uncertainty in DB yet? } NULL values Age Height Weight 20 NULL 200 NULL
More informationQueries and Materialized Views on Probabilistic Databases
Queries and Materialized Views on Probabilistic Databases Nilesh Dalvi Christopher Ré Dan Suciu September 11, 2008 Abstract We review in this paper some recent yet fundamental results on evaluating queries
More informationDatabase Design and Implementation
Database Design and Implementation CS 645 Data provenance Provenance provenance, n. The fact of coming from some particular source or quarter; origin, derivation [Oxford English Dictionary] Data provenance
More informationBrief Tutorial on Probabilistic Databases
Brief Tutorial on Probabilistic Databases Dan Suciu University of Washington Simons 206 About This Talk Probabilistic databases Tuple-independent Query evaluation Statistical relational models Representation,
More informationA Toolbox of Query Evaluation Techniques for Probabilistic Databases
2nd Workshop on Management and mining Of UNcertain Data (MOUND) Long Beach, March 1st, 2010 A Toolbox of Query Evaluation Techniques for Probabilistic Databases Dan Olteanu, Oxford University Computing
More informationEfficient Query Evaluation on Probabilistic Databases
Efficient Query Evaluation on Probabilistic Databases Nilesh Dalvi and Dan Suciu April 4, 2004 Abstract We describe a system that supports arbitrarily complex SQL queries with uncertain predicates. The
More informationThe Trichotomy of HAVING Queries on a Probabilistic Database
VLDBJ manuscript No. (will be inserted by the editor) The Trichotomy of HAVING Queries on a Probabilistic Database Christopher Ré Dan Suciu Received: date / Accepted: date Abstract We study the evaluation
More informationOutline. Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC
Outline Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC Part 5: Completeness of Lifted Inference Part 6: Query Compilation Part 7:
More informationQuery answering using views
Query answering using views General setting: database relations R 1,...,R n. Several views V 1,...,V k are defined as results of queries over the R i s. We have a query Q over R 1,...,R n. Question: Can
More informationGAV-sound with conjunctive queries
GAV-sound with conjunctive queries Source and global schema as before: source R 1 (A, B),R 2 (B,C) Global schema: T 1 (A, C), T 2 (B,C) GAV mappings become sound: T 1 {x, y, z R 1 (x,y) R 2 (y,z)} T 2
More informationProvenance Semirings. Todd Green Grigoris Karvounarakis Val Tannen. presented by Clemens Ley
Provenance Semirings Todd Green Grigoris Karvounarakis Val Tannen presented by Clemens Ley place of origin Provenance Semirings Todd Green Grigoris Karvounarakis Val Tannen presented by Clemens Ley place
More informationEfficient Top-k Query Evaluation on Probabilistic Data
Efficient Top-k Query Evaluation on Probabilistic Data Extended Version TR: #2006-06-05 Christopher Ré, Nilesh Dalvi and Dan Suciu University of Washington Dept. of Computer Science and Engineering E-mail:
More informationProbabilistic Databases
Probabilistic Databases Amol Deshpande University of Maryland Goal Introduction to probabilistic databases Focus on an overview of: Different possible representations Challenges in using them Probabilistic
More informationTractable Lineages on Treelike Instances: Limits and Extensions
Tractable Lineages on Treelike Instances: Limits and Extensions Antoine Amarilli 1, Pierre Bourhis 2, Pierre enellart 1,3 June 29th, 2016 1 Télécom ParisTech 2 CNR CRItAL 3 National University of ingapore
More informationarxiv: v1 [cs.ai] 21 Sep 2014
Oblivious Bounds on the Probability of Boolean Functions WOLFGANG GATTERBAUER, Carnegie Mellon University DAN SUCIU, University of Washington arxiv:409.6052v [cs.ai] 2 Sep 204 This paper develops upper
More informationLogic and Databases. Phokion G. Kolaitis. UC Santa Cruz & IBM Research Almaden. Lecture 4 Part 1
Logic and Databases Phokion G. Kolaitis UC Santa Cruz & IBM Research Almaden Lecture 4 Part 1 1 Thematic Roadmap Logic and Database Query Languages Relational Algebra and Relational Calculus Conjunctive
More informationQuery Evaluation with Soft-Key Constraints
Query Evaluation with Soft-Key Constraints Abhay Jha CSE, University of Washington Seattle WA, 98195-2350 abhaykj@cs.washington.edu Vibhor Rastogi CSE, University of Washington Seattle WA, 98195-2350 vibhor@cs.washington.edu
More informationRepresenting and Querying Correlated Tuples in Probabilistic Databases
Representing and Querying Correlated Tuples in Probabilistic Databases Prithviraj Sen Dept. of Computer Science, University of Maryland, College Park, MD 20742. sen@cs.umd.edu Amol Deshpande Dept. of Computer
More informationSemantics of Ranking Queries for Probabilistic Data and Expected Ranks
Semantics of Ranking Queries for Probabilistic Data and Expected Ranks Graham Cormode AT&T Labs Feifei Li FSU Ke Yi HKUST 1-1 Uncertain, uncertain, uncertain... (Probabilistic, probabilistic, probabilistic...)
More informationCSE 562 Database Systems
Outline Query Optimization CSE 562 Database Systems Query Processing: Algebraic Optimization Some slides are based or modified from originals by Database Systems: The Complete Book, Pearson Prentice Hall
More informationThe Veracity of Big Data
The Veracity of Big Data Pierre Senellart École normale supérieure 13 October 2016, Big Data & Market Insights 23 April 2013, Dow Jones (cnn.com) 23 April 2013, Dow Jones (cnn.com) 23 April 2013, Dow Jones
More informationTowards Deterministic Decomposable Circuits for Safe Queries
Towards Deterministic Decomposable Circuits for Safe Queries Mikaël Monet 1 and Dan Olteanu 2 1 LTCI, Télécom ParisTech, Université Paris-Saclay, France, Inria Paris; Paris, France mikael.monet@telecom-paristech.fr
More informationComputing Query Probability with Incidence Algebras Technical Report UW-CSE University of Washington
Computing Query Probability with Incidence Algebras Technical Report UW-CSE-10-03-02 University of Washington Nilesh Dalvi, Karl Schnaitter and Dan Suciu Revised: August 24, 2010 Abstract We describe an
More informationProbabilistic Databases
Probabilistic Databases Dan Olteanu (Oxford) London, November 2017 The National Archives Probabilistic Databases For the purpose of this introductory talk: Probabilistic data = Relational data + Probabilities
More informationRelational Database Design
Relational Database Design Jan Chomicki University at Buffalo Jan Chomicki () Relational database design 1 / 16 Outline 1 Functional dependencies 2 Normal forms 3 Multivalued dependencies Jan Chomicki
More informationData Dependencies in the Presence of Difference
Data Dependencies in the Presence of Difference Tsinghua University sxsong@tsinghua.edu.cn Outline Introduction Application Foundation Discovery Conclusion and Future Work Data Dependencies in the Presence
More informationTop-k Query Processing in Probabilistic Databases with Non-Materialized Views. Maximilian Dylla Iris Miliaraki Martin Theobald
Top-k Query Processing in Probabilistic Databases with Non-Materialized Views Maximilian Dylla Iris Miliaraki Martin Theobald MPI I 2012 5-002 June 2012 Authors Addresses Maximilian Dylla Max-Planck-Institut
More informationOutline. Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC
Outline Part 1: Motivation Part 2: Probabilistic Databases Part 3: Weighted Model Counting Part 4: Lifted Inference for WFOMC Part 5: Completeness of Lifted Inference Part 6: Query Compilation Part 7:
More informationKnowledge Compilation Meets Database Theory : Compiling Queries to Decision Diagrams
Knowledge Compilation Meets Database Theory : Compiling Queries to Decision Diagrams Abhay Jha Computer Science and Engineering University of Washington, WA, USA abhaykj@cs.washington.edu Dan Suciu Computer
More informationDatabase Applications (15-415)
Database Applications (15-415) Relational Calculus Lecture 6, January 26, 2016 Mohammad Hammoud Today Last Session: Relational Algebra Today s Session: Relational calculus Relational tuple calculus Announcements:
More informationScalable Uncertainty Management
Scalable Uncertainty Management 05 Query Evaluation in Probabilistic Databases Rainer Gemulla Jun 1, 2012 Overview In this lecture Primer: relational calculus Understand complexity of query evaluation
More informationarxiv: v1 [cs.db] 21 Feb 2017
Answering Conjunctive Queries under Updates Christoph Berkholz, Jens Keppeler, Nicole Schweikardt Humboldt-Universität zu Berlin {berkholz,keppelej,schweika}@informatik.hu-berlin.de February 22, 207 arxiv:702.06370v
More informationStarAI Full, 6+1 pages Short, 2 page position paper or abstract
StarAI 2015 Fifth International Workshop on Statistical Relational AI At the 31st Conference on Uncertainty in Artificial Intelligence (UAI) (right after ICML) In Amsterdam, The Netherlands, on July 16.
More informationSchema Refinement: Other Dependencies and Higher Normal Forms
Schema Refinement: Other Dependencies and Higher Normal Forms Spring 2018 School of Computer Science University of Waterloo Databases CS348 (University of Waterloo) Higher Normal Forms 1 / 14 Outline 1
More informationQSQL: Incorporating Logic-based Retrieval Conditions into SQL
QSQL: Incorporating Logic-based Retrieval Conditions into SQL Sebastian Lehrack and Ingo Schmitt Brandenburg University of Technology Cottbus Institute of Computer Science Chair of Database and Information
More informationAnswering Queries from Statistics and Probabilistic Views. Nilesh Dalvi and Dan Suciu, University of Washington.
Answering Queries from Statistics and Probabilistic Views Nilesh Dalvi and Dan Suciu, University of Washington. Background Query answering using Views problem: find answers to a query q over a database
More informationFactorised Representations of Query Results: Size Bounds and Readability
Factorised Representations of Query Results: Size Bounds and Readability Dan Olteanu and Jakub Závodný Department of Computer Science University of Oxford {dan.olteanu,jakub.zavodny}@cs.ox.ac.uk ABSTRACT
More informationData Bases Data Mining Foundations of databases: from functional dependencies to normal forms
Data Bases Data Mining Foundations of databases: from functional dependencies to normal forms Database Group http://liris.cnrs.fr/ecoquery/dokuwiki/doku.php?id=enseignement: dbdm:start March 1, 2017 Exemple
More informationP Q1 Q2 Q3 Q4 Q5 Tot (60) (20) (20) (20) (60) (20) (200) You are allotted a maximum of 4 hours to complete this exam.
Exam INFO-H-417 Database System Architecture 13 January 2014 Name: ULB Student ID: P Q1 Q2 Q3 Q4 Q5 Tot (60 (20 (20 (20 (60 (20 (200 Exam modalities You are allotted a maximum of 4 hours to complete this
More information7 RC Simulates RA. Lemma: For every RA expression E(A 1... A k ) there exists a DRC formula F with F V (F ) = {A 1,..., A k } and
7 RC Simulates RA. We now show that DRC (and hence TRC) is at least as expressive as RA. That is, given an RA expression E that mentions at most C, there is an equivalent DRC expression E that mentions
More informationFundamentos lógicos de bases de datos (Logical foundations of databases)
20/7/2015 ECI 2015 Buenos Aires Fundamentos lógicos de bases de datos (Logical foundations of databases) Diego Figueira Gabriele Puppis CNRS LaBRI About the speakers Gabriele Puppis PhD from Udine (Italy)
More information6 The Relational Data Model: Algebraic operations on tabular data
6 The Relational Data Model: Algebraic operations on tabular data 6.1 Basic idea of relational languages 6.2 Relational Algebra operations 6.3 Relational Algebra: Syntax and Semantics 6.4. More Operators
More informationQuerying Big Data by Accessing Small Data
Querying Big Data by Accessing Small Data Wenfei Fan 1,3 Floris Geerts 2 Yang Cao 1,3 Ting Deng 3 Ping Lu 3 1 University of Edinburgh 2 University of Antwerp 3 Beihang University {wenfei@inf, yang.cao@}.ed.ac.uk,
More informationOn the Semantics and Evaluation of Top-k Queries in Probabilistic Databases
On the Semantics and Evaluation of Top-k Queries in Probabilistic Databases Xi Zhang Jan Chomicki SUNY at Buffalo September 23, 2008 Xi Zhang, Jan Chomicki (SUNY at Buffalo) Topk Queries in Prob. DB September
More informationLightweight Description Logics: DL-Lite A and EL ++
Lightweight Description Logics: DL-Lite A and EL ++ Elena Botoeva 1 KRDB Research Centre Free University of Bozen-Bolzano January 13, 2011 Departamento de Ciencias de la Computación Universidad de Chile,
More informationEfficient query evaluation
Efficient query evaluation Maria Luisa Sapino Set of values E.g. select * from EMPLOYEES where SALARY = 1500; Result of a query Sorted list E.g. select * from CAR-IMAGE where color = red ; 2 Queries as
More informationProfiling Sets for Preference Querying
Profiling Sets for Preference Querying Xi Zhang and Jan Chomicki Department of Computer Science and Engineering University at Buffalo, SUNY, U.S.A. {xizhang,chomicki}@cse.buffalo.edu Abstract. We propose
More informationDatabases Lecture 8. Timothy G. Griffin. Computer Laboratory University of Cambridge, UK. Databases, Lent 2009
Databases Lecture 8 Timothy G. Griffin Computer Laboratory University of Cambridge, UK Databases, Lent 2009 T. Griffin (cl.cam.ac.uk) Databases Lecture 8 DB 2009 1 / 15 Lecture 08: Multivalued Dependencies
More informationApproximate Lineage for Probabilistic Databases. University of Washington Technical Report UW TR: #TR
Approximate Lineage for Probabilistic Databases University of Washington Technical Report UW TR: #TR2008-03-02 Christopher Ré and Dan Suciu {chrisre,suciu}@cs.washington.edu Abstract In probabilistic databases,
More informationComputational Logic. Relational Query Languages with Negation. Free University of Bozen-Bolzano, Werner Nutt
Computational Logic Free University of Bozen-Bolzano, 2010 Werner Nutt (Slides adapted from Thomas Eiter and Leonid Libkin) Computational Logic 1 Queries with All Who are the directors whose movies are
More informationThe Calculus of Computation: Decision Procedures with Applications to Verification. Part I: FOUNDATIONS. by Aaron Bradley Zohar Manna
The Calculus of Computation: Decision Procedures with Applications to Verification Part I: FOUNDATIONS by Aaron Bradley Zohar Manna 1. Propositional Logic(PL) Springer 2007 1-1 1-2 Propositional Logic(PL)
More informationOn Factorisation of Provenance Polynomials
On Factorisation of Provenance Polynomials Dan Olteanu and Jakub Závodný Oxford University Computing Laboratory Wolfson Building, Parks Road, OX1 3QD, Oxford, UK 1 Introduction Tracking and managing provenance
More informationRepresenting and Querying Correlated Tuples in Probabilistic Databases
Representing and Querying Correlated Tuples in Probabilistic Databases Prithviraj Sen Amol Deshpande Department of Computer Science University of Maryland, College Park. International Conference on Data
More informationFunctional Dependencies. Getting a good DB design Lisa Ball November 2012
Functional Dependencies Getting a good DB design Lisa Ball November 2012 Outline (2012) SEE NEXT SLIDE FOR ALL TOPICS (some for you to read) Normalization covered by Dr Sanchez Armstrong s Axioms other
More informationRelational Database: Identities of Relational Algebra; Example of Query Optimization
Relational Database: Identities of Relational Algebra; Example of Query Optimization Greg Plaxton Theory in Programming Practice, Fall 2005 Department of Computer Science University of Texas at Austin
More informationDatabase Applications (15-415)
Database Applications (15-415) Relational Calculus Lecture 5, January 27, 2014 Mohammad Hammoud Today Last Session: Relational Algebra Today s Session: Relational algebra The division operator and summary
More informationLanguage of Propositional Logic
Logic A logic has: 1. An alphabet that contains all the symbols of the language of the logic. 2. A syntax giving the rules that define the well formed expressions of the language of the logic (often called
More informationHomework Assignment 2. Due Date: October 17th, CS425 - Database Organization Results
Name CWID Homework Assignment 2 Due Date: October 17th, 2017 CS425 - Database Organization Results Please leave this empty! 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 2.10 2.11 2.12 2.15 2.16 2.17 2.18 2.19 Sum
More informationOn the Optimal Approximation of Queries Using Tractable Propositional Languages
On the Optimal Approximation of Queries Using Tractable Propositional Languages Robert Fink and Dan Olteanu Oxford University Computing Laboratory {robert.fink,dan.olteanu}@comlab.ox.ac.uk ABSTRACT This
More informationQQL: A DB&IR Query Language
The VLDB Journal (2008) 7:39 56 DOI 0.007/s00778-007-0070- SPECIAL ISSUE PAPER QQL: A DB&IR Query Language Ingo Schmitt Received: 8 September 2006 / Revised: 4 June 2007 / Accepted: 8 July 2007 / Published
More informationAutomata-Theoretic Model Checking of Reactive Systems
Automata-Theoretic Model Checking of Reactive Systems Radu Iosif Verimag/CNRS (Grenoble, France) Thanks to Tom Henzinger (IST, Austria), Barbara Jobstmann (CNRS, Grenoble) and Doron Peled (Bar-Ilan University,
More informationLanguages. Theory I: Database Foundations. Relational Algebra. Paradigms. Projection. Basic Operators. Jan-Georg Smaus (Georg Lausen)
Languages Theory I: Database Foundations Jan-Georg Smaus (Georg Lausen) Paradigms 1. Languages: Relational Algebra Projection Union and Difference Summary 26.7.2011 Relational algebra Relational calculus
More informationProbabilistic Representation and Reasoning
Probabilistic Representation and Reasoning Alessandro Panella Department of Computer Science University of Illinois at Chicago May 4, 2010 Alessandro Panella (CS Dept. - UIC) Probabilistic Representation
More informationDatabase Design: Normal Forms as Quality Criteria. Functional Dependencies Normal Forms Design and Normal forms
Database Design: Normal Forms as Quality Criteria Functional Dependencies Normal Forms Design and Normal forms Design Quality: Introduction Good conceptual model: - Many alternatives - Informal guidelines
More informationJournal of Computer and System Sciences
Journal of Computer and System Sciences 78 (2012) 1026 1044 Contents lists available at SciVerse ScienceDirect Journal of Computer and System Sciences www.elsevier.com/locate/jcss Characterisations of
More informationThe Assignment Axiom (Hoare)
The Assignment Axiom (Hoare) Syntax: V := E Semantics: value of V in final state is value of E in initial state Example: X:=X+ (adds one to the value of the variable X) The Assignment Axiom {Q[E/V ]} V
More informationFirst-order logic Syntax and semantics
1 / 43 First-order logic Syntax and semantics Mario Alviano University of Calabria, Italy A.Y. 2017/2018 Outline 2 / 43 1 Motivation Why more than propositional logic? Intuition 2 Syntax Terms Formulas
More informationProf. Dr. Ralf Möller Dr. Özgür L. Özçep Universität zu Lübeck Institut für Informationssysteme. Tanya Braun (Exercises)
Prof. Dr. Ralf Möller Dr. Özgür L. Özçep Universität zu Lübeck Institut für Informationssysteme Tanya Braun (Exercises) Slides taken from the presentation (subset only) Learning Statistical Models From
More informationPropositional Logic Part 1
Propositional Logic Part 1 Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department University of Wisconsin, Madison [Based on slides from Louis Oliphant, Andrew Moore, Jerry Zhu] slide 1 5 is even
More informationChapter 4: Classical Propositional Semantics
Chapter 4: Classical Propositional Semantics Language : L {,,, }. Classical Semantics assumptions: TWO VALUES: there are only two logical values: truth (T) and false (F), and EXTENSIONALITY: the logical
More informationSebastian Link University of Auckland, Auckland, New Zealand Henri Prade IRIT, CNRS and Université de Toulouse III, Toulouse, France
CDMTCS Research Report Series Relational Database Schema Design for Uncertain Data Sebastian Link University of Auckland, Auckland, New Zealand Henri Prade IRIT, CNRS and Université de Toulouse III, Toulouse,
More informationCSE 544: Principles of Database Systems
CSE 544: Principles of Database Systems Conjunctive Queries (and beyond) CSE544 - Spring, 2013 1 Outline Conjunctive queries Query containment and equivalence Query minimization Undecidability for relational
More informationCS632 Notes on Relational Query Languages I
CS632 Notes on Relational Query Languages I A. Demers 6 Feb 2003 1 Introduction Here we define relations, and introduce our notational conventions, which are taken almost directly from [AD93]. We begin
More informationCountability Sets of Measure Zero Random Reals Normal Numbers There s a Bear in There. The Real Thing. Paul McCann. Wednesday, 3 August, 2011
Wednesday, 3 August, 2011 Outline Countability 1 Countability 2 3 4 5 Counting with the Natural Numbers A set S is countable if there is a function f from N onto S. That is, we can label each s S by a
More informationComputing Query Probability with Incidence Algebras
Computing Query Probability with Incidence Algebras Nilesh Dalvi Yahoo Research Santa Clara, CA, USA ndalvi@yahoo-inc.com Karl Schnaitter UC Santa Cruz Santa Cruz, CA, USA karlsch@soe.ucsc.edu Dan Suciu
More informationBounded Query Rewriting Using Views
Bounded Query Rewriting Using Views Yang Cao 1,3 Wenfei Fan 1,3 Floris Geerts 2 Ping Lu 3 1 University of Edinburgh 2 University of Antwerp 3 Beihang University {wenfei@inf, Y.Cao-17@sms}.ed.ac.uk, floris.geerts@uantwerpen.be,
More informationExpiration of Historical Databases
Expiration of Historical Databases (Extended Abstract) David Toman Department of Computer Science, University of Waterloo Waterloo, Ontario, Canada N2L 3G1 E-mail: david@uwaterloo.ca Abstract We present
More information1 CHAPTER 1 INTRODUCTION 1.1 Background One branch of the study of descriptive complexity aims at characterizing complexity classes according to the l
viii CONTENTS ABSTRACT IN ENGLISH ABSTRACT IN TAMIL LIST OF TABLES LIST OF FIGURES iii v ix x 1 INTRODUCTION 1 1.1 Background : : : : : : : : : : : : : : : : : : : : : : : : : : : : 1 1.2 Preliminaries
More informationThe Meaning of Entity-Relationship Diagrams, part II
(1/23) Logic The Meaning of Entity-Relationship Diagrams, part II Enrico Franconi franconi@inf.unibz.it http://www.inf.unibz.it/ franconi Faculty of Computer Science, Free University of Bozen-Bolzano (2/23)
More informationFrom Constructibility and Absoluteness to Computability and Domain Independence
From Constructibility and Absoluteness to Computability and Domain Independence Arnon Avron School of Computer Science Tel Aviv University, Tel Aviv 69978, Israel aa@math.tau.ac.il Abstract. Gödel s main
More informationAnnotated revision programs
Annotated revision programs Victor Marek Inna Pivkina Miros law Truszczyński Department of Computer Science, University of Kentucky, Lexington, KY 40506-0046 marek inna mirek@cs.engr.uky.edu Abstract Revision
More informationA Knowledge-Based Approach to Entity Resolution
A Knowledge-Based Approach to Entity Resolution Klaus-Dieter Schewe Software Competence Center Hagenberg and Johannes-Kepler-University Linz Austria kd.schewe@scch.at, kd.schewe@faw.at Qing Wang Research
More informationRelational Calculus. Dr Paolo Guagliardo. University of Edinburgh. Fall 2016
Relational Calculus Dr Paolo Guagliardo University of Edinburgh Fall 2016 First-order logic term t := x (variable) c (constant) f(t 1,..., t n ) (function application) formula ϕ := P (t 1,..., t n ) t
More informationAdvanced DB CHAPTER 5 DATALOG
Advanced DB CHAPTER 5 DATALOG Datalog Basic Structure Syntax of Datalog Rules Semantics of Nonrecursive Datalog Safety Relational Operations in Datalog Recursion in Datalog The Power of Recursion A More
More informationCorrelated subqueries. Query Optimization. Magic decorrelation. COUNT bug. Magic example (slide 2) Magic example (slide 1)
Correlated subqueries Query Optimization CPS Advanced Database Systems SELECT CID FROM Course Executing correlated subquery is expensive The subquery is evaluated once for every CPS course Decorrelate!
More informationLogic. (Propositional Logic)
Logic (Propositional Logic) 1 REPRESENTING KNOWLEDGE: LOGIC Logic is the branch of mathematics / philosophy concerned with knowledge and reasoning Aristotle distinguished between three types of arguments:
More informationPseudorandom functions and permutations
Introduction Pseudorandom functions and permutations 15-859I Spring 2003 Informally, a Pseudorandom function family (PRF is a collection of functions which are indistinguishable from random functions PRFs
More informationarxiv: v2 [cs.db] 16 Jun 2008
Conditioning Probabilistic Databases Christoph Koch Department of Computer Science Cornell University Ithaca, NY 14853, USA koch@cs.cornell.edu Dan Olteanu Computing Laboratory Oxford University Oxford,
More informationThe semijoin algebra and the guarded fragment
The semijoin algebra and the guarded fragment Dirk Leinders (dirk.leinders@luc.ac.be) Limburgs Universitair Centrum, Universitaire Campus, Diepenbeek, Belgium Maarten Marx (marx@science.uva.nl) Informatics
More informationThe Query Containment Problem: Set Semantics vs. Bag Semantics. Phokion G. Kolaitis University of California Santa Cruz & IBM Research - Almaden
The Query Containment Problem: Set Semantics vs. Bag Semantics Phokion G. Kolaitis University of California Santa Cruz & IBM Research - Almaden PROBLEMS Problems worthy of attack prove their worth by hitting
More informationY. Xiang, Inference with Uncertain Knowledge 1
Inference with Uncertain Knowledge Objectives Why must agent use uncertain knowledge? Fundamentals of Bayesian probability Inference with full joint distributions Inference with Bayes rule Bayesian networks
More information02 Propositional Logic
SE 2F03 Fall 2005 02 Propositional Logic Instructor: W. M. Farmer Revised: 25 September 2005 1 What is Propositional Logic? Propositional logic is the study of the truth or falsehood of propositions or
More informationα-acyclic Joins Jef Wijsen May 4, 2017
α-acyclic Joins Jef Wijsen May 4, 2017 1 Motivation Joins in a Distributed Environment Assume the following relations. 1 M[NN, Field of Study, Year] stores data about students of UMONS. For example, (19950423158,
More informationFinding Frequent Items in Probabilistic Data
Finding Frequent Items in Probabilistic Data Qin Zhang, Hong Kong University of Science & Technology Feifei Li, Florida State University Ke Yi, Hong Kong University of Science & Technology SIGMOD 2008
More informationDifferential Privacy
CS 380S Differential Privacy Vitaly Shmatikov most slides from Adam Smith (Penn State) slide 1 Reading Assignment Dwork. Differential Privacy (invited talk at ICALP 2006). slide 2 Basic Setting DB= x 1
More informationcse541 LOGIC FOR COMPUTER SCIENCE
cse541 LOGIC FOR COMPUTER SCIENCE Professor Anita Wasilewska Spring 2015 LECTURE 2 Chapter 2 Introduction to Classical Propositional Logic PART 1: Classical Propositional Model Assumptions PART 2: Syntax
More informationA Magic Approach to Optimizing Incremental Relational Expressions
A Magic Approach to Optimizing Incremental Relational Expressions Andreas Behrend Institute of Computer Science III, University of Bonn Römerstr. 164, D-53117 Bonn, Germany behrend@cs.uni-bonn.de ABSTRACT
More informationLower Bound Techniques for Multiparty Communication Complexity
Lower Bound Techniques for Multiparty Communication Complexity Qin Zhang Indiana University Bloomington Based on works with Jeff Phillips, Elad Verbin and David Woodruff 1-1 The multiparty number-in-hand
More information