Epistemic and Statistical Probabilistic Ontologies

Similar documents
Logic Programming Techniques for Reasoning with Probabilistic Ontologies

Semantics and Inference for Probabilistic Ontologies

A Distribution Semantics for Probabilistic Ontologies

Probabilistic Description Logics under the Distribution Semantics

Probabilistic Description Logics under the Distribution Semantics

Distributed Parameter Learning for Probabilistic Ontologies

Bayesian Description Logics

Probabilistic Ontologies in Datalog+/-

arxiv: v3 [cs.ai] 29 Jan 2015

Probabilistic Datalog+/- under the Distribution Semantics

The PITA System for Logical-Probabilistic Inference

PR-OWL Bridging the gap to OWL semantics

A Possibilistic Extension of Description Logics

Reasoning in the Description Logic BEL using Bayesian Networks

Probabilistic Logic Languages

Semantics and Inference for Probabilistic Description Logics

Tight Complexity Bounds for Reasoning in the Description Logic BEL

Optimization Techniques for Fuzzy Description Logics

Semantic Web Uncertainty Management

Probabilistic Hybrid Knowledge Bases under the Distribution Semantics

MCINTYRE: A Monte Carlo Algorithm for Probabilistic Logic Programming

Inference in Probabilistic Ontologies with Attributive Concept Descriptions and Nominals URSW 2008

Optimizing Inference for Probabilistic Logic Programs Exploiting Independence and Exclusiveness

Probabilistic Ontologies: Logical Approach

A Crisp Representation for Fuzzy SHOIN with Fuzzy Nominals and General Concept Inclusions

Tractable Reasoning with Bayesian Description Logics

Probabilistic Inductive Constraint Logic

How to Contract Ontologies

Structure Learning of Probabilistic Logic Programs by MapReduce

On Correspondences between Probabilistic First-Order and Description Logics

On Integrating Ontologies with Relational Probabilistic Models

Completing Description Logic Knowledge Bases using Formal Concept Analysis

Revision of DL-Lite Knowledge Bases

Learning Terminological Naïve Bayesian Classifiers Under Different Assumptions on Missing Knowledge

Improved Algorithms for Module Extraction and Atomic Decomposition

Modular Reuse of Ontologies: Theory and Practice

An Introduction to Description Logics

TURAMBAR: An Approach to Deal with Uncertainty in Semantic Environments

Inference in Probabilistic Ontologies with Attributive Concept Descriptions and Nominals

Next Steps for Description Logics of Minimal Knowledge and Negation as Failure

UNCERTAINTY. In which we see what an agent should do when not all is crystal-clear.

Chapter 2 Background. 2.1 A Basic Description Logic

Pengju XJTU 2016

Mapping Validation by Probabilistic Reasoning

HANDLING UNCERTAINTY IN INFORMATION EXTRACTION

CSC384: Intro to Artificial Intelligence Reasoning under Uncertainty-II

A Formal Investigation of Mapping Language for Terminological Knowledge

Root Justifications for Ontology Repair

Just: a Tool for Computing Justifications w.r.t. ELH Ontologies

Reasoning with Uncertainty

DESCRIPTION LOGICS. Paula Severi. October 12, University of Leicester

Revising General Knowledge Bases in Description Logics

A New Approach to Knowledge Base Revision in DL-Lite

An Introduction to Description Logic III

Probabilistic and Truth-Functional Many-Valued Logic Programming

Representation of Parsimonious Covering Theory in OWL-DL

Modeling and reasoning with uncertainty

A Zadeh-Norm Fuzzy Description Logic for Handling Uncertainty: Reasoning Algorithms and the Reasoning System

I N F S Y S R E S E A R C H R E P O R T AN OVERVIEW OF UNCERTAINTY AND VAGUENESS IN DESCRIPTION LOGICS INSTITUT FÜR INFORMATIONSSYSTEME

Logic and Reasoning in the Semantic Web (part II OWL)

Contraction and Revision over DL-Lite TBoxes

Symmetry Breaking for Relational Weighted Model Finding

Probabilistic and Bayesian Analytics

Bayesian Reasoning. Adapted from slides by Tim Finin and Marie desjardins.

OntoMerge: A System for Merging DL-Lite Ontologies

Using the Dempster-Shafer theory of evidence to resolve ABox inconsistencies. Andriy Nikolov Victoria Uren Enrico Motta Anne de Roeck

Encoding formulas with partially constrained weights in a possibilistic-like many-sorted propositional logic

Reasoning with Inconsistent and Uncertain Ontologies

Structured Descriptions & Tradeoff Between Expressiveness and Tractability

22c:145 Artificial Intelligence

CS:4420 Artificial Intelligence

OntoMerge: A System for Merging DL-Lite Ontologies

Reasoning Under Uncertainty: Introduction to Probability

Uncertainty. Logic and Uncertainty. Russell & Norvig. Readings: Chapter 13. One problem with logical-agent approaches: C:145 Artificial

Role-depth Bounded Least Common Subsumers by Completion for EL- and Prob-EL-TBoxes

Probabilistic Reasoning

Reasoning the FMA Ontologies with TrOWL

The Bayesian Ontology Language BEL

How to determine if a statement is true or false. Fuzzy logic deal with statements that are somewhat vague, such as: this paint is grey.

Uncertainty and Bayesian Networks

PROBABILITY AND INFERENCE

Knowledge Base Revision in Description Logics

Optimizing the Crisp Representation of the Fuzzy Description Logic SROIQ

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty

Artificial Intelligence Programming Probability

Fuzzy Description Logics

Relationships between Probabilistic Description and First-Order Logics

Probabilistic Reasoning. (Mostly using Bayesian Networks)

Tableau-Based ABox Abduction for Description Logics: Preliminary Report

Artificial Intelligence

arxiv: v1 [cs.ai] 4 Oct 2011

Probabilistic Logical Inference On the Web

PROBABILISTIC LOGIC. J. Webster (ed.), Wiley Encyclopedia of Electrical and Electronics Engineering Copyright c 1999 John Wiley & Sons, Inc.

Efficient approximation in DL-Lite of OWL 2 ontologies

Uncertainty and Rules

Model Checking. Boris Feigin March 9, University College London

The Consistency of the CADIAG-2 Knowledge Base: A Probabilistic Approach

Pengju

A Refined Tableau Calculus with Controlled Blocking for the Description Logic SHOI

Knowledge Representation and Description Logic Part 2

Transcription:

Epistemic and Statistical Probabilistic Ontologies Fabrizio Riguzzi, Elena Bellodi, Evelina Lamma, Riccardo Zese ENDIF University of Ferrara, Italy fabrizio.riguzzi@unife.it, elena.bellodi@unife.it, evelina.lamma@unife.it, riccardo.zese@student.unife.it Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 1 / 20

Outline 1 Representing Uncertainty 2 Probabilistic Logic Languages 3 Distribution Semantics 4 Probabilistic Ontologies under DISPONTE semantics 5 Epistemic vs Statistical Probability 6 Inference in Probabilistic Ontologies 7 Examples 8 Query answering for Probabilistic OWL DL Ontologies 9 Related works 10 Conclusions 11 References Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 2 / 20

Representing Uncertainty Uncertainty Representation Semantic Web Incompleteness or uncertainty are intrinsic of much information on the World Wide Web Most common approaches: probability theory, Fuzzy Logic Logic Programming Uncertain relationships among entities characterize many complex domains Most common approaches: probability theory Distribution Semantics (Sato,1995)[6] It underlies Probabilistic Logic Languages (ICL,PRISM, ProbLog, LPADs),... They define a probability distribution over normal logic programs The distribution is extended to a joint distribution over worlds and queries The probability of a query is obtained from this distribution by marginalization Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 3 / 20

Probabilistic Logic Languages Probabilistic Program Example (ProbLog) Example: Program T, development of an epidemic or pandemic, if somebody has the flu and the climate is cold. C 1 = epidemic : flu(x), epid(x), cold. C 2 = pandemic : flu(x), not epid(x), pand(x), cold. C 3 = flu(david). C 4 = flu(robert). F 1 = 0.7 :: cold. F 2 = 0.6 :: epid(x). F 3 = 0.3 :: pand(x). Distributions over facts Worlds obtained by selecting or not every grounding of each probabilistic fact Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 4 / 20

Distribution Semantics Distribution Semantics Case of no function symbols: finite set of groundings of each probabilistic fact F a ProbLog fact p :: F is interpreted as F : p null : 1 p. Atomic choice: selection of a value for a grounding of a probabilistic fact F: (F i, θ j, k), where θ j is a substitution grounding F i and k {0, 1}. Composite choice κ: consistent set of atomic choices κ = {(F 2, {X/david}, 1), (F 2, {X/david}, 0)} not consistent Boolean random variable X ij, for each (F i, θ j, k) Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 5 / 20

Distribution Semantics Distribution Semantics Selection σ: a total composite choice (one atomic choice for every grounding of each probabilistic fact) σ = {(F 1, {}, 1), (F 2, {X/david}, 1), (F 3, {X/david}, 1), (F 2, {X/robert}, 0), (F 3, {X/robert}, 0)} A selection σ identifies a logic program w σ called world: w σ = T C {F i θ j (F i, θ j, 1) σ}, where T C is the set of certain rules of T (a normal logic program) The probability of w σ is P(w σ ) = P(σ) = (F i,θ j,1) κ p i (F i,θ j,0) κ (1 p i) For the example above: P(w σ ) = 0.7 0.6 0.3 (1 0.6) (1 0.3) Finite set of worlds: W T = {w 1,..., w m } P W distribution over worlds: w W T P(w) = 1 Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 6 / 20

Distribution Semantics Distribution Semantics Conditional probability of a query Q: P(Q w) = 1 if w = Q and 0 otherwise Joint distribution of the worlds and queries P(Q,w): P(Q, w) = P(Q w)p(w) P(Q) = w W T P(Q, w) = w W T P(Q w)p(w) = w W T :w =Q P(w) In the example T has 5 Boolean random variables F 1 X 11 (1 grounding) F 2 X 21 and X 22 (2 groundings) F 3 X 31 and X 32 (2 groundings) and thus 32 worlds. The query epidemic is true in 5 of them. By the sum of their probability, we otain P(epidemic) = 0.588. Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 7 / 20

Probabilistic Ontologies under DISPONTE semantics DISPONTE: DIstribution Semantics for Probabilistic ONTologiEs Idea: annotate each axiom of an ontology with a probability and assume that each axiom is independent of the others (see URSW2011) DISPONTE semantics exploits the translation of a probabilistic ontology into a first order logic theory A probabilistic ontology defines thus a distribution over normal theories (worlds) obtained by including an axiom in a world with a probability given by the annotation The probability of a query is again computed from this distribution with marginalization: P(Q) = w P(Q, w) = w P(Q w)p(w) = w:w =Q P(w) What s new w.r.t. URSW2011? Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 8 / 20

Probabilistic Ontologies under DISPONTE semantics Probabilistic Ontologies under DISPONTE semantics We can specify two kinds of probability for OWL DL axioms, under the DISPONTE semantics: 1 p :: e E : epistemic probability where p [0, 1] and E is any (TBox, RBox or ABox) axiom p represents our degree of belief in axiom E e.g., p :: e C D represents the fact that we believe in the truth of C D with probability p. 2 p :: s E : statistical probability where p [0, 1] and E is a TBox or RBox axiom p represents information regarding random individuals from certain populations e.g., p :: s C D means instead that a random individual of class C has probability p of belonging to D. 3 Any unannotated axiom E is certain. Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 9 / 20

Observations Epistemic vs Statistical Probability 1 Epistemic probability p :: e C D represents the fact that we believe in the truth of C D with probability p If two individuals i and j belong to class C, the probability that they both belong to D under the epistemic probability is p 2 Statistical probability p :: s C D means that a random individual of class C has probability p of belonging to D If two individuals i and j belong to class C, thus the probability that they both belong to D under statistical probability interpretation is p p. Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 10 / 20

Inference in Probabilistic Ontologies Explanations for a query Each atomic choice is a triple (F i, θ j, k) F i is the formula obtained by translating the i-th axiom E i θ j is a substitution k {0, 1}. k indicates whether (F i, θ j, k) is chosen to be included in a world (k = 1) or not (k = 0) If F i is obtained from an unannotated axiom, then θ j = and k = 1 If F i is obtained from an axiom of the form p :: e E i, then θ j = If F i is obtained from an axiom of the form p :: s E i, then θ j instantiates the variables occurring in the logical translation of axiom E i. Boolean random variables (X ij ) are, again, associated to (instantiations of) logical formulas (F i ) by substitution θ j Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 11 / 20

Inference in Probabilistic Ontologies Inference and Query answering Similarly to the case of probabilistic logic programming, the probability of a query Q given a probabilistic ontology O can be computed by first finding the explanations for Q in O Explanation: subset of axioms of O that is sufficient for entailing Q All the explanations for Q must be found, corresponding to all ways of proving Q Probability of Q probability of the DNF formula F(Q) = X ij ) ( X ij e E Q (F i,θ j,1) e (F i,θ j,0) e where E Q is the set of explanations and X ij is a random variable with k = 1 and probability p i (and X ij is a random variable with k = 0 and probability (1 p i )) We exploit an underlying DL reasoner for computing explanations, and Binary Decision Diagrams for making these explanations mutually incompatible. Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 12 / 20

Examples Example 1.1 - people+pets ontology fluffy is a Cat with (epistemic) probability 0.4 and tom is a Cat with probability 0.3; Cats are Pets with (epistemic) probability 0.6 0.4 :: e fluffy : Cat (1) 0.3 :: e tom : Cat (2) 0.6 :: e Cat Pet (3) Everyone who has a pet animal (hasanimal.pet) is a PetOwner, kevin has two animals, fluffy and tom hasanimal.pet PetOwner (4) (kevin, fluffy) : hasanimal (5) (kevin, tom) : hasanimal (6) Q = kevin : PetOwner has two (mutually exclusive) explanations: {(1), (3), not (2)} and {(2),(3)} P(Q) = 0.4 0.6 (1 0.3) + 0.3 0.6 = 0.348 Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 13 / 20

Examples Example 1.2 - people+pets ontology If we replace epistemic with statistical probability in axiom: 0.6 :: s Cat Pet (7) then for Q = kevin : PetOwner we have instances of axiom (7) in (mutually exclusive) explanations: {(1), (7)/fluffy, not (2)}, {(1), (7)/fluffy, (2), not ((7)/tom) } and {(2),(7)/tom} P(Q) = 0.4 0.6 (1 0.3)+0.4 0.6 0.3 (1 0.6)+0.3 0.6 = 0.3768 Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 14 / 20

Query answering for Probabilistic OWL DL Ontologies BUNDLE system Binary decision diagrams for Uncertain reasoning on Description Logic theories BUNDLE performs inference over probabilistic OWL DL ontologies that follow the DISPONTE semantics It exploits an underlying ontology reasoner able to return all explanations for a query, such as Pellet [7] Explanations for a query in the form of a set of sets of axioms Pellet has been extended to record not only used axioms, but their instantiations too, in order to correctly handle statistical probability BUNDLE performs a double loop over the set of explanations and over the set of (instantiated) axioms in each explanation, in which it builds a BDD representing the set of explanations JavaBDD library for the manipulation of BDDs BUNDLE has been implemented in Java and will be available for download from http://sites.unife.it/bundle Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 15 / 20

Related works Related works (Laskey, and da Costa,2005) [4] proposed PR-OWL, an upper ontology that provides a framework for building probabilistic ontologies and allows to use the first-order probabilistic logic MEBN ; instead we tried to provide a minimal extension to DL (Koller et al.,1997) [3] present a probabilistic description logic based on Bayesian networks that deals with statistical terminological knowledge, but, differently from us, does not allow probabilistic assertional knowledge about concept and role instances (Jaeger, 1994)[2] allows assertional knowledge about concept and role instances together with statistical terminological knowledge. We can also represent epistemic information with terminological knowledge. Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 16 / 20

Related works Related works (Ding, and Peng, 2004)[1] propose a probabilistic extension of OWL that admits a translation into Bayesian networks. The semantics assigns a probability distribution P(i) over individuals and a probability to a class C as P(C) = i C P(i), while we assign a probability distribution over theories In (Nilsson, 1986) s probabilistic logic [5]: a probabilistic interpretation Pr defines a probability distribution over the set of interpretations I. The probability of a logic formula φ according to Pr, denoted Pr(φ), is the sum of all Pr(I) such that I I and I = φ while a probabilistic knowledge base may have multiple models that are probabilistic interpretations, a probabilistic ontology under the distribution semantics defines a single distribution over interpretations Worth to mention also alternative approaches to modeling imperfect knowledge in ontologies, based on fuzzy logic Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 17 / 20

Conclusions Conclusions and future works DISPONTE semantics for probabilistic ontologies inspired by the distribution semantics of probabilistic logic programming two ways (epistemic and statistical) to specify the probability of the axioms of an ontology The problem of inference in DISPONTE remains decidable if it was so in the underlying description logic BUNDLE system able to compute the probability of queries from an uncertain OWL DL ontology Computing explanations of a query is exponential in time Computing the probability of a DNF formula of independent Boolean random variables is a #P-complete problem (#P over the number of computed explanations) Future works Extension to treat different degrees of statistical probability, by choosing which variables in the logical translation are subject to instantiation and which not (as proposed by (Halpern,1990])) Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 18 / 20

Conclusions Thanks. Questions? Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 19 / 20

References References I Zhongli Ding and Yun Peng. A probabilistic extension to ontology language OWL. In Hawaii International Conference On System Sciences. IEEE, 2004. Manfred Jaeger. Probabilistic reasoning in terminological logics. In International Conference on Principles of Knowledge Representation and Reasoning, pages 305 316, 1994. Daphne Koller, Alon Y. Levy, and Avi Pfeffer. P-classic: A tractable probablistic description logic. In National Conference on Artificial Intelligence, pages 390 397, 1997. Kathryn B. Laskey and Paulo Cesar G. da Costa. Of starships and klingons: Bayesian logic for the 23rd century. In Conference in Uncertainty in Artificial Intelligence, pages 346 353. AUAI Press, 2005. Nils J. Nilsson. Probabilistic logic. Artif. Intell., 28(1):71 87, 1986. Taisuke Sato. A statistical learning method for logic programs with distribution semantics. In International Conference on Logic Programming, pages 715 729. MIT Press, 1995. Evren Sirin, Bijan Parsia, Bernardo Cuenca Grau, Aditya Kalyanpur, and Yarden Katz. Pellet: A practical OWL-DL reasoner. J. Web Sem., 5(2):51 53, 2007. Riguzzi, Bellodi, Lamma, Zese (ENDIF) DISPONTE 20 / 20