Data-Driven Logical Reasoning

Similar documents
Relations to first order logic

Logics for Data and Knowledge Representation

Apriori algorithm. Seminar of Popular Algorithms in Data Mining and Machine Learning, TKK. Presentation Lauri Lahti

Assignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran

Fuzzy Description Logics

Principles of Knowledge Representation and Reasoning

Description Logics. an introduction into its basic ideas

Decidability of SHI with transitive closure of roles

Reasoning with Inconsistent and Uncertain Ontologies

AN EXTENSION OF THE PROBABILITY LOGIC LP P 2. Tatjana Stojanović 1, Ana Kaplarević-Mališić 1 and Zoran Ognjanović 2

Detecting Anomalous and Exceptional Behaviour on Credit Data by means of Association Rules. M. Delgado, M.D. Ruiz, M.J. Martin-Bautista, D.

A Crisp Representation for Fuzzy SHOIN with Fuzzy Nominals and General Concept Inclusions

Mathematical Logic. Reasoning in First Order Logic. Chiara Ghidini. FBK-IRST, Trento, Italy

Introduction to Kleene Algebras

Lightweight Description Logics: DL-Lite A and EL ++

CS 6110 Lecture 21 The Fixed-Point Theorem 8 March 2013 Lecturer: Andrew Myers. 1 Complete partial orders (CPOs) 2 Least fixed points of functions

A Zadeh-Norm Fuzzy Description Logic for Handling Uncertainty: Reasoning Algorithms and the Reasoning System

Mathematical Logics Description Logic: Introduction

Bayesian Description Logics

Decidable Description Logics of Context with Rigid Roles

Data Mining. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of Computer Science

Mining Positive and Negative Fuzzy Association Rules

All psychiatrists are doctors All doctors are college graduates All psychiatrists are college graduates

An Introduction to Description Logics: Techniques, Properties, and Applications. NASSLLI, Day 2, Part 2. Reasoning via Tableau Algorithms.

Complexity of Reasoning in Entity Relationship Models

OPPA European Social Fund Prague & EU: We invest in your future.

Modularity in DL-Lite

On Terminological Default Reasoning about Spatial Information: Extended Abstract

Reductionist View: A Priori Algorithm and Vector-Space Text Retrieval. Sargur Srihari University at Buffalo The State University of New York

Applied Logic. Lecture 1 - Propositional logic. Marcin Szczuka. Institute of Informatics, The University of Warsaw

A Survey of Temporal Knowledge Representations

Description Logics. Adrian Groza. Department of Computer Science Technical University of Cluj-Napoca

Predicate Logic: Introduction and Translations

Phase 1. Phase 2. Phase 3. History. implementation of systems based on incomplete structural subsumption algorithms

Propositional Logic Language

A Logical Formulation of the Granular Data Model

A Framework for Representing Ontology Mappings under Probabilities and Inconsistency

Algebraic Trace Theory

An Introduction to Description Logics

Reusing and Modifying Rulebases by Predicate Substitution

Database Theory VU , SS Complexity of Query Evaluation. Reinhard Pichler

Augmenting Subsumption Propagation in Distributed Description Logics

REDUNDANCY, DEDUCTION SCHEMES, AND MINIMUM-SIZE BASES FOR ASSOCIATION RULES

Classification Based on Logical Concept Analysis

Chapter 2 Background. 2.1 A Basic Description Logic

Algebraic Trace Theory

Towards a formal language for systemic requirements

Improved Algorithms for Module Extraction and Atomic Decomposition

Statistical Privacy For Privacy Preserving Information Sharing

An Introduction to Description Logic III

Adding Context to Tableaux for DLs

Parts 3-6 are EXAMPLES for cse634

Bridging the Gap Between OWL and Relational Databases

LTCS Report. Blocking and Pinpointing in Forest Tableaux. LTCS-Report 08-02

4 Quantifiers and Quantified Arguments 4.1 Quantifiers

CHAPTER 1. Preliminaries. 1 Set Theory

FUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH

The DL-Lite Family of Languages A FO Perspective

Natural Deduction. Formal Methods in Verification of Computer Systems Jeremy Johnson

On Axiomatic Rejection for the Description Logic ALC

Distributed Mining of Frequent Closed Itemsets: Some Preliminary Results

Meelis Kull Autumn Meelis Kull - Autumn MTAT Data Mining - Lecture 05

UBGI and Address Standards

Developing Modal Tableaux and Resolution Methods via First-Order Resolution

Redundancy, Deduction Schemes, and Minimum-Size Bases for Association Rules

Just: a Tool for Computing Justifications w.r.t. ELH Ontologies

A Refined Tableau Calculus with Controlled Blocking for the Description Logic SHOI

CSE-4412(M) Midterm. There are five major questions, each worth 10 points, for a total of 50 points. Points for each sub-question are as indicated.

Quasi-Classical Semantics for Expressive Description Logics

Completing Description Logic Knowledge Bases using Formal Concept Analysis

Role-depth Bounded Least Common Subsumers by Completion for EL- and Prob-EL-TBoxes

Introduzione alle logiche descrittive

A Formal Investigation of Mapping Language for Terminological Knowledge

1 INFO 2950, 2 4 Feb 10

DESCRIPTION LOGICS. Paula Severi. October 12, University of Leicester

Interpreting Low and High Order Rules: A Granular Computing Approach

Mining chains of relations

Entity-Relationship Diagrams and FOL

Database Design and Normalization

Loop Formulas for Description Logic Programs

Automated Synthesis of Tableau Calculi

First-order logic Syntax and semantics

Fuzzy Answer Set semantics for Residuated Logic programs

Extended Decision Procedure for a Fragment of HL with Binders

The Bayesian Ontology Language BEL

Bound and Free Variables. Theorems and Proofs. More valid formulas involving quantifiers:

A Tableau Calculus for Minimal Modal Model Generation

Logic and Propositional Calculus

Privacy-preserving Data Mining

Modular Reuse of Ontologies: Theory and Practice

Query Answering in DL-Lite with Datatypes: A Non-Uniform Approach

A tableaux calculus for ALC + T min R

CEX and MEX: Logical Diff and Semantic Module Extraction in a Fragment of OWL

Using C-OWL for the Alignment and Merging of Medical Ontologies

Probabilistic Abductive Logic Programming: a Joint Approach of Logic and Probability

Machine Learning: Pattern Mining

Bisimulation for conditional modalities

Review CHAPTER. 2.1 Definitions in Chapter Sample Exam Questions. 2.1 Set; Element; Member; Universal Set Partition. 2.

Nonmonotonic Reasoning in Description Logic by Tableaux Algorithm with Blocking

Description Logics. Logics and Ontologies. franconi. Enrico Franconi

Transcription:

Data-Driven Logical Reasoning Claudia d Amato Volha Bryl, Luciano Serafini November 11, 2012 8 th International Workshop on Uncertainty Reasoning for the Semantic Web 11 th ISWC, Boston (MA), USA.

Heterogeneous Data Heterogeneous resources of knowledge about the same domain: Non or simply structured data (e.g., sensor data, signals, DB tracks, texts, bag of words, etc) containing (alpha)numeric feature based data Structured knowledge (e.g., ontologies = T-box + A-box) describing the entity types, the relations and the objects of a particular domain.

Heterogeneous Data Heterogeneous resources of knowledge about the same domain: Non or simply structured data (e.g., sensor data, signals, DB tracks, texts, bag of words, etc) containing (alpha)numeric feature based data For instance a database of a company containing anagraphic data, salaries, evaluations, performances, and other relevant information about the employees. Structured knowledge (e.g., ontologies = T-box + A-box) describing the entity types, the relations and the objects of a particular domain. For instance the ontology for the organizational structure of a company describing, the roles, the activities, the responsibilities, etc. and the instantiation to the set of the company employees.

Heterogeneous Inferences Different type of data support different type of inference Non or simply structured data enables the discovering of new knowledge as regularity patterns of data via inductive reasoning. Structured knowledge allows to discover new knowledge via inferencing of classes or properties of objects based on logical deductive reasoning.

Heterogeneous Inferences Different type of data support different type of inference Non or simply structured data enables the discovering of new knowledge as regularity patterns of data via inductive reasoning. Usually older employees earn more than their younger colleagues Structured knowledge allows to discover new knowledge via inferencing of classes or properties of objects based on logical deductive reasoning. If a person is a project leader than he or she coordinates the work of all the people allocated to the project

Combining heterogeneous Inferences Non or simply structured data + Structured knowledge will enable the combination of knowledge induced from data and knowledge encoded in an ontology in a new form of mixed reasoning, that we call data drive inference. Usually older employees earn more than their younger colleagues + If a person is a project leader than he or she coordinates the work of all the people allocated to the project

Combining heterogeneous Inferences Non or simply structured data + Structured knowledge will enable the combination of knowledge induced from data and knowledge encoded in an ontology in a new form of mixed reasoning, that we call data drive inference. Usually older employees earn more than their younger colleagues + If a person is a project leader than he or she coordinates the work of all the people allocated to the project = Since project coordinators receive always the highest salary within a project team, the oldest person of project team, is most probably the project coordinator

objectives Abstract objectives 1. To define a reference framework capable to represent quantitative data and logical knowledge in an integrated way 2. extend machine learning algorithms to support the induction of new knowledge from quantitative data integrated with logical knowledge 3. extend the logical reasoning algorithms to support logical inference in presence of knowledge induced from logical data integrated with logical knowledge

objectives Concrete contributions (EKAW 2012) 1. define the notion of ontology integrated with a dataset, called grounded ontologies 2. introduce semantically enriched association rules to represent knowledge induced from grounded ontologies 3. define a simple algorithm for learning semantically enriched association rules from grounded ontologies.

objectives Concrete contributions (EKAW 2012) 1. define the notion of ontology integrated with a dataset, called grounded ontologies 2. introduce semantically enriched association rules to represent knowledge induced from grounded ontologies 3. define a simple algorithm for learning semantically enriched association rules from grounded ontologies. Concrete contributions (URSW 2012) 4. define a new reasoning task, called the most plausible model that computes the most common model of an ontology w.r.t., a set of semantically enriched association rules 5. propose a first approximated tableaux algorithm to compute the most plausible model

Grounded Ontology Dataset D is a non empty set of objects f 1,..., f n are n feature functions defined on every element of D, with f i : D D i.

Grounded Ontology Dataset D is a non empty set of objects f 1,..., f n are n feature functions defined on every element of D, with f i : D D i. Knowledge base K on an alphabet Σ, composed of three disjoint ets of symbols, Σ C, Σ R and Σ I, is a set K of DL inclusion axioms and DL assertions.

Grounded Ontology Dataset D is a non empty set of objects f 1,..., f n are n feature functions defined on every element of D, with f i : D D i. A grounding is a total function g : D Σ I Knowledge base K on an alphabet Σ, composed of three disjoint ets of symbols, Σ C, Σ R and Σ I, is a set K of DL inclusion axioms and DL assertions.

Grounded Ontology Dataset D is a non empty set of objects f 1,..., f n are n feature functions defined on every element of D, with f i : D D i. A grounding is a total function g : D Σ I for every d D there is a constant a Σ I with g(d) = a. Knowledge base K on an alphabet Σ, composed of three disjoint ets of symbols, Σ C, Σ R and Σ I, is a set K of DL inclusion axioms and DL assertions.

Grounded Ontology Dataset D is a non empty set of objects f 1,..., f n are n feature functions defined on every element of D, with f i : D D i. Knowledge base K on an alphabet Σ, composed of three disjoint ets of symbols, Σ C, Σ R and Σ I, is a set K of DL inclusion axioms and DL assertions. A grounding is a total function g : D Σ I for every d D there is a constant a Σ I with g(d) = a. intuitively g(d) = a represents the fact that data d is an observation about the object a of the knowledge base;

Grounded Ontology Dataset D is a non empty set of objects f 1,..., f n are n feature functions defined on every element of D, with f i : D D i. Knowledge base K on an alphabet Σ, composed of three disjoint ets of symbols, Σ C, Σ R and Σ I, is a set K of DL inclusion axioms and DL assertions. A grounding is a total function g : D Σ I for every d D there is a constant a Σ I with g(d) = a. intuitively g(d) = a represents the fact that data d is an observation about the object a of the knowledge base; there can be more than one observation for a in D. This implies that g(d) = a and g(d ) = a for d d

Grounded Ontology - Example ID E ID salary year 001 E01 30,000 2010 002 E01 35,000 2011 003 E01 40,000 2012 004 E02 28,000 2011 005 E02 33,000 2012 006 E03 24,000 2012 007 E04 25,000 2011 008 E04 25,000 2012 009 E05 40,000 2012

Grounded Ontology - Example ID E ID salary year 001 E01 30,000 2010 002 E01 35,000 2011 003 E01 40,000 2012 004 E02 28,000 2011 005 E02 33,000 2012 006 E03 24,000 2012 007 E04 25,000 2011 008 E04 25,000 2012 009 E05 40,000 2012 Person Prj Person leads.prj worksfor.prj Prj (= 1)leads Prj worksfor dom(worksfor) Person dom(leads) Person leads(alice, P) worksfor(bob, P) worksfor(chris, P) leads(bob, Q) worksfor(chris, Q) worksfor(dan, Q) leads(alice, R) worksfor(dan, R) alldifferent(alice, Bob, Chris, Dan) (Eva), leads (Chris) leads (Dan)

Grounded Ontology - Example ID E ID salary year 001 E01 30,000 2010 002 E01 35,000 2011 003 E01 40,000 2012 004 E02 28,000 2011 005 E02 33,000 2012 006 E03 24,000 2012 007 E04 25,000 2011 008 E04 25,000 2012 009 E05 40,000 2012 Person Prj Person leads.prj worksfor.prj Prj (= 1)leads Prj worksfor dom(worksfor) Person dom(leads) Person leads(alice, P) worksfor(bob, P) worksfor(chris, P) leads(bob, Q) worksfor(chris, Q) worksfor(dan, Q) leads(alice, R) worksfor(dan, R) alldifferent(alice, Bob, Chris, Dan) (Eva), leads (Chris) leads (Dan) g 001 Alice 002 Alice 003 Alice 004 Bob 005 Bob 006 Chris 007 Dan 008 Dan 009 Eva

Grounding allow to join data with knowledge Let (K, D, g) be a grounded ontology, the semantic enrichment of D w.r.t., K, denoted by D +K is dataset containing the same data as D with the following additional semantic attributes:

Grounding allow to join data with knowledge Let (K, D, g) be a grounded ontology, the semantic enrichment of D w.r.t., K, denoted by D +K is dataset containing the same data as D with the following additional semantic attributes: Conceptual attributes for every primitive concept C Σ c the attribute f C is defined as follows: 1 if K = C(g(d)) f C (d) = 0 if K = C(g(d)) unknown otherwise

Grounding allow to join data with knowledge Let (K, D, g) be a grounded ontology, the semantic enrichment of D w.r.t., K, denoted by D +K is dataset containing the same data as D with the following additional semantic attributes: Conceptual attributes for every primitive concept C Σ c the attribute f C is defined as follows: 1 if K = C(g(d)) f C (d) = 0 if K = C(g(d)) unknown otherwise Relational attributes for every relation R Σ r the attribute f R is defined as follows: 1 if K = R(g(d)) f R (d) = 0 if K = R(g(d)) unknown otherwise

Join data with knowledge - example ID E ID salary year Person Prj leads worksfor 001 E01 30,000 2010 1 0 1-002 E01 35,000 2011 1 0 1-003 E01 40,000 2012 1 0 1-004 E02 28,000 2011 1 0 1 1 005 E02 32,000 2012 1 0 1 1 006 E03 24,000 2012 1 0 0 1 007 E04 25,000 2011 1 0 0 1 008 E04 25,000 2012 1 0 0 1 009 E05 40,000 2012 - - - -

Association Rules Definition Given a dataset D made by a set of attributes {A 1,..., A n } an itemset of D is an expression f i1 = v 1 f ik = v k (1) the support of an itemset is the number of tuples (rows) in D that match the itemset. Association rule is an expression f i1 = v 1 f ik = v k f ik +1 = v k+1 f in = v n (2) The confidence of (2) is the fraction of cases in D that match the conclusion amongs the one that matches the premises. conf (θ ϕ) = support(θ ϕ) support(θ)

Semantically Enriched Association Rules (SEAR) Semantically enriched association rules Let (K, D, g) be a grounded ontology A semantically enriched association rule is an association rule defined on the semantically enriched dataset D +K.

Learning association rules Association rules are learned by finding the frequent itemsets w.r.t. a given support threshold, extracting the rules from the frequent itemsets satisfying a given confidence threshold. The first subproblem is the most challenging/expensive We use the standard Apriori algorithm key assumption: a set of variables is frequent only if all its subsets are frequent, itemsets are built iteratively, incrementing the length at each step.

Learing SEAR: example ID E ID salary year Person Prj leads worksfor 001 E01 30 40K 2010 1 0 1-002 E01 30 40K 2011 1 0 1-003 E01 30 40K 2012 1 0 1-004 E02 20 30K 2011 1 0 1 1 005 E02 30 40K 2012 1 0 1 1 006 E03 20 30K 2012 1 0 0 1 007 E04 20 30K 2011 1 0 0 1 008 E04 20 30K 2012 1 0 0 1 009 E05 30 40K 2012 - - 0 - Examples of SEARs f leads = 1, f year = 2011 f Salary = 30 40K conf = 4/4 = 1.00 f Person = 1 conf = 8/9 = 0.88 f Project = 0 conf = 8/9 = 0.88 f Salary = 30 40K f leads = 1 conf = 4/5 = 0.80 f leads = 1 f Salary = 30 40K conf = 4/5 = 0.80 f Salary = 20 30K f leads = 0 conf = 3/4 = 0.75 f year = 2012, f Salary = 30 40K f leads = 1 conf = 2/3 = 0.66

objectives Concrete contributions 1. defined the notion of ontology grounded on a set of data, also called grounded ontologies 2. introduced semantically enriched association rules to represent knowledge induced from grounded ontologies 3. defined a simple algorithm for learning semantically enriched association rules from grounded ontologies.

objectives Concrete contributions 1. defined the notion of ontology grounded on a set of data, also called grounded ontologies 2. introduced semantically enriched association rules to represent knowledge induced from grounded ontologies 3. defined a simple algorithm for learning semantically enriched association rules from grounded ontologies. Concrete contributions 4. defined a new reasoning task, called the most plausible model that computes the most common model of an ontology w.r.t., a set of semantically enriched association rules 5. proposed a first approximated tableaux algorithm to compute the most plausible model

The reasoning task Definition (Inference Problem) Given: D, K, the set R of SEARs, an data item d 0, a concept C 0, and the grounding g(d 0 ) = x 0, Determine: the model I r for K {C 0 (x 0 )} representing the most plausible model for K {C 0 (x 0 )} w.r.t., g and R.

Plausibility ordering Definition (Plausibility ordering (first attempt)) Let (K, {d}, g) be a grounded ontology: I, d, g = f i = a if f i (d) = a I, d, g = f C = 1 if g(d) I C I I, d, g = f C = 0 if g(d) I C I I, d, g = f R = 1 if g(d) I ( R. ) I I, d, g = f R = 0 if g(d) I ( R. ) I I, d, g = φ 1 φ n if I, d, g = φ i for 1 i n I, d, g = φ ψ if I, d, g = φ or I, d, g = ψ

Plausibility ordering Definition (Plausibility ordering (first attempt)) Let (K, {d}, g) be a grounded ontology: I, d, g = f i = a if f i (d) = a I, d, g = f C = 1 if g(d) I C I I, d, g = f C = 0 if g(d) I C I I, d, g = f R = 1 if g(d) I ( R. ) I I, d, g = f R = 0 if g(d) I ( R. ) I I, d, g = φ 1 φ n if I, d, g = φ i for 1 i n I, d, g = φ ψ if I, d, g = φ or I, d, g = ψ p(i, d, g, α) = { 0 If I, d, g = α 2 conf (α) 1 Otherwise

Plausibility ordering Definition (Plausibility ordering (first attempt)) Let (K, {d}, g) be a grounded ontology: I, d, g = f i = a if f i (d) = a I, d, g = f C = 1 if g(d) I C I I, d, g = f C = 0 if g(d) I C I I, d, g = f R = 1 if g(d) I ( R. ) I I, d, g = f R = 0 if g(d) I ( R. ) I I, d, g = φ 1 φ n if I, d, g = φ i for 1 i n I, d, g = φ ψ if I, d, g = φ or I, d, g = ψ { 0 If I, d, g = α p(i, d, g, α) = 2 conf (α) 1 Otherwise I J iff p(i, d, g, α) p(j, d, g, α) α AR α AR

ALC Tableaux Tableaux Let C 0 be an ALC-concept in NNF. In order to test satisfiability of C 0, the ALC tableaux algorithm starts with A 0 := {C 0 (x 0 )}, and applies the following rules: Rule Condition Effect C D(x) A A := A {C(x), D(x)} C D(x) A A := A {C(x)} or A {D(x)} R.C(x) A A := A {R(x, y), C(y)} R.C(x), R(x, y) A A := A {C(y)}.........

ALC Tableaux Tableaux Let C 0 be an ALC-concept in NNF. In order to test satisfiability of C 0, the ALC tableaux algorithm starts with A 0 := {C 0 (x 0 )}, and applies the following rules: Rule Condition Effect C D(x) A A := A {C(x), D(x)} C D(x) A A := A {C(x)} or A {D(x)} R.C(x) A A := A {R(x, y), C(y)} R.C(x), R(x, y) A A := A {C(y)}......... Indeterministic Choices In applying rule we have to do a choice, either we expand A with C(x) or with D(x). Depending on the choice we can generate a more or less plausible model.

-rule - a simple example Example (Simple) C(x 0) C A B f Salary (d) = 27 f Salary = 10 20 A conf = 0.77 f Salary = 20 30 B conf = 0.66 g(d) = x 0 C(x 0) A B(x 0) A(x 0) C(x 0) A B(x 0) C(x 0) A B(x 0) B(x 0)

-rule - a simple example Example (Simple) C(x 0) C A B f Salary (d) = 27 f Salary = 10 20 A conf = 0.77 f Salary = 20 30 B conf = 0.66 g(d) = x 0 There are three possible models: C(x 0) A B(x 0) A(x 0) C(x 0) A B(x 0) C(x 0) A B(x 0) B(x 0) I A I B I C I α AR p(i, d, g, α) I 1 {x 0} {x 0} {x 0} 0.32 I 2 {x 0} {x 0} {x 0} 0.00 I 3 {x 0} {x 0} {x 0} {x 0} 0.00 The plausibility ordering is I 2 I 1, I 3 I 1

A complete simple example Example (The most plausible model for Eva) ID E ID salary year Person Prj leads worksfor 009 E05 30 40K 2012 - - - - (Eva)

A complete simple example Example (The most plausible model for Eva) ID E ID salary year Person Prj leads worksfor 009 E05 30 40K 2012 - - - - Person Prj (Eva) Person Prj(Eva)

A complete simple example Example (The most plausible model for Eva) ID E ID salary year Person Prj leads worksfor 009 E05 30 40K 2012 - - - - Person Prj f Person = 1[0.88] (Eva) Person Prj(Eva) Person(Eva)

A complete simple example Example (The most plausible model for Eva) ID E ID salary year Person Prj leads worksfor 009 E05 30 40K 2012 - - - - Person Prj f Person = 1[0.88] Person worksfor.prj leads.prj (Eva) Person Prj(Eva) Person(Eva) worksfor.prj leads.prj

A complete simple example Example (The most plausible model for Eva) ID E ID salary year Person Prj leads worksfor 009 E05 30 40K 2012 - - - - Person Prj f Person = 1[0.88] Person worksfor.prj leads.prj f Salary (Eva) = 30 40 f Salary = 30 40 f leads = 1 (Eva) Person Prj(Eva) Person(Eva) worksfor.prj leads.prj leads.prj(eva)

A complete simple example Example (The most plausible model for Eva) ID E ID salary year Person Prj leads worksfor 009 E05 30 40K 2012 - - - - Person Prj f Person = 1[0.88] Person worksfor.prj leads.prj f Salary (Eva) = 30 40 f Salary = 30 40 f leads = 1 (Eva) Person Prj(Eva) Person(Eva) worksfor.prj leads.prj leads.prj(eva) leads(eva, p), Prj(p)

Conclusions & Future Work Conclusions: Proposed a framework for learning association rules from hybrid sources of information Preliminary ideas on exploitation of the learnt association rules during the deductive reasoning process Future works: Experimental evaluation of the proposed preliminary methodology Extension and consolitation of the theoretical framework to include also relations between objects.