HANDLING UNCERTAINTY IN INFORMATION EXTRACTION

Similar documents
Epistemic and Statistical Probabilistic Ontologies

Chapter 14. From Randomness to Probability. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Informatics 2D Reasoning and Agents Semester 2,

Mobility Analytics through Social and Personal Data. Pierre Senellart

2. Probability. Chris Piech and Mehran Sahami. Oct 2017

Properties of Sequences

Reductionist View: A Priori Algorithm and Vector-Space Text Retrieval. Sargur Srihari University at Buffalo The State University of New York

Pengju XJTU 2016

Uncertainty and Bayesian Networks

The GapVis project and automated textual geoparsing

Expert Systems! Knowledge Based Systems!

Lecture 1: Review of Probability

Beyond points: How to turn SMW into a complete Geographic Information System

AHMED, ALAA HASSAN, M.S. Fusing Uncertain Data with Probabilities. (2016) Directed by Dr. Fereidoon Sadri. 86pp.

CS 4649/7649 Robot Intelligence: Planning

Uncertain Knowledge and Bayes Rule. George Konidaris

DISTRIBUTIONAL SEMANTICS

Recovering randomness from an asymptotic Hamming distance

SYDE 372 Introduction to Pattern Recognition. Probability Measures for Classification: Part I

Fundamentals of Probability CE 311S

The Veracity of Big Data

Modeling and reasoning with uncertainty

Lessons From the Trenches: using Mobile Phone Data for Official Statistics

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty

INFO 4300 / CS4300 Information Retrieval. slides adapted from Hinrich Schütze s, linked from

Introduction to Statistical Hypothesis Testing

COMP9414/ 9814/ 3411: Artificial Intelligence. 14. Uncertainty. Russell & Norvig, Chapter 13. UNSW c AIMA, 2004, Alan Blair, 2012

Lecture Overview [ ] Introduction to Artificial Intelligence COMP 3501 / COMP Lecture 6. Motivation. Logical Agents

Bayesian Networks BY: MOHAMAD ALSABBAGH

UnSAID: Uncertainty and Structure in the Access to Intensional Data

Information Retrieval and Web Search Engines

Artificial Intelligence

Learning Terminological Naïve Bayesian Classifiers Under Different Assumptions on Missing Knowledge

An AI-ish view of Probability, Conditional Probability & Bayes Theorem

10/18/2017. An AI-ish view of Probability, Conditional Probability & Bayes Theorem. Making decisions under uncertainty.

Discrete Probability and State Estimation

Probabilistic Reasoning. (Mostly using Bayesian Networks)

Manager: Scribe: Reporter: Per. Significant Zeros. Which zeros are significant in a measurement, and which are simply important?

Uncertainty. Logic and Uncertainty. Russell & Norvig. Readings: Chapter 13. One problem with logical-agent approaches: C:145 Artificial

Algorithms for Nearest Neighbors

Boolean and Vector Space Retrieval Models

Expert Systems! Knowledge Based Systems!

A Tutorial on Learning with Bayesian Networks

POGIL: Significant Zeros Why?

Toponym Disambiguation using Ontology-based Semantic Similarity

Ranked Retrieval (2)

Uncertain Knowledge and Reasoning

Bayesian Reasoning. Adapted from slides by Tim Finin and Marie desjardins.

Reasoning under Uncertainty: Intro to Probability

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2016

Global Machine Learning for Spatial Ontology Population

Outline. Uncertainty. Methods for handling uncertainty. Uncertainty. Making decisions under uncertainty. Probability. Uncertainty

DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS

ToxiCat: Hybrid Named Entity Recognition services to support curation of the Comparative Toxicogenomic Database

The PITA System for Logical-Probabilistic Inference

Uncertainty. Outline. Probability Syntax and Semantics Inference Independence and Bayes Rule. AIMA2e Chapter 13

Knowledge Expansion over Probabilistic Knowledge Bases. Yang Chen, Daisy Zhe Wang

Prof. Dr. Ralf Möller Dr. Özgür L. Özçep Universität zu Lübeck Institut für Informationssysteme. Tanya Braun (Exercises)

Uncertainty. Chapter 13. Chapter 13 1

Information Extraction from Text

DATA MINING AND MACHINE LEARNING

Probabilistic and Bayesian Analytics

Replicated Softmax: an Undirected Topic Model. Stephen Turner

Probabilistic Reasoning

2011 Pearson Education, Inc

GIS Visualization Support to the C4.5 Classification Algorithm of KDD

COMPUTER SCIENCE TRIPOS

CLRG Biocreative V

Basic Probabilistic Reasoning SEG

INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR. NPTEL National Programme on Technology Enhanced Learning. Probability Methods in Civil Engineering

Discrete Probability and State Estimation

Elements of probability theory

Hidden Markov Models. x 1 x 2 x 3 x N

Quantifying Uncertainty

Probabilistic Reasoning

A General Framework for Conflation

Online Videos FERPA. Sign waiver or sit on the sides or in the back. Off camera question time before and after lecture. Questions?

Probabilistic Databases

Machine Learning. CS Spring 2015 a Bayesian Learning (I) Uncertainty

Coding, Information Theory (and Advanced Modulation) Prof. Jay Weitzen Ball 411

Fuzzy Systems. Introduction

A Probability Primer. A random walk down a probabilistic path leading to some stochastic thoughts on chance events and uncertain outcomes.

Uncertainty. Chapter 13

Lecture 3: Probabilistic Retrieval Models

STA Module 4 Probability Concepts. Rev.F08 1

Conditional Random Fields

Introduction to Artificial Intelligence (AI)

Origins of Probability Theory

REASONING UNDER UNCERTAINTY: CERTAINTY THEORY

Probabilistic Partial Evaluation: Exploiting rule structure in probabilistic inference

DIGITAL CIRCUIT LOGIC BOOLEAN ALGEBRA (CONT.)

What is the relationship between number of constraints and number of possible solutions?

Bayesian Description Logics

Testing a Hash Function using Probability

TEACHING INDEPENDENCE AND EXCHANGEABILITY

CS 188: Artificial Intelligence Fall 2011

arxiv: v3 [cs.lg] 17 Nov 2017

Privacy in Statistical Databases

Water spectroscopy with a Distributed Information System

Fuzzy Systems. Introduction

Transcription:

HANDLING UNCERTAINTY IN INFORMATION EXTRACTION Maurice van Keulen and Mena Badieh Habib URSW 23 Oct 2011

INFORMATION EXTRACTION Unstructured Text Information extraction Web of Data Inherently imperfect process Word Paris First name? City? We humans happily deal with doubt and misinterpretation City => over 60 every cities day; Paris Why shouldn t computers? Toponyms: 46% >2 refs source: GeoNames Goal: Technology to support the development of domain specific information extractors Uncertainty Reasoning for the Semantic Web, Bonn, Germany 23 Oct 2011 2

SHERLOCK HOLMES-STYLE INFORMATION EXTRACTION when you have eliminated the impossible, whatever remains, however improbable, must be the truth Information extraction is about gathering enough evidence to decide upon a certain combination of annotations among many possible ones Evidence comes from ML + developer (generic) + end user (instances) Annotations are uncertain Maintain alternatives + probabilities throughout process (incl. result) Unconventional starting point Not no annotations, but no knowledge, hence anything is possible Developer interactively defines information extractor until good enough Iterations: Add knowledge, apply to sample texts, evaluate result Scalability for storage, querying, manipulation of annotations From my own field (databases): Probabilistic databases? Uncertainty Reasoning for the Semantic Web, Bonn, Germany 23 Oct 2011 3

SHERLOCK HOLMES-STYLE INFORMATION EXTRACTION EXAMPLE: NAMED ENTITY RECOGNITION (NER) when you have eliminated the impossible, whatever remains, however improbable, must be the truth Paris Hilton stayed in the Paris Hilton a1 a2 a3 a4 a5 a6 a7 a8 a9 a10 a11 a12 a13 a14 a15 a16 a17 a18 a19 a20 a21 a22 a23 a24 a25 a26 a27 a28 Person Toponym City inter-actively defined isa Uncertainty Reasoning for the Semantic Web, Bonn, Germany 23 Oct 2011 4

SHERLOCK HOLMES-STYLE INFORMATION EXTRACTION EXAMPLE: NAMED ENTITY RECOGNITION (NER) Paris Hilton stayed in the Paris Hilton a1 a2 a3 a4 a5 a6 a7 a8 a9 a10 a11 a12 a13 a14 a15 a16 a17 a18 a19 a20 a21 a22 a23 a24 a25 a26 a27 a28 A =O(klt) linear?!? k: length of string Although l: maximum conceptual/theoretical, length phrases considered t: number it doesn t of entity seem types to be a severe challenge for a probabilistic database Here: 28 * 3 = 84 possible annotations The problem is not in the amount of URSW call for alternative papers about 1300 annotations! words say 20 types say max length 6 (I saw one with 5) = roughly 1300 * 20 * 6 = roughly 156,000 possible annotations Uncertainty Reasoning for the Semantic Web, Bonn, Germany 23 Oct 2011 5

ADDING KNOWLEDGE = CONDITIONING Paris Hilton stayed in the Paris Hilton a and b independent P(a)=0.6 P(b)=0.8 b a 0.32 0.08 a b 0.12 a b 0.48 Person --- --- City x 1 2 ( Paris is a City) [a] x 8 1 ( Paris Hilton is a Person) [b] become mutually exclusive a and b mutually exclusive (a b is not possible) 0.15 b a 0.62 a b 0.23 Uncertainty Reasoning for the Semantic Web, Bonn, Germany 23 Oct 2011 6

ADDING KNOWLEDGE CREATES DEPENDENCIES NUMBER OF DEPS MAGNITUDES IN SIZE SMALLER THAN POSSIBLE COMBINATIONS Paris Hilton stayed Person City Person City neq 8 +8 +15 Uncertainty Reasoning for the Semantic Web, Bonn, Germany 23 Oct 2011 7

PROBLEM AND SOLUTION DIRECTIONS I m looking for a scalable approach to reason and redistribute probability mass considering all these dependencies to find the remaining possible interpretations and their probabilities Feasibility approach hinges on efficient representation and conditioning of probabilistic dependencies Solution directions (in my own field): Koch etal VLDB 2008 (Conditioning in MayBMS) Getoor etal VLDB 2008 (Shared correlations) This is not about only learning a joint probability distribution. Here I d like to estimate a joint probability distribution based on initial independent observations and then batch-by-batch add constraints/dependencies and recalculate Techniques out there that fit this problem? Questions / Suggestions? Uncertainty Reasoning for the Semantic Web, Bonn, Germany 23 Oct 2011 8