B. Piwowarski P. Gallinari G. Dupret

Similar documents
Applying the Divergence From Randomness Approach for Content-Only Search in XML Documents

Applying the Divergence From Randomness Approach for Content-Only Search in XML Documents

A Multi-Layered Bayesian Network Model for Structured Document Retrieval

Language Models and Smoothing Methods for Collections with Large Variation in Document Length. 2 Models

Language Models, Smoothing, and IDF Weighting

Luis M. de Campos, Juan M. Fernández-Luna, and Juan F. Huete

Advanced Click Models & their Applications to IR

Information Retrieval

Information Retrieval Basic IR models. Luca Bondi

CS 237: Probability in Computing

1 Information retrieval fundamentals

Scoring (Vector Space Model) CE-324: Modern Information Retrieval Sharif University of Technology

Lecture 12: Link Analysis for Web Retrieval

Scoring (Vector Space Model) CE-324: Modern Information Retrieval Sharif University of Technology

Information Retrieval

The Maximum Entropy Method for Analyzing Retrieval Measures. Javed Aslam Emine Yilmaz Vilgil Pavlu Northeastern University

Outline for today. Information Retrieval. Cosine similarity between query and document. tf-idf weighting

Ranking-II. Temporal Representation and Retrieval Models. Temporal Information Retrieval

Scoring (Vector Space Model) CE-324: Modern Information Retrieval Sharif University of Technology

CS47300: Web Information Search and Management

Information Retrieval and Web Search

IR Models: The Probabilistic Model. Lecture 8

Markov Precision: Modelling User Behaviour over Rank and Time

INFO 4300 / CS4300 Information Retrieval. IR 9: Linear Algebra Review

Information Retrieval

Introduction to Information Retrieval (Manning, Raghavan, Schutze) Chapter 6 Scoring term weighting and the vector space model

Learning Hierarchical Bayesian Networks for human skill modelling

Boolean and Vector Space Retrieval Models

Extensions of Bayesian Networks. Outline. Bayesian Network. Reasoning under Uncertainty. Features of Bayesian Networks.

Lecture 9: Probabilistic IR The Binary Independence Model and Okapi BM25

PV211: Introduction to Information Retrieval

Announcements. Assignment 5 due today Assignment 6 (smaller than others) available sometime today Midterm: 27 February.

Counterfactual Model for Learning Systems

Part A. P (w 1 )P (w 2 w 1 )P (w 3 w 1 w 2 ) P (w M w 1 w 2 w M 1 ) P (w 1 )P (w 2 w 1 )P (w 3 w 2 ) P (w M w M 1 )

Fall CS646: Information Retrieval. Lecture 6 Boolean Search and Vector Space Model. Jiepu Jiang University of Massachusetts Amherst 2016/09/26

RETRIEVAL MODELS. Dr. Gjergji Kasneci Introduction to Information Retrieval WS

2 : Directed GMs: Bayesian Networks

Probabilistic Reasoning with Graphical Security Models

Causal Discovery. Beware of the DAG! OK??? Seeing and Doing SEEING. Properties of CI. Association. Conditional Independence

quan col... Probabilistic Forecasts of Bike-Sharing Systems for Journey Planning

Ranked Retrieval (2)

Chap 2: Classical models for information retrieval

Evaluation Metrics. Jaime Arguello INLS 509: Information Retrieval March 25, Monday, March 25, 13

Probabilistic Partial Evaluation: Exploiting rule structure in probabilistic inference

Intelligent Data Analysis. PageRank. School of Computer Science University of Birmingham

Information Retrieval

Information Retrieval

A set theoretic view of the ISA hierarchy

Evaluation Strategies

Term Weighting and the Vector Space Model. borrowing from: Pandu Nayak and Prabhakar Raghavan

CS6840: Advanced Complexity Theory Mar 29, Lecturer: Jayalal Sarma M.N. Scribe: Dinesh K.

Probabilistic Near-Duplicate. Detection Using Simhash

PageRank algorithm Hubs and Authorities. Data mining. Web Data Mining PageRank, Hubs and Authorities. University of Szeged.

OPTIMIZING SEARCH ENGINES USING CLICKTHROUGH DATA. Paper By: Thorsten Joachims (Cornell University) Presented By: Roy Levin (Technion)

Can Vector Space Bases Model Context?

Part I: Web Structure Mining Chapter 1: Information Retrieval and Web Search

Business Process Technology Master Seminar

Learning causal network structure from multiple (in)dependence models

Computational Complexity of Bayesian Networks

Probabilistic Aspects of Voting

What is (certain) Spatio-Temporal Data?

A Stochastic Framework for Quantitative Analysis of Attack-Defense Trees

Motivation. Bayesian Networks in Epistemology and Philosophy of Science Lecture. Overview. Organizational Issues

Outline. CSE 573: Artificial Intelligence Autumn Agent. Partial Observability. Markov Decision Process (MDP) 10/31/2012

Term Weighting and Vector Space Model. Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze

Variable Latent Semantic Indexing

On Ordering Descriptions in a Description Logic

High-Dimensional Indexing by Distributed Aggregation

boolean queries Inverted index query processing Query optimization boolean model January 15, / 35

CS276A Text Information Retrieval, Mining, and Exploitation. Lecture 4 15 Oct 2002

The Static Absorbing Model for the Web a

Fast LSI-based techniques for query expansion in text retrieval systems

3. Basics of Information Retrieval

INFO 4300 / CS4300 Information Retrieval. slides adapted from Hinrich Schütze s, linked from

Measuring Retrieval Effectiveness Based on User Preference of Documents

CAT L4: Quantum Non-Locality and Contextuality

H2 Spring E. Governance, in principle, applies to the entire life cycle of an IT system

Machine Learning for Data Science (CS4786) Lecture 19

Collaborative Filtering. Radek Pelánek

Introduction to Probabilistic Reasoning. Image credit: NASA. Assignment

Data Mining and Matrices

Information Retrieval and Topic Models. Mausam (Based on slides of W. Arms, Dan Jurafsky, Thomas Hofmann, Ata Kaban, Chris Manning, Melanie Martin)

CS 570: Machine Learning Seminar. Fall 2016

Online Learning, Mistake Bounds, Perceptron Algorithm

INFO 630 / CS 674 Lecture Notes

Some Formal Analysis of Rocchio s Similarity-Based Relevance Feedback Algorithm

INFO 4300 / CS4300 Information Retrieval. slides adapted from Hinrich Schütze s, linked from

Likelihood Weighting and Importance Sampling

BLAST: Target frequencies and information content Dannie Durand

4 : Exact Inference: Variable Elimination

Boolean and Vector Space Retrieval Models CS 290N Some of slides from R. Mooney (UTexas), J. Ghosh (UT ECE), D. Lee (USTHK).

The Axiometrics Project

Information Retrieval and Web Search Engines

PROBABILISTIC LATENT SEMANTIC ANALYSIS

Performance Metrics for Machine Learning. Sargur N. Srihari


15-780: Graduate Artificial Intelligence. Bayesian networks: Construction and inference

Building Bayesian Networks. Lecture3: Building BN p.1

NCDREC: A Decomposability Inspired Framework for Top-N Recommendation

Lecture 3: Probabilistic Retrieval Models

Transcription:

PRUM Precision Recall with User Modelling Application to XML B. Piwowarski P. Gallinari G. Dupret

Outline Why measuring? XML Information Retrieval Current metrics GPR, PRng, XCG PRUM A new user model A new metric Comparison

Why measuring? The evaluation methodology started early (Cleverdon, 1967) Justification of pragmatic/theoretic developments In new IR paradigms (XML, video, etc.), measures have the same role to play but none of the proposed metrics are satisfying.

XML Retrieval (and INEX) more precise access by giving more specific answers (ie. XML nodes) Assessments: a new scale Specificity (S): The extent to which a document component is focused on the information (4 values: 0 to 3) Exhaustivity (E): The extent to which the information contained in a document component satisfies the user's query need (4 values: 0 to 3)

Precision Recall E0S0 E3S2 E3S3 E0S0 Only exact answers are rewarded Too strict! Overlap problem in recall base E2S3 E1S3

Generalised Precision Recall E0S0 1 0.5 E2S3 0.75 E3S2 E3S3 E0S0 0.25 E1S3 q ExSy = Assign a relevance value to near misses (quantisation) Recall base is expanded Overlap is a problem

PRng (N. Gövert) Exhaustivity and specificity based on the notion of an ideal concept space upon which precision and recall are defined. Considers overlap in retrieval results Very sensitive to list order Does not consider overlap in recall-base (not completely true)

PRng (N. Gövert) 0.75 a (30) Recall(a,b,c) = 0.75 + 0 x 1 + 0 x 0.5 = 0.75 0.5 1 b(20) c (10) 0.25 Recall(c,b,a) = 0.5 +.5 x 1 + 1/3 x 0.75 = 1.25 Recall(c,b,a) = 0.5 +.5 x 1 + 1/3 x 0.75 = 1.25

XCG (G. Kazai) Based on cumulated gain measure for IR Accumulates gain obtained by retrieving elements at fixed ranks Based on the construction of an ideal recallbase Consider overlap in both retrieval results and recall-base

XCG (G. Kazai) The most consistent metric Normalisation maximises the gain of near misses Normalisation to ensure that performance is bounded by an ideal system but No precise user model No precision dimension

Towards user models Web and others Quintana Dunlop leads to: Tolerance To Irrelevance (A. de Vries) The first real user model in XML IR Some theoretic problems

PRUM: main ideas Construction of an ideal recall base without overlap (like XCG) Definition of a precise user model (which includes T2I as a special case) Sound formal grounds (based on Raghavan's probabilistic precision-recall)

The PRUM user model Retrieved list a b c d e f g h j b a c k e d g h i f The user wants to see 4 relevant elements

The PRUM user model Consulted elements a Seen elements a b c k e d P=0 f g h i f The user has seen 0 relevant element

The PRUM user model Consulted elements a Seen elements a b b c k leads to an unseen relevant e d P=1/2 j g h i f The user has seen 1 relevant element

The PRUM user model Consulted elements a Seen elements a b c b c k e d j g h i f P=2/3 The user has seen 2 relevant elements

The PRUM user model Consulted elements a Seen elements a b c d b c k e d j g h i f P=2/4 The user has seen 2 relevant elements

The PRUM user model Consulted elements a Seen elements a b c d e e b c k d j g h i f P=3/5 The user has seen 4 relevant elements

PRUM stochastic user model x 1 2 x 1 2 x 1 3 2 x 1 x 2/4 1 5 3 x 1 2

PRUM stochastic user model x P(x->) 0 1 0.7 0.5

PRUM definition Extends Raghavan's definition P Lur Cs, L=l,Q=q The element Leads to an Unseen Relevant The element is Consulted by the user The user wants to see l % of the relevant elements

Computing PRUM Two probabilities have to be computed at each rank in the consulted list: The probability that the user has seen s distinct relevant elements P F i =s The probability that the user sees a new relevant element (knowing that ) P F i F i 1 F i 1 =s

Experiments (INEX 2004) 5 hypothetic systems 1) ideal (I) = ancestors (A) 2) parent (P) ~ biggest child (B) 3) document (D) Orders GRP: A >> I > P > D >> B RPng: A > I ~ P > D >> B PRUM: I = A >> B >> P >> D

Conclusion PRUM Extension of Precision-Recall (probabilistic) Explicit parameters ( GPR, PRng, XCG) Good behaviour of the metric Adapted to several IR formalisms Work to be done More realistic user model parameters Shortcomings Graded scale Complexity