A Formalization of Logical Imaging for IR using QT

Similar documents
The troubles with using a logical model of IR on a large collection of documents

Geometric and Quantum Methods for Information Retrieval

Information Retrieval Basic IR models. Luca Bondi

DRAFT. The geometry of information retrieval and the dimensions of perception. C.J. van Rijsbergen

Probability Kinematics in Information. Retrieval: a case study. F. Crestani. Universita' di Padova, Italy. C.J. van Rijsbergen.

arxiv:quant-ph/ v1 23 Mar 2007

Zuccon, Guido (2012) Document ranking with quantum probabilities. PhD thesis. Copyright and moral rights for this thesis are retained by the author

A Quantum Query Expansion Approach for Session Search

The Static Absorbing Model for the Web a

End-to-End Quantum-Like Language Models with Application to Question Answering

Introduction to Quantum Probability. for Social and Behavioral Scientists

Towards modeling implicit feedback with quantum entanglement

Linear Algebra and Dirac Notation, Pt. 1

The Liar-paradox in a Quantum Mechanical Perspective

Comparison of Quantum and Bayesian Inference Models

Retrieval by Content. Part 2: Text Retrieval Term Frequency and Inverse Document Frequency. Srihari: CSE 626 1

HELP US REMOVE OUTDATED BELIEFS?

On updates with integrity constraints

Introduction to Quantum Computing

Can Vector Space Bases Model Context?

Quantum Probability in Cognition. Ryan Weiss 11/28/2018

S (16) DOI: Reference: PHYSA 16966

arxiv: v1 [cs.ir] 26 Oct 2018

Query Performance Prediction: Evaluation Contrasted with Effectiveness

Lecture Notes in Artificial Intelligence 5494

Motivation. User. Retrieval Model Result: Query. Document Collection. Information Need. Information Retrieval / Chapter 3: Retrieval Models

Deposited on: 1 November 2007 Glasgow eprints Service

Modeling Quantum Entanglements in Quantum Language Models

The quantum state as a vector

Spooky Memory Models Jerome Busemeyer Indiana Univer Univ sity er

Conditional probabilistic reasoning without conditional logic

Ranked Retrieval (2)

On the Probabilistic Logical Modelling of Quantum and Geometrically-Inspired IR

arxiv:cs.ai/ v2 23 Oct 2006

Basics on quantum information

Chapter 5. Density matrix formalism

IR Models: The Probabilistic Model. Lecture 8

Phil Introductory Formal Logic

Short Course in Quantum Information Lecture 2

arxiv: v2 [quant-ph] 23 Aug 2018

The Kochen-Specker Theorem

In Newcomb s problem, an agent is faced with a choice between acts that

Algebra C Numerical Linear Algebra Sample Exam Problems

2. Introduction to quantum mechanics

Chap 2: Classical models for information retrieval

An Empirical Study on Dimensionality Optimization in Text Mining for Linguistic Knowledge Acquisition

Linear Algebra and Dirac Notation, Pt. 2

Stochastic Processes

The Quantum Challenge in Concept Theory and Natural Language Processing

Density Matrices. Chapter Introduction

Quantum Computing Lecture 2. Review of Linear Algebra

Stochastic Quantum Dynamics I. Born Rule

A Multi-Layered Bayesian Network Model for Structured Document Retrieval

Math Camp II. Basic Linear Algebra. Yiqing Xu. Aug 26, 2014 MIT

Introduction to Quantum Information Hermann Kampermann

1 Mathematical preliminaries

Toward a Quantum Theory of Cognition: History, Development and Perspectives

Contextual equivalence

Ensembles and incomplete information

In the beginning God created tensor... as a picture

Basics on quantum information

Towards Collaborative Information Retrieval

Incompatibility Paradoxes

Lecture 4: Postulates of quantum mechanics

INFO 4300 / CS4300 Information Retrieval. IR 9: Linear Algebra Review

Stochastic Histories. Chapter Introduction

Techniques. Contextual equivalence

Connecting the categorical and the modal logic approaches to Quantum Mech

Mathematical foundations - linear algebra

Harvard CS 121 and CSCI E-207 Lecture 6: Regular Languages and Countability

Linear Algebra using Dirac Notation: Pt. 2

Lecture 2: Linear operators

arxiv: v1 [cs.ir] 15 Sep 2018

By allowing randomization in the verification process, we obtain a class known as MA.

Conflict-Based Belief Revision Operators in Possibilistic Logic

Probabilistic Information Retrieval

Belief revision: A vade-mecum

Counterfactuals and comparative similarity

Information Retrieval and Web Search

Response Selection Using Neural Phase Oscillators

Lecture notes on Quantum Computing. Chapter 1 Mathematical Background

Latent Semantic Indexing (LSI) CE-324: Modern Information Retrieval Sharif University of Technology

02 Propositional Logic

Ph 219/CS 219. Exercises Due: Friday 20 October 2006

If you d wiggled A, then B would ve changed Causality and counterfactual conditionals

Reasoning Under Uncertainty: Introduction to Probability

Propositional logic. First order logic. Alexander Clark. Autumn 2014

Foundations of non-commutative probability theory

Probabilistic Latent Semantic Analysis

PHY305: Notes on Entanglement and the Density Matrix

1. Let A = (a) 2 (b) 3 (c) 0 (d) 4 (e) 1

Modeling User s Cognitive Dynamics in Information Access and Retrieval using Quantum Probability (ESR-6)

Language Models and Smoothing Methods for Collections with Large Variation in Document Length. 2 Models

Chapter 2 The Density Matrix

Elementary linear algebra

Boolean and Vector Space Retrieval Models

Computer Science CPSC 322. Lecture 18 Marginalization, Conditioning

THE NUMBER OF ORTHOGONAL CONJUGATIONS

Correlated Information: A Logic for Multi-Partite Quantum Systems

Timed Automata VINO 2011

Transcription:

A Formalization of Logical Imaging for IR using QT September 1st, 2008 guido@dcs.gla.ac.uk http://www.dcs.gla.ac.uk/ guido Information Retrieval Group University of Glasgow

QT in IR Why QT? General guidelines for the use of QT in IR

Why QT? General guidelines for the use of QT in IR How could QT be useful to IR? In the last years there have been some attempts to model classical systems using QT: Cognitive processes such as concepts formation and combination [AG05] Modeling of semantics [BKNM08, BW08, CCS08] Modeling IR processes and techniques [vr04, AvR07, HRAvR08] QT trait d union between logics, probability and geometry QT naturally models the contextual behaviour of complex systems [Kit08] Inspires the invention of new models and techniques

Why QT? General guidelines for the use of QT in IR How IR can be translated in QT Correspondences between concepts in IR and QT: Information space Hilbert space H Document d vector d Relevance an observable, that is a Hermitian operator (R) The value of an observable is given by the eigenvalues of the operator (always real). A state of the Quantum system corresponds to a vector; we can associate a density operator to each state, e.g. ρ x = x x

In this section: Considerations and Future directions

Motivation for using Logical Imaging in IR How do we estimate the probability of relevance of a document given a query? P(R q, d) P(d q) LUP by van Rijsbergen: Given any two sentences x and y; a measure of the uncertainty of y x relative to a given data set, is determined by the MINIMAL EXTENT to which we have to add information to the data set, to establish the truth of y x. [van89] Logical Imaging (LI) addresses such minimal belief revision

Logical Imaging: a technique for belief revision Then, what is LI? LI is a technique for belief revision, based on Possible Worlds Semantics (PWS) Introduced by Stalnaker [HSP81], extended and refined by Lewis [Lew73, Lew76], investigated by Gärdenfors [Gär88], applied to IR by Crestani [CvR95a, CvR95b] and Amati [AK92]

The intuition behind LI A probability is assigned to each term, There is a measure of similarity between terms (e.g. EMIM, cosine, NGD,...) Probability mass is moved between similar terms: this generates a kinematics of probabilities

A first LI model The LI model for IR by Crestani [CvR95a, CvR95b]: Each term has a probability For every document in the collection, we evaluate P(d q) by imaging on the document Uses EMIM as similarity measure The kinematics: The probabilities associated to a term NOT IN the document are transfered to the most similar term IN the document

Formal definition of the model (1/2) A formal definition of Crestani s model: T terms t i, probability P(t i ) associated, t i T P(t i) = 1 P(d q) = T P(t i)τ(t d, q) t d is the most similar term to t i for which d is true; τ is a truth function

Formal definition of the model (2/2) night bat baseball d 1 hit ball cricket

Variations on the theme The variations affect the kinematics of probabilities: Transfer of probabilities towards more than a single term in the document: proportional opinionated The transfer leaves a small probability mass associated to the term not in the document: Jeffrey s conditionalization

Logical Imaging in QT Represent a term t i with a vector of document e.g. t i = [2 0 1] T P(t) be a probability distribution over the set T of terms (thus, k i 1 P(t i ) = 1 and 0 P(t i ) 1) Note: We use the Dirac notation, where the following correspondences hold: column vector x x inner product y x y x row vector x x outer product y x y x

Logical Imaging in QT A document d is represented by d, a query q by q. e.g. d i = t i W d λ i t i where W d is the set of terms which are present in d Projector associated to document: P d = ti d t i t i Projector associated to query: P d = ti q t i t i

Density operator and its evolution We can associate a density operator to document d: ρ d = t i d α i( n j=1 λ2 i,j e j e j ) Thus, the density operator associated to d after imaging is: ρ d = t i W d α i( n j=1 λ i,j 2 e j ej ), Note: To simplify the notation, we assume P(t i ) = α i

Computing P(d q) We can now compute the probability P(d q) after Imaging, or, more precisely, the probability induced by P d P q. In order to do so we make use of the Gleason s Theorem: Let H be a Hilbert space with dimension n (finite or countably infinite) larger than 2. Let P be a probability measure defined upon the closed subspaces of H that is additive for a finite or infinite number of mutually orthogonal subspaces. Then there exists a trace class positive operator ρ with unit trace such that P(L) = tr(ρp L), P L being the projector upon a closed subspace L."

Computing P(d q) Thanks to Gleason s Theorem, P(d q) = tr(ρ d P R), where P R is the projector associated to P d P q Note, in the case that P d and P q are not compatible (they do not commute, P d P q P q P d ): P R = I lim n ((I P d )(I lim n P d P q P d ) n (I P d )) n

(1/2) How do we move the probabilities? Let us assume that the probability associated to each term is stored in an entry of a diagonal matrix, eg: 0.3 0 0 0 0 0.2 0 0 0 0 0.1 0 0 0 0 0.4 and t 1 t 3 0 0 1 0 Applying transformation A = K T AK with K = 0 1 0 0 0 0 1 0... 0 0 0 1

(2/2)...we obtain 0 0 0 0 0 0.2 0 0 0 0 0.4 0 0 0 0 0.4 remember t 1 t 3 The Kinematics operator corresponds to the evolution operator U of the Schrödinger Picture. The evolution of the density operator D from time t 1 to time t 2 is governed by the equation: D(t 2 ) = U (t 2 )D(t 1 )U(t 2 )

Recap LI moves probabilities from terms not in the document to similar terms into the document QT is capable to model LI and its kinematics, by means of a particular operator, the Kinematics Operator

(1/2) This formulation of LI is not effective: MAP and P@R are lower than baselines values obtained with simple weightings as TF IDF or IDF on standard TREC collections (e.g. WSJ, AP). The current formulation of LI is more effective than simple baselines on small collections, e.g. CACM, NPL. The LI theory is soundness, but the current formulation is poor (higher rank to documents that contain not discriminative information)

(2/2) We are investigating a reformulation of LI considering an intra-document kinematics We also want to take into account the context where a term appears: Context Based Logical Imaging (CBLI) using Voronoi Tessellation in the QT framework

A Voronoi Tessellation of a 2D space, and its use to guide the Kinematics ball hit cricket bat night baseball Voronoi Tessellation generated by d = {night, bat,...}

A Voronoi Tessellation of a 2D space, and its use to guide the Kinematics ball hit cricket bat night baseball Voronoi Tessellation generated by d = {bat, hit,...}

A Voronoi Tessellation of a 2D space, and its use to guide the Kinematics hit ball cricket ball hit cricket bat bat night baseball night baseball The difference in the Kinematics produced by two different Voronoi Tessellations

I [AG05] [AK92] [AvR07] [BKNM08] D. Aerts and L. Gabora. A Theory of Concepts and their Combinations I. Kybernetes, 34:151 175, 2005. G. Amati and S. Kerpedjev. An Information Retrieval Model: Implementation and Experiments. Technical Report 5B04892, Fondazione Ugo Bordoni, Roma, IT, 1992. S. Arafat and C. J. van Rijsbergen. Quantum Theory and the Nature of Search. In Proc. of the 1st AAAI Quantum Interaction Symposium, pages 114 121, Stanford, CA, USA, 2007. P. Bruza, K. Kitto, D. Nelson, and K. McEvoy. Entangling Words and Meaning. In Proc. of the 2nd AAAI Quantum Interaction Symposium, pages 118 124, Oxford, UK, 2008.

II [BW08] [CCS08] [CvR95a] [CvR95b] P. Bruza and J. Woods. Quantum Collapse in Semantic Space: Interpreting Natural Language Argumentation. In Proc. of the 2nd AAAI Quantum Interaction Symposium, pages 141 147, Oxford, UK, 2008. S. Clark, B. Coecke, and M. Sadrzadeh. A Compositional Distributional Model of Meaning. In Proc. of the 2nd AAAI Quantum Interaction Symposium, pages 133 140, Oxford, UK, 2008. F. Crestani and C. J. van Rijsbergen. Information Retrieval by Logical Imaging. Journal of Documentation, 51(1):3 17, Mar 1995. F. Crestani and C. J. van Rijsbergen. Probability Kinematics in Information Retrieval. Proc. ACM SIGIR 95, pages 291 299, Jan 1995.

III [Gär88] P. Gärdenfors. Knowledge in Flux. Modeling the Dynamics of Epistemic States. The MIT Press, Cambridge, MA, USA, 1988. [HRAvR08] A. Huertas-Rosero, L. A. Azzopardi, and C.J. van Rijsbergen. Characterising Through Erasing A Theoretical Framework for Representing Documents Inspired by Quantum Theory. In Proc. of the 2nd AAAI Quantum Interaction Symposium, pages 160 163, Oxford, UK, 2008. [HSP81] [Kit08] W. L. Harper, R. Stalnaker, and G. Pearce, editors. Ifs. Conditionals, Belief, Decision, Chance and Time. Reidel, Dordrecht, NL, 1981. K. Kitto. Why Quantum Theory? In Proc. of the 2nd AAAI Quantum Interaction Symposium, pages 11 18, Oxford, UK, 2008.

IV [Lew73] [Lew76] [van89] [vr04] D. K. Lewis. Counterfactuals. Basil Blackwell, Oxford, UK, 1973. D. K. Lewis. Probabilities of Conditionals and Conditional Probabilities. The Philosophical Review, 85:297 315, 1976. C. J. van Rijsbergen. Towards an Information Logic. In Proc. ACM SIGIR 89, pages 77 86, Cambridge, MA, USA, 1989. C. J. van Rijsbergen. The Geometry of Information Retrieval. CUP, 2004.

Questions? Thanks for the attention Any question? For further discussion do not hesitate to contact me on email (guido@dcs.gla.ac.uk)