Random Field Models for Applications in Computer Vision

Similar documents
Conditional Random Fields and beyond DANIEL KHASHABI CS 546 UIUC, 2013

Undirected Graphical Models

Machine Learning for Structured Prediction

Conditional Random Field

Discriminative Fields for Modeling Spatial Dependencies in Natural Images

Probabilistic Graphical Models

Intelligent Systems (AI-2)

Introduction to Graphical Models

Diagram Structure Recognition by Bayesian Conditional Random Fields

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

Intelligent Systems (AI-2)

A graph contains a set of nodes (vertices) connected by links (edges or arcs)

3 : Representation of Undirected GM

Lecture 9: PGM Learning

Intelligent Systems (AI-2)

Undirected Graphical Models: Markov Random Fields

CS Lecture 4. Markov Random Fields

Discriminative Random Fields: A Discriminative Framework for Contextual Interaction in Classification

Pattern Recognition and Machine Learning

Probabilistic Graphical Models

Conditional Random Fields: An Introduction

Structure Learning in Sequential Data

Brief Introduction of Machine Learning Techniques for Content Analysis

Sequential Supervised Learning

Lecture 6: Graphical Models

Introduction to Machine Learning Midterm, Tues April 8

Sequence Modelling with Features: Linear-Chain Conditional Random Fields. COMP-599 Oct 6, 2015

Probabilistic Graphical Models

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Conditional Independence and Factorization

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016

STAT 518 Intro Student Presentation

Graphical Models for Collaborative Filtering

Graphical models for part of speech tagging

Parameter learning in CRF s

Chris Bishop s PRML Ch. 8: Graphical Models

1 Undirected Graphical Models. 2 Markov Random Fields (MRFs)

Introduction to Graphical Models. Srikumar Ramalingam School of Computing University of Utah

Part I. C. M. Bishop PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS

A brief introduction to Conditional Random Fields

STA 4273H: Statistical Machine Learning

Bayesian Machine Learning - Lecture 7

10 : HMM and CRF. 1 Case Study: Supervised Part-of-Speech Tagging

6.867 Machine learning, lecture 23 (Jaakkola)

Bayesian Networks (Part II)

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning

Information Extraction from Text

13: Variational inference II

Directed and Undirected Graphical Models

Computational Genomics

An Introduction to Bayesian Machine Learning

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2016

Undirected graphical models

Towards Maximum Geometric Margin Minimum Error Classification

Combining Generative and Discriminative Methods for Pixel Classification with Multi-Conditional Learning

Lecture 15. Probabilistic Models on Graph

Overview of Statistical Tools. Statistical Inference. Bayesian Framework. Modeling. Very simple case. Things are usually more complicated

Bayesian Conditional Random Fields

Markov Random Fields

Introduction to Graphical Models. Srikumar Ramalingam School of Computing University of Utah

Nonparametric Bayesian Methods (Gaussian Processes)

arxiv: v1 [stat.ml] 17 Nov 2010

Probabilistic Models for Sequence Labeling

Chapter 10: Random Fields

Probabilistic Graphical Models

Machine Learning Lecture 14

Graphical Models and Kernel Methods

Intelligent Systems:

Density Propagation for Continuous Temporal Chains Generative and Discriminative Models

Directed and Undirected Graphical Models

CPSC 540: Machine Learning

Bayesian Networks Introduction to Machine Learning. Matt Gormley Lecture 24 April 9, 2018

Chapter 16. Structured Probabilistic Models for Deep Learning

Bayesian Networks Inference with Probabilistic Graphical Models

COMP90051 Statistical Machine Learning

Statistical and Learning Techniques in Computer Vision Lecture 2: Maximum Likelihood and Bayesian Estimation Jens Rittscher and Chuck Stewart

Support Vector Machines. CAP 5610: Machine Learning Instructor: Guo-Jun QI

Fast Inference and Learning with Sparse Belief Propagation

Probabilistic Graphical Models: MRFs and CRFs. CSE628: Natural Language Processing Guest Lecturer: Veselin Stoyanov

Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science Algorithms For Inference Fall 2014

Logistic Regression: Online, Lazy, Kernelized, Sequential, etc.

Machine Learning Overview

Graphical Models Another Approach to Generalize the Viterbi Algorithm

Machine Learning Lecture 5

Recent Advances in Bayesian Inference Techniques

Probabilistic Machine Learning. Industrial AI Lab.

Course 16:198:520: Introduction To Artificial Intelligence Lecture 9. Markov Networks. Abdeslam Boularias. Monday, October 14, 2015

Predicting Sequences: Structured Perceptron. CS 6355: Structured Prediction

Advanced statistical methods for data analysis Lecture 2

Hidden Markov Models

HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems

ECE521 Tutorial 11. Topic Review. ECE521 Winter Credits to Alireza Makhzani, Alex Schwing, Rich Zemel and TAs for slides. ECE521 Tutorial 11 / 4

Probabilistic Graphical Models

Probabilistic Graphical Models

Probabilistic classification CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2016

ECE521 Lecture 19 HMM cont. Inference in HMM

STA 4273H: Statistical Machine Learning

Alternative Parameterizations of Markov Networks. Sargur Srihari

2.2 Structured Prediction

Transcription:

Random Field Models for Applications in Computer Vision Nazre Batool Post-doctorate Fellow, Team AYIN, INRIA Sophia Antipolis

Outline Graphical Models Generative vs. Discriminative Classifiers Markov Random Field Model Conditional Random Field Model Discriminative Random Field Model - An Example

Graphical Models 1 Graphical Models Graph Theory + Probability Models A graphical model is a family of probability distributions that factorizes according to an underlying graph Factorization -> incorporate contextual constraints -> simplify computation A distribution over a large number of random variables -> product of local functions of small number of variables only P ( Y ) V c ( Y ) c [1] Christopher M. Bishop, Pattern Recognition and Machine Learning. Springer, 2006

Graphical Models 1 How is Factorization achieved in a graphical model? Conditional Independence in a graphical model The absence of an edge between two nodes shows the conditional independence of corresponding random variables Conditional independence/absence of edges allows a complex probability distribution to be decomposed into factors Example : Directed Graphical Model (Bayesian Network) [1] Christopher M. Bishop, Pattern Recognition and Machine Learning. Springer, 2006

Graphical Models 1 Factor Graphs A bipartite graph to represent factorization /conditional independence Directed Graph Undirected Graph [1] Christopher M. Bishop, Pattern Recognition and Machine Learning. Springer, 2006

Generative vs. Discriminative Models for Classification 1 Overview of relationship between Generative and Conditional/Discriminative Models 1 [1] Charles Sutton, Andrew McCallum, An Introduction to Conditional Random Fields for Relational Learning. Introduction to Statistical Relational Learning, MIT Press, 2006. Nazre Batool April 23, 2014

[1] T. Jebara, Machine Learning : Discriminative and Generative (The Kluwer International Series in Engineering and Computer Science). Springer, December 2003. Nazre Batool April 23, 2014 Generative vs. Discriminative Models for Classification 1 Generative Models Separately estimate a probability model for each class generative term Classify by comparing probabilities under these models to find the class with highest probability Requires modeling of joint probability distribution of all variables The joint distribution can be used for classification by conditioning and using marginalization, Bayes rule, etc. Examples: Gaussian, naïve Bayes, hidden Markov Model, Bayesian Network, MRF

Generative vs. Discriminative Models for Classification 1 Discriminative Models No modeling of underlying probability distribution of classes Try to learn a classifier that performs well on the training data Process examples from several classes to find the optimal boundaries among classes Examples: Logistic Regression, Neural Networks, Support Vector Machines, Conditional Random Field [1] T. Jebara, Machine Learning : Discriminative and Generative (The Kluwer International Series in Engineering and Computer Science). Springer, December 2003.

Markov Random Field Markov random field is defined on an undirected graph Nodes of graph are random variables having Markov property A random field is a Markov random field when: The joint probability factorizes because of Markovianity property and can be represented as a Gibbs distribution (Hammersely- Clifford theorem) i.e. in terms of cliques, normalization constant and temperature parameter. 1 1 P ( Y) exp( U ( Y)) exp ( V c ( Y)) Z Z c C

Markov Random Field How is MRF a generative model? 1 Likelihood/Generative Term P( observations labels) P( labels) P( labels observations) P( observations labels) P( labels) P( observations) Prior probability model on labels An MRF : P( labels observations) U ( observations labels) U ( labels) An MRF : 1 P( labels) exp ( V Z c C c ( labels)) U ( labels) P( labels observations) P( observations labels) P( labels) Any exponential probability function : 1 P( observations labels) exp U ( observations labels) Z [1] S. Z. Li, Markov Random Field Modeling in Image Analysis, Springer 2009.

Nazre Batool April 23,2014 Generative vs. Discriminative Models for Classification Both types of classifiers have pros and cons Why Conditional Random Field (instead of MRF)? Modeling of joint probability distribution (Generative Model) is not always tractable/efficient Independence assumptions are made for conditional, easy to compute, distributions/likelihood Prior distributions are incorporated to constrain the joint distribution Modeling joint probability, P(observations labels)p(labels), is only an intermediate step towards the ultimate goal of classification Why not use discriminative classifiers directly?

Conditional Random Field 1,2 Definition: Let G = (S, E) be a graph such that x is indexed by the vertices of G. Then (x, y) is said to be a conditional random field if, when conditioned on x, the random variables {y} obey the Markov property with respect to the graph. Markovianity only after labels have been conditioned on observations in some features No Markovianity on label priors No generative terms/likelihood [1] Charles Sutton, Andrew McCallum, An Introduction to Conditional Random Fields for Relational Learning. Introduction to Statistical Relational Learning, MIT Press, 2006. [2] J. Lafferty, A. McCallum, F. Pereira, Conditional random elds: Probablistic models for segmenting and labeling sequence data.proc. Int. Conf. on Machinge Learning, 2001. Nazre Batool April 23, 2014

Conditional Random Field A discriminative counterpart to generative Hidden Markov Model (HMM) HMM: Independence assumptions are made for tractability Each observation variable depends only on the current state Each state depends only on its immediate state Likelihood/Generative Term

Conditional Random Field CRF: Condition HMM on observations (instead of the generative term) Introduce feature functions (flexible may not be log probabilities) and Normalization constant to sum to 1

Conditional Random Field CRF: Contrary to HMM, independence assumption between observations/hidden variables is relaxed

Discriminative Random Field (DRF) 1 An example of CRF 2D extension of CRF discriminate model in 2D Globally conditioned on observation data Not linear-chain -> has neighborhood relationships among labels [1] S. Kumar and M. Herbert, Discriminative Fields for Modeling Spatial Dependencies in Natural Images, Proc. 18th Ann. Conf. Neural Information Processing Systems, 2004

DRF for Object Detection DRF for detection of Man-made structures in images Features h i (y) and µ ij (y) are based on Histograms of gradients Please look at cited work for MAP optimization algorithms!

Conclusion Take-home Messages Graphical Models conditional independence -> Factorization in graphs -> tractability Generative vs. Discriminative Classifiers/Models Markovianity is more general than characteristic of being generative MRF vs. CRF

Thank You! Nazre Batool April 23, 2014