Social Saliency Prediction. Hyun Soo Park and Jianbo Shi

Size: px
Start display at page:

Download "Social Saliency Prediction. Hyun Soo Park and Jianbo Shi"

Transcription

1 Social Saliency Prediction Hyun Soo Park and Jianbo Shi

2 Nonverbal social signals 67% Hogan and Stubbs, Can t Get Through 8 Barriers to Communication, Vinciarelli et al., Social Signal Processing: Survey of an Emerging Domain, Image and Vision Computing, 2009.

3 Nonverbal social signals 67% Location of joint attention Hogan and Stubbs, Can t Get Through 8 Barriers to Communication, Vinciarelli et al., Social Signal Processing: Survey of an Emerging Domain, Image and Vision Computing, 2009.

4 Geometric Localization of Joint Attention Gaze ray Head location Head orientation Location of joint attention Patron-Perez et al., Structured Learning of Human Interactions in TV Shows, PAMI, 2012 Fathi et al., Social Interactions: A First Person Perspective, CVPR, 2012 Park et al., 3D Social Saliency from Head-mounted Cameras, NIPS, 2012

5 Challenges in Social Scenes

6 Challenges in Social Scenes Gaze detection Marin-Jimenez et al., Detecting People Looking at Each Other in Videos, IJCV 2014

7 Social Scene Can we localize joint attention without measuring gaze directions? Challenges in Social Scenes True positive head detection Marin-Jimenez et al., Detecting People Looking at Each Other in Videos, IJCV 2014

8 Input Video

9 Structure from Motion

10 Children sitting area Camera trajectory

11 Children Social saliency Social saliency: likelihood of joint attention Output

12 Children Social saliency Social saliency: likelihood of joint attention Output

13 Critani et Cristani al., BMVC et al., 2011 BMVC 2011 Fathi et Fathi al., CVPR et al., 2012 CVPR 2012 Park et al., Park NIPS et 2012 al., NIPS 2012 Rodriguez et al., ICCV 2011 Lan et al., PAMI 2012 Chakraborty et al., CVPR 2013 Yang et al., CVPR 2011 Alahi et al., CVPR 2014 Choi et al., ECCV 2014 Li et al., ICCV 2013 Ryoo et al., CVPR 2013 Pusiol et al., CogSci 2014 Arev et al., SIGGRAPH 2014 Third person view Distance between face and camera First person view

14 Joint Attention from First Person Cameras Croquet : Joint attention : Head direction Park et al., 3D Social Saliency from Head-mounted Cameras, NIPS, 2012.

15 Critani et Cristani al., BMVC et al., 2011 BMVC 2011 Fathi et Fathi al., CVPR et al., 2012 CVPR 2012 Park et al., Park NIPS et 2012 al., NIPS D estimation error<10cm Noninvasiveness Measurement accuracy Third person view Distance between face and camera First person view

16 Critani et Cristani al., BMVC et al., 2011 BMVC 2011 Fathi et Fathi al., CVPR et al., 2012 CVPR 2012 Park et al., Park NIPS et 2012 al., NIPS D estimation error<10cm Noninvasiveness Prediction Learning Measurement accuracy Third person view Distance between face and camera First person view

17 : Ground truth joint attention : Head location Location of performer: Ground truth joint attention Children g ({ } N ) i i = 1 Θ = Social formation where N is the number of social members.

18 : Ground truth joint attention : Head location g ({ } N ) i i = 1 Θ = Social formation where N is the number of social members. cf. g ({ } N ) i i 1 = = Geometric localization: triangulation

19 : Ground truth joint attention : Head location g ({ } N ) i i = 1 Θ = Social formation where N is the number of social members.

20 : Ground truth joint attention : Head location : Center of mass g ({ } N ) i i = 1 Θ = Social formation where N is the number of social members. ex. g ({ } N ) i i = 1 1 N Θ = Θ= N i = 1 Geometric localization: center of mass

21 : Ground truth joint attention : Head location : Center of mass : Center of circumcircle g ({ } N ) i i = 1 Θ = Social formation where N is the number of social members. cf. g ({ } N ) i i = 1 1 N Θ = Θ= N i = 1 Geometric localization: center of circumcircle

22 : Ground truth joint attention : Head location : Center of mass : Center of circumcircle g ({ } N ) i i = 1 Θ = Social formation where N is the number of social members.

23 First Person Social Interaction Data Scene N T(sec) F B-boy I B-boy II B-boy III B-boy IV Surprise party Class Croquet Busker I Busker II Card game Hide and seek Block building Social game Meeting I Meeting II Picnic Musical Dance way party Snowman Total 49,490 social formations

24 : Ground truth joint attention : Head location g ({ } N ) i i = 1 Θ = Social formation where N is the number of social members.

25 g ({ } N ) i i = 1 Θ = Social formation where N is the number of social members. Scale variation : Ground truth joint attention : Head location

26 g ({ } N ) i i = 1 Θ = Social formation where N is the number of social members. Scale variation Orientation variation : Joint attention : Head location

27 Representation: Social Dipole Moment

28 Water molecule, H2O

29 104.5 Water molecule, H2O N q = ( e p ) e i i Electric dipole moment

30 H H O Water molecule, H2O N q = ( e p ) e i i Electric dipole moment

31 H H O Water molecule, H2O N q = ( e p ) e i i Electric dipole moment

32 O O O H H O H H H H O H H H H Water molecule, H2O N q = ( e p ) e i i Electric dipole moment

33 q s : Ground truth joint attention p i c = 1 N N i p i : Head location : Center of mass : Social dipole moment N 1 q= s- pi = s-c N i Social dipole moment

34 Orientation Normalization q p i s c : Ground truth joint attention : Head location : Center of mass : Social dipole moment N 1 q= s- p = s-c i N i Social dipole moment

35 Scale Normalization q p i s c : Ground truth joint attention : Head location : Center of mass : Social dipole moment 1 N N i p i c = 1

36 Social Formation Feature q s p i c

37 Social Formation Feature q s p i c

38 Learning Likelihood of Joint Attention q s x p i c ( f c, f s ; x s) Φ = =1

39 Learning Likelihood of Joint Attention x q x p i x x x s x c x x x ( f c, f s ; x s) Φ = =1 ( f c, f s ; x s) Φ =0 AdaBoost binary classifier

40 Joint Attention Prediction x f s p i c f c ( f c, f s ; x s) Φ = =1 ( f c, f s ; x s) Φ =0 AdaBoost binary classifier

41 Joint Attention Prediction x f s p i c f c ( f c, f s ; x s) Φ = =1 ( f c, f s ; x s) Φ =0 AdaBoost binary classifier Social saliency: Likelihood of joint attention

42 Joint Attention Prediction f s p i s c f c ( f c, f s ; x s) Φ = =1 ( f c, f s ; x s) Φ =0 AdaBoost binary classifier Social saliency: Likelihood of joint attention

43 Input video Human detection

44 Detected social group

45 Input video Human detection Group detection Learning via FPCs Social saliency prediction

46 Result

47 Social member location Joint attention location Center of mass (COM) Center of circumcircle (CC)

48 Group meeting CF: Context feature (Lan et al., PAMI 2012) Social member location Joint attention location Center of mass (COM) Center of circumcircle (CC) Street performance Class interactions

49 Group meeting Street performance Class interactions CF: Context feature (Lan et al., PAMI 2012) Social member location Joint attention location Center of mass (COM) Center of circumcircle (CC)

50 Group meeting Street performance Class interactions CF: Context feature (Lan et al., PAMI 2012) SSF: Social formation feature RF: Random forests Scenes SFF+Boosting SFF+RF CC COM CF Social member location Joint attention location Center of mass (COM) Center of circumcircle (CC) Dance Meeting I B-boy I Class Busker Picnic Social game Mean average precision

51

52

53

54

55 Basketball Scene Result

56

57 University team of Northwestern Polytechnical University (China)

58

59 Dynamic Joint Attention Prediction q s p i c ( f c, f s ; x s) Φ = =1 ( f c, f s ; x s) Φ =0

60 Dynamic Joint Attention Prediction q s p i c v com ( c s f, f, v ; x s) ( c s f, f, v ; x s) Φ = = com 1 Φ = = com 0

61 Top view Social saliency Person detector: Yang and Ramanan, Articulated Human Detection with Flexible Mixtures of Parts, PAMI 2003.

62 Can we predict social saliency without measuring gaze directions? Social formation Social saliency

63 Social Saliency Prediction Hyun Soo Park and Jianbo Shi Project website: Poster #36

Mathematical Definition of Group Behaviors Inspired by Granger Causality

Mathematical Definition of Group Behaviors Inspired by Granger Causality Social Dynamics Mathematical Definition of Group Behaviors Inspired by Granger Causality Behaviors of X and Y are a group behavior if spatial/temporal predictions of Y based on X and Y jointly are more

More information

Pictorial Structures Revisited: People Detection and Articulated Pose Estimation. Department of Computer Science TU Darmstadt

Pictorial Structures Revisited: People Detection and Articulated Pose Estimation. Department of Computer Science TU Darmstadt Pictorial Structures Revisited: People Detection and Articulated Pose Estimation Mykhaylo Andriluka Stefan Roth Bernt Schiele Department of Computer Science TU Darmstadt Generic model for human detection

More information

Human Pose Tracking I: Basics. David Fleet University of Toronto

Human Pose Tracking I: Basics. David Fleet University of Toronto Human Pose Tracking I: Basics David Fleet University of Toronto CIFAR Summer School, 2009 Looking at People Challenges: Complex pose / motion People have many degrees of freedom, comprising an articulated

More information

End-to-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation

End-to-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation End-to-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation Wei Yang, Wanli Ouyang, Hongsheng Li, and Xiaogang Wang Department of Electronic Engineering,

More information

Uncertainty Models in Quasiconvex Optimization for Geometric Reconstruction

Uncertainty Models in Quasiconvex Optimization for Geometric Reconstruction Uncertainty Models in Quasiconvex Optimization for Geometric Reconstruction Qifa Ke and Takeo Kanade Department of Computer Science, Carnegie Mellon University Email: ke@cmu.edu, tk@cs.cmu.edu Abstract

More information

The state of the art and beyond

The state of the art and beyond Feature Detectors and Descriptors The state of the art and beyond Local covariant detectors and descriptors have been successful in many applications Registration Stereo vision Motion estimation Matching

More information

Feedback Loop between High Level Semantics and Low Level Vision

Feedback Loop between High Level Semantics and Low Level Vision Feedback Loop between High Level Semantics and Low Level Vision Varun K. Nagaraja Vlad I. Morariu Larry S. Davis {varun,morariu,lsd}@umiacs.umd.edu University of Maryland, College Park, MD, USA. Abstract.

More information

CSC487/2503: Foundations of Computer Vision. Visual Tracking. David Fleet

CSC487/2503: Foundations of Computer Vision. Visual Tracking. David Fleet CSC487/2503: Foundations of Computer Vision Visual Tracking David Fleet Introduction What is tracking? Major players: Dynamics (model of temporal variation of target parameters) Measurements (relation

More information

Advances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al.

Advances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al. 6.869 Advances in Computer Vision Prof. Bill Freeman March 3, 2005 Image and shape descriptors Affine invariant features Comparison of feature descriptors Shape context Readings: Mikolajczyk and Schmid;

More information

Level-Set Person Segmentation and Tracking with Multi-Region Appearance Models and Top-Down Shape Information Appendix

Level-Set Person Segmentation and Tracking with Multi-Region Appearance Models and Top-Down Shape Information Appendix Level-Set Person Segmentation and Tracking with Multi-Region Appearance Models and Top-Down Shape Information Appendix Esther Horbert, Konstantinos Rematas, Bastian Leibe UMIC Research Centre, RWTH Aachen

More information

Higher-order Statistical Modeling based Deep CNNs (Introduction) Peihua Li Dalian University of Technology

Higher-order Statistical Modeling based Deep CNNs (Introduction) Peihua Li Dalian University of Technology Higher-order Statistical Modeling based Deep CNNs (Introduction) Peihua Li Dalian University of Technology http://peihuali.org Outline What is Higher-order? Why We Study High-order Overview of Speaker

More information

Distinguish between different types of scenes. Matching human perception Understanding the environment

Distinguish between different types of scenes. Matching human perception Understanding the environment Scene Recognition Adriana Kovashka UTCS, PhD student Problem Statement Distinguish between different types of scenes Applications Matching human perception Understanding the environment Indexing of images

More information

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros Corners, Blobs & Descriptors With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros Motivation: Build a Panorama M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003 How do we build panorama?

More information

Solution: There are 30 choices for the first person to leave, 29 for the second, etc. Thus this exodus can occur in. = P (30, 8) ways.

Solution: There are 30 choices for the first person to leave, 29 for the second, etc. Thus this exodus can occur in. = P (30, 8) ways. Math-2320 Assignment 7 Solutions Problem 1: (Section 7.1 Exercise 4) There are 30 people in a class learning about permutations. One after another, eight people gradually slip out the back door. In how

More information

Global Behaviour Inference using Probabilistic Latent Semantic Analysis

Global Behaviour Inference using Probabilistic Latent Semantic Analysis Global Behaviour Inference using Probabilistic Latent Semantic Analysis Jian Li, Shaogang Gong, Tao Xiang Department of Computer Science Queen Mary College, University of London, London, E1 4NS, UK {jianli,

More information

Leaving Some Stones Unturned: Dynamic Feature Prioritization for Activity Detection in Streaming Video

Leaving Some Stones Unturned: Dynamic Feature Prioritization for Activity Detection in Streaming Video In Proceedings of the European Conference on Computer Vision (ECCV), 26 Leaving Some Stones Unturned: Dynamic Feature Prioritization for Activity Detection in Streaming Video Yu-Chuan Su and Kristen Grauman

More information

Fisher Vector image representation

Fisher Vector image representation Fisher Vector image representation Machine Learning and Category Representation 2014-2015 Jakob Verbeek, January 9, 2015 Course website: http://lear.inrialpes.fr/~verbeek/mlcr.14.15 A brief recap on kernel

More information

Lecture 13: Tracking mo3on features op3cal flow

Lecture 13: Tracking mo3on features op3cal flow Lecture 13: Tracking mo3on features op3cal flow Professor Fei- Fei Li Stanford Vision Lab Lecture 14-1! What we will learn today? Introduc3on Op3cal flow Feature tracking Applica3ons Reading: [Szeliski]

More information

Discriminative part-based models. Many slides based on P. Felzenszwalb

Discriminative part-based models. Many slides based on P. Felzenszwalb More sliding window detection: ti Discriminative part-based models Many slides based on P. Felzenszwalb Challenge: Generic object detection Pedestrian detection Features: Histograms of oriented gradients

More information

Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context

Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context Lecture 10 Detectors and descriptors Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context Silvio Savarese Lecture 10-16-Feb-15 From the 3D to 2D & vice versa

More information

Compressed Fisher vectors for LSVR

Compressed Fisher vectors for LSVR XRCE@ILSVRC2011 Compressed Fisher vectors for LSVR Florent Perronnin and Jorge Sánchez* Xerox Research Centre Europe (XRCE) *Now with CIII, Cordoba University, Argentina Our system in a nutshell High-dimensional

More information

Probability Theory and Simulation Methods

Probability Theory and Simulation Methods Feb 28th, 2018 Lecture 10: Random variables Countdown to midterm (March 21st): 28 days Week 1 Chapter 1: Axioms of probability Week 2 Chapter 3: Conditional probability and independence Week 4 Chapters

More information

A Generative Perspective on MRFs in Low-Level Vision Supplemental Material

A Generative Perspective on MRFs in Low-Level Vision Supplemental Material A Generative Perspective on MRFs in Low-Level Vision Supplemental Material Uwe Schmidt Qi Gao Stefan Roth Department of Computer Science, TU Darmstadt 1. Derivations 1.1. Sampling the Prior We first rewrite

More information

Anticipating Visual Representations from Unlabeled Data. Carl Vondrick, Hamed Pirsiavash, Antonio Torralba

Anticipating Visual Representations from Unlabeled Data. Carl Vondrick, Hamed Pirsiavash, Antonio Torralba Anticipating Visual Representations from Unlabeled Data Carl Vondrick, Hamed Pirsiavash, Antonio Torralba Overview Problem Key Insight Methods Experiments Problem: Predict future actions and objects Image

More information

Video Indexing OptionTM. Lecture 2 Jenny Benois-Pineau LABRI UMR 5800/Université Bordeaux

Video Indexing OptionTM. Lecture 2 Jenny Benois-Pineau LABRI UMR 5800/Université Bordeaux Video Indexing OptionTM. Lecture 2 Jenny Benois-Pineau LABRI UMR 5800/Université Bordeaux Summary Spatio-temporal saliency modeling Robust motion estimation Global robust motion estimation for the foreground

More information

Graphical Object Models for Detection and Tracking

Graphical Object Models for Detection and Tracking Graphical Object Models for Detection and Tracking (ls@cs.brown.edu) Department of Computer Science Brown University Joined work with: -Ying Zhu, Siemens Corporate Research, Princeton, NJ -DorinComaniciu,

More information

Juergen Gall. Analyzing Human Behavior in Video Sequences

Juergen Gall. Analyzing Human Behavior in Video Sequences Juergen Gall Analyzing Human Behavior in Video Sequences 09. 10. 2 01 7 Juer gen Gall Instit u t e of Com puter Science III Com puter Vision Gr oup 2 Analyzing Human Behavior Analyzing Human Behavior Human

More information

Collective Activity Detection using Hinge-loss Markov Random Fields

Collective Activity Detection using Hinge-loss Markov Random Fields Collective Activity Detection using Hinge-loss Markov Random Fields Ben London, Sameh Khamis, Stephen H. Bach, Bert Huang, Lise Getoor, Larry Davis University of Maryland College Park, MD 20742 {blondon,sameh,bach,bert,getoor,lsdavis}@cs.umd.edu

More information

Activity Forecasting. Research CMU. Carnegie Mellon University. Kris Kitani Carnegie Mellon University

Activity Forecasting. Research CMU. Carnegie Mellon University. Kris Kitani Carnegie Mellon University Carnegie Mellon University Research Showcase @ CMU Robotics Institute School of Computer Science 8-2012 Activity Forecasting Kris Kitani Carnegie Mellon University Brian D. Ziebart Carnegie Mellon University

More information

Deep Learning for Computer Vision

Deep Learning for Computer Vision Deep Learning for Computer Vision Spring 2018 http://vllab.ee.ntu.edu.tw/dlcv.html (primary) https://ceiba.ntu.edu.tw/1062dlcv (grade, etc.) FB: DLCV Spring 2018 Yu-Chiang Frank Wang 王鈺強, Associate Professor

More information

2D Image Processing (Extended) Kalman and particle filter

2D Image Processing (Extended) Kalman and particle filter 2D Image Processing (Extended) Kalman and particle filter Prof. Didier Stricker Dr. Gabriele Bleser Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz

More information

Probabilistic Logic CNF for Reasoning

Probabilistic Logic CNF for Reasoning Probabilistic Logic CNF for Reasoning Song- Chun Zhu, Sinisa Todorovic, and Ales Leonardis At CVPR, Providence, Rhode Island June 16, 2012 Goal input tracking parsing reasoning activity recognition in

More information

6.8 The Pigeonhole Principle

6.8 The Pigeonhole Principle 6.8 The Pigeonhole Principle Getting Started Are there two leaf-bearing trees on Earth with the same number of leaves if we only consider the number of leaves on a tree at full bloom? Getting Started Are

More information

Large-Margin Thresholded Ensembles for Ordinal Regression

Large-Margin Thresholded Ensembles for Ordinal Regression Large-Margin Thresholded Ensembles for Ordinal Regression Hsuan-Tien Lin (accepted by ALT 06, joint work with Ling Li) Learning Systems Group, Caltech Workshop Talk in MLSS 2006, Taipei, Taiwan, 07/25/2006

More information

Geographic Knowledge Discovery Using Ground-Level Images and Videos

Geographic Knowledge Discovery Using Ground-Level Images and Videos This work was funded in part by a DOE Early Career award, an NSF CAREER award (#IIS- 1150115), and a PECASE award. We gratefully acknowledge NVIDIA for their donated hardware. Geographic Knowledge Discovery

More information

Human Action Recognition under Log-Euclidean Riemannian Metric

Human Action Recognition under Log-Euclidean Riemannian Metric Human Action Recognition under Log-Euclidean Riemannian Metric Chunfeng Yuan, Weiming Hu, Xi Li, Stephen Maybank 2, Guan Luo, National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing,

More information

Joint probabilistic pedestrian head and body orientation estimation

Joint probabilistic pedestrian head and body orientation estimation 214 IEEE Intelligent Vehicles Symposium (IV) June 8-11, 214. Dearborn, Michigan, USA Joint probabilistic pedestrian head and body orientation estimation Fabian Flohr 1,2, Madalin Dumitru-Guzu 1,3, Julian

More information

Fantope Regularization in Metric Learning

Fantope Regularization in Metric Learning Fantope Regularization in Metric Learning CVPR 2014 Marc T. Law (LIP6, UPMC), Nicolas Thome (LIP6 - UPMC Sorbonne Universités), Matthieu Cord (LIP6 - UPMC Sorbonne Universités), Paris, France Introduction

More information

Asaf Bar Zvi Adi Hayat. Semantic Segmentation

Asaf Bar Zvi Adi Hayat. Semantic Segmentation Asaf Bar Zvi Adi Hayat Semantic Segmentation Today s Topics Fully Convolutional Networks (FCN) (CVPR 2015) Conditional Random Fields as Recurrent Neural Networks (ICCV 2015) Gaussian Conditional random

More information

Introduction to Mobile Robotics Probabilistic Sensor Models

Introduction to Mobile Robotics Probabilistic Sensor Models Introduction to Mobile Robotics Probabilistic Sensor Models Wolfram Burgard 1 Sensors for Mobile Robots Contact sensors: Bumpers Proprioceptive sensors Accelerometers (spring-mounted masses) Gyroscopes

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Dan Oneaţă 1 Introduction Probabilistic Latent Semantic Analysis (plsa) is a technique from the category of topic models. Its main goal is to model cooccurrence information

More information

Transformation of Markov Random Fields for Marginal Distribution Estimation

Transformation of Markov Random Fields for Marginal Distribution Estimation Transformation of Markov Random Fields for Marginal Distribution Estimation Masaki Saito Takayuki Okatani Tohoku University, Japan {msaito, okatani}@vision.is.tohoku.ac.jp Abstract This paper presents

More information

Learning and Predicting On-road Pedestrian Behavior Around Vehicles

Learning and Predicting On-road Pedestrian Behavior Around Vehicles Learning and Predicting On-road Pedestrian Behavior Around Vehicles Nachiket Deo and Mohan M. Trivedi Laboratory for Intelligent and Safe Automobiles University of California San Diego La Jolla, USA {ndeo,

More information

Multimodal context analysis and prediction

Multimodal context analysis and prediction Multimodal context analysis and prediction Valeria Tomaselli (valeria.tomaselli@st.com) Sebastiano Battiato Giovanni Maria Farinella Tiziana Rotondo (PhD student) Outline 2 Context analysis vs prediction

More information

The Multibody Trifocal Tensor: Motion Segmentation from 3 Perspective Views

The Multibody Trifocal Tensor: Motion Segmentation from 3 Perspective Views The Multibody Trifocal Tensor: Motion Segmentation from 3 Perspective Views Richard Hartley 1,2 and RenéVidal 2,3 1 Dept. of Systems Engineering 3 Center for Imaging Science Australian National University

More information

Face detection and recognition. Detection Recognition Sally

Face detection and recognition. Detection Recognition Sally Face detection and recognition Detection Recognition Sally Face detection & recognition Viola & Jones detector Available in open CV Face recognition Eigenfaces for face recognition Metric learning identification

More information

Clustering with k-means and Gaussian mixture distributions

Clustering with k-means and Gaussian mixture distributions Clustering with k-means and Gaussian mixture distributions Machine Learning and Category Representation 2012-2013 Jakob Verbeek, ovember 23, 2012 Course website: http://lear.inrialpes.fr/~verbeek/mlcr.12.13

More information

DRAFT. M118 Exam Jam Concise. Contents. Chapter 2: Set Theory 2. Chapter 3: Combinatorics 3. Chapter 4: Probability 4. Chapter 5: Statistics 6

DRAFT. M118 Exam Jam Concise. Contents. Chapter 2: Set Theory 2. Chapter 3: Combinatorics 3. Chapter 4: Probability 4. Chapter 5: Statistics 6 Contents Chapter 2: Set Theory 2 Chapter 3: Combinatorics 3 Chapter 4: Probability 4 Chapter 5: Statistics 6 Chapter 6: Linear Equations and Matrix Algebra 8 Chapter 7: Linear Programming: Graphical Solutions

More information

M118 Exam Jam. Contents. Chapter 2: Set Theory 2. Chapter 3: Combinatorics 5. Chapter 4: Probability 7. Chapter 5: Statistics 12

M118 Exam Jam. Contents. Chapter 2: Set Theory 2. Chapter 3: Combinatorics 5. Chapter 4: Probability 7. Chapter 5: Statistics 12 Contents Chapter 2: Set Theory 2 Chapter 3: Combinatorics 5 Chapter 4: Probability 7 Chapter 5: Statistics 12 Chapter 6: Linear Equations and Matrix Algebra 17 Chapter 7: Linear Programming: Graphical

More information

Robust Motion Segmentation by Spectral Clustering

Robust Motion Segmentation by Spectral Clustering Robust Motion Segmentation by Spectral Clustering Hongbin Wang and Phil F. Culverhouse Centre for Robotics Intelligent Systems University of Plymouth Plymouth, PL4 8AA, UK {hongbin.wang, P.Culverhouse}@plymouth.ac.uk

More information

CS 3710: Visual Recognition Describing Images with Features. Adriana Kovashka Department of Computer Science January 8, 2015

CS 3710: Visual Recognition Describing Images with Features. Adriana Kovashka Department of Computer Science January 8, 2015 CS 3710: Visual Recognition Describing Images with Features Adriana Kovashka Department of Computer Science January 8, 2015 Plan for Today Presentation assignments + schedule changes Image filtering Feature

More information

Chapter 1 Introduction

Chapter 1 Introduction 1 Chapter 1 Introduction Figure 1.1: Westlake Plaza A warm sunny day on a downtown street and plaza, pedestrians pass on the sidewalks, people sit on benches and steps, enjoying a cup of coffee, shoppers

More information

Chaotic Invariants for Human Action Recognition

Chaotic Invariants for Human Action Recognition Chaotic Invariants for Human Action Recognition Saad Ali Computer Vision Lab University of Central Florida sali@cs.ucf.edu Arslan Basharat Computer Vision Lab University of Central Florida arslan@cs.ucf.edu

More information

Chapter 14 Combining Models

Chapter 14 Combining Models Chapter 14 Combining Models T-61.62 Special Course II: Pattern Recognition and Machine Learning Spring 27 Laboratory of Computer and Information Science TKK April 3th 27 Outline Independent Mixing Coefficients

More information

A Novel Activity Detection Method

A Novel Activity Detection Method A Novel Activity Detection Method Gismy George P.G. Student, Department of ECE, Ilahia College of,muvattupuzha, Kerala, India ABSTRACT: This paper presents an approach for activity state recognition of

More information

Math 493 Final Exam December 01

Math 493 Final Exam December 01 Math 493 Final Exam December 01 NAME: ID NUMBER: Return your blue book to my office or the Math Department office by Noon on Tuesday 11 th. On all parts after the first show enough work in your exam booklet

More information

Beyond Active Noun Tagging: Modeling Contextual Interactions for Multi-Class Active Learning

Beyond Active Noun Tagging: Modeling Contextual Interactions for Multi-Class Active Learning Beyond Active Noun Tagging: Modeling Contextual Interactions for Multi-Class Active Learning Behjat Siddiquie Department of Computer Science University of Maryland, College Park behjat@cs.umd.edu Abhinav

More information

Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales

Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales Tsun-Yi Yang,2 Yen-Yu Lin Yung-Yu Chuang 2 Academia Sinica, Taiwan 2 National Taiwan University, Taiwan {shamangary,yylin}@citi.sinica.edu.tw

More information

Chapter 2: Set Theory 2. Chapter 3: Combinatorics 3. Chapter 4: Probability 4. Chapter 5: Statistics 5

Chapter 2: Set Theory 2. Chapter 3: Combinatorics 3. Chapter 4: Probability 4. Chapter 5: Statistics 5 M118 Exam Jam Concise s Contents Chapter 2: Set Theory 2 Chapter 3: Combinatorics 3 Chapter 4: Probability 4 Chapter 5: Statistics 5 Chapter 6: Linear Equations and Matrix Algebra 7 Chapter 7: Linear Programming:

More information

Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities

Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities Bangpeng Yao and Li Fei-Fei Computer Science Department, Stanford University {bangpeng,feifeili}@cs.stanford.edu

More information

Intelligent Systems:

Intelligent Systems: Intelligent Systems: Undirected Graphical models (Factor Graphs) (2 lectures) Carsten Rother 15/01/2015 Intelligent Systems: Probabilistic Inference in DGM and UGM Roadmap for next two lectures Definition

More information

Second-Order Polynomial Models for Background Subtraction

Second-Order Polynomial Models for Background Subtraction Second-Order Polynomial Models for Background Subtraction Alessandro Lanza, Federico Tombari, Luigi Di Stefano DEIS, University of Bologna Bologna, Italy {alessandro.lanza, federico.tombari, luigi.distefano

More information

Hidden CRFs for Human Activity Classification from RGBD Data

Hidden CRFs for Human Activity Classification from RGBD Data H Hidden s for Human from RGBD Data Avi Singh & Ankit Goyal IIT-Kanpur CS679: Machine Learning for Computer Vision April 13, 215 Overview H 1 2 3 H 4 5 6 7 Statement H Input An RGBD video with a human

More information

Inquiry Based Instruction Unit. Virginia Kromhout

Inquiry Based Instruction Unit. Virginia Kromhout Inquiry Based Instruction Unit Virginia Kromhout Unit Title: _Exploring the moon Grade level: _2 grade nd Subject Area: _Science Topic: The Universe Key Words: Moon, lunar surface Designed By: Virginia

More information

Feature detectors and descriptors. Fei-Fei Li

Feature detectors and descriptors. Fei-Fei Li Feature detectors and descriptors Fei-Fei Li Feature Detection e.g. DoG detected points (~300) coordinates, neighbourhoods Feature Description e.g. SIFT local descriptors (invariant) vectors database of

More information

Covariance-Based PCA for Multi-Size Data

Covariance-Based PCA for Multi-Size Data Covariance-Based PCA for Multi-Size Data Menghua Zhai, Feiyu Shi, Drew Duncan, and Nathan Jacobs Department of Computer Science, University of Kentucky, USA {mzh234, fsh224, drew, jacobs}@cs.uky.edu Abstract

More information

Detector Ensemble. Abstract. 1. Introduction

Detector Ensemble. Abstract. 1. Introduction Detector Ensemble Shengyang Dai Ming Yang Ying Wu Aggelos Katsaggelos EECS Department, Northwestern University, Evanston, IL 60208, U.S.A. {sda690,mya671,yingwu,aggk}@ece.northwestern.edu Abstract Component-based

More information

BugID: Rapid-Throughput Insect Population Counting Using Computer Vision

BugID: Rapid-Throughput Insect Population Counting Using Computer Vision BugID: Rapid-Throughput Insect Population Counting Using Computer Vision Students: N. Larios, H. Deng, W. Zhang, M. Sarpola, J. Yuen, S. Ruiz Correa Faculty: R. Paasch, A. Moldenke, D. A. Lytle, E. Mortensen,

More information

Abnormal Activity Detection and Tracking Namrata Vaswani Dept. of Electrical and Computer Engineering Iowa State University

Abnormal Activity Detection and Tracking Namrata Vaswani Dept. of Electrical and Computer Engineering Iowa State University Abnormal Activity Detection and Tracking Namrata Vaswani Dept. of Electrical and Computer Engineering Iowa State University Abnormal Activity Detection and Tracking 1 The Problem Goal: To track activities

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 2016 MODULE 1 : Probability distributions Time allowed: Three hours Candidates should answer FIVE questions. All questions carry equal marks.

More information

Combining Generative and Discriminative Methods for Pixel Classification with Multi-Conditional Learning

Combining Generative and Discriminative Methods for Pixel Classification with Multi-Conditional Learning Combining Generative and Discriminative Methods for Pixel Classification with Multi-Conditional Learning B. Michael Kelm Interdisciplinary Center for Scientific Computing University of Heidelberg michael.kelm@iwr.uni-heidelberg.de

More information

Statistical and Learning Techniques in Computer Vision Lecture 2: Maximum Likelihood and Bayesian Estimation Jens Rittscher and Chuck Stewart

Statistical and Learning Techniques in Computer Vision Lecture 2: Maximum Likelihood and Bayesian Estimation Jens Rittscher and Chuck Stewart Statistical and Learning Techniques in Computer Vision Lecture 2: Maximum Likelihood and Bayesian Estimation Jens Rittscher and Chuck Stewart 1 Motivation and Problem In Lecture 1 we briefly saw how histograms

More information

Descriptive Statistics Class Practice [133 marks]

Descriptive Statistics Class Practice [133 marks] Descriptive Statistics Class Practice [133 marks] The weekly wages (in dollars) of 80 employees are displayed in the cumulative frequency curve below. 1a. (i) (ii) Write down the median weekly wage. Find

More information

Outline: Ensemble Learning. Ensemble Learning. The Wisdom of Crowds. The Wisdom of Crowds - Really? Crowd wiser than any individual

Outline: Ensemble Learning. Ensemble Learning. The Wisdom of Crowds. The Wisdom of Crowds - Really? Crowd wiser than any individual Outline: Ensemble Learning We will describe and investigate algorithms to Ensemble Learning Lecture 10, DD2431 Machine Learning A. Maki, J. Sullivan October 2014 train weak classifiers/regressors and how

More information

Science in the Kitchen

Science in the Kitchen Program Support Notes by: Margaret Bishop B.Ed, Dip T Produced by: VEA Pty Ltd Commissioning Editor: Sandra Frerichs B.Ed, M.Ed. Executive Producer: Simon Garner B.Ed, Dip Management Davis Film and Video

More information

Finding Group Interactions in Social Clutter

Finding Group Interactions in Social Clutter Finding Group Interactions in Social Clutter Ruonan Li Harvard University ruonanli@seas.harvard.edu Parker Porfilio Brown University parker porfilio@brown.edu Todd Zickler Harvard University zickler@seas.harvard.edu

More information

Unit 5 Lesson 1 Images from Space. Copyright Houghton Mifflin Harcourt Publishing Company

Unit 5 Lesson 1 Images from Space. Copyright Houghton Mifflin Harcourt Publishing Company Florida Benchmarks SC.8.N.4.2 Explain how political, social, and economic concerns can affect science, and vice versa. SC.8.E.5.10 Assess how technology is essential to science for such purposes as access

More information

Robustifying the Flock of Trackers

Robustifying the Flock of Trackers 16 th Computer Vision Winter Workshop Andreas Wendel, Sabine Sternig, Martin Godec (eds.) Mitterberg, Austria, February 2-4, 2011 Robustifying the Flock of Trackers Jiří Matas and Tomáš Vojíř The Center

More information

Gait Recognition using Dynamic Affine Invariants

Gait Recognition using Dynamic Affine Invariants Gait Recognition using Dynamic Affine Invariants Alessandro Bissacco Payam Saisan UCLA CSD TR040014 Abstract We present a method for recognizing classes of human gaits from video sequences. We propose

More information

Feature detectors and descriptors. Fei-Fei Li

Feature detectors and descriptors. Fei-Fei Li Feature detectors and descriptors Fei-Fei Li Feature Detection e.g. DoG detected points (~300) coordinates, neighbourhoods Feature Description e.g. SIFT local descriptors (invariant) vectors database of

More information

Fast Human Detection from Videos Using Covariance Features

Fast Human Detection from Videos Using Covariance Features THIS PAPER APPEARED IN THE ECCV VISUAL SURVEILLANCE WORKSHOP (ECCV-VS), MARSEILLE, OCTOBER 2008 Fast Human Detection from Videos Using Covariance Features Jian Yao Jean-Marc Odobez IDIAP Research Institute

More information

The Success of Deep Generative Models

The Success of Deep Generative Models The Success of Deep Generative Models Jakub Tomczak AMLAB, University of Amsterdam CERN, 2018 What is AI about? What is AI about? Decision making: What is AI about? Decision making: new data High probability

More information

Lecture 12. Local Feature Detection. Matching with Invariant Features. Why extract features? Why extract features? Why extract features?

Lecture 12. Local Feature Detection. Matching with Invariant Features. Why extract features? Why extract features? Why extract features? Lecture 1 Why extract eatures? Motivation: panorama stitching We have two images how do we combine them? Local Feature Detection Guest lecturer: Alex Berg Reading: Harris and Stephens David Lowe IJCV We

More information

Urban land use information retrieval based on scene classification of Google Street View images

Urban land use information retrieval based on scene classification of Google Street View images Urban land use information retrieval based on scene classification of Google Street View images Xiaojiang Li 1, Chuanrong Zhang 1 1 Department of Geography, University of Connecticut, Storrs Email: {xiaojiang.li;chuanrong.zhang}@uconn.edu

More information

CS540 Machine learning L9 Bayesian statistics

CS540 Machine learning L9 Bayesian statistics CS540 Machine learning L9 Bayesian statistics 1 Last time Naïve Bayes Beta-Bernoulli 2 Outline Bayesian concept learning Beta-Bernoulli model (review) Dirichlet-multinomial model Credible intervals 3 Bayesian

More information

Single-Image-Based Rain and Snow Removal Using Multi-guided Filter

Single-Image-Based Rain and Snow Removal Using Multi-guided Filter Single-Image-Based Rain and Snow Removal Using Multi-guided Filter Xianhui Zheng 1, Yinghao Liao 1,,WeiGuo 2, Xueyang Fu 2, and Xinghao Ding 2 1 Department of Electronic Engineering, Xiamen University,

More information

Self-Tuning Semantic Image Segmentation

Self-Tuning Semantic Image Segmentation Self-Tuning Semantic Image Segmentation Sergey Milyaev 1,2, Olga Barinova 2 1 Voronezh State University sergey.milyaev@gmail.com 2 Lomonosov Moscow State University obarinova@graphics.cs.msu.su Abstract.

More information

Making Minimal Problems Fast

Making Minimal Problems Fast Making Minimal Problems Fast T o m a s P a j d l a presents work of M. Bujnak, Z. Kukelova & T. Pajdla Czech Technical University Prague Center for Machine Perception M o t i v a t i o n 3 D R E C O N

More information

Another look at Bayesian. inference

Another look at Bayesian. inference Another look at Bayesian A general scenario: - Query variables: X inference - Evidence (observed) variables and their values: E = e - Unobserved variables: Y Inference problem: answer questions about the

More information

A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis

A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis Saad Ali Computer Vision Lab University of Central Florida sali@cs.ucf.edu Mubarak Shah Computer Vision Lab University

More information

Class 26: review for final exam 18.05, Spring 2014

Class 26: review for final exam 18.05, Spring 2014 Probability Class 26: review for final eam 8.05, Spring 204 Counting Sets Inclusion-eclusion principle Rule of product (multiplication rule) Permutation and combinations Basics Outcome, sample space, event

More information

Conditional Probability (cont...) 10/06/2005

Conditional Probability (cont...) 10/06/2005 Conditional Probability (cont...) 10/06/2005 Independent Events Two events E and F are independent if both E and F have positive probability and if P (E F ) = P (E), and P (F E) = P (F ). 1 Theorem. If

More information

Kai Yu NEC Laboratories America, Cupertino, California, USA

Kai Yu NEC Laboratories America, Cupertino, California, USA Kai Yu NEC Laboratories America, Cupertino, California, USA Joint work with Jinjun Wang, Fengjun Lv, Wei Xu, Yihong Gong Xi Zhou, Jianchao Yang, Thomas Huang, Tong Zhang Chen Wu NEC Laboratories America

More information

CSC420: Intro to Image Understanding Introduction

CSC420: Intro to Image Understanding Introduction CSC420: Intro to Image Understanding Introduction Sanja Fidler January 3, 2017 Sanja Fidler Intro to Image Understanding 1/65 The Team Instructor: Sanja Fidler (fidler@cs.toronto.edu) O ce: DH3094 O ce

More information

Given a feature in I 1, how to find the best match in I 2?

Given a feature in I 1, how to find the best match in I 2? Feature Matching 1 Feature matching Given a feature in I 1, how to find the best match in I 2? 1. Define distance function that compares two descriptors 2. Test all the features in I 2, find the one with

More information

Local Features (contd.)

Local Features (contd.) Motivation Local Features (contd.) Readings: Mikolajczyk and Schmid; F&P Ch 10 Feature points are used also or: Image alignment (homography, undamental matrix) 3D reconstruction Motion tracking Object

More information

Blobs & Scale Invariance

Blobs & Scale Invariance Blobs & Scale Invariance Prof. Didier Stricker Doz. Gabriele Bleser Computer Vision: Object and People Tracking With slides from Bebis, S. Lazebnik & S. Seitz, D. Lowe, A. Efros 1 Apertizer: some videos

More information

Large-Margin Thresholded Ensembles for Ordinal Regression

Large-Margin Thresholded Ensembles for Ordinal Regression Large-Margin Thresholded Ensembles for Ordinal Regression Hsuan-Tien Lin and Ling Li Learning Systems Group, California Institute of Technology, U.S.A. Conf. on Algorithmic Learning Theory, October 9,

More information

Multimedia analysis and retrieval

Multimedia analysis and retrieval Multimedia analysis and retrieval from the masses, and for the masses Lexing Xie IBM T J Watson Research Center xlx@us.ibm.com ICIP-MIR workshop, Oct 12, 2008 2008 IBM Research multimedia information analysis

More information

Statistical Filters for Crowd Image Analysis

Statistical Filters for Crowd Image Analysis Statistical Filters for Crowd Image Analysis Ákos Utasi, Ákos Kiss and Tamás Szirányi Distributed Events Analysis Research Group, Computer and Automation Research Institute H-1111 Budapest, Kende utca

More information