Nearest Neighbor Searching Under Uncertainty. Wuzhou Zhang Supervised by Pankaj K. Agarwal Department of Computer Science Duke University

Size: px
Start display at page:

Download "Nearest Neighbor Searching Under Uncertainty. Wuzhou Zhang Supervised by Pankaj K. Agarwal Department of Computer Science Duke University"

Transcription

1 Nearest Neighbor Searching Under Uncertainty Wuzhou Zhang Supervised by Pankaj K. Agarwal Department of Computer Science Duke University

2 Nearest Neighbor Searching (NNS) S: a set of n points in R. q: any query point R. Find the closest point p S to q. S q p Applications Pattern Recognition, Data Compression Statistical Classification, Clustering Databases, Information Retrieval Computer Vision, etc.

3 Nearest Neighbor Searching Under Uncertainty S = {P 1,P,,P n }, a set of uncertain points in R. P i, represented by a probability density function (pdf). Discrete pdf 0. P i = {p i,1,p i,,,p i,k } R Continuous pdf Gaussian Distribution

4 Nearest Neighbor In Expectation q: query point Expected distance from q to P i : Ed(q, P i )= k w i,j d(q, p i,j ). j=1 Problem: find its expected nearest neighbor P, Ed(q, P )=min P S Ed(q, P).

5 Bisector In Case Of Gaussian For Gaussian distribution, bisector is a line! x µ1 f(x, y) = 1 πσ 1 σ 1 ρ e Hard to get explicit formula! 1 (1 ρ ) σ 1 ρ x µ1 σ 1 y µ y µ σ + σ = + + f (x, y) = f(x, y) qx x +(qy y) dxdy f (x, y) qx x +(qy y) dxdy πσ 1 σ 1 x µ ρ 1 e 1 (1 ρ ) σ x µ 1 y µ + y µ 1 σ 1 σ σ 1 ρ Figure:

6 Squared Distance Function If we use (q x x) +(q y y) instead of (q x x) +(q y y) bisector is simple and beautiful! (µ 1 µ 1 )q x + (µ µ )q y + µ 1 + µ µ 1 µ + σ 1 + σ σ 1 σ =0 In case of discrete pdf, P i = {p i,1,p i,,,p i,k } R bisector is also a line! In both cases, compute the Voronoi diagram, solve it optimally! However, not a metric!

7 Sampling Continuous Distributions Sometimes working on continuous distributions is hard. For example, suppose a uniform distribution is given by 1 (b a)(d c) for (x, y) [a, b] [c, d] f(x, y) = 0 otherwise In L 1 metric, given ε r, # of points to be sampled: k 1 ε r. Generalization in R d, k ( 1 ε r ) d. Lower bounds on other metrics and distributions are also possible. Let s focus on discrete pdf then.

8 Expected Nearest Neighbor In L1 Metric (Manhattan metric) P i = {p i,1,p i,,,p i,k } R k Ed(q, P i )= w i,j d(q, p i,j )= j=1 k j=1 Naive algorithm: O(nk) space, O(nk) query time. w i,j x q x pi,j + y q y pi,j x q + x pi,j + y q y pi,j x q x pi,j + y q y pi,j p i,j f i (x q,y q ) is linear! x q x pi,j y q + y pi,j x q + x pi,j y q + y pi,j # of regions for P i : O(k ). total : O(nk )

9 Expected Nearest Neighbor In L1 Metric ( cont. ) f i (x q,y q ) x q y q Source: Range Searching on Uncertain Data [P.K.Agarwal et al. 009]

10 Geometric Reduction U = x 1 <x < <x u,u nk, the set of xpi,j. V = y 1 <y < <y v,v nk, the set of ypi,j. R = {r 1,r,,r t },t= O(nk ) the set of rectangles in the xy-projections of f i (x q,y q ), P i. ϕ m, the plane that contains the rectangular piece of f i (x q,y q ), whose projection is r m R. Associate P i with r m and ϕ m. Given q =(x q,y q ), among all r m R that contain q, we wish to report the one for which ϕ m lies lowest.

11 Building Block: Half- Space Intersection and Convex Hulls Upper hulls correspond to lower envelopes, an example in D Source: page 5 53, Computational Geometry: Algorithms and Applications, 3rd Edition[Mark de Berg et al. ]

12 Segment- tree Based Data Structures for Expected- NN In L1 Metric U = x 1 <x < <x u,u nk a family I of O(nk) canonical intervals.

13 Segment- tree Based Data Structures for Expected- NN In L1 Metric ( cont. ) Each rectangle r m R can be partitioned into a set C[r m ] of O(log (nk)) canonical rectangles. For each C I I, Φ C = {ϕ m C C[r m ],r m R} C Φ C = O nk log (nk) For each non-empty Φ C, we compute the lower envelope LE C, store its xy-projection S C ( planar map ), and associate ϕ m with corresponding face f S C. C S C = O nk log (nk). Expected preprocessing time: O nk log 3 (nk).

14 Segment- tree Based Data Structures for Expected- NN In L1 Metric ( cont. ) Given q =(x q,y q ), first find I xq I, I yq I, O log(nk) canonical intervals each. For each C in I xq I y q, do the point location query on S C. Query time: O log 3 (nk) Size of data structure Preprocessing time Query time O nk log (nk) O O nk log 3 log 3 (nk) (nk) Summary of the result take O k as a constant O n log n n log 3 n O log 3 n

15 Approximate L Metric For any polygon P that contains the origin, define d P : R R R + as d P (a, b) =min{λ b λp + a} It s a metric when P is centrally symmetric! m =/ δ, P the regular m-gon of circumscribing radius 1, centered at the origin. d P (a, b) (1 + δ)d(a, b). d P (a, b) = min 1 i m d i (a, b), sinced i (a, b) isfiniteiff b C i + a

16 Approximate L Metric ( cont. ) More complex!

17 Future Work Approximate the expected NN in L metric Work harder in the near future! Study the complexity of expected Voronoi diagram Study the probability case

18 Thanks! Main References: [1] Pankaj K. Agarwal, Siu-Wing Cheng, Yufei Tao, Ke Yi: Indexing uncertain data. PODS 009: [] Pankaj K. Agarwal, Lars Arge, Jeff Erickson: Indexing Moving Points. J. Comput. Syst. Sci. 66(1): (003)

Nearest-Neighbor Searching Under Uncertainty

Nearest-Neighbor Searching Under Uncertainty Nearest-Neighbor Searching Under Uncertainty Wuzhou Zhang Joint work with Pankaj K. Agarwal, Alon Efrat, and Swaminathan Sankararaman. To appear in PODS 2012. Nearest-Neighbor Searching S: a set of n points

More information

The τ-skyline for Uncertain Data

The τ-skyline for Uncertain Data CCCG 2014, Halifax, Nova Scotia, August 11 13, 2014 The τ-skyline for Uncertain Data Haitao Wang Wuzhou Zhang Abstract In this paper, we introduce the notion of τ-skyline as an alternative representation

More information

Nearest-Neighbor Searching Under Uncertainty

Nearest-Neighbor Searching Under Uncertainty Nearest-Neighbor Searching Under Uncertainty Pankaj K. Agarwal Department of Computer Science Duke University pankaj@cs.duke.edu Alon Efrat Department of Computer Science The University of Arizona alon@cs.arizona.edu

More information

Geometric Computing over Uncertain Data

Geometric Computing over Uncertain Data Geometric Computing over Uncertain Data by Wuzhou Zhang Department of Computer Science Duke University Date: Approved: Pankaj K. Agarwal, Supervisor Sayan Mukherjee Kamesh Munagala Jun Yang Dissertation

More information

Geometric Computing over Uncertain Data

Geometric Computing over Uncertain Data Geometric Computing over Uncertain Data by Wuzhou Zhang Department of Computer Science Duke University Date: Approved: Pankaj K. Agarwal, Supervisor Sayan Mukherjee Kamesh Munagala Jun Yang Dissertation

More information

k th -order Voronoi Diagrams

k th -order Voronoi Diagrams k th -order Voronoi Diagrams References: D.-T. Lee, On k-nearest neighbor Voronoi Diagrams in the plane, IEEE Transactions on Computers, Vol. 31, No. 6, pp. 478 487, 1982. B. Chazelle and H. Edelsbrunner,

More information

Optimal Data-Dependent Hashing for Approximate Near Neighbors

Optimal Data-Dependent Hashing for Approximate Near Neighbors Optimal Data-Dependent Hashing for Approximate Near Neighbors Alexandr Andoni 1 Ilya Razenshteyn 2 1 Simons Institute 2 MIT, CSAIL April 20, 2015 1 / 30 Nearest Neighbor Search (NNS) Let P be an n-point

More information

Approximate Voronoi Diagrams

Approximate Voronoi Diagrams CS468, Mon. Oct. 30 th, 2006 Approximate Voronoi Diagrams Presentation by Maks Ovsjanikov S. Har-Peled s notes, Chapters 6 and 7 1-1 Outline Preliminaries Problem Statement ANN using PLEB } Bounds and

More information

On the Most Likely Voronoi Diagram and Nearest Neighbor Searching?

On the Most Likely Voronoi Diagram and Nearest Neighbor Searching? On the Most Likely Voronoi Diagram and Nearest Neighbor Searching? Subhash Suri and Kevin Verbeek Department of Computer Science, University of California, Santa Barbara, USA. Abstract. We consider the

More information

Nearest Neighbor Search with Keywords

Nearest Neighbor Search with Keywords Nearest Neighbor Search with Keywords Yufei Tao KAIST June 3, 2013 In recent years, many search engines have started to support queries that combine keyword search with geography-related predicates (e.g.,

More information

Lecture 23: Hausdorff and Fréchet distance

Lecture 23: Hausdorff and Fréchet distance CPS296.2 Geometric Optimization 05 April, 2007 Lecture 23: Hausdorff and Fréchet distance Lecturer: Pankaj K. Agarwal Scribe: Nihshanka Debroy 23.1 Introduction Shape matching is an important area of research

More information

Nearest Neighbor Search with Keywords in Spatial Databases

Nearest Neighbor Search with Keywords in Spatial Databases 776 Nearest Neighbor Search with Keywords in Spatial Databases 1 Sphurti S. Sao, 2 Dr. Rahila Sheikh 1 M. Tech Student IV Sem, Dept of CSE, RCERT Chandrapur, MH, India 2 Head of Department, Dept of CSE,

More information

Notion of Distance. Metric Distance Binary Vector Distances Tangent Distance

Notion of Distance. Metric Distance Binary Vector Distances Tangent Distance Notion of Distance Metric Distance Binary Vector Distances Tangent Distance Distance Measures Many pattern recognition/data mining techniques are based on similarity measures between objects e.g., nearest-neighbor

More information

Efficient geometric algorithms for preference top-k queries, stochastic line arrangements, and proximity problems

Efficient geometric algorithms for preference top-k queries, stochastic line arrangements, and proximity problems Efficient geometric algorithms for preference top-k queries, stochastic line arrangements, and proximity problems A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA

More information

Semi Supervised Distance Metric Learning

Semi Supervised Distance Metric Learning Semi Supervised Distance Metric Learning wliu@ee.columbia.edu Outline Background Related Work Learning Framework Collaborative Image Retrieval Future Research Background Euclidean distance d( x, x ) =

More information

Tailored Bregman Ball Trees for Effective Nearest Neighbors

Tailored Bregman Ball Trees for Effective Nearest Neighbors Tailored Bregman Ball Trees for Effective Nearest Neighbors Frank Nielsen 1 Paolo Piro 2 Michel Barlaud 2 1 Ecole Polytechnique, LIX, Palaiseau, France 2 CNRS / University of Nice-Sophia Antipolis, Sophia

More information

Integration. Tuesday, December 3, 13

Integration. Tuesday, December 3, 13 4 Integration 4.3 Riemann Sums and Definite Integrals Objectives n Understand the definition of a Riemann sum. n Evaluate a definite integral using properties of definite integrals. 3 Riemann Sums 4 Riemann

More information

Geometry of Similarity Search

Geometry of Similarity Search Geometry of Similarity Search Alex Andoni (Columbia University) Find pairs of similar images how should we measure similarity? Naïvely: about n 2 comparisons Can we do better? 2 Measuring similarity 000000

More information

Approximating the Minimum Closest Pair Distance and Nearest Neighbor Distances of Linearly Moving Points

Approximating the Minimum Closest Pair Distance and Nearest Neighbor Distances of Linearly Moving Points Approximating the Minimum Closest Pair Distance and Nearest Neighbor Distances of Linearly Moving Points Timothy M. Chan Zahed Rahmati Abstract Given a set of n moving points in R d, where each point moves

More information

Computing a Minimum-Dilation Spanning Tree is NP-hard

Computing a Minimum-Dilation Spanning Tree is NP-hard Computing a Minimum-Dilation Spanning Tree is NP-hard Otfried Cheong 1 Herman Haverkort 2 Mira Lee 1 October 0, 2008 Abstract In a geometric network G = (S, E), the graph distance between two vertices

More information

Proximity problems in high dimensions

Proximity problems in high dimensions Proximity problems in high dimensions Ioannis Psarros National & Kapodistrian University of Athens March 31, 2017 Ioannis Psarros Proximity problems in high dimensions March 31, 2017 1 / 43 Problem definition

More information

C.M. Liu Perceptual Signal Processing Lab College of Computer Science National Chiao-Tung University

C.M. Liu Perceptual Signal Processing Lab College of Computer Science National Chiao-Tung University Quantization C.M. Liu Perceptual Signal Processing Lab College of Computer Science National Chiao-Tung University http://www.csie.nctu.edu.tw/~cmliu/courses/compression/ Office: EC538 (03)5731877 cmliu@cs.nctu.edu.tw

More information

The 2-Center Problem in Three Dimensions

The 2-Center Problem in Three Dimensions The 2-Center Problem in Three Dimensions Pankaj K. Agarwal Rinat Ben Avraham Micha Sharir Abstract Let P be a set of n points in R 3. The 2-center problem for P is to find two congruent balls of minimum

More information

Machine Learning. CUNY Graduate Center, Spring Lectures 11-12: Unsupervised Learning 1. Professor Liang Huang.

Machine Learning. CUNY Graduate Center, Spring Lectures 11-12: Unsupervised Learning 1. Professor Liang Huang. Machine Learning CUNY Graduate Center, Spring 2013 Lectures 11-12: Unsupervised Learning 1 (Clustering: k-means, EM, mixture models) Professor Liang Huang huang@cs.qc.cuny.edu http://acl.cs.qc.edu/~lhuang/teaching/machine-learning

More information

Algorithms for Calculating Statistical Properties on Moving Points

Algorithms for Calculating Statistical Properties on Moving Points Algorithms for Calculating Statistical Properties on Moving Points Dissertation Proposal Sorelle Friedler Committee: David Mount (Chair), William Gasarch Samir Khuller, Amitabh Varshney January 14, 2009

More information

Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya Yanki, Bojana Dimcheva, Donald Kossmann

Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya Yanki, Bojana Dimcheva, Donald Kossmann Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya anki, Bojana Dimcheva, Donald Kossmann Systems Group ETH Zurich Moving Objects Intersection Finding Position at a future

More information

Algorithms for Nearest Neighbors

Algorithms for Nearest Neighbors Algorithms for Nearest Neighbors Background and Two Challenges Yury Lifshits Steklov Institute of Mathematics at St.Petersburg http://logic.pdmi.ras.ru/~yura McGill University, July 2007 1 / 29 Outline

More information

Lecture 14 October 16, 2014

Lecture 14 October 16, 2014 CS 224: Advanced Algorithms Fall 2014 Prof. Jelani Nelson Lecture 14 October 16, 2014 Scribe: Jao-ke Chin-Lee 1 Overview In the last lecture we covered learning topic models with guest lecturer Rong Ge.

More information

arxiv:cs/ v1 [cs.cg] 7 Feb 2006

arxiv:cs/ v1 [cs.cg] 7 Feb 2006 Approximate Weighted Farthest Neighbors and Minimum Dilation Stars John Augustine, David Eppstein, and Kevin A. Wortman Computer Science Department University of California, Irvine Irvine, CA 92697, USA

More information

Fly Cheaply: On the Minimum Fuel Consumption Problem

Fly Cheaply: On the Minimum Fuel Consumption Problem Journal of Algorithms 41, 330 337 (2001) doi:10.1006/jagm.2001.1189, available online at http://www.idealibrary.com on Fly Cheaply: On the Minimum Fuel Consumption Problem Timothy M. Chan Department of

More information

Lecture 9: Random sampling, ɛ-approximation and ɛ-nets

Lecture 9: Random sampling, ɛ-approximation and ɛ-nets CPS296.2 Geometric Optimization February 13, 2007 Lecture 9: Random sampling, -approximation and -nets Lecturer: Pankaj K. Agarwal Scribe: Shashidhara K. Ganjugunte In the next few lectures we will be

More information

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 13 Indexes for Multimedia Data 13 Indexes for Multimedia

More information

Geometric Problems in Moderate Dimensions

Geometric Problems in Moderate Dimensions Geometric Problems in Moderate Dimensions Timothy Chan UIUC Basic Problems in Comp. Geometry Orthogonal range search preprocess n points in R d s.t. we can detect, or count, or report points inside a query

More information

Compressing Kinetic Data From Sensor Networks. Sorelle A. Friedler (Swat 04) Joint work with David Mount University of Maryland, College Park

Compressing Kinetic Data From Sensor Networks. Sorelle A. Friedler (Swat 04) Joint work with David Mount University of Maryland, College Park Compressing Kinetic Data From Sensor Networks Sorelle A. Friedler (Swat 04) Joint work with David Mount University of Maryland, College Park Motivation Motivation Computer Science Graphics: Image and video

More information

Linear Classification: Linear Programming

Linear Classification: Linear Programming Linear Classification: Linear Programming Yufei Tao Department of Computer Science and Engineering Chinese University of Hong Kong 1 / 21 Y Tao Linear Classification: Linear Programming Recall the definition

More information

A Kernel on Persistence Diagrams for Machine Learning

A Kernel on Persistence Diagrams for Machine Learning A Kernel on Persistence Diagrams for Machine Learning Jan Reininghaus 1 Stefan Huber 1 Roland Kwitt 2 Ulrich Bauer 1 1 Institute of Science and Technology Austria 2 FB Computer Science Universität Salzburg,

More information

Multimedia Databases 1/29/ Indexes for Multimedia Data Indexes for Multimedia Data Indexes for Multimedia Data

Multimedia Databases 1/29/ Indexes for Multimedia Data Indexes for Multimedia Data Indexes for Multimedia Data 1/29/2010 13 Indexes for Multimedia Data 13 Indexes for Multimedia Data 13.1 R-Trees Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

More information

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Cheng Soon Ong & Christian Walder. Canberra February June 2018 Cheng Soon Ong & Christian Walder Research Group and College of Engineering and Computer Science Canberra February June 218 Outlines Overview Introduction Linear Algebra Probability Linear Regression 1

More information

Probabilistic Nearest-Neighbor Query on Uncertain Objects

Probabilistic Nearest-Neighbor Query on Uncertain Objects Probabilistic Nearest-Neighbor Query on Uncertain Objects Hans-Peter Kriegel, Peter Kunath, Matthias Renz University of Munich, Germany {kriegel, kunath, renz}@dbs.ifi.lmu.de Abstract. Nearest-neighbor

More information

Computational Geometry

Computational Geometry Voronoi diagrams Voronoi diagram Given some trees, seen from above, which region will they occupy? Voronoi diagram Given some trees, seen from above, which region will they occupy? Voronoi diagram Given

More information

Facility location and the geometric minimum-diameter spanning tree

Facility location and the geometric minimum-diameter spanning tree Chapter 6 Facility location and the geometric minimum-diameter spanning tree Previously published as: Joachim Gudmundsson, Herman J. Haverkort, Sang- Min Park, Chan-Su Shin and Alexander Wolff: Facility

More information

High Dimensional Geometry, Curse of Dimensionality, Dimension Reduction

High Dimensional Geometry, Curse of Dimensionality, Dimension Reduction Chapter 11 High Dimensional Geometry, Curse of Dimensionality, Dimension Reduction High-dimensional vectors are ubiquitous in applications (gene expression data, set of movies watched by Netflix customer,

More information

Euclidean Shortest Path Algorithm for Spherical Obstacles

Euclidean Shortest Path Algorithm for Spherical Obstacles Euclidean Shortest Path Algorithm for Spherical Obstacles Fajie Li 1, Gisela Klette 2, and Reinhard Klette 3 1 College of Computer Science and Technology, Huaqiao University Xiamen, Fujian, China 2 School

More information

Privacy-Preserving Data Mining

Privacy-Preserving Data Mining CS 380S Privacy-Preserving Data Mining Vitaly Shmatikov slide 1 Reading Assignment Evfimievski, Gehrke, Srikant. Limiting Privacy Breaches in Privacy-Preserving Data Mining (PODS 2003). Blum, Dwork, McSherry,

More information

Making Nearest Neighbors Easier. Restrictions on Input Algorithms for Nearest Neighbor Search: Lecture 4. Outline. Chapter XI

Making Nearest Neighbors Easier. Restrictions on Input Algorithms for Nearest Neighbor Search: Lecture 4. Outline. Chapter XI Restrictions on Input Algorithms for Nearest Neighbor Search: Lecture 4 Yury Lifshits http://yury.name Steklov Institute of Mathematics at St.Petersburg California Institute of Technology Making Nearest

More information

Geometric Approximation via Coresets

Geometric Approximation via Coresets Geometric Approximation via Coresets Pankaj K. Agarwal Sariel Har-Peled Kasturi R. Varadarajan October 22, 2004 Abstract The paradigm of coresets has recently emerged as a powerful tool for efficiently

More information

OWELL WEEKLY JOURNAL

OWELL WEEKLY JOURNAL Y \»< - } Y Y Y & #»»» q ] q»»»>) & - - - } ) x ( - { Y» & ( x - (» & )< - Y X - & Q Q» 3 - x Q Y 6 \Y > Y Y X 3 3-9 33 x - - / - -»- --

More information

Geometric View of Machine Learning Nearest Neighbor Classification. Slides adapted from Prof. Carpuat

Geometric View of Machine Learning Nearest Neighbor Classification. Slides adapted from Prof. Carpuat Geometric View of Machine Learning Nearest Neighbor Classification Slides adapted from Prof. Carpuat What we know so far Decision Trees What is a decision tree, and how to induce it from data Fundamental

More information

Machine Learning. Nonparametric Methods. Space of ML Problems. Todo. Histograms. Instance-Based Learning (aka non-parametric methods)

Machine Learning. Nonparametric Methods. Space of ML Problems. Todo. Histograms. Instance-Based Learning (aka non-parametric methods) Machine Learning InstanceBased Learning (aka nonparametric methods) Supervised Learning Unsupervised Learning Reinforcement Learning Parametric Non parametric CSE 446 Machine Learning Daniel Weld March

More information

Carnegie Mellon Univ. Dept. of Computer Science Database Applications. SAMs - Detailed outline. Spatial Access Methods - problem

Carnegie Mellon Univ. Dept. of Computer Science Database Applications. SAMs - Detailed outline. Spatial Access Methods - problem Carnegie Mellon Univ. Dept. of Computer Science 15-415 - Database Applications Lecture #26: Spatial Databases (R&G ch. 28) SAMs - Detailed outline spatial access methods problem dfn R-trees Faloutsos 2

More information

Contour Trees of Uncertain Terrains

Contour Trees of Uncertain Terrains Contour Trees of Uncertain Terrains Pankaj K. Agarwal Duke University Sayan Mukherjee Duke University Wuzhou Zhang Duke University ABSTRACT We study contour trees of terrains, which encode the topological

More information

Kaggle.

Kaggle. Administrivia Mini-project 2 due April 7, in class implement multi-class reductions, naive bayes, kernel perceptron, multi-class logistic regression and two layer neural networks training set: Project

More information

Distance Metric Learning

Distance Metric Learning Distance Metric Learning Technical University of Munich Department of Informatics Computer Vision Group November 11, 2016 M.Sc. John Chiotellis: Distance Metric Learning 1 / 36 Outline Computer Vision

More information

A Simple Example Binary Hypothesis Testing Optimal Receiver Frontend M-ary Signal Sets Message Sequences. possible signals has been transmitted.

A Simple Example Binary Hypothesis Testing Optimal Receiver Frontend M-ary Signal Sets Message Sequences. possible signals has been transmitted. Introduction I We have focused on the problem of deciding which of two possible signals has been transmitted. I Binary Signal Sets I We will generalize the design of optimum (MPE) receivers to signal sets

More information

Day 3: Classification, logistic regression

Day 3: Classification, logistic regression Day 3: Classification, logistic regression Introduction to Machine Learning Summer School June 18, 2018 - June 29, 2018, Chicago Instructor: Suriya Gunasekar, TTI Chicago 20 June 2018 Topics so far Supervised

More information

Linear Classification: Linear Programming

Linear Classification: Linear Programming Yufei Tao Department of Computer Science and Engineering Chinese University of Hong Kong Recall the definition of linear classification. Definition 1. Let R d denote the d-dimensional space where the domain

More information

Multi-Dimensional Online Tracking

Multi-Dimensional Online Tracking Multi-Dimensional Online Tracking Ke Yi and Qin Zhang Hong Kong University of Science & Technology SODA 2009 January 4-6, 2009 1-1 A natural problem Bob: tracker f(t) g(t) Alice: observer (t, g(t)) t 2-1

More information

Metric-based classifiers. Nuno Vasconcelos UCSD

Metric-based classifiers. Nuno Vasconcelos UCSD Metric-based classifiers Nuno Vasconcelos UCSD Statistical learning goal: given a function f. y f and a collection of eample data-points, learn what the function f. is. this is called training. two major

More information

(Approximate) Uncertain Skylines

(Approximate) Uncertain Skylines (Approximate) Uncertain Skylines Peyman Afshani Faculty of Computer Science Dalhousie University Kasper Green Larsen MADALGO & Department of Computer Science Aarhus Univesity Pankaj K. Agarwal Department

More information

Geometric Computations on Indecisive and Uncertain Points

Geometric Computations on Indecisive and Uncertain Points Geometric Computations on Indecisive and Uncertain Points Allan Jørgensen Maarten Löffler Jeff M. Phillips Abstract We study computing geometric problems on uncertain points. An uncertain point is a point

More information

Semantics of Ranking Queries for Probabilistic Data and Expected Ranks

Semantics of Ranking Queries for Probabilistic Data and Expected Ranks Semantics of Ranking Queries for Probabilistic Data and Expected Ranks Graham Cormode AT&T Labs Feifei Li FSU Ke Yi HKUST 1-1 Uncertain, uncertain, uncertain... (Probabilistic, probabilistic, probabilistic...)

More information

Convex Hulls under Uncertainty

Convex Hulls under Uncertainty Convex Hulls under Uncertainty Pankaj K. Agarwal Sariel Har-Peled Subhash Suri Hakan Yıldız Wuzhou Zhang March 3, 2015 Abstract We study the convex-hull problem in a probabilistic setting, motivated by

More information

Gaussian Mixture Distance for Information Retrieval

Gaussian Mixture Distance for Information Retrieval Gaussian Mixture Distance for Information Retrieval X.Q. Li and I. King fxqli, ingg@cse.cuh.edu.h Department of omputer Science & Engineering The hinese University of Hong Kong Shatin, New Territories,

More information

Clustering. CSL465/603 - Fall 2016 Narayanan C Krishnan

Clustering. CSL465/603 - Fall 2016 Narayanan C Krishnan Clustering CSL465/603 - Fall 2016 Narayanan C Krishnan ckn@iitrpr.ac.in Supervised vs Unsupervised Learning Supervised learning Given x ", y " "%& ', learn a function f: X Y Categorical output classification

More information

(Approximate) Uncertain Skylines

(Approximate) Uncertain Skylines (Approximate) Uncertain Skylines Peyman Afshani Faculty of Computer Science Dalhousie University Kasper Green Larsen MADALGO & Department of Computer Science Aarhus Univesity Pankaj K. Agarwal Department

More information

Union of Random Minkowski Sums and Network Vulnerability Analysis

Union of Random Minkowski Sums and Network Vulnerability Analysis Union of Random Minkowski Sums and Network Vulnerability Analysis Pankaj K. Agarwal Sariel Har-Peled Haim Kaplan Micha Sharir June 29, 2014 Abstract Let C = {C 1,..., C n } be a set of n pairwise-disjoint

More information

Computing a Minimum-Dilation Spanning Tree is NP-hard

Computing a Minimum-Dilation Spanning Tree is NP-hard Computing a Minimum-Dilation Spanning Tree is NP-hard Otfried Cheong Herman Haverkort Mira Lee Dept. of Computer Science, Korea Advanced Institute of Science & Technology, Daejeon, South Korea. Email:

More information

Space-Time Tradeoffs for Approximate Nearest Neighbor Searching

Space-Time Tradeoffs for Approximate Nearest Neighbor Searching 1 Space-Time Tradeoffs for Approximate Nearest Neighbor Searching SUNIL ARYA Hong Kong University of Science and Technology, Kowloon, Hong Kong, China THEOCHARIS MALAMATOS University of Peloponnese, Tripoli,

More information

Fast algorithms for collision and proximity problems involving moving geometric objects

Fast algorithms for collision and proximity problems involving moving geometric objects Fast algorithms for collision and proximity problems involving moving geometric objects Prosenjit Gupta Ravi Janardan Michiel Smid August 22, 1995 Abstract Consider a set of geometric objects, such as

More information

Lecture 8: Interest Point Detection. Saad J Bedros

Lecture 8: Interest Point Detection. Saad J Bedros #1 Lecture 8: Interest Point Detection Saad J Bedros sbedros@umn.edu Last Lecture : Edge Detection Preprocessing of image is desired to eliminate or at least minimize noise effects There is always tradeoff

More information

On Regular Vertices of the Union of Planar Convex Objects

On Regular Vertices of the Union of Planar Convex Objects On Regular Vertices of the Union of Planar Convex Objects Esther Ezra János Pach Micha Sharir Abstract Let C be a collection of n compact convex sets in the plane, such that the boundaries of any pair

More information

l 2 b 2 A ( φ) θ/2 b 1 j p k

l 2 b 2 A ( φ) θ/2 b 1 j p k Computing the Angularity Tolerance Mark de Berg y Henk Meijer z Mark Overmars y Gordon Wilfong x July 30, 1996 Abstract In computational metrology one needs to compute whether an object satises specications

More information

Divide-and-Conquer. Reading: CLRS Sections 2.3, 4.1, 4.2, 4.3, 28.2, CSE 6331 Algorithms Steve Lai

Divide-and-Conquer. Reading: CLRS Sections 2.3, 4.1, 4.2, 4.3, 28.2, CSE 6331 Algorithms Steve Lai Divide-and-Conquer Reading: CLRS Sections 2.3, 4.1, 4.2, 4.3, 28.2, 33.4. CSE 6331 Algorithms Steve Lai Divide and Conquer Given an instance x of a prolem, the method works as follows: divide-and-conquer

More information

High-Dimensional Indexing by Distributed Aggregation

High-Dimensional Indexing by Distributed Aggregation High-Dimensional Indexing by Distributed Aggregation Yufei Tao ITEE University of Queensland In this lecture, we will learn a new approach for indexing high-dimensional points. The approach borrows ideas

More information

Spatial Database. Ahmad Alhilal, Dimitris Tsaras

Spatial Database. Ahmad Alhilal, Dimitris Tsaras Spatial Database Ahmad Alhilal, Dimitris Tsaras Content What is Spatial DB Modeling Spatial DB Spatial Queries The R-tree Range Query NN Query Aggregation Query RNN Query NN Queries with Validity Information

More information

University of Florida CISE department Gator Engineering. Clustering Part 1

University of Florida CISE department Gator Engineering. Clustering Part 1 Clustering Part 1 Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville What is Cluster Analysis? Finding groups of objects such that the objects

More information

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 4 Spatial Point Patterns Definition Set of point locations with recorded events" within study

More information

Improved Submatrix Maximum Queries in Monge Matrices

Improved Submatrix Maximum Queries in Monge Matrices Improved Submatrix Maximum Queries in Monge Matrices Pawe l Gawrychowski 1, Shay Mozes 2, and Oren Weimann 3 1 MPII, gawry@mpi-inf.mpg.de 2 IDC Herzliya, smozes@idc.ac.il 3 University of Haifa, oren@cs.haifa.ac.il

More information

Example 1: Finding angle measures: I ll do one: We ll do one together: You try one: ML and MN are tangent to circle O. Find the value of x

Example 1: Finding angle measures: I ll do one: We ll do one together: You try one: ML and MN are tangent to circle O. Find the value of x Ch 1: Circles 1 1 Tangent Lines 1 Chords and Arcs 1 3 Inscribed Angles 1 4 Angle Measures and Segment Lengths 1 5 Circles in the coordinate plane 1 1 Tangent Lines Focused Learning Target: I will be able

More information

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 5 Topic Overview 1) Introduction/Unvariate Statistics 2) Bootstrapping/Monte Carlo Simulation/Kernel

More information

Manifold Coarse Graining for Online Semi-supervised Learning

Manifold Coarse Graining for Online Semi-supervised Learning for Online Semi-supervised Learning Mehrdad Farajtabar, Amirreza Shaban, Hamid R. Rabiee, Mohammad H. Rohban Digital Media Lab, Department of Computer Engineering, Sharif University of Technology, Tehran,

More information

Improved Pointer Machine and I/O Lower Bounds for Simplex Range Reporting and Related Problems

Improved Pointer Machine and I/O Lower Bounds for Simplex Range Reporting and Related Problems Improved Pointer Machine and I/O Lower Bounds for Simplex Range Reporting and Related Problems Peyman Afshani MADALGO Aarhus University peyman@cs.au.dk Abstract We investigate one of the fundamental areas

More information

Optimal compression of approximate Euclidean distances

Optimal compression of approximate Euclidean distances Optimal compression of approximate Euclidean distances Noga Alon 1 Bo az Klartag 2 Abstract Let X be a set of n points of norm at most 1 in the Euclidean space R k, and suppose ε > 0. An ε-distance sketch

More information

Principles of Pattern Recognition. C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata

Principles of Pattern Recognition. C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata Principles of Pattern Recognition C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata e-mail: murthy@isical.ac.in Pattern Recognition Measurement Space > Feature Space >Decision

More information

CATEGORIZATION GENERATED BY PROTOTYPES AN AXIOMATIC APPROACH

CATEGORIZATION GENERATED BY PROTOTYPES AN AXIOMATIC APPROACH CATEGORIZATION GENERATED BY PROTOTYPES AN AXIOMATIC APPROACH Yaron Azrieli and Ehud Lehrer Tel-Aviv University CATEGORIZATION GENERATED BY PROTOTYPES AN AXIOMATIC APPROACH p. 1/3 Outline Categorization

More information

Application of the Cross-Entropy Method to Clustering and Vector Quantization

Application of the Cross-Entropy Method to Clustering and Vector Quantization Application of the Cross-Entropy Method to Clustering and Vector Quantization Dirk P. Kroese and Reuven Y. Rubinstein and Thomas Taimre Faculty of Industrial Engineering and Management, Technion, Haifa,

More information

Decidability of consistency of function and derivative information for a triangle and a convex quadrilateral

Decidability of consistency of function and derivative information for a triangle and a convex quadrilateral Decidability of consistency of function and derivative information for a triangle and a convex quadrilateral Abbas Edalat Department of Computing Imperial College London Abstract Given a triangle in the

More information

Geometric Facility Location Problems on Uncertain Data

Geometric Facility Location Problems on Uncertain Data Utah State University DigitalCommons@USU All Graduate Theses and Dissertations Graduate Studies 8-2017 Geometric Facility Location Problems on Uncertain Data Jingru Zhang Utah State University Follow this

More information

On Solving Coverage Problems in a Wireless Sensor Network Using Voronoi Diagrams

On Solving Coverage Problems in a Wireless Sensor Network Using Voronoi Diagrams On Solving Coverage Problems in a Wireless Sensor Network Using Voronoi Diagrams Anthony Man Cho So 1 and Yinyu Ye 2 1 Department of Computer Science, Stanford University, Stanford, CA 94305, USA. manchoso@cs.stanford.edu

More information

Interpolation via Barycentric Coordinates

Interpolation via Barycentric Coordinates Interpolation via Barycentric Coordinates Pierre Alliez Inria Some material from D. Anisimov Outline Barycenter Convexity Barycentric coordinates For Simplices For point sets Inverse distance (Shepard)

More information

Implicit Differentiation

Implicit Differentiation Implicit Differentiation Sometimes we have a curve in the plane defined by a function Fxy= (, ) 0 (1) that involves both x and y, possibly in complicated ways. Examples. We show below the graphs of the

More information

Slide a window along the input arc sequence S. Least-squares estimate. σ 2. σ Estimate 1. Statistically test the difference between θ 1 and θ 2

Slide a window along the input arc sequence S. Least-squares estimate. σ 2. σ Estimate 1. Statistically test the difference between θ 1 and θ 2 Corner Detection 2D Image Features Corners are important two dimensional features. Two dimensional image features are interesting local structures. They include junctions of dierent types Slide 3 They

More information

Computational Genomics

Computational Genomics Computational Genomics http://www.cs.cmu.edu/~02710 Introduction to probability, statistics and algorithms (brief) intro to probability Basic notations Random variable - referring to an element / event

More information

Separating Bichromatic Point Sets by L-shapes

Separating Bichromatic Point Sets by L-shapes Separating Bichromatic Point Sets by L-shapes Farnaz Sheikhi Ali Mohades Mark de Berg Mansoor Davoodi August 28, 2013 Abstract Given a set R of red points and a set B of blue points in the plane, we study

More information

BMO Round 2 Problem 3 Generalisation and Bounds

BMO Round 2 Problem 3 Generalisation and Bounds BMO 2007 2008 Round 2 Problem 3 Generalisation and Bounds Joseph Myers February 2008 1 Introduction Problem 3 (by Paul Jefferys) is: 3. Adrian has drawn a circle in the xy-plane whose radius is a positive

More information

Succinct Data Structures for Approximating Convex Functions with Applications

Succinct Data Structures for Approximating Convex Functions with Applications Succinct Data Structures for Approximating Convex Functions with Applications Prosenjit Bose, 1 Luc Devroye and Pat Morin 1 1 School of Computer Science, Carleton University, Ottawa, Canada, K1S 5B6, {jit,morin}@cs.carleton.ca

More information

Local Self-Tuning in Nonparametric Regression. Samory Kpotufe ORFE, Princeton University

Local Self-Tuning in Nonparametric Regression. Samory Kpotufe ORFE, Princeton University Local Self-Tuning in Nonparametric Regression Samory Kpotufe ORFE, Princeton University Local Regression Data: {(X i, Y i )} n i=1, Y = f(x) + noise f nonparametric F, i.e. dim(f) =. Learn: f n (x) = avg

More information

Discrete Geometry. Problem 1. Austin Mohr. April 26, 2012

Discrete Geometry. Problem 1. Austin Mohr. April 26, 2012 Discrete Geometry Austin Mohr April 26, 2012 Problem 1 Theorem 1 (Linear Programming Duality). Suppose x, y, b, c R n and A R n n, Ax b, x 0, A T y c, and y 0. If x maximizes c T x and y minimizes b T

More information

GI01/M055: Supervised Learning

GI01/M055: Supervised Learning GI01/M055: Supervised Learning 1. Introduction to Supervised Learning October 5, 2009 John Shawe-Taylor 1 Course information 1. When: Mondays, 14:00 17:00 Where: Room 1.20, Engineering Building, Malet

More information

Lecture 15: Random Projections

Lecture 15: Random Projections Lecture 15: Random Projections Introduction to Learning and Analysis of Big Data Kontorovich and Sabato (BGU) Lecture 15 1 / 11 Review of PCA Unsupervised learning technique Performs dimensionality reduction

More information