K-Nearest Neighbor Temporal Aggregate Queries

Size: px
Start display at page:

Download "K-Nearest Neighbor Temporal Aggregate Queries"

Transcription

1 Experiments and Conclusion K-Nearest Neighbor Temporal Aggregate Queries Yu Sun Jianzhong Qi Yu Zheng Rui Zhang Department of Computing and Information Systems University of Melbourne Microsoft Research, Beijing March 26 th 2015

2 Outline Experiments and Conclusion 1 Motivation and Related Work Motivating Examples Previous Work 2 Definition of knnta Structure and Usage of TAR-tree 3 The Proposed Strategy Analysis of Different Strategies 4 Experiments and Conclusion

3 Examples in Our Daily Life Experiments and Conclusion Motivating Examples Previous Work Find some walking-distance attractions Find a nearby club gathering lots of people now Find a good restaurant not far away and has few customers now

4 Experiments and Conclusion Motivating Examples Previous Work Answering Such Questions Has Wide Applications Foursquare or Facebook: places nearby Flickr or Instagram: photos taken nearby having many Likes Urban computing

5 Experiments and Conclusion Motivating Examples Previous Work No Existing Queries Can Effectively Answer Such Questions Ranking locations on Spatial distance Temporal aggregate on visits or likes Range aggregate does not work

6 Experiments and Conclusion Motivating Examples Previous Work No Existing Algorithms or Indexes Can Efficiently Support Such Rankings Characteristics of such applications Visits or likes arrive continuously Interested periods range from hours to years Explore results of different preferences

7 Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree A Weighted Sum of the Spatial Distance and Temporal Aggregate The ranking function f(p) = αd(p, q)+(1 α)(1 g(p,i q )) (knnta): returns the k locations with the minimum ranking scores

8 A Query Example Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree c l a g b j d q t 0 t 1 t 2 e a f i e f h... k l Query: q, [t 0, now], α = 0.3, k = 1 f(e) = (1 0.3) (1 2 ) = f(f) = (1 0.3) ( ) = 0.058

9 Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree A Straightforward Approach May Encounter Very High Cost Number of locations in Foursquare is 60 million Number of records for each location is 525, 600 Much more for applications like Instagram or Twitter

10 A Brief Reminder of R-tree Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree R 6 R 2 c g l R 1 a b d j f R 3 h e R 7 R 4 i R 5 k R 6 R 7 R 1 R 2 R 3 R 4 R 5 c g l a b d h j f e i k

11 Basic Structure of TAR-tree l a d j R 3 b c R 1 g R 2 R 5 f e h R 6 i R 4 R 7 k Temporal Indexes (2,3,2) R 6 R 7 (3,5,4) R 1 R 2 R 3 R 4 R 5 f c g b a e d h k i l j (3,5,4) (2,2,2) (2,3,1) (1,*,1)

12 Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree knnta Query Processing using TAR-tree Best-First Search: R 6 (0.000) R 7 (0.427) R 1 (0.058) R 2 (0.352) R 3 (0.408) R 7 (0.427) f (0.058) R 2 (0.352) R 3 (0.408) R 7 (0.427) (2,3,2) (3,5,4) R 6 R 7 (2,2,2) (3,5,4) R 1 R 2 R 3 (2,3,*) R 4 R 5 f c g b a e d h k i l j

13 Maintenance of TAR-tree Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree Insert Visits or Likes Insert location Re-Insert Note split R 6 R 7 R 1 R 2 R 3 R 4 R 5 f c g b a e d h k i l j

14 Maintenance of TAR-tree Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree Insert Visits or Likes Insert location Re-Insert Note split R 6 R 7 R 1 R 2 R 3 R 4 R 5 f c g b a e d h k i l j

15 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies The Importance of Entry Grouping Strategy R 6 R 2 R 7 Root, R 6, R 7, R 2 Four node accesses

16 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies The Importance of Entry Grouping Strategy R 6 R 2 R 7 R 3 R 6 Root, R 6, R 7, R 2 Four node accesses Root, R 6, R 3 Three node accesses

17 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Properly Integrating the Spatial Distance and Temporal Aggregate Straightforward one: Use the spatial information Straightforward two: Use the aggregate distribution (1, 0, 1) with (1, 1, 0) (1, 0, 1) not with (10, 8, 9) Propose: 3D MBR in R*-tree two spatial dimensions third is the aggregate dimension Coordinate of the aggregate dimension m λ p = 1 m v i i=1 Example: (1, 0, 1) 0.67 and (10, 2, 9, 8) 7.25

18 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Power-law Distribution of the Aggregate Data Roughly 80% of the visits are at 20% of the locations Pr(X x) Pr(X x) x (a) NYC Pr(X x) Pr(X x) x (b) LA x (c) GW x (d) GS

19 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Estimation of the Query Search Region and Number of Node Accesses aggregate dimension l j c g g a b d e h k i f 0.00 Conic shape search region

20 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Estimation of the Query Search Region and Number of Node Accesses aggregate dimension l j c g g a b d e h k i f 0.00 aggregate dimension nodes search region Conic shape search region Power-law like node extents

21 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Bad Behavior of the Two Straightforward Strategies b a d e l j h k c g i f Spatial grouping

22 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Bad Behavior of the Two Straightforward Strategies l j b a d h e k a d b l j h k e c g i c g i f f Spatial grouping Aggregate grouping

23 Experiment Set Up Experiments and Conclusion Experiments Conclusion Table: Data Set Name Time Locations Check-ins NYC 05/ / , ,784 LA 02/ / , ,924 GW 02/ /2010 1,280,969 6,442,803 GS 01/ / ,968 1,385,223 Temporal index: Multi-version B-tree Desktop with 3.40GHz CPU and 16GB RAM Results are averaged over 1, 000 queries By default k = 10 and α = 0.3.

24 Validation of The Analysis Experiments and Conclusion Experiments Conclusion f(p k ) f(p k ) measured estimated k measured estimated α 0 node accesses node accesses measured estimated k measured estimated α 0

25 Experiments and Conclusion Experiments Conclusion Performance of the TAR-tree CPU time (ms) CPU time (ms) baseline IND-agg IND-spa TAR-tree 1 20% 40% 60% 80% 100% time 10 baseline IND-agg IND-spa TAR-tree α CPU time (ms) CPU time (ms) baseline IND-agg IND-spa TAR-tree k baseline IND-agg IND-spa TAR-tree epoch length (day)

26 Conclusion Experiments and Conclusion Experiments Conclusion The knnta query can provide highly customized location retrieval and has wide applications. The TAR-tree index efficiently processes the knnta query.

27 Experiments and Conclusion Experiments Conclusion Questions?

Exploiting Geographic Dependencies for Real Estate Appraisal

Exploiting Geographic Dependencies for Real Estate Appraisal Exploiting Geographic Dependencies for Real Estate Appraisal Yanjie Fu Joint work with Hui Xiong, Yu Zheng, Yong Ge, Zhihua Zhou, Zijun Yao Rutgers, the State University of New Jersey Microsoft Research

More information

Nearest Neighbor Search with Keywords

Nearest Neighbor Search with Keywords Nearest Neighbor Search with Keywords Yufei Tao KAIST June 3, 2013 In recent years, many search engines have started to support queries that combine keyword search with geography-related predicates (e.g.,

More information

Skylines. Yufei Tao. ITEE University of Queensland. INFS4205/7205, Uni of Queensland

Skylines. Yufei Tao. ITEE University of Queensland. INFS4205/7205, Uni of Queensland Yufei Tao ITEE University of Queensland Today we will discuss problems closely related to the topic of multi-criteria optimization, where one aims to identify objects that strike a good balance often optimal

More information

Behavioral Simulations in MapReduce

Behavioral Simulations in MapReduce Behavioral Simulations in MapReduce Guozhang Wang, Marcos Vaz Salles, Benjamin Sowell, Xun Wang, Tuan Cao, Alan Demers, Johannes Gehrke, Walker White Cornell University 1 What are Behavioral Simulations?

More information

Nearest Neighbor Search with Keywords in Spatial Databases

Nearest Neighbor Search with Keywords in Spatial Databases 776 Nearest Neighbor Search with Keywords in Spatial Databases 1 Sphurti S. Sao, 2 Dr. Rahila Sheikh 1 M. Tech Student IV Sem, Dept of CSE, RCERT Chandrapur, MH, India 2 Head of Department, Dept of CSE,

More information

A Survey on Spatial-Keyword Search

A Survey on Spatial-Keyword Search A Survey on Spatial-Keyword Search (COMP 6311C Advanced Data Management) Nikolaos Armenatzoglou 06/03/2012 Outline Problem Definition and Motivation Query Types Query Processing Techniques Indices Algorithms

More information

Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time

Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time Experiment presentation for CS3710:Visual Recognition Presenter: Zitao Liu University of Pittsburgh ztliu@cs.pitt.edu

More information

QuickScorer: a fast algorithm to rank documents with additive ensembles of regression trees

QuickScorer: a fast algorithm to rank documents with additive ensembles of regression trees QuickScorer: a fast algorithm to rank documents with additive ensembles of regression trees Claudio Lucchese, Franco Maria Nardini, Raffaele Perego, Nicola Tonellotto HPC Lab, ISTI-CNR, Pisa, Italy & Tiscali

More information

Diagnosing New York City s Noises with Ubiquitous Data

Diagnosing New York City s Noises with Ubiquitous Data Diagnosing New York City s Noises with Ubiquitous Data Dr. Yu Zheng yuzheng@microsoft.com Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Background Many cities suffer

More information

An Efficient Sampling Method for Characterizing Points of Interests on Maps

An Efficient Sampling Method for Characterizing Points of Interests on Maps An Efficient Sampling Method for Characterizing Points of Interests on Maps Team 1 Qiuyi Hong, Caidan Liu, Zhenhua Li, Jiaqi Liu, Shaowei Gong Outline Background and formulated problem Challenges Our methods

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu March 16, 2016 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision

More information

Point-of-Interest Recommendations: Learning Potential Check-ins from Friends

Point-of-Interest Recommendations: Learning Potential Check-ins from Friends Point-of-Interest Recommendations: Learning Potential Check-ins from Friends Huayu Li, Yong Ge +, Richang Hong, Hengshu Zhu University of North Carolina at Charlotte + University of Arizona Hefei University

More information

Personalized Social Recommendations Accurate or Private

Personalized Social Recommendations Accurate or Private Personalized Social Recommendations Accurate or Private Presented by: Lurye Jenny Paper by: Ashwin Machanavajjhala, Aleksandra Korolova, Atish Das Sarma Outline Introduction Motivation The model General

More information

Collaborative Nowcasting for Contextual Recommendation

Collaborative Nowcasting for Contextual Recommendation Collaborative for Contextual Recommendation Yu Sun 1, Nicholas Jing Yuan 2, Xing Xie 3, Kieran McDonald 4, Rui Zhang 5 University of Melbourne { 1 sun.y, 5 rui.zhang}@unimelb.edu.au Microsoft Research

More information

Carnegie Mellon Univ. Dept. of Computer Science Database Applications. SAMs - Detailed outline. Spatial Access Methods - problem

Carnegie Mellon Univ. Dept. of Computer Science Database Applications. SAMs - Detailed outline. Spatial Access Methods - problem Carnegie Mellon Univ. Dept. of Computer Science 15-415 - Database Applications Lecture #26: Spatial Databases (R&G ch. 28) SAMs - Detailed outline spatial access methods problem dfn R-trees Faloutsos 2

More information

Understanding Wealth in New York City From the Activity of Local Businesses

Understanding Wealth in New York City From the Activity of Local Businesses Understanding Wealth in New York City From the Activity of Local Businesses Vincent S. Chen Department of Computer Science Stanford University vschen@stanford.edu Dan X. Yu Department of Computer Science

More information

Social and Technological Network Analysis: Spatial Networks, Mobility and Applications

Social and Technological Network Analysis: Spatial Networks, Mobility and Applications Social and Technological Network Analysis: Spatial Networks, Mobility and Applications Anastasios Noulas Computer Laboratory, University of Cambridge February 2015 Today s Outline 1. Introduction to spatial

More information

TASM: Top-k Approximate Subtree Matching

TASM: Top-k Approximate Subtree Matching TASM: Top-k Approximate Subtree Matching Nikolaus Augsten 1 Denilson Barbosa 2 Michael Böhlen 3 Themis Palpanas 4 1 Free University of Bozen-Bolzano, Italy augsten@inf.unibz.it 2 University of Alberta,

More information

Query Processing in Spatial Network Databases

Query Processing in Spatial Network Databases Temporal and Spatial Data Management Fall 0 Query Processing in Spatial Network Databases SL06 Spatial network databases Shortest Path Incremental Euclidean Restriction Incremental Network Expansion Spatial

More information

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018 Lab 8: Measuring Graph Centrality - PageRank Monday, November 5 CompSci 531, Fall 2018 Outline Measuring Graph Centrality: Motivation Random Walks, Markov Chains, and Stationarity Distributions Google

More information

Progressive & Algorithms & Systems

Progressive & Algorithms & Systems University of California Merced Lawrence Berkeley National Laboratory Progressive Computation for Data Exploration Progressive Computation Online Aggregation (OLA) in DB Query Result Estimate Result ε

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu November 16, 2015 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision

More information

Facebook Friends! and Matrix Functions

Facebook Friends! and Matrix Functions Facebook Friends! and Matrix Functions! Graduate Research Day Joint with David F. Gleich, (Purdue), supported by" NSF CAREER 1149756-CCF Kyle Kloster! Purdue University! Network Analysis Use linear algebra

More information

ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties

ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties Prof. James She james.she@ust.hk 1 Last lecture 2 Selected works from Tutorial

More information

Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya Yanki, Bojana Dimcheva, Donald Kossmann

Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya Yanki, Bojana Dimcheva, Donald Kossmann Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya anki, Bojana Dimcheva, Donald Kossmann Systems Group ETH Zurich Moving Objects Intersection Finding Position at a future

More information

Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data

Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data Yingzi Wang 12, Nicholas Jing Yuan 2, Defu Lian 3, Linli Xu 1 Xing Xie 2, Enhong Chen 1, Yong Rui 2 1 University of Science

More information

Discovering Spatial and Temporal Links among RDF Data Panayiotis Smeros and Manolis Koubarakis

Discovering Spatial and Temporal Links among RDF Data Panayiotis Smeros and Manolis Koubarakis Discovering Spatial and Temporal Links among RDF Data Panayiotis Smeros and Manolis Koubarakis WWW2016 Workshop: Linked Data on the Web (LDOW2016) April 12, 2016 - Montréal, Canada Outline Introduction

More information

Not All Apps Are Created Equal:

Not All Apps Are Created Equal: Not All Apps Are Created Equal: Analysis of Spatiotemporal Heterogeneity in Nationwide Mobile Service Usage Cristina Marquez and Marco Gramaglia (Universidad Carlos III de Madrid); Marco Fiore (CNR-IEIIT);

More information

Optimal Spatial Dominance: An Effective Search of Nearest Neighbor Candidates

Optimal Spatial Dominance: An Effective Search of Nearest Neighbor Candidates Optimal Spatial Dominance: An Effective Search of Nearest Neighbor Candidates X I A O YA N G W A N G 1, Y I N G Z H A N G 2, W E N J I E Z H A N G 1, X U E M I N L I N 1, M U H A M M A D A A M I R C H

More information

Nearest Neighbor Search with Keywords: Compression

Nearest Neighbor Search with Keywords: Compression Nearest Neighbor Search with Keywords: Compression Yufei Tao KAIST June 3, 2013 In this lecture, we will continue our discussion on: Problem (Nearest Neighbor Search with Keywords) Let P be a set of points

More information

Web Structure Mining Nodes, Links and Influence

Web Structure Mining Nodes, Links and Influence Web Structure Mining Nodes, Links and Influence 1 Outline 1. Importance of nodes 1. Centrality 2. Prestige 3. Page Rank 4. Hubs and Authority 5. Metrics comparison 2. Link analysis 3. Influence model 1.

More information

Model of Stars 5 Oct

Model of Stars 5 Oct Outline Test 1 Model of Stars 5 Oct Missouri Club on Fri Questions on parallax Hot plate model of a star Thermal radiation Hertzsprung Russell diagram Missouri Club for Hwk 4 Test 1: Average 15 (60%) Test

More information

Outline for today. Information Retrieval. Cosine similarity between query and document. tf-idf weighting

Outline for today. Information Retrieval. Cosine similarity between query and document. tf-idf weighting Outline for today Information Retrieval Efficient Scoring and Ranking Recap on ranked retrieval Jörg Tiedemann jorg.tiedemann@lingfil.uu.se Department of Linguistics and Philology Uppsala University Efficient

More information

Collaborative Filtering. Radek Pelánek

Collaborative Filtering. Radek Pelánek Collaborative Filtering Radek Pelánek 2017 Notes on Lecture the most technical lecture of the course includes some scary looking math, but typically with intuitive interpretation use of standard machine

More information

Where to Find My Next Passenger?

Where to Find My Next Passenger? Where to Find My Next Passenger? Jing Yuan 1 Yu Zheng 2 Liuhang Zhang 1 Guangzhong Sun 1 1 University of Science and Technology of China 2 Microsoft Research Asia September 19, 2011 Jing Yuan et al. (USTC,MSRA)

More information

Frequency-hiding Dependency-preserving Encryption for Outsourced Databases

Frequency-hiding Dependency-preserving Encryption for Outsourced Databases Frequency-hiding Dependency-preserving Encryption for Outsourced Databases ICDE 17 Boxiang Dong 1 Wendy Wang 2 1 Montclair State University Montclair, NJ 2 Stevens Institute of Technology Hoboken, NJ April

More information

ECEN 689 Special Topics in Data Science for Communications Networks

ECEN 689 Special Topics in Data Science for Communications Networks ECEN 689 Special Topics in Data Science for Communications Networks Nick Duffield Department of Electrical & Computer Engineering Texas A&M University Lecture 5 Optimizing Fixed Size Samples Sampling as

More information

Adapting an Existing Activity Based Modeling Structure for the New York Region

Adapting an Existing Activity Based Modeling Structure for the New York Region Adapting an Existing Activity Based Modeling Structure for the New York Region presented to 2018 TRB Innovations in Travel Modeling Conference Attendees presented by Rachel Copperman Jason Lemp with Thomas

More information

Spatial Keyword Search by using Point Regional QuadTree Algorithm

Spatial Keyword Search by using Point Regional QuadTree Algorithm Spatial Keyword Search by using Point Regional QuadTree Algorithm Ms Harshitha A.R 1, Smt. Christy Persya A 2 PG Scholar, Department of ISE (SSE), BNMIT, Bangalore 560085, Karnataka, India 1 Associate

More information

Slides based on those in:

Slides based on those in: Spyros Kontogiannis & Christos Zaroliagis Slides based on those in: http://www.mmds.org High dim. data Graph data Infinite data Machine learning Apps Locality sensitive hashing PageRank, SimRank Filtering

More information

Latent Geographic Feature Extraction from Social Media

Latent Geographic Feature Extraction from Social Media Latent Geographic Feature Extraction from Social Media Christian Sengstock* Michael Gertz Database Systems Research Group Heidelberg University, Germany November 8, 2012 Social Media is a huge and increasing

More information

Neural networks. Chapter 20. Chapter 20 1

Neural networks. Chapter 20. Chapter 20 1 Neural networks Chapter 20 Chapter 20 1 Outline Brains Neural networks Perceptrons Multilayer networks Applications of neural networks Chapter 20 2 Brains 10 11 neurons of > 20 types, 10 14 synapses, 1ms

More information

Exploring Urban Areas of Interest. Yingjie Hu and Sathya Prasad

Exploring Urban Areas of Interest. Yingjie Hu and Sathya Prasad Exploring Urban Areas of Interest Yingjie Hu and Sathya Prasad What is Urban Areas of Interest (AOIs)? What is Urban Areas of Interest (AOIs)? Urban AOIs exist in people s minds and defined by people s

More information

Outline. Computer Science 331. Cost of Binary Search Tree Operations. Bounds on Height: Worst- and Average-Case

Outline. Computer Science 331. Cost of Binary Search Tree Operations. Bounds on Height: Worst- and Average-Case Outline Computer Science Average Case Analysis: Binary Search Trees Mike Jacobson Department of Computer Science University of Calgary Lecture #7 Motivation and Objective Definition 4 Mike Jacobson (University

More information

Spatial Database. Ahmad Alhilal, Dimitris Tsaras

Spatial Database. Ahmad Alhilal, Dimitris Tsaras Spatial Database Ahmad Alhilal, Dimitris Tsaras Content What is Spatial DB Modeling Spatial DB Spatial Queries The R-tree Range Query NN Query Aggregation Query RNN Query NN Queries with Validity Information

More information

ALMA Cycle 6 Information and Cycle 4 and Cycle 5 Statistics

ALMA Cycle 6 Information and Cycle 4 and Cycle 5 Statistics ALMA Cycle 6 Information and Cycle 4 and Cycle 5 Statistics Yu-Nung Su (ASIAA) on behalf of Taiwan ARC node ALMA Cycle 6 Users Workshop March 26, 2018 Photo Credit: Wei-Hao Wang Cycle 6 Timeline (23:00

More information

Unveiling the misteries of our Galaxy with Intersystems Caché

Unveiling the misteries of our Galaxy with Intersystems Caché Unveiling the misteries of our Galaxy with Intersystems Caché Dr. Jordi Portell i de Mora on behalf of the Gaia team at the Institute for Space Studies of Catalonia (IEEC UB) and the Gaia Data Processing

More information

CS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine

CS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine CS 277: Data Mining Mining Web Link Structure Class Presentations In-class, Tuesday and Thursday next week 2-person teams: 6 minutes, up to 6 slides, 3 minutes/slides each person 1-person teams 4 minutes,

More information

past balancing schemes require maintenance of balance info at all times, are aggresive use work of searches to pay for work of rebalancing

past balancing schemes require maintenance of balance info at all times, are aggresive use work of searches to pay for work of rebalancing 1 Splay Trees Sleator and Tarjan, Self Adjusting Binary Search Trees JACM 32(3) 1985 The claim planning ahead. But matches previous idea of being lazy, letting potential build up, using it to pay for expensive

More information

Data Mining Recitation Notes Week 3

Data Mining Recitation Notes Week 3 Data Mining Recitation Notes Week 3 Jack Rae January 28, 2013 1 Information Retrieval Given a set of documents, pull the (k) most similar document(s) to a given query. 1.1 Setup Say we have D documents

More information

Finding Top-k Preferable Products

Finding Top-k Preferable Products JOURNAL OF L A T E X CLASS FILES, VOL. 6, NO., JANUARY 7 Finding Top-k Preferable Products Yu Peng, Raymond Chi-Wing Wong and Qian Wan Abstract The importance of dominance and skyline analysis has been

More information

Large-Scale Behavioral Targeting

Large-Scale Behavioral Targeting Large-Scale Behavioral Targeting Ye Chen, Dmitry Pavlov, John Canny ebay, Yandex, UC Berkeley (This work was conducted at Yahoo! Labs.) June 30, 2009 Chen et al. (KDD 09) Large-Scale Behavioral Targeting

More information

Activity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness

Activity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness Activity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness Lian Huang, Qingquan Li, Yang Yue State Key Laboratory of Information Engineering in Survey, Mapping and Remote

More information

Topic 4 Randomized algorithms

Topic 4 Randomized algorithms CSE 103: Probability and statistics Winter 010 Topic 4 Randomized algorithms 4.1 Finding percentiles 4.1.1 The mean as a summary statistic Suppose UCSD tracks this year s graduating class in computer science

More information

ArcGIS Enterprise: Out-of-the-Box Spatial Analysis. Vicki Cove Hilary Curtis

ArcGIS Enterprise: Out-of-the-Box Spatial Analysis. Vicki Cove Hilary Curtis ArcGIS Enterprise: Out-of-the-Box Spatial Analysis Vicki Cove Hilary Curtis Agenda What is spatial analysis? Spatial analysis with ArcGIS Enterprise Analysis demos: - Sunflower proximity to bees - Tourists

More information

Predictive Nearest Neighbor Queries Over Uncertain Spatial-Temporal Data

Predictive Nearest Neighbor Queries Over Uncertain Spatial-Temporal Data Predictive Nearest Neighbor Queries Over Uncertain Spatial-Temporal Data Jinghua Zhu, Xue Wang, and Yingshu Li Department of Computer Science, Georgia State University, Atlanta GA, USA, jhzhu.ellen@gmail.com

More information

RaRE: Social Rank Regulated Large-scale Network Embedding

RaRE: Social Rank Regulated Large-scale Network Embedding RaRE: Social Rank Regulated Large-scale Network Embedding Authors: Yupeng Gu 1, Yizhou Sun 1, Yanen Li 2, Yang Yang 3 04/26/2018 The Web Conference, 2018 1 University of California, Los Angeles 2 Snapchat

More information

Tailored Bregman Ball Trees for Effective Nearest Neighbors

Tailored Bregman Ball Trees for Effective Nearest Neighbors Tailored Bregman Ball Trees for Effective Nearest Neighbors Frank Nielsen 1 Paolo Piro 2 Michel Barlaud 2 1 Ecole Polytechnique, LIX, Palaiseau, France 2 CNRS / University of Nice-Sophia Antipolis, Sophia

More information

Holdout and Cross-Validation Methods Overfitting Avoidance

Holdout and Cross-Validation Methods Overfitting Avoidance Holdout and Cross-Validation Methods Overfitting Avoidance Decision Trees Reduce error pruning Cost-complexity pruning Neural Networks Early stopping Adjusting Regularizers via Cross-Validation Nearest

More information

Algorithms for Calculating Statistical Properties on Moving Points

Algorithms for Calculating Statistical Properties on Moving Points Algorithms for Calculating Statistical Properties on Moving Points Dissertation Proposal Sorelle Friedler Committee: David Mount (Chair), William Gasarch Samir Khuller, Amitabh Varshney January 14, 2009

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Lecture #9: Link Analysis Seoul National University 1 In This Lecture Motivation for link analysis Pagerank: an important graph ranking algorithm Flow and random walk formulation

More information

Time-aware Point-of-interest Recommendation

Time-aware Point-of-interest Recommendation Time-aware Point-of-interest Recommendation Quan Yuan, Gao Cong, Zongyang Ma, Aixin Sun, and Nadia Magnenat-Thalmann School of Computer Engineering Nanyang Technological University Presented by ShenglinZHAO

More information

Multi-Dimensional Top-k Dominating Queries

Multi-Dimensional Top-k Dominating Queries Noname manuscript No. (will be inserted by the editor) Multi-Dimensional Top-k Dominating Queries Man Lung Yiu Nikos Mamoulis Abstract The top-k dominating query returns k data objects which dominate the

More information

Recommendation Systems

Recommendation Systems Recommendation Systems Pawan Goyal CSE, IITKGP October 21, 2014 Pawan Goyal (IIT Kharagpur) Recommendation Systems October 21, 2014 1 / 52 Recommendation System? Pawan Goyal (IIT Kharagpur) Recommendation

More information

DYNAMIC TRIP ATTRACTION ESTIMATION WITH LOCATION BASED SOCIAL NETWORK DATA BALANCING BETWEEN TIME OF DAY VARIATIONS AND ZONAL DIFFERENCES

DYNAMIC TRIP ATTRACTION ESTIMATION WITH LOCATION BASED SOCIAL NETWORK DATA BALANCING BETWEEN TIME OF DAY VARIATIONS AND ZONAL DIFFERENCES DYNAMIC TRIP ATTRACTION ESTIMATION WITH LOCATION BASED SOCIAL NETWORK DATA BALANCING BETWEEN TIME OF DAY VARIATIONS AND ZONAL DIFFERENCES Nicholas W. Hu a, Peter J. Jin b, * a Department of Civil and Environmental

More information

P-LAG: Location-aware Group Recommendation for Passive Users

P-LAG: Location-aware Group Recommendation for Passive Users P-LAG: Location-aware Group Recommendation for Passive Users Yuqiu Qian, Ziyu Lu, Nikos Mamoulis, and David W. Cheung The University of Hong Kong {yqqian, zylu, dcheung}@cs.hku.hk University of Ioannina

More information

Vehicle Routing with Traffic Congestion and Drivers Driving and Working Rules

Vehicle Routing with Traffic Congestion and Drivers Driving and Working Rules Vehicle Routing with Traffic Congestion and Drivers Driving and Working Rules A.L. Kok, E.W. Hans, J.M.J. Schutten, W.H.M. Zijm Operational Methods for Production and Logistics, University of Twente, P.O.

More information

Socio-Spatial Group Queries for Impromptu Activity Planning

Socio-Spatial Group Queries for Impromptu Activity Planning Socio-Spatial Group Queries for Impromptu Activity Planning Chih-Ya Shen, De-Nian Yang, Senior Member, IEEE, Liang-Hao Huang, Wang-Chien Lee, Member, IEEE, and Ming-Syan Chen, Fellow, IEEE arxiv:55.68v

More information

Social Studies Grade 2 - Building a Society

Social Studies Grade 2 - Building a Society Social Studies Grade 2 - Building a Society Description The second grade curriculum provides students with a broad view of the political units around them, specifically their town, state, and country.

More information

Attentive Betweenness Centrality (ABC): Considering Options and Bandwidth when Measuring Criticality

Attentive Betweenness Centrality (ABC): Considering Options and Bandwidth when Measuring Criticality Attentive Betweenness Centrality (ABC): Considering Options and Bandwidth when Measuring Criticality Sibel Adalı, Xiaohui Lu, M. Magdon-Ismail September 5, 0 Who is the Most Critical? Would you use the

More information

Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran City, Iran

Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran City, Iran Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran City, Iran January 2018 Niloofar HAJI MIRZA AGHASI Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran

More information

Learning a Degree-Augmented Distance Metric from a Network. Bert Huang, U of Maryland Blake Shaw, Foursquare Tony Jebara, Columbia U

Learning a Degree-Augmented Distance Metric from a Network. Bert Huang, U of Maryland Blake Shaw, Foursquare Tony Jebara, Columbia U Learning a Degree-Augmented Distance Metric from a Network Bert Huang, U of Maryland Blake Shaw, Foursquare Tony Jebara, Columbia U Beyond Mahalanobis: Supervised Large-Scale Learning of Similarity NIPS

More information

Powers functions. Composition and Inverse of functions.

Powers functions. Composition and Inverse of functions. Chapter 4 Powers functions. Composition and Inverse of functions. 4.1 Power functions We have already encountered some examples of power functions in the previous chapters, in the context of polynomial

More information

Mining Users Behaviors and Environments for Semantic Place Prediction

Mining Users Behaviors and Environments for Semantic Place Prediction Mining Users Behaviors and Environments for Semantic Place Prediction Chi-Min Huang Institute of Computer Science and Information Engineering National Cheng Kung University No.1, University Road, Tainan

More information

Inferring Friendship from Check-in Data of Location-Based Social Networks

Inferring Friendship from Check-in Data of Location-Based Social Networks Inferring Friendship from Check-in Data of Location-Based Social Networks Ran Cheng, Jun Pang, Yang Zhang Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, Luxembourg

More information

Advanced Implementations of Tables: Balanced Search Trees and Hashing

Advanced Implementations of Tables: Balanced Search Trees and Hashing Advanced Implementations of Tables: Balanced Search Trees and Hashing Balanced Search Trees Binary search tree operations such as insert, delete, retrieve, etc. depend on the length of the path to the

More information

SPATIAL INDEXING. Vaibhav Bajpai

SPATIAL INDEXING. Vaibhav Bajpai SPATIAL INDEXING Vaibhav Bajpai Contents Overview Problem with B+ Trees in Spatial Domain Requirements from a Spatial Indexing Structure Approaches SQL/MM Standard Current Issues Overview What is a Spatial

More information

Exploring the Impact of Ambient Population Measures on Crime Hotspots

Exploring the Impact of Ambient Population Measures on Crime Hotspots Exploring the Impact of Ambient Population Measures on Crime Hotspots Nick Malleson School of Geography, University of Leeds http://nickmalleson.co.uk/ N.S.Malleson@leeds.ac.uk Martin Andresen Institute

More information

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank

More information

SocViz: Visualization of Facebook Data

SocViz: Visualization of Facebook Data SocViz: Visualization of Facebook Data Abhinav S Bhatele Department of Computer Science University of Illinois at Urbana Champaign Urbana, IL 61801 USA bhatele2@uiuc.edu Kyratso Karahalios Department of

More information

Cambridge International Examinations Cambridge Ordinary Level

Cambridge International Examinations Cambridge Ordinary Level Cambridge International Examinations Cambridge Ordinary Level *7891959410* GEOGRAPHY 2217/22 Paper 2 October/November 2015 Candidates answer on the Question Paper. Additional Materials: Calculator Ruler

More information

Composite Quantization for Approximate Nearest Neighbor Search

Composite Quantization for Approximate Nearest Neighbor Search Composite Quantization for Approximate Nearest Neighbor Search Jingdong Wang Lead Researcher Microsoft Research http://research.microsoft.com/~jingdw ICML 104, joint work with my interns Ting Zhang from

More information

RESEARCH ON THE DISTRIBUTED PARALLEL SPATIAL INDEXING SCHEMA BASED ON R-TREE

RESEARCH ON THE DISTRIBUTED PARALLEL SPATIAL INDEXING SCHEMA BASED ON R-TREE RESEARCH ON THE DISTRIBUTED PARALLEL SPATIAL INDEXING SCHEMA BASED ON R-TREE Yuan-chun Zhao a, b, Cheng-ming Li b a. Shandong University of Science and Technology, Qingdao 266510 b. Chinese Academy of

More information

Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts

Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts Kathrin Bujna 1 and Martin Wistuba 2 1 Paderborn University 2 IBM Research Ireland Abstract.

More information

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank

More information

Blog Community Discovery and Evolution

Blog Community Discovery and Evolution Blog Community Discovery and Evolution Mutual Awareness, Interactions and Community Stories Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura and Belle Tseng What do people feel about Hurricane Katrina?

More information

The direction-constrained k nearest neighbor query

The direction-constrained k nearest neighbor query Geoinformatica (2016) 20:471 502 DOI 10.1007/s10707-016-0245-2 The direction-constrained k nearest neighbor query Dealing with spatio-directional objects Min-Joong Lee 2 Dong-Wan Choi 3 SangYeon Kim 2

More information

I N T H E W O O D S FEBRUARY HALF TERM ACTIVITY GUIDE ( F O R T H E E N C H A N T E D ) C A L L A M B E R

I N T H E W O O D S FEBRUARY HALF TERM ACTIVITY GUIDE ( F O R T H E E N C H A N T E D ) C A L L A M B E R I N T H E W O O D S ( F O R T H E E N C H A N T E D ) FEBRUARY HALF TERM ACTIVITY GUIDE C A L L A M B E R 0 8 9 7 9 0 4 3 6 ACTIVITY GUIDE I N T H E W O O D S KIDS FOR ALL SEASONS The kids' club, on the

More information

Note that numerically, with white corresponding to 0 and black to 1, the rule can be written:

Note that numerically, with white corresponding to 0 and black to 1, the rule can be written: Cellular automata We discuss cellular automata as a simple application of MATLAB programming and as an accessible scientific topic of recent interest. You can find a lot of information on the internet.

More information

Multi-Approximate-Keyword Routing Query

Multi-Approximate-Keyword Routing Query Bin Yao 1, Mingwang Tang 2, Feifei Li 2 1 Department of Computer Science and Engineering Shanghai Jiao Tong University, P. R. China 2 School of Computing University of Utah, USA Outline 1 Introduction

More information

LABELING RESIDENTIAL COMMUNITY CHARACTERISTICS FROM COLLECTIVE ACTIVITY PATTERNS USING TAXI TRIP DATA

LABELING RESIDENTIAL COMMUNITY CHARACTERISTICS FROM COLLECTIVE ACTIVITY PATTERNS USING TAXI TRIP DATA LABELING RESIDENTIAL COMMUNITY CHARACTERISTICS FROM COLLECTIVE ACTIVITY PATTERNS USING TAXI TRIP DATA Yang Zhou 1, 3, *, Zhixiang Fang 2 1 Wuhan Land Use and Urban Spatial Planning Research Center, 55Sanyang

More information

arxiv: v2 [cs.db] 5 May 2011

arxiv: v2 [cs.db] 5 May 2011 Inverse Queries For Multidimensional Spaces Thomas Bernecker 1, Tobias Emrich 1, Hans-Peter Kriegel 1, Nikos Mamoulis 2, Matthias Renz 1, Shiming Zhang 2, and Andreas Züfle 1 arxiv:113.172v2 [cs.db] May

More information

Your World is not Red or Green. Good Practice in Data Display and Dashboard Design

Your World is not Red or Green. Good Practice in Data Display and Dashboard Design Your World is not Red or Green Good Practice in Data Display and Dashboard Design References Tufte, E. R. (2). The visual display of quantitative information (2nd Ed.). Cheshire, CT: Graphics Press. Few,

More information

Decision Trees. None Some Full > No Yes. No Yes. No Yes. No Yes. No Yes. No Yes. No Yes. Patrons? WaitEstimate? Hungry? Alternate?

Decision Trees. None Some Full > No Yes. No Yes. No Yes. No Yes. No Yes. No Yes. No Yes. Patrons? WaitEstimate? Hungry? Alternate? Decision rees Decision trees is one of the simplest methods for supervised learning. It can be applied to both regression & classification. Example: A decision tree for deciding whether to wait for a place

More information

Impression Store: Compressive Sensing-based Storage for. Big Data Analytics

Impression Store: Compressive Sensing-based Storage for. Big Data Analytics Impression Store: Compressive Sensing-based Storage for Big Data Analytics Jiaxing Zhang, Ying Yan, Liang Jeff Chen, Minjie Wang, Thomas Moscibroda & Zheng Zhang Microsoft Research The Curse of O(N) in

More information

The Flexible Socio Spatial Group Queries

The Flexible Socio Spatial Group Queries The Flexible Socio Spatial Group Queries ishwamittra Ghosh UT, angladesh bishwamittra.ghosh@gmail.com Sajid Hasan pon UT, angladesh sajidhasanapon@gmail.com Mohammed unus li UT, angladesh eunus@cse.buet.ac.bd

More information

Photo Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II

Photo Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II Photo Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II Noah Snavely Cornell James Hays CMU MIT (Fall 2009) Brown (Spring 2010 ) Complexity of the Visual World

More information

Estimating Large Scale Population Movement ML Dublin Meetup

Estimating Large Scale Population Movement ML Dublin Meetup Deutsche Bank COO Chief Data Office Estimating Large Scale Population Movement ML Dublin Meetup John Doyle PhD Assistant Vice President CDO Research & Development Science & Innovation john.doyle@db.com

More information

Review of Some Fast Algorithms for Electromagnetic Scattering

Review of Some Fast Algorithms for Electromagnetic Scattering Review of Some Fast Algorithms for Electromagnetic Scattering Weng Cho Chew Center for Computational Electromagnetics and Electromagnetic Laboratory University of Illinois at Urbana-Champaign CSCAMM Lecture

More information

Caesar s Taxi Prediction Services

Caesar s Taxi Prediction Services 1 Caesar s Taxi Prediction Services Predicting NYC Taxi Fares, Trip Distance, and Activity Paul Jolly, Boxiao Pan, Varun Nambiar Abstract In this paper, we propose three models each predicting either taxi

More information