K-Nearest Neighbor Temporal Aggregate Queries
|
|
- Meagan Wade
- 6 years ago
- Views:
Transcription
1 Experiments and Conclusion K-Nearest Neighbor Temporal Aggregate Queries Yu Sun Jianzhong Qi Yu Zheng Rui Zhang Department of Computing and Information Systems University of Melbourne Microsoft Research, Beijing March 26 th 2015
2 Outline Experiments and Conclusion 1 Motivation and Related Work Motivating Examples Previous Work 2 Definition of knnta Structure and Usage of TAR-tree 3 The Proposed Strategy Analysis of Different Strategies 4 Experiments and Conclusion
3 Examples in Our Daily Life Experiments and Conclusion Motivating Examples Previous Work Find some walking-distance attractions Find a nearby club gathering lots of people now Find a good restaurant not far away and has few customers now
4 Experiments and Conclusion Motivating Examples Previous Work Answering Such Questions Has Wide Applications Foursquare or Facebook: places nearby Flickr or Instagram: photos taken nearby having many Likes Urban computing
5 Experiments and Conclusion Motivating Examples Previous Work No Existing Queries Can Effectively Answer Such Questions Ranking locations on Spatial distance Temporal aggregate on visits or likes Range aggregate does not work
6 Experiments and Conclusion Motivating Examples Previous Work No Existing Algorithms or Indexes Can Efficiently Support Such Rankings Characteristics of such applications Visits or likes arrive continuously Interested periods range from hours to years Explore results of different preferences
7 Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree A Weighted Sum of the Spatial Distance and Temporal Aggregate The ranking function f(p) = αd(p, q)+(1 α)(1 g(p,i q )) (knnta): returns the k locations with the minimum ranking scores
8 A Query Example Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree c l a g b j d q t 0 t 1 t 2 e a f i e f h... k l Query: q, [t 0, now], α = 0.3, k = 1 f(e) = (1 0.3) (1 2 ) = f(f) = (1 0.3) ( ) = 0.058
9 Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree A Straightforward Approach May Encounter Very High Cost Number of locations in Foursquare is 60 million Number of records for each location is 525, 600 Much more for applications like Instagram or Twitter
10 A Brief Reminder of R-tree Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree R 6 R 2 c g l R 1 a b d j f R 3 h e R 7 R 4 i R 5 k R 6 R 7 R 1 R 2 R 3 R 4 R 5 c g l a b d h j f e i k
11 Basic Structure of TAR-tree l a d j R 3 b c R 1 g R 2 R 5 f e h R 6 i R 4 R 7 k Temporal Indexes (2,3,2) R 6 R 7 (3,5,4) R 1 R 2 R 3 R 4 R 5 f c g b a e d h k i l j (3,5,4) (2,2,2) (2,3,1) (1,*,1)
12 Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree knnta Query Processing using TAR-tree Best-First Search: R 6 (0.000) R 7 (0.427) R 1 (0.058) R 2 (0.352) R 3 (0.408) R 7 (0.427) f (0.058) R 2 (0.352) R 3 (0.408) R 7 (0.427) (2,3,2) (3,5,4) R 6 R 7 (2,2,2) (3,5,4) R 1 R 2 R 3 (2,3,*) R 4 R 5 f c g b a e d h k i l j
13 Maintenance of TAR-tree Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree Insert Visits or Likes Insert location Re-Insert Note split R 6 R 7 R 1 R 2 R 3 R 4 R 5 f c g b a e d h k i l j
14 Maintenance of TAR-tree Experiments and Conclusion Definition of knnta Structure and Usage of TAR-tree Insert Visits or Likes Insert location Re-Insert Note split R 6 R 7 R 1 R 2 R 3 R 4 R 5 f c g b a e d h k i l j
15 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies The Importance of Entry Grouping Strategy R 6 R 2 R 7 Root, R 6, R 7, R 2 Four node accesses
16 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies The Importance of Entry Grouping Strategy R 6 R 2 R 7 R 3 R 6 Root, R 6, R 7, R 2 Four node accesses Root, R 6, R 3 Three node accesses
17 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Properly Integrating the Spatial Distance and Temporal Aggregate Straightforward one: Use the spatial information Straightforward two: Use the aggregate distribution (1, 0, 1) with (1, 1, 0) (1, 0, 1) not with (10, 8, 9) Propose: 3D MBR in R*-tree two spatial dimensions third is the aggregate dimension Coordinate of the aggregate dimension m λ p = 1 m v i i=1 Example: (1, 0, 1) 0.67 and (10, 2, 9, 8) 7.25
18 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Power-law Distribution of the Aggregate Data Roughly 80% of the visits are at 20% of the locations Pr(X x) Pr(X x) x (a) NYC Pr(X x) Pr(X x) x (b) LA x (c) GW x (d) GS
19 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Estimation of the Query Search Region and Number of Node Accesses aggregate dimension l j c g g a b d e h k i f 0.00 Conic shape search region
20 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Estimation of the Query Search Region and Number of Node Accesses aggregate dimension l j c g g a b d e h k i f 0.00 aggregate dimension nodes search region Conic shape search region Power-law like node extents
21 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Bad Behavior of the Two Straightforward Strategies b a d e l j h k c g i f Spatial grouping
22 Experiments and Conclusion The Proposed Strategy Analysis of Different Strategies Bad Behavior of the Two Straightforward Strategies l j b a d h e k a d b l j h k e c g i c g i f f Spatial grouping Aggregate grouping
23 Experiment Set Up Experiments and Conclusion Experiments Conclusion Table: Data Set Name Time Locations Check-ins NYC 05/ / , ,784 LA 02/ / , ,924 GW 02/ /2010 1,280,969 6,442,803 GS 01/ / ,968 1,385,223 Temporal index: Multi-version B-tree Desktop with 3.40GHz CPU and 16GB RAM Results are averaged over 1, 000 queries By default k = 10 and α = 0.3.
24 Validation of The Analysis Experiments and Conclusion Experiments Conclusion f(p k ) f(p k ) measured estimated k measured estimated α 0 node accesses node accesses measured estimated k measured estimated α 0
25 Experiments and Conclusion Experiments Conclusion Performance of the TAR-tree CPU time (ms) CPU time (ms) baseline IND-agg IND-spa TAR-tree 1 20% 40% 60% 80% 100% time 10 baseline IND-agg IND-spa TAR-tree α CPU time (ms) CPU time (ms) baseline IND-agg IND-spa TAR-tree k baseline IND-agg IND-spa TAR-tree epoch length (day)
26 Conclusion Experiments and Conclusion Experiments Conclusion The knnta query can provide highly customized location retrieval and has wide applications. The TAR-tree index efficiently processes the knnta query.
27 Experiments and Conclusion Experiments Conclusion Questions?
Exploiting Geographic Dependencies for Real Estate Appraisal
Exploiting Geographic Dependencies for Real Estate Appraisal Yanjie Fu Joint work with Hui Xiong, Yu Zheng, Yong Ge, Zhihua Zhou, Zijun Yao Rutgers, the State University of New Jersey Microsoft Research
More informationNearest Neighbor Search with Keywords
Nearest Neighbor Search with Keywords Yufei Tao KAIST June 3, 2013 In recent years, many search engines have started to support queries that combine keyword search with geography-related predicates (e.g.,
More informationSkylines. Yufei Tao. ITEE University of Queensland. INFS4205/7205, Uni of Queensland
Yufei Tao ITEE University of Queensland Today we will discuss problems closely related to the topic of multi-criteria optimization, where one aims to identify objects that strike a good balance often optimal
More informationBehavioral Simulations in MapReduce
Behavioral Simulations in MapReduce Guozhang Wang, Marcos Vaz Salles, Benjamin Sowell, Xun Wang, Tuan Cao, Alan Demers, Johannes Gehrke, Walker White Cornell University 1 What are Behavioral Simulations?
More informationNearest Neighbor Search with Keywords in Spatial Databases
776 Nearest Neighbor Search with Keywords in Spatial Databases 1 Sphurti S. Sao, 2 Dr. Rahila Sheikh 1 M. Tech Student IV Sem, Dept of CSE, RCERT Chandrapur, MH, India 2 Head of Department, Dept of CSE,
More informationA Survey on Spatial-Keyword Search
A Survey on Spatial-Keyword Search (COMP 6311C Advanced Data Management) Nikolaos Armenatzoglou 06/03/2012 Outline Problem Definition and Motivation Query Types Query Processing Techniques Indices Algorithms
More informationStyle-aware Mid-level Representation for Discovering Visual Connections in Space and Time
Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time Experiment presentation for CS3710:Visual Recognition Presenter: Zitao Liu University of Pittsburgh ztliu@cs.pitt.edu
More informationQuickScorer: a fast algorithm to rank documents with additive ensembles of regression trees
QuickScorer: a fast algorithm to rank documents with additive ensembles of regression trees Claudio Lucchese, Franco Maria Nardini, Raffaele Perego, Nicola Tonellotto HPC Lab, ISTI-CNR, Pisa, Italy & Tiscali
More informationDiagnosing New York City s Noises with Ubiquitous Data
Diagnosing New York City s Noises with Ubiquitous Data Dr. Yu Zheng yuzheng@microsoft.com Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Background Many cities suffer
More informationAn Efficient Sampling Method for Characterizing Points of Interests on Maps
An Efficient Sampling Method for Characterizing Points of Interests on Maps Team 1 Qiuyi Hong, Caidan Liu, Zhenhua Li, Jiaqi Liu, Shaowei Gong Outline Background and formulated problem Challenges Our methods
More informationCS6220: DATA MINING TECHNIQUES
CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu March 16, 2016 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision
More informationPoint-of-Interest Recommendations: Learning Potential Check-ins from Friends
Point-of-Interest Recommendations: Learning Potential Check-ins from Friends Huayu Li, Yong Ge +, Richang Hong, Hengshu Zhu University of North Carolina at Charlotte + University of Arizona Hefei University
More informationPersonalized Social Recommendations Accurate or Private
Personalized Social Recommendations Accurate or Private Presented by: Lurye Jenny Paper by: Ashwin Machanavajjhala, Aleksandra Korolova, Atish Das Sarma Outline Introduction Motivation The model General
More informationCollaborative Nowcasting for Contextual Recommendation
Collaborative for Contextual Recommendation Yu Sun 1, Nicholas Jing Yuan 2, Xing Xie 3, Kieran McDonald 4, Rui Zhang 5 University of Melbourne { 1 sun.y, 5 rui.zhang}@unimelb.edu.au Microsoft Research
More informationCarnegie Mellon Univ. Dept. of Computer Science Database Applications. SAMs - Detailed outline. Spatial Access Methods - problem
Carnegie Mellon Univ. Dept. of Computer Science 15-415 - Database Applications Lecture #26: Spatial Databases (R&G ch. 28) SAMs - Detailed outline spatial access methods problem dfn R-trees Faloutsos 2
More informationUnderstanding Wealth in New York City From the Activity of Local Businesses
Understanding Wealth in New York City From the Activity of Local Businesses Vincent S. Chen Department of Computer Science Stanford University vschen@stanford.edu Dan X. Yu Department of Computer Science
More informationSocial and Technological Network Analysis: Spatial Networks, Mobility and Applications
Social and Technological Network Analysis: Spatial Networks, Mobility and Applications Anastasios Noulas Computer Laboratory, University of Cambridge February 2015 Today s Outline 1. Introduction to spatial
More informationTASM: Top-k Approximate Subtree Matching
TASM: Top-k Approximate Subtree Matching Nikolaus Augsten 1 Denilson Barbosa 2 Michael Böhlen 3 Themis Palpanas 4 1 Free University of Bozen-Bolzano, Italy augsten@inf.unibz.it 2 University of Alberta,
More informationQuery Processing in Spatial Network Databases
Temporal and Spatial Data Management Fall 0 Query Processing in Spatial Network Databases SL06 Spatial network databases Shortest Path Incremental Euclidean Restriction Incremental Network Expansion Spatial
More informationLab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018
Lab 8: Measuring Graph Centrality - PageRank Monday, November 5 CompSci 531, Fall 2018 Outline Measuring Graph Centrality: Motivation Random Walks, Markov Chains, and Stationarity Distributions Google
More informationProgressive & Algorithms & Systems
University of California Merced Lawrence Berkeley National Laboratory Progressive Computation for Data Exploration Progressive Computation Online Aggregation (OLA) in DB Query Result Estimate Result ε
More informationCS6220: DATA MINING TECHNIQUES
CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu November 16, 2015 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision
More informationFacebook Friends! and Matrix Functions
Facebook Friends! and Matrix Functions! Graduate Research Day Joint with David F. Gleich, (Purdue), supported by" NSF CAREER 1149756-CCF Kyle Kloster! Purdue University! Network Analysis Use linear algebra
More informationELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties
ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties Prof. James She james.she@ust.hk 1 Last lecture 2 Selected works from Tutorial
More informationTowards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya Yanki, Bojana Dimcheva, Donald Kossmann
Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya anki, Bojana Dimcheva, Donald Kossmann Systems Group ETH Zurich Moving Objects Intersection Finding Position at a future
More informationRegularity and Conformity: Location Prediction Using Heterogeneous Mobility Data
Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data Yingzi Wang 12, Nicholas Jing Yuan 2, Defu Lian 3, Linli Xu 1 Xing Xie 2, Enhong Chen 1, Yong Rui 2 1 University of Science
More informationDiscovering Spatial and Temporal Links among RDF Data Panayiotis Smeros and Manolis Koubarakis
Discovering Spatial and Temporal Links among RDF Data Panayiotis Smeros and Manolis Koubarakis WWW2016 Workshop: Linked Data on the Web (LDOW2016) April 12, 2016 - Montréal, Canada Outline Introduction
More informationNot All Apps Are Created Equal:
Not All Apps Are Created Equal: Analysis of Spatiotemporal Heterogeneity in Nationwide Mobile Service Usage Cristina Marquez and Marco Gramaglia (Universidad Carlos III de Madrid); Marco Fiore (CNR-IEIIT);
More informationOptimal Spatial Dominance: An Effective Search of Nearest Neighbor Candidates
Optimal Spatial Dominance: An Effective Search of Nearest Neighbor Candidates X I A O YA N G W A N G 1, Y I N G Z H A N G 2, W E N J I E Z H A N G 1, X U E M I N L I N 1, M U H A M M A D A A M I R C H
More informationNearest Neighbor Search with Keywords: Compression
Nearest Neighbor Search with Keywords: Compression Yufei Tao KAIST June 3, 2013 In this lecture, we will continue our discussion on: Problem (Nearest Neighbor Search with Keywords) Let P be a set of points
More informationWeb Structure Mining Nodes, Links and Influence
Web Structure Mining Nodes, Links and Influence 1 Outline 1. Importance of nodes 1. Centrality 2. Prestige 3. Page Rank 4. Hubs and Authority 5. Metrics comparison 2. Link analysis 3. Influence model 1.
More informationModel of Stars 5 Oct
Outline Test 1 Model of Stars 5 Oct Missouri Club on Fri Questions on parallax Hot plate model of a star Thermal radiation Hertzsprung Russell diagram Missouri Club for Hwk 4 Test 1: Average 15 (60%) Test
More informationOutline for today. Information Retrieval. Cosine similarity between query and document. tf-idf weighting
Outline for today Information Retrieval Efficient Scoring and Ranking Recap on ranked retrieval Jörg Tiedemann jorg.tiedemann@lingfil.uu.se Department of Linguistics and Philology Uppsala University Efficient
More informationCollaborative Filtering. Radek Pelánek
Collaborative Filtering Radek Pelánek 2017 Notes on Lecture the most technical lecture of the course includes some scary looking math, but typically with intuitive interpretation use of standard machine
More informationWhere to Find My Next Passenger?
Where to Find My Next Passenger? Jing Yuan 1 Yu Zheng 2 Liuhang Zhang 1 Guangzhong Sun 1 1 University of Science and Technology of China 2 Microsoft Research Asia September 19, 2011 Jing Yuan et al. (USTC,MSRA)
More informationFrequency-hiding Dependency-preserving Encryption for Outsourced Databases
Frequency-hiding Dependency-preserving Encryption for Outsourced Databases ICDE 17 Boxiang Dong 1 Wendy Wang 2 1 Montclair State University Montclair, NJ 2 Stevens Institute of Technology Hoboken, NJ April
More informationECEN 689 Special Topics in Data Science for Communications Networks
ECEN 689 Special Topics in Data Science for Communications Networks Nick Duffield Department of Electrical & Computer Engineering Texas A&M University Lecture 5 Optimizing Fixed Size Samples Sampling as
More informationAdapting an Existing Activity Based Modeling Structure for the New York Region
Adapting an Existing Activity Based Modeling Structure for the New York Region presented to 2018 TRB Innovations in Travel Modeling Conference Attendees presented by Rachel Copperman Jason Lemp with Thomas
More informationSpatial Keyword Search by using Point Regional QuadTree Algorithm
Spatial Keyword Search by using Point Regional QuadTree Algorithm Ms Harshitha A.R 1, Smt. Christy Persya A 2 PG Scholar, Department of ISE (SSE), BNMIT, Bangalore 560085, Karnataka, India 1 Associate
More informationSlides based on those in:
Spyros Kontogiannis & Christos Zaroliagis Slides based on those in: http://www.mmds.org High dim. data Graph data Infinite data Machine learning Apps Locality sensitive hashing PageRank, SimRank Filtering
More informationLatent Geographic Feature Extraction from Social Media
Latent Geographic Feature Extraction from Social Media Christian Sengstock* Michael Gertz Database Systems Research Group Heidelberg University, Germany November 8, 2012 Social Media is a huge and increasing
More informationNeural networks. Chapter 20. Chapter 20 1
Neural networks Chapter 20 Chapter 20 1 Outline Brains Neural networks Perceptrons Multilayer networks Applications of neural networks Chapter 20 2 Brains 10 11 neurons of > 20 types, 10 14 synapses, 1ms
More informationExploring Urban Areas of Interest. Yingjie Hu and Sathya Prasad
Exploring Urban Areas of Interest Yingjie Hu and Sathya Prasad What is Urban Areas of Interest (AOIs)? What is Urban Areas of Interest (AOIs)? Urban AOIs exist in people s minds and defined by people s
More informationOutline. Computer Science 331. Cost of Binary Search Tree Operations. Bounds on Height: Worst- and Average-Case
Outline Computer Science Average Case Analysis: Binary Search Trees Mike Jacobson Department of Computer Science University of Calgary Lecture #7 Motivation and Objective Definition 4 Mike Jacobson (University
More informationSpatial Database. Ahmad Alhilal, Dimitris Tsaras
Spatial Database Ahmad Alhilal, Dimitris Tsaras Content What is Spatial DB Modeling Spatial DB Spatial Queries The R-tree Range Query NN Query Aggregation Query RNN Query NN Queries with Validity Information
More informationALMA Cycle 6 Information and Cycle 4 and Cycle 5 Statistics
ALMA Cycle 6 Information and Cycle 4 and Cycle 5 Statistics Yu-Nung Su (ASIAA) on behalf of Taiwan ARC node ALMA Cycle 6 Users Workshop March 26, 2018 Photo Credit: Wei-Hao Wang Cycle 6 Timeline (23:00
More informationUnveiling the misteries of our Galaxy with Intersystems Caché
Unveiling the misteries of our Galaxy with Intersystems Caché Dr. Jordi Portell i de Mora on behalf of the Gaia team at the Institute for Space Studies of Catalonia (IEEC UB) and the Gaia Data Processing
More informationCS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine
CS 277: Data Mining Mining Web Link Structure Class Presentations In-class, Tuesday and Thursday next week 2-person teams: 6 minutes, up to 6 slides, 3 minutes/slides each person 1-person teams 4 minutes,
More informationpast balancing schemes require maintenance of balance info at all times, are aggresive use work of searches to pay for work of rebalancing
1 Splay Trees Sleator and Tarjan, Self Adjusting Binary Search Trees JACM 32(3) 1985 The claim planning ahead. But matches previous idea of being lazy, letting potential build up, using it to pay for expensive
More informationData Mining Recitation Notes Week 3
Data Mining Recitation Notes Week 3 Jack Rae January 28, 2013 1 Information Retrieval Given a set of documents, pull the (k) most similar document(s) to a given query. 1.1 Setup Say we have D documents
More informationFinding Top-k Preferable Products
JOURNAL OF L A T E X CLASS FILES, VOL. 6, NO., JANUARY 7 Finding Top-k Preferable Products Yu Peng, Raymond Chi-Wing Wong and Qian Wan Abstract The importance of dominance and skyline analysis has been
More informationLarge-Scale Behavioral Targeting
Large-Scale Behavioral Targeting Ye Chen, Dmitry Pavlov, John Canny ebay, Yandex, UC Berkeley (This work was conducted at Yahoo! Labs.) June 30, 2009 Chen et al. (KDD 09) Large-Scale Behavioral Targeting
More informationActivity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness
Activity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness Lian Huang, Qingquan Li, Yang Yue State Key Laboratory of Information Engineering in Survey, Mapping and Remote
More informationTopic 4 Randomized algorithms
CSE 103: Probability and statistics Winter 010 Topic 4 Randomized algorithms 4.1 Finding percentiles 4.1.1 The mean as a summary statistic Suppose UCSD tracks this year s graduating class in computer science
More informationArcGIS Enterprise: Out-of-the-Box Spatial Analysis. Vicki Cove Hilary Curtis
ArcGIS Enterprise: Out-of-the-Box Spatial Analysis Vicki Cove Hilary Curtis Agenda What is spatial analysis? Spatial analysis with ArcGIS Enterprise Analysis demos: - Sunflower proximity to bees - Tourists
More informationPredictive Nearest Neighbor Queries Over Uncertain Spatial-Temporal Data
Predictive Nearest Neighbor Queries Over Uncertain Spatial-Temporal Data Jinghua Zhu, Xue Wang, and Yingshu Li Department of Computer Science, Georgia State University, Atlanta GA, USA, jhzhu.ellen@gmail.com
More informationRaRE: Social Rank Regulated Large-scale Network Embedding
RaRE: Social Rank Regulated Large-scale Network Embedding Authors: Yupeng Gu 1, Yizhou Sun 1, Yanen Li 2, Yang Yang 3 04/26/2018 The Web Conference, 2018 1 University of California, Los Angeles 2 Snapchat
More informationTailored Bregman Ball Trees for Effective Nearest Neighbors
Tailored Bregman Ball Trees for Effective Nearest Neighbors Frank Nielsen 1 Paolo Piro 2 Michel Barlaud 2 1 Ecole Polytechnique, LIX, Palaiseau, France 2 CNRS / University of Nice-Sophia Antipolis, Sophia
More informationHoldout and Cross-Validation Methods Overfitting Avoidance
Holdout and Cross-Validation Methods Overfitting Avoidance Decision Trees Reduce error pruning Cost-complexity pruning Neural Networks Early stopping Adjusting Regularizers via Cross-Validation Nearest
More informationAlgorithms for Calculating Statistical Properties on Moving Points
Algorithms for Calculating Statistical Properties on Moving Points Dissertation Proposal Sorelle Friedler Committee: David Mount (Chair), William Gasarch Samir Khuller, Amitabh Varshney January 14, 2009
More informationIntroduction to Data Mining
Introduction to Data Mining Lecture #9: Link Analysis Seoul National University 1 In This Lecture Motivation for link analysis Pagerank: an important graph ranking algorithm Flow and random walk formulation
More informationTime-aware Point-of-interest Recommendation
Time-aware Point-of-interest Recommendation Quan Yuan, Gao Cong, Zongyang Ma, Aixin Sun, and Nadia Magnenat-Thalmann School of Computer Engineering Nanyang Technological University Presented by ShenglinZHAO
More informationMulti-Dimensional Top-k Dominating Queries
Noname manuscript No. (will be inserted by the editor) Multi-Dimensional Top-k Dominating Queries Man Lung Yiu Nikos Mamoulis Abstract The top-k dominating query returns k data objects which dominate the
More informationRecommendation Systems
Recommendation Systems Pawan Goyal CSE, IITKGP October 21, 2014 Pawan Goyal (IIT Kharagpur) Recommendation Systems October 21, 2014 1 / 52 Recommendation System? Pawan Goyal (IIT Kharagpur) Recommendation
More informationDYNAMIC TRIP ATTRACTION ESTIMATION WITH LOCATION BASED SOCIAL NETWORK DATA BALANCING BETWEEN TIME OF DAY VARIATIONS AND ZONAL DIFFERENCES
DYNAMIC TRIP ATTRACTION ESTIMATION WITH LOCATION BASED SOCIAL NETWORK DATA BALANCING BETWEEN TIME OF DAY VARIATIONS AND ZONAL DIFFERENCES Nicholas W. Hu a, Peter J. Jin b, * a Department of Civil and Environmental
More informationP-LAG: Location-aware Group Recommendation for Passive Users
P-LAG: Location-aware Group Recommendation for Passive Users Yuqiu Qian, Ziyu Lu, Nikos Mamoulis, and David W. Cheung The University of Hong Kong {yqqian, zylu, dcheung}@cs.hku.hk University of Ioannina
More informationVehicle Routing with Traffic Congestion and Drivers Driving and Working Rules
Vehicle Routing with Traffic Congestion and Drivers Driving and Working Rules A.L. Kok, E.W. Hans, J.M.J. Schutten, W.H.M. Zijm Operational Methods for Production and Logistics, University of Twente, P.O.
More informationSocio-Spatial Group Queries for Impromptu Activity Planning
Socio-Spatial Group Queries for Impromptu Activity Planning Chih-Ya Shen, De-Nian Yang, Senior Member, IEEE, Liang-Hao Huang, Wang-Chien Lee, Member, IEEE, and Ming-Syan Chen, Fellow, IEEE arxiv:55.68v
More informationSocial Studies Grade 2 - Building a Society
Social Studies Grade 2 - Building a Society Description The second grade curriculum provides students with a broad view of the political units around them, specifically their town, state, and country.
More informationAttentive Betweenness Centrality (ABC): Considering Options and Bandwidth when Measuring Criticality
Attentive Betweenness Centrality (ABC): Considering Options and Bandwidth when Measuring Criticality Sibel Adalı, Xiaohui Lu, M. Magdon-Ismail September 5, 0 Who is the Most Critical? Would you use the
More informationSpatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran City, Iran
Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran City, Iran January 2018 Niloofar HAJI MIRZA AGHASI Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran
More informationLearning a Degree-Augmented Distance Metric from a Network. Bert Huang, U of Maryland Blake Shaw, Foursquare Tony Jebara, Columbia U
Learning a Degree-Augmented Distance Metric from a Network Bert Huang, U of Maryland Blake Shaw, Foursquare Tony Jebara, Columbia U Beyond Mahalanobis: Supervised Large-Scale Learning of Similarity NIPS
More informationPowers functions. Composition and Inverse of functions.
Chapter 4 Powers functions. Composition and Inverse of functions. 4.1 Power functions We have already encountered some examples of power functions in the previous chapters, in the context of polynomial
More informationMining Users Behaviors and Environments for Semantic Place Prediction
Mining Users Behaviors and Environments for Semantic Place Prediction Chi-Min Huang Institute of Computer Science and Information Engineering National Cheng Kung University No.1, University Road, Tainan
More informationInferring Friendship from Check-in Data of Location-Based Social Networks
Inferring Friendship from Check-in Data of Location-Based Social Networks Ran Cheng, Jun Pang, Yang Zhang Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, Luxembourg
More informationAdvanced Implementations of Tables: Balanced Search Trees and Hashing
Advanced Implementations of Tables: Balanced Search Trees and Hashing Balanced Search Trees Binary search tree operations such as insert, delete, retrieve, etc. depend on the length of the path to the
More informationSPATIAL INDEXING. Vaibhav Bajpai
SPATIAL INDEXING Vaibhav Bajpai Contents Overview Problem with B+ Trees in Spatial Domain Requirements from a Spatial Indexing Structure Approaches SQL/MM Standard Current Issues Overview What is a Spatial
More informationExploring the Impact of Ambient Population Measures on Crime Hotspots
Exploring the Impact of Ambient Population Measures on Crime Hotspots Nick Malleson School of Geography, University of Leeds http://nickmalleson.co.uk/ N.S.Malleson@leeds.ac.uk Martin Andresen Institute
More informationThanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides
Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank
More informationSocViz: Visualization of Facebook Data
SocViz: Visualization of Facebook Data Abhinav S Bhatele Department of Computer Science University of Illinois at Urbana Champaign Urbana, IL 61801 USA bhatele2@uiuc.edu Kyratso Karahalios Department of
More informationCambridge International Examinations Cambridge Ordinary Level
Cambridge International Examinations Cambridge Ordinary Level *7891959410* GEOGRAPHY 2217/22 Paper 2 October/November 2015 Candidates answer on the Question Paper. Additional Materials: Calculator Ruler
More informationComposite Quantization for Approximate Nearest Neighbor Search
Composite Quantization for Approximate Nearest Neighbor Search Jingdong Wang Lead Researcher Microsoft Research http://research.microsoft.com/~jingdw ICML 104, joint work with my interns Ting Zhang from
More informationRESEARCH ON THE DISTRIBUTED PARALLEL SPATIAL INDEXING SCHEMA BASED ON R-TREE
RESEARCH ON THE DISTRIBUTED PARALLEL SPATIAL INDEXING SCHEMA BASED ON R-TREE Yuan-chun Zhao a, b, Cheng-ming Li b a. Shandong University of Science and Technology, Qingdao 266510 b. Chinese Academy of
More informationMulti-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts
Multi-Plant Photovoltaic Energy Forecasting Challenge with Regression Tree Ensembles and Hourly Average Forecasts Kathrin Bujna 1 and Martin Wistuba 2 1 Paderborn University 2 IBM Research Ireland Abstract.
More informationThanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides
Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank
More informationBlog Community Discovery and Evolution
Blog Community Discovery and Evolution Mutual Awareness, Interactions and Community Stories Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura and Belle Tseng What do people feel about Hurricane Katrina?
More informationThe direction-constrained k nearest neighbor query
Geoinformatica (2016) 20:471 502 DOI 10.1007/s10707-016-0245-2 The direction-constrained k nearest neighbor query Dealing with spatio-directional objects Min-Joong Lee 2 Dong-Wan Choi 3 SangYeon Kim 2
More informationI N T H E W O O D S FEBRUARY HALF TERM ACTIVITY GUIDE ( F O R T H E E N C H A N T E D ) C A L L A M B E R
I N T H E W O O D S ( F O R T H E E N C H A N T E D ) FEBRUARY HALF TERM ACTIVITY GUIDE C A L L A M B E R 0 8 9 7 9 0 4 3 6 ACTIVITY GUIDE I N T H E W O O D S KIDS FOR ALL SEASONS The kids' club, on the
More informationNote that numerically, with white corresponding to 0 and black to 1, the rule can be written:
Cellular automata We discuss cellular automata as a simple application of MATLAB programming and as an accessible scientific topic of recent interest. You can find a lot of information on the internet.
More informationMulti-Approximate-Keyword Routing Query
Bin Yao 1, Mingwang Tang 2, Feifei Li 2 1 Department of Computer Science and Engineering Shanghai Jiao Tong University, P. R. China 2 School of Computing University of Utah, USA Outline 1 Introduction
More informationLABELING RESIDENTIAL COMMUNITY CHARACTERISTICS FROM COLLECTIVE ACTIVITY PATTERNS USING TAXI TRIP DATA
LABELING RESIDENTIAL COMMUNITY CHARACTERISTICS FROM COLLECTIVE ACTIVITY PATTERNS USING TAXI TRIP DATA Yang Zhou 1, 3, *, Zhixiang Fang 2 1 Wuhan Land Use and Urban Spatial Planning Research Center, 55Sanyang
More informationarxiv: v2 [cs.db] 5 May 2011
Inverse Queries For Multidimensional Spaces Thomas Bernecker 1, Tobias Emrich 1, Hans-Peter Kriegel 1, Nikos Mamoulis 2, Matthias Renz 1, Shiming Zhang 2, and Andreas Züfle 1 arxiv:113.172v2 [cs.db] May
More informationYour World is not Red or Green. Good Practice in Data Display and Dashboard Design
Your World is not Red or Green Good Practice in Data Display and Dashboard Design References Tufte, E. R. (2). The visual display of quantitative information (2nd Ed.). Cheshire, CT: Graphics Press. Few,
More informationDecision Trees. None Some Full > No Yes. No Yes. No Yes. No Yes. No Yes. No Yes. No Yes. Patrons? WaitEstimate? Hungry? Alternate?
Decision rees Decision trees is one of the simplest methods for supervised learning. It can be applied to both regression & classification. Example: A decision tree for deciding whether to wait for a place
More informationImpression Store: Compressive Sensing-based Storage for. Big Data Analytics
Impression Store: Compressive Sensing-based Storage for Big Data Analytics Jiaxing Zhang, Ying Yan, Liang Jeff Chen, Minjie Wang, Thomas Moscibroda & Zheng Zhang Microsoft Research The Curse of O(N) in
More informationThe Flexible Socio Spatial Group Queries
The Flexible Socio Spatial Group Queries ishwamittra Ghosh UT, angladesh bishwamittra.ghosh@gmail.com Sajid Hasan pon UT, angladesh sajidhasanapon@gmail.com Mohammed unus li UT, angladesh eunus@cse.buet.ac.bd
More informationPhoto Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II
Photo Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II Noah Snavely Cornell James Hays CMU MIT (Fall 2009) Brown (Spring 2010 ) Complexity of the Visual World
More informationEstimating Large Scale Population Movement ML Dublin Meetup
Deutsche Bank COO Chief Data Office Estimating Large Scale Population Movement ML Dublin Meetup John Doyle PhD Assistant Vice President CDO Research & Development Science & Innovation john.doyle@db.com
More informationReview of Some Fast Algorithms for Electromagnetic Scattering
Review of Some Fast Algorithms for Electromagnetic Scattering Weng Cho Chew Center for Computational Electromagnetics and Electromagnetic Laboratory University of Illinois at Urbana-Champaign CSCAMM Lecture
More informationCaesar s Taxi Prediction Services
1 Caesar s Taxi Prediction Services Predicting NYC Taxi Fares, Trip Distance, and Activity Paul Jolly, Boxiao Pan, Varun Nambiar Abstract In this paper, we propose three models each predicting either taxi
More information