SPATIAL INDEXING. Vaibhav Bajpai

Similar documents
Carnegie Mellon Univ. Dept. of Computer Science Database Applications. SAMs - Detailed outline. Spatial Access Methods - problem

SPATIAL DATA MINING. Ms. S. Malathi, Lecturer in Computer Applications, KGiSL - IIM

EEOS 381 -Spatial Databases and GIS Applications

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

Spatial analysis in XML/GML/SVG based WebGIS

Problem: Data base too big to fit memory Disk reads are slow. Example: 1,000,000 records on disk Binary search might take 20 disk reads

Multimedia Databases 1/29/ Indexes for Multimedia Data Indexes for Multimedia Data Indexes for Multimedia Data

Multimedia Databases. Previous Lecture. 4.1 Multiresolution Analysis. 4 Shape-based Features. 4.1 Multiresolution Analysis

Multimedia Databases. Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig

Multimedia Databases. 4 Shape-based Features. 4.1 Multiresolution Analysis. 4.1 Multiresolution Analysis. 4.1 Multiresolution Analysis

Geodatabase An Introduction

The File Geodatabase API. Dave Sousa, Lance Shipman

Introduction to GIS I

5 Spatial Access Methods

5 Spatial Access Methods

Using the File Geodatabase API. Lance Shipman David Sousa

Working with the Geodatabase

12 Review and Outlook

Overview key concepts and terms (based on the textbook Chang 2006 and the practical manual)

Innovation. The Push and Pull at ESRI. September Kevin Daugherty Cadastral/Land Records Industry Solutions Manager

The File Geodatabase API. Craig Gillgrass Lance Shipman

Geodatabase Essentials Part One - Intro to the Geodatabase. Jonathan Murphy Colin Zwicker

INTRODUCTION TO GEOGRAPHIC INFORMATION SYSTEM By Reshma H. Patil

Composite Quantization for Approximate Nearest Neighbor Search

Esri Overview for Mentor Protégé Program:

Introduction-Overview. Why use a GIS? What can a GIS do? Spatial (coordinate) data model Relational (tabular) data model

Spatial Database. Ahmad Alhilal, Dimitris Tsaras

Nearest Neighbor Search with Keywords

Environment (Parallelizing Query Optimization)

GIS CONCEPTS ARCGIS METHODS AND. 2 nd Edition, July David M. Theobald, Ph.D. Natural Resource Ecology Laboratory Colorado State University

A. Boluwade 1 and A Ferdinand. Ste Anne de Bellevue, QC, H9X 3V9, Canada. ABSTRACT

Georelational Vector Data Model

Geometric Algorithms in GIS

GIS Data Structure: Raster vs. Vector RS & GIS XXIII

Oracle Spatial: Essentials

GEOGRAPHY 350/550 Final Exam Fall 2005 NAME:

GIS Lecture 4: Data. GIS Tutorial, Third Edition GIS 1

ISU GIS CENTER S ARCSDE USER'S GUIDE AND DATA CATALOG

Nearest Neighbor Search with Keywords in Spatial Databases

Popular Mechanics, 1954

Karsten Vennemann, Seattle. QGIS Workshop CUGOS Spring Fling 2015

Applied Cartography and Introduction to GIS GEOG 2017 EL. Lecture-2 Chapters 3 and 4

Nearest Neighbor Search with Keywords: Compression

Geographers Perspectives on the World

Geometric Algorithms in GIS

Geog 469 GIS Workshop. Data Analysis

AN EFFICIENT INDEXING STRUCTURE FOR TRAJECTORIES IN GEOGRAPHICAL INFORMATION SYSTEMS SOWJANYA SUNKARA

GEO-DATA INPUT AND CONVERSION. Christos G. Karydas,, Dr. Lab of Remote Sensing and GIS Director: Prof. N. Silleos

Cross Comparison of Spatially Enabled Databases: PostGIS, SQL Server and JASPA

NR402 GIS Applications in Natural Resources

What s New. August 2013

Tailored Bregman Ball Trees for Effective Nearest Neighbors

Welcome to NR502 GIS Applications in Natural Resources. You can take this course for 1 or 2 credits. There is also an option for 3 credits.

SuperGIS Server 3.3 Functionality Matrix

Introduction to GIS. Dr. M.S. Ganesh Prasad

Advanced Implementations of Tables: Balanced Search Trees and Hashing

43400 Serdang Selangor, Malaysia Serdang Selangor, Malaysia 4

Probabilistic Cardinal Direction Queries On Spatio-Temporal Data

For the IT professional Information management solutions IBM Spatial Solutions

Geography 38/42:376 GIS II. Topic 1: Spatial Data Representation and an Introduction to Geodatabases. The Nature of Geographic Data

Distribution-specific analysis of nearest neighbor search and classification

Census Mapping with ArcGIS

Introduction to Geographic Information Systems

October 2011 ArcGIS 10 for Server Functionality Matrix

Basics of GIS. by Basudeb Bhatta. Computer Aided Design Centre Department of Computer Science and Engineering Jadavpur University

Raster Data Model. Examples of raster data Remotely sensed imagery (BV, DN) DEM (elevation) DRG (color) Raster Database

Representation of Geographic Data

Paths Toward CAD and GIS Interoperability

ESRI Object Models and Data Capture 9/18/ /18/2014 M. Helper GEO327G/386G, UT Austin 2. ESRI Arc/Info ArcView ArcGIS

Free and Open Source Software for Cadastre and Land Registration : A Hidden Treasure? Gertrude Pieper Espada. Overview

SEARCHING THROUGH SPATIAL RELATIONSHIPS USING THE 2DR-TREE

GIS CONCEPTS ARCGIS METHODS AND. 3 rd Edition, July David M. Theobald, Ph.D. Warner College of Natural Resources Colorado State University

Relational operations

Exam 1. March 12th, CS525 - Midterm Exam Solutions

CS1802 Week 11: Algorithms, Sums, Series, Induction

Indexing Objects Moving on Fixed Networks

STORAGE, MANIPULATION, AND VISUALIZATION OF LIDAR DATA

GeoPackage, das Shapefile der Zukunft

Geodatabase 101 Why, What, & How

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall

Existing Open Source Tools and Possibilities for Cadastre Systems

Lower and Upper Bound Theory

INTRODUCTION TO GIS. Dr. Ori Gudes

On multi-scale display of geometric objects

Pavel Zezula, Giuseppe Amato,

GIS for ChEs Introduction to Geographic Information Systems

Factorized Relational Databases Olteanu and Závodný, University of Oxford

Efficient Spatial Data Structure for Multiversion Management of Engineering Drawings

GEO 463-Geographic Information Systems Applications. Lecture 1

<Insert Picture Here> Oracle Spatial 11g. Dr. Siva Ravada

Algorithms for Picture Analysis. Lecture 07: Metrics. Axioms of a Metric

Geographic Information Systems (GIS) in Environmental Studies ENVS Winter 2003 Session III

How GIS can support the Production

FNRM 3131 Introduction to GIS in Natural Resource Management

Swiss PGDay 2016, 24. Juni 2016, HSR Rapperswil POSTGIS. Überblick, Tips und Tricks. Stefan Keller

Administering your Enterprise Geodatabase using Python. Jill Penney

Dictionary: an abstract data type

DS L19: B Trees

Least-Cost Transportation Corridor Analysis Using Raster Data.

12 Review and Outlook

Transcription:

SPATIAL INDEXING Vaibhav Bajpai

Contents Overview Problem with B+ Trees in Spatial Domain Requirements from a Spatial Indexing Structure Approaches SQL/MM Standard Current Issues

Overview What is a Spatial Data? Who requires Spatial Data? What are types of Spatial Data? What are types of Spatial Data Queries? Spatial Databases and their Applications

Overview What is Spatial Data? Information representing geometry - location - topology 2d - Planar Surface 3D - Volumetric 4D - Spatio-Temporal

Overview Who requires Spatial Data? Local Governments: City Planning Traffic Management Accident Investigation State Provincial Governments:Highway Planning Natural Resource Management National Governments: Defense Border Control Extractive Industries: Mineral Water Location Farming: Plot Allocation

Overview What are types of Spatial Data? Point Data Points in m-dimensional space. No Space No Area No Volume. Raster Point Data: Each Pixel with added measured value Region Data Objects having a spatial extent. Location: A fixed point [Centroid] Boundary: A geometric approximation [Vector Data] constructed using Polygons

Overview What are types of Spatial Data Queries? Point Queries Spatial-Range Queries Nearest-Neighbor Queries Spatial-Join Queries *expensive* Similarity Queries

Overview Spatial Databases normal databases understand numeric character types. need to make them understand geometry and features *SimpleFeatures* specification by OGC Point LineString Polygon Multipoint Multipolygon

Problem with B+ Trees in Spatial Domain Approaches to Adapt to Spatial Domain - Composite B+ Trees Multiple B+ Trees

Problem with B+ Trees in Spatial Domain Composite B+ Trees It linearizes 2D space [-] Spatial Proximity is lost. Close in nature => Close on Disk

Problem with B+ Trees in Spatial Domain Multiple B+ Trees [-] Selects too much un-needed data [-] Index space grows with increasing dimensions

Requirements from a Spatial Indexing Structure Should be scalable to n-dimensions Should have symmetric behavior in all Dimensions Should support Insert Delete gracefully Should support non-point data

Approaches GRIDs Quad Trees Z-Curves X-Trees GiST R-Trees *popular*

!!!!!! Approaches!!!!!! GRIDs 2D surface divided into contiguous cells each cell is assigned unique ID and used for indexing *Equal Angle GRIDs* *Equal Area GRIDs* based on *Space Driven Indexing Method*

Approaches Quad Trees Region-Based Quad Trees Point-Based Quad Trees

Approaches Quad Trees: Oct Trees

Approaches Quad Trees: kd Trees Always Binary Always split along a point

Approaches Z-Curves A space-filling curve Good locality-preserving behavior Z-value is interleaved the binary representation of coord values UB-Tree: B+Tree with records stored in Z-Order

Approaches X-Trees Improves upon the R* Tree Provides a *overlap free* split and *supernode* mechanism

Approaches GiST Generalized Search Tree is both a data structure and an API to build upon. R-Trees and X-Trees are built upon using GiST. PostgreSQL has an off-the-shelf GiST implementation.

Approaches R-Trees leaf: <n-dim box, RID> non-leaf: <n-dim box, Ptr to Child>

Approaches R-Trees Search

Approaches R-Trees Insert Node Splitting Heuristic: min [area(l1) + area(l2)]

Approaches R-Trees Delete

Approaches R-Trees Properties The tree is balanced! Inexact Match is a good thing! Match bounding boxes and exact match later. Good on Average-Case Poor in Worst-Case Use Priority R-Tree

Approaches R-Trees Optimizations Store boxes as *Approximate Regions* compact boxes fast interval checking use *Convex Polygons* reduces overlap reduces complexity

Approaches R-Trees Variants R* Tree minimize *box perimeter* than *box area* *forced re-inserts* R+ Tree insert objects in multiple leaves reduces overlap search now takes place a single path to leaves.

SQL/MM defines how to store, retrieve and process spatial data using SQL defines functions to convert, compare, and process this data. only supports up to 2D data commercial implementations IBM DB2 Spatial Extender IBM Informix Dynamic Server Oracle9 Spatial Product

SQL/MM

Current Issues Sequential Scan > R-Tree when Dimension > 12 Nearest Neighbor Queries are meaningful only at Low Contrast need to empirically test the database