Multiscale Spatio-Temporal Data Aggregation and Mapping for Urban Data Exploration

Similar documents
Point data in mashups: moving away from pushpins in maps

A BASE SYSTEM FOR MICRO TRAFFIC SIMULATION USING THE GEOGRAPHICAL INFORMATION DATABASE

Geographic Analysis of Linguistically Encoded Movement Patterns A Contextualized Perspective

GIS Visualization: A Library s Pursuit Towards Creative and Innovative Research

Joint MISTRAL/CESI lunch workshop 15 th November 2017

How to make R, PostGIS and QGis cooperate for statistical modelling duties: a case study on hedonic regressions

Animating Maps: Visual Analytics meets Geoweb 2.0

DataShine Automated Thematic Mapping of 2011 Census Quick Statistics

How to make R, PostGIS and QGis cooperate for statistical modelling duties: a case study on hedonic regressions

ArcGIS Online Routing and Network Analysis. Deelesh Mandloi Matt Crowder

Changes in the Spatial Distribution of Mobile Source Emissions due to the Interactions between Land-use and Regional Transportation Systems

Sample assessment task. Task details. Content description. Year level 7

Crossing Boundaries Building Tools to Combine Public-Use (and Misaligned) Data

Esri Production Mapping: Map Automation & Advanced Cartography MADHURA PHATERPEKAR JOE SHEFFIELD

Principles of IR. Hacettepe University Department of Information Management DOK 324: Principles of IR

Time Series Analysis with SAR & Optical Satellite Data

GENERALIZATION IN THE NEW GENERATION OF GIS. Dan Lee ESRI, Inc. 380 New York Street Redlands, CA USA Fax:

Dynamic Maps and Historical Context

DP Project Development Pvt. Ltd.

Finding geodata that otherwise would have been forgotten GeoXchange a portal for free geodata

Geometric Algorithms in GIS

From Research Objects to Research Networks: Combining Spatial and Semantic Search

4D information management system for road maintenance using GIS

Changes in Esri GIS, practical ways to be ready for the future

The Scope and Growth of Spatial Analysis in the Social Sciences

Conceptual study of Web-based PPGIS for Designing Built Environment: Identifying Housing Location Preferences in Littleborough

Portal for ArcGIS: An Introduction. Catherine Hynes and Derek Law

Corporate. Information. Railway Infrastructure Administrator. Year indracompany.com

Visualizing Big Data on Maps: Emerging Tools and Techniques. Ilir Bejleri, Sanjay Ranka

Exploring Human Mobility with Multi-Source Data at Extremely Large Metropolitan Scales. ACM MobiCom 2014, Maui, HI

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

ArcGIS is Advancing. Both Contributing and Integrating many new Innovations. IoT. Smart Mapping. Smart Devices Advanced Analytics

Transit Time Shed Analyzing Accessibility to Employment and Services

The Challenge of Geospatial Big Data Analysis

Basics of GIS reviewed

Collection and Analyses of Crowd Travel Behaviour Data by using Smartphones

Personal Field Data Collection by UM-FieldGIS (Integration of Google Map API to Mobile GIS)

Overview. Everywhere. Over everything.

One platform for desktop, web and mobile

Geographic Data Science - Lecture II

FIRE DEPARMENT SANTA CLARA COUNTY

Oregon Department of Transportation. Geographic Information Systems Strategic Plan

Introduction to Coastal GIS

How to make R, PostGIS and QGis cooperate for statistical modelling duties

2013 AND 2025 THE FUTURE OF GIS

Development of a server to manage a customised local version of OpenStreetMap in Ireland

Spatial Information Retrieval

an accessible interface to marine environmental data Russell Moffitt

Design of a Map-Based Transit Itinerary Planner

ADAPTABLE DASHBOARD FOR VISUALIZATION OF ORIGIN-DESTINATION DATA PATTERNS

Introduction to ArcGIS Maps for Office. Greg Ponto Scott Ball

Visualizing Energy Usage and Consumption of the World

TOWARDS THE DEVELOPMENT OF A MONITORING SYSTEM FOR PLANNING POLICY Residential Land Uses Case study of Brisbane, Melbourne, Chicago and London

Smart Card clustering to extract typical temporal passenger habits in Transit network. Two case studies: Rennes in France and Gatineau in Canada

GIS4Graph: a tool for analyzing (geo)graphs applied to study efficiency in a street network

ADAPTABLE DASHBOARD FOR VISUALIZATION OF ORIGIN-DESTINATION DATA PATTERNS

DYNAMIC PORTRAYAL AND DIRECT LOCATION METHOD FOR

VISUALIZATION OF SPATIO-TEMPORAL PATTERNS IN PUBLIC TRANSPORT DATA

ArcGIS GeoAnalytics Server: An Introduction. Sarah Ambrose and Ravi Narayanan

An introduction to ArcGIS Maps for Office. Scott Ball & Mike Flanagan

A HIS-GIS for 200 Years of Belgian Territorial Structures ( )

Department of Computer Science Federal University of São Carlos São Carlos SP, Brazil

MapOSMatic, free city maps for everyone!

Tutorial: Urban Trajectory Visualization. Case Studies. Ye Zhao

Global Grids and the Future of Geospatial Computing. Kevin M. Sahr Department of Computer Science Southern Oregon University

Information System as a Tool for Marine Spatial Planning The SmartSea Vision and a Prototype

THE 3D SIMULATION INFORMATION SYSTEM FOR ASSESSING THE FLOODING LOST IN KEELUNG RIVER BASIN

ELF products in the ArcGIS platform

Spatial Analysis with Web GIS. Rachel Weeden

Web georeference of historical maps

3D-geodata infrastructure in the city of coburg - Origin process und vision. INGeoForum e.v. Open Geospatial Consortium

Navigating to Success: Finding Your Way Through the Challenges of Map Digitization

FUNDAMENTALS OF GEOINFORMATICS PART-II (CLASS: FYBSc SEM- II)

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

Canadian Historical GIS Partnership Development: Taking Steps for Historical Mapping in Canada

ArcGIS Enterprise: What s New. Philip Heede Shannon Kalisky Melanie Summers Sam Williamson

LUCAS Technical reference document U1 LUCAS Survey data user guide. (Land Use / Cover Area Frame Survey)

Joint-accessibility Design (JAD) Thomas Straatemeier

Spatial Data Infrastructure Concepts and Components. Douglas Nebert U.S. Federal Geographic Data Committee Secretariat

Leveraging Web GIS: An Introduction to the ArcGIS portal

USAGE OF OLD MAPS AND THEIR DISTRIBUTION WITHIN THE FRAME OF WEB MAP SERVICES

Implementing Visual Analytics Methods for Massive Collections of Movement Data

ARCHAVE : A Virtual Reality Interface for Archaeological 3D GIS. Master s Thesis Proposal by Daniel Acevedo Feliz

Mastering Geospatial Complexity in Planning Processes Leonard Daws Geospatial Information Analyst September 2015 OZRI 2015

The SyMoGIH project and Geo-Larhra: A method and a collaborative platform for a digital historical atlas

Institute of Statistical and Geographical Information of Jalisco State Subnational Statistical and Geographical System India.

Smart Data Collection and Real-time Digital Cartography

Write a report (6-7 pages, double space) on some examples of Internet Applications. You can choose only ONE of the following application areas:

GIS for Crime Analysis. Building Better Analysis Capabilities with the ArcGIS Platform

MapOSMatic: city maps for the masses

GEOGRAPHIC INFORMATION SYSTEMS Session 8

You are Building Your Organization s Geographic Knowledge

Part 1: Fundamentals

Encapsulating Urban Traffic Rhythms into Road Networks

Brazil Paper for the. Second Preparatory Meeting of the Proposed United Nations Committee of Experts on Global Geographic Information Management

MEU A cartographic-based web-platform for urban energy management and planning

SOFTWARE ARCHITECTURE DESIGN OF GIS WEB SERVICE AGGREGATION BASED ON SERVICE GROUP

Using MAGIC to Access Spatial Imagery: Putting ER Mapper Image Web Server, ArcIMS and MrSID to work in your Library

Climate Risk Visualization for Adaptation Planning and Emergency Response

A Land Use Transport Model for Greater London:

Transcription:

Multiscale Spatio-Temporal Data Aggregation and Mapping for Urban Data Exploration Etienne Co me1 and Anaı s Remy2 1- Universite Paris-Est, COSYS, GRETTIA, IFSTTAR, F-77447 Marne-la-Valle e, France 2- SNCF, Innovation & Recherche, 40 avenue des terroirs de France, 75611 Paris, France Abstract. Maps seem the most intuitive way to visualize massive urban data but they also raise some well-known graphical problems (such as visual clutter, etc.). This paper focuses on processing massive spatiotemporal data in order to ease multi-scale exploration. To this end, we describe a preprocessing tool that enables the automatic creation of a multi-resolution grid from a high resolution grid of spatio-temporal data in a format compatible with webmapping applications (vector tiles). The use of this tool is exemplified through a prototype that offers the possibility to navigate into a massive itinerary request dataset collected in the Ile-deFrance region. 1 Interactive Maps and Massive Urban Data In this work, we focus on processing massive mobility data in order to provide multi-scale information while tacking into account temporal dynamics. To address these issues, we developed a code called pypixgrid for vector tile generation from grid data. Tile systems are now commonly used for geographical multi-scale data management. We propose here a generic tool that automatically generates vector tiles from any massive grid data and allows: A precise aggregation control process from a zoom level to another; The management of temporal aspects additionally to geographic features. In a second step, we developed an interactive visualization prototype, called living mobility generators based on itinerary requests data and directly connected to the previously generated vector tiles. This case study is a first illustration of pypixgrid code usage. Our goal is to visually explore mobility dynamics and their links with territorial features. A key issue is to provide a simple interface allowing an intuitively navigation through spatial and temporal features. This work is a proof of concept developed in collaboration between SNCF Innovation & Research department and IFSTTAR. Data processing and visualization have been implemented on a set of historical data and on different French areas at urban and regional levels. The authors thank SNCF Transilien and Keolis Lyon for supporting this research project. 429

2 Spatio-Temporal Multi-Scale Data Aggregation for Interactive Exploration One of the main challenge in providing a fluid navigation in massive urban datasets come from the size of the data and their spatio-temporal nature. A classical way to solve this question is to perform some sort of aggregation (binning) in order to reduce the size. This technique may also be used to avoid visual clutter and even enhanced using a data-dependent binning [1]. Such lossy compression techniques could solve the problem of data weight but open the question of defining an appropriate bin size. Depending on the chosen bin size, it is well known (in statistics and geography [2]) that different aspects of the underlying data could be observed (or hidden). A natural extension of this concept is therefore to propose several scale of analysis to the viewer by computing several aggregations with different bin sizes. This approach has another advantage: if the aggregation scale is proportional to the View scale (using coarse binning for large scale views and finer binning for zoomed views) the size of the data needed to draw the map can in fact be controlled. At fine scales where the binning is small (and where therefore the data weight more) only a small part of the whole dataset is visible to the user and we can provide only the relevant part of the dataset to the user. This technique is commonly used in web-mapping where network bandwidth limits the amount of data that can be rapidly transfered to the viewer. Building on this well known technique for spatial data we build an automatic multi-scale aggregation tool for statistical data aggregation that we call pypixgrid. Taking the finest grid as input, this tool builds a serie of grids by iteratively aggregating the previous grid by a specified factor using a user defined aggregating function. With such an approach the series of grids has typically the following form: (30m 30m) (60m 60m)... (240m 240m) (480m 480m) The first grid uses cells of 30m by 30m, the second s are four time bigger and so on. Eventually, each of these grids is sliced into vector tiles using a format commonly used in web-mapping applications in order to enable a fast access to the portion of the data needed to display the current view to the user (i.e. which depends on the current position and zoom level). The definition of the relationship between the grid pyramid and the tiles pyramid (which defines the available zoom levels) is also parameterizable using a simple scheme. This solution enables the exploration of massive spatial datasets and can be extended to include temporal information and this was one of the motivations for this work. In the same spirit as for space, a nested binning scheme can be used for time. This will increase the complexity of the interaction with the user during exploration since two kinds of zoom must be handled in this case (spatial and temporal). In this first prototype, we limited ourselves to a unique aggregation period for the time dimension of the data to avoid this issue. To conclude, 430

this dataprocessing tool can handle a spatial grid over which several statistical variables are observed. These variables may vary with respect to time or not. The tool produces web-map tiles ready to be visualized with web-mapping tools enabling an easy exploration of spatio-temporal data at different scales. 3 Case Study: Itinerary Request Data Trip planners are used to find multimodal itineraries in response to door to door travel demands. This service is available through various supports: website, smartphone applications, etc. These tools are commonly used on many French regions. For example, there are about 80 thousand itinerary requests per hour in the Ile-de-France region (Paris and its suburbs) through SNCF Transilien tools. So, why not examine these massive digital activities in order to discover new insights on mobility demand on territories? For technical reasons, trip planning tools generate automatically log files with full itinerary requests made by users and calculated responses. Figure 1 presents the data collected on the user query and the service response in these logs. Although this information can contain bias (demand expressed by only a part of transit users) [3, 4, 5], this kind of data are interesting because they provide a new source about transit information needs and door to door mobility demand on territories. Fig. 1: Itinerary request and response data collection. It s noteworthy that theses log files record each server call and response and don t necessarily reflect the users actual behaviors. For example, if a user requests the same itinerary three times, we count three itineraries although there is only one demand. It is one of the actual limits of the data and it can cause some bias. However, the data also provide very detailed information with precise locations of origin and destination that can be an address, a station or a point of interest. We also know the moment when the user requests an itinerary and 431

when he wants to start or finish his travel. The response gives back information about transportation supply through several indicators like walking time, vehicle time and waiting time [6]. The scope of the study is on both the Ile-de-France region and Lyon urban agglomeration in France. Both territories are characterized by strong population densification and mass transit issues. We collected all itineraries requests made by users and paths calculated in response during few months in 2015. These dataset like other automated data collection systems became quickly large, about 1 hundred millions of itineraries requests. We also collected others territorial data such as employment, facilities and special events (football matches, concerts, etc.) thanks to INSEE s open data and some available API. Then, we defined a synthetic contextual indicator based on these data: Each contextual feature has a relative weight by geographical zone (with the sum of the weights equal to one). By combining these contextual data with mobility demand (through itinerary requests) we want to visually establish a link between them in order to improve our understanding of mobility generators. From an applicative perspective, the high temporal resolution of such a dataset is of main interest compared to traditional surveys which can t be used to answer questions regarding demand variability or the impact of specific temporal events since they only provide an average vision of the mobility demand. An interactive visual exploration of such a sizable dataset can be a first step towards a better understanding of such questions, and is therefore of great interest. However providing a fluid environment for spatio-temporal data exploration is not a trivial task, building on existing technologies we propose a proof of concept for this dataset based on multi-scale data aggregation that could be used for other types of spatio-temporal data. With respect to the implementation of this tool it s a python module which works with a postgis database commonly used to store spatio-temporal data. The aggregation process is defined in a simple text file and the outputs of the program fulfill the vector tile specifications1. 4 Interface Design New technologies and possibilities of data visualization are now very impressive in order to make tangible some complex phenomena from large data sets. This is a growing research area that requires various skills in data processing, IT development, and also generative design. In this case, we want to visually highlight mobility dynamics from itinerary requests data in order to: Identify attractors / sprayers of mobility during the day; Visualize links with contextual environment features (employment, housing, facilities, cultural and social events). 1 https://www.mapbox.com/vector-tiles/specification/ 432

Fig. 2: First sketch of the prototype. As a first step, we present a sketch of data visualization made by a designer [7] in Figure 2. The middle area is a map showing origins and destinations of planned trips during the day, and different color layers represent contextual data like employment, housing, facilities, cultural and social events. Several users interactions are available: Territories module: To select the geographic scope of study; Attractors / Sprayers module: To select only origins or destinations of planned trips; Contextual module: To filter one or more color layers (employment, housing, facilities, events, etc.). In addition, the user can explore the whole dataset by choosing the map zoom level (by scroll interaction) and the time step thanks to an hourly timeline with playback and control capability. The main difficulty in the prototype was to develop a graphical encoding to combine simultaneously different contextual data. To solve this visualization issue, some inspirations were derived from printing process. The final solution represents the weight of the different context variables for each of the grid cells together with the total volume of itinerary requests with origin (or destination). A screen-shot of the final live prototype with real data is given in Figure 3. 5 Conclusion In this paper we have described a pre-processing tool for spatio-temporal datasets that enables multi-scale data aggregation and produces vector tiles for an easy integration with web-mapping libraries. One application of such an approach on itinerary request data is also described in this paper but the general approach is easily expendable to another massive spatio-temporal datasets with several scales of interest. With respect to our current and further works, we are working on automatic tools to ease the definition of the graphical encoding used to display such multi-scale spatio-temporal dataset. 433

Fig. 3: Final prototype on a large scale view. References [1] A. Chua and A. Vande Moere. Binsq: visualizing geographic dot density patterns with gridded maps. Cartography and Geographic Information Science, pages 1 20, 2016. [2] S. Openshaw and S. Openshaw. The modifiable areal unit problem. Geo Abstracts University of East Anglia, 1984. [3] M. Tre panier, R. Chapleau, and B. Allard. Can trip planner log files analysis help in transit service planning? Journal of Public Transportation, 8:79 103, 2005. [4] P. Colpaert, A. Chua, R. Verborgh, E. Mannens, R. Van de Walle, and A. Vande Moere. What public transit api logs tell us about travel flows. In Proceedings of the 25th International Conference Companion on World Wide Web., WWW 16 Companion, pages 873 878, Republic and Canton of Geneva, Switzerland, 2016. International World Wide Web Conferences Steering Committee. [5] J. Jones, C. Cloquet, A. Adam, A. Decuyper, and Isabelle Thomas. Belgium through the lens of rail travel requests: Does geography still matter? ISPRS International journal of geo-information, 5(11):216 238, 2016. [6] A. Remy, M. Chandesris, S. Mastalerz, A. Hyenne, and E. Bousquie. Multimodal travel demand based on itinerary requests. In Proceedings of the World Conference on Transport Research, 2016. [7] C. Gue rin, M. Chandesris, and A. Re my. Territoire vivant. Sciences du Design, 3:30 33, 2016. 434