arxiv: v1 [cs.oh] 16 Nov 2017

Similar documents
GIS4Graph: a tool for analyzing (geo)graphs applied to study efficiency in a street network

Instituto de Pesquisas Meteorológicas - IPMet Universidade Estadual Paulista - Unesp

Combining Geographic and Network Analysis: The GoMore Rideshare Network. Kate Lyndegaard

MEDIUM-SCALE GRAVITY WAVES OBTAINED FROM AIRGLOW ALL-KSY OBSERVATIONS OVER CACHOEIRA PAULISTA

Multi agent Evacuation Simulation Data Model for Disaster Management Context

QGIS FLO-2D Integration

INTRODUCTION TO GIS. Dr. Ori Gudes

Design and implementation of a new meteorology geographic information system

Techniques for Science Teachers: Using GIS in Science Classrooms.

INTRODUCTION TO GEOGRAPHIC INFORMATION SYSTEM By Reshma H. Patil

Introduction to GIS I

V1.0. Session: Labelled Maps Verification, entering names into a GIS and Google Maps/Earth. Pier-Giorgio Zaccheddu

Full Toblerian Spaces: Recovering Tobler s Views on the Representations for Fluxes in Geographical Analysis

Overview of Geospatial Open Source Software which is Robust, Feature Rich and Standards Compliant

Natural Disaster Monitoring and Alert System

COMPLETELY CONTRACTIVE MAPS BETWEEN C -ALGEBRAS

Among various open-source GIS programs, QGIS can be the best suitable option which can be used across partners for reasons outlined below.

These modules are covered with a brief information and practical in ArcGIS Software and open source software also like QGIS, ILWIS.

GIS CONCEPTS ARCGIS METHODS AND. 2 nd Edition, July David M. Theobald, Ph.D. Natural Resource Ecology Laboratory Colorado State University

Georelational Vector Data Model

Lecture 9: Geocoding & Network Analysis

GEOGRAPHIC INFORMATION SYSTEMS

Geometric Algorithms in GIS

Geography 38/42:376 GIS II. Topic 1: Spatial Data Representation and an Introduction to Geodatabases. The Nature of Geographic Data

QGIS COURSE - ROUTE ANALYSIS, FINDING THE SHORT PATH IN QGIS ONLINE TRAINING. GIS Training. Course. .com

Móstoles, Spain. Keywords: complex networks, dual graph, line graph, line digraph.

Discovery and Access of Geospatial Resources using the Geoportal Extension. Marten Hogeweg Geoportal Extension Product Manager

GIS = Geographic Information Systems;

Comparative analysis of transport communication networks and q-type statistics

Beyond points: How to turn SMW into a complete Geographic Information System

All About Spatial Data. Find it, Manage it, Use it

Learning ArcGIS: Introduction to ArcCatalog 10.1

Welcome to NR502 GIS Applications in Natural Resources. You can take this course for 1 or 2 credits. There is also an option for 3 credits.

Write a report (6-7 pages, double space) on some examples of Internet Applications. You can choose only ONE of the following application areas:

Stochastic cellular automata model for wildland fire spread dynamics

A Technique for Importing Shapefile to Mobile Device in a Distributed System Environment.

ANALYSIS OF THE MICROCLIMATE OF SÃO CRISTÓVÃO AND ITS INFLUENCE IN HEATING SYSTEMS, VENTILATING AND AIR CONDITIONING (HVAC)

PostGIS Cookbook. open source I community experience distilled

Development of a server to manage a customised local version of OpenStreetMap in Ireland

Migration Clusters in Brazil: an Analysis of Areas of Origin and Destination Ernesto Friedrich Amaral

GIS-based Smart Campus System using 3D Modeling

Overview key concepts and terms (based on the textbook Chang 2006 and the practical manual)

An Open Source Tool for the Visualization, Analysis and Reporting of Regional and Statewide Transit Networks

CPSC 695. Future of GIS. Marina L. Gavrilova

GEOGRAPHY 350/550 Final Exam Fall 2005 NAME:

Introduction-Overview. Why use a GIS? What can a GIS do? Spatial (coordinate) data model Relational (tabular) data model

Innovation. The Push and Pull at ESRI. September Kevin Daugherty Cadastral/Land Records Industry Solutions Manager

CENSUS MAPPING WITH GIS IN NAMIBIA. BY Mrs. Ottilie Mwazi Central Bureau of Statistics Tel: October 2007

17th International Conference on Harmonisation within Atmospheric Dispersion Modelling for Regulatory Purposes 9-12 May 2016, Budapest, Hungary

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

4CitySemantics. GIS-Semantic Tool for Urban Intervention Areas

A Natural Disaster Monitoring and Warning System SISMADEN

GIS for ChEs Introduction to Geographic Information Systems

STOCHASTIC CELLULAR AUTOMATA MODEL FOR WILDLAND FIRE SPREAD DYNAMICS

BRAZILIAN'REMOTE SmSING COMKmUTY: HISTORICAL, GEOGRAPHICAL AND ECONOMICAL ASPECTS

What is GIS and How Can It Help Me?

GIS Visualization: A Library s Pursuit Towards Creative and Innovative Research

Welcome! Power BI User Group (PUG) Copenhagen

Valdosta State University Strategic Research & Analysis

GIS CONCEPTS ARCGIS METHODS AND. 3 rd Edition, July David M. Theobald, Ph.D. Warner College of Natural Resources Colorado State University

Encapsulating Urban Traffic Rhythms into Road Networks

GIS Based Transit Information System for Metropolitan Cities in India

GEOMAGNETICALLY INDUCED CURRENTS IN A BRAZILIAN POWER NETWORK OVER THE SOLAR CYCLES 23 AND 24

NR402 GIS Applications in Natural Resources

Overview. Everywhere. Over everything.

One platform for desktop, web and mobile

Application of WebGIS and VGI for Community Based Resources Inventory. Jihn-Fa Jan Department of Land Economics National Chengchi University

What is 511? Need for 511 Services. Development & Deployment of Regional Road and Weather Information Supporting 511 Traveler Services

Road & Railway Network Density Dataset at 1 km over the Belt and Road and Surround Region

International Journal of Remote Sensing, in press, 2006.

Open Source Technologies and Remotely Sensed Data in Protecting Elephants. Rosemary Alles Dr. Justine Blanford Penn State World Campus July 2015

Modeling evacuation plan problems

Spatial Web Technology for Urban Green Society (A Case of Tsukuba City)

Free Open Source Software: FOSS Based GIS for Spatial Retrievals of Appropriate Locations for Ocean Energy Utilizing Electric Power Generation Plants

Research Article The Impact on Geographic Location Accuracy due to Different Satellite Orbit Ephemerides

Applied Cartography and Introduction to GIS GEOG 2017 EL. Lecture-2 Chapters 3 and 4

A Prototype of a Web Mapping System Architecture for the Arctic Region

MapOSMatic: city maps for the masses

Numerical evaluation of the upper critical dimension of percolation in scale-free networks

Geometric Algorithms in GIS

Experiences and Directions in National Portals"

INCT2012 Complex Networks, Long-Range Interactions and Nonextensive Statistics

A NOTE ON WELL-POSED NULL AND FIXED POINT PROBLEMS

Networks as a tool for Complex systems

a system for input, storage, manipulation, and output of geographic information. GIS combines software with hardware,

A National Scale Infrastructure Database and Modelling Environment for the UK

Searching for less perturbed elliptical orbits around Europa

StreamStats: Delivering Streamflow Information to the Public. By Kernell Ries

Department of Computer Science Federal University of São Carlos São Carlos SP, Brazil

GIS Data Structure: Raster vs. Vector RS & GIS XXIII

Introduction to IsoMAP Isoscapes Modeling, Analysis, and Prediction

ISO INTERNATIONAL STANDARD. Geographic information Metadata Part 2: Extensions for imagery and gridded data

arxiv:cond-mat/ v1 [cond-mat.dis-nn] 12 Jun 2004

Development and operation of GIS exercise materials for undergraduate students

An Implementation of Mobile Sensing for Large-Scale Urban Monitoring

DESIGNING AND APPLICATION OF WEB-BASED GEOGRAPHICAL INFORMATION SYSTEM FOR VISUAL ASSESSMENT OF LAND LEVELS

12 Review and Outlook

Introduction to GIS. Phil Guertin School of Natural Resources and the Environment GeoSpatial Technologies

Development of Univ. of San Agustin Geographic Information System (USAGIS)

Basics of GIS. by Basudeb Bhatta. Computer Aided Design Centre Department of Computer Science and Engineering Jadavpur University

Transcription:

(geo)graphs - Complex Networks as a shapefile of nodes and a shapefile of edges for different applications Leonardo B L Santos 1,2,*, Aurelienne A S Jorge 3, Marcio Rossato 4, Jéssica D Santos 2, Onofre A Candido 1, Wilson Seron 5, and Charles N de Santana 6,7,8 arxiv:1711.05879v1 [cs.oh] 16 Nov 2017 1 Centro Nacional de Monitoramento e Alertas de Desastres Naturais (Cemaden), São José dos Campos, SP, Brazil 2 Programa de Pós Graduação em Computação Aplicada do Instituto Nacional de Pesquisas Espaciais (INPE), São José dos Campos, SP, Brasil 3 Instituto Nacional de Pesquisas Espaciais (INPE), Cachoeira Paulista, SP, Brasil 4 Via Vale Sistemas, Cachoeira Paulista, SP, Brasil 5 Universidade Federal de São Paulo, São José dos Campos, SP, Brasil 6 Instituto Nacional de Ciência e Tecnologia em Estudos Interdisciplinares e Transdisciplinares em Ecologia e Evolução (INCT-INTREE), Universidade Federal da Bahia, Salvador, BA, Brasil 7 Programa de Pós Graduação em Modelagem em Ciências da Terra e do Ambiente, Universidade Estadual de Feira de Santana, Feira, BA, Brasil 8 Departamento de Computação Científica, DataSCOUT - Data Science and Scientific Computing, Salvador, BA, Brasil * leonardo.santos@cemaden.gov.br ABSTRACT Spatial dependency and spatial embedding are basic physical properties of many phenomena modeled by networks. The most indicated computational environment to deal with spatial information is to use Georeferenced Information System (GIS) and Geographical Database Management Systems (GDBMS). Several models have been proposed in this direction, however there is a gap in the literature in generic frameworks for working with Complex Networks in GIS/GDBMS environments. Here we introduce the concept of (geo)graphs: graphs in which the nodes have a known geographical location and the edges have spatial dependence. We present case studies and two open source softwares (GIS4GRAPH and GeoCNet) that indicate how to retrieve networks from GIS data and how to represent networks over GIS data by using (geo)graphs. Introduction The advances in high computing processing and data storage in last decades together with the development of modern statistical physics theory facilitated the arise of Complex Networks approaches to model large scale systems in which it is possible to find relationships among its elements 1 4. This concept has been applied to interdisciplinary studies ranging from Ecology 5, 6, Evolution 7 9, Urban Mobility 10, Epidemiology 11, Computer Sciences 12, Climatic Sciences 13, 14 among others. Spatial dependency and spatial embedding are basic physical properties of many phenomena modeled by networks. The classical approach to deal with spatial information is to use Georeferenced Information System (GIS) and Geographical Database Management Systems (GDBMS) that easily combine different layers of georeferenced data into information retrieval. Several models have been proposed to describe spatial effects in networks 15, 16. However, there is a gap in the literature about generic frameworks for working with Complex Networks in GIS/GDBMS environments. Here we introduce the concept of (geo)graphs: graphs in which the nodes have a known geographical location and the edges have spatial dependence - and (geo)graphs are geographical objects compatible with GIS/GDBMS environments. We also present two open source computational frameworks to facilitate the integration of the (geo)graphs in GIS/GDBMS environments: GIS4GRAPH (G4G), a Web tool written in Python and JavaScript that allows the conversion of a given shapefile to a (geo)graph, for which the only requirements to use it is to have Internet access and a browser; and Geographical Complex Networks (GeoCNet) a desktop application written in Python and C that allows the conversion of a graph to a (geo)graph shapefile. Both, G4G and GeoCNet, have been developed by some of the authors of this paper, and both source codes are free and can be accessed, respectively, in the repositories https://github.com/aurelienne/gis4graph and https://github.com/jessicadominguess/geocnet.

The G4G tool From GIS to graphs - the G4G sofware In terms of geospatial data, GIS4Graph is able to deal with both shapefiles and OpenStreetMap (OSM) files as input. When dealing with a shapefile, it must be a set of linestrings representing the network to be analyzed. Such data are then inserted into a database with geographic support - using PostgreSQL as the Database Management System and PostGIS as its spatial extension. The connections identification between network segments is efficiently performed by an indexed spatial query based on a function that verifies intersections between geometric features. When it comes to an OSM file representing a street network, a PostgreSQL extension named pgrouting is employed. In both cases, the result is a connection list between nodes. The first step before any calculation is to build a graph based on the analyzed network. It can be done by adding a node for each geographic feature and edges based on the connection list. The igraph library 17 is used for both graph building and some default metrics calculation. Vertex degree, clustering coefficient, shortest paths and betweenness are some examples of metrics calculated by igraph and incorporated to G4G. G4G interface is built upon OpenLayers, a JavaScript framework to work with interactive maps and geographic elements, allowing applying visualization styles to features individually. The resultant dataset is displayed visually on a map or as a graph. From graphs to GIS - the GeoCNet software In order to convert spatial networks to GIS environment we propose the following workflow: 1. To create a shapefile for the nodes using any GIS software. A point type shapefile for the nodes must be created. The shapefile must have a mandatory column of type integer named id, representing the id s of the nodes. All the characteristics of the polygons/points will be associated to their respective points as attributes, including the geographic locations of the nodes. The resulting nodes shapefile will be used as an input for the GeoCNet software. 2. To create an adjacency matrix (0s and 1s) representing the connections between these nodes. The matrix will be used as an input for the GeoCNet software as well. 3. Then, a line type shapefile representing the edges of the network is given as an output of GeoCNet. The point-typeshapefile and the line-type-shapefile will have topological attributes of nodes and edges respectively. Some Applications Transportation Here we present how (geo)graphs can be used to identify potential main streets used by most of the transports routes in the city of Lorena/SP. The geodata representing the street network of the city of Lorena/SP were acquired through a request on OpenStreetMap Extended API by specifying the bounding box of the city. It delivers an XML response wrapped in an OSM element that includes basically the description of the ways (polylines that represent linear features such as roads) and their relationships (OSM, 2017). More precisely, each line segment between crossroads is a way, and the relationships between ways are indicated by osm source and osm target fields. For the proposed analysis, it is needed to represent every avenue or street as a single node. By using the G4G, we can extract the network from the GIS data and then represent the betweenness centrality index for the streets network on a map. A case study for the city of Lorena/SP is shown in Figure 1. The streets for which the betweenness centrality index are higher are the potential main streets used by most of the transport routes in the city. Mobility - flow We studied the flow of people between each pair of areas (traffic zones) in a city on a typical day using an Origin-Destination survey for the metropolitan region of Rio de Janeiro/RJ 18. By using the GeoCNet, we can represent urban mobility networks on a map. A case study for the metropolitan region of Rio de Janeiro/Brazil is shown in Figure 2. Each node represents a traffic zone. Each pair of nodes is connected if the flow of people between the traffic zones represented by them is greater than a given threshold value. Nodes colors change according to the number of connections of the nodes. And edges colors change according to the total number of people going from one to the other node. In the figure it is shown that many people move between distant traffic zones (red edges between nodes distant from each other) especially if the destination/origin is a very connected traffic zone (a red node). Weather radar - correlation Weather radar is one of the most important equipment for monitoring extreme meteorological events. For this brief case study, a dataset of meteorological time series were obtained by a weather radar (temporal resolution of 10 minutes) 1. 1 sigma.cptec.inpe.br/radar/ 2/5

Figure 1. Lorena city s street network on map visualization. A filter was applied to show only the streets with the highest betweenness values with colors employed according to the legend on the bottom right corner. By using the GeoCNet, we can study rainfall temporal correlation networks. A case study for the mountainous region of Rio de Janeiro/Brazil, near the city Nova Friburgo/Brazil, is shown in Figure 3. Each node represents a grid point of a weather radar (as an interpolated field of rainfall). Each pair of nodes is connected if the Pearson correlation between the rainfall time series associated to each node is greater than a given threshold value. In the Figure, red dots represent the nodes and the blue lines represent the edges. So it is possible to visualize a community-like structure in the network, guided by the mountains of the region (see the satellite image). Conclusions and Perspectives In this paper it was introduced the concept of (geo)graphs: graphs in which the nodes have a known geographical location and the edges have spatial dependence. We also presented two open source softwares GIS4GRAPH (G4G) and GeoCNet, two computational frameworks to facilitate the integration of the (geo)graphs in GIS softwares. Datasets about transportation (streets - OSM), urban mobility (flow of people) and meteorological (time series correlation) networks were used as brief case studies. Using (geo)graphs, as a concept, and G4G and GeoCNet, as tools, you are able to insert and manipulate graphs into GIS, the most appropriate computational environment to handle geographical data. Then, you are able to easily representing graphs and their properties on maps, composing with different layers and allowing several spatial analysis. Among the perspectives of this research are the improvement of the web interface for the tool, and more detailed case studies, producing thematic maps for the nodes, edges and properties (topological measurements). References 1. Watts, D. J. & Strogatz, S. H. Collective dynamics of small-world networks. nature 393, 440 (1998). 2. Barabási, A.-L. & Albert, R. Emergence of scaling in random networks. science 286, 509 512 (1999). 3. Strogatz, S. H. Exploring complex networks. nature 410, 268 (2001). 4. Albert, R. & Barabási, A.-L. Statistical mechanics of complex networks. Rev. modern physics 74, 47 (2002). 5. Sole, R. V. & Montoya, M. Complexity and fragility in ecological networks. Proc. Royal Soc. Lond. B: Biol. Sci. 268, 2039 2045 (2001). 3/5

Figure 2. Urban mobility network for the metropolitan region of Rio de Janeiro/Brazil. The color of each node is related to its topological degree (number of connections), and the color of each edge is related to its weight index (total number of people going from one to the other node of each pair of nodes). The threshold of flow to connect each pair of nodes was taken in a thousand people. 6. Pascual, M. & Dunne, J. A. Ecological networks: linking structure to dynamics in food webs (Oxford University Press, 2006). 7. Wagner, A. & Fell, D. A. The small world inside large metabolic networks. Proc. Royal Soc. Lond. B: Biol. Sci. 268, 1803 1810 (2001). 8. Andrade, R. F. et al. Detecting network communities: An application to phylogenetic analysis. PLoS computational biology 7, e1001131 (2011). 9. Wagner, A. Genotype networks shed light on evolutionary constraints. Trends Ecol. & Evol. 26, 577 584 (2011). 10. Barbosa-Filho, H. et al. Human mobility: Models and applications. arxiv:1710.00004 [physics.soc-ph] (2017). 11. Pastor-Satorras, R. & Vespignani, A. Epidemic spreading in scale-free networks. Phys. review letters 86, 3200 (2001). 12. Barabási, A.-L., Albert, R. & Jeong, H. Scale-free characteristics of random networks: the topology of the world-wide web. Phys. A: statistical mechanics its applications 281, 69 77 (2000). 13. Donges, J. F., Zou, Y., Marwan, N. & Kurths, J. Complex networks in climate dynamics. The Eur. Phys. Journal-Special Top. 174, 157 179 (2009). 14. de Santana, C. et al. Graph theory defining non-local dependency of rainfall in northeast brazil. Ecol. Complex. 6, 272 277 (2009). 4/5

Figure 3. Correlation network for the rainfall time series in the mountains region of Rio de Janeiro/Brazil. The satellite image of the region is also represented. 15. Hayashi, Y. A review of recent studies of geographical scale-free networks. Inf. Media Technol. 1, 1136 1145 (2006). 16. Barthe lemy, M. Spatial networks. Phys. Reports 499, 1 101 (2011). 17. Csardi, G. & Nepusz, T. The igraph software package for complex network research. InterJournal Complex Systems, 1695 (2006). URL http://igraph.org. 18. Santos, L. B. L. et al. Exploring the spatial structure of rio de janeiro s intra-urban mobility complex network. to be submitted (2018). Acknowledgements Research was partially supported by grant 454267/2014-2 of the Brazilian National Council for Scientific and Technological Development (CNPq) and by the grant 2015/50122-0 Sa o Paulo Research Foundation (FAPESP) and DFG-IRTG 1740/2. C.N.S was partially supported by a PDJ Postdoctoral grant 23038.000776/2017-54 by the Coordination for the Improvement of Higher Education Personnel (CAPES). The authors sincerely thank Dr. Marcos Quiles, Dr. Elbert Macau and Dr. Tristan Pryer for their kind comments during this study. Author contributions statement L.B.L. conceived the concept of (geo)graphs, A.A.S.J., M.R. and J.D.S implemented the main computational tools, A.A.S.J., M.R., J.D.S, O.A.C. and W.S. conducted the experiments, L.B.L. and C.N.S. analyzed the results. All author reviewed the manuscript. 5/5