An adapted intensity estimator for linear networks with an application to modelling anti-social behaviour in an urban environment

Similar documents
A Spatio-Temporal Point Process Model for Firemen Demand in Twente

Points. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

School on Modelling Tools and Capacity Building in Climate and Public Health April Point Event Analysis

Stochastic Modeling of Tropical Cyclone Track Data

Estimating functions for inhomogeneous spatial point processes with incomplete covariate data

A point process approach to analyze spatial transformation effects on threshold exceedances

Spatial point processes

Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland

Second-order pseudo-stationary random fields and point processes on graphs and their edges

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina

Getting started with spatstat

Test of Complete Spatial Randomness on Networks

A route map to calibrate spatial interaction models from GPS movement data

A Modified DBSCAN Clustering Method to Estimate Retail Centre Extent

Second-Order Analysis of Spatial Point Processes

Geographical General Regression Neural Network (GGRNN) Tool For Geographically Weighted Regression Analysis

A homogeneity test for spatial point patterns

Comparison of spatial methods for measuring road accident hotspots : a case study of London

The K-Function Method on a Network and its Computational Implementation

Spatial and Temporal Geovisualisation and Data Mining of Road Traffic Accidents in Christchurch, New Zealand

The Building Blocks of the City: Points, Lines and Polygons

STATISTICAL DATA ANALYSIS AND MODELING OF SKIN DISEASES

Article: Disclaimer:

M. Saraiva* 1 and J. Barros 1. * Keywords: Agent-Based Models, Urban Flows, Accessibility, Centrality.

Book of Extended Abstracts

Differential Geometry III, Solutions 6 (Week 6)

ESTIMATING FUNCTIONS FOR INHOMOGENEOUS COX PROCESSES

CS 534: Computer Vision Segmentation III Statistical Nonparametric Methods for Segmentation

Jean-Michel Billiot, Jean-François Coeurjolly and Rémy Drouilhet

Monsuru Adepeju 1 and Andy Evans 2. School of Geography, University of Leeds, LS21 1HB 1

Multitype point process analysis of spines on the dendrite network of a neuron

Updating on the Kernel Density Estimation for Compositional Data

Aalborg Universitet. Publication date: Document Version Early version, also known as pre-print. Link to publication from Aalborg University

Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran City, Iran

An introduction to spatial point processes

CoDa-dendrogram: A new exploratory tool. 2 Dept. Informàtica i Matemàtica Aplicada, Universitat de Girona, Spain;

The Dirichlet distribution with respect to the Aitchison measure on the simplex - a first approach

Texas A&M University

Structured network regression for spatial point patterns

CHARACTERIZATION OF SOLAR ACCESS IN MEDITERRANEAN CITIES: ORIENTED SKY FACTOR

Extending clustered point process-based rainfall models to a non-stationary climate

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

Nonparametric Principal Components Regression

THE DEVELOPMENT OF ROAD ACCIDENT DATABASE MANAGEMENT SYSTEM FOR ROAD SAFETY ANALYSES AND IMPROVEMENT

Properties of spatial Cox process models

The role of topological outliers in the spatial analysis of georeferenced social media data

DM-Group Meeting. Subhodip Biswas 10/16/2014

ZIPF S LAW FOR CITIES: AN EMPIRICAL EXAMINATION 1

transportation research in policy making for addressing mobility problems, infrastructure and functionality issues in urban areas. This study explored

Project Proposal. Student safety at Humboldt State is something that is highly considered by faculty,

A 3D GEOVISUALIZATION APPROACH TO CRIME MAPPING

On the Distribution Dynamics of Human Development: Evidence from the Metropolitan Regions of Bolivia

Spatial clustering of traffic accidents using distances along the network

Theory, Concepts and Terminology

European regional convergence revisited The role of intangible assets

A COMPARATIVE STUDY OF THE APPLICATION OF THE STANDARD KERNEL DENSITY ESTIMATION AND NETWORK KERNEL DENSITY ESTIMATION IN CRASH HOTSPOT IDENTIFICATION

Algorithms for Picture Analysis. Lecture 07: Metrics. Axioms of a Metric

Spatial Variation in Local Road Pedestrian and Bicycle Crashes

Urban GIS for Health Metrics

Analysing The Temporal Pattern Of Crime Hotspots In South Yorkshire, United Kingdom

Exploring the Impact of Ambient Population Measures on Crime Hotspots

Statistical tests for comparisons of spatial and spatio-temporal point patterns

RESEARCH REPORT. Log Gaussian Cox processes on the sphere. Francisco Cuevas-Pacheco and Jesper Møller. No.

Nonparametric Methods

arxiv: v1 [math.st] 21 Dec 2018

Nonparametric Density Estimation. October 1, 2018

Diffusion/Inference geometries of data features, situational awareness and visualization. Ronald R Coifman Mathematics Yale University

A simple agricultural land use model implemented using Geographical Vector Agents (GVA)

Road Network Analysis as a Means of Socio-spatial Investigation the CoMStaR 1 Project

Chapter 6. Fundamentals of GIS-Based Data Analysis for Decision Support. Table 6.1. Spatial Data Transformations by Geospatial Data Types

Summary statistics for inhomogeneous spatio-temporal marked point patterns

EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China

The Use of Statistical Point Processes in Geoinformation Analysis

A new metric of crime hotspots for Operational Policing

"Transport statistics" MEETING OF THE WORKING GROUP ON RAIL TRANSPORT STATISTICS. Luxembourg, 25 and 26 June Bech Building.

Spatial cluster point processes related to Poisson-Voronoi tessellations Møller, Jesper; Rasmussen, Jakob Gulddahl

A methodology on natural occurring lines segmentation and generalization

Stochastic Modelling Unit 1: Markov chain models

Urban settlements delimitation using a gridded spatial support

Everything is related to everything else, but near things are more related than distant things.

Tree Crown Extraction using Marked Point Processes

Performance assessment of quantum clustering in non-spherical data distributions

USE OF R ENVIRONMENT FOR A FINDING PARAMETERS NONLINEAR REGRESSION MODELS

The Joys of Geocoding (from a Spatial Statistician s Perspective)

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

HIERARCHICAL IMAGE OBJECT-BASED STRUCTURAL ANALYSIS TOWARD URBAN LAND USE CLASSIFICATION USING HIGH-RESOLUTION IMAGERY AND AIRBORNE LIDAR DATA

On Model Fitting Procedures for Inhomogeneous Neyman-Scott Processes

Cluster Analysis using SaTScan

GIS4Graph: a tool for analyzing (geo)graphs applied to study efficiency in a street network

Robustness and vulnerability assessment of water networks by use of centrality metrics

Spatial Models: A Quick Overview Astrostatistics Summer School, 2017

ON THE ESTIMATION OF DISTANCE DISTRIBUTION FUNCTIONS FOR POINT PROCESSES AND RANDOM SETS

Tail dependence in bivariate skew-normal and skew-t distributions

GIST 4302/5302: Spatial Analysis and Modeling Point Pattern Analysis

ANALYSIS OF URBAN PLANNING IN ISA TOWN USING GEOGRAPHIC INFORMATION SYSTEMS TECHNIQUES

Geometric Algorithms in GIS

RESEARCH REPORT. A Studentized Permutation Test for the Comparison of Spatial Point Patterns. Ute Hahn. No.

Place Syntax Tool (PST)

Kernel Density Estimation (KDE) vs. Hot-Spot Analysis - Detecting Criminal Hot Spots in the City of San Francisco

Lecture 2: Poisson point processes: properties and statistical inference

Transcription:

An adapted intensity estimator for linear networks with an application to modelling anti-social behaviour in an urban environment M. M. Moradi 1,2,, F. J. Rodríguez-Cortés 2 and J. Mateu 2 1 Institute of New Imaging Technologies (INIT), University Jaume I, Castellón, Spain; moradi@uji.es 2 Department of Mathematics, University Jaume I, Castellón, Spain; cortesf@uji.es, mateu@uji.es Corresponding author Abstract. We consider an inhomogeneous point process on a linear network, and extend Diggle s nonparametric kernel-based edge-corrected intensity estimator in this context. We compare the accuracy and performance of Diggle s method with respect to the equal-split discontinuous kernel estimator. We show the improvement on the estimation of second-order summary statistics, in particular on the inhomogeneous geometrically corrected network K- and pair correlation functions. Finally, we analyse a spatial point pattern of calls related to anti-social behaviour in the streets of a Spanish town. Keywords. Anti-social behaviour; Intensity; Linear network; Second-order characteristics; Shortest path distance. 1 Point processes on a linear network In the spatial statistics context, there are numerous real problems such as the location of traffic accidents in a geographical area, or geo-coded locations of anti-social behaviour in the streets of cities that need to restrict the support of the process over networks to face a more realistic scenario. The spatial events can be classified into two classes with respect to the linear networks: events that occur directly on a linear network, and events that occur alongside a linear network rather than directly on it (see [5]). We consider the events which directly occur on a linear network as a point pattern, that is, a set of locations, irregularly distributed within a linear network and assumed to have been generated by some form of stochastic mechanism. In order to propose statistical methods for analysing point patterns on linear networks, it is necessary to define some fundamental concepts in this new support. A line segment in the plane with endpoints u and v can be written in parametric form as [u,v] = {tu + (1 t)v : 0 t 1} with u,v R 2. Following [2], a linear network L can be defined as the union of a finite collection of line segments embedded in a plane, the endpoints of the segments are called nodes, and the degree of a node is the number of segments that share the same node. Denote by L the total sum of the lengths of line segments L. The distance between two points u and v in the network L is computed by the shortest-path distance d L (u,v) which is the minimum of the lengths of all paths linking u and v. The disc of radius r > 0 centered at the point u L is given by b L (u,r) = {v : v L : d L (u,v) r} and its relative boundary b L (u,r) is the set of points lying exactly r units away from u. The circumference m(u,r) is the number of points in the relative boundary b L (u,r) (for more details see [2]).

We consider a point process X on a linear network L with no overlapping points as a random countable subset of R 2. In practice, we observe n events {u i } n i=1 of X within a bounded linear network L R2, and let N(L) denote the number of events falling in L. We assume that X has intensity function λ( ) and pair correlation function g( ) (see [2]). Then, for any Borel measurable real function h on L [ ] E h(u i ) u i X = h(u)λ(u)d 1 u, (1) L where d 1 u denotes one-dimensional integration over the line segment. Generally, λ(u) is interpreted as the expected number of events per unit length of linear network in a neighbourhood of u, and under the assumption of homogeneity it can be estimated by the quantity n/ L. 2 Adapted Diggle s method for the intensity on a linear network In the inhomogeneous case, several kernel smoothing methods for intensity estimation on linear networks have been proposed in the literature (see [5]). These methods are straightforward extensions of the planar case to the linear networks, but without the consideration of important facts such as edge-effects and the degree of the nodes. However, the equal-split kernel function proposed by [4] tries to remedy these problems by dividing the weight assigned by the kernel by a quantity that depends on the degree of the nodes which are equally distributed over the segments that share the node. Nevertheless, this estimation does not seem to provide quite good results when it is used in the estimation of second-order summary statistics. This method has been implemented in the R package spatstat [2]. We extend the non-parametric kernel-based edge-corrected intensity estimator of [3] to the linear network support following the mathematical arguments in [1]. We thus define the estimator of the intensity function as ˆλ (D) ε (u) = n i=1 κ ε (d L (u,v i )), u L, (2) κ(t)m(v i,t)dt 0 where κ is a one-dimensional kernel function with smoothing parameter ε. Note that the edge-correction in the denominator of (2) involves the circumference function, and therefore we are also considering the degree of the nodes, while removing the overestimation at the nodes and the edges. We highlight that this adaptation on linear networks of the classical techniques for the intensity estimation are appropriate, and the circumference function is an necessary tool that provides information of the structure of the network on every point. 3 Simulation study We conduct a simulation experiment to compare the accuracy and performance of adapted Diggle s method with respect to the equal-split discontinuous kernel estimator under the assumption of inhomogeneous Poisson processes. We set a simple linear network composed by 19 vertices, 26 lines with total length 39.38 units, within a window [0,5] [0,5] units 2, as it is shown in Figure (1). We consider the intensity function λ(x,y) = 0.2e 0.4(x+y) and use this to simulate 1000 point patterns with expected number of points E[N(L)] = 95. For each one of the patterns we estimate the intensity based on both techniques METMA VIII Workshop 2

M. M. Moradi et al. An adapted intensity estimator for linear networks (adapted Diggle s method and equal-split discontinuous kernel) and compare them through a mean error (ME) approach. Figures (1b) and (1c) show the intensity estimation based on the adapted Diggle s method using the Epanechnikov kernel, and on the equal-split discontinuous method using a Gaussian kernel, respectively, with bandwidth ε = 1.25 units for both cases. (a) (b) (c) Figure 1: Simulation and smoothing kernel density estimates of an inhomogeneous Poisson process on the simple network: (1a) Realisation of an inhomogeneous pattern on a simple network. (1b) Linethickness display for adapted Diggle s estimator. (1c) Line-thickness display for the equal-split discontinuous method. From a visual comparison in Figure (1a) we note that the adapted Diggle s method improves the estimation particularly at those segments with a low number of points because it does not overestimate the intensity at these segments, while the equal-split discontinuous method shows a constant high value along the segments regardless the distribution of the points. Furthermore, adapted Diggle s estimator proved to be robust showing small variations against smooth variations in the bandwidths. Figure (2) shows that the ME for the adapted Diggle s estimator is around 0, whereas that for the equal-split discontinuous method is around -2.5, indicating that the former estimator shows a smaller error than the latter with respect to the theoretical intensity. Figure 2: Mean error (ME) for 1000 simulations of an inhomogeneous Poisson point process on a simple linear network. METMA VIII Workshop 3

4 Anti-social behaviour analysis We analyse the spatial point pattern of calls related to anti-social behaviour in the streets of Castellón, Spain in January 2013. The city of Castellón is divided into 108 census tracks with an overall surface of 108.659 km 2. The linear network associated with this city contains 450 nodes and 2242 line segments, with a total length of 1611.688 km, and it is shown in Figure 3. The dataset is formed by 189 locations of calls related to anti-social behaviour with a circumradius of 161.168 km. 0 0.1 0.2 0.3 0.4 Figure 3: Estimated intensity for anti-social behaviour on the streets of Castellón, Spain in January 2013 with ε = 1.333 km. The pattern in Figure 3 looks quite regular and does not exhibit any apparent clustering. However, the streets leading to the city center show higher intensity than those in the boundary which may be related to night-life and discotheques. Second-order summary statistics on the linear network will be further used to support these findings. Acknowledgments. The research of M.M. Moradi is supported by funding from the European Union through the GEO-C project (H2020-MSCA-ITN-2014, Grant Agreement Number 642332, http://www.geoc.eu/). F.J. Rodríguez-Cortés and J. Mateu are supported by Grant MTM2013-43917-P from the Spanish Ministry of Science and Education, and Grant P1-1B2015-40. References [1] Ang, Q. W., Baddeley, A., and Nair, G. (2012). Geometrically corrected second order analysis of events on a linear network, with applications to ecology and criminology. Scandinavian Journal of Statistics, 39 591 617. [2] Baddeley, A., Rubak, E., and Turner, R. (2015). Spatial Point Patterns: Methodology and Applications with R. Chapman and Hall/CRC Press, London. [3] Diggle, P. J. (1985). A kernel method for smoothing point process data. Journal of the Royal Statistical Society. Series C (Applied Statistics), 34 138 147. [4] Okabe, A., Satoh, T., and Sugihara, K. (2009). A kernel density estimation method for networks, its computational method and a GIS-based tool. International Journal of Geographical Information Science, 23 7 32. METMA VIII Workshop 4

[5] Okabe, A., and Sugihara, K. (2012). Spatial analysis along networks: statistical and computational methods. John Wiley & Sons, London. METMA VIII Workshop 5