A new metric of crime hotspots for Operational Policing

Similar documents
Monsuru Adepeju 1 and Andy Evans 2. School of Geography, University of Leeds, LS21 1HB 1

Detecting Emerging Space-Time Crime Patterns by Prospective STSS

Clustering Analysis of London Police Foot Patrol Behaviour from Raw Trajectories

Profiling Burglary in London using Geodemographics

An Assessment of Crime Forecasting Models

A 3D GEOVISUALIZATION APPROACH TO CRIME MAPPING

Profiling Burglary in London using Geodemographics. Chris Gale 1, Alex Singleton 2, Paul Longley 1

Examining the extent to which hotspot analysis can support spatial predictions of crime

Exploring the Impact of Ambient Population Measures on Crime Hotspots

Comparison of spatial methods for measuring road accident hotspots : a case study of London

The Building Blocks of the City: Points, Lines and Polygons

Developing an Analysis Process for Crime Hot Spot Identification and Comparison

The geography of domestic energy consumption

Understanding the spatial pattern of urban crime: a developing country s perspective

The CrimeStat Program: Characteristics, Use, and Audience

- Geography is important when studying crime because when a crime happens it always has a location, a time and a reason.

Crime Analysis. GIS Solutions for Intelligence-Led Policing

Modeling Temporal-Spatial Correlations for Crime Prediction

Urban Geography. Unit 7 - Settlement and Urbanization

Linking Neighbourhood Data in the Analysis of Crime

The value of retrospective spatial trends of crime for predictive crime mapping

A route map to calibrate spatial interaction models from GPS movement data

Butterflies, Bees & Burglars

Estimating Large Scale Population Movement ML Dublin Meetup

Coastal regions: People living along the coastline and integration of NUTS 2010 and latest population grid

Spatial Statistical Analysis of burglary Crime in Chennai City Promoters Apartments: A CASE STUDY

Introducing GIS analysis

Space-time modelling of sexually transmitted infections in London with focus on Chlamydia trachomatis

BIG IDEAS. Area of Learning: SOCIAL STUDIES Urban Studies Grade 12. Learning Standards. Curricular Competencies

1. Richard Milton 2. Steven Gray 3. Oliver O Brien Centre for Advanced Spatial Analysis (UCL)

Exploring Digital Welfare data using GeoTools and Grids

GIS for Crime Analysis. Building Better Analysis Capabilities with the ArcGIS Platform

Patterns in Offender Distance Decay and the Geographic Profiling Problem.

NEW YORK DEPARTMENT OF SANITATION. Spatial Analysis of Complaints

Spatial Patterns in the 2011 London Riots

Optimization of Criminal HotSpots Based on Underlying Crime Controlling Factors Using Geospatial Discriminative Pattern

Uncovering the Digital Divide and the Physical Divide in Senegal Using Mobile Phone Data

EICCARS WORKBOOK FOR INITIATING COMMUNITY PROFILES WITH STATISTICAL AND OTHER SECONDARY DATA

A Geo-Statistical Approach for Crime hot spot Prediction

Malicious Hoax Calls And Suspicious Fires: An Examination Of Their Spatial And Temporal Dynamics

Dr Spencer Department of Security and Crime Science, University College London

DM-Group Meeting. Subhodip Biswas 10/16/2014

Preprint.

Spatial Pattern of Burglary in South Yorkshire Using Geographic Information System (GIS)

Introduction: Name Redacted November 17, 2017 GSP 370 Research Proposal and Literature Review

Module 3 Indicator Land Consumption Rate to Population Growth Rate

Spatio-Temporal Analytics of Network Data

Area-Specific Crime Prediction Models

Implications for the Sharing Economy

Spatial Analysis of Crime Report Datasets

Open Data Sources for Domain Specific Geodemographics

Analysing The Temporal Pattern Of Crime Hotspots In South Yorkshire, United Kingdom

Poisson-Based Regression Analysis of Aggregate Crime Rates

Curriculum Vitae. Ran Wei

Comparison of different crime prediction models in Bogotá

Kernel Density Estimation (KDE) vs. Hot-Spot Analysis - Detecting Criminal Hot Spots in the City of San Francisco

What Explains the Effectiveness of Hot Spots Policing And What New Strategies Should we Employ?

UCL Institute for Environmental Design and Engineering

Protocol between London MAPPA Strategic Management Board (SMB) and (Name of Borough) Local Safeguarding Children Board

COSMIC: COmplexity in Spatial dynamic

INTRODUCTION. In March 1998, the tender for project CT.98.EP.04 was awarded to the Department of Medicines Management, Keele University, UK.

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones

Space-Time Kernels. Dr. Jiaqiu Wang, Dr. Tao Cheng James Haworth University College London

Public spaces revisted: a study of the relationship between patterns of stationary activity and visual fields

Spatial Process VS. Non-spatial Process. Landscape Process

Empirical Discriminative Tensor Analysis for Crime Forecasting

Tracey Farrigan Research Geographer USDA-Economic Research Service

URBAN CHANGE DETECTION OF LAHORE (PAKISTAN) USING A TIME SERIES OF SATELLITE IMAGES SINCE 1972

Techniques for Science Teachers: Using GIS in Science Classrooms.

Nature of Spatial Data. Outline. Spatial Is Special

Spatial-Temporal Analytics with Students Data to recommend optimum regions to stay

Exploring the Geography of Communities in Social Networks

Major Crime Map Help Documentation

Crime Forecasting Using Data Mining Techniques

Inclusion of Non-Street Addresses in Cancer Cluster Analysis

ROBBERY VULNERABILITY

Master Thesis. submitted within the UNIGIS M.Sc. programme. Interfaculty Department of Geoinformatics - Z_GIS. University of Salzburg

Assessing the impact of seasonal population fluctuation on regional flood risk management

Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns

Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran City, Iran

The Mathematics of Criminal Behavior: From Agent-Based Models to PDEs. Martin B. Short GA Tech School of Math

The Mathematics of Geographic Profiling

Spatial metrics for Greek cities using land cover information from the Urban Atlas

Using GIS to Evaluate Rural Emergency Medical Services (EMS)

Self-exciting point process modeling of crime

Environmental Design using Fractals in Computer Graphics Eunjoo LEE, Yoshitaka IDA, Sungho WOO and Tsuyoshi SASADA

A Modified DBSCAN Clustering Method to Estimate Retail Centre Extent

ENV208/ENV508 Applied GIS. Week 1: What is GIS?

Linking Industry and Occupation Clusters in Regional Economic Development

INTERNATIONAL JOURNAL OF GEOMATICS AND GEOSCIENCES Volume 7, No 1, 2016

LAB EXERCISE #2 Quantifying Patch Mosaics

User Guide: RADON POTENTIAL DATASET - England and Wales

Neighborhood Locations and Amenities

Non-parametric bootstrap and small area estimation to mitigate bias in crowdsourced data Simulation study and application to perceived safety

Modelling Spatial Behaviour in Music Festivals Using Mobile Generated Data and Machine Learning

GIS and the Built Environment

Compact guides GISCO. Geographic information system of the Commission

Rural Pennsylvania: Where Is It Anyway? A Compendium of the Definitions of Rural and Rationale for Their Use

Space-time clusters of crime in Stockholm, Sweden

Estimating the long-term health impact of air pollution using spatial ecological studies. Duncan Lee

Transcription:

A new metric of crime hotspots for Operational Policing Monsuru Adepeju *1, Tao Cheng 1, John Shawe-Taylor 2, Kate Bowers 3 1 SpaceTimeLab for Big Data Analytics, Department of Civil, Environmental and Geomatic Engineering, University College London 2 Department of Security and Crime Science, University College London 3 Department of Computer Science, University College London November 07, 2014 Summary This study examines the existing metrics used in evaluating the effectiveness of area-based crime hotspots for operational policing. We identified some of the limitations of the metric (i.e. Area-to- Perimeter (AP) ratio) used for measuring compactness of hotspots and then proposed a new improved metric called Clumpiness Index (CI). The case study of London Metropolitan police crime dataset features the prediction of 3 different crime types using two different crime predictive methods. The effectiveness of the hotspots was then measured using both AP ratio and CI. The comparison of the results clearly shows that CI is a better metric for measuring the effectiveness of crime hotspots for operational policing. KEYWORDS: Effective hotspots, Area-to-perimeter (AP) ratio, Hit rate, Clumpiness Index (CI), Operational Policing. 1. Introduction As police resources are becoming increasingly limited due partly to budget constraints, there is a growing interest in strategies that can enhance optimisation of their resources towards achieving the desired crime prevention goals. An effective policing strategy is one that offers the police high crime prevention potential with a small amount of police deployment (Weisburd, 2008). Over the last decade, attempts to increase police effectiveness have resulted in operational policing being informed by predictive analysis of crime. The predictive methods are used to identify locations of high future crime risk. These locations are referred to as crime hotspots. The types of hotspots include pointbased, network-based and area-based hotspots. Several studies have suggested that police can be more effective in intervening in crime by focussing on small geographical units with high crime rates (hotpots) rather than actual people (offenders) committing the crimes (Telep & Weisburd, 2014). As a result, most predictive methods of crime have been aimed at identifying area-based hotspots. To estimate how effective the detected hotspots are for operational policing, two metrics have been used, proposed by Bowers et al. (2004). These metrics are Hit rate (HR) and Area-to-Perimeter (AP) ratio. The HR measures the proportion of future crime accurately captured by the purported hotspots while AP ratio measures compactness (easiness of covering) the hotspots. Bowers et al. (2004) recommended that the two measures should be used together for meaningful evaluation. This is because hotspots with high HR may not necessarily be easily coverable based on their geometric shape. Thus, hotspots with moderate HR and a high AP ratio are preferred to ensure effective policing. However, certain limitations can be identified with the AP ratio which renders it less appealing for evaluating hotspots for effective policing. The AP ratio is used to measure the geometric complexities (compactness) of hotpots. The * monsuru.adepeju.11@ucl.ac.uk tao.cheng@ucl.ac.uk j.shawe-taylor@ucl.ac.uk kate.bowers@ucl.ac.uk

assumption is that regular-shaped hotspots (e.g. squares) can be covered quicker and more easily than irregularly-shaped hotspots, if we ignore the underlying network structure. The AP ratio has limitations. They are: a) Holding the shape of a hotspot constant, the AP ratio varies with the spatial scale, (ranging from zero to infinity). This makes it difficult to compare similar hotspots across different study areas. b) The level of disaggregation or dispersion of the hotspots (grid units) cannot be inferred from the value of the AP ratio. Therefore, the AP ratio cannot give us an idea of randomly distributed hotspots as a baseline for comparison. c) The AP ratio is relatively insensitive to differences in the structure of hotspots. Thus, although hotspots may possess very different shapes, they may have identical area and perimeters. Therefore, the goal of this paper is to propose a new metric for measuring effectiveness of crime hotspots for operational policing. Specifically, we are proposing a new metric called Clumpiness Index as an alternative to AP ratio given the limitations of AP ratio listed above. 2. Existing Metrics Hit Rate and AP Ratio 2.1 Hit Rate: the proportion of new crimes captured by the defined hotspot. Evaluated at a certain area coverage (e.g. 20% area coverage) Hit Rate = ( m k=1(number of crimes) n i=1(number of crimes) ) 100 (1) Where i = number of ranked grids; k = number of percentile of ranked grids e.g. 20th; n = total number of grids. 2.2 Area-to-perimeter (AP) ratio: a measure of how compact an identified cluster (hotspot) is. The more compact a hotspot is, the easier and quicker it will be to cover operationally. Higher AP ratio corresponds to better compactness (Figure 1). Figure 1 Area-to-Perimeter (AP) Ratio. The hotspot on the right pane is more compact and therefore has higher AP ratio and may be seen as more efficient in operational policing terms. 3. A new metric - Clumpiness Index (CI) AP ratio = 6/14 = 0.43 AP ratio = 6/10 = 0.6 Provided a modest hit rate, the actual effectiveness of a predictive solution is measured in terms the geometric complexity (compactness) and distribution of the hotspots across a geographical area. We propose a new metric called Clumpiness Index (CI) which measures the compactness and distribution of hotspots and is robust to scaling issues of AP ratio. The Clumpiness Index (CI) was originally proposed by Turner (1989) as Contagion Index for measuring the overall clumpiness of categorical patches on a landscape. CI is able to measure effectively both patch type interspersion (i.e. the intermixing of units of different patch types) as well as patch dispersion (i.e. the spatial distribution of a patch type) at the landscape level. CI is computed by first summarising the adjacency of all cells in an adjacency matrix, which shows the frequency with which different pairs of patch types (including adjacencies between the same patch type) appear side-by-side on the map. CI is defined as follows:

G i = ( g ii m k=1 g ik ) ; CI = G i P i 1 P i for G i P i G i P i 1 P i for G i < P i ; P i 0.5 P i G i { P i for G i < P i ; P i < 0.5 (2) g ii = number of like adjacencies (joins) between pixels of patch type (class) i g ik = number of adjacencies (joins) between pixels of patch types (classes) i and k P i = proportion of the landscape occupied by patch type (class) i. The goal of CI is to determine the maximum value of g ii for any P i The CI takes values between -1 (when the class is maximally disaggregated) to 1 (when the class is maximally aggregated corresponding to a checkerboard arrangement). 4. Case Study 4.1 Camden Borough of London Camden Borough is one of the 12 inner boroughs of London City with an estimated 224,962 inhabitants as of 2011. The population density is estimated as approximately 10,000 people per square kilometre (2011 Census, Office of National Statistics). The Borough contains a mixture of commercial and residential areas with the busiest parts being the Camden Market and Covent Garden and Holborn. The borough recorded a crime rate of 145 crimes per 1,000 people in 2010/11, the national average being 75 crimes per 1,000 people (Source: Metropolitan Police Service website, 2014). 4.2 Effectiveness of hotspot for operational policing Three different crime types are used in this analysis. They are shoplifting, violence-against-persons and burglary (in-dwellings) crimes. The data points are aggregated to a grid system of 250m by 250m and have temporal resolution of 1 day. The time period predicted is between 28/09/2011 and 6/01/2012, predicting 2 days ahead using the crime risk surface produced on each day. To check the predictions against the real dataset, we overlay future crime on the predictive surface generated. For example, validating the prediction on day t n means overlaying crime data from day t n+1 to day t n+2 on the predictive surface generated on day t n. By so doing, the proportion of crimes that are captured by the ranked top 20% of the grids squares (hotspots) is evaluated (assuming that police only has resources to cover just 20% of the Camden). Two hotspots predictive methods are used, namely (i) Prospective Hotspot (Bowers et al. 2004) and (ii) Kernel Density Estimation (KDE) method. Figure 2 represents the average of percentage hit rate over the prediction period. Also included in Figure 2 is baseline prediction which is generated by way of picking grid squares with equal probability (random) until 20% coverage is attained. This is represented with the green line. The general performance of these predictive methods in terms of hit rates follows the spatial concentration of different crime types with highly spatially concentrated crime type showing highest hit rates. For example, shoplifting crime is highly concentrated in a few regions near commercial areas and therefore shows the highest hit rates whereas prediction of burglary crimes is lowest as residential properties are dispersed across the entire borough.

% Crime Predicted Shoplifting Violence-Against-Persons Burglary-in-dwellings % Area Flagged % Area Flagged % Area Flagged Figure 2 Average % hit rates over the prediction period In measuring hotspot compactness, we adapted CI to crime hotspots by classifying grid squares constituting the predictive surface into two types, namely (1) Hot Spot the top 20% ranked grids and (2) Cold Spot the remaining 80%. Figure 3 shows examples of predictive surfaces generated by Prospective Hotspot method and random grid selection to illustrate how AP ratio and CI vary with different hotspot configurations. In each example, we consider two spatial scalings: the original data (unchanged), and data scaled such that each grid square has unit length. The AP ratio is observed to change at different scales of measurement of the same surface, making it difficult to compare. However, CI remains the same at any given scale. This is because CI is based on the adjacencies as well as the proportion of the hotspot grids across total surface. Therefore, CI is able to provide a sense of dispersion from a complete disaggregation (CI = -1) of the hotspot units.

Predictive Predictive Surface Surface 1 1 (Random (Random Predictive gridding) gridding) Surface 1 (Random gridding) Predictive At a At At Surface unit a a unit grid 1 grid: size: size: (Random AP ratio AP gridding) ratio = 0.19 At AP a ratio unit = 0.19 = grid 0.19 size: CI = CI 0 AP = 0 CI = ratio -0.45 = 0.19 At actual At a unit grid size: At CI actual = 0grid grid measurement: AP ratio AP ratio = 0.19 AP At ratio actual = 68.02 68.02 grid measurement: CI = CI At = actual 0 grid measurement: CI 0 AP AP ratio ratio = = 68.02 68.02 At CI actual = 0-0.45 grid measurement: AP ratio = 68.02 CI = 0 Predictive surface 2 2 (Prospective Predictive surface Hotspot) 2 (Prospective Hotspot) At At a unit unit grid grid size: size: Predictive At surface 2 AP AP a unit (Prospective ratio ratio grid: Hotspot) 0.37 = 0.37 AP At CI CI = ratio a unit = 0.42 0.42 = grid 0.37 size: AP ratio 0.37 CI At = 0.32 a unit size: At CI At = actual 0.42grid grid measurement: AP ratio 0.37 At AP AP ratio ratio = = 95.31 95.31 At CI actual 0.42 grid measurement: CI CI = = 0.42 0.42 AP ratio = 95.31 CI At actual 0.42 grid measurement: CI = 0.32 AP ratio = 95.31 CI = 0.42 0 0 800 800 1,600 1,600 2,400 2,400 3,200 3,200 Meters Meters 0 800 1,600 2,400 3,200 0 800 1,600 2,400 3,200 Meters Meters Predictive Predictive surface surface 3 3 (Prospective (Prospective Hotspot) Hotspot) Predictive Predictive surface surface 3 (Prospective At a At unit a unit grid Hotspot) grid size: size: AP ratio AP ratio = 0.66 = 0.66 CI = CI At 0.88 At = a 0.88 a unit unit grid grid: size: AP AP ratio ratio = 0.66 = 0.66 At actual At CI actual = 0.88 grid grid measurement: AP ratio CI AP ratio = 164.58 0.54 = 164.58 CI = CI At 0.88 = actual 0.88 grid measurement: AP At ratio actual = 164.58 grid measurement: CI AP = 0.88 ratio = 164.58 CI = 0.54 Legend Legend Legend 3 3Hot Hot Spot Spot 3 3 Hot Hot Spot Spot Cold Cold Spot Spot Cold Cold Spot Spot AP AP = = Area-to-Perimeter CI CI AP = Clumpiness Index Index AP = Area-to-Perimeter CI CI = = Clumpiness Index 5. Discussion and Conclusion Figure 3 Evaluating hotspot compactness with AP ratio and CI This study examines the use of existing metrics for measuring the effectiveness of predictive hotspots for operational policing. We highlighted some of the limitations of AP ratio, a metric that is specifically designed to evaluate effectiveness of predictive hotspots. We then proposed a new metric called Clumpiness Index CI which is able to eliminate the limitations of AP ratio. This study first established that the two predictive methods used (i.e. Prospective Hotspot and KDE) are able to predict crimes well above the baseline predictions (random), and their performances are observed to vary according to the spatial concentration of different crime types. The CI was then used to provide more interpretable assessment of hotspot compactness which is found to be very robust to change in scales of spatial units of analysis. In addition, the CI calculation provided a baseline of comparison i.e. either of maximally aggregated (CI = -1) or maximally disaggregated (CI = 1) hotspot configuration, giving a sense how easy the hotspots can be covered by the police. As with AP ratio however, the CI being a single-valued metric requires visualisation of the purported hotspot to make perfect meaning out of it. 6. Acknowledgements This research is part of the CPC (Crime, Policing and Citizenship) project supported by UK EPSRC (EP/J004197/1), in collaboration with the London Metropolitan Police. 7. Biography Monsuru Adepeju is currently 2 nd year PhD student at the SpaceTimeLab for Big Data Analytics, at University College London. His PhD focuses on predictive modelling of space-time hotspot of crime and his research interests include validation of crime predictive models for predictive policing and development of usable predictive tools for operational environment.

Tao Cheng is a Professor in GeoInformatics, and Director of SpaceTimeLab for Big Data Analytics (http://www.ucl.ac.uk/spacetimelab), at University College London. Her research interests span network complexity, Geocomputation, integrated spatio-temporal analytics and big data mining (modelling, prediction, clustering, visualisation and simulation), with applications in transport, crime, health, social media, and environmental monitoring. John S Shawe-Taylor is a professor at University College London (UK) where he is co-director of the Centre for Computational Statistics and Machine Learning (CSML). His main research area is Statistical Learning Theory, but his contributions range from Neural Networks, to Machine Learning, to Graph Theory. He has coordinated a number of European wide projects investigating the theory and practice of Machine Learning, including the NeuroCOLT projects. Kate Bowers is a Professor in Crime Science at the UCL Department of Security and Crime Science. Kate has worked in the field of crime science for almost 20 years, with research interests focusing on the use of quantitative methods in crime analysis and crime prevention. References Bowers K J, Johnson S D and Pease K (2004). Prospective hot-spotting the future of crime mapping? British Journal of Criminology, 44, 641-658. Telep C W and Weisburd D (2014). Hot Spots and Place-Based Policing. In Encyclopedia of Criminology and Criminal Justice, Springer New York, pp. 2352-2363. Turner M G (1989). Landscape ecology: the effect of pattern on process. Annual Review of Ecology and Systems, 20: 171 197 Weisburd D (2008). Place-based policing, Ideas in American policing. Police Foundation, Washington, DC. http://www.policefoundation.org/content/place-based-policing. Accessed on 7/11/2014