Evaluating Travel Impedance Agreement among Online Road Network Data Providers

Similar documents
VGI and formal data. EEO / AGI (Scotland) seminar. David Fairbairn Newcastle University School of Civil Engineering & Geosciences

Collecting a Ground Truth Dataset for OpenStreetMap

ASSESSING THE QUALITY OF OPEN SPATIAL DATA FOR MOBILE LOCATION-BASED SERVICES RESEARCH AND APPLICATIONS

Exploratory Completeness Analysis of Mapillary for Selected Cities in Germany and Austria

Assessing OSM Road Positional Quality With Authoritative Data

INTRODUCTION. 13 th AGILE International Conference on Geographic Information Science 2010 Page 1 of 15 Guimarães, Portugal

Automated Assessment and Improvement of OpenStreetMap Data

An open source approach for the intrinsic assessment of the temporal accuracy, up-todateness and lineage of OpenStreetMap

Getting Started with Community Maps

Sensitivity of estimates of travel distance and travel time to street network data quality

Application of WebGIS and VGI for Community Based Resources Inventory. Jihn-Fa Jan Department of Land Economics National Chengchi University

Road Network Impedance Factor Modelling Based on Slope and Curvature of the Road

Development of a server to manage a customised local version of OpenStreetMap in Ireland

Internet GIS Sites. 2 OakMapper webgis Application

Is OSM Good Enough for Vehicle Routing? A Study Comparing Street Networks in Vienna

Quality Evaluations on Canadian OpenStreetMap Data

V1.0. Session: Labelled Maps Verification, entering names into a GIS and Google Maps/Earth. Pier-Giorgio Zaccheddu

Quality Assessment of Volunteered Geographic Information: An Investigation into the Ottawa-Gatineau OpenStreetMap Database

BROOKINGS May

Basic Map Skills for the Outdoors

Provision of Web-Based Childcare Support Maps by Local Governments in Japan

Road & Railway Network Density Dataset at 1 km over the Belt and Road and Surround Region

GIS ANALYSIS METHODOLOGY

Humans as Sensors: Citizen Science Data to Assess Climate Change

NR402 GIS Applications in Natural Resources

DATA 301 Introduction to Data Analytics Geographic Information Systems

CHAPTER 22 GEOGRAPHIC INFORMATION SYSTEMS

All About Spatial Data. Find it, Manage it, Use it

Cross-Linkage Between Mapillary Street Level Photos and OSM Edits

Base Maps: Creating, Using & Participating

Salisbury University: Eric Flint, John O Brien, & Alex Nohe

file://q:\report1\greenatlasfinalreportindex.html

Washington Master Address Services: Project Overview Ben Vaught, OCIO David Wright, DOR Craig Erickson, DOH Tom Kimpel, OFM

presents challenges related to utility infrastructure planning. Many of these challenges

GIS CONCEPTS Part I. GIS ON THE WEB Part II

SCAUG Community Maps Building a Living Atlas of the World

Development of an automated matching algorithm to assess the quality of the OpenStreetMap road network

These modules are covered with a brief information and practical in ArcGIS Software and open source software also like QGIS, ILWIS.

Key Points Sharing fosters participation and collaboration Metadata has a big role in sharing Sharing is not always easy

GIScience: Current Technology. Michael F. Goodchild University of California Santa Barbara

Understanding Community Mapping as a Socio-Technical Work Domain

What is 511? Need for 511 Services. Development & Deployment of Regional Road and Weather Information Supporting 511 Traveler Services

Welcome! Power BI User Group (PUG) Copenhagen

Finding Common Ground Through GIS

Texas A&M University

Multi agent Evacuation Simulation Data Model for Disaster Management Context

GeoSUR SRTM 30-m / TPS

ArcGIS for Applied Economists Session 2

Subwatersheds File Geodatabase Feature Class

What are we like? Population characteristics from UK censuses. Justin Hayes & Richard Wiseman UK Data Service Census Support

Working with ArcGIS Online

International Journal of Computer Sciences and Engineering Open Access. Design and Development of tool for assessing OpenStreetMap Completeness

Transit Time Shed Analyzing Accessibility to Employment and Services

ASSESSMENT OF THE HOMOGENEITY OF VOLUNTEERED GEOGRAPHIC INFORMATION IN SOUTH AFRICA L. Siebritz a, G. Sithole b, S. Zlatanova c

SPANISH GOOD (AND NO SO GOOD) PRACTICES IMPLEMENTING INSPIRE

You are Building Your Organization s Geographic Knowledge

Google Maps and Beyond

Marine Transportation and Ocean Use

Among various open-source GIS programs, QGIS can be the best suitable option which can be used across partners for reasons outlined below.

ArcGIS Online Routing and Network Analysis. Deelesh Mandloi Matt Crowder

INSTITUTE OF POLICY AND PLANNING SCIENCES. Discussion Paper Series

Zielstra and Hochmair page 1 of 17

Geographical Bias on Social Media and Geo-Local Contents System with Mobile Devices

GED 554 IT & GIS. Lecture 6 Exercise 5. May 10, 2013

Assessment of Logical Consistency in OpenStreetMap Based on the Spatial Similarity Concept

The One and Many Maps: Participatory and Temporal Diversities in OpenStreetMap

Census Transportation Planning Products (CTPP)

Application of GIS in Public Transportation Case-study: Almada, Portugal

DataShine Automated Thematic Mapping of 2011 Census Quick Statistics

The World Bank and the Open Geospatial Web. Chris Holmes

The Platform Generation. Derek Law and Ebony Wicks

Integrating Origin and Destination (OD) Study into GIS in Support of LIRR Services and Network Improvements

CyberGIS: What Still Needs to Be Done? Michael F. Goodchild University of California Santa Barbara

ArcGIS API for Python for Data Scientists. Andrew Chapkowski Alberto Nieto

Mobile GIS Application for Khartoum Public Transportation Network

Using GIS to Determine Goodness of Fit for Functional Classification. Eric Foster NWMSU MoDOT

Updating the Urban Boundary and Functional Classification of New Jersey Roadways using 2010 Census data

Tools to Assess Local Health Needs. Richard Leadbeater, Esri NACo 2011 Healthy Counties Forum December 1, 2011

Quality analysis of the Parisian OSM toponyms evolution

Evaluating e-government : implementing GIS services in Municipality

Improving Geographical Data Finder Using Tokenize Approach from GIS Map API

BROADBAND DEMAND AGGREGATION: PLANNING BROADBAND IN RURAL NORTHERN CALIFORNIA

Welcome to NR502 GIS Applications in Natural Resources. You can take this course for 1 or 2 credits. There is also an option for 3 credits.

INDOT Office of Traffic Safety

Smart Citizens. Maria Antonia Brovelli Politecnico di Milano, Italy

Spatial Data Infrastructure Concepts and Components. Douglas Nebert U.S. Federal Geographic Data Committee Secretariat

Dynamic Maps and Historical Context

ISPRS Hanover Workshop, Crowdsourced Mapping: Letting Amateurs into the Temple?

LIRR Routes, New York NY, January 2017

GeoPostcodes. Luxembourg

Technical Memorandum #2 Future Conditions

Healthsites.io: The Global Healthsites Mapping Project

DATA SOURCES AND INPUT IN GIS. By Prof. A. Balasubramanian Centre for Advanced Studies in Earth Science, University of Mysore, Mysore

How do Free and Open Geodata and Open Standards fit together?

Comparative analysis of online mapping sites on a case study of Sofia city center

Discovery and Access of Geospatial Resources using the Geoportal Extension. Marten Hogeweg Geoportal Extension Product Manager

Tax Jurisdiction Sourcing Data Bases

Quality assessment of professional and VGI geo-data in The Netherlands

Transcription:

Evaluating Travel Impedance Agreement among Online Road Network Data Providers Derek Marsh Eric Delmelle Coline Dony Department of Geography and Earth Sciences University of North Carolina at Charlotte

GoogleMaps 205 miles Rand McNally 205.4miles

Yahoo 205.93 miles Open Mapquest 205.38 miles

Online geographic data providers 1 Web services such as: Google Maps, Bing Maps, MapQuest Provide unprecedented access to spatial data and analytical tools geocoding addresses identifying points of interest determining travel directions Simple network analysis without the need for a GIS network dataset No data preparation necessary Available to GIS and non-gis users alike

Online geographic data providers 1 For sizeable use, generally require a paid license Directions service requests are limited otherwise Google Maps 2,500/day Bing Maps 10,000/90-days MapQuest 5,000/day An alternative is using openly sourced, public domain volunteered geographic information (VGI) MapQuestOpen unlimited

Volunteered Geographic Information 1 the widespread engagement of large numbers of private citizens, often with little in the way of formal qualifications, in the creation of geographic information (Goodchild 2007) One of the most successful examples of VGI, OpenStreetMap (OSM), offers a free, editable map of the world with no restrictions governing use for spatial analysis

VGI data quality 1 Despite VGI s potential, the question remains: What is the quality of this data? Because participants potentially lack any formal training in geographic data collection, central coordination is weak to non-existent, and adherence to a particular data structure is not required, no assumptions can be made about the overall quality of uploaded data (Goodchild & Li 2012)

Literature VGI data quality - Comparative assessments 2 Girres & Touya (2010) In comparison to the French National Mapping Agency, point positional displacement was on average 6.65 meters Haklay (2010) In comparison to the Ordnance Survey of Great Britain, greater than 81% overlap among major roads and an average of 6 meters point displacement of the OSM dataset within study sites across London Ciepłuch et al. (2010) In comparison to Google Maps and Bing Maps, accuracy is inconsistent among all three providers

VGI data quality - Indicator assessments 2 if one individual contributes an error, others can be expected to edit and correct the error, and the success of this mechanism rises in proportion to the number who look at the contribution (Linus law) (Goodchild & Li 2012) Haklay et al. (2010) Positional accuracy improved with an increase in the number of contributors up to a threshold (n>13) at which improvement stabilized Keßler & Groot (2010) Without a reference dataset, the volume of user contribution to an area or object in OSM is positively correlated to trustworthiness of the dataset

Research objectives and 3 questions I. Evaluating the Uncertainty of Travel Impedance Estimates What is the degree of uncertainty in travel impedance estimates among online road network data providers? Do routes calculated using VGI data present significantly different travel impedance estimates in comparison to commercial online spatial datasets? II. VGI User Contribution Applying Linus s Law at the Network Object Level Correlation between number of contributors and level of agreement?

Methodology 4 Identify O-D Pairs Travel Estimation: d ij,k, t ij,k Origins Destinations Network Snapping Tertiary Roads O-D Pairs Lat/Long Points Batch Routing Network Provider API Google Maps ArcGIS Online JavaScript Object Notation (JSON) Travel Time & Distance Estimates OpenStreetMap Linus s Law Disagreement Assessment: Distance weighted contributor average Route Contributor Average ݓ ) C a = ݐ ) ݓ * c Store Contributor Information Network Metadata API OpenStreetMap Identify route road segments Network Metadata API MapQuest Open OpenStreetMap 1. Difference (Δd, Δt) among online providers 2. Percent Difference 3. Correlation (r)

Case study area 4 North Carolina offers several clear urban locations, a diverse road network, and a range of topographical environments to assess road network uncertainty.

Methodology 4 Origins Destinations Identify O-D Pairs Network Snapping Tertiary Roads O-D Pairs Lat/Long Points Remove limited access roads from network dataset. Origins and destinations selected from tertiary roads Modified dataset segmented at nodes; begin nodes serve as candidate origin and destination points Specific implementation study area dependent; discussed further in results Select n*2 number of randomly distributed of candidate points used to form n number of origindestination (OD) pairs Store OD pairs in text file as latitude, longitude and unique identifier

Example of North Carolina - (total = 100,000 OD pairs): 14,300 pairs are selected in each of seven distance intervals: 0-50 kilometers (km), 50-100 km, 100-150 km, 150-200 km, 200-250 km, 250-500 km and 500-1000 km. It was necessary to increase the range of the category intervals for the longer distances to accomplish an equally stratified sample. Results OD selection 6 Road network, State of North Carolina Exclude interstate highways Identify begin and nodes of all resulting road segments Exclude begin nodes in the proximity of highways (incorrect snapping) (*) 300 pairs of vertices were selected at random for each county (stratified random sampling of vertices) P =306,788 Q =47,059,285,078 P =30,000 P c = 300(*) Q =449,985,000

Results OD selection 6 Ex) North Carolina All pairs of OD points Spider map of OD pairs originating or terminating in Ashe County

Methodology Online data providers (k): Reference Datasets: Google Maps (TeleAtlas) ArcGIS Online (NavTeq) VGI Dataset: OpenStreetMap Technical Issues: Google Directions API limited to 2,500 requests per day ArcGIS Online requires license OpenStreetMap directions algorithm provided by MapQuest Open Assuming no significant difference due to heuristic or routing algorithm Travel estimations do not account for traffic or other realtime data Precision limited to 1/10 th mile Batch Routing Network Provider API Google Maps ArcGIS Online OpenStreetMap JavaScript Object Notation (JSON) 4 Travel Estimation: d ij,k, t ij,k In Python: For each OD pair, a URL string is formed that includes the network provider web address, OD coordinates, and routing specifications. A new URL is created for each provider, k. Results returned in JavaScript Object Notation (JSON), an easily read data format that uses key-value pairs. Travel Time & Distance Estimates

Methodology Travel Impedance Estimates d ij : travel distance t ij : travel time Batch Routing Network Provider API Google Maps ArcGIS Online OpenStreetMap JavaScript Object Notation (JSON) 4 Travel Estimation: d ij,k, t ij,k Travel Time & Distance Estimates ODIndex originlat originlng destinationlat destinationlng GoogleMile GoogleMin ArcGISMile ArcGISMin OSMMile OSMMin 1 35.2458-80.8045 35.2261-80.9443 10.4 18 11.1 17.4 10.4 17.0 2 35.0783-80.8225 35.0758-80.8946 5.5 13 5.5 13.0 5.5 12.1 3 35.2163-80.7872 35.1013-80.8255 10.3 21 10.2 19.7 10.1 16.5 4 35.2355-80.7941 35.2922-80.9497 11.5 20 12.1 20.2 12.1 20.2 5 35.2606-80.8525 35.0418-80.8542 18.9 24 18.8 24.7 18.9 22.1 6 35.2185-80.7703 35.4468-80.8836 21.4 25 21.8 27.1 21.9 25.9 7 35.2212-80.8299 35.1972-80.7583 5.1 8 4.8 8.0 4.9 7.2 8 35.3304-80.7344 35.2503-80.9238 14.7 17 14.7 18.5 14.8 19.0 9 35.0441-80.7744 35.3107-80.7203 26.9 28 26.7 30.0 27.0 30.3 10 35.4195-80.8763 35.1178-80.9753 27.7 32 27.6 34.5 27.7 35.8

Results 6 Low uncertainty in estimated travel distance ArcGIS Online overestimates

Results 6 Correlation Coefficients NC Outlier(s) Google Maps includes ferries in the routing calculation

Results 6

What about contributors?

Methodology 4 = Segment distance ݓ (selected) = Total ݓ = ݐ segment distance c ݓ = Number of segment contributors Distance weighted contributor average Route Contributor Average ݓ ) C a = ݐ ݓ ) C a = ݐ ) ݓ * c * c ݓ ) Fewer contributors are required to validate shorter road segments, Linus s Law Store Contributor Information Network Metadata API OpenStreetMap Identify route road segments Network Metadata API MapQuest Open OpenStreetMap but a higher proportion of contributors is needed to verify the accuracy of a longer route A sample of road segments is used from the total route; thus, the user average is proportional to the length of known road segments

Results Linus s Law 6 North Carolina OD pairs Level of uncertainty decreases as number of contributors increases Initial increase in uncertainty corresponds to greatest sample of contributor averages (overall average = 3.27) Large number of outliers

Results at different distances 6 0-25mi 25-75mi 75-250mi >250mi

Discussion and conclusion 7 Correlation coefficients and percent difference both resulted in relatively high agreement. 1. Uncertainty was extremely low at long travel distances 2. Shorter, county wide distances showed greater uncertainty among all providers 3. The VGI dataset OSM was as reliable as the two commercial providers in estimating travel distance 4. OpenStreetMap may be a viable dataset for routing and navigation purposes within the selected study areas

Discussion and conclusion 7 VGI User Contribution Applying Linus s Law at the Network Object Level 1. Disagreement decreases with increasing number of contributors 2. Relationship not uniform across different route lengths.

Future research opportunities 7 Approach could be expanded to new areas of the OSM dataset (e.g. other regions and countries) Urban travel Rural travel Analyze overlap among individual routes to explain where and why travel impedance uncertainty occurred Is the trend of the Linus s Law valid in other states and other countries?

Thank you Derek Marsh Eric Delmelle Coline Dony Department of Geography and Earth Sciences University of North Carolina at Charlotte

Results Correlation Coefficients Mecklenburg County 6 Greater uncertainty across all providers Correlation still high Same pattern of under/overestimation Greater uncertainty at 20-35 miles

Results Percent Difference Mecklenburg County 6 Trend in correlation plots are corroborated by percent difference ArcGIS Online produces greater uncertainty around 15 miles OSM has greater uncertainty at 30 miles