Improved Maritime Statistics with Big Data

Similar documents
Seatrack Web Developments

ANNEX 23 RESOLUTION MSC.231(82) ADOPTION OF AMENDMENTS TO THE EXISTING MANDATORY SHIP REPORTING SYSTEM IN THE GULF OF FINLAND

An Attempt to Predict Manoeuvring Indices Using AIS Data for Automatic OD Data Acquisition

USING MASSIVE AMOUNTS OF MARINE AIS DATA TO EVALUATE MARINE TRAFFIC

Paul Bridge Meteorologist Vaisala/UKMO Work Groups/Committees: WMO/TRB/AMS

The production and use of a hydrographic flow-direction network of surface waters. Rickard HALLENGREN, Håkan OLSSON and Erik SISELL, Sweden

NATIONAL REPORT OF ESTONIA

Winter weather and municipal winter road maintenance

D6.6 Crowd-sourcing of ice information

The prediction of passenger flow under transport disturbance using accumulated passenger data

IHO STAKEHOLDERS FORUM. Hydrographic data and its role in MSDI. Thursday 27 September Jens Peter Hartmann KMS

Ufs. No Notices to Mariners, SWEDEN Swedish Maritime Administration.

Land Navigation Table of Contents

23 rd BSHC Meeting Agenda item B August 2018 National Report of Sweden Aalborg, DENMARK

C o a s t a l p o l l u t i o n

GEOGRAPHIC INFORMATION SYSTEMS Session 8

National Report of Finland

Using systematic diffusive sampling campaigns and geostatistics to map air pollution in Portugal

Sea ice charts and SAR for sea ice classification. Patrick Eriksson Juha Karvonen Jouni Vainio

Marine Transportation and Ocean Use

INTERNATIONAL CONVENTION FOR THE CONTROL AND MANAGEMENT OF SHIPS' BALLAST WATER AND SEDIMENTS, 2004

Towards coherent maritime spatial planning in the Baltic Sea Region, transnational and project perspective Talis Linkaits Head of VASAB Secretariat

Appropriation Directions for 2007

The challenge of linking or integrating data on Buildings

Advancing Real Time Observations and Coastal Modeling Forecasts-

Development and Use of the Swedish Road Weather Information System

Quality and Coverage of Data Sources

The HELCOM Baltic Sea Action Plan and Marine Spatial Planning

Planning Road Networks in New Cities Using GIS: The Case of New Sohag, Egypt

Travel Time Calculation With GIS in Rail Station Location Optimization

North Sea Ballast Water Exchange Area

Education in Maritime Spatial Planning European Maritime Days May 22, 2012 Gothenburg

1.1 What is Site Fingerprinting?

INSPIRE in Sweden.

Geospatial Data Solutions: Site and Corridor Siting Projects. Rachel Turney-Work

Weather and ice information as a tool for arctic marine and offshore services

Baltic Sea Region cooperation in Maritime Spatial Planning - HELCOM/VASAB

Your web browser (Safari 7) is out of date. For more security, comfort and the best experience on this site: Update your browser Ignore

Compact guides GISCO. Geographic information system of the Commission

Better estimation of Flood Wave Propagation Time in Meandering Reaches by using 2D-modelling

Managing bathymetric data in a hydrographic survey company

Monmouth County Adam Nassr (Partner: Brian Berkowitz, Ocean County)

Accessibility to urban areas of different sizes

INSPIRE a backbone in spatially enabling the digital maritime society in the Baltic Sea region

A route map to calibrate spatial interaction models from GPS movement data

IMO ROUTEING OF SHIPS, SHIP REPORTING AND RELATED MATTERS. Amendments to the existing mandatory ship reporting system In the Gulf of Finland

Adding value to Copernicus services with member states reference data

Safety evaluations of level crossings

COMPETITION BETWEEN CONTAINER PORTS IN THE NORTHERN ADRIATIC

Maritime Spatial Planning: Transboundary Cooperation in the Celtic Seas Looking Ahead

5/3/2018 Susceptibility analysis of landslide due to earthquake due in Gorkha (25th April 2015) Animesh Bahadur Pradhan GIS IN WATER RESOURCES CE 547

Development of a tool for proximity applications

Arctic Hydrographic Adequacy an Update

AERODROME METEOROLOGICAL OBSERVATION AND FORECAST STUDY GROUP (AMOFSG)

Stochastic prediction of train delays with dynamic Bayesian networks. Author(s): Kecman, Pavle; Corman, Francesco; Peterson, Anders; Joborn, Martin

1. Baltic SCOPE Towards coherence and cross-border solutions in Baltic Maritime Spatial Plans

Progress of UN-GGIM: Europe Working Group A on Core Data

Darren Wright Maritime Services Program Manager Center for Operational Oceanographic Products and Services (CO-OPS)

Statistical-geospatial integration - The example of Sweden. Marie Haldorson Director, Statistics Sweden

GIS Workshop Data Collection Techniques

EA SEA-WAY Project. 7 th Coordination Meeting. WP5 Development of sustainable passenger transport models for the Adriatic basin and capacity building

EO Information Services. Assessing Vulnerability in the metropolitan area of Rio de Janeiro (Floods & Landslides) Project

ENV208/ENV508 Applied GIS. Week 1: What is GIS?

Swedish Meteorological and Hydrological Institute

DATA SOURCES AND INPUT IN GIS. By Prof. A. Balasubramanian Centre for Advanced Studies in Earth Science, University of Mysore, Mysore

Visitor Flows Model for Queensland a new approach

Environmental Risk from Ship traffic along the Norwegian Coast Odd Willy Brude, Det Norske Veritas, Veritasveien 1, 1322 Høvik, Norway

What are the five components of a GIS? A typically GIS consists of five elements: - Hardware, Software, Data, People and Procedures (Work Flows)

City and SUMP of Ravenna

GIS Support for a Traffic. Delaware

maritime spatial planning in the baltic sea

Landslide Mapping and Hazard Analysis for a Natural Gas Pipeline Project

CENTRAL ASIAN REPUBLICS AND THEIR NEW TRANSPORT DEMANDS

Continental Divide National Scenic Trail GIS Program

Digitalization in Shipping

"Transport statistics" MEETING OF THE WORKING GROUP ON RAIL TRANSPORT STATISTICS. Luxembourg, 25 and 26 June Bech Building.

Digital Cartography in the Royal Library - the National Library of Sweden

Application of GIS in Public Transportation Case-study: Almada, Portugal

Digital Mariner s Routeing Guide An Exploration of the Standardization and Online Delivery of Marine Information

Commercialisation. Lessons learned from Dutch weather market

Progress of UN-GGIM: Europe Working Group A on Core Data

Contribution of Norwegian partners (Aanderaa Data Instruments and NIVA) to Safeport project ( ). Final report

WINTER MAINTENANCE DECISIONS SUPPORT SYSTEMS IN POLAND Paweł Sobiesiak

New Artificial Intelligence Technology Improving Fuel Efficiency and Reducing CO 2 Emissions of Ships through Use of Operational Big Data

DRAFT PROGRAM Registration of participants, welcome coffee, exhibition tour

Spanish national plan for land observation: new collaborative production system in Europe

Interactive GIS in Veterinary Epidemiology Technology & Application in a Veterinary Diagnostic Lab

Give 4 advantages of using ICT in the collection of data. Give. Give 4 disadvantages in the use of ICT in the collection of data

Marine Spatial Planning in the Baltic Sea Region

Rail Baltica Growth Corridor Driver of Change

Trip Distribution Model for Flood Disaster Evacuation Operation

The Added Value of Geospatial Data in a Statistical Office. Pedro Diaz Munoz Director Sectoral and Regional Statistics EUROSTAT European Commission

Country Report on SDI Activities in Singapore *

CadasterENV Sweden Time series in support of a multi-purpose land cover mapping system at national scale

If you are looking for a book Finland Map in pdf format, then you have come on to loyal website. We furnish complete option of this ebook in doc,

Urban Form and Travel Behavior:

Use of big data, crowdsourcing and GIS in assessment of weather-related impact

The National Policy Strategy for Infrastructure and Spatial Planning CODE24 CONFERENCE. Emiel Reiding

National Polish services using regional products

WINTER MAINTENANCE MANAGEMENT IN ESTONIA. Kuno Männik Director of Tartu Road Office Estonian Road Administration

Transcription:

Improved Maritime Statistics with Big Data Abboud Ado 1, Annica Isaksson de Groote 2, Ingegerd Jansson 3, Marcus Justesen 4, Jerker Moström 5, Fredrik Söderbaum 6 1 Transport Analysis, Stockholm, Sweden; Abboud.Ado@Trafa.se 2 Statistics Sweden, Stockholm; Annica.Isaksson@scb.se 3 Statistics Sweden, Stockholm; Ingegerd.Jansson@scb.se 4 Statistics Sweden, Stockholm; Marcus.Justesen@scb.se 5 Statistics Sweden, Stockholm; Jerker.Mostrom@scb.se 6 Transport Analysis, Östersund, Sweden; Fredrik.Soderbaum@Trafa.se Abstract In recent years, Big Data as a potential data source for official statistics has attracted much interest. In a joint project, Statistics Sweden and Transport Analysis (the government agency responsible for official transportation statistics) have explored the potential of a type of geographical Big Data, AIS data, for improving maritime transport statistics. In this paper we focus on one important goal of the project : to evaluate the usefulness of AIS data for improving the quality of the distances between Swedish ports. Two different methods are tried and described in this paper. Our study suggests that AIS data have great potential to improve maritime statistics, but further work is needed. Keywords: AIS, position data, distance matrix, ports. 1. Introduction In recent years, Big Data as a potential data source for official statistics has attracted much interest. Big Data can basically be described as huge volumes of data generated at short intervals, often in unstructured form, which due to their complexity and volume require innovative methods and technological solutions to extract useful information. Our work is based on a special kind of Big Data with a geographical component: AIS data. AIS (Automatic Identification System) is a system that makes it possible for ships to identify and track other ships movements. The system also provides detailed information about the ships, including their identity, size, position, course, speed, type of cargo, and destination. The system relies on digital information transmitted by the ships at short intervals and received by other vessels via their own AIS equipment. The information is also received on land through a network of AIS 1

base stations. The use of AIS data is an application of Big Data undergoing strong development, and there are several national and international examples of research studies and pilot projects on the topic. So far, however, systematic implementation of AIS data in statistical production is rare. One exception is the Swedish Meteorological and Hydrological Institute which uses AIS data to calculate emissions from shipping (SMHI 2013; SMED 2012). This paper is based on a pilot study in 2015, jointly conducted by Statistics Sweden and the Swedish government agency Transport analysis. The aim of the pilot was to investigate the potential of AIS data to improve maritime statistics. Swedish official statistics on maritime transports are based on survey data from Swedish ports. Data are collected quarterly and the port offices are asked to provide information on ships arriving at the ports. A variable of interest to most users of maritime statistics is transport performance: the amount of transported goods (or number of passengers) multiplied by the travel distance. In order to calculate transport performance, information about the distances between ports is needed. At present, flat-rate tables on the distances between ports are used in the calculations. The flatrates are derived from a Port Distance Calculation Tool developed by Eurostat (Eurostat 2009). The Eurostat tool is however not ideal for this use. Above all, it is mainly developed to provide distances between ports in different European countries. The calculated distances cannot be divided between international and national waters. Also, between ports where traffic is known to occur, information on the distances is sometimes missing. In this work, we have tested the use of AIS data in order to simplify the data collection from the ports, and to improve the quality of the calculated distances between ports. 2. Selection of data The pilot study is based on a test period of two weeks in 2014: 7-20 September. Passenger traffic was excluded from the analysis. Each day generated about one million data points within the study area. 2

The geographical area used for the selected data was the entire Baltic Sea and Skagerrak and Kattegat. The purpose of this relatively wide geographical area was to capture all possible routes between Swedish ports. 3. Preparatory processing steps: In order to create a high-definition matrix using AIS data, we prepared the data in four 1. A data set of port areas was created, containing: Swedish ports areas, a dummy area representing foreign ports, and water (outside the ports). 2. AIS data points were categorized to fall into one of the three areas defined in Step 1. 3. For each transport that occurred between ports (port areas), a line connecting the AIS data points was created based on the available information on port, time stamp and Maritime Mobile Service Identity (MMSI). 4. Each line was updated with information about origin and destination of the vessel. Step 1-4 created a population of routes which was used both to verify existing model-based distance calculations and for developing a new distance matrix. From the population of routes, it is possible to select all transport operations between Swedish ports, all transport operations to or from a specific port, etc., see Figure 1. 3

Figure 1 Left: map of all traffic between Swedish ports (in purple). Right: map of all traffic to Norrköping (in purple).grey lines represent all traffic in the area Using AIS data directly to calculate travel distance and transport performance for a certain period of time provides very accurate results as AIS data depicts the actual routes of all vessels in the study area. On the other hand, a direct use of AIS data is very time consuming as it involves repeated processing of large quantities of data. Another option is to use a model approach. A model is easier to handle and gives an accurate result, however based on ideal distances between ports rather than actual measured routes. The model needs to be based on AIS data, but instead of recalculating the actual measured routes every time, the calculations can be made once and then fed into the model. The basis for a distance matrix is the created population of routes between each pair of ports. It is thus important that the created routes correspond to the common transport routes from each port to all other ports. Hence, for a good performance of the model the population of routes needs to contain enough data to cover all possible routes, ideally with a large number of observations. 4

4. Development and modelling Geographic information can be processed in two formats: vectors or raster. Typically, the vector format is used to create a network: the lines are connected by nodes and transport operations take place along the lines. We have however chosen to use raster. A raster in GIS can be summarized as a surface composed of cells with a specific definition, such as 1 x 1 meter, which contains a value for each cell. The value in the cell is chosen to represent any information necessary for modelling the network; for example, to exclude certain cells from a transport route. We have several reasons to believe that raster is more suitable than vectors for our particular purpose: Maritime transport operations take place on a surface, they are not limited to the roads the way road traffic is. Raster can contain a lot of data, because it is only a value in a cell. Raster is faster to process. Raster is flexible and easy to experiment with. New data can be added to a raster. In order to define routes on a raster surface we have calculated the minimum cost to move from an origin to a destination, where the cost is equal to friction times distance. Friction can be anything, such as a road gradient, time, speed, or wind. Several different factors can be combined and weighted to achieve the desired friction. It is the friction that determines the route. We use the following steps to create a raster-based model: 1. All lines are converted to a raster with a one kilometer resolution. 2. For each raster cell, line density is calculated. Line density determines the waterway; if the line density is high, the traffic is high and a waterway is identified. 5

3. We use the waterway combined with destination (the port we want to calculate the route to) to define the cost to travel through each cell. 4. A distance raster is created from the cell to the port (source) of interest. Each cell in the raster is assigned a value equivalent to the least accumulated cost of being transported from the cell to the source. 5. Finally, the route is calculated as the path with the least accumulated cost from specified port or ports to the port of interest based on the distance raster. Note that the traffic density is not the main factor in the weighting. It only acts as a complement to the destination which is the most important factor. With a computation of routes based only on density and distance, a calculated route can differ significantly from the actual route. This is illustrated in Figure 2. Adjustments are needed to handle this, such as destination that also can be added as a friction. Figure 2 Route from the north of Sweden to Norrkoping calculated using only line density. Another aspect is the position of the original port when routes to a destination port are calculated. Depending on the choice of traffic; only from Swedish ports, or all traffic, the results of the route calculations differ. Figure 3 illustrates these differences. Note how the 6

route from northern Gotland changes because of the transport operation coming from Lithuania. Figure 3 Calculated routes from 70 ports to Norrköping. In the left image, only transport operations from Swedish ports to Norrköping have higher weights. In the right image, all transport operations to Norrköping (even from foreign ports) have higher weights. 5. Conclusions We have developed preliminary methods for identifying routes between Swedish ports, and between Swedish and foreign ports. The same methods are likely to be useful also for processing other types of mass-generated position data (such as data from other mobile devices). The raster-based model for calculating distances has shown promising results. The model needs however further development. In summary, our study suggests that AIS data have great potential to improve maritime statistics, but further work is needed. 6. References Eurostat (2009), Methodology for Maritime Network description, Unpublished report. 7

SMED (2012), Uppdatering av typfartyg för svensk inrikes sjöfart, SMED rapport Nr 135. SMHI (2013), A dynamic model for shipping emissions. Adaptation of Airviro and application in the Baltic Sea, METEOROLOGY, 153. 8