Crowdsourcing Semantics for Big Data in Geoscience Applications

Similar documents
From Research Objects to Research Networks: Combining Spatial and Semantic Search

Ontology Summit Framing the Conversation: Ontologies within Semantic Interoperability Ecosystems

VGIscience Summer School Interpretation, Visualisation and Social Computing of Volunteered Geographic Information (VGI)

Towards Geographic Information Observatories

Automated Assessment and Improvement of OpenStreetMap Data

An Ontology-based Framework for Modeling Movement on a Smart Campus

GeoLink: Semantics and Linked Data for the Geosciences. The Web is the API.

St. Kitts and Nevis Heritage and Culture

Quality Evaluations on Canadian OpenStreetMap Data

A conceptual model for quality assessment of VGI for the purpose of flood management

Dynamic Ontology Service for Historical Persons and Places Based on Crowdsourcing

SPECCHIO for Australia: taking spectroscopy data from the sensor to discovery for the Australian remote sensing community

UC Berkeley International Conference on GIScience Short Paper Proceedings

A General Framework for Conflation

ArcGIS Enterprise: What s New. Philip Heede Shannon Kalisky Melanie Summers Sam Williamson

Spatial Data Science. Soumya K Ghosh

Weather Technology in the Cockpit (WTIC) Program Applying Cloud Technology and Crowd Sourcing to Enhance Cockpit Weather

USE AND OPTIMISATION OF PAID CROWDSOURCING FOR THE COLLECTION OF GEODATA

USE AND OPTIMISATION OF PAID CROWDSOURCING FOR THE COLLECTION OF GEODATA

Generalisation and Multiple Representation of Location-Based Social Media Data

Crowdsourcing, Citizen Science & INSPIRE

Trust as a Proxy Measure for the Quality of Volunteered Geographic Information in the Case of OpenStreetMap

Geographic Knowledge Discovery Using Ground-Level Images and Videos

INSPIRE - A Legal framework for environmental and land administration data in Europe

Visualizing Uncertainty In Environmental Work-flows And Sensor Streams

LUIZ FERNANDO F. G. DE ASSIS, TÉSSIO NOVACK, KARINE R. FERREIRA, LUBIA VINHAS AND ALEXANDER ZIPF

Collecting a Ground Truth Dataset for OpenStreetMap

Leveraging Semantic Web Techniques to Gain Situational Awareness

GIS Visualization: A Library s Pursuit Towards Creative and Innovative Research

Faculty Environmental Sciences» Department Geosciences» Chair for Geoinformationsystems Developing Scientific GDI DGEO Workshop

Ontologies and Spatial Decision Support

INTRODUCTION. 13 th AGILE International Conference on Geographic Information Science 2010 Page 1 of 15 Guimarães, Portugal

Geographic Data Science - Lecture II

Computational Biology, University of Maryland, College Park, MD, USA

Surveying, Mapping and Remote Sensing (LIESMARS), Wuhan University, China

UC Berkeley International Conference on GIScience Short Paper Proceedings

Understanding Community Mapping as a Socio-Technical Work Domain

Geo Business Gis In The Digital Organization

Convergence Classes and Spaces of Partial Functions

The Global Statistical Geospatial Framework and the Global Fundamental Geospatial Themes

Assessing pervasive user-generated content to describe tourist dynamics

Your web browser (Safari 7) is out of date. For more security, comfort and the best experience on this site: Update your browser Ignore

Citation for published version (APA): Andogah, G. (2010). Geographically constrained information retrieval Groningen: s.n.

Risk Perception, Warning Systems and Evacuation Plans for Volcanic Hazards

A Framework for Implementing Volunteered Geographic Information Systems

Spatializing Research Hypotheses a long- term research vision for

How do Free and Open Geodata and Open Standards fit together?

Semantics, ontologies and escience for the Geosciences

Test and Evaluation of an Electronic Database Selection Expert System

A Map Through Time Virtual Historic Cities

From the Venice Lagoon Atlas towards a collaborative federated system

ArcGIS is Advancing. Both Contributing and Integrating many new Innovations. IoT. Smart Mapping. Smart Devices Advanced Analytics

Text mining and natural language analysis. Jefrey Lijffijt

The Architecture of the Georgia Basin Digital Library: Using geoscientific knowledge in sustainable development

The GeoLink Modular Oceanography Ontology

Can Vector Space Bases Model Context?

Manual of Digital Earth

Understanding Interlinked Data

Non-parametric bootstrap and small area estimation to mitigate bias in crowdsourced data Simulation study and application to perceived safety

A Framework of Detecting Burst Events from Micro-blogging Streams

XXIII CONGRESS OF ISPRS RESOLUTIONS

Twenty Years of Progress: GIScience in Michael F. Goodchild University of California Santa Barbara

3D Urban Information Models in making a smart city the i-scope project case study

Best Practices in Geospatial Metadata

The INSPIRE Community Geoportal

DISTRIBUTED GEOCOMPUTATIONS AND WEB COLLABORATION

Geospatial Semantics. Yingjie Hu. Geospatial Semantics

Ontology Summit 2016: SI Track: SI in the GeoScience Session 1: How is SI Viewed in the GeoSciences"

About peer review missions related to data management

Maps as Research Tools Within a Virtual Research Environment.

Place this Photo on a Map: A Study of Explicit Disclosure of Location Information

NOKIS - Information Infrastructure for the North and Baltic Sea

Reducing Consumer Uncertainty

GIS at UCAR. The evolution of NCAR s GIS Initiative. Olga Wilhelmi ESIG-NCAR Unidata Workshop 24 June, 2003

Colin Bray, OSi CEO. Collaboration to develop a data platform for geospatial and statistical information in Ireland

International Conference on Dublin Core and Metadata Application 2004

Key Points Sharing fosters participation and collaboration Metadata has a big role in sharing Sharing is not always easy

VGI and formal data. EEO / AGI (Scotland) seminar. David Fairbairn Newcastle University School of Civil Engineering & Geosciences

Applying the Semantic Web to Computational Chemistry

A.C.R.E and. C3S Data Rescue Capacity Building Workshops. December 4-8, 2017 Auckland, New Zealand. Session 3: Rescue of Large Format and Analog Data

WHITE PAPER ON QUANTUM COMPUTING AND QUANTUM COMMUNICATION

Roadmap to interoperability of geoinformation

Basic Dublin Core Semantics

Navigating to Success: Finding Your Way Through the Challenges of Map Digitization

AS/NZS ISO :2015

BUILDING AN ACCURATE GIS

reviewed paper EU-Project: Cross-border Spatial Information System with High Added Value (CROSS-SIS) Stefan SANDMANN

Reimaging GIS: Geographic Information Society. Clint Brown Linda Beale Mark Harrower Esri

CURRICULUM VITAE Roman V. Krems

Experiences and Directions in National Portals"

A set theoretic view of the ISA hierarchy

Building streaming GIScience from context, theory, and intelligence

ZFL, Center of Remote Sensing of Land Surfaces, University of Bonn, Germany. Geographical Institute, University of Bonn, Germany

Intelligent GIS: Automatic generation of qualitative spatial information

Using spatial-temporal signatures to infer human activities from personal trajectories on location-enabled mobile devices

How to shape future met-services: a seamless perspective

Zero Hours of System Training

Abstract. Interoperable Framework for Mobile Dynamic Surveying based on open source components

Charter for the. Information Transfer and Services Architecture Focus Group

Can Volunteered Geographic Information be a participant in eenvironment and Spatial Data Infrastructures?

Transcription:

Wright State University CORE Scholar Computer Science and Engineering Faculty Publications Computer Science and Engineering 2013 Crowdsourcing Semantics for Big Data in Geoscience Applications Thomas Narock Pascal Hitzler pascal.hitzler@wright.edu Follow this and additional works at: http://corescholar.libraries.wright.edu/cse Part of the Computer Sciences Commons, and the Engineering Commons Repository Citation Narock, T., & Hitzler, P. (2013). Crowdsourcing Semantics for Big Data in Geoscience Applications. Semantics for Big Data: Papers from the AAAI Symposium. http://corescholar.libraries.wright.edu/cse/211 This Presentation is brought to you for free and open access by Wright State University s CORE Scholar. It has been accepted for inclusion in Computer Science and Engineering Faculty Publications by an authorized administrator of CORE Scholar. For more information, please contact corescholar@www.libraries.wright.edu.

Tom Narock 1 and Pascal Hitzler 2 1. University of Maryland, Baltimore County 2. Wright State University

Within the geosciences, scholars now routinely share data, publications, and code Generating and maintaining links between these items is more than just a concern for the library sciences This is beginning to have direct impact on data discovery and the way in which research is being conducted

Given known dataset can we find: Related Papers Related Presentations Related Software Tools Given a presentation can we find: Data used Software used Given spatio-temporal constraints can we find: Relevant Data Relevant Publications

However, this shift to Web-native systems has expanded information by orders of magnitude (Priem, 2013) The scale of this information overwhelms attempts at manual curation and has entered the realm of Big Data Relationships need to be identified computationally

Accuracy can be improved by applying human computation Within the last few years crowdsourcing has emerged as a viable platform for aggregating community knowledge (Alonso and Baeza-Yates, 2011) Bernstein et al. (2012a) has referred to this combined network of human and computer as the global brain

There are literally hundreds of examples of the global brain at work Bernstein (2012b) has extended this notion to the Semantic Web. Yet, we believe our application provides new insights and approaches

7 years of Geoscience conference data Earth science repository metadata 36 years of NSF funded project data Research Library Holdings This information is available or becoming available as Linked Open Data

7 years of Geoscience conference data 36 years of NSF funded project data Identify Implicit Links NLP Earth science repository metadata Research Library Holdings

7 years of Geoscience conference data 36 years of NSF funded project data Identify Implicit Links NLP Coreference Res Earth science repository metadata Research Library Holdings

7 years of Geoscience conference data 36 years of NSF funded project data Identify Implicit Links NLP Coreference Res Rules Earth science repository metadata Research Library Holdings

The accuracy of the identified relationships is paramount posing challenges for uptake and adoption Accuracy in our sense refers to the validity of a semantic statement in the real world (Hitzler and van Harmelen, 2010)

Successful volunteer crowdsourcing is difficult to replicate and efforts turning to financial compensation (Alonso and Baeza-Yates, 2011) Amazon Mechanical Turk (AMT) classic example where micro-payments are given for completing a task In the geosciences the crowd is comprised of research scientists

Malone et al. (2009) have observed a power law distribution in contributions to AMT. This pattern is also repeated in the geosciences. Neis et al. (2012) saw similar distribution of workload in OpenStreetMap A few people are doing the majority of work Many people doing little work

Volunteers do not have professional qualifications in geoscience data collection (Goodchild 2007; Nies et al., 2012) This is also true of AMT and Wikipedia where participants posses general knowledge on a topic, but are not necessarily professional experts on the subject. However, so-called citizen science will not work with crowd of research scientists. Knowledge is compartmentalized.

Build on the growing trend in Altmetrics

1. Collect user identity via Social Media 2. Provide tools to find, validate, and annotate relevant triples 3. Record provenance for user and annotations

Prototyped in Earth science community Participants provided validation and annotations Interest from geoscience community to expand scope of prototype Limited sample size and conclusions

Significant progress in provenance (PROV- O) and annotations (e.g. Annotation Ontology, Ciccarese et al., 2011) Open questions in use of provenance at Big Data scale propagation of trust within large-scale networks (Janowicz, 2009) participation of broader geoscience community

Part of NSF Promote these ideas within ocean sciences Looking at follow up project devoted to crowdsourcing Lots of interest in crowdsourcing at ISWC

Full paper available at: http://userpages.umbc.edu/~tnarock/papers/ aaai_narock_hitzler_camera_ready.pdf Symposium proceedings: http://www.aaai.org/press/reports/symposia/fall/fs-13-04.php

Alonso, O. and R. Baeza-Yates, (2011), Design and Implementation of Relevance Assessments Using Crowdsourcing, in Advances in Information Retrieval, Proceedings of the 33rd European Conference on IR Research, ECIR 2011, Dublin, Ireland, April 18-21, 2011, Edited by Paul Clough, Colum Foley, Cathal Gurrin, Gareth J. F. Jones, Wessel Kraaij, Hyowon Lee, Vanessa Mudoch, Lecture Notes in Computer Science, Volume 6611, 2011, pp 153-164, ISBN: 978-3-642-20160-8 Bernstein, A., Klein, M., and Malone, T. W., (2012a), Programming the Global Brain, Communications of the ACM, Volume 55 Issue 5, May 2012, pp 41-43 Bernstein, A., (2012b), The Global Brain Semantic Web Interleaving Human-Machine Knowledge and Computation, International Semantic Web Conference 2012, Boston, MA Ciccarese, P., Ocana, M., Castro L. J. G., Das S., and Clark, T, (2011), An Open Annotation Ontology for Science on Web 3.0, J Biomed Semantics 2011, 2(Suppl 2):S4 (17 May 2011)

Goodchild, M. F., (2007), Citizens as sensors: the world of volunteered geography, GeoJournal, 69, pp 211-221 Hitzler, P., and van Harmelen, F., (2010), A reasonable Semantic Web. Semantic Web 1 (1-2), pp. 39-44 Janowicz, K., (2009), Trust and Provenance You Can t Have One Without the Other, Technical Report, Institute for Geoinformatics, University of Muenster, Germany; January 2009 Malone, T. W., Laubacher, R., Dellarocas, C., (2009), Harnessing Crowds: Mapping the Genome of Genome of Collective Intelligence, MIT Press, Cambridge Neis, P., Zielstra, D., Zipf, A., (2012), The Street Network Evolution of Crowdsourced Maps: OpenStreetMap in Germany 2007-2011, Future Internet, 4, pp. 1-21 Priem, J., (2013), Beyond the paper, Nature, Volume 495, March 28 2013, pp 437-440.