The Nature and Value of Geographic Information. Michael F. Goodchild University of California Santa Barbara

Similar documents
THE NATURE AND VALUE OF GEOGRAPHIC INFORMATION

GIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

Statistical Perspectives on Geographic Information Science. Michael F. Goodchild University of California Santa Barbara

Improving Spatial Data Interoperability

A General Framework for Conflation

[Figure 1 about here]

30 YEARS SINCE Z_GIS FOUNDING: PROVIDING SOLUTIONS FOR SOCIETY AND BUSINESS

WELCOME. To GEOG 350 / 550 Introduction to Geographic Information Science

Representation of Geographic Data

NR402 GIS Applications in Natural Resources

A spatial literacy initiative for undergraduate education at UCSB

Introduction to GIS I

Popular Mechanics, 1954

( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r C h a p t e r 1 7 : I n f o r m a t i o n S c i e n c e P a g e 1

Equational Logic. Chapter Syntax Terms and Term Algebras

Uncertainty in Geographic Information: House of Cards? Michael F. Goodchild University of California Santa Barbara

Welcome to GST 101: Introduction to Geospatial Technology. This course will introduce you to Geographic Information Systems (GIS), cartography,

Uncertainity, Information, and Entropy

Geospatial Semantics. Yingjie Hu. Geospatial Semantics

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall

Space-adjusting Technologies and the Social Ecologies of Place

17.1 Binary Codes Normal numbers we use are in base 10, which are called decimal numbers. Each digit can be 10 possible numbers: 0, 1, 2, 9.

An Introduction to Geographic Information System

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

Spatial Webs. Michael F. Goodchild University of California Santa Barbara

Modeling Discrete Processes Over Multiple Levels Of Detail Using Partial Function Application

CHAPTER 4 ENTROPY AND INFORMATION

Why Complexity is Different

Notes on BAN Logic CSG 399. March 7, 2006

Outline. Geographic Information Analysis & Spatial Data. Spatial Analysis is a Key Term. Lecture #1

GEO-INFORMATION (LAKE DATA) SERVICE BASED ON ONTOLOGY

Year 8 standard elaborations Australian Curriculum: Geography

Key Issue #1. How do geographers describe where things are? 2014 Pearson Education, Inc.

Exam 1. Problem 1: True or false

Global Grids and the Future of Geospatial Computing. Kevin M. Sahr Department of Computer Science Southern Oregon University

The Case for Space in the Social Sciences

Bandwidth: Communicate large complex & highly detailed 3D models through lowbandwidth connection (e.g. VRML over the Internet)

Amobile satellite communication system, like Motorola s

Principles of IR. Hacettepe University Department of Information Management DOK 324: Principles of IR

Welcome to NR502 GIS Applications in Natural Resources. You can take this course for 1 or 2 credits. There is also an option for 3 credits.

Lecture 1: Geospatial Data Models

Accuracy and Uncertainty

Introducing GIS analysis

Entropies & Information Theory

Transiogram: A spatial relationship measure for categorical data

Quality and Coverage of Data Sources

Geographic Information Systems (GIS) in Environmental Studies ENVS Winter 2003 Session III

Number theory (Chapter 4)

Institutional Opportunities and Constraints. Michael F. Goodchild

EPISTEMOLOGY, COMPUTATION AND THE LAWS OF PHYSICS

Lecture 8. Spatial Estimation

SPATIAL DATA MINING. Ms. S. Malathi, Lecturer in Computer Applications, KGiSL - IIM

Basics of GIS reviewed

Lecture 5 Geostatistics

Noisy channel communication

GIScience: Current Technology. Michael F. Goodchild University of California Santa Barbara

Spatial analysis. Spatial descriptive analysis. Spatial inferential analysis:

A GIS helps you answer questions and solve problems by looking at your data in a way that is quickly understood and easily shared.

cis32-ai lecture # 18 mon-3-apr-2006

Lecture 7. Union bound for reducing M-ary to binary hypothesis testing

Entropy as a measure of surprise

Designing Information Devices and Systems I Fall 2017 Official Lecture Notes Note 2

The Future of Geography in an. Society. Michael F. Goodchild University of California Santa Barbara

6196 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 57, NO. 9, SEPTEMBER 2011

SRJC Applied Technology 54A Introduction to GIS

GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis

3.3 Accumulation Sequences

Linear Systems Theory Handout

Sample. Contents SECTION 1: PLACE NAMES 6 SECTION 2: CONNECTING TO PLACES 21 SECTION 3: SPACES: NEAR AND FAR 53

GIST 4302/5302: Spatial Analysis and Modeling

THE 3D SIMULATION INFORMATION SYSTEM FOR ASSESSING THE FLOODING LOST IN KEELUNG RIVER BASIN

Learning ArcGIS: Introduction to ArcCatalog 10.1

Fundamental Spatial Concepts. Michael F. Goodchild University of California Santa Barbara

Chapter 1: Basic Concepts

Geographers Perspectives on the World

Modal Logic. UIT2206: The Importance of Being Formal. Martin Henz. March 19, 2014

Key Words: geospatial ontologies, formal concept analysis, semantic integration, multi-scale, multi-context.

Physics is becoming too difficult for physicists. David Hilbert (mathematician)

STATE GEOGRAPHIC INFORMATION DATABASE

Motion Fundamentals. 1 Postulates. 2 Natural Progression. Thomas Kirk

Interpolating Raster Surfaces

FNRM 3131 Introduction to GIS in Natural Resource Management

Vision 2020: Focus on the Future. Michael F. Goodchild University of California Santa Barbara

VIDEO Intypedia008en LESSON 8: SECRET SHARING PROTOCOL. AUTHOR: Luis Hernández Encinas. Spanish Scientific Research Council in Madrid, Spain

CSCI 2200 Foundations of Computer Science Spring 2018 Quiz 3 (May 2, 2018) SOLUTIONS

Logic for Computer Science - Week 4 Natural Deduction

Convex Hull-Based Metric Refinements for Topological Spatial Relations

CSE468 Information Conflict

Nature of Spatial Data. Outline. Spatial Is Special

Delta School District 1

Modeling Temporal Uncertainty of Events: a. Descriptive Perspective

Quality Assessment and Uncertainty Handling in Uncertainty-Based Spatial Data Mining Framework

MODULE -4 BAYEIAN LEARNING

UNIT I INFORMATION THEORY. I k log 2

Nested Epistemic Logic Programs

Chapter 17. Proof by Contradiction The method

GEOGRAPHY 350/550 Final Exam Fall 2005 NAME:

OBEUS. (Object-Based Environment for Urban Simulation) Shareware Version. Itzhak Benenson 1,2, Slava Birfur 1, Vlad Kharbash 1

Summer Research Projects for First Year Students

Transcription:

The Nature and Value of Geographic Information Michael F. Goodchild University of California Santa Barbara

What is the GI in GIScience?! Is information stuff? if it is, then it must be possible to measure its quantity Q(A+B) = Q(A)+Q(B) a market in GI requires that such means be agreed otherwise all transactions would be unique and no market could exist conventional metrics of quantity are arbitrary, media-dependent, structure-dependent e.g. per sq km, per quadrangle, per megabyte

Shannon-Weaver information theory! Measures the information content of a message by comparison to the number of distinct messages that could exist in a given code e.g. one Roman letter resolves among 26 possibilities but not all possibilities are equally likely in English an E conveys less information than an X! Is code-dependent, media-dependent, structure-dependent is syntactic rather than semantic

The information content of a number! There are 100 2-digit numbers any one 2-digit number resolves among 100 possibilities! Consider the infinite series of digits starting 3.14159... resolves among an infinite number of possibilities but can be sent by sending one letter from the Greek alphabet provided the receiver knows the code the value of information depends on knowledge of codes

Towards a semantic theory of GI! Measuring the meaning conveyed by a message the increment to the receiver s knowledge in ways that are independent of media, syntax, structure! Accommodating the ability of GIS to transform information can easily mutate into other forms how do we know if the content of two data sets is the same?! Why GI?

Atoms of GI! GI is composed of atomic pairs of the form <x,z> compare Berry, Sinton, Plewe where x is a location in space-time of 2 to 4 dimensions using agreed methods for referring to times, and locations on the Earth s surface (latitude/longitude, WGS 84, GMT, ) methods that are shared between sender and receiver of GI (and are frequently universal)

The nature of z! A vector of properties using definitions that are already agreed between sender and receiver some such definitions are universal, e.g. Celsius some are not, e.g. vegetation cover type the value of an atom sent to a receiver who does not share the same definitions will be uncertain, and may be nil

Domains of GI! x! z limited to the Earth s surface and near-surface to the present, near-past, and near-future a rigid Newtonian frame mappable physical, social, environmental properties associated with locations

Continuity of x! Description is impossible because x is continuous and z is infinitely dimensioned we are saved by Tobler s Law all things are related but nearby things are more related than distant things <x+δx,z> = <x,z> for δx<λ an infinite number of pairs is not required hell is a place with no spatial dependence a potentially infinite number of properties exist, but in practice they are strongly correlated and only a finite number are needed for useful description

Consequences of Tobler s Law! The explicit atomic form is never needed atoms are inferred from larger structures using appropriate universal rules and transformations e.g. the boundary of California leads to an infinite number of pairs <x,z> where z is binary databases are built using larger structures as well as atoms

Six field representations! Representing a single property z! Irregularly spaced sample points a finite number of pairs <x,z> plus an interpolator, e.g. inverse-distance weighting, Kriging, splines, proximal/thiessen! Regularly spaced sample points a single tuple <G,O,z 1,z 2,,z n >where G defines georeferencing, O defines ordering

Irregular polygons! Tuples defining each polygon and its field value <x 1,y 1,x 2,y 2,,x m,y m,z>! Polygons do not overlap, and collectively exhaust the space every point x,y lies in exactly one polygon! Loss of detail justified by reference either to some λ (pixel size, MMU), or to knowledge of the properties of the phenomenon (land ownership parcels)

Discrete objects! Points are atomic! Lines and areas as tuples <x 1,y 1,x 2,y 2,,x m,y m,z>

Geographic information systems! Systems that combine GI with expertise to perform transformations and respond to queries! A geographic query a query to which GI provides the answer satisfied by access to one or more atoms e.g., What is the temperature at x? e.g., Where is the temperature equal to T?

Possession of GI! A GIS is said to possess an item of GI if it is capable of responding successfully to a query to which the item is the answer item = one or more atoms independent of format, structure, medium may imply transformations a message has no value if the information it contains is already possessed

Derivative queries and spatial analysis! What is the distance from A to B?! Requires <x 1,A> and <x 2,B>! Requires a rule for determining distance (a metric)! Within the capabilities of a GIS, but beyond those of a human?

Digital Earth! I believe we need a 'Digital Earth'. A multiresolution, three-dimensional representation of the planet, into which we can embed vast quantities of georeferenced data. U.S. Vice President Gore, 1/98! A single (distributed?) repository for all GI a complete description of the planet! A system that contained DE would be able to respond to all queries about Earth

Bit or it?! A DE and someone with access to the Earth would be equally successful at answering queries there is no query that could resolve whether Earth is real or digital two Chinese postmen a DE would contain sufficient information to reconstruct Earth sending a DE is equivalent to transporting the planet Siegfried, The Bit and the Pendulum

Naïve geography! Geocentric perspective: Newtonian frame, scientific measurement! Human-centric perspective: individual differences, perception, uncertainty proliferation of z e.g., multiple definitions of wetland the body of knowledge that people have about the surrounding geographic world (Egenhofer and Mark 1995)

Consistency with geometric principles! All points contained within the boundary of California are in California what if someone believes otherwise?! Santa Barbara is north of Los Angeles between 337.5 and 22.5 degrees potential violation of geometric principles! The rules, transformations on which GIS is based break down information is not necessarily reducible to atomic form queries are not necessarily answerable

Scale and spatial resolution! In practice the ability to locate precisely on the Earth s surface is limited there are not an infinite number of possible locations e.g., ROSE! Tobler s Law enables approximately complete description with a finite number of atoms

Quantity of information! A polygon describing the State of California enables an infinite number of queries of the form Is x in California? does the system possess an infinite amount of information?! Suppose location is knowable to an accuracy λ (a linear measure) there are only 4πR 2 /λ 2 distinct locations on the Earth s surface only that number of distinct queries can be answered

and in addition! If x 1 and x 2 are in California, then αx 1 +(1- α)x 2 is also probably in California and certainly so if California is convex! The system actually possesses the coordinates of a polygon, plus a universal rule the volume of information is bounded by the volume of the polygon definition

A semantic theory of GI! Atomic pairs link understood concepts x is universally understood z is understood by an information community that includes the receiver! The value of an atom of GI is related to the level of understanding on the part of the receiver of the concepts that it links linking a concept that is not understood is of no value

< Mt Everest,8850m>! Of no value to a receiver who does not recognize Mt Everest, the concept of height, or the metric system! Given <x, Mt Everest > the system can deduce <x,8850m> other pairs can be deduced from other prior knowledge! Understanding : the number of prior linkages to a concept the higher the understanding, the greater the value of a new linkage

x 8850

Unresolved issues! Partial resolution of uncertainty incomplete answers to queries what is the relative value of < Mt Everest,8848m±2>? what is the value of increased spatial resolution?! Naïve and inconsistent belief is it possible to build such a GIS?

Key points! GI in atomic form almost never exposed except for point data must be compressed in practice! Pairs linking already-understood concepts value depends on number of linkages and whether tuple is already possessed! Systems as combinations of information and expertise tuples and rules! Independent of media, structure, format