Design and Development of Linked Data from The National Map. E. Lynn Usery and Dalia Varanka.

Similar documents
Geospatial Semantics for Topographic Data

Applied Cartography and Introduction to GIS GEOG 2017 EL. Lecture-2 Chapters 3 and 4

An Introduction to Geographic Information System

GeomRDF : A Geodata Converter with a Fine-Grained Structured Representation of Geometry in the Web

Introduction to GIS I

GIS-based Smart Campus System using 3D Modeling

USING GIS CARTOGRAPHIC MODELING TO ANALYSIS SPATIAL DISTRIBUTION OF LANDSLIDE SENSITIVE AREAS IN YANGMINGSHAN NATIONAL PARK, TAIWAN

COMPLEX TOPOGRAPHIC FEATURE ONTOLOGY PATTERNS

a system for input, storage, manipulation, and output of geographic information. GIS combines software with hardware,

Geography 38/42:376 GIS II. Topic 1: Spatial Data Representation and an Introduction to Geodatabases. The Nature of Geographic Data

4. GIS Implementation of the TxDOT Hydrology Extensions

USGS Hydrography Overview. May 9, 2018

FUNDAMENTALS OF GEOINFORMATICS PART-II (CLASS: FYBSc SEM- II)

Introduction to Geographic Information Systems (GIS): Environmental Science Focus

Key Words: geospatial ontologies, formal concept analysis, semantic integration, multi-scale, multi-context.

Introduction-Overview. Why use a GIS? What can a GIS do? Spatial (coordinate) data model Relational (tabular) data model

Introduction INTRODUCTION TO GIS GIS - GIS GIS 1/12/2015. New York Association of Professional Land Surveyors January 22, 2015

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

Globally Estimating the Population Characteristics of Small Geographic Areas. Tom Fitzwater

Geographic Information Systems (GIS) in Environmental Studies ENVS Winter 2003 Session III

ISU GIS CENTER S ARCSDE USER'S GUIDE AND DATA CATALOG

Designing a Dam for Blockhouse Ranch. Haley Born

Lecture 1: Geospatial Data Models

What s New in Topographic Information - USGS National Map

Taxonomies of Building Objects towards Topographic and Thematic Geo-Ontologies

These modules are covered with a brief information and practical in ArcGIS Software and open source software also like QGIS, ILWIS.

NR402 GIS Applications in Natural Resources

Welcome to NR502 GIS Applications in Natural Resources. You can take this course for 1 or 2 credits. There is also an option for 3 credits.

A Temporal Hydrologic Database for Rapidly Changing Landscapes

Digital Map of Mexico Platform and MxSIG. March 2017

PROANA A USEFUL SOFTWARE FOR TERRAIN ANALYSIS AND GEOENVIRONMENTAL APPLICATIONS STUDY CASE ON THE GEODYNAMIC EVOLUTION OF ARGOLIS PENINSULA, GREECE.

Spanish national plan for land observation: new collaborative production system in Europe

Georelational Vector Data Model

A General Framework for Conflation

GRAPEVINE LAKE MODELING & WATERSHED CHARACTERISTICS

Lecture 11. Data Standards and Quality & New Developments in GIS

GIS in Weather and Society

TRANSFORMATION THROUGH CLC WITH THE CONTINUOUS RESEARCH TECHNIQUES - GIS (OPEN CODE) AND RS (GEO-WEB SERVICES)

Popular Mechanics, 1954

INTRODUCTION TO GIS. Dr. Ori Gudes

Designing GIS Databases to Support Mapping and Map Production Charlie Frye, ESRI Redlands Aileen Buckley, ESRI Redlands

INTRODUCTION TO GEOGRAPHIC INFORMATION SYSTEM By Reshma H. Patil

)UDQFR54XHQWLQ(DQG'tD]'HOJDGR&

Web Visualization of Geo-Spatial Data using SVG and VRML/X3D

What are the five components of a GIS? A typically GIS consists of five elements: - Hardware, Software, Data, People and Procedures (Work Flows)

The production and use of a hydrographic flow-direction network of surface waters. Rickard HALLENGREN, Håkan OLSSON and Erik SISELL, Sweden

The National Hydrography Dataset in the Pacific Region. U.S. Department of the Interior U.S. Geological Survey

The Architecture of the Georgia Basin Digital Library: Using geoscientific knowledge in sustainable development

On-demand mapping and integration of thematic data

Generalized map production: Italian experiences

SRJC Applied Technology 54A Introduction to GIS

Overview key concepts and terms (based on the textbook Chang 2006 and the practical manual)

POSSIBILITY FOR APPLICATION OF GIS TECHNOLOGIES IN RTB BOR GROUP

Utilization and Provision of Geographical Name Information on the Basic Map of Japan*

What is GIS? Introduction to data. Introduction to data modeling

GeoWEPP Tutorial Appendix

Louisiana Transportation Engineering Conference. Monday, February 12, 2007

Lecture 9: Reference Maps & Aerial Photography

Available online at Analele Stiintifice ale Universitatii Al. I. Cuza din Iasi Seria Geologie 58 (1) (2012) 53 58

Publishing Value Added EO Products as Linked Data

Basics of GIS. by Basudeb Bhatta. Computer Aided Design Centre Department of Computer Science and Engineering Jadavpur University

Terms GIS GPS Vector Data Model Raster Data Model Feature Attribute Table Point Line Polygon Pixel RGB Overlay Function

software, just as word processors or databases are. GIS was originally developed and cartographic capabilities have been augmented by analysis tools.

GIS Changing the World GIS Day November 15, 2017

CS 350 A Computing Perspective on GIS

Land Accounts - The Canadian Experience

Display data in a map-like format so that geographic patterns and interrelationships are visible

Imagery and the Location-enabled Platform in State and Local Government

Chapter 6. Fundamentals of GIS-Based Data Analysis for Decision Support. Table 6.1. Spatial Data Transformations by Geospatial Data Types

Chapter 5. GIS The Global Information System

CARTOGRAPHIC INFORMATION MANAGEMENT IN COLOMBIA REACH A LEVEL OF PERFECTION

Multiscale Representations of Water: Tailoring Generalization Sequences to Specific Physiographic Regimes

FINNISH LINKED DATA PILOTS

Statewide Topographic Mapping Program

Lecture 12. Data Standards and Quality & New Developments in GIS

Volcanic Hazards of Mt Shasta

ISO Canadian Drought Monitor Data Product Specifications

Geo-spatial Analysis for Prediction of River Floods

Hydrology and Floodplain Analysis, Chapter 10

Digitization in a Census

RHOAPS. Real-time Hydrology Ocean Atmosphere Prediction System. Pronunciation: Ropes Motto: More than just THREDDS

Geographic Analysis of Linguistically Encoded Movement Patterns A Contextualized Perspective

Watershed Application of WEPP and Geospatial Interfaces. Dennis C. Flanagan

ENV208/ENV508 Applied GIS. Week 1: What is GIS?

Place Still Matters: Generalizing the NHD by Local Terrain and Climate

Geodatabase An Overview

Lecture 6 - Raster Data Model & GIS File Organization

ISO INTERNATIONAL STANDARD. Geographic information Metadata Part 2: Extensions for imagery and gridded data

Reprint of article that appeared in Hydro INTERNATIONAL November 2006 Volume 10 Issue 9

IMAGE CLASSIFICATION TOOL FOR LAND USE / LAND COVER ANALYSIS: A COMPARATIVE STUDY OF MAXIMUM LIKELIHOOD AND MINIMUM DISTANCE METHOD

5A.10 A GEOSPATIAL DATABASE AND CLIMATOLOGY OF SEVERE WEATHER DATA

GIS for ChEs Introduction to Geographic Information Systems

GEOMATICS. Shaping our world. A company of

GIS = Geographic Information Systems;

YYT-C3002 Application Programming in Engineering GIS I. Anas Altartouri Otaniemi

AUTOMATIC EXTRACTION OF ALUVIAL FANS FROM ASTER L1 SATELLITE DATA AND A DIGITAL ELEVATION MODEL USING OBJECT-ORIENTED IMAGE ANALYSIS

GIS (GEOGRAPHIC INFORMATION SYSTEMS)

Creation of an Internet Based Indiana Water Quality Atlas (IWQA)

GeoQuery Tool to Pose Spatial Queries over the Web

Introduction to GIS. Dr. M.S. Ganesh Prasad

Transcription:

Design and Development of Linked Data from The National Map E. Lynn Usery and Dalia Varanka usery@usgs.gov; dvaranka@usgs.gov Abstract The development of linked data on the World-Wide Web provides the opportunity for the U.S. Geological Survey (USGS) to supply its extensive volumes of geospatial data, information, and knowledge in a machine interpretable form and reach users and applications that heretofore have been unavailable. To pilot a process to take advantage of this opportunity, the USGS has selected data from The National Map for nine research test areas and provided these data in the Semantic Web format of Resource Description Framework (RDF) triples to support machine processing and linked data access. The provision of geospatial data on the linked data of the Web is problematic from several perspectives and the USGS is developing solutions to these problems. Specifically, the handling of coordinates for geospatial data in vector format and the identification of geospatial entities and objects in geospatial raster data and the handling of raster geometry (pixels) in a linked data format have proved difficult. It is the purpose of this paper to discuss the USGS approach to developing linked data for both vector and raster data from The National Map databases. 1.0 Introduction The USGS is a primary supplier of geospatial and environmental datasets that are used extensively in mapping, planning, resource and land management, emergency response, and many other applications. A sampling of these data is presented in Table 1 with descriptions. All data are public domain and available on the Web. Use of these data often require combining one or more of these datasets or combining these data with user-generated data. Since the data exist in many different formats, some of which are proprietary, the integration or conflation of the data for use in a specific application requires significant data processing and manipulation by the user. The Semantic Web offers an alternative approach to data formatting, access, and integration for use in applications (W3C, 2011). By use of the standard triple model of the Resource Description Framework (RDF) of the Semantic Web (W3C, 2004), applications are able to link to other data and to use and share data effectively to answer queries and support specific applications (Heath and Bizer, 2011). Thus, the USGS has begun exploring the potential of the Semantic Web, particularly for geospatial data access, integration, synthesis, and use in applications. It is the purpose of this paper to describe the USGS approach to providing geospatial data on the Semantic Web. The remainder of the paper is organized as follows. Section 2 describes the general approach to conversion of data. Section 3 introduces the ontology for The National Map and section 4 details the conversion process for point, vector, and raster

data. Section 5 describes the access, download, and query of the converted data with conclusions presented in Section 6. 2.0 Data Conversion Approach The USGS approach to using the Semantic Web is to convert specific datasets from The National Map to RDF and make these data available for download and/or direct query in the RDF format. As a pilot project, the USGS selected nine test areas based on specific geographic characteristics, extracted all data for the eight layers of The National Map for these areas, and converted the vector and point data to RDF maintaining the coordinates in the Geography Markup Language (GML) based on the Open Geospatial Consortium (OGC) standard (Portele, 2007). The nine research test areas include six watershed sub-basin areas defined from the National Hydrography Dataset (NHD) that reflect differing combinations of physiography and climate (Figure 1). In addition to the watershed areas, the sites include three urban areas, Atlanta, Georgia; St. Louis, Missouri; and New Haven, Connecticut, included as an urban coastal site. Table 1 Sample Datasets Managed by the U.S. Geological Survey Data Type Vector Raster Satellite images Hazards Minerals Energy Landscapes and coasts Astrogeology Description Nationwide, multi-resolution, multitemporal National Hydrography Dataset medium resolution (1:100,000-scale source), high resolution (1:24,000-scale source), and some local (1:4,800-scale source) National Transportation Dataset from Census National Boundaries Dataset National Structures Dataset Geographic Names Information System Nationwide, some worldwide, multiresolution, multi-temporal National Elevation Dataset National Digital Orthophotos National Land Cover Dataset Global Landcover dataset Lidar Many different remote sensing platforms Earthquakes, volcanoes

Geologic Map Database Geologic data Water Data National Water Information System Floods and High Flow Drought Monthly Streamflow Ground Water Water Quality National Biological Information Infrastructure Vegetation characterization Wildlife Invasive Species Digital Data Series 1.5 million sites in United States for Real-time data Surface water Ground water Water quality Figure 1. Location of USGS research datasets for developing ontology and semantics for The National Map.

Each of these test areas includes the eight standard layers of The National Map, land cover, structures, boundaries, hydrography, geographic names, transportation, elevation, and orthoimagery (Figure 2). Figure 2. Nationwide data layers of The National Map The boundary, hydrography, and transportation layers use vector geometry with point, line, and area objects as the basic data elements. The structures and geographic names layers use point objects as the geometric base of the data elements. The land cover, elevation, and orthoimage layers use raster geometry with pixels or cells as the basic geometric unit. Objects in the raster layers must be defined and referenced over the cell geometry for access and manipulation. Conversion of the sample site datasets to RDF has followed the general approach of defining the subject, predicate, and object of RDF as the feature identifier, feature name or other attribute or relation, and feature instance or object of the relation, respectively. Thus, it is the feature name or other attribute or relation that connects the feature identifier with the actual instance or other object of the feature in the data. A requirement for the conversion is to maintain the ability to generate a graphic for any query result. Thus, the coordinates must be associated with the RDF triple. This association is done through GML and allows access and use by any traditional program that can process GML. A SPARQL Protocol and RDF Query Language (SPARQL) query of the RDF data can retrieve the needed result and the final output can be used to generate a map

from the GML coordinate store as needed. All GML entities and operations used in the data conversion and semantic queries follow the OGC standard for GML (OGC, 2011). 3.0 Ontology for The National Map The conversion and use of USGS geospatial data to the Semantic Web format requires an ontology of the basic entities and associated attributes and relationships. An ontology has been developed using the basic topographic information available from the previous USGS specifications and standards for creating topographic maps (Varanka, 2009). The ontology development combines a top-down approach based on standards and general scientific knowledge and bottom-up approaches from data collections. The vocabulary for topographic features, to be represented as triple subjects and/or objects, was developed from standard feature list sources that were derived from decades of topographic feature data collection (U.S. Geological Survey 2010; U.S. Board on Geographic Names 2010). Features are classified into six taxonomic modules; terrain, surface water, ecological regime, structures, divisions, and events, which reflect topographic science modeling and the thematic layers of The National Map. The classification was guided by geographic knowledge with regard to regional context, morphology as natural or engineered structures, and descriptive attributes, such as shape or texture (fluid vs. frozen), in accordance with empirical experience and scientific concepts. Features whose currency or relevance was unclear, such as drive-in theater or demilitarized zone were eliminated from these lists. Features that have become common since the development of the standards, such as windfarm, were added as new vocabulary without the full development and review of a new standard. The digital files form a vocabulary and consist of feature type classes under the taxonomic module domain. Each class has a definition and the definition source from on-line documentation. The hierarchy is flat (U.S. Geological Survey 2011). The semantic meaning of topography combines natural and built-up (human-constructed) features. Human impacts and experience of landscape change have led to increased understanding of landscape complexity. Complex features involve spatial relations among their basic components and as parts of landscape systems. These relations are functional, such as the relation between an airport runway and control tower; these connections form a part of the complex feature identity. Complex features are particularly common in the largest group of topographic features in the USGS vocabulary, built-up structures. In these cases, the base vocabulary allows relating simple classes into complexes for ontology design patterns (ODP). In addition to feature relations of location, such as coordinate pairs or geometric distances between features, spatial relation terms for the ontology development are generally drawn from a set of Open Geospatial Consortium (OGC) standards and verb/preposition pairs identified from the topographic feature type standards. Spatial relations are a part of many feature type semantics and subsequent definitions; for example, a tributary is a stream that flows to a larger stream. In

such cases, the appropriate spatial relation can be modeled with mereo-topological relations, such as connects or network part and logical concepts, such as the Web Ontology Language (OWL) FunctionalProperty relation (Casti and Varzi 1999; Smith et al., 2009). The logical axioms to be applied to the topographic triples are the W3C standards and functionalities offered by specific software platforms. Spatial relations are often considered to form a separate predicate between feature subjects and objects of triples. The automatic creation of predicate relations results from the conversion of relational to triple data forms. Not all of these are spatial, but most include significant attribute properties. To match the converted data to the ontology classes, spatial prepositions are being researched in which the geospatial codes resemble natural language semantics. Because prepositions reflect geometric cognition, the hypothesis is that linguistic terms reflect geometric data operations. 4.0 Data conversion To make USGS data available to the Semantic Web and the Linked Open Data Community, the USGS converted data for the nine research test areas to RDF and GML. The required conversion processes and structure of the resulting data with access to the original geometry are different based on the original geometry of the geographic data sources. The following discussion is separated into point, vector, and raster to describe the different processes required for conversion. 4.1 Point Data The point datasets for The National Map include geographic names and structures. Whereas structures data in The National Map will eventually be generated using the polygonal boundary for the structure outline, currently available data use a single point at the proximate center of the building or other structure. Thus, at present structures are converted to RDF using a point geometry model. The basic conversion for the point data proceeded as follows. Point data for The National Map are stored in Esri geodatabase or shape file formats (Esri, 2011). These files are used to create GML documents to store the geometric data. The output of the conversion process is written to an N3 document, a shorthand RDF format (Berners-Lee, 2005). Complete description of this process including conversion from geodatabase, personal geodatabase, and shape files to GML and then to N3 is presented in Bulen et al. (2011). Each point feature in the Geographic Names Information System (GNIS) is structured as a name associated with a location. The conversion of this structure to RDF triples uses the simple convention that the feature identifier is connected to the actual instance (coordinate location) by the feature name. The same process is used for structures for which a query example of the resulting RDF is presented in Figure 3. Figure 3 also presents the result in GML including the coordinates for the structure location.

Figure 3. Query text for a structure feature from RDF data of a sample from the National Map. The resulting GML from the query is shown in the bottom of the figure. 4.2 Vector Data The conversion of vector formatted geospatial data for boundaries, hydrography, and transportation for the test sites to the linked data format of the Semantic Web proceeded with the following general approach. The subject, predicate, object format of RDF for the semantic web was constructed from the entities as defined in formats of The National Map. For example, for a stream in the National Hydrography Dataset (NHD) of The National Map, flowline is the primary feature of the stream reach that provides connections of the hydrographic network. The

subject is the feature identifier, in the case of a flowline it is the reach code as defined in NHD (fid:_77127453 in Figure 4). The predicate is the particular property of the flowline being modeled in the triple, its length, for example (http://cegis.usgs.gov/rdf/geometry#length in Figure 4). The objects are many and depend on the predicate. For example, the object of the predicate geometry#length is a literal number; the object of geometry#intersects is another flowline. The object of geometry#gml are the coordinates of the flowline. Figure 4 shows a query and the detailed set of flowline characteristics that are the distinct properties or predicates of the flowline. As shown in the figure, each subject (reach code identifier) has many distinct predicates and objects associated with it to capture the stream characteristics. As with the point data, the geometry of the flowline is represented by coordinate stores in GML. An example of a query is shown in Figure 5 with the resulting GML. Note that in Figure 5, the coordinates in the GML are truncated for space considerations on the printed page. query PREFIX qgis: <http://cegis.usgs.gov/rdf/geometry#> PREFIX fid: <http://cegis.usgs.gov/rdf/nhd/featureid#> PREFIX road: <http://cegis.usgs.gov/rdf/trans/featureid#> PREFIX nhd: <http://cegis.usgs.gov/rdf/nhd#> PREFIX trans: <http://cegis.usgs.gov/rdf/trans#> select distinct?predicate where { fid:_77127453?predicate [] } predicate http://www.w3.org/1999/02/22-rdf-syntax-ns#type http://cegis.usgs.gov/rdf/geometry#area http://cegis.usgs.gov/rdf/geometry#centroid http://cegis.usgs.gov/rdf/geometry#dimension http://cegis.usgs.gov/rdf/geometry#geometrytype http://cegis.usgs.gov/rdf/geometry#gml http://cegis.usgs.gov/rdf/geometry#intersects http://cegis.usgs.gov/rdf/geometry#isempty http://cegis.usgs.gov/rdf/geometry#issimple http://cegis.usgs.gov/rdf/geometry#isvalid

http://cegis.usgs.gov/rdf/geometry#length http://cegis.usgs.gov/rdf/geometry#numgeometries http://cegis.usgs.gov/rdf/geometry#numpoints http://cegis.usgs.gov/rdf/geometry#touches http://cegis.usgs.gov/rdf/geometry#within http://cegis.usgs.gov/rdf/nhd#comid http://cegis.usgs.gov/rdf/nhd#fcode http://cegis.usgs.gov/rdf/nhd#fdate http://cegis.usgs.gov/rdf/nhd#ftype http://cegis.usgs.gov/rdf/nhd#geometryproperty http://cegis.usgs.gov/rdf/nhd#permanentidentifier http://cegis.usgs.gov/rdf/nhd#resolution http://cegis.usgs.gov/rdf/nhd#shapelength http://cegis.usgs.gov/rdf/nhd#duuid http://cegis.usgs.gov/rdf/nhd#metaprocess http://cegis.usgs.gov/rdf/nhd#permanentid http://cegis.usgs.gov/rdf/nhd#flowsto http://cegis.usgs.gov/rdf/nhd#enabled http://cegis.usgs.gov/rdf/nhd#flowdir http://cegis.usgs.gov/rdf/nhd#lengthkm http://cegis.usgs.gov/rdf/nhd#reachcode http://cegis.usgs.gov/rdf/nhd#wbareacomid http://cegis.usgs.gov/rdf/geometry#crosses http://cegis.usgs.gov/rdf/geometry#covers Figure 4. Query for stream flowline from RDF data and the predicates of the flowline for a sample from the National Hydrography Dataset of The National Map.

Figure 5. An example query with the resulting GML. Note that the coordinate string is truncated on line 5 for because of space limitations in printing. 4.3 Raster Data Query and access to raster data on the Semantic Web poses unique problems since ontological objects are not defined in the structure of the data which is a grid of pixel values or digital numbers. Traditional processing of raster data has treated the entire raster grid as a coverage, as in Web Coverage Services, or provided procedures to extract vector objects from the raster matrix. Raster data conversion to RDF requires definition of objects, and although a significant literature exists on image segmentation and object extraction from raster image data (see standard texts on remote sensing and image processing, such as Lillesand et al., 2005), there has been little work on ontology and semantics with raster geometry. In general the approach to this problem is to first develop vector objects from image segmentation then use existing methods for building ontology and semantics for the vector objects. However, Liu et al. (2004) have proposed methods to extend the Geographic Structured Query language (GSQL) to support raster data. By defining specific abstract data types (ADTs), such as Pixel, Raster Region, and RasterCoverage

and formalizing data objects and operations on these ADTs, GSQL has been extended to query raster objects. Quintero et al. (2009) alos provide an approach to raster data semantics. Their approach is in three stages requiring conceptualization, synthesis, and description of objects in the raster data. Neither of these approaches are directly implemented for the Semantic Web and neither uses an RDF structure for the raster objects. The raster data layers in The National Map are land cover, elevation, and orthographic images (see Figure 1). Unlike other approaches that extract the semantic objects from the raster data, the approach is to determine relevant objects and maintain the raster matrix as the geometric basis of the geographic features of interest. This is essential since a user may want to see a source map or image of the features in concert with a query result or with other data. The steps involved in the conversion first require that the features be identified in the raster source and a pixel or set of pixels selected as the basic geometric footprint for the feature (Usery, 1996). This identification results in a single pixel for features that can be treated as point features at the resolution of the raster data. An example is well or spring. A linear set of pixels can be used to represent line types of features, such as roads or rivers, based on size of the feature and resolution of the data. Features that span areas require contiguous groups of pixels or in some cases non-contiguous groups of pixels to be identified (Usery, 1994a; 1994b). The identification step must be followed by an identification of the relations of the specified feature to other neighboring features leading to the development of an ODP as is done with vector data (Varanka, 2010). Once the features and relations, as specified in the ontology, are identified, the feature is matched to an existing ODP. If no pattern in the repository provides an adequate match, a new ODP is developed and added to the repository. The newly defined ODP is linked to the feature as are the geometric pixel patterns and relationships. At this point an RDF structure can be created for the feature. Similar to the representation of point and vector data in RDF above, the conversion of the feature and relations to RDF is performed and the raster geometry, pixel, linear set of pixels, or pixel aggregation is structured in GML, using the GML coverage. To define the gml:grid element, a minimum bounding rectangle (MBR) is used for the feature since at this point GML does not allow storage of pixels in other than a rectangular fashion. Eventually, storage of the exact set of pixels that represent a line of polygon will be stored, but currently to remain within the GML standard, only the MBR is used. 5.0 Access to USGS RDF Data for Research Test Sites of The National Map To provide access to the research test data that the USGS has converted to RDF format, the USGS established a computer server accessible to the public (http://usgs-ybother.srv.mst.edu). On this server users external to the USGS Intranet can access and download the data in the original Esri and image formats (Geodatabase, shapefile, TIFF) of The National Map or in RDF. The USGS has also established a SPARQL Endpoint at http://usgs-ybother.srv.mst.edu:8890/sparql that allows direct query of the data using SPARQL. At present the USGS is conducting a testbed application of these data with the proposed GeoSPARQL standard of OGC (Perry and Herring 2011). To illustrate the use of the SPARQL Endpoint, the USGS implemented the relations standardized by OGC from the 9-intersection model (Egenhofer and Franzosa, 1991). An example relation using the SPARQL Endpoint and the converted data is shown below. The relation is touches and

the query is For a given feature find all other features that touch the given feature. (Figure 6). Placing the query in the geographic space of data from The National Map, it can be phrased about a specific feature: Find all the tributaries of Hunter Creek. The result as a series of URIs is presented in Figure 7 and when the coordinates from GML are placed on a background map, the graphic in Figure 8 is the result. Figure 6. Initial screen accessed on the USGS SPARQL Endpoint with example query using relation touches.

Figure 7. The set of URIs that result from the query in Figure 7. It should be noted that the relationships for this dataset were determined at the time of conversion and implemented to generate coordinates in GML. A true geographic query remains to be implemented with the GeoSPARQL query language for which USGS is a test site with the sample data described in this paper.

Figure 8. Graphical result of the query in Figure 6. Hunter Creek is shown in red and its tributaries are shown in blue. The background image is a standard USGS Digital Raster Graphic for the quadrangle that includes Hunter Creek, Colorado. 6.0 Conclusions The USGS is researching the capabilities of the Semantic Web for supporting query and analysis of geographic data from The National Map. As a part of that research, point and vector data for 9 research test areas have been converted to RDF and made available to the public. A vocabulary of topographic terms has been developed to form the basis for ontology for The National Map.

To support user interaction with the converted data, the USGS provides access for download of the research test data in original formats of the The National Map, RDF formatted data, and a SPARQL Endpoint for direct query of the data. The USGS is participating with these data in testing the evolving GeoSPARQL standard and providing methods for users to semantically interact with the data. Raster data poses unique problems for representation on the Semantic Web since objects representing actual entities in the real world appear as digital number values for pixels in the image. Constructing object representations requires maintaining the pixel geometry while developing the complete set of attributes and relationships that comprise the ontology for the entities. Approaches to date have relied on conversion from raster to vector geometry thus losing the original geometric source of the data. The USGS approach is to maintain the pixel structure of the entity from the raster image and build ontology from ODP. 7.0 References Berners-Lee, T., 2005, Primer: Getting into RDF & Semantic Web using N3: v. 1.61, World Wide Web Consortium, accessed March 16, 2011, at http://www.w3.org/2000/10/swap/primer. Bulen, A., Carter, J., and Varanka, D., 2011. A Program For The Conversion of The National Map Data From Proprietary Format to Resource Description Framework (RDF), USGS Open File Report, U.S. Geological Survey, Reston, VA. Casati, R. and Varzi, A. 1999. Parts and Places, The Structures of Spatial Representation. Cambridge, Mass. The MIT Press. Egenhofer, M.F. and Franzosa, R.D., Point Set Topological Spatial Relations, International Journal of Geographical Information Systems, vol 5, no 2, 161-174, 1991. ESRI, 2011, Geodatabases. ESRI, accessed March 30, 2011, at http://resources.arcgis.com/content/geodatabases/10.0/about. Heath, T. and Bizer, C. (2011) Linked Data: Evolving the Web into a Global Data Space (1st edition). Synthesis Lectures on the Semantic Web: Theory and Technology, 1:1, 1-136. Morgan & Claypool accessed May 31, 2011 at http://linkeddatabook.com/editions/1.0/. Lillesand, T.M., Kiefer, R.W., and Chipman, J., 2007. Remote Sensing and Image Interpretation, Sixth Edition, John Wiley and Sons, New York, Liu, Y. Lin, Y., Qin, S., Zhang, Y., Wu, L., 2005. Research on GSQL Estension Supporting Raster Data, Journal of Image and Graphics, accessed May 31, 2011, at http://en.cnki.com.cn/article_en/cjfdtotal-zgtb20050100j.htm

OGC, 2010, Open Geospatial Consortium, Inc.: accessed March 25, 2010 at URL: http://www.opengeospatial.org/. OGC, 2011. Geography Markup Language (GML) Encoding Standard, Open Geospatial Consortium, Inc., accessed May 31, 2-011 at http://www.opengeospatial.org/standards/gml Perry, M., and Herring, J., eds. 2011. GeoSPARQL A Geographic Query Language for RDF Data. Open Geospatial Consortium Inc., project document reference number OGC 09-157- r1. Portele, C., 2007, OpenGIS Geography Markup Language (GML) Encoding Standard, v. 3.2.1: Open Geospatial Consortium, Inc., OGC 07-036, accessed March 30, 2011, at http://www.opengeospatial.org/standards/gml. Quintero, R., Torres, M., Moreno, M., and Guzman, G. 2009. Towards A Semantic Representation of Raster Spatial Data. GeoSpatial Semantics, Third International Conference, GeoS 2009, Mexico City, Mexico, December 2009 Proceedings. Springer, LNCS 5892 P. 63 82. Smith, M.K., Welty, C. and McGuinness, D.L. 2009. OWL Web Ontology Language. W3C Recommendation 10 February 2004. http://www.w3.org/tr/2004/rec-owl-guide- 20040210/#FunctionalProperty. U.S. Board on Geographic Names, 2010, Geographic Names Information System: accessed March 25, 2010 at URL: http://geonames.usgs.gov/domestic/index.html. Usery, E.L., 1994a. "Implementation Constructs for Raster Features," Proceedings, American Society for Photogrammetry and Remote Sensing Annual Convention, Reno, Nevada, pp. 661-670. Usery, E.L., 1994b. "Display of Geographic Features from Multiple Image and Map Databases," Proceedings, International Society for Photogrammetry and Remote Sensing, Commission IV Symposium on Mapping and Geographic Information Systems, International Archives of Photogrammetry, Volume XXX, Part B4, Athens, G, pp. 1-9. Usery, E.L., 1996. "A Feature-Based Geographic Information System Model," Photogrammetric Engineering and Remote Sensing, v. 62, no. 7, p. 833-838. U.S. Geological Survey, 2010, National Geospatial Program Standards, accessed March 23, 2010 at URL http://nationalmap.gov/gio/standards/. U.S. Geological Survey, 2011, Six Domain Modules for Ontology Development. U.S. Geological Survey. http://cegis.usgs.gov/ontology.html. Varanka, D., 2009. A Topographic Feature Taxonomy for a U.S. National Topographic Mapping Ontology, Proceedings, International Cartographic Conference, Santiago, Chile, CD- ROM publication.

Varanka, D., 2011. Ontology Patterns for Complex Topographic Feature Types, Cartography and Geographic Information Science, v. 38, no. 2, in press. W3C 2004. Resource Description Framework (RDF), accessed May 31, 2011 at http://www.w3.org/rdf/. W3C 2011. Semantic Web, accessed May 31, 2011 at http://www.w3.org/standards/semanticweb/.