Geocoding and Business Intelligence By Dr. Ramon Barquin and Diogenes Torres

Similar documents
Techniques for Science Teachers: Using GIS in Science Classrooms.

An online data and consulting resource of THE UNIVERSITY OF TOLEDO THE JACK FORD URBAN AFFAIRS CENTER

Are You Maximizing The Value Of All Your Data?

GIS = Geographic Information Systems;

TRAITS to put you on the map

Understanding China Census Data with GIS By Shuming Bao and Susan Haynie China Data Center, University of Michigan

Spatial Organization of Data and Data Extraction from Maptitude

2010 Oracle Corporation 1

Big-Geo-Data EHR Infrastructure Development for On-Demand Analytics

CENSUS MAPPING WITH GIS IN NAMIBIA. BY Mrs. Ottilie Mwazi Central Bureau of Statistics Tel: October 2007

BROOKINGS May

GEOGRAPHIC INFORMATION SYSTEMS Session 8

Cartographic and Geospatial Futures

David Lanter PhD GISP. Information Security Risks and Controls of Public Geospatial Datasets July 17, 2014

GIS Monroe Geographic Information System March 14, 2018

GIS Geographical Information Systems. GIS Management

June 19 Huntsville, Alabama 1

Introduction to GIS. Dr. M.S. Ganesh Prasad

Applications: Introduction Task 1: Introduction to ArcCatalog Task 2: Introduction to ArcMap Challenge Question References

Your web browser (Safari 7) is out of date. For more security, comfort and. the best experience on this site: Update your browser Ignore

GIS Spatial Statistics for Public Opinion Survey Response Rates

Tools to Assess Local Health Needs. Richard Leadbeater, Esri NACo 2011 Healthy Counties Forum December 1, 2011

Spatial Statistical Information Services in KOSTAT

Canadian Board of Examiners for Professional Surveyors Core Syllabus Item C 5: GEOSPATIAL INFORMATION SYSTEMS

Climate Risk Visualization for Adaptation Planning and Emergency Response

Utilizing GIS Technology for Rockland County. Rockland County Planning Department Douglas Schuetz & Scott Lounsbury

INTRODUCTION TO GEOGRAPHIC INFORMATION SYSTEM By Reshma H. Patil

ENV208/ENV508 Applied GIS. Week 1: What is GIS?

For the IT professional Information management solutions IBM Spatial Solutions

Crime Analysis. GIS Solutions for Intelligence-Led Policing

Exploit your geodata to enable smart cities and countries

Putting the U.S. Geospatial Services Industry On the Map

Advanced Algorithms for Geographic Information Systems CPSC 695

Gov 2.0. Encourage citizen engagement, deliver transparency, and foster collaboration with ArcGIS.

The econ Planning Suite: CPD Maps and the Con Plan in IDIS for Consortia Grantees Session 1

Map your way to deeper insights

BROADBAND DEMAND AGGREGATION: PLANNING BROADBAND IN RURAL NORTHERN CALIFORNIA

CHANGES IN ETHNIC GEOGRAPHY IN WATERBURY AS A RESULT OF NATURAL DISASTERS AND URBAN RENEWAL

ASSESSMENT. Industry Solutions Harness the Power of GIS for Property Assessment

STATE GEOGRAPHIC INFORMATION DATABASE

If you aren t familiar with Geographical Information Systems (GIS), you. GIS, when combined with a database that stores response information,

Geospatial Services in Special Libraries: A Needs Assessment Perspective

Purpose Study conducted to determine the needs of the health care workforce related to GIS use, incorporation and training.

compass.durhamnc.gov Building Community by Illustrating Community Durham s Neighborhood Compass

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

GIS Capability Maturity Assessment: How is Your Organization Doing?

Introduction to Google Mapping Tools

COMMUNITY SERVICE AREA

Powerful Combination: GIS and Web Services

geographic patterns and processes are captured and represented using computer technologies

a system for input, storage, manipulation, and output of geographic information. GIS combines software with hardware,

What are the five components of a GIS? A typically GIS consists of five elements: - Hardware, Software, Data, People and Procedures (Work Flows)

BOWIE SENIORS COMPUTER CLUB

Local Government. VALOR GIS Specialist South Georgia RDC

The purpose of this report is to recommend a Geographic Information System (GIS) Strategy for the Town of Richmond Hill.

Speakers: Jeff Price, Federal Transit Administration Linda Young, Center for Neighborhood Technology Sofia Becker, Center for Neighborhood Technology

Cadcorp Introductory Paper I

The Underutilization of GIS & How to Cure It. Adam Carnow Esri

Growth of Urban Areas. Urban Hierarchies

John Laznik 273 Delaplane Ave Newark, DE (302)

Policy Paper Alabama Primary Care Service Areas

GIS-T 2010 Building a Successful Geospatial Data Sharing Framework: A Ohio DOT Success Story

Working with ArcGIS Online

Introducing GIS analysis

CRP 608 Winter 10 Class presentation February 04, Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity

UN-GGIM: User case studies ISRAEL

Construction in my Neighborhood - Communication with Stakeholder and Public -

IMPLEMENTING GOVERNMENT-WIDE ENTERPRISE GIS; THE FEDERATED MODEL

Second High Level Forum on GGIM Seminar on Regional Cooperation in Geospatial Information Management Doha, Qatar, 7 February 2013

UTILIZING GIS IN NEXT GENERATION Tennessee Information for Public Safety. Presented by State of TN, OIR-GIS Services

Bentley Map Advancing GIS for the World s Infrastructure

Demographic Data in ArcGIS. Harry J. Moore IV

GIS INTEGRATION OF DATA COLLECTED BY MOBILE GPSSIT

Excel Geomatics: Offering Excellence with Unique Consultancy & Services

Application of GIS in Public Transportation Case-study: Almada, Portugal

Washington Master Address Services: Project Overview Ben Vaught, OCIO David Wright, DOR Craig Erickson, DOH Tom Kimpel, OFM

Google Maps and Beyond

Building a National Data Repository

Place Syntax Tool (PST)

COPYRIGHTED MATERIAL. 1 Introduction

The GIS Path Forward Saskatchewan s Geomatics Strategic Plan

Introduction to Geographic Information Systems Dr. Arun K Saraf Department of Earth Sciences Indian Institute of Technology, Roorkee

GIS present situation in Japan

An Open Source Tool for the Visualization, Analysis and Reporting of Regional and Statewide Transit Networks

Rural Pennsylvania: Where Is It Anyway? A Compendium of the Definitions of Rural and Rationale for Their Use

Chapter : 1 : Introduction of GIS

Raising awareness about advantages of standardization of geographical names / Des avantages de la normalisation

Institute of Statistical and Geographical Information of Jalisco State Geospatial Technologies, platforms and services at a subnational level

Borah Peak Earthquake HAZUS Scenario Project Executive Summary Idaho Bureau of Homeland Security Idaho Geological Survey Western States Seismic

Exploring Digital Welfare data using GeoTools and Grids

GEOGRAPHIC INFORMATION SYSTEMS

Flood Hazard Zone Modeling for Regulation Development

Oregon Department of Transportation. Geographic Information Systems Strategic Plan

Excel Geomatics. Rajesh Paul Excel Geomatics Pvt. Ltd., Noida February, 2015 India Geospatial Forum, Hyderabad

Holistic Planning for Urban & Rural Health Care Infrastructure: A Case Study for a District in India

CS 350 A Computing Perspective on GIS

Selection of the Most Suitable Locations for Telecommunication Services in Khartoum

The Use of Geographic Information Systems (GIS) by Local Governments. Giving municipal decision-makers the power to make better decisions

How does ArcGIS Server integrate into an Enterprise Environment? Willy Lynch Mining Industry Specialist ESRI, Denver, Colorado USA

LEO Catchment Profile (LCP) Key Data for Enterprise Strategy

Transcription:

Geocoding and Business Intelligence By Dr. Ramon Barquin and Diogenes Torres

Geocoding and Business Intelligence By Dr. Ramon Barquin and Diogenes Torres Have you ever sensed the frustration of a TV detective who finds a scrap of paper with an incomplete address (i.e., missing city or state) that may just allow him to crack a case or prevent a new crime and pursues a number of false leads until finally getting to the right place and nabbing the bad guy? If so, you are just getting a glimpse of the type of gyrations and desperate maneuvers that real investigators in law enforcement or homeland security agencies go through as they try to extract business intelligence from geospatial data tied to crime prevention or the war on terrorism. The FBI, the Drug Enforcement Administration, the Secret Service, the Department of Homeland Security s Immigration and Customs Enforcement component are all frequently in this boat. But the need to mine business intelligence from geospatial data doesn t stop there. It is difficult to think of a single federal agency that does not at some point require an in-depth utilization of geographic information. The Department of Housing and Urban Development (HUD) often must know exactly where a specific property is located to determine whether it lies in an area that has been designated for special statutory treatment, such as Section 8 rent subsidies, which are defined through city-based fair market rates. Congressional assistance after major emergencies is usually distributed by naming counties as disaster areas, and the IRS will often provide tax breaks for damages arising from that emergency. The Small Business Administration has to determine whether a specific business is headquartered within the boundaries of a geographical area in order to establish whether they qualify for HUB Zone (impoverished location) status and are hence eligible for special treatment. The EPA provides for a special designation of areas where hazardous waste or other types of toxic substance have contaminated the soil or the water supply, entitling residents to certain federal benefits including tax deductions. But how do we know in any of these cases, whether a specific household, person or business, are eligible for the benefit? First, it has to be determined whether a location falls within the designated geography. That may be easy to determine if an address has a zip code that is clearly within, say, a well established core-based statistical area (CBSA), but if there are several hundred or thousands of addresses that need to verified for location and to determine whether they are within a specified area, it can get very dicey. This is where the importance of geocoding becomes clear. Geocoding is the process whereby, knowing an address (street, number, city, state), a set of geographic coordinates are assigned to said address thus marking, within a degree of precision determined by the instrumentation, its specific location. This marker is defined by a pair of numbers that provide its latitude and longitude (lat/long). And given that all geographies are defined by polygons with preset borders, having the lat/long of any address allows us to perform

the geometrical mathematics that let us establish whether that address falls inside or outside the geography of interest. But geocoding is not easily done. Determining the precise lat/longs of buildings, landmarks, houses, plants or power stations requires that someone go there with a transponder or GPS and capture the specific coordinates. Furthermore, what coordinates are we talking about? A building may have several corners that define its base or several other corners that define its rooftop. A power station may include a complex of several buildings, substations, parking spaces and open areas. And, because it is not likely that anyone will ever actually obtain the coordinates for every single structure in the country, there are established approaches to extrapolate the lat/longs of an address that is toward the middle of a street (center line) when all one has are the coordinates for the first and last house in a block. The dominant company dedicated to capturing geographical information, including latitudes and longitudes in the U.S. is Chicago-based NAVTEQ, and one of its principal competitors is Tele Atlas, a Dutch firm. Geographical information systems (GISs) have been around for several decades, but with the advent of services such as Google Maps and Google Earth and the proliferation of mobile computing and location aware devices, the need has intensified to develop platforms for producing geospatial business intelligence. And, as mentioned, governments in particular seem to have an insatiable appetite for these applications. In order to address these needs, agencies are waking to the importance of hosting enterprise-wide geocoding support operations with a variety of functions. A recent request for information (RFI) issued by a federal agency, for example, outlined the following requirements: Ad hoc street address matching to include validation, standardization and geocoding. Batch processing of street addresses to include validation, standardization, and geocoding. Address validation, certification, correction and other interpolative capabilities. Methods of determining and matching the location of landmarks, cities, towns and other commonly used geographic references (i.e., Gazetteer -level functionality). Proximal intelligence or supplemental detail on environs of geocoded results beyond simple latitude and longitude returns (i.e., demographics, political boundaries, census designation, congressional districts, etc.). Ability to load and manage custom taxonomies, data dictionaries and GIS data sets. Reverse geocoding capabilities to translate incoming coordinate data and dimensionalize that geo-location data to a local street address, nearest populated place, etc. Because there is no one single commercial off-the-shelf (COTS) tool that will meet all the classical geocoding requirements, most agencies will require a portfolio of several software tools

available in the market that address some of these requirements. Each has a specific range of coverage such as: geocoding, geographic entity extraction from structured and unstructured data, geographic visualization, Gazetteer capability, etc. In order to obtain a robust full-service geocoding platform, agencies are looking into creating what could be called a Location Expert and Geocoding Service (LEGS). In addition to the tools required, a LEGS will surely necessitate a certain amount of custom application development to extract and store information that is not readily available through these tools. A robust LEGS will consist of several components. The main ones are: Data input and output, Geocode databases, and Geocoding services and architecture. The principal services provided by a LEGS could be subsumed under the term address enrichment services and are: 1. Data quality, address standardization service 2. Geocoding service 3. Geospatial service 4. Mapping service 5. Demographic service The Department of Housing and Urban Development probably pioneered some of this 20 years ago by creating a Geocode Service Center (GSC) that standardizes and enriches close to 100 million addresses per year. While the HUD GSC functionality is somewhat unsophisticated by today s standards for a LEGS, other federal agencies are only now stepping up to the need to provide such enterprise-wide support. Geospatial business intelligence (GBI) is essential for governments in the 21 st century, and in order to do GBI well, there has to be a LEGS or its equivalent within every public sector enterprise. About the Authors Dr. Ramon Barquin is the President of Barquin International, a consulting firm, since 1994. He specializes in developing information systems strategies, particularly data warehousing, customer relationship management, business intelligence and knowledge management, for public

and private sector enterprises. He has consulted for the U.S. Military, many government agencies and international governments and corporations. Dr. Barquin is a member of the E-Gov (Electronic Government) Advisory Board, and chair of its knowledge management conference series; member of the Digital Government Institute Advisory Board; and has been the Program Chair for E-Government and Knowledge Management programs at the Brookings Institution. He was also the co-founder and first president of The Data Warehousing Institute, and president of the Computer Ethics Institute. His PhD is from MIT. Dr. Barquin can be reached at rbarquin@barquin.com. Diogenes Torres is Chief Data Warehousing Architect at Barquin International. A recognized expert in databases, data warehousing architecture and geospatial processing systems, he has taught the data warehousing consultants at Oracle, BearingPoint/KPMG, Deloitte & Touche and PricewaterhouseCoopers. He has been a member of the faculty at The Data Warehousing Institute, where he was a regular from the early conferences. He had long careers with NCR and DEC, and for the last ten years has been working with Barquin International. He designed the modernization of the HUD Geocoding Service Center and has consulted for suppliers of geocoding software. Other engagements in federal business intelligence projects include the design and implementation of HUD s Multifamily Housing data warehouse, architecting the Advanced Query Facility for Census 2000 and serving as lead architect in several Department of Homeland Security data warehousing efforts.