Geocoding of Statistics Portugal Business Register and it s integration with the INSPIRE s Annex III Buildings theme Barcelona, 26th September2016 INE/DMSI-GEO» 1
The summary Business Register (BR) Enterprise and Sampling frame Spatial Data Infrastructure (SDI) o Buildings Geographic Database (BGE) o National Dwellings Register (FNA) Building the BR Geographic Database (BRGD) o Geo processing methodology Joining BRGD and BGE/FNA o INSPIRE data specifications» 2
BR Coverage «Enterprises are identified by a unique national number All section NACE.Rev2 activities are included All institutional sectors according to European System of Accounts Micro, small and medium-sized enterprises I1» 3
Slide 3 I1 europian recomendation 2003/361/CE da momissão INE, 22/10/2009
BR Coverage «Legal Units Enterprise Enterprise group Companies Individuals Public Administration Non Profit Institutions Local units» 4
Updating process BR Access online BR Architecture the management system of population and samples BR Data Warehouse Data System Collection Quality Control» 5
BR Population Frame of surveys Each year, a new Structural population frame is built All active enterprises are included All enterprises with activity cessation in reference period are included All Institutional sectors are included The sampling frame will be obtained from this Structural population frame» 6
BR Updating Process Administrative Sources Ministry of Justice - National Registry of Legal Persons Ministry of Finance - Corporate Tax- Income Tax SICAE system (a partnership information system about legal units NACE code, where Statistics Portugal is one of the three partners, the others are Tax Authority and Institute of Registration and Notary Affairs)» 7
BR Updating Process Each variable of the BR is linked to different information sources, ranked by degree of importance For each variable a cross-check is made by comparing the information provided by the source with the one contained in the BR Concerning the economic variables a growth rate is calculated and an acceptable range is established. Those that fall out of this range are subject to further analysis» 8
Spatial Data Infrastructure Human Resources Standards Institutional Partnership and Data sharing SDI Data and Metadata Technology (Hardware and Software)» 9
Spatial Data Infrastructure Technology Pmapper Quantum GIS MapServer» 10
CAOP Portugal s Official Administrative Map Spatial Data Infrastructure Roads 1 Road Segment Code Road Segment 1 600 150 Spatial Data Grid Cells 94 265 GRID (Grid_ETRS8 9_LAEA_1K BGRI N BGE Buildings 1 Building Code 1 Residential Other Census2011 3 547 318 Schools Hospitals Business ( ) Geographical Features BGRI Census Blocks (polygons) BSA Road network (lines) BGE Buildings (points) FNA National Dwellings Register N» 11
National Dwellings Register In 2011, Statistics Portugal constructed a national geographical database of all the georeferenced buildings from the 2011 Census This geographical dataset has been used to reference census data at point level and to support the creation of a National Dwellings Register (FNA) FNA is updated by data available in different sources: (1) surveys conducted by Statistics Portugal: (2) administrative sources Censos 2011 Construction Update Buildings/Dwellings INE Surveys (Building license survey) Administrative Sources x x 1 st fase 2 nd fase t» 12
Statistical Units Information System Type of statistical unit SIGINQ-IAP (BR) SIGINQ-IE (FNA) SIGINQ-AGR (BAA) Company Building (ED) Parcel Local Unit Dwelling (UA)» 13
Statistical Units SIGINQ-AGR SIGINQ-IE SIGINQ-IAP BAA FNA GRID N Building BR N Legal Unit Farm Fraction N N Farmer Dwelling Local Unit Household dwelling Collective dwelling» 14
Action The current action is integrated into: Statistics Portugal strategy to improve the efficiency statistical process Grant: Merging statistics and geospatial information in member states GOAL Implement a spatially enabled and quality-controlled point based infrastructure for the production and delivery of BR statistics at all relevant geographic breakdown levels by means of data integration» 15
BR Geo processing Methodology The address is the key element to directly or indirectly match the records with the existing BGE following a step-by-step approach based on locators capable of sequentially pinpoint the BR records A different mix of those locators has been used for the cases processed MORADA_CP7 BSA_CP7_DTA BSA_CP7_ESQ MORADA_LOC_CP4 CP7 CP4 Complete address composed by type of road, name, number, 7 digit postal code Used over the BSA in order to overcome discrepancies in the address Uses the locality name and the 4 digit postal code Based on the 7 digit postal code, which is a linear structure used to code each block façade composed by the CP4 and 3 additional digits Based on the 4 digit postal code, which is a polygonal structure used to code each postal distribution area» 16
BR Geo processing Methodology» 17
Statistics Portugal Responsability: 5 themes I.3 Geographical Names I.5 Adresses III.1 Statistical Units III.2 Buildings III.10 Population Distribution demography Implementation of the INSPIRE Framework Participation in 5 Thematic WG 18
Implementation of the INSPIRE Framework Harmonization byhale The alignment is the mapping between source and target schemas. It defines relations between source and target entities (types or properties). Based on the defined relations a transformation is derived. Download versão 2.9.4 (2015-11-01) versão 32 e 64 bit para windows, Mac OS, Linux» 19
HALE Workflow 1.Import Source/TargetSchemas Transformation according to target Schema 2.Import data 3.Defining mapping rules 4.Export transformed data 5.Data validation» 20
HALE Interface Schema Explorer allows you to view the structure of the source (left) and the target (right) schema in various ways and to define mappings between the elements of the schemas. Alignment view displays the current alignment per type relation and allows editing or removing mapping cells. Error Log gives you insight into the application's log messages Properties View displays information on the current selection Functions View shows the available transformation functions, which can be used to define relations. Further information on a selected function will be displayed in the Properties view. Report List provides an overview of the last completed processes 21
Joining BR and BGE / FNA The BR Geographic Database is joined to point based component - BGE Buildings and FNA Aiming to create a unique geocoded national framework of Statistical Units to be used in the national Statistical System» 22
Joining BR and BGE / FNA This data is now ready to be used by Census 2016 Pilot» 23
THANK YOU Data to be analysed / updated Department of Methodology and System Information GeoInformation Unit ana.msantos@ine.pt» 24