Evolution Strategies for Optimizing Rectangular Cartograms

Similar documents
Lecture 26 Section 8.4. Mon, Oct 13, 2008

Nursing Facilities' Life Safety Standard Survey Results Quarterly Reference Tables

Analyzing Severe Weather Data

Your Galactic Address

Use your text to define the following term. Use the terms to label the figure below. Define the following term.

Sample Statistics 5021 First Midterm Examination with solutions

Evolution Strategies for Optimizing Rectangular Cartograms

What Lies Beneath: A Sub- National Look at Okun s Law for the United States.

Country

Parametric Test. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 1984.

Country

Appendix 5 Summary of State Trademark Registration Provisions (as of July 2016)

Drought Monitoring Capability of the Oklahoma Mesonet. Gary McManus Oklahoma Climatological Survey Oklahoma Mesonet

Multiway Analysis of Bridge Structural Types in the National Bridge Inventory (NBI) A Tensor Decomposition Approach

Statistical Mechanics of Money, Income, and Wealth

Smart Magnets for Smart Product Design: Advanced Topics

SAMPLE AUDIT FORMAT. Pre Audit Notification Letter Draft. Dear Registrant:

Forecasting the 2012 Presidential Election from History and the Polls

Cluster Analysis. Part of the Michigan Prosperity Initiative

2006 Supplemental Tax Information for JennisonDryden and Strategic Partners Funds

Class business PS is due Wed. Lecture 20 (QPM 2016) Multivariate Regression November 14, / 44

Summary of Natural Hazard Statistics for 2008 in the United States

C Further Concepts in Statistics

Annual Performance Report: State Assessment Data

Swine Enteric Coronavirus Disease (SECD) Situation Report June 30, 2016

EXST 7015 Fall 2014 Lab 08: Polynomial Regression

5. Atmospheric Supply of Mercury to the Baltic Sea in 2015

AIR FORCE RESCUE COORDINATION CENTER

Data Visualization (DSC 530/CIS )

Meteorology 110. Lab 1. Geography and Map Skills

Data Visualization (CIS 468)

Module 19: Simple Linear Regression

Some concepts are so simple

Swine Enteric Coronavirus Disease (SECD) Situation Report Sept 17, 2015

Resource efficiency and Geospatial data What EUROSTAT does. What could do.

Final Exam. 1. Definitions: Briefly Define each of the following terms as they relate to the material covered in class.

Kari Lock. Department of Statistics, Harvard University Joint Work with Andrew Gelman (Columbia University)

REGRESSION ANALYSIS BY EXAMPLE

The veto as electoral stunt

A L A BA M A L A W R E V IE W

Overview of numbers submitted for Statistics on Pending Mutual Agreement Procedures (MAPs) under the Arbitration Convention (AC) at the End of 2017

AFRCC AIR FORCE RESCUE COORDINATION CENTER

Problems arising during the implementation of CLC2006

Swine Enteric Coronavirus Disease (SECD) Situation Report Mar 5, 2015

Introduction to Mathematical Statistics and Its Applications Richard J. Larsen Morris L. Marx Fifth Edition

GRIPs. Overview Andrea Ćirlićová Business Area Manager, System Development

Empirical Application of Panel Data Regression

Estimating Dynamic Games of Electoral Competition to Evaluate Term Limits in U.S. Gubernatorial Elections: Online Appendix

Anycast Latency How many sites are enough?

Combinatorics. Problem: How to count without counting.

Analysis of the USDA Annual Report (2015) of Animal Usage by Research Facility. July 4th, 2017

Weekly price report on Pig carcass (Class S, E and R) and Piglet prices in the EU. Carcass Class S % + 0.3% % 98.

Further Concepts in Statistics

Statistics on Pending Mutual Agreement Procedures (MAPs) under the Arbitration Convention at the End of 2015

Test of Convergence in Agricultural Factor Productivity: A Semiparametric Approach

Draft Report. Prepared for: Regional Air Quality Council 1445 Market Street, Suite 260 Denver, Colorado Prepared by:

$3.6 Billion GNMA Servicing Offering

TEPZZ A_T EP A1 (19) (11) EP A1. (12) EUROPEAN PATENT APPLICATION published in accordance with Art.

TEPZZ 89955_A T EP A2 (19) (11) EP A2 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: G01R 15/20 ( )

Resources. Amusement Park Physics With a NASA Twist EG GRC

Regression Diagnostics

AC/RC Regional Councils of Colonels & Partnerships

Effects of Various Uncertainty Sources on Automatic Generation Control Systems

TEPZZ A T EP A2 (19) (11) EP A2 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: H02M 7/483 ( )

P a g e 5 1 of R e p o r t P B 4 / 0 9

2/25/2019. Taking the northern and southern hemispheres together, on average the world s population lives 24 degrees from the equator.

Week 3 Linear Regression I

EU JOINT TRANSFER PRICING FORUM

TEPZZ A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: G01J 5/62 ( ) G01J 3/28 (2006.

"Transport statistics" MEETING OF THE WORKING GROUP ON RAIL TRANSPORT STATISTICS. Luxembourg, 25 and 26 June Bech Building.

P a g e 3 6 of R e p o r t P B 4 / 0 9

Variance estimation on SILC based indicators

Further Concepts in Statistics

Statistics on Pending Mutual Agreement Procedures (MAPs) under the Arbitration Convention at the End of 2014

Time-Series Trends of Mercury Deposition Network Data

Discontinuation of Support for Field Chemistry Measurements in the National Atmospheric Deposition Program National Trends Network (NADP/NTN)

APRIL 1999 THE WALL STREET JOURNAL CLASSROOM EDITION Hurricanes, typhoons, coastal storms Earthquake (San Francisco area) Flooding

T h e C S E T I P r o j e c t

Flexible Top Trading Cycles and Chains Mechanism: Maintaining Diversity in Erasmus Student Exchange

Blueline Tilefish Assessment Approach: Stock structure, data structure, and modeling approach

TEPZZ 95785_A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION

Land Cover and Land Use Diversity Indicators in LUCAS 2009 data

Teachers Curriculum Institute Map Skills Toolkit 411

Software Process Models there are many process model s in th e li t e ra t u re, s om e a r e prescriptions and some are descriptions you need to mode

Topographic Recreational Map Of New Mexico: Detailed Travel Map By GTR Mapping

The Observational Climate Record

Serbia and Montenegro

OH BOY! Story. N a r r a t iv e a n d o bj e c t s th ea t e r Fo r a l l a g e s, fr o m th e a ge of 9

Daily Operations Briefing June 9, 2012 As of 8:30 a.m. EDT

GIS Reference Layers on UWWT Directive Sensitive Areas. Technical Report. Version: 1.0. ETC/Water task:

Directorate F: Social Statistics and information Society UnitF-3: Living conditions and social protection statistics. Working Group SOCIAL PROTECTION

TEPZZ 6_Z6_ A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.:

ANNEXES. to the REPORT FROM THE COMMISSION TO THE EUROPEAN PARLIAMENT AND THE COUNCIL

TEPZZ A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.:

Clear Roads Overview. AASHTO Committee on Maintenance July 23, 2018 Charlotte, North Carolina

LUCAS 2009 (Land Use / Cover Area Frame Survey)

Axel Ssymank. Bundesamt für Naturschutz

I N A C O M P L E X W O R L D

EUROINDICATORS WORKING GROUP THE IMPACT OF THE SEASONAL ADJUSTMENT PROCESS OF BUSINESS TENDENCY SURVEYS ON TURNING POINTS DATING

TEPZZ 6_8_ ZA_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.:

Transcription:

Evolution Strategies for Optimizing Rectangular Cartograms Kevin Buchin 1, Bettina Speckmann 1, and Sander Verdonschot 2 1 TU Eindhoven, 2 Carleton University September 20, 2012 Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 1 / 33

Cartograms Map with administrative boundaries Visualizes a quantitative value per region (e.g. population, GDP) Regions are deformed to make their area proportional to this variable c Copyright 2006 SASI Group (University of Sheffield) and Mark Newman (University of Michigan) Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 2 / 33

Cartograms c 2008 M. E. J. Newman, Department of Physics and Center for the Study of Complex Systems, University of Michigan Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 3 / 33

Types of Cartograms Contiguous area cartogram Non-contiguous area cartogram Dorling / Demers cartogram Rectangular cartogram Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 4 / 33

Contiguous area cartogram Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 5 / 33

Non-contiguous area cartogram Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 6 / 33

Dorling / Demers cartogram c 2002 Ian Bortins and Steve Demers Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 7 / 33

Rectangular Cartogram Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 8 / 33

Rectangular Cartogram Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 9 / 33

Workflow Given a geometric map and a corresponding variable Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 10 / 33

Workflow Convert it to the correct format Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 10 / 33

Workflow Assign a direction to each adjacency Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 10 / 33

Workflow Assign a direction to each adjacency Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 10 / 33

Workflow Build the cartogram [Speckmann, van Kreveld, and Florisson, 2006] Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 10 / 33

Workflow Build the cartogram [Speckmann, van Kreveld, and Florisson, 2006] Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 10 / 33

Workflow Our focus is on the direction of the adjacencies Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 10 / 33

Regular Edge Labelings Colouring of interior edges of the dual graph Red: vertical adjacency, Blue: horizontal adjacency Must satisfy local constraints Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 11 / 33

Regular Edge Labelings Flip: invert colour of all edges inside alternating 4-cycle Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 12 / 33

Regular Edge Labelings Flip: invert colour of all edges inside alternating 4-cycle Flip graph: Take all labelings, add an edge if two are connected by a flip Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 12 / 33

Regular Edge Labelings Flip: invert colour of all edges inside alternating 4-cycle Flip graph: Take all labelings, add an edge if two are connected by a flip This graph is a distributive lattice [Fusy, 2009] Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 12 / 33

Challenges Maps of EU and US have over 100 million labelings The best labeling can vary between data sets A local change in the labeling can cause global changes in the cartogram Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 13 / 33

Evolution Strategy 1 Initialize a random population of labelings Start at the minimum labeling Follow a random path towards the maximal labeling Stop at a random height Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 14 / 33

Evolution Strategy 1 Initialize a random population of labelings Start at the minimum labeling Follow a random path towards the maximal labeling Stop at a random height 2 Copy a few of the best labelings to the next generation Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 14 / 33

Evolution Strategy 1 Initialize a random population of labelings Start at the minimum labeling Follow a random path towards the maximal labeling Stop at a random height 2 Copy a few of the best labelings to the next generation 3 Use selection and mutation to fill the rest of the population Select a random labeling, favouring higher quality With low probability, take a long random walk in the lattice With high probability, take one random step in the lattice Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 14 / 33

Evolution Strategy 1 Initialize a random population of labelings Start at the minimum labeling Follow a random path towards the maximal labeling Stop at a random height 2 Copy a few of the best labelings to the next generation 3 Use selection and mutation to fill the rest of the population Select a random labeling, favouring higher quality With low probability, take a long random walk in the lattice With high probability, take one random step in the lattice 4 Repeat steps 2 and 3 a fixed number of times, keeping track of the best labeling found Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 14 / 33

Quality Criteria Correct adjacencies Relative positions of the regions d d d d Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 15 / 33

Quality Criteria Correct adjacencies Relative positions of the regions Cartographic error of the regions A c A s A s A c : Area in cartogram A s : Area specified 50 (20) Error: 1.5 20 (30) Error: 0.33 Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 16 / 33

Quality Criteria Correct adjacencies Relative positions of the regions Cartographic error of the regions A c A s A s A c : Area in cartogram A s : Area specified Error: 1.5 Error: 0.33 Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 16 / 33

Quality Criteria Correct adjacencies Relative positions of the regions (30 %) Cartographic error of the regions (70 %) A c A s A s A c : Area in cartogram A s : Area specified Error: 1.5 Error: 0.33 Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 17 / 33

Experimental Results Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 18 / 33

United States (63 regions) Made New Mexico adjacent to Utah to resolve 4-state point All US Census data sets with a positive value for each state Generating one cartogram took 8 minutes Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 19 / 33

United States - Population WA OR MT MN WI ID ND IN IL MI OH NY NJ ME VT NH MA CT RI WY SD IA NV UT NE CO KS MO KY WV PA VA DE MD CA OK AR TN NC AZ NM TX LA MS AL GA SC FL Average error: 0.5 % Maximum error: 2.2 % Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 20 / 33

United States - High School Graduates (%) WA OR ID MT ND MN ME NH VT CA NV UT WY SD NE IA WI MI OH PA MA AZ NM CO KS MO IL KY IN WV MD DE NY NJ CT RI OK AR TN VA NC TX LA MS AL GA SC FL Average error: 0.0 % Maximum error: 0.0 % Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 21 / 33

United States - Population per square mile ME WI MN MI VT NH ND WA MT ID IA SD OR WY NE NV IL UT COKS CA MO AZ IN OH WV MD PA DE NJ NY MA NM OK KY VA CT RI AR TN NC TX LA MS AL GA SC FL Average error: 2.0 % Maximum error: 11.3 % Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 22 / 33

United States - Population per square mile Negative correlation with land area Large variation ME VT NH WI MI MN ND WA MT ID IA SD OR WY NE NV IL UT COKS CA MO AZ IN OH WV MD PA DE NJ NY MA NM OK KY VA CT RI AR TN NC TX LA MS AL GA SC FL Average error: 2.0 % Maximum error: 11.3 % Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 23 / 33

United States - Non-Hispanic White Population (%) Negative correlation with land area Large variation ME WA MT ND MN VT NH MA ID WY SD IA WI MI PA OR NE OH NJ NY CT RI UT CO KS MO IL IN WV MD DE CA NV NM OK AR TN KY VA NC AZ TX LA MS AL GA SC Average error: 0.0 % Maximum error: 0.0 % FL Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 24 / 33

United States - Businesses w/o Paid Employees Negative correlation with land area Large variation WA OR ND ID MT MN WI MI ME VT NH SD WY NE NV UT CO IA KS MO IL IN OH PA NY MA CA OK KY WV AZ NM AR TN VA MD DE NJ CT RI NC TX SC LA MS GA AL FL Average error: 0.7 % Maximum error: 3.1 % Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 25 / 33

United States - Variation vs Maximum Error Negative correlation with land area Large variation maximum cartographic error (min) 0.40 0.35 0.30 0.25 0.20 0.15 0.10 0.05 0.00 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 coefficient of variation Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 26 / 33

United States - 2008 Presidential Elections ME WA OR ID ND MT MN WI MI DC NH VT CA SD WY NE NV UT CO KS OK AZ NM IA MO AR TN IL IN KY VA NC OH WV MD PA DE NJ NY MA CT RI TX LA MS AL GA SC AK HI FL Average error: 0.0 % Maximum error: 0.0 % Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 27 / 33

Europe (65 regions) Merged Luxembourg with Belgium, and Moldova with Ukraine All CIA World Fact Book ranked data sets with data for each country Generating one cartogram took 6 minutes Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 28 / 33

Europe - Population IS SE FI NO DK EE IE GB NL BE + LU DE SK PL LV LT BY UA + MD RU FR CZ HU MK RO BG PT ES CH IT AT RS KO SI ME GR BA HR AL MT TR CY Average error: 0.0 % Maximum error: 0.0 % Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 29 / 33

Europe - Account Balance IS NO IE GB DK SE FI EE LV EE LV LT LT BY RU NL DE PL CZ AT CH SK UA + MD HU RO SI SI HR RS BG BG MK MK BA MEKO AL BA ME ES IT GR TR PT BE BE + LU + LU FR MT MT CY Average error: 4.0 % Maximum error: 19.7 % Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 30 / 33

Europe - Exports IS IS NO GB NO SE FI IE BE + LU NL DK EE RU SE FI IE LV RU NL GB LT DK PL BY DE UA + MD DE EE LV FR LT BE + LU CZ SK RO BY HU ES PL CZ RS BG CH AT SK UA + MD HR MK HU RS RO CH AT KO SI AL ME SI HR ME KO BA BG AL MK FR BA TR PT ES PT GR IT TR MT GR IT MT CY CY Max error: 0.0 % BBSD: 0.088 Max error: 1.6 % BBSD: 0.078 Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 31 / 33

World (275 regions) All countries with population over 1 million Generating one cartogram took 3.5 hours Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 32 / 33

World - Population Average error: 1.17 % Maximum error: 18.5 % Sander Verdonschot (Carleton University) Optimizing Rectangular Cartograms September 20, 2012 33 / 33