Primal Dual Algorithms for Halfspace Depth

Similar documents
EMEA Rents and Yields MarketView

Directorate C: National Accounts, Prices and Key Indicators Unit C.3: Statistics for administrative purposes

10.2 Fitting a Linear Model to Data

2017 AP European History Summer Work

Vocabulary 2a Favourite characters aunt (n),, best (adj)

APPENDIX IV Data Tables

5 The Accessibility of Cities in European Infrastructure Networks

12 th International Students Conference Modern Analytical Chemistry September Prague, Czech Republic

The EuCheMS Division Chemistry and the Environment EuCheMS/DCE

Flight Guide. Winter 2017/18. Summer November 2017 Edition* *Subject to variation

T HE EUROPEAN METROPOLIS

F M U Total. Total registrants at 31/12/2014. Profession AS 2, ,574 BS 15,044 7, ,498 CH 9,471 3, ,932

From the Zonotope Construction to the Minkowski Addition of Convex Polytopes

41 st IUPAC World Chemistry Congress. Chemistry Protecting Health, Natural Environment and Cultural Heritage

CropCast Europe Weekly Report

AD HOC DRAFTING GROUP ON TRANSNATIONAL ORGANISED CRIME (PC-GR-COT) STATUS OF RATIFICATIONS BY COUNCIL OF EUROPE MEMBER STATES

2017 Source of Foreign Income Earned By Fund

Aerospace part number guide

Publication Date: 15 Jan 2015 Effective Date: 12 Jan 2015 Addendum 6 to the CRI Technical Report (Version: 2014, Update 1)

PLUTO The Transport Response to the National Planning Framework. Dr. Aoife O Grady Department of Transport, Tourism and Sport

Export Destinations and Input Prices. Appendix A

The European height reference system and its realizations

Mathematics. Pre-Leaving Certificate Examination, Paper 2 Higher Level Time: 2 hours, 30 minutes. 300 marks L.20 NAME SCHOOL TEACHER

Modelling structural change using broken sticks

A Convolutional Neural Network-based

PIRLS 2016 INTERNATIONAL RESULTS IN READING

EMEA Office OccupierView

Canadian Imports of Honey

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the WHO European Region

2 European cities. Introduction. Urbanisation. 36 Eurostat regional yearbook 2010 eurostat. The spatial dimension. The topics.

Weighted Voting Games

The 2011 Common Entrance syllabus and Geography ISEB Revision Guide

VIVA RAPID Worldwide Express Rates 2018

USDA Dairy Import License Circular for 2018

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

The SDEWES Index Atlas

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the WHO European Region

WHO EpiData. A monthly summary of the epidemiological data on selected vaccine preventable diseases in the European Region

Populating urban data bases with local data

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

PIRLS 2011 The PIRLS 2011 Safe and Orderly School Scale

Teacher Guide. Our resource has been created for ease of use by both TEACHERS and STUDENTS alike. Sample file NAME: A on Mount Olympus, Greece

The trade dispute between the US and China Who wins? Who loses?

10/27/2015. Content. Well-homogenized national datasets. Difference (national global) BEST (1800) Difference BEST (1911) Difference GHCN & GISS (1911)

The School Geography Curriculum in European Geography Education. Similarities and differences in the United Europe.

USDA Dairy Import License Circular for 2018 Commodity/

the quest for certainty

Part 2. Cost-effective Control of Acidification and Ground-level Ozone

Spatial Planning in Metropolitan Areas: Perspective from SPIMA

The Information Content of Capacity Utilisation Rates for Output Gap Estimates

STATISTICA MULTIVARIATA 2

ORGANISATION FOR ECONOMIC CO-OPERATION AND DEVELOPMENT

Drawing the European map

TIMSS 2011 The TIMSS 2011 Instruction to Engage Students in Learning Scale, Fourth Grade

FM Map Reading and Land Navigation

An Eye in the Sky EUMETSAT. Monitoring Weather, Climate and the Environment

PIRLS 2011 The PIRLS 2011 Teacher Working Conditions Scale

EUMETSAT. A global operational satellite agency at the heart of Europe. Presentation for the Spanish Industry Day Madrid, 15 March 2012

The Evolution and Evaluation of City Brand Models and Rankings

This document is a preview generated by EVS

Multivariate Analysis

USDA Dairy Import License Circular for 2018

PIRLS 2011 The PIRLS 2011 Students Motivated to Read Scale

Spatial Planning in Metropolitan Areas: SPIMA Project

MB of. Cable. Wholesale. FWBA (fixed OAOs. connections of which Full unbundled. OAO owning. Internet. unbundled broadband

PIRLS 2011 The PIRLS 2011 Students Engaged in Reading Lessons Scale

Weekly price report on Pig carcass (Class S, E and R) and Piglet prices in the EU. Carcass Class S % + 0.3% % 98.

Paul Krugman s New Economic Geography: past, present and future. J.-F. Thisse CORE-UCLouvain (Belgium)

Doubts about Discovering New Physics at the Large Hadron Collider (LHC)

PIRLS 2011 The PIRLS 2011 Teacher Career Satisfaction Scale

BSH Description. 2. Features. 3. Applications. 4. Pinning information. N-channel enhancement mode field-effect transistor

TIMSS 2011 The TIMSS 2011 Teacher Career Satisfaction Scale, Fourth Grade

European regulations for winter equipment on trucks and buses Winter 2015/2016

LMI MODELLING 4. CONVEX LMI MODELLING. Didier HENRION. LAAS-CNRS Toulouse, FR Czech Tech Univ Prague, CZ. Universidad de Valladolid, SP March 2009

How Well Are Recessions and Recoveries Forecast? Prakash Loungani, Herman Stekler and Natalia Tamirisa

E-SURFMAR Report. Jean ROLLAND Michel TREMANT Pierre BLOUCH Jon TURTON

A Markov system analysis application on labour market dynamics: The case of Greece

Daylight, Latitude, and Traditional Japanese Clocks

Quick Guide QUICK GUIDE. Activity 1: Determine the Reaction Rate in the Presence or Absence of an Enzyme

Towards indicators of proximity to services in Europe's major cities

Annotated Exam of Statistics 6C - Prof. M. Romanazzi

CO 250 Final Exam Guide

TIMSS 2011 The TIMSS 2011 School Discipline and Safety Scale, Fourth Grade

Economic and Social Council

Welcome to CERN! Dr. Yannis PAPAPHILIPPOU ACCELERATOR AND BEAMS Department. 05 Novembre

TNO (M. Schaap, R. Kranenburg, S. Jonkers, A. Segers, C. Hendriks) METNO (M. Schulz, A. Valdebenito, A. Mortier, M. Pommier, S.Tsyro, H.

2008 Men's 20 European Championship / Qualification

APPLYING BORDA COUNT METHOD FOR DETERMINING THE BEST WEEE MANAGEMENT IN EUROPE. Maria-Loredana POPESCU 1

Cyclone Click to go to the page. Cyclone

Bilateral Labour Agreements, 2004

The European regional Human Development and Human Poverty Indices Human Development Index

40 Years Listening to the Beat of the Earth

Interval-Based Composite Indicators

The Large Hadron Collider, a marvel of technology Lyn Evans. Royal Institution of South Wales St David s day lecture, 16 th March 2017

RISK ASSESSMENT METHODOLOGIES FOR LANDSLIDES

c. {2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 83, 89, 97}. Also {x x is a prime less than 100}.

Trends in Human Development Index of European Union

04 June Dim A W V Total. Total Laser Met

Transcription:

for for Halfspace Depth David Bremner Komei Fukuda March 17, 2006

Wir sind Zentrum 75 Hammerfest, Norway 70 65 Oslo, Norway Helsinki, Finland St. Petersburg, Russia 60 Stockholm, Sweden Aberdeen, Scotland Glasgow, Edinburgh, Scotland Scotland Copenhagen, Denmark Moscow, Russia 55 Belfast, Northern Newcastle on Tyne, Ireland England Dublin, Liverpool, Ireland Manchester, Leeds, England England England Hamburg, Germany Bremen, Germany Birmingham, England Amsterdam, Netherlands Berlin, Germany Warsaw, Poland Bristol, London, England England Brussels, Belgium Plymouth, England Frankfurt, Mean Germany Prague, Czech Republic 50 Paris, France Munich, Germany Vienna, Austria Zürich, Switzerland Budapest, Hungary Odessa, Ukraine Lyons, France Milan, Italy Venice, Italy Bordeaux, France Belgrade, Yugoslavia 45 Bucharest, Romania Marseilles, France Sofia, Bulgaria Rome, Italy Barcelona, Spain Madrid, Spain Naples, Italy Ankara, Turkey 40 Lisbon, Portugal Athens, Greece for 35 10 0 10 20 30 40

Estimators of Location centre Given a set of vectors, return a vector which best describes the set. City halfplane depth-1 Frankfurt, Germany 19 Brussels, Belgium 17 Munich, Germany 16 Amsterdam, Netherlands 15 Zürich, Switzerland 13 London, England 13 Prague, Czech Republic 12 for

Estimators of Location centre Given a set of vectors, return a vector which best describes the set. depth measure Rank a set of vectors such that vectors of maximum rank define one or more centre vectors. City halfplane depth-1 Frankfurt, Germany 19 Brussels, Belgium 17 Munich, Germany 16 Amsterdam, Netherlands 15 Zürich, Switzerland 13 London, England 13 Prague, Czech Republic 12 for

Robustness for The breakdown point of an estimator is the fraction of data that must be moved to infinity before the estimator is also moved to infinity. median mean

Robustness for The breakdown point of an estimator is the fraction of data that must be moved to infinity before the estimator is also moved to infinity. The breakdown point of the mean is 1 (i.e. one n error suffices to destroy the estimate). median mean

Robustness for The breakdown point of an estimator is the fraction of data that must be moved to infinity before the estimator is also moved to infinity. The breakdown point of the mean is 1 (i.e. one n error suffices to destroy the estimate). The median in R 1 has breakdown 1/2. median mean

What makes a good depth measure? Affine Invariant i.e. independant of coordinate system for

What makes a good depth measure? Affine Invariant i.e. independant of coordinate system Robustness A high breakdown point. For affine invariant measures in R d, breakdown 1/d for

What makes a good depth measure? Affine Invariant i.e. independant of coordinate system Robustness A high breakdown point. For affine invariant measures in R d, breakdown 1/d Nesting Let D X,k denote the points of R d at depth k with respect to X. We want for k > j = D X,k conv D X,j

The halfspace depth of a point q with respect to S R d is defined as depth S (q) = min {p S a, p a, q } a R d \0 q depth(q) = 1 p depth(p) = 4 for

The halfspace depth of a point q with respect to S R d is defined as depth S (q) = min {p S a, p a, q } a R d \0 1 3 2 4 for

Tukey Median The Tukey Median t(s) is defined as {q S depth S (q) = max p S depth S(p)} depth 1 depth 2 depth 5= centre for

Tukey Median The Tukey Median t(s) is defined as {q S depth S (q) = max p S depth S(p)} depth 1 depth 2 depth 5= centre nested for

Tukey Median The Tukey Median t(s) is defined as {q S depth S (q) = max p S depth S(p)} depth 1 depth 2 depth 5= centre nested The Tukey median has breakdown point at least 1/(d + 1) for points in general position. for

results for NP-complete, Johnson and Preparata 1978 Halfspace depth APX- Amaldi and Kahn, 1998

results for NP-complete, Johnson and Preparata 1978 Densest Open and Closed Hemisphere problem Halfspace depth APX- Amaldi and Kahn, 1998

results for NP-complete, Johnson and Preparata 1978 Densest Open and Closed Hemisphere problem Halfspace depth APX- Amaldi and Kahn, 1998 Maximum Feasible Subsystem A 2-approximation of MFS is possible.

The Dual Arrangement for ˆp 3 = 1 ˆp, x = 1 + ++ + ˆp = (p, 1) R d+1 h(p) = {x R d+1 ˆp i, x = 0} A p = {h(q) q S \ {p}} {x ˆp, x = 1}

The Dual Arrangement for ˆp 3 = 1 ˆp, x = 1 + ++ + A p = {h(q) q S \ {p}} {x ˆp, x = 1} σ(x) = (σ 1... σ n ) where σ i = sign( ˆp, x )

Reverse Search for reverse search requires two problem specific functions.

Reverse Search for reverse search requires two problem specific functions. The adjacency oracle Adj() returns the neighbouring cells Adj Y X C Adj Adj f() f(adj(x, j)) = X

Reverse Search for reverse search requires two problem specific functions. The adjacency oracle Adj() returns the neighbouring cells The local search function f ( ) satisfies C X k f k (X ) = C Adj Y X C Adj Adj f() f(adj(x, j)) = X

Adjacency Oracle X σ(x) is flippable at position j if negating sign j yields a cell. 1 + + + + + + ++ 2 3 ++ for + + ++ 4 + + + Adj(+ + ++, 1) = true Adj(+ + ++, 2) = false

Adjacency Oracle X σ(x) is flippable at position j if negating sign j yields a cell. Adj(X, j) is true iff X is flippable at position j. Solve via LP. 1 + + + + + + ++ 2 3 ++ for + + ++ 4 + + + Adj(+ + ++, 1) = true Adj(+ + ++, 2) = false

Local Search Function Define a canonical interior point i(x ) for each cell. for 1 2 3 i(x) i(c) Y f(x) = Y 4

Local Search Function Define a canonical interior point i(x ) for each cell. Choose an arbitrary cell C. To find a closer cell to C shoot a ray from i(x ) to i(c). for 1 2 3 i(x) i(c) Y f(x) = Y 4

Local Search Function Define a canonical interior point i(x ) for each cell. Choose an arbitrary cell C. To find a closer cell to C shoot a ray from i(x ) to i(c). Requires a single LP for i(c). i(x ) can be computed by flipping test. 1 2 3 for i(x) i(c) Y f(x) = Y 4

Reverse Search Summary for Time O(n LP(n, d) cells ).

Reverse Search Summary for Time O(n LP(n, d) cells ). Space complexity O(nd).

Reverse Search Summary for Time O(n LP(n, d) cells ). Space complexity O(nd). Optimizations include Choosing a shallow start cell Pruning the search.

Reverse Search Summary for Time O(n LP(n, d) cells ). Space complexity O(nd). Optimizations include Choosing a shallow start cell Pruning the search. uses ZRAM for reverse-search framework, cddlib to solve small, dense LPs. Parallelization is no extra implementation effort with ZRAM, and speedup is linear.

Reverse Search Summary for Time O(n LP(n, d) cells ). Space complexity O(nd). Optimizations include Choosing a shallow start cell Pruning the search. uses ZRAM for reverse-search framework, cddlib to solve small, dense LPs. Parallelization is no extra implementation effort with ZRAM, and speedup is linear. Little information until enumeration terminates.

for Update at a every step an upper bound and a lower bound for the depth. Terminate when (if) bounds are equal

for Update at a every step an upper bound and a lower bound for the depth. Terminate when (if) bounds are equal To ensure termination, fall back on enumeration after a fixed time limit.

for Update at a every step an upper bound and a lower bound for the depth. Terminate when (if) bounds are equal To ensure termination, fall back on enumeration after a fixed time limit. Generally, answers improve with time.

Upper Bounds via Random Walks for Use Adj() oracle from enumeration algorithm 1 2 3 4 + + + + ++ + ++ Adj() + + + ++ Adj() 5 + + ++ 6

Upper Bounds via Random Walks for Use Adj() oracle from enumeration algorithm Greedily try to reduce number of + in σ until local minimum reached. 1 2 3 4 + + + + ++ + ++ Adj() + + + ++ Adj() 5 + + ++ 6

Upper Bounds via Random Walks for Use Adj() oracle from enumeration algorithm Greedily try to reduce number of + in σ until local minimum reached. Repeat several times choosing a random starting cell. 1 2 3 4 + + + + ++ + ++ Adj() + + + ++ Adj() 5 + + ++ 6

Minimal Dominating Sets for Definition A Minimal Dominating Set (MDS) for p R d with respect to S R d is R S such that p.

Minimal Dominating Sets for Definition A Minimal Dominating Set (MDS) for p R d with respect to S R d is R S such that p conv R p.

Minimal Dominating Sets for Definition A Minimal Dominating Set (MDS) for p R d with respect to S R d is R S such that p conv R if R R then p / conv R. An MDS might also be called a Charathéodory set. p

Lower bounds via MDS s Proposition Let be the set of all MDS s for p with respect to S. Let T be a minimum transversal (hitting set) of. T = depth(p) for

Lower bounds via MDS s Proposition Let be the set of all MDS s for p with respect to S. Let T be a minimum transversal (hitting set) of. T depth(p) T = depth(p) Each MDS intersects both closed sides of any hyperplane through p. for p

Lower bounds via MDS s Proposition Let be the set of all MDS s for p with respect to S. Let T be a minimum transversal (hitting set) of. T = depth(p) Suppose T < depth(p) if T is separable from p, there is an uncovered MDS. for T p

Lower bounds via MDS s Proposition Let be the set of all MDS s for p with respect to S. Let T be a minimum transversal (hitting set) of. T = depth(p) Suppose T < depth(p) if T is not separable from P, there is a totally covered MDS. T for p

Generating Missed MDSs (cuts) Definition Given a partial traversal T for the MDS s of p w.r.t. S Define S = S \ T. Define the auxiliary polytope Q(p, T ) as λ satisfying: λ S = p λ i = 1 λ i 0 i for Each vertex (basic solution) of Q(p, T ) defines an MDS missed by T.

Generating Missed MDSs (cuts) Definition Given a partial traversal T for the MDS s of p w.r.t. S Define S = S \ T. Define the auxiliary polytope Q(p, T ) as λ satisfying: λ S = p λ i = 1 λ i 0 i for Each vertex (basic solution) of Q(p, T ) defines an MDS missed by T. A single cut can be found by LP

Generating Missed MDSs (cuts) Definition Given a partial traversal T for the MDS s of p w.r.t. S Define S = S \ T. Define the auxiliary polytope Q(p, T ) as λ satisfying: λ S = p λ i = 1 λ i 0 i for Each vertex (basic solution) of Q(p, T ) defines an MDS missed by T. A single cut can be found by LP k cuts can be found via reverse search (or other pivoting method).

Algorithm for 1. Walk in the dual arrangement to find a cell with minimal positive support

Algorithm for 1. Walk in the dual arrangement to find a cell with minimal positive support 2. Find obstructions (i.e. MDS s) to the optimality of this cell via the auxiliary polytope

Algorithm for 1. Walk in the dual arrangement to find a cell with minimal positive support 2. Find obstructions (i.e. MDS s) to the optimality of this cell via the auxiliary polytope 3. If none found, report optimal (we have solved the global minimum transversal problem).

Algorithm for 1. Walk in the dual arrangement to find a cell with minimal positive support 2. Find obstructions (i.e. MDS s) to the optimality of this cell via the auxiliary polytope 3. If none found, report optimal (we have solved the global minimum transversal problem). 4. Otherwise solve the resulting (partial) transversal problem via integer programming.

Algorithm for 1. Walk in the dual arrangement to find a cell with minimal positive support 2. Find obstructions (i.e. MDS s) to the optimality of this cell via the auxiliary polytope 3. If none found, report optimal (we have solved the global minimum transversal problem). 4. Otherwise solve the resulting (partial) transversal problem via integer programming. 5. If bored, switch to enumeration.

Where Purpose cdd http://www.ifor. math.ethz.ch/ fukuda/cdd home Solving Dense LPs COIN lrs SYMPHONY ZRAM http://www. coin-or.org http://cgm.cs. mcgill.ca/ avis/ http://www. branchandcut.org http://www.cs. unb.ca/ bremner/ zram Simplex Solver, cut generation MDS generation. Branch and Cut Parallel reverse search for

time (s) 1400 1200 1000 800 600 400 200 dominating set generation random walk integer program solving for 0 50a 50b 50c 60a 60b 60c 70a 70b 70c 80a 80b 80c 90a 90b 90c 110b 120a 120b 130a points:trial Dimension 10

time (s) 40 35 30 25 20 15 10 5 dominating set generation random walk for 0 3a 3b 3c 4a 4b 4c 5a 5b 5c 6a 6b 6c 7a 7b 7c 8a 8b 8c 9a 9b 9c 10a10b10c11a11b11c dimension:trial 50 points

16 dimension 20 for bounds 14 12 10 8 mean of bounds optimal value 130a 140c 150a 150c 140a 150b 6 4 80b 90a 120a 120c 120b 100a100b 110a 110b110c 100c 130c 130b 140b 2 90c 50c 70b 80c 90b 0 40a40b40c 50a50b 60a60b60c 70a 70c 80a 35 55 75 95 115 135 155 points:trial

35 dimension 5, symmetrized point sets for 30 mean of bounds optimal value 87a bounds 25 20 15 47a 47b 57a 57b 67a 67b 77a 77b 10 27a 27b 37a 37b 5 17a 17b 0 7a 7b 0 20 40 60 80 100 points:trial

Conclusions for Row generation is crucial to solve the IPs.

Conclusions for Row generation is crucial to solve the IPs. Restarting IP solver should be avoided, integrate cut generation.

Conclusions for Row generation is crucial to solve the IPs. Restarting IP solver should be avoided, integrate cut generation. Better upper bounds would nice.