Optimal Approximate Polytope Membership

Similar documents
Approximating min-max k-clustering

Strong Matching of Points with Geometric Shapes

arxiv: v2 [cs.cg] 26 Jun 2017

Research Article An iterative Algorithm for Hemicontractive Mappings in Banach Spaces

Elementary Analysis in Q p

An Introduction To Range Searching

On the Chvatál-Complexity of Knapsack Problems

Linear diophantine equations for discrete tomography

MATH 6210: SOLUTIONS TO PROBLEM SET #3

GIVEN an input sequence x 0,..., x n 1 and the

A Note on Guaranteed Sparse Recovery via l 1 -Minimization

Estimation of the large covariance matrix with two-step monotone missing data

On Isoperimetric Functions of Probability Measures Having Log-Concave Densities with Respect to the Standard Normal Law

The inverse Goldbach problem

Robustness of classifiers to uniform l p and Gaussian noise Supplementary material

Various Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various Families of R n Norms and Some Open Problems

Positive decomposition of transfer functions with multiple poles

IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES

4. Score normalization technical details We now discuss the technical details of the score normalization method.

Positive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Application

For q 0; 1; : : : ; `? 1, we have m 0; 1; : : : ; q? 1. The set fh j(x) : j 0; 1; ; : : : ; `? 1g forms a basis for the tness functions dened on the i

p-adic Measures and Bernoulli Numbers

#A37 INTEGERS 15 (2015) NOTE ON A RESULT OF CHUNG ON WEIL TYPE SUMS

Multiplicative group law on the folium of Descartes

Opposite-quadrant depth in the plane

where x i is the ith coordinate of x R N. 1. Show that the following upper bound holds for the growth function of H:

An Estimate For Heilbronn s Exponential Sum

Introduction to Banach Spaces

The Graph Accessibility Problem and the Universality of the Collision CRCW Conflict Resolution Rule

Combinatorics of topmost discs of multi-peg Tower of Hanoi problem

Some results of convex programming complexity

Convex Optimization methods for Computing Channel Capacity

The Fekete Szegő theorem with splitting conditions: Part I

STRONG TYPE INEQUALITIES AND AN ALMOST-ORTHOGONALITY PRINCIPLE FOR FAMILIES OF MAXIMAL OPERATORS ALONG DIRECTIONS IN R 2

The non-stochastic multi-armed bandit problem

Asymptotically Optimal Simulation Allocation under Dependent Sampling

BEST CONSTANT IN POINCARÉ INEQUALITIES WITH TRACES: A FREE DISCONTINUITY APPROACH

arxiv:math/ v4 [math.gn] 25 Nov 2006

On the Toppling of a Sand Pile

Space-Time Tradeoffs for Approximate Spherical Range Counting

GOOD MODELS FOR CUBIC SURFACES. 1. Introduction

Radial Basis Function Networks: Algorithms

Elementary theory of L p spaces

CMSC 425: Lecture 4 Geometry and Geometric Programming

Uniform Law on the Unit Sphere of a Banach Space

Topic 7: Using identity types

MATH 2710: NOTES FOR ANALYSIS

On Wald-Type Optimal Stopping for Brownian Motion

#A8 INTEGERS 12 (2012) PARTITION OF AN INTEGER INTO DISTINCT BOUNDED PARTS, IDENTITIES AND BOUNDS

Sums of independent random variables

Kinetic Stable Delaunay Graphs

Efficient algorithms for the smallest enclosing ball problem

Best approximation by linear combinations of characteristic functions of half-spaces

Correspondence Between Fractal-Wavelet. Transforms and Iterated Function Systems. With Grey Level Maps. F. Mendivil and E.R.

Complex Analysis Homework 1

HENSEL S LEMMA KEITH CONRAD

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests

Math 751 Lecture Notes Week 3

Robust hamiltonicity of random directed graphs

ON THE NORM OF AN IDEMPOTENT SCHUR MULTIPLIER ON THE SCHATTEN CLASS

Introduction Consider a set of jobs that are created in an on-line fashion and should be assigned to disks. Each job has a weight which is the frequen

Cryptanalysis of Pseudorandom Generators

Research Article Controllability of Linear Discrete-Time Systems with Both Delayed States and Delayed Inputs

Topic: Lower Bounds on Randomized Algorithms Date: September 22, 2004 Scribe: Srinath Sridhar

Pell's Equation and Fundamental Units Pell's equation was first introduced to me in the number theory class at Caltech that I never comleted. It was r

John Weatherwax. Analysis of Parallel Depth First Search Algorithms

The Knuth-Yao Quadrangle-Inequality Speedup is a Consequence of Total-Monotonicity

A note on the random greedy triangle-packing algorithm

Statics and dynamics: some elementary concepts

Principles of Computed Tomography (CT)

LEIBNIZ SEMINORMS IN PROBABILITY SPACES

19th Bay Area Mathematical Olympiad. Problems and Solutions. February 28, 2017

3 Properties of Dedekind domains

On Character Sums of Binary Quadratic Forms 1 2. Mei-Chu Chang 3. Abstract. We establish character sum bounds of the form.

The rapid growth in the size and scope of datasets in science

CR extensions with a classical Several Complex Variables point of view. August Peter Brådalen Sonne Master s Thesis, Spring 2018

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)

Stable Delaunay Graphs

An Analysis of Reliable Classifiers through ROC Isometrics

Analysis of some entrance probabilities for killed birth-death processes

Solution sheet ξi ξ < ξ i+1 0 otherwise ξ ξ i N i,p 1 (ξ) + where 0 0

Recursive Estimation of the Preisach Density function for a Smart Actuator

COMMUNICATION BETWEEN SHAREHOLDERS 1

Intrinsic Approximation on Cantor-like Sets, a Problem of Mahler

ON MINKOWSKI MEASURABILITY

2 Asymptotic density and Dirichlet density

Additive results for the generalized Drazin inverse in a Banach algebra

Approximation of the Euclidean Distance by Chamfer Distances

ON THE LEAST SIGNIFICANT p ADIC DIGITS OF CERTAIN LUCAS NUMBERS

2 Asymptotic density and Dirichlet density

MATH 361: NUMBER THEORY EIGHTH LECTURE

Round-off Errors and Computer Arithmetic - (1.2)

ON POLYNOMIAL SELECTION FOR THE GENERAL NUMBER FIELD SIEVE

RESOLUTIONS OF THREE-ROWED SKEW- AND ALMOST SKEW-SHAPES IN CHARACTERISTIC ZERO

Notes on duality in second order and -order cone otimization E. D. Andersen Λ, C. Roos y, and T. Terlaky z Aril 6, 000 Abstract Recently, the so-calle

GENERICITY OF INFINITE-ORDER ELEMENTS IN HYPERBOLIC GROUPS

Preconditioning techniques for Newton s method for the incompressible Navier Stokes equations

On a class of Rellich inequalities

1-way quantum finite automata: strengths, weaknesses and generalizations

Sharp gradient estimate and spectral rigidity for p-laplacian

Transcription:

Otimal Aroximate Polytoe Membershi Sunil Arya Deartment of Comuter Science and Engineering The Hong Kong University of Science and Technology Clear Water Bay, Kowloon, Hong Kong arya@cse.ust.hk Abstract David M. Mount Deartment of Comuter Science and Institute for Advanced Comuter Studies University of Maryland College Park, Maryland 074 mount@cs.umd.edu In the olytoe membershi roblem, a convex olytoe K in R d is given, and the objective is to rerocess K into a data structure so that, given a query oint q R d, it is ossible to determine efficiently whether q K. We consider this roblem in an aroximate setting and assume that d is a constant. Given an aroximation arameter ε > 0, the query can be answered either way if the distance from q to K s boundary is at most ε times K s diameter. Previous solutions to the roblem were on the form of a sace-time tradeoff, where logarithmic query time demands O(1/ε d 1 ) storage, whereas storage O(1/ε (d 1)/ ) admits roughly O(1/ε (d 1)/8 ) query time. In this aer, we resent a data structure that achieves logarithmic query time with storage of only O(1/ε (d 1)/ ), which matches the worst-case lower bound on the comlexity of any ε- aroximating olytoe. Our data structure is based on a new technique, a hierarchy of ellisoids defined as aroximations to Macbeath regions. As an alication, we obtain major imrovements to aroximate Euclidean nearest neighbor searching. Notably, the storage needed to answer ε-aroximate nearest neighbor queries for a set of n oints in O(log n ε ) time is reduced to O(n/ε d/ ). This halves the exonent in the ε-deendency of the existing sace bound of roughly O(n/ε d ), which has stood for 15 years (Har- Peled, 001). Research suorted by the Research Grants Council of Hong Kong, China under roject number 61001. Research suorted by NSF grant CCF 1618866. 1 Introduction Guilherme D. da Fonseca Université Clermont Auvergne and LIMOS Clermont-Ferrand, France fonseca@isima.fr Convex olytoes are key structures in many areas of mathematics and comutation. In this aer, we consider a fundamental search roblem related to these objects. Let K denote a convex olytoe in R d, that is, the bounded intersection of n halfsaces. The olytoe membershi roblem is that of rerocessing K so that it is ossible to determine efficiently whether a given query oint q R d lies within K. Throughout, we assume that the dimension d is a fixed constant and that K is full dimensional. It follows from standard results in rojective duality that olytoe membershi is equivalent to answering halfsace emtiness queries for a set of n oints in R d. In dimension d 3, it is ossible to build a data structure of linear size that can answer such queries in logarithmic time [9,30]. In higher dimensions, however, the fastest exact data structures with near-linear sace have a query time of roughly O ( n 1 1/ d/ ) [41], which is unaccetably high for many alications. Polytoe membershi is a secial case of olytoe intersection queries [16, 7, 30]. Recently, Barba and Langerman [16] showed that for any fixed d, it is ossible to rerocess olytoes in R d so that given two such olytoes that have been translated and rotated, it can be determined whether they intersect each other in time that is logarithmic in their total combinatorial comlexity. The rerocessing time and sace are quite high, growing as the combinatorial comlexity of the olytoe (which can be as high as Θ(n d/ )) raised to the ower d/. 70 Coyright by SIAM

The lack of efficient exact solutions has motivated consideration of aroximate solutions. Let ε be a ositive real arameter, and let diam(k) denote K s diameter. Given a query oint q R d, an ε-aroximate olytoe membershi query returns a ositive result if q K, a negative result if the distance from q to its closest oint in K is greater than ε diam(k), and it may return either result otherwise. Polytoe membershi queries, both exact and aroximate, arise in many alication areas, such as linear-rogramming and ray-shooting queries [, 6, 40, 4, 43], nearest neighbor searching and the comutation of extreme oints [3, 8], collision detection [35], and machine learning [1]. Dudley [31] showed that, for any convex body K in R d, it is ossible to construct an ε-aroximating olytoe P with O(1/ε (d 1)/ ) facets. This bound is asymtotically tight in the worst case, even when K is a Euclidean ball. This construction imlies a (trivial) data structure for aroximate olytoe membershi roblem with sace and query time O(1/ε (d 1)/ ). Another simle solution arises from the aroximation roosed by Bentley et al. [17]. A d-dimensional grid with cells of size Θ(ε diam(k)) is created and for every column along the x d -axis, the two extreme x d values where the column intersects K are stored. Given a query oint q, it is easy to determine if q P in constant time (assuming a model of comutation that suorts the floor function). The storage required by the aroach is O(1/ε d 1 ). In [4], the authors resented a simle and ractical data structure for the aroximate olytoe membershi roblem, called SlitReduce. Given a arameter t, sace is subdivided hierarchically using a quadtree until each cell either (1) lies entirely inside K, () entirely outside K, or (3) intersects K s boundary and is locally aroximable by at most t halfsaces. In the latter case, the leaf node associated with such a cell stores such a set of hyerlanes. To answer a query, the quadtree is descended until arriving at the leaf node whose cell contains the query oint. If this node is not labeled as inside or outside, the query is answered by testing whether the query oint lies within all the halfsaces stored in the leaf node. In [4] it is shown that the quadtree height is O(log 1 ε ), and therefore the overall query time is O(log 1 ε + t). A more refined analysis is resented in [6], showing that the minimum storage of O(1/ε (d 1)/ ) is attained for query time t = Θ ( (log 1 ε )/ε(d 1)/8). Furthermore, a sace-time trade-off is resented that involves a iecewise linear function. Obtaining a tight analysis remains an oen roblem. A lower-bound roof shows that the storage requirement increases when the query time t dros down to roughly O(1/ε (d 1)/18 ) [4]. Furthermore, the data structure rovides no imrovement over the storage in [17] when the query time is olylogarithmic. While the SlitReduce data structure is both simle and ractical, the question of whether it is ossible to achieve query time O(log 1 ε ) with minimum storage O(1/ε (d 1)/ ) has remained oen. In this aer, we give an affirmative answer to this question. We abandon the quadtree-based aroach of [4] and [6] in favor of a data structure involving a hierarchy of ellisoids. These ellisoids are selected through a samling rocess that is insired by a classical structure from the theory of convexity, called Macbeath regions [39]. main result. Here is our Theorem 1.1. Given a convex olytoe K in R d and an aroximation arameter 0 < ε 1, there is a data structure that can answer ε-aroximate olytoe membershi queries with ( Query time: O log 1 ) ε ( and Sace: O 1 ε (d 1)/ ). Our focus is on the existence of this data structure. Prerocessing will be discussed in future work, but assuming that K is reresented as the intersection of h halfsaces, the construction described in Section 3.1 can be imlemented in time O(n + oly(1/ε)), with olynomial exonents deending on d. The rincial contribution of this aer is to show that through the use of a more shae-sensitive aroach, it is ossible to achieve dramatic imrovements over the sace requirements of the data structure. As evidence of the imortance of this result, we show that it can be alied to roduce significant imrovements in the efficiency of aroximate nearestneighbor searching in Euclidean sace. Aroximate nearest neighbor searching in saces of fixed dimension has been widely studied. Data structures with O(n) storage and query times no better than O(log n + 1/ε d 1 ) have been roosed by several authors [13, 18, 5, 3]. In subsequent aers, it was shown that query times could be reduced at the exense of greater storage [4, 8, 37, 44]. Har-Peled introduced the AVD (aroximate Voronoi diagram) data structure and showed that O(log n ε ) query time could be achieved using Õ(n/ε d ) sace [37]. (The notation Õ( ) conceals logarithmic factors.) Sace-time trade-offs were established for the AVD in a series of aers [3, 8, 9, 11]. At one end of the sectrum, it was shown that with O(n) storage, queries could be answered in time O(log n + 1/ε (d 1)/ ). At the other end, queries could be answered in time O(log n ε ) 71 Coyright by SIAM

with sace Õ(n/εd ). In [4], the authors resented a reduction from Euclidean aroximate nearest neighbor searching to olytoe membershi. They established significant imrovements to the best trade-offs throughout the middle of the sectrum, but the extremes were essentially unchanged [4, 6]. While the AVD is simle and ractical, in [11] lower bounds were resented that imly that significant imrovements at the extreme ends of the sectrum are not ossible in this model. Through the use of our new data structure for olytoe membershi, we achieve the following imroved trade-off. Theorem 1.. Given a set X of n oints in R d, an aroximation arameter 0 < ε 1, and m such that log 1 ε m 1/(εd/ log 1 ε ), there is a data structure that can answer Euclidean ε-aroximate nearest neighbor queries with ( ) 1 Query time: O log n + and m ε d/ Sace: O(nm). By setting m to its uer limit it is ossible to achieve logarithmic query time while roughly halving the exonent in the ε-deendency of the revious best bound, as exressed in the following corollary. Corollary 1.1. Given a set X of n oints in R d and an aroximation arameter 0 < ε 1, there is a data structure that can answer Euclidean ε-aroximate nearest neighbor queries with ( Query time: O log n ) ( n ) and Sace: O. ε ε d/ The rest of the aer is organized as follows. In the next section we resent definitions and reliminary results. In Section 3 we resent the data structure and analyze its erformance. Section 4 discusses the alication to aroximate nearest-neighbor searching. Geometric Preliminaries Throughout, we assume that K is resented as the intersection of halfsaces. Note however that our results are largely insensitive to the exact reresentation or the combinatorial comlexity of K. (The excetions are our remarks on the construction of the data structure and choice of hyerlane witnesses to non-membershi). For this reason, we will often refer to K simly as a convex body. It will be convenient to define the aroximation error in absolute terms. Given a query oint q R d, an absolute ε-aroximate olytoe membershi query returns a ositive result if q K, a negative result if the distance from q to its closest oint in K is greater than ε, and it may return either result otherwise. We may assume throughout that d 4, since olytoe membershi queries (which may be alied to the Dudley aroximation) can be answered exactly in logarithmic time for d 3 [30]..1 Canonical Position and Ray Shooting. Let K denote the boundary of K. Let O denote the origin of R d, and for x R d and r 0, let B r (x) denote the Euclidean ball of radius r centered at x. Given a arameter 0 < γ 1, we say that a convex body K is γ-fat if there exist concentric Euclidean balls B and B, such that B K B, and radius(b)/radius(b ) γ. We say that K is fat if it is γ-fat for a constant γ (ossibly deending on d, but not on ε). Let B 0 denote a ball of radius r 0 = 1/ centered at the origin. For 0 < γ 1, let γb 0 denote the concentric ball of radius γr 0 = γ/. We say that a convex body K is in γ-canonical form if its boundary is nested between γb 0 and B 0 (see Figure 1(a)). A body in γ-canonical form is γ-fat, and diam(k) [γ, 1]. We will refer to oint O as the center of K. The next lemma shows that, u to constant factors, the roblem of answering relative ε-aroximate olytoe membershi queries can be reduced to the roblem of answering absolute (ε/d)-aroximate queries with resect to a convex body in (1/d)-canonical form. The roof follows from a combination of John s Theorem [38] and Lemma 3.1 of Agarwal et al. [1]. (Also, see Lemma.1 of the arxiv version of [7].) Lemma.1. Let K R d be a convex body. There exists a non-singular affine transformation T such that T (K) is in (1/d)-canonical form. Further, if q is a oint at distance greater than ε diam(k) from K, then T (q) is at distance greater than ε/d from T (K). In light of this result, we may assume henceforth that K is resented in γ-canonical form, for any constant γ (deending on dimension), and that ε has been aroriately scaled. (This scaling will affect the constant factors hidden in our asymtotic bounds.) Henceforth, we focus on the roblem of answering absolute ε- aroximate olytoe membershi queries with resect to K. Our query algorithm solves a slightly more general roblem, which will be exloited later in Section 4. Given a convex body in γ-canonical form and any oint q R d \ {O}, consider the (infinite) ray with origin at O and assing through q, which we denote as Oq. An ε-aroximate ray shooting query returns a oint that lies on this ray and is not internal to K but lies within 7 Coyright by SIAM

h Downloaded 08/5/18 to 18.8.17.130. Redistribution subject to SIAM license or coyright; see htt://www.siam.org/journals/ojsa.h K γ O (a) 1 K O (b) Figure 1: (a) γ-canonical form, (b) ε-aroximate ray-shooting query, (c) witness. distance ε of K 1 (see Figure 1(b)). Given the answer to such a ray-shooting query, we can answer aroximate membershi queries for a query oint q by alying the query to the ray Oq and testing whether q lies on the ortion of the ray between O and. If so, then (by convexity and the fact that O is interior to K) q lies within distance ε of K. If not, q does not lie within K. In Section 3 we will show the following. Lemma.. Given an arbitrary constant γ, a convex olytoe K in R d that is in γ-canonical form, and an aroximation arameter 0 < ε 1, there is a data structure that can answer ε-aroximate ray-shooting queries in O(log 1 ε ) time and O(1/ε(d 1)/ ) sace. Theorem 1.1 follows directly from Lemmas.1 and.. Our ray-shooting algorithm satisfies the additional roerty that, when K is given as the intersection of halfsaces, the reorted oint lies on the bounding hyerlane h of one of these halfsaces (see Figure 1(c)). The query returns not only but h as well. As such, if q is reorted to lie outside of K, then h serves as a witness to q s non-membershi. This fact will be exloited in Section 4.. Cas and Macbeath Regions. Much of the material in this section has been resented in [7]. We include it here for the sake of comleteness. Given a convex body K, a ca C is defined to be the nonemty intersection of the convex body K with a 1 In light of Lemma.1, aroximate ray-shooting queries also can be defined for an arbitrary convex body. The ray s origin is chosen to be the center of the John ellisoid and the distance to the oint is relative to K s diameter. In general the ray s central oint may be located at any oint in K s interior with the roerty that K s boundary is sandwiched between two uniformly scaled coies of an ellisoid, both centered at this oint. As in Lemma.1, the value of ε needs to be adjusted based on the scale factor. q K halfsace (see Figure (a)). Let h denote the hyerlane bounding this halfsace. We define the base of C to be h K. The aex of C is any oint in the ca such that the suorting hyerlane of K at this oint is arallel to h. The width of C, denoted width(c), is the distance between h and this suorting hyerlane. Given any ca C of width w and a real ρ 0, we define its ρ-exansion, denoted C ρ, to be the ca of K cut by a hyerlane arallel to and at distance ρw from this suorting hyerlane. (Note that C ρ = K, if ρw exceeds the width of K along the defining direction.) An easy consequence of convexity is that, for ρ 1, C ρ is a subset of the region obtained by scaling C by a factor of ρ about its aex. This imlies the following lemma. Lemma.3. Let K R d be a convex body and ρ 1. For any ca C of K, vol(c ρ ) ρ d vol(c). Given a oint x K and real arameter λ 0, the Macbeath region M λ (x) (also called an M-region) is defined as: O (c) M λ (x) = x + λ((k x) (x K)). It is easy to see that M 1 (x) is the intersection of K and the reflection of K around x (see Figure (b)), and so M 1 (x) is centrally symmetric about x. M λ (x) is a scaled coy of M 1 (x) by the factor λ about x. We refer to x as the center of M λ (x) and to λ as its scaling factor. As a convenience, we define M(x) = M 1 (x) and M (x) = M 1/5 (x). We refer to the latter as the shrunken Macbeath region. Macbeath regions have found numerous uses in the theory of convex sets and the geometry of numbers (see Bárány [15] for an excellent survey). They have also been alied to a growing number of results in the field of comutational geometry, articularly to construct lower bounds for range searching [10, 14, 19] q 73 Coyright by SIAM

aex C h C width w w (a) base K x M(x) M (x) Figure : (a) Ca concets and (b) Macbeath regions. and to bound the comlexity of an ε-aroximating olytoe [5, 7]. Given any oint x K, we define a minimal ca C(x) to be the ca with minimum volume that contains x. Clearly, the base of the minimal ca must ass through x. In fact, a standard variational argument imlies x is the centroid of the base (otherwise, we could decrease the ca volume by an infinitesimal rotation of the base about x [36]). If the minimal ca is not unique, the notation C(x) will refer to any one of these cas fixed arbitrarily. Define v(x) = vol(c(x)) and width(x) = width(c(x)). It will be convenient to use C ρ (x) to refer to the ρ-exansion of C(x), that is, C ρ (x) = (C(x)) ρ. We now resent two lemmas that encasulate key roerties of Macbeath regions, which will be useful in the develoment of our data structure. The first lemma shows that if two shrunken Macbeath regions have a nonemty intersection, then a constant factor exansion of one contains the other [19, 36]. Since the statement we need is slightly different from that roved in earlier aers, we give a roof in the aendix. Lemma.4. Let K be a convex body, and let λ 1/5 be any real. If x, y K such that M λ (x) M λ (y), then M λ (y) M 4λ (x). The next lemma shows that the minimal ca associated with a oint is contained within a suitable constant factor exansion of the associated Macbeath region. It is a straightforward adatation of a lemma roved by Ewald, Larman and Rogers [36] (see roof of Lemma 4 in [36]). Lemma.5. Let K R d be a convex body in γ- canonical form, and let 0 = 1 (γ /(4d)) d be a constant. If x is a oint in K that lies within distance 0 of K, then C(x) M 3d (x). The following lemma is an immediate consequence of the definition of Macbeath region. (b) x K Lemma.6. Let K be a convex body and λ > 0. If x is a oint in a ca C of K, then M λ (x) K C 1+λ. Furthermore, if λ 1, then M λ (x) C 1+λ. The next lemma is useful in situations when we know that a shrunken Macbeath region artially overlas a ca of K. It allows us to conclude that a constant factor exansion of the ca will fully contain the Macbeath region. The roof aears in [7]. Lemma.7. Let K be a convex body. Let C be a ca of K and x be a oint in K such that C M (x). Then M (x) C..3 Relating Distances and Widths. In this section we resent a number of geometric results demonstrating the relationshi between three notions of the distance from a oint lying within a convex body to body s boundary. Throughout, let K be a convex body in γ-canonical form where γ is a constant and let x K. Recall that width(x) is the width of x s minimum ca. Define δ(x) to be the minimum distance from x to any oint on K. For the sake of ray-shooting queries, we define a ray-based notion of distance as well. Given x K, consider the intersection oint of K and the ray emanating from O and assing through x. Define x s ray-distance, denoted ray(x), to be x (see Figure 3). K M(x) O δ(x) x ray(x) C(x) width(x) Figure 3: Relating δ(x), width(x), and ray(x). First we relate ray(x) and δ(x). The lower bound 74 Coyright by SIAM

on ray(x) is trivial and the uer bound follows by a straightforward adatation of Lemma 4. of [7]. Lemma.8. Let K be a convex body in γ-canonical form. For any oint x K, δ(x) ray(x) δ(x)/γ. Next, let us relate width(x) and δ(x). Clearly, width(x) δ(x). In Lemma.10, we show that close to the boundary, width(x) cannot exceed δ(x) by more than a constant factor. Its roof is based standard roerties of Macbeath regions and the following lemma. Lemma.9. Let K be a convex body in γ-canonical form. Let C 1 and C be two cas of K such that C 1 C. Then width(c 1 ) width(c )/γ. Proof. We consider two cases deending on whether the origin O is inside C 1 or not. First, if O C 1, then O C. Since K contains the ball B γ/ (O), it follows that width(c ) γ/. Since K is contained within the ball B 1/ (O), we have width(c 1 ) 1. Thus, width(c 1 ) width(c )/γ. Otherwise, we have O / C 1. Consider the segment joining O to t, where t is the aex of C 1. Let x denote the oint of intersection of this segment with the base of C 1. Clearly, width(c 1 ) ray(x). By Lemma.8, ray(x) δ(x)/γ. Thus, width(c 1 ) δ(x)/γ. Also, since x C, we have δ(x) width(c ). Thus, width(c 1 ) width(c )/γ, comleting the roof. Lemma.10. Let K R d be a convex body in γ-canonical form, and let 0 be the constant of Lemma.5. If x is a oint in K such that δ(x) 0, then width(x) (/γ)(3d + 1)δ(x). Proof. Let t denote the oint on K that is closest to x. Consider the suorting hyerlane of K at t that is orthogonal to segment xt. Consider the halfsace bounded by this hyerlane which does not contain K in its interior. Translate this halfsace such that the bounding hyerlane asses through x. Let C denote the ca formed by intersecting this halfsace with K. Note that the width of ca C is δ(x). By Lemma.6, M 3d (x) K C 3d+1. Since δ(x) 0, it follows from Lemma.5 that C(x) M 3d (x). By definition, C(x) K, so we have C(x) M 3d (x) K C 3d+1. By Lemma.9, it follows that width(x) = width(c(x)) γ width(c3d+1 ) = (3d + 1)δ(x), γ as desired. The following lemma, illustrated in Figure 4, will be useful to analyze the ray shooting erformed by our data structure. h t h y t y C r O width(c) 0 Figure 4: Statement of Lemma.11. Lemma.11. Let K be a convex body in γ-canonical form, and let 0 be the constant of Lemma.5. Let C be a ca of width at most 0 defined by a hyerlane h, and let y be any oint in C. Let t be C s aex, and let h t be the hyerlane arallel to h that asses through t. Letting y denote the intersection of line Oy and h t, we have yy width(c)/γ. Proof. Given that y C, yy is maximized when y lies on C h, and so let us assume this. Since K is in γ-canonical form, it is nested between two balls of radii r = γ/ and R = 1/ centered at O. Let r denote the erendicular distance from O to h t. Clearly, h t is a suorting hyerlane of K, and so r r. By definition of 0 and since γ 1, we have 0 γ/4 = r/. Let R = Oy. Since y K, R R. Letting w = width(c), by similar triangles we have R /(r w) = yy /w. Therefore, yy = as desired. R r w w R r w 0 R r/ w = width(c), γ R r (r/) w Finally, we establish a monotonicity relationshi between δ(x) and ray(x) that holds close to the boundary. For any δ > 0, define the δ-erosion of K, denoted K(δ), to be the closed convex body formed by removing from K all oints lying within distance δ of K. We can define K(δ) equivalently as follows. Let H denote the set of suorting halfsaces of K, so that K = H H H. Letting H(δ) denote the set of halfsaces obtained by translating each halfsace of H towards O by δ, we have K(δ) = H H(δ) H. Recalling that Bγ/ (O) K, the next lemma follows from elementary geometry. 75 Coyright by SIAM

Lemma.1. Let K be a convex body in γ-canonical form. The following hold: (a) if δ < γ/, then O K(δ). (b) Consider any ray emanating from O. Let x and y denote the oints of intersection of this ray with the boundaries of K(γ/) and K, resectively. As oint moves along this ray from x to y, δ() decreases strictly monotonically..4 Further Proerties of Macbeath Regions. Finally, we identify some useful novel roerties of Macbeath regions. The first lemma is a useful utility. Lemma.14 shows that all the oints in a shrunken Macbeath region have similar distances from the boundary of K, and Lemma.15 shows that the minimal cas associated with these oints have similar volumes. Lemma.13. Let K be a convex body. If x K and x M (x), then x M 1/4 (x ). Proof. Recalling that M (x) = M 1/5 (x), it follows that there exist oints 1, K such that x = x + 1 5 ( 1 x) and x = x + 1 5 (x ). After simle algebraic maniulations, the first equation is equivalent to (.1) x = x + 1 4 (x 1 ). Letting 3 = 3 + 1 3 x, the second equation is equivalent to (.) x = x + 1 ( 4 3 + 1 ) 3 x x = x + 1 4 ( 3 x ). As 3 is a convex combination of and x, we have 3 K. Eq. (.1) shows that x x + (1/4)(x K), and Eq. (.) shows that x x + (1/4)(K x ). Thus x M 1/4 (x ). Lemma.14. Let K be a convex body. If x K and x M (x), then 4δ(x)/5 δ(x ) 4δ(x)/3. Proof. To rove the lower bound on δ(x ), let z denote the oint of K that is closest to x, and let h be a suorting hyerlane assing through z (see Figure 5). Let l denote the (erendicular) distance from x to h, and let h be the translate of h by distance 4l/5 towards x. Because M(x) lies entirely within the halfsace bounded by h that contains the origin, it follows that M (x) lies entirely within the corresonding halfsace bounded by h. This imlies that δ(x ) 4l/5. Clearly, δ(x) l, and hence δ(x ) 4l/5 4δ(x)/5. 4l/5 M (x) Figure 5: Proof of Lemma.14. To rove the uer bound on δ(x ) observe that, by Lemma.13, x M 1/4 (x ). A symmetrical argument to the above shows that δ(x) 3δ(x )/4, as desired. Recall that C(x) is the ca of minimum volume that contains x and v(x) = vol(c(x)). Lemma.15. Let K R d be a convex body. If x K and x M (x), then d v(x) v(x ) v(x)/ d. Proof. By Lemma.6, M (x) C 6/5 (x). Therefore, x C 6/5 (x), imlying that the minimum volume ca containing x has volume at most vol(c 6/5 (x)). By Lemma.3, vol(c 6/5 (x)) (6/5) d vol(c(x)). Thus v(x ) vol(c 6/5 (x)) x l x z h h ( ) d 6 v(x) d v(x), 5 which roves the first inequality. To rove the second inequality observe that, by Lemma.13, x M 1/4 (x ). Arguing as in the roof of the first inequality, we obtain v(x) d v(x ), as desired. 3 The Data Structure Recall that we are given a convex olytoe K R d in γ- canonical form, where γ is a constant, and our objective is to construct a data structure that can answer ε- aroximate ray-shooting queries. Our aroach is to comute a series of nested rings within K, each of which surrounds the origin. Each ring is the union of a collection of aroriately scaled Macbeath regions such that any ray shot from the origin hits at least one Macbeath region from each ring (see Figure 6). The rings extend outwards towards the boundary of K. To simlify query rocessing, we will relace each Macbeath region with a containing ellisoid whose volume is larger by at most a constant factor. With each successive level these Macbeath ellisoids define successively better aroximations to K, with the last ring forming an ε-aroximation to K. These rings naturally define a layered DAG structure whose nodes corresond to Macbeath ellisoids. A Macbeath ellisoid at level i is the child of a Macbeath 76 Coyright by SIAM

Figure 6: Illustration of two levels of the data structure. ellisoid at level i 1 if there is a ray from the origin that intersects both of them. (It will in fact hit the ellisoid at level i 1 before the one at level i.) We will show that each ellisoid has a constant number of children, and that the overall deth of this DAG is O(log 1 ε ). To define the structure more formally, let 0 be the constant of Lemma.5, and for i 0 define i = 0 / i. The levels are indexed from 0 to l, where l is the smallest integer such that l γ ε/(8(3d + 1)). Since γ is a constant, l = O(log 1 ε ). Recall that K(δ) denotes the body that results by eroding K by distance δ, and let λ 0 = 1/(0 d) be a constant. By Lemma.1(a), K( 0 ) contains the origin O and K( i ) K( i+1 ). The nodes at level i of our data structure corresond to a maximal set of disjoint Macbeath regions M λ0 (x) whose centers x lie on the boundary of the eroded body K( i ). For any node u, let x u denote the center of the associated Macbeath region M λ0 (x u ). Define the associated Macbeath ellisoid, denoted E(x u ), to be the circumscribing John ellisoid of M 4λ0 (x u ). (Since M λ0 (x u ) is centrally symmetric about x u, E(x u ) will be centered about this oint.) We will show that the union of the Macbeath ellisoids at level i cover K( i ), imlying that any ray emanating from the origin must intersect at least one ellisoid of each level. As mentioned above, given nodes u and v from levels i and i + 1, resectively, v is a child of u if there exists a ray emanating from the origin that intersects both E(x u ) and E(x v ). We can root the DAG by creating a secial node whose children are all the nodes of level zero. In order to roduce a witness for aroximate ray-shooting queries, we associate each leaf node with a constant number of suorting hyerlanes of K that locally aroximate the boundary of K near the leaf s Macbeath ellisoid. (This will be discussed in detail in Section 3.1). Given a ray Oq, the query algorithm descends the DAG by starting at the root and visiting any node at level zero that intersects the ray. Letting u denote the current node, we next visit any child of u whose associated ellisoid intersects the ray. (Such a child must exist.) Uon reaching the leaf level we intersect Oq with all of its associated suorting hyerlanes and return the intersection oint that is closest to O as the answer to the query (along with the identity of the hyerlane containing ). In the subsections below, we resent a formal analysis of the structure and its roerties. In Section 3.1 we sketch its construction. In Section 3. we show that each node has O(1) children. In Section 3.3, we show that the total storage required is O(1/ε (d 1)/ ). Finally, in Section 3.4 we show that the query algorithm is correct and has query time O(log 1 ε ). 3.1 Construction. Since our focus is on the existential roerties of the data structure, we will discuss its construction only at a high level. We are given the convex body K and aroximation arameter ε. Due to the aroximate nature of the queries, most of the stes can be imlemented aroximately subject to a suitable adjustment of the constant factors. The construction begins by converting K into canonical form as described in Lemma.1. Next, for 0 i l, the eroded bodies K( i ) are comuted. Recalling the constant λ 0 earlier, for each body K( i ) we greedily comute a maximal set of oints X i on its boundary such that the Macbeath regions M λ0 (x) for x X i are airwise disjoint. For each oint x X i, we construct the associated Macbeath region M 4λ0 (x) and the associated Macbeath ellisoid E(x). We also create a node for this oint at level i of the DAG. Finally, for each air of nodes at consecutive levels of the DAG, we determine whether there exists a ray emanating from the origin that intersects both of their associated Macbeath ellisoids. If so, we create a arent-child link between them. We create a secial root node, which we connect to all the nodes of level zero. This defines the layered DAG structure. Next, let us consider the assignment of suorting hyerlanes to the leaves of the data structure. Let u be a leaf node, and let E(x u ) denote the associated Macbeath ellisoid with center oint x u (see Figure 7). Let C(x u ) denote the corresonding minimum volume ca. Let t be the aex of this ca, and let h t denote the hyerlane (which is a suorting hyerlane of K) assing through t and arallel to the base of the ca. In Lemma 3.7, we will show that h t can serve as the desired witness, but in some alications it is desirable that the witness be chosen from K s bounding hyerlanes. By Carathéodory s theorem [34], there is a set of at most d of K s bounding halfsaces whose intersection defines an unbounded simlex that contains K, and this simlex is contained within the halfsace bounded by h t containing K (shaded in blue in Figure 7). The leaf 77 Coyright by SIAM

node u stores this set of hyerlanes, which we denote by H u. t C(x u ) x u H u E(x u ) Figure 7: A leaf node in the data structure. 3. Bounding the Out-degree. In this section we show that each node has O(1) children. Intuitively, this involves showing that the set of rays emanating from the origin that ass through a Macbeath ellisoid for a oint on the boundary of K( i ) can intersect at most a constant number of Macbeath ellisoids for oints on the boundary K( i+1 ). This is because the oints x defining the nodes of each level have disjoint Macbeath regions M λ0 (x), which ermits us to emloy a acking argument. For any oint x K, recall that v(x) denotes the volume of the minimal ca C(x). Our first lemma considers how v(x) changes as the oint x moves towards the boundary of K along a ray emanating from O. The lemma shows that if the distance to the boundary, δ(x), decreases by at most a constant factor, then v(x) decreases by no more than some constant factor. Lemma 3.1. Let K R d be a convex body in γ- canonical form. Let y be a oint on the ray Ox, such that ray(y) ray(x). If δ(y) δ(x)/α for any α 1, then v(y) (γ/α) d v(x). Proof. If C(y) contains O then, by convexity, it would follow that x C(y). This would imly that v(y) = vol(c(y)) v(x), which would rove the lemma. We may assume therefore that C(y) does not contain O. w x w y y O x t K C(y) C (x) Figure 8: Proof of Lemma 3.1. Let h y denote the hyerlane assing through the base of C(y), and let t denote the aex of C(y). h t h t h y h x M (x) x y x O y M (y) Figure 9: Proof of Lemma 3.. C 4 (x) Let h t and h x denote the hyerlanes arallel to h y assing through t and x, resectively. Note that h t is a suorting hyerlane of K. Let C (x) denote the (not necessarily minimal) ca with aex t, whose base lies on h x. Let w y and w x denote the widths of the cas C(y) and C (x), resectively. Clearly, C (x) is a (w x /w y )-exansion of the ca C(y), and so by Lemma.3, vol(c (x)) (w x /w y ) d vol(c(y)). Thus (3.3) v(x) vol(c (x)) ( wx w y ) d v(y). Next we show that w y is not much smaller than w x. Let and denote the oints of intersection of the ray Ox with K and h t, resectively. Using elementary geometry and the facts that ray(y) δ(y) and ray(x) δ(x)/γ (Lemma.8), we obtain w x = x w y y ray(x) ray(y) = ray(x) + ray(y) + δ(x)/γ δ(y) α γ. Substituting this bound in Equation 3.3, we obtain v(x) (α/γ) d v(y), which comletes the roof. The following lemma relates the Macbeath regions associated with a node and any of its children. Lemma 3.. Let K R d be a convex body in γ- canonical form for some constant γ, and let 0 be the constant of Lemma.5. Let x be a oint within distance at most 0 of the boundary of K. Consider the generalized cone formed by rays emanating from the center O of K and intersecting M (x). Consider any Macbeath region M (y) that overlas this cone where δ(y) = δ(x)/. Then (a) M (y) C 4 (x), and (b) There exists a constant c (deending on d and γ) such that vol(m(y)) v(x)/c. Proof. We claim that M (y) overlas C (x). By Lemma.7, this will imly that M (y) C 4 (x) and 78 Coyright by SIAM

so will establish (a). To see the claim, consider any ray that emanates from O and intersects both M (x) and M (y). Let x and y be any two oints on this ray that are contained in M (x) and M (y), resectively (see Figure 9). Alying Lemma.14 to oints x and x, we obtain δ(x ) 4δ(x)/5. Alying the same lemma to oints y and y, we obtain δ(y ) 4δ(y)/3. Recalling that δ(y) = δ(x)/ and utting this all together, we obtain δ(y ) 4 3 δ(y) = 4 3 δ(x) 4 3 1 5 4 δ(x ) < δ(x ). Alying Lemma.14 to oints x and x, we have δ(x ) 4δ(x)/3 4 0 /3. Substituting the value of 0, it is easy to verify that δ(x ) < γ/. Since δ(y ) < δ(x ), we can now aly Lemma.1(b) to conclude that ray(y ) < ray(x ). In other words, y occurs after x along the ray emanating from O. Also, by Lemma.6, we have M (x) C 6/5 (x) C (x). Therefore, x C (x), and so y C (x). Thus, we have shown that M (y) intersects C (x), which roves (a). Next we rove (b). Alying Lemma.14 to oints y and y, we obtain δ(y ) 4δ(y)/5. Recalling that δ(x ) 4δ(x)/3, we have δ(y ) 4 5 δ(y) = 4 5 δ(x) 4 5 1 3 4 δ(x ) 1 4 δ(x ). Alying Lemma 3.1 to x and y, we obtain v(y ) (γ/4) d v(x ). Alying Lemma.15 to x and x, we have v(x ) v(x)/ d. Analogously, we have v(y) v(y )/ d. Also, since δ(y) = δ(x)/ 0 / 0, the recondition of Lemma.5 is satisfied for oint y. Alying Lemma.5, it follows that C(y) M 3d (y). Thus vol(m(y)) vol(c(y)) (3d) d = v(y) (3d) d. Putting it all together, we obtain vol(m(y)) v(y) (3d) d 1 (3d) d 1 d v(y ) 1 (3d) d 1 ( γ ) d d v(x ) 4 1 (3d) d 1 ( γ ) d d 1 4 d v(x) ( γ ) d v(x). 48d This yields vol(m(y)) v(x)/c for any constant c (48d/γ) d, which roves (b). The revious lemma imlies the following. Lemma 3.3. Let K R d be a convex body, and let 0 be the constant of Lemma.5. Also, let λ 1/5 be any constant. Let x K such that δ(x) 0. Consider the generalized cone formed by rays emanating from the center O of K and intersecting M (x). Let Y denote any set of oints y such that δ(y) = δ(x)/ and the set of Macbeath regions M λ (y) are disjoint. Let Y Y denote the set of oints y such that M (y) overlas the aforementioned cone. Then Y = O(1). Proof. Let y denote any oint of Y. Alying Lemma 3., it follows that (a) M (y) C 4 (x), and (b) vol(m(y)) v(x)/c, for a suitable constant c. Since λ 1/5, it follows that M λ (y) is contained in C 4 (x). By Lemma.3, the volume of C 4 (x) is at most 4 d v(x) = O(v(x)) and the volume of M λ (y) is λ d vol(m(y)) λ d v(x)/c = Ω(v(x)). Since the Macbeath regions M λ (y) for y Y are disjoint, by a straightforward acking argument, it follows that Y = O(1). We are now ready to show that the number of children of any non-root node u in our data structure is O(1). (We will analyze the number of children of the root node later. See the remarks following Lemma 3.5.) Consider any node u at level i 0. Recall that E(x u ) denotes the associated Macbeath ellisoid, which encloses M 4λ0 (x u ). The children of u are those nodes v at level i + 1 whose ellisoid E(x v ) intersects the generalized cone formed by rays emanating from the origin that intersect E(x u ). The child condition is exressed in terms of Macbeath ellisoids (for the sake of efficient query rocessing), but the above lemma is stated in terms of Macbeath regions. Since x u K( i ), we have δ(x u ) = i 0. Macbeath regions are centrally symmetric, and the constant in John s Theorem [38] is d for centrally symmetric bodies. Recalling that λ 0 = 1/(0 d) we have (3.4) M 4λ0 (x u ) E(x u ) M 4λ0 d (x u ) = M (x u ). Thus, the generalized cone of rays that intersect M (x u ) includes all the rays used to define the children of x u. The oints x v that form level i + 1 of the structure lie on K( i+1 ) and thus satisfy δ(x v ) = δ(x u )/. Since by our construction they have disjoint Macbeath regions M λ0 (x v ), they constitute a set Y as described in the reconditions of Lemma 3.3. Each child v of u corresonds to a oint x v such that the ellisoid E(x v ) intersects the generalized cone. Reasoning as we did above for x u, we have E(x v ) M (x v ). Therefore, the oints x v associated with the children of u constitute a 79 Coyright by SIAM

subset of the set Y given in the lemma. Therefore, the number of children of x u is O(1), as desired. 3.3 Storage Sace. In this section, we show that the total number of nodes in the data structure is O(1/ε (d 1)/ ). Since each node has O(1) children, it will follow that the total storage is also O(1/ε (d 1)/ ). Recall the constants 0 and λ 0 = 1/(0 d) defined earlier. The number of nodes at level i is bounded above by the cardinality of a maximal set of disjoint Macbeath regions M λ0 (x), such that the centers x lie on the boundary of K( i ), where i = 0 / i. Our analysis will make use of the following lemma, which is a straightforward adatation of Lemma 3., which is roved in the arxiv version of [7]. Lemma 3.4. Let K R d be a convex body in γ- canonical form. Let 0 < λ 1/5 be any fixed constant and let γ/1 be a real arameter. Let C be a set of cas, whose widths lie between and, such that the Macbeath regions M λ (x) centered at the centroids x of the bases of these cas are disjoint. Then C = O(1/ (d 1)/ ). We aly this to bound the number of Macbeath regions that define the nodes of each layer. Lemma 3.5. Let K R d be a convex body in γ- canonical form for some constant γ. Let 0 be the constant of Lemma.5 and 0 < λ 1/5 be any fixed constant. Let 0 be a real arameter. Let M be a set of disjoint Macbeath regions, each of which has scaling factor λ and whose centers lie on the boundary of K( ). Then M = O(1/ (d 1)/ ). Proof. Let X denote the set of center oints of M. By Lemma.10, for any x X, width(x) is between and (/γ)(3d + 1). We can artition X (and by extension M) into O(1) grous such that the oints in any grou have same width to within a factor of two. Let X denote one of these grous, and let its associated widths be between w and w. Since 0, we have w (/γ)(3d + 1) 0. Under our assumtion that d 3, it is easy to verify that the latter quantity does not exceed γ/1. Thus, the set of cas C(x) for the oints of this grou satisfy the recondition of Lemma 3.4. Alying this lemma yields X = O(1/w (d 1)/ ). Summing over all the grous, it follows that the total size of X (and hence the number of regions in M) is O(1/ (d 1)/ ). By Lemma 3.5, the number of nodes at level i is O(1/ (d 1)/ i ) = O(( i / 0 ) (d 1)/ ). Recall that 0 deends only on d and γ, and both d and γ are constants. It follows that the number of nodes at level zero is O(1). (This bounds the out-degree of the root node, as alluded to in Section 3..) Also, observe that the number of nodes grows geometrically with each level. Therefore, the total number of nodes is dominated by the number of leaves. The leaves are located at level l, where l is Ω(ε). Therefore, the number of leaves, and hence the total number of nodes, is O(1/ε (d 1)/ ). 3.4 Query Processing. Finally, let us resent the query algorithm for answering ε-aroximate ray-shooting queries. Let Oq denote the query ray. As mentioned earlier, the query algorithm descends the layered DAG structure, visiting a node u at each level such that the associated Macbeath ellisoid E(x u ) intersects the query ray, until arriving at the leaf level. In order to show that this is well defined, it is necessary to demonstrate that such a node exists at each level of the data structure. Since all the eroded bodies K( i ) contain the origin, it suffices to show that the union of the Macbeath ellisoids associated with the nodes of level i cover the boundary of K( i ). This is established by the following lemma. Lemma 3.6. For any 0, let X denote a maximal set of oints lying on the boundary of the eroded body K( ) such that the associated Macbeath regions M λ0 (x) are airwise disjoint. Then the collection of Macbeath ellisoids {E(x) x X} covers K( ). Proof. Consider any oint x K( ). Because X is maximal, there must exist x X such that M λ0 (x) has a nonemty intersection with M λ0 (x ). By Lemma.4, M λ0 (x ) M 4λ0 (x). Recalling that M 4λ0 (x) E(x), it follows that x E(x). Since i 0 for each level i of the data structure, it follows from the above lemma that the query rocedure will succeed in finding a suitable child for each node visited until it reaches the leaf level. Since each node has a constant number of children, it takes O(l) = O(log 1 ε ) time to erform this descent. Recall from Section 3.1 that each leaf node u stores a set H u of at most d suorting hyerlanes of K whose intersection defines an unbounded simlex that contains K (see Figure 10(a)). The query algorithm comutes the intersection of the query ray with each of these hyerlanes and returns the closest intersection oint to the origin. The following lemma establishes the correctness of the query rocessing. Lemma 3.7. Given a query ray Oq, the oint returned by the query rocedure is a valid answer to the ε-aroximate ray-shooting query, and it lies on a suorting hyerlane of K. 80 Coyright by SIAM

H u Downloaded 08/5/18 to 18.8.17.130. Redistribution subject to SIAM license or coyright; see htt://www.siam.org/journals/ojsa.h t x u (a) Proof. Observe that lies at the intersection of the query ray and a suorting hyerlane of K. Clearly, is not internal to K, so all that remains is to show that lies within distance ε of K. Recall that a leaf node u satisfies δ(x u ) γ ε/(8(3d + 1)), and therefore by Lemma.10, width(x u ) γε/4. Since the search rocedure arrived at node u, the ray Oq intersects E(x u ). By Eq. (3.4) and Lemma.6, E(x u ) M (x u ) C 6/5 (x u ) C (x u ). Let t denote the aex of C (x u ), and let h t denote the hyerlane assing through t that is arallel to the base of this ca (see Figure 10(b)). By construction, the intersection of the halfsaces H u associated with u lies within the halfsace bounded by h t that contains K. Let y be any oint in E(x u ) Oq, and let y denote the intersection of the ray Oq and h t. By Lemma.11, y y width(c (x u ))/γ = 4 width(x u )/γ ε. Therefore y lies within distance ε of K, imlying that does as well. Summarizing the results of this section, we have shown that, given a convex olytoe K in γ-canonical form, where γ is a constant, and given ε > 0, there exists a data structure that uses O(1/ε (d 1)/ ) sace and answers ε-aroximate ray-shooting queries in time O(log 1 ε ). This establishes Lemma., and Theorem 1.1 follows immediately. The following lemma justifies our assertion that these bounds are asymtotically otimal. Lemma 3.8. For all sufficiently small ε > 0, any data structure for answering ε-aroximate olytoe membershi queries in R d requires Ω(1/ε (d 1)/ ) bits of storage, and if the data structure oerates in the decision tree model, the query time is Ω(log 1 ε ) in the worst case. Proof. Consider a Euclidean ball of unit diameter in R d, and let be any oint on the boundary of this ball. For any 0 < ε < 1, it follows from a simle K εγ/4 y C (x u ) Figure 10: Query rocessing for a leaf node. y t x u (b) alication of the Pythagorean Theorem that a ca of width ε whose aex is at has diameter at most c ε, for some constant c deending only on d. By a simle acking argument there exists a set P of oints of size Ω((1/ ε) d 1 ) = Ω(1/ε (d 1)/ ) on the boundary of the ball such that the ε-width cas centered at these oints are airwise disjoint. For any two distinct subsets P and P of P, consider a oint that lies in one subset, say P, but not in the other. It is easy to see that for the query oint, the answer to the ε-aroximate membershi query at q is yes for P and no for P. Therefore, the two data structures for these subsets must differ. It follows that there are P distinct data structures needed to reresent the various subsets of P. By an information-theoretic argument, such a data structure requires Ω(1/ε (d 1)/ ) bits in the worst case. Assuming that queries are answered in the decision-tree model, such a structure requires deth Ω(log 1 ε ). 4 Aroximate Nearest-Neighbor Searching In this section we resent a reduction from aroximate Euclidean nearest-neighbor searching to aroximate olytoe membershi, or more accurately, to aroximate ray-shooting. The reduction is based on the aroximate Voronoi diagram (AVD) construction from [11]. The AVD for an n-element oint set X emloys a height-balanced variant of a quadtree, a balanced box decomosition (BBD) tree [1] to be recise. Each leaf cell Q of the tree stores a set R X of reresentative oints, which have the roerty that for any query oint q Q, at least one of these reresentatives is an ε-nearest neighbor of q. We will emloy a version of this data structure where the total number of reresentatives over all the nodes is O(n log 1 ε ). In the data structure of [11] a query is answered by locating the leaf cell that contains the query oint in O(log n) time, and then selecting the nearest reresentative from this cell to the query (by simle brute force). Later in [4] it was shown that queries can be K h t 81 Coyright by SIAM

answered more efficiently by relacing the brute-force search with an aroach based on using aroximate olytoe membershi queries. (This will be discussed below.) These membershi queries were alied within the context of a binary search in order to simulate aroximate ray shooting. In light of Lemma., we can forgo the binary search, which saves a factor of O(log 1 ε ) in the query time. The aroximate ray shooting queries used in [4] were of a different nature than those resented here. First, the rays are vertical (arallel to one of the coordinate axes). Second, the hyerlanes near the ortion of the olytoe s boundary where the ray might hit are not too sharly sloed with resect to the query ray. (More formally, for any ε-aroximating convex olytoe P of K, the angle between the vertical ray and the normal vector of the hyerlane of P hit by this ray is bounded away from π/ by a constant.) We refer to this as vertical sloe-restricted aroximate ray shooting. The first art of the following result is roved in [4], and the sloe-restricted variant follows directly by eliminating the binary search. Lemma 4.1. Let 0 < ε 1/ be a real arameter and X be a set of n oints in R d. Given a data structure for aroximate olytoe membershi in d-dimensional sace with query time at most t d (ε) and storage s d (ε), it is ossible to rerocess X into an ε-aroximate nearest neighbor data structure with Query time: O ( ) log n + t d+1 (ε) log 1 ε and ( Sace: O n log 1 ε + n s ) d+1(ε). t d+1 (ε) If vertical sloe-restricted aroximate ray shooting queries are suorted, then the query time is O(log n + t d+1 (ε)). Observe that the sace bound varies inversely with the query-time bound. While the query time resented here is O(log 1/ε), we can artificially generate higher query times by emloying brute-force search. Indeed, this lemma exloits the fact that when brute-force search is used on subsets of size at most t d+1 (ε), the data structure need only be constructed for subsets of size at least t d+1 (ε), of which there are at most O(n/t d+1 (ε)). 4.1 Lifting Transformation. In order to adat Lemma 4.1 to our context, we will need to understand a bit more about how it works. It is based on a wellknown transformation that mas a oint in R d to R d+1 by rojecting it vertically onto a araboloid. More formally, we can embed a oint = (x 1,..., x d ) in R d into R d+1 by adding an additional (d+1)st coordinate whose value is zero. Let us visualize the (d + 1)st coordinate axis as being directed vertically uwards. Let Ψ denote the araboloid x d+1 = d i=1 x i. Given a oint Rd, the lifting transformation rojects R d vertically to a oint lying on Ψ. Define h() to be the hyerlane tangent to Ψ at, that is, h() = { (x 1,..., x d+1 ) x d+1 = d i x i }. i=1 For any q R d, let q denote the oint of intersection between h() and a vertical ray shot uwards from q. Letting q denote the Euclidean distance between oints and q, it is easily verified that q q = q. (See [4] for details.) Given a finite oint set R in R d, let E(R) denote the uer enveloe of the hyerlanes h() for each R (shaded in Figure 11(a)). A vertical line through any oint q R d intersects a facet of E(R). (If the line intersects the boundary between multile facets, we select one facet arbitrarily.) It follows directly that the nearest neighbor in R of any query oint q is the oint R whose associated hyerlane h() is hit by the vertical line segment assing through q. That is, nearest neighbor queries in R d can be reduced to vertical rayshooting queries against E(R) in R d+1 [, 33]. While this alies to exact nearest neighbors, it is shown in [4] that the ε-aroximate closest reresentative in R can be determined by simulating vertical ray shooting against a suitable aroximation to E(R). In articular, after a normalizing transformation, it can be assumed that the cell Q is centered at the origin, and both Q and the oints of R all lie within some constant distance of the origin. The choice of this constant is arbitrary (deending ossibly on d but not on ε), and it only affects the constant factors in the query time. We will assume henceforth that this constant is chosen to be 1/. E(R) is unbounded, and it will be necessary to define a bounded olytoe that contains the relevant ortion of E(R). Because the distance between any oint q R d and its closest reresentative in R is at most one, it follows that for the sake of answering nearest neighbor queries, the relevant ortion of E(R) lies within the region bounded by two horizontal hyerlanes 1 x d+1 +1. For reasons that will be aarent later, it will be convenient to define this bounded region to be a frustum. Let f be the d- dimensional hyercube on the hyerlane x d+1 = 1 satisfying 1/ x i 1/, for 1 i d, and let f + be the d-dimensional hyercube on the hyerlane x d+1 = +1 satisfying 5/6 x i 5/6, for 1 i d (see Figure 11(b)). Let F denote the frustum defined 8 Coyright by SIAM