Geometric Inference on Kernel Density Estimates

Size: px
Start display at page:

Download "Geometric Inference on Kernel Density Estimates"

Transcription

1 1 Geometric Inference on Kernel Density Estimates Jeff M. Phillis University of Utah Bei Wang University of Utah Yan Zheng University of Utah 3 Abstract We show that geometric inference of a oint cloud can be calculated by examining its kernel density estimate. This intermediate ste results in the inference being statically robust to noise and allows for large comutational gains and scalability (e.g. on 100 million oints). In articular, by first creating a coreset for the kernel density estimate, the data reresenting the final geometric and toological structure has size deending only on the error tolerance, not on the size of the original oint set or the comlexity of the structure. To achieve this result, we study how to relace distance to a measure, as studied by Chazal, Cohen- Steiner, and Mérigot, with the kernel distance. The kernel distance is monotonic with the kernel density estimate (sublevel sets of the kernel distance are suerlevel sets of the kernel density estimate), thus allowing us to examine the kernel density estimate in this manner. We show it has several comutational and stability advantages. Moreover, we rovide an algorithm to estimate its toology using weighted Vietoris-Ris comlexes.

2 Introduction Geometric inference [9] is the task of recovering the toological and geometric features of an object using only a finite samled oint cloud. Secifically the goal is for the inference from the oint cloud to reserve homeomorhism [, 7], homotoy tye [17, 18, 3, 50, 1], or homology [17, 50, 1] of the underlying object. The field has rogressively considered inference on more and more general objects, including smooth manifolds [50, 51, 0] and comact subsets of Euclidean sace [1]. While the underlying object varies, the aroaches have generally relied on combinatorial structures on the oint clouds, such as α-comlexes [30] and Vietoris-Ris comlexes [1]. The α-comlex is tyically associated with the offsets of oint clouds (i.e. as union of balls of an aroriate radius centered at the oint samle). In this aer we ask: what geometric inference guarantees can be made from kernel density estimates? These well-studied statistical objects [57, 54, 5, 6] create a smooth intermediate reresentation that is robust to noise and satial variation, and thanks to recent comutational advances [43, 5, 65, 64] are easily constructed very efficiently at enormous scales with formal aroximation guarantees. We consider the following aroach for rocessing a oint set drawn from an unknown object. 1. Create a kernel density estimate of the oint set, this aroximates the true underlying measure.. Aroximate the ersistence diagram of the sublevel sets filtration of kernel distance using weighted Vietoris-Ris comlexes. 3. Examine the shae and toological roerties of the suerlevel sets of the kernel density estimates. To analyze this rocess, we take a few detours from what might be suggested. First, instead of taking the classical view of an α-comlex as a union of balls, we exlore a newer interretation as the sublevel sets of a distance function to the object of study. These distance function based techniques [9] have been used with success in estimating the toology and geometry of the underlying object with certain data models [47, 19, 1, 11, 13, 48, 14]. However, when noise is resent, the reconstruction task is much more difficult as a single stray oint can drastically alter the underlying toology, therefore decreasing the stability of the distance function. Recently Chazal, Cohen-Steiner, and Merigot [15] have introduced distance to a measure, which is a distance-like function that is robust to erturbations and noise on the data, and can be aroximated efficiently [39, 5]. We build on the technology develoed in this line of work in order to understand and construct suerlevel sets of kernel density estimates. Second, instead of linearly varying the threshold of suerlevel sets on the kernel density estimate, we consider the sublevel sets of the kernel distance, which is roughly the squareroot of a constant minus the kernel density estimate. In addition to allowing us to analyze the kernel density estimates, this kernel distance [36, 58, 43, 53], offers many robustness guarantees and comutational advantages that are on ar and occasionally advantageous comared to those of the distance to a measure. This is not the first work exloring geometric inference and toological roerties of suerlevel sets of kernel density estimates. An recent manuscrit [4] offers a statistical ersective on this roblem, and exlores, among other things, how the samle size of the inut data set affects the robustness of the toological roerties such as ersistence diagrams. By examining this roblem through the kernel distance, our bounds on the same questions are considerably smaller and require far fewer arameters Background on Kernels, Kernel Density Estimates, and Kernel Distance A kernel is a non-negative similarity measure K : R d R d R + ; more similar oints have higher value. For any fixed R d, a kernel K(, ) can be normalized to be a robability distribution; that is x R K(, x)dx = 1. For the uroses of this article we focus on the Gaussian kernel defined as d 1

3 K(, x) = σ ex( x /σ ). 1 A kernel density estimate [57, 54, 5, 6] is a way to estimate a continuous distribution function over R d for a finite oint set P R d. Secifically, KDE P (x) = 1 K(, x). P The kernel distance [59, 41, 36, 61, 9, 8, 37, 43, 53] (also called current distance or maximum mean discreancy) is a metric [49, 58] between two oint sets P, Q (as long as the kernel used is characteristic [58], a slight restriction of being ositive definite [3, 4, 4, 63], this includes the Gaussian and Lalace kernels). Define a similarity between the two oint sets as κ(p, Q) = 1 1 K(, q). P Q P P q Q Then the kernel distance between two oint sets is defined as D K (P, Q) = κ(p, P ) + κ(q, Q) κ(p, Q). When oint set Q is a single oint x, then κ(p, x) = KDE P (x). Kernel density estimates can aly to any measure µ (on R d ) as KDE µ (x) = R K(, x)µ()d. The d similarity between two measures is κ(µ, ν) = R q R K(, q)µ()ν(q)ddq, and then the kernel dis- d d tance between two measures µ and ν is still a metric, defined as D K (µ, ν) = κ(µ, µ) + κ(ν, ν) κ(µ, ν). When the measure ν is a unit Dirac mass at x (ν(q) = 1 when q = x and ν(q) = 0 otherwise), then κ(µ, x) = KDE µ (x). Given a finite oint set P R d, we can work with emirical measure µ P defined as µ P = 1 P P δ, where δ is the Dirac measure on, and D K (µ P, µ Q ) = D K (P, Q). If K is ositive definite, it is said to have the reroducing roerty [4, 3, 4, 63]. This imlies that K(, x) is an inner roduct in some reroducing kernel Hilbert sace (RKHS) H K. Secifically, there is a lifting ma φ : R d H K so that K(, x) = φ(), φ(x) HK, and moreover the entire set P can be reresented as Φ(P ) = P φ(), which is a single element of H K and has a norm Φ(P ) HK = κ(p, P ). A single oint x R d also has a norm φ(x) HK = K(x, x) in this sace and we can also write κ(p, x) = KDE P (x) = Φ(P ) φ(x) H K. 1. Geometric Inference and Distance to a Measure: A Review Given an unknown object (e.g. a comact set) S R d and a finite oint cloud P R d that comes from 81 S under some rocess, geometric inference aims to recover toological and geometric roerties of S from 8 P. The offset-based (and more generally, the distance function-based) aroach for geometric inference 83 reconstructs a geometric and toological aroximation of S by offsets from P [11, 1, 13, 14, 15, 17, 18, 84 0]. 85 Given a comact set S R d, we can define a distance function f S to S; a common examle is f S (x) = inf y S x y. The offsets of S are the sublevel sets of f S, denoted (S) r = f 1 S ([0, r]). Now an arox- 87 imation of S by another comact set P R d (e.g. a finite oint cloud) can be quantified by the Hausdorff 88 distance d H (S, P ) := f S f P = inf x R d f S (x) f P (x) of their distance functions. The intuition 89 behind the inference of toology is that if d H (S, P ) is small, thus f S and f P are close, and subsequently, S, (S) r and (P ) r 90 carry the same toology for an aroriate scale r. In other words, to comare the toology of offsets (S) r and (P ) r 91, we require Hausdorff stability with resect to their distance functions f S and f P. 9 An examle of an offset-based toological inference result is formally stated as follows (as a articular ver- 93 sion of the reconstruction Theorem 4.6 in [1]), where the reach of a comact set S, reach(s), is defined as 94 the minimum distance between S and its medial axis. 1 K(, x) is normalized so that K(x, x) = 1 for σ = 1. The choice of coefficient σ is not the standard normalization, but it is erfectly valid as it scales everything by a constant. It has the roerty that σ K(, x) x / for x small.

4 Theorem 1.1 (Reconstruction from f P [1, 15]). Let S, P R d be comact sets such that reach(s) > R and ε := d H (S, P ) R/7. Then S and (P ) r are homotoy equivalent (and even isotoic) if 4ε r R 3ε. Here R ensures that the toological roerties of S and (S) r are the same, and the ε arameter ensures (S) r and (P ) r are close. Tyically ε is tied to the density with which a oint cloud P is samled from S. In addition to the Hausdorff stability roerty stated above, as exlained in [15], there are a few more secific roerties of f S that enable it to be useful for geometric inference [1, 14, 47, 48]: (F1) f S is 1-Lischitz: For all x, y R d, f S (x) f S (y) x y. (F) fs is 1-semiconcave: The ma x Rd (f S (x)) x is concave. As described in [15], here (F1) ensures that f S is differentiable almost everywhere and the medial axis of S has zero d-volume; and (F) is a crucial technical tool, e.g. in roving the existence of the flow of the gradient of the distance function for toological inference [1]. Distance to a measure. Given a robability measure µ on R d and a arameter m 0 > 0 smaller than the total mass of µ, the distance to a measure d ccm : R n R + [15] is defined for any oint x R d as ( 1 m0 1/ d ccm (x) = (δ µ,m (x)) dm), where δ µ,m (x) = inf { r > 0 : µ( m B r (x)) m }, 0 m=0 and where B r (x) is a ball of radius r centered at x and B r (x) is its closure. It has been shown in [15] that d ccm is a distance-like function that has the following roerties: (D1) The function x d ccm (x) is 1-Lischitz. (D) The function x (d ccm (x)) x is concave. (D3) [Stability] For robability measures µ and ν on R d and m 0 > 0, then d ccm d ccm ν,m 0 1 m0 W (µ, ν). 114 ( 1/ Here W is the Wasserstein distance W (µ, ν) = inf π Π(µ,ν) R d R x y dπ(x, y)) between two d 115 measures, where dπ(x, y) measures the amount of mass transferred from location x to location y and π 116 Π(µ, ν) is a transference lan [6]. Proerty (D3) imlies that in addition to d ccm being suitable for known 117 reconstruction theorems, it is also stable with resect to the Wasserstein distance on µ. 118 Furthermore, for the reconstruction theorem [1] (Theorem 1.1) to hold, the authors require a distance- 119 like version ([15], Proosition 4.) of the Isotoy Lemma ([38], Proosition 1.8), which adds an extra 10 condition that d ccm has to be roer, in the sense that, for every sequence { i } in R d that escaes to 11 infinity, {d ccm (x)} escaes to infinity in R + ([46], Proosition.17). 1 On a oint set P, one can algorithmically construct a toological estimation of d ccm µ P,m 0, e.g. aroximate 13 the ersistence diagram [3] of the sublevel sets filtration of d ccm µ P,m 0. In the case of d ccm µ P,m 0, the sublevel sets 14 can be aroximated by union of balls, and their toology can be estimated using tools from comutational 15 toology such as weighted alha-shaes [39] and weighted Ris comlexes [6] Our Results Here we show how to erform geometric inference on a kernel density estimate of a oint set P. We accomlish this by showing that a similar set of roerties hold for the kernel distance with resect to a measure µ, d K µ : R d R + (in lace of distance to a measure d ccm ), defined as d K µ (x) = D K (µ, x) = κ(µ, µ) + κ(x, x) κ(µ, x). This treats x as a robability measure reresented by a Dirac mass at x. Secifically, the following roerties of d K µ allow it to inherit the reconstruction roerties of d ccm. 3

5 (K1) d K µ is 1-Lischitz. (K) (d K µ ) is 1-semiconcave: the ma x (d K µ (x)) x is concave. (K3) d K µ is roer. (K4) [Stability] If µ and ν are two measures on R d, then d K µ d K ν D K (µ, ν). In addition, we rovide constructive toological estimation results associated with d K µ. We aroximate the ersistence diagram of the sublevel sets filtration of d K µ using weighted Ris comlexes, following ower distance machinery introduced in [6]. We also describe further advantages of the kernel distance. (i) It has a small coreset reresentation, which allows for sarse reresentation and efficient, scalable comutation. In articular, an ε-kernel samle [43, 5, 65] Q of µ is a finite oint set whose size only deends on ε > 0 and such that max x R d KDE µ (x) KDE µq (x) = max x R d κ(µ, x) κ(µ Q, x) ε. (ii) It is Lischitz with resect to the outlier arameter σ when the inut x is fixed. (iii) Its inference is easily interretable through the suerlevel sets of a kernel density estimate. (iv) As σ tends to for any two robability measures µ, ν, the kernel distance is bounded by the Wasserstein distance: lim σ D K (µ, ν) W (µ, ν) Proerties of the Kernel Distance Recall we use the σ -normalized Gaussian kernel K σ (, x) = σ ex( x /σ ). For ease of exosition, unless otherwise noted, we will assume σ is fixed and write K instead of K σ. Yet, the choice of σ is imortant. It is a smoothing arameter and controls the variance of the kernel; it determines the amount of noise or outliers tolerated by d K µ. We will address its significance in Section 4. and AendixA.1 in more detail Semiconcave Proerty for d K µ Lemma.1 (K). (d K µ ) is 1-semiconcave: the ma x (d K µ (x)) x is concave. Proof. This follows from the second derivative of T (x) = (d K µ (x)) x in any direction always being non-ositive. We can rewrite T (x) = κ(µ, µ) + κ(x, x) κ(µ, x) x = κ(µ, µ) + κ(x, x) (K(, x) + x )µ()d. R d Note that both κ(µ, µ) and κ(x, x) are absolute constants, so we can ignore them in the second derivative. Furthermore, by setting t(, x) = K(, x) x, the second derivative of T (x) is non-ositive if the second derivative of t(, x) is non-ositive for all, x R d. First note that the second derivative of x is a constant in every direction. The second derivative of K(, x) is symmetric about, so we can consider the second derivative along any vector u = x, d ( ) ) u t(, x) = du σ 1 ex ( u σ. This exression reaches its maximum value at u = x = 0 where it is exactly 0; this follows since ex( u /σ ) is always ositive and u /σ is always non-negative. We also note in Aendix B.3 that semiconcavity follows trivially in the RKHS H K. 4

6 Lischitz roerty for d K µ. We generalize a (folklore [15]) relation between semiconcave and Lischitz functions, and (re)rove it in Aendix B for comleteness. Then, using Lemma.1, roerty (K1) follows. Lemma.. Function g(x) such that (g(x)) is l-semiconcave, for l 1, then g(x) must be l-lischitz. Lemma.3 (K1). d K µ is 1-Lischitz on its inut Proerness of d K µ For the Isotoy Lemma to hold, we show that d K µ is roer, when its range is restricted to be less than c µ := κ(µ, µ) + κ(x, x). Here, the value of c µ deends on µ not on x since κ(x, x) = K(x, x) = σ. Lemma.4 (K3). d K µ is roer. Proof. We begin by first reviewing several equivalent definitions of a roer ma. Definition (i): A continuous ma f : X Y between two toological saces is roer if and only if the inverse image of every comact subset in Y is comact in X ([45], age 84; [46], age 45). Suose both X and Y are toological manifolds, an alternative characterization of roerness exists in terms of divergent sequences, that is, for Definition (ii): a continuous ma f : X Y between two toological manifolds is roer if and only if for every sequence { i } in X that escaes to infinity, {f( i )} escaes to infinity in Y ([46], Proosition.17). Here, for a toological sace X, a sequence { i } in X escae to infinity if for every comact set G X, there are at most finitely many values of i for which i G ([46], age 46). Based on the later definition, for examle, d ccm is roer in the sense that it tends to infinity as x goes to infinity. To rove that d K µ is roer, we rove the following two claims: (a) A continuous function f : R d [0, c) (where c is a constant) is roer, if for all sequence {x i } in R d that escaes to infinity, the sequence {f(x i )} tends to c; (b) Let f := d K µ and one needs to show that for all sequence {x i } that escaes to infinity, the sequence {f(x i )} tends to c µ ; or equivalently, κ(µ, x i ) tends to 0. We rove claim (a) by roving its contraositive. If a continuous function f : R d [0, c) is not roer, then there exists a sequence {x i } in R d that escaes to infinity, such that the sequence {f(x i )} does not tend to c. Suose f is not roer, this imlies that there exists a constant b < c such that f 1 [0, b] is not comact (based on roerness definition (i)) and therefore either not closed or unbounded. We first show that A := f 1 [0, b] is closed. We make use of the following theorem ([44], age 88, Theorem 10 ): A maing f of a toological sace X into a toological sace Y is continuous if and only if the re-image f 1 (F ) of every closed set F Y is closed in X. Since f is continuous, it imlies that the re-image of every closed set [a, b] R is closed in R d. Therefore, A is closed, therefore it must be unbounded. Since every unbounded sequence contains a monotone subsequence that has either + or as a limit, therefore A contains a subsequence S := {x i } that tends to an infinite limit. In addition, as elements in S escaes to infinity, {f(x i )} tends to b and does not tend to c. Therefore (a) holds by contraosition. To rove claim (b), we need to show that for all sequence {x i } that escaes to infinity, κ(µ, x i ) tends to 0. For each x i, define a radius r i = x i 0 / and define a ball B i that is centered at the origine 0 and has radius r i. As x i goes to infinity, r i increases until for any fixed arbitrary ε > 0, we have B i µ()d 1 ε/σ and thus R d \B i µ()d ε/σ. Furthermore, let i = arg min B x i, so x i i = r i. Thus also as x i goes to infinity, r i increases until for any ε > 0 we have K( i, x i ) ε/. We now decomose κ(µ, x i ) = B i K(, x i )µ()d + q R d \B i K(q, x i )µ(q)dq. Thus for any ε > 0, as x i goes to infinity, the first term is at most ε/ since all K(, x i ) K( i, x i ) ε/ and the second term is at most ε/ since K(q, x) σ and q R d \B i µ(q)dq ε/σ. Since these results hold for all ε, as x i goes to infinity and ε goes to 0, κ(µ, x i ) goes to 0. Combine (a) with (b) and the fact that d K µ is a continuous (in fact, Lischitz) function, we obtained the roerness result. 5

7 By the definition of roerness, Lemma.4 imlies that it is a closed ma and its levelset at any value a [0, c µ ) is comact. This also means that the sublevel set of d K µ (for ranges [0, a) [0, c µ )) is comact. Since the levelset (sublevel set) of d K µ corresonds to the levelset (suerlevel set) of KDE µ, we have the following corollary. Corollary.1. The suerlevel sets of KDE µ for all ranges whose lower bound a > 0 are comact. The result in [31] imlies that the above statement is true for µ being a uniform measure over a finite oint set. The above corollary is a more general statement Reconstruction and Toological Estimation using Kernel Distance 3.1 Homotoy Equivalent Reconstruction using d K µ We have shown that the kernel distance function d K µ is a distance-like function in the sense defined in [15]: it is a non-negative function which is 1-Lischitz and roer (with restriction on the range), whose square is 1-semiconcave. Therefore the reconstruction theory for a distance-like function [15] (which is an extension of results for comact sets [1]) holds in the setting of d K µ. Most imortantly we state the following two corollaries for comleteness, whose roofs follow trivially from the roofs of Proosition 4. and Theorem 4.6 in [15]. Before their formal statement, we need some notation adated from [15] to make these statements recise. Let φ : R d R + be a distance-like function. A oint x R d is an α-critical oint if φ (x + h) φ (x) + α h φ(x) + h with α [0, 1], h R d. Let (φ) r = {x R d φ(x) r} denote the sublevel set of φ, and let (φ) [r 1,r ] = {x R d r 1 φ(x) r } denote all oints at levels in the range [r 1, r ]. For α [0, 1], the α-reach of φ is the maximum r such that (φ) r has no α-critical oint, denoted as reach α (φ). When α = 1, reach 1 coincides with reach introduced in [35]. 5 6 Theorem 3.1 (Isotoy lemma on d K µ ). Let r 1 < r be two ositive numbers such that d K µ oints in (d K µ ) [r 1,r ]. Then all the sublevel sets (d K µ ) r are isotoic for r [r 1, r ]. has no critical Theorem 3. (Reconstruction on d K µ ). Let d K µ and d K ν be two kernel distance functions such that d K µ d K ν ε. Suose reach α (d K µ ) R for some α > 0. Then r [4ε/α, R 3ε], and η (0, R), the sublevel sets (d K µ ) η and (d K ν ) r are homotoy equivalent for ε R/(5 + 4/α ) Power Distance using d K µ A ower distance using d K µ for a measure µ is defined with a oint set P R d and a metric d(, ) on R d, f P (µ, x) = min P ( d(, x) + d K µ () ). We consider the instance fp K that uses kernel distance d(, x) := D K(, x). Many details are left to Aendix D. Given a measure µ, let + = arg max q R d κ(µ, q), and let P + R d be a oint set that contains +. 1 We can show (Theorem D. and Theorem D.1) that d K µ (x) fp K + (µ, x) 14d K µ (x). However, constructing + exactly seems quite difficult. Now consider an emirical measure µ P defined by a oint set P. We show (in Theorem D.5 in Aendix D.3) how to construct a oint ˆ + (that aroximates + ) such that D K (P, ˆ + ) (1 + δ)d K (P, + ) for any δ > 0. The median concentration Λ P is a radius such that no oint P has more than half of the oints 6

8 of P within Λ P. The runtime is olynomial in n and 1/δ assuming the sread of P is bounded, and for the choice of σ that σ/λ P and d are absolute constants. Then we consider ˆP + = P {ˆ + }, where ˆ + is found with δ = 1/ in the above construction. Then we can rovide the following multilicative bound, roven in Theorem D.3. The lower bound holds indeendent of the choice of P as shown in Theorem D.1. Theorem 3.3. For any oint set P R d and oint x R d, with emirical measure µ P defined by P, then 1 d K µ P (x) f KˆP+ (µ P, x) 8d K µ P (x) Constructive Toological Estimation using d K µ 47 In order to erform constructive toological estimation using the kernel distance d K µ, one needs to be able to 48 comute quantities related to its sublevel sets, in articular, to comute the ersistence diagram of the sub- 49 level sets filtration of d K µ. Now we describe such tools needed for the kernel distance based on machinery 50 recently develoed by Buchet et al. [6], which shows how to aroximate the ersistent homology of dis- 51 tance to a measure for any metric sace via a ower distance construction. Then using similar constructions, 5 we can use the weighted Ris filtration to aroximate the ersistence diagram of the kernel distance. 53 To state our results, first we require some technical notions and assume basic knowledge on ersistent 54 homology (see [3, 33] for a readable background). Given a metric sace X with the distance d X (, ), a 55 set P X and a function w : P R, the (general) ower distance f associated with (P, w) is defined as f(x) = 56 min P (d X (, x) + w() ). Now given the set (P, w) and its corresonding ower distance 57 f, one could use the weighted Ris filtration to aroximate the ersistence diagram of w. Consider the 58 sublevel set of f, f 1 ((, α]). It is the union of balls centered at oints P with radius r (α) = 59 α w() for each. The weighted Cech comlex C α (P, w) for arameter α is the union of simlices s such that 60 s B(, r (α)) 0. The weighted Ris comlex R α (P, w) for arameter α is the maximal 61 comlex whose 1-skeleton is the same as C α (P, w). The corresonding weighted Ris filtration is denoted 6 as {R α (P, w)}. 63 Setting w := d K µ P and given a oint set ˆP+ described in Section 3., we consider the weighted Ris 64 filtration {R α ( ˆP +, d K µ )} based on the kernel version of the ower distance, f KˆP+. We can now state the 65 following corollary of Theorem 3.3 (roofs detailed in Aendix D.4). Here, the ersistence diagrams are 66 viewed on a logarithmic scale, that is, we change coordinates of oints following the maing (x, y) 67 (ln x, ln y). d ln B denotes the corresonding bottleneck distance between ersistence diagrams Corollary 3.1. The weighted Ris filtration {R α ( ˆP +, d K µ P )} can be used to aroximate the ersistence diagram of d K µ P such that d ln B (Dgm(dK µ P ), Dgm({R α ( ˆP +, d K µ P )})) ln(16). Proof. To rove that two ersistence diagrams are close, one could rove that their filtration are interleaved [10], that is, two filtrations {U α } and {V α } are ε-interleaved if for any α, U α V α+ε U α+ε. First, Lemmas D.1 and D.13 rove that the ersistence diagrams Dgm(d K µ P ) and Dgm({R α ( ˆP +, d K µ P )})) are well-defined. Second, the results of Theorem 3.3 imlies an 8 multilicative interleaving. Therefore for any α R, (d K µ P ) 1 ((, α]) (f KˆP+ ) 1 ((, α) (d K µ P ) 1 ((, 8 α]). On a logarithmic scale (by taking the natural log of both sides), such interleaving becomes addictive, ln d K µ P ln f KˆP+ ln d K µ P + ln 8. 7

9 76 Theorem 4 of [16] imlies d ln B(Dgm(d K µ P ), Dgm(f KˆP+ )) ln(8) In addition, by the Persistent Nerve Lemma ([1], Theorem 6 of [55], an extension of the Nerve Theorem [40]), the sublevel sets filtration of d K µ, which corresond to unions of balls of increasing radius, has the same ersistent homology as the nerve filtration of these balls (which, by definition, is the Cech filtration). Finally, there exists a multilicative interleaving between weighted Ris and Cech comlexes (Proosition 31 of [16]), C α R α C α. We then obtain the following bounds on ersistence diagrams, d ln B(Dgm(f K P + ), Dgm({R α (P +, d K µ P )})) ln. 8 We use triangle inequality to obtain the final result: d ln B(Dgm(d K µ P ), Dgm({R α (P +, d K µ P )})) ln(16) Based on Corollary 3.1, we have an algorithm that aroximates the ersistent homology of the sublevel set filtration of d K µ by constructing the weighted Ris filtration corresonding to the kernel-based ower distance and comuting its ersistent homology. For memory efficient comutation, sarse (weighted) Ris filtrations could be adated by considering simlices on subsamles at each scale [56, 16], although some restrictions on the sace aly Stability Proerties for d K µ Lemma 4.1 (K4). For two measures µ and ν on R d we have d K µ d K ν D K (µ, ν). Proof. Since D K (, ) is a metric, then by triangle inequality, for any x R d we have D K (µ, x) D K (µ, ν) + D K (ν, x) and D K (ν, x) D K (ν, µ) + D K (µ, x). Therefore for any x R d we have D K (µ, x) D K (ν, x) D K (µ, ν), roving the claim. Note that both the Wasserstein and kernel distance are integral robability metrics [49, 58], so both (D3) and (K4) are interesting, but not easily comarable. In the next subsection we attemt to reconcile this Comaring D K to W Lemma 4.. There is no Lischitz constant γ such that for any two robability measures µ and ν we have W (µ, ν) γd K (µ, ν). 98 Proof. Consider two measures µ and ν which are identical, excet some ositive fractional amount τ of mass 99 from µ is moved to a distance n in ν from the origin. The Wasserstein distance requires a transortation lan that moves this τ mass to some art of µ with cost τ Ω(n) in W (µ, ν). On the other hand, D K (µ, ν) = κ(µ, µ) + κ(ν, ν) κ(µ, ν) σ + σ 0 = σ is bounded We conjecture for any two robability measures µ and ν that D K (µ, ν) W (µ, ν). This would show that d K µ is at least as stable as d ccm since a bound on W (µ, ν) would also bound D K (µ, ν), but not vice versa. Here we only show that this is true for a secialized case and as σ. To simlify notation, all integrals are assumed to be over the full domain R d. Lemma 4.3. Consider two robability measures µ and ν on R d where ν is reresented by a Dirac mass at a oint x R d. Then d K µ (x) = D K (µ, ν) W (µ, ν) for any σ > 0, where the equality only holds when µ is also a Dirac mass at x. 8

10 Proof. Since both W (µ, ν) and D K (µ, ν) are metrics and hence non-negative, we can instead consider their squared versions: (W (µ, ν)) = x µ()d and (D K (µ, ν)) = K(x, x) + = σ (1 + q q K(, q)µ()µ(q)dqd K(, x)µ()d ) q ex ( σ µ()µ(q)dqd Now use the bound 1 t e t 1 for t 0 to aroximate (D K (µ, ν)) σ (1 + = q (1)µ(q)dqµ()d (1 x µ()d = (W (µ, ν)). ) ) x ex ( σ µ()d. ) ) x σ µ()d The inequality becomes an equality only when x = 0 for all P, and since they are both metrics, this is the only location where they are both In Aendix C we show that even if ν is not a unit Dirac mass this at x, then this inequality hold in the limit 31 as σ goes to infinity. The technical work towards the next theorem is making recise how σ K(, x) x /, how on measures D K (µ, ν) acts like the difference between the means µ = 313 µ()d and 314 ν, and how the difference becomes smaller as σ goes to infinity Theorem 4.1. For any two robability measures µ and ν defined on R d µ ν W (µ, ν). Thus lim σ D K (µ, ν) W (µ, ν). 4. Stability with Resect to σ. lim D K(µ, ν) = µ ν and σ We now exlore the Lischitz roerties of d K µ with resect to the noise arameter σ. We argue that any distance function that is robust to noise needs some sort of arameter to address how many outliers to ignore or how far away a oint should be to be considered an outlier. For instance, this arameter in d ccm is m 0 which controls the amount of measure µ to be used in the distance. Here we show that d K µ has a articularly nice roerty, that it is Lischitz with resect to this choice of σ for any fixed x. Many details are deferred to Aendix B. The larger σ the more effect outliers have, and the smaller σ the less the data is smoothed and thus the closer the noise needs to be to the underlying object to effect the inference. The effect of changing σ is well-studied henomenon [6, 57, 31] and there is a suite of acceted heuristics for choosing the best value [60], although it is not clear they are intended for roblems in geometric inference. Lemma 4.4. Let h(σ, z) = ex( z /σ d 38 ). We can bound h(σ, z) 1, dσ h(σ, z) (/e)/σ and d 39 h(σ, z) (18/e 3 )/σ over any choice of z > 0. dσ Theorem 4.. For any x, d K µ (x) is l-lischitz with resect to σ, for l = 18/e 3 + 8/e + < 6. Proof. (Sketch) Define an indicator function 1 x () = 1 iff = x, and otherwise 0. It is now useful to define a function f x (σ) as ( ) q (µ()µ(q) f x (σ) = ex + 1x σ ()1 x (q) µ()1 x (q) ) dqd. q 9

11 333 So d K µ (x) = σ f x (σ) and we can also define F (σ) = (d K µ (x)) l σ = σ f x (σ) lσ Now to rove d K µ (x) is l-lischitz, we can show that (d K µ ) is l-semi-concave with resect to σ, and aly Lemma.. This boils down to showing the second derivative of F (σ) is always non-ositive. d d F (σ) = σ dσ dσ f x(σ) + 4σ d dσ f x(σ) + f x (σ) l. First we note that q (µ()µ(q) + µ()1 x(q) 1 x ()1 x (q))dqd since both µ and 1 x integrate to 1. Combining this with Lemma 4.4 using h(σ, q ), we can now bound each term as follows. d ( ) ( 18 1 F (σ) σ dσ e 3 σ 1 + 4σ e σ ) + ( 1 ) ( 18 e ) e + = 0. Lischitz in m 0 for d ccm. A Lischitz roerty is not known for d ccm with resect to m 0. Consider a µ P for oint set P R d where τ fraction of the oints P are close to x R d, and the rest far from x. When m 0 is above τ some mass is needed from the far art of P, causing a raid change in value with varying m 0. Thus the Lischitz roerty with resect to m 0 is related to P = max, P. 5 Algorithmic and Aroximation Observations Kernel coresets. The kernel distance is robust under random samles [43]. Secifically, if Q is a oint set randomly chosen from R d roortional to µ of size O((1/ε )(d + log(1/δ)) then d K µ d K Q ε with robability at least 1 δ. Thus if ε R/(5 + 4/α ), then with Theorem 3. any sublevel set (d K µ ) η is homotoy equivalent to (d K Q )r for r [4ε/α, R 3ε] and η (0, R). Furthermore, it is ossible to construct subsets Q with the same roerties but with even smaller size of Q = O(((1/ε) log(1/εδ)) d/(d+) ) [5]; in articular in R the required size is Q = O((1/ε) log(1/εδ)). The above three size bounds also ensure that KDE µ KDE Q ε, so Q is an ε-kernel samle (see Lemma B.). Exloiting the above constructions, recent work [65] builds a data structure to allow for efficient aroximate evaluations of KDE P where P = 100,000,000. This oens the door for reconstruction-based geometric inference which are robust to noise, on enormous datasets. Stability of ersistence diagrams. Furthermore, the stability results on ersistence diagrams [3] hold for kernel density estimates of µ and Q (where Q is a coreset of µ with the same size bounds as above); that is, d B (Dgm(KDE µ ), Dgm(KDE Q )) ε, where d B is the bottleneck distance between ersistence diagrams. Similar roerties are observed in [4], but with weaker and more comlicated bounds on Q. Geometric inference via suerlevel sets of kernel density estimates. A convenient roerty of d K µ ( ) is that it is monotonic with KDE µ ( ). That is, as d K µ (x) gets smaller, KDE µ (x) gets larger. Thus instead of doing geometric inference using sublevel sets of the kernel distance function d K µ, we can just examine the suerlevel sets of KDE µ. This rovides a very clean and natural interretation of the reconstruction roblem through the well-studied lens of kernel density estimates. Note that the difference between sublevel sets of d K µ and suerlevel sets of KDE µ which reresent the same region of R d do not have a linear relationshi. Recall d K µ (x) = (c µ ) KDE µ (x) where (c µ ) = σ + κ(µ, µ) is a constant that deends only on µ. 10

12 Acknowledgement The authors thank Don Sheehy, Frédéric Chazal and the rest of the Geometrica grou at INRIA-Saclay for enlightening discussions on geometric and toological reconstruction. We also thank Don Sheehy for ersonal communications regarding the ower distance constructions References [1] P. K. Agarwal, S. Har-Peled, H. Kalan, and M. Sharir. Union of random minkowski sums and network vulnerability analysis. In Proceedings of 9th Symosium on Comutational Geometry, 013. [] N. Amenta and M. Bern. Surface reconstruction by voronoi filtering. Discrete and Comutational Geometry, (4): , [3] N. Aronszajn. Theory of reroducing kernels. Transactions of the American Mathematical Society, 68: , [4] S. Balakrishnan, B. T. Fasy, F. Lecci, A. Rinaldo, A. Singh, and L. Wasserman. Statistical inference for ersistent homology. Technical reort, ArXiv: , March 013. [5] M. Buchet, F. Chazal, S. Y. Oudot, and D. R. Sheehy. Efficient and robust toological data analysis on metric saces. Technical reort, ArXiv: , 013. [6] M. Buchet, F. Chazal, S. Y. Oudot, and D. R. Sheehy. Efficient and robust toological data analysis on metric saces. arxiv: , 013. [7] G. Carlsson, G. Singh, and A. Zomorodian. Comuting multidimensional ersistence. Algorithms and Comutation: Lecture Notes in Comuter Science, 5878: , 009. [8] G. Carlsson and A. Zomorodian. The theory of multidimensional ersistence. Proc. 3rd Ann. Sym. Comutational Geometry, ages , 007. [9] F. Chazal and D. Cohen-Steiner. Geometric inference. Tessellations in the Sciences, 01. [10] F. Chazal, D. Cohen-Steiner, M. Glisse, L. J. Guibas, and S. Y. Oudot. Proximity of ersistence modules and their diagrams. In Proceedings 5th Annual Symosium on Comutational Geometry, ages 37 46, 009. [11] F. Chazal, D. Cohen-Steiner, and A. Lieutier. Normal cone aroximation and offset shae isotoy. Comutational Geometry: Theory and Alications, 4: , 009. [1] F. Chazal, D. Cohen-Steiner, and A. Lieutier. A samling theory for comact sets in Euclidean sace. Discrete Comutational Geometry, 41(3): , 009. [13] F. Chazal, D. Cohen-Steiner, A. Lieutier, and B. Thibert. Stability of curvature measures. Comuter Grahics Forum (Proceedings Symosium on Geometry Processing), ages , 009. [14] F. Chazal, D. Cohen-Steiner, and Q. Mérigot. Boundary measures for geometric inference. Foundations of Comutational Mathematics, 10():1 40, 010. [15] F. Chazal, D. Cohen-Steiner, and Q. Mérigot. Geometric inference for robability measures. Foundations of Comutational Mathematics, 11(6): ,

13 [16] F. Chazal, V. de Silva, M. Glisse, and S. Oudot. The structure and stability of ersistence modules. arxiv: , 013. [17] F. Chazal and A. Lieutier. Weak feature size and ersistent homology: comuting homology of solids in rn from noisy data samles. Proceedings 1st Annual Symosium on Comutational Geometry, ages 55 6, 005. [18] F. Chazal and A. Lieutier. Toology guaranteeing manifold reconstruction using distance function to noisy data. Proceedings nd Annual Symosium on Comutational Geometry, ages , 006. [19] F. Chazal and A. Lieutier. Stability and comutation of toological invariants of solids in R n. Discrete Comutational Geometry, 37(4): , 007. [0] F. Chazal and A. Lieutier. Smooth manifold reconstruction from noisy and non-uniform aroximation with guarantees. Comutational Geometry: Theory and Alications, 40(): , 008. [1] F. Chazal and S. Oudot. Towards ersistence-based reconstruction in euclidean saces. Proceedings 4th Annual Symosium on Comutational Geometry, ages 3 41, 008. [] S. Cohen. Finding color and shae atterns in images. Technical reort, Stanford University: CS-TR , [3] D. Cohen-Steiner, H. Edelsbrunner, and J. Harer. Stability of ersistence diagrams. Discrete and Comutational Geometry, 37:103 10, 007. [4] H. Daumé. From zero to reroducing kernel hilbert saces in twelve ages or less. htt://ub. hal3.name/daume04rkhs.s, 004. [5] L. Devroye and L. Györfi. Nonarametric Density Estimation: The L 1 View. Wiley, [6] L. Devroye and G. Lugosi. Combinatorial Methods in Density Estimation. Sringer-Verlag, 001. [7] T. K. Dey and S. Goswami. Provable surface reconstruction from noisy samles. Comutational Geometry, 35(1):14 141, 006. [8] S. Durrleman, X. Pennec, A. Trouvé, and N. Ayache. Measuring brain variability via sulcal lines registration: A diffeomorhic aroach. In 10th International Conference on Medical Image Comuting and Comuter Assisted Intervention, 007. [9] S. Durrleman, X. Pennec, A. Trouvé, and N. Ayache. Sarse aroximation of currents for statistics on curves and surfaces. In 11th International Conference on Medical Image Comuting and Comuter Assisted Intervention, 008. [30] H. Edelsbrunner. The union of balls and its dual shae. Proceedings 9th Annual Symosium on Comutational Geometry, ages 18 31, [31] H. Edelsbrunner, B. T. Fasy, and G. Rote. Add isotroic Gaussian kernels at own risk: More and more resiliant modes in higher dimensions. Proceedings 8th Annual Symosium on Comutational Geometry, ages , 01. [3] H. Edelsbrunner and J. Harer. Persistent homology - a survey. Contemorary Mathematics, 453:57 8,

14 [33] H. Edelsbrunner and J. Harer. Comutational Toology: An Introduction. American Mathematical Society, Providence, RI, USA, 010. [34] H. Edelsbrunner, D. Letscher, and A. J. Zomorodian. Toological ersistence and simlification. Discrete and Comutational Geometry, 8: , 00. [35] H. Federer. Curvature measures. Transactions of the American Mathematical Society, 93: , [36] J. Glaunès. Transort ar difféomorhismes de oints, de mesures et de courants our la comaraison de formes et l anatomie numérique. PhD thesis, Université Paris 13, 005. [37] J. Glaunès and S. Joshi. Temlate estimation form unlabeled oint set data and surfaces for comutational anatomy. In International Worksho on Mathematical Foundations of Comutational Anatomy, 006. [38] K. Grove. Critical oint theory for distance functions. Proceedings of Symosia in Pure Mathematics, 54: , [39] L. Guibas, Q. Mérigot, and D. Morozov. Witnessed k-distance. Proceedings 7th Annual Symosium on Comutational Geometry, ages 57 64, 011. [40] A. Hatcher. Algebraic Toology. Cambridge University Press, 00. [41] M. Hein and O. Bousquet. Hilbertian metrics and ositive definite kernels on robability measures. In Proceedings 10th International Worksho on Artificial Intelligence and Statistics, 005. [4] M. I. Jordan. Reroducing kernel hilbert saces. htt:// jordan/ courses/81b-sring04/lectures/rkhs.s. [43] S. Joshi, R. V. Kommaraju, J. M. Phillis, and S. Venkatasubramanian. Comaring distributions and shaes using the kernel distance. Proceedings 7th Annual Symosium on Comutational Geometry, 011. [44] A. N. Kolmogorov, S. V. Fomin, and R. A. Silverman. Introductory Real Analysis. Dover Publications, [45] J. M. Lee. Introduction to toological manifolds. Sringer-Verlag, 000. [46] J. M. Lee. Introduction to smooth manifolds. Sringer, 003. [47] A. Lieutier. Any oen bounded subset of R n has the same homotoy tye as its medial axis. Comuter- Aided Design, 36: , 004. [48] Q. Mérigot, M. Ovsjanikov, and L. Guibas. Robust Voronoi-based curvature and feature estimation. Proceedings SIAM/ACM Joint Conference on Geometric and Physical Modeling, ages 1 1, 009. [49] A. Müller. Integral robability metrics and their generating classes of functions. Advances in Alied Probability, 9():49 443, [50] P. Niyogi, S. Smale, and S. Weinberger. Finding the homology of submanifolds with high confidence from random samles. Discrete and Comutational Geometry, 39(1): , 008. [51] P. Niyogi, S. Smale, and S. Weinberger. A toological view of unsuervised learning from noisy data. Manuscrit,

15 [5] J. M. Phillis. es-samles for kernels. Proceedings 4th Annual ACM-SIAM Symosium on Discrete Algorithms (to aear), 013. [53] J. M. Phillis and S. Venkatasubramanian. A gentle introduction to the kernel distance. arxiv: , March 011. [54] D. W. Scott. Multivariate Density Estimation: Theory, Practice, and Visualization. Wiley, 199. [55] D. R. Sheehy. A multicover nerve for geometric inference. Canadian Conference in Comutational Geometry, 01. [56] D. R. Sheehy. Linear-size aroximations to the Vietoris-Ris filtration. Discrete & Comutational Geometry, 49: , 013. [57] B. W. Silverman. Density Estimation for Statistics and Data Analysis. Chaman & Hall/CRC, [58] B. K. Srierumbudur, A. Gretton, K. Fukumizu, B. Schölkof, and G. R. G. Lanckriet. Hilbert sace embeddings and metrics on robability measures. Journal of Machine Learning Research, 11: , 010. [59] C. Suquet. Distances Euclidiennes sur les mesures signées et alication à des théorèmes de Berry- Esséen. Bulletin Belgium Mathetics Society, : , [60] B. A. Turlach. Bandwidth selection in kernel density estimation: A review. Discussion aer 9317, Istitut de Statistique, UCL, Louvain-la-Neuve, Belgium. [61] M. Vaillant and J. Glaunès. Surface matching via currents. In Proceedings Information Processing in Medical Imaging, volume 19, ages 381 9, 005. [6] C. Villani. Toics in Otimal Transortation. American Mathematical Society, 003. [63] G. Wahba. Suort vector machines, reroducing kernel Hilbert saces, and randomization GACV. In Advances in Kernel Methods Suort Vector Learning, ages Bernhard Schölkof and Alezander J. Smola and Christoher J. C. Burges and Rosanna Soentiet, [64] C. Yang, R. Duraiswami, N. Gumerov, and L. Davis. Imroved fast Gauss transform and efficient kernel density estimation. In Proceedings 9th International Conference on Comuter Vision, ages , 003. [65] Y. Zheng, J. Jestes, J. M. Phillis, and F. Li. Quality and efficiency in kernel density estimates for large data. In Proceedings ACM Conference on the Management of Data (SIGMOD), A Exeriments We consider measures µ P defined by a oint set P R. To exerimentally visualize the structures of the suerlevel sets of kernel density estimates, or equivalently sublevel sets of the kernel distance, we do the simlest thing and just evaluate d K µ P at every grid oint on a sufficiently dense grid. 14

16 Figure 1: Sublevel sets for the kernel distance while varying the isolevel γ, for fixed σ (left) and for fixed isolevel γ but variable σ (right), with Gaussian kernel. The variable values of σ and γ are chosen to make the lots similar. 504 Grid aroximation. Due to the 1-Lischitz roerty of the kernel distance, well chosen grid oints 505 have several nice roerties. We consider the functions u to some resolution arameter ε > 0, consistent 506 with the arameter used to create a coreset aroximation Q. Now secifically, consider an axis-aligned grid G ε,d with edge length ε/ 507 d so no oint x R d is further than ε from some grid oint g G ε,d. 508 Since K(x, y) ε when x y σ ln(σ /ε) = δ ε,σ, we only need to consider grid oints g G ε,d 509 which are within δ ε,σ of some oint P (or q Q, of coreset Q of P ) [43, 65]. This is at most ( 510 d/ε) d (δ ε,σ ) d = O((σ log(ε/d)/ε) d ) grid oints total for d a fixed constant. Furthermore, due to the Lischitz roerty of d K P, when considering a secific level set at r 51 a oint x such that d K P (x) r ε is no further than ε from some g G such that dk P (g) r, and 513 every ball B ε (x) centered at some oint x Rd of radius ε so that all y B ε (x) has dk P (y) r has 514 some reresentative oint g G ε,d such that g B ε (x), and hence d K P (g) r. Thus dee regions and satially thick features are reserved, however thin assageways or layers that are near the threshold r, even if they do not corresond to a critical oint, may erroneously become disconnected, causing hantom comonents or other toological features Varying arameter r or σ. We demonstrate the geometric inference on a synthetic dataset in [0, 1] where 900 oints are chosen near a circle centered at (0.5, 0.5) with radius 0.5 or along a line segment from (0, 0) to (1, 1). Each oint has Gaussian noise added with standard deviation The remaining 1100 oints are chosen uniformly from [0, 1]. We use a Gaussian kernel with σ = Figure 1 shows (left) various sublevel sets γ Γ for the kernel distance at a fixed σ = 0.05 and (right) various suerlevel sets for a fixed γ = , but various values of σ Σ, where Γ = [ , , , , ] and Σ = [0.0485, , 0.049, , 0.05] This choice of Γ and Σ were made to highlight how similar the isolevels can be. Non-uniform densities. Another advantage of techniques based on some sort of distance to a measure (as oosed to offsets from the oints) is that they more naturally handle inut data sets where different 15

17 (a) (b) Figure : Sublevel sets of kernel distance (a), and distance to a measure (b) on a data set with two circles with different scales and densities. These lots fix a level set γ and vary (a) σ for d K µ and (b) m 0 for d ccm. The arameters are chosen to rovide similar aearing images. Figure 3: Alternate kernel density estimates for the same dataset as Figure 1. From left to right, they use the Lalace, triangle, Eanechnikov, and the ball kernel regions have different densities of oints. In Figure we consider a data set with 000 oints in [0, 1] generated from circles, where 00 oints are chosen near a circle centered at (0.08, 0.08) with radius 0.04 and 1600 are chosen near a circle centered at (0.9, 0.9) with radius Each oint in the to circle has Gaussian noise added with standard deviation 0.01 and the bottom circle has Gaussian noise added with standard deviation The remaining 00 oints are chosen uniformly from [0, 1]. Figure (a) shows the sublevel sets of d K µ with a fixed γ = and vary σ Σ, where Σ = [ , 0.014, , , 0.015] Each circle is recovered at some level. Using distance to a measure (see Figure (b)), we tried to tune the arameters so it looked similar to Figure (a). Secifically we fixed γ = and used values m 0 M 0, where M 0 = [0.05, 0.04, 0.1, 0., 0.5]. 16

18 Alternative kernels. for instance We can choose kernels other than the Gaussian kernel in the kernel density estimate, the Lalace kernel K(, x) = ex( x y /σ), the triangle kernel K(, x) = max{0, 1 x y /σ}, the Eanechnikov kernel K(, x) = max{0, 1 x y /σ }, or the ball kernel (K(, x) = {1 if x σ; o.w. 0}. Figure 3 chooses arameters to make them comarable to the Figure 1(left). Of these only the Lalace kernel is characteristic [58] making the corresonding version of the kernel distance a metric. Investigating which of the above reconstruction theorems hold when using the Lalace or other kernels is an interesting question for future work. Additionally, normal vector information and even k-forms can be used in the definition of a kernel [36, 61, 9, 8, 37, 43]; this variant is known as the current distance. In some cases it retains its metric roerties and has been shown to be very useful for shae alignment in conjunction with medical imaging A.1 Discussion: Noise and Scale Much of geometric and toological reconstruction grew out of the desire to understand shaes at various scales. A common mechanism is offset based; e.g., α-shaes [30] reresent the scale of a shae with the α arameter controlling the offsets of a oint cloud. There are two arameters with the kernel distance: r controls the offset through the sublevel set of the function, and σ controls the noise. We argue that any function which is robust to noise must have a arameter that controls the noise (e.g. σ for d K µ and m 0 for d ccm ). Here σ clearly defines some sense of scale in the setting of density estimation [57] and has a geometrical interretation, while m 0 reresents a fraction of the measure and is hard to interret geometrically, as illustrated by the lack of a Lischitz roerty for d ccm with resect to m 0. There are several exeriments above, in Aendix A, from which several insights can be drawn. One observation is that even though there are two arameters r and σ that control the scale, the interesting values tyically have r very close to σ. Thus, we recommend to first set σ to control the scale at which the data is studied, and then exlore the effect of varying r for values near σ. Moreover, not much structure seems to be missed by not exloring the sace of both arameters; Figure 1 shows that fixing one (of r and σ) and varying the other can rovide very similar suerlevel sets. However, it is ossible instead to fix r and exlore the ersistent toological features in the data [34, 3] (those less effected by smoothing) by varying σ. On the other hand, it remains a challenge roblem to study two arameter ersistent homology [8, 7] under the setting of kernel distance (or kernel density estimate) B Lischitz Proerty of d K µ In this section we formalize several Lischitz roerties about the kernel distance. That is as some arameter changes, how quickly can d K µ change B.1 Relating Semiconcavity to Lischitz Proerties We rove a generalization of a folklore result relating l-semiconcavity of functions and l-lischitz functions. Lemma.. Function g(x) such that (g(x)) is l-semiconcave, for l 1, then g(x) must be l-lischitz. 17

Geometric Inference on Kernel Density Estimates

Geometric Inference on Kernel Density Estimates Geometric Inference on Kernel Density Estimates Bei Wang 1 Jeff M. Phillips 2 Yan Zheng 2 1 Scientific Computing and Imaging Institute, University of Utah 2 School of Computing, University of Utah Feb

More information

Some Topics in Computational Topology. Yusu Wang. AMS Short Course 2014

Some Topics in Computational Topology. Yusu Wang. AMS Short Course 2014 Some Topics in Computational Topology Yusu Wang AMS Short Course 2014 Introduction Much recent developments in computational topology Both in theory and in their applications E.g, the theory of persistence

More information

Some Topics in Computational Topology

Some Topics in Computational Topology Some Topics in Computational Topology Yusu Wang Ohio State University AMS Short Course 2014 Introduction Much recent developments in computational topology Both in theory and in their applications E.g,

More information

IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES

IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES OHAD GILADI AND ASSAF NAOR Abstract. It is shown that if (, ) is a Banach sace with Rademacher tye 1 then for every n N there exists

More information

EXPLORING PERSISTENT LOCAL HOMOLOGY IN TOPOLOGICAL DATA ANALYSIS

EXPLORING PERSISTENT LOCAL HOMOLOGY IN TOPOLOGICAL DATA ANALYSIS EXPLORING PERSISTENT LOCAL HOMOLOGY IN TOPOLOGICAL DATA ANALYSIS Brittany Terese Fasy Montana State University Comuter Science Deartment Bozeman, MT Bei Wang University of Utah Scientific Comuting and

More information

Elementary Analysis in Q p

Elementary Analysis in Q p Elementary Analysis in Q Hannah Hutter, May Szedlák, Phili Wirth November 17, 2011 This reort follows very closely the book of Svetlana Katok 1. 1 Sequences and Series In this section we will see some

More information

Approximating min-max k-clustering

Approximating min-max k-clustering Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost

More information

Elementary theory of L p spaces

Elementary theory of L p spaces CHAPTER 3 Elementary theory of L saces 3.1 Convexity. Jensen, Hölder, Minkowski inequality. We begin with two definitions. A set A R d is said to be convex if, for any x 0, x 1 2 A x = x 0 + (x 1 x 0 )

More information

Convex Optimization methods for Computing Channel Capacity

Convex Optimization methods for Computing Channel Capacity Convex Otimization methods for Comuting Channel Caacity Abhishek Sinha Laboratory for Information and Decision Systems (LIDS), MIT sinhaa@mit.edu May 15, 2014 We consider a classical comutational roblem

More information

arxiv:math/ v4 [math.gn] 25 Nov 2006

arxiv:math/ v4 [math.gn] 25 Nov 2006 arxiv:math/0607751v4 [math.gn] 25 Nov 2006 On the uniqueness of the coincidence index on orientable differentiable manifolds P. Christoher Staecker October 12, 2006 Abstract The fixed oint index of toological

More information

Estimation of the large covariance matrix with two-step monotone missing data

Estimation of the large covariance matrix with two-step monotone missing data Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo

More information

Inequalities for the L 1 Deviation of the Empirical Distribution

Inequalities for the L 1 Deviation of the Empirical Distribution Inequalities for the L 1 Deviation of the Emirical Distribution Tsachy Weissman, Erik Ordentlich, Gadiel Seroussi, Sergio Verdu, Marcelo J. Weinberger June 13, 2003 Abstract We derive bounds on the robability

More information

p-adic Measures and Bernoulli Numbers

p-adic Measures and Bernoulli Numbers -Adic Measures and Bernoulli Numbers Adam Bowers Introduction The constants B k in the Taylor series exansion t e t = t k B k k! k=0 are known as the Bernoulli numbers. The first few are,, 6, 0, 30, 0,

More information

Robustness of classifiers to uniform l p and Gaussian noise Supplementary material

Robustness of classifiers to uniform l p and Gaussian noise Supplementary material Robustness of classifiers to uniform l and Gaussian noise Sulementary material Jean-Yves Franceschi Ecole Normale Suérieure de Lyon LIP UMR 5668 Omar Fawzi Ecole Normale Suérieure de Lyon LIP UMR 5668

More information

MATH 2710: NOTES FOR ANALYSIS

MATH 2710: NOTES FOR ANALYSIS MATH 270: NOTES FOR ANALYSIS The main ideas we will learn from analysis center around the idea of a limit. Limits occurs in several settings. We will start with finite limits of sequences, then cover infinite

More information

GENERICITY OF INFINITE-ORDER ELEMENTS IN HYPERBOLIC GROUPS

GENERICITY OF INFINITE-ORDER ELEMENTS IN HYPERBOLIC GROUPS GENERICITY OF INFINITE-ORDER ELEMENTS IN HYPERBOLIC GROUPS PALLAVI DANI 1. Introduction Let Γ be a finitely generated grou and let S be a finite set of generators for Γ. This determines a word metric on

More information

SYMPLECTIC STRUCTURES: AT THE INTERFACE OF ANALYSIS, GEOMETRY, AND TOPOLOGY

SYMPLECTIC STRUCTURES: AT THE INTERFACE OF ANALYSIS, GEOMETRY, AND TOPOLOGY SYMPLECTIC STRUCTURES: AT THE INTERFACE OF ANALYSIS, GEOMETRY, AND TOPOLOGY FEDERICA PASQUOTTO 1. Descrition of the roosed research 1.1. Introduction. Symlectic structures made their first aearance in

More information

Geometric Inference for Probability distributions

Geometric Inference for Probability distributions Geometric Inference for Probability distributions F. Chazal 1 D. Cohen-Steiner 2 Q. Mérigot 2 1 Geometrica, INRIA Saclay, 2 Geometrica, INRIA Sophia-Antipolis 2009 June 1 Motivation What is the (relevant)

More information

General Linear Model Introduction, Classes of Linear models and Estimation

General Linear Model Introduction, Classes of Linear models and Estimation Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)

More information

CMSC 425: Lecture 4 Geometry and Geometric Programming

CMSC 425: Lecture 4 Geometry and Geometric Programming CMSC 425: Lecture 4 Geometry and Geometric Programming Geometry for Game Programming and Grahics: For the next few lectures, we will discuss some of the basic elements of geometry. There are many areas

More information

Sharp gradient estimate and spectral rigidity for p-laplacian

Sharp gradient estimate and spectral rigidity for p-laplacian Shar gradient estimate and sectral rigidity for -Lalacian Chiung-Jue Anna Sung and Jiaing Wang To aear in ath. Research Letters. Abstract We derive a shar gradient estimate for ositive eigenfunctions of

More information

Local Homology Transfer and Stratification Learning

Local Homology Transfer and Stratification Learning Local Homology Transfer and Stratification Learning Paul Bendich Bei Wang Sayan Mukherjee Abstract The objective of this aer is to show that oint cloud data can under certain circumstances be clustered

More information

Analysis of some entrance probabilities for killed birth-death processes

Analysis of some entrance probabilities for killed birth-death processes Analysis of some entrance robabilities for killed birth-death rocesses Master s Thesis O.J.G. van der Velde Suervisor: Dr. F.M. Sieksma July 5, 207 Mathematical Institute, Leiden University Contents Introduction

More information

Uniform Law on the Unit Sphere of a Banach Space

Uniform Law on the Unit Sphere of a Banach Space Uniform Law on the Unit Shere of a Banach Sace by Bernard Beauzamy Société de Calcul Mathématique SA Faubourg Saint Honoré 75008 Paris France Setember 008 Abstract We investigate the construction of a

More information

Introduction to Banach Spaces

Introduction to Banach Spaces CHAPTER 8 Introduction to Banach Saces 1. Uniform and Absolute Convergence As a rearation we begin by reviewing some familiar roerties of Cauchy sequences and uniform limits in the setting of metric saces.

More information

4. Score normalization technical details We now discuss the technical details of the score normalization method.

4. Score normalization technical details We now discuss the technical details of the score normalization method. SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules

More information

arxiv: v1 [physics.data-an] 26 Oct 2012

arxiv: v1 [physics.data-an] 26 Oct 2012 Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch

More information

COMMUNICATION BETWEEN SHAREHOLDERS 1

COMMUNICATION BETWEEN SHAREHOLDERS 1 COMMUNICATION BTWN SHARHOLDRS 1 A B. O A : A D Lemma B.1. U to µ Z r 2 σ2 Z + σ2 X 2r ω 2 an additive constant that does not deend on a or θ, the agents ayoffs can be written as: 2r rθa ω2 + θ µ Y rcov

More information

Network Configuration Control Via Connectivity Graph Processes

Network Configuration Control Via Connectivity Graph Processes Network Configuration Control Via Connectivity Grah Processes Abubakr Muhammad Deartment of Electrical and Systems Engineering University of Pennsylvania Philadelhia, PA 90 abubakr@seas.uenn.edu Magnus

More information

HENSEL S LEMMA KEITH CONRAD

HENSEL S LEMMA KEITH CONRAD HENSEL S LEMMA KEITH CONRAD 1. Introduction In the -adic integers, congruences are aroximations: for a and b in Z, a b mod n is the same as a b 1/ n. Turning information modulo one ower of into similar

More information

Radial Basis Function Networks: Algorithms

Radial Basis Function Networks: Algorithms Radial Basis Function Networks: Algorithms Introduction to Neural Networks : Lecture 13 John A. Bullinaria, 2004 1. The RBF Maing 2. The RBF Network Architecture 3. Comutational Power of RBF Networks 4.

More information

ON MINKOWSKI MEASURABILITY

ON MINKOWSKI MEASURABILITY ON MINKOWSKI MEASURABILITY F. MENDIVIL AND J. C. SAUNDERS DEPARTMENT OF MATHEMATICS AND STATISTICS ACADIA UNIVERSITY WOLFVILLE, NS CANADA B4P 2R6 Abstract. Two athological roerties of Minkowski content

More information

On the Toppling of a Sand Pile

On the Toppling of a Sand Pile Discrete Mathematics and Theoretical Comuter Science Proceedings AA (DM-CCG), 2001, 275 286 On the Toling of a Sand Pile Jean-Christohe Novelli 1 and Dominique Rossin 2 1 CNRS, LIFL, Bâtiment M3, Université

More information

On Isoperimetric Functions of Probability Measures Having Log-Concave Densities with Respect to the Standard Normal Law

On Isoperimetric Functions of Probability Measures Having Log-Concave Densities with Respect to the Standard Normal Law On Isoerimetric Functions of Probability Measures Having Log-Concave Densities with Resect to the Standard Normal Law Sergey G. Bobkov Abstract Isoerimetric inequalities are discussed for one-dimensional

More information

Sums of independent random variables

Sums of independent random variables 3 Sums of indeendent random variables This lecture collects a number of estimates for sums of indeendent random variables with values in a Banach sace E. We concentrate on sums of the form N γ nx n, where

More information

SECTION 5: FIBRATIONS AND HOMOTOPY FIBERS

SECTION 5: FIBRATIONS AND HOMOTOPY FIBERS SECTION 5: FIBRATIONS AND HOMOTOPY FIBERS In this section we will introduce two imortant classes of mas of saces, namely the Hurewicz fibrations and the more general Serre fibrations, which are both obtained

More information

Cryptanalysis of Pseudorandom Generators

Cryptanalysis of Pseudorandom Generators CSE 206A: Lattice Algorithms and Alications Fall 2017 Crytanalysis of Pseudorandom Generators Instructor: Daniele Micciancio UCSD CSE As a motivating alication for the study of lattice in crytograhy we

More information

DIFFERENTIAL GEOMETRY. LECTURES 9-10,

DIFFERENTIAL GEOMETRY. LECTURES 9-10, DIFFERENTIAL GEOMETRY. LECTURES 9-10, 23-26.06.08 Let us rovide some more details to the definintion of the de Rham differential. Let V, W be two vector bundles and assume we want to define an oerator

More information

#A64 INTEGERS 18 (2018) APPLYING MODULAR ARITHMETIC TO DIOPHANTINE EQUATIONS

#A64 INTEGERS 18 (2018) APPLYING MODULAR ARITHMETIC TO DIOPHANTINE EQUATIONS #A64 INTEGERS 18 (2018) APPLYING MODULAR ARITHMETIC TO DIOPHANTINE EQUATIONS Ramy F. Taki ElDin Physics and Engineering Mathematics Deartment, Faculty of Engineering, Ain Shams University, Cairo, Egyt

More information

A New Perspective on Learning Linear Separators with Large L q L p Margins

A New Perspective on Learning Linear Separators with Large L q L p Margins A New Persective on Learning Linear Searators with Large L q L Margins Maria-Florina Balcan Georgia Institute of Technology Christoher Berlind Georgia Institute of Technology Abstract We give theoretical

More information

GOOD MODELS FOR CUBIC SURFACES. 1. Introduction

GOOD MODELS FOR CUBIC SURFACES. 1. Introduction GOOD MODELS FOR CUBIC SURFACES ANDREAS-STEPHAN ELSENHANS Abstract. This article describes an algorithm for finding a model of a hyersurface with small coefficients. It is shown that the aroach works in

More information

New Information Measures for the Generalized Normal Distribution

New Information Measures for the Generalized Normal Distribution Information,, 3-7; doi:.339/info3 OPEN ACCESS information ISSN 75-7 www.mdi.com/journal/information Article New Information Measures for the Generalized Normal Distribution Christos P. Kitsos * and Thomas

More information

Best approximation by linear combinations of characteristic functions of half-spaces

Best approximation by linear combinations of characteristic functions of half-spaces Best aroximation by linear combinations of characteristic functions of half-saces Paul C. Kainen Deartment of Mathematics Georgetown University Washington, D.C. 20057-1233, USA Věra Kůrková Institute of

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell February 10, 2010 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

The non-stochastic multi-armed bandit problem

The non-stochastic multi-armed bandit problem Submitted for journal ublication. The non-stochastic multi-armed bandit roblem Peter Auer Institute for Theoretical Comuter Science Graz University of Technology A-8010 Graz (Austria) auer@igi.tu-graz.ac.at

More information

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split A Bound on the Error of Cross Validation Using the Aroximation and Estimation Rates, with Consequences for the Training-Test Slit Michael Kearns AT&T Bell Laboratories Murray Hill, NJ 7974 mkearns@research.att.com

More information

CHAPTER 2: SMOOTH MAPS. 1. Introduction In this chapter we introduce smooth maps between manifolds, and some important

CHAPTER 2: SMOOTH MAPS. 1. Introduction In this chapter we introduce smooth maps between manifolds, and some important CHAPTER 2: SMOOTH MAPS DAVID GLICKENSTEIN 1. Introduction In this chater we introduce smooth mas between manifolds, and some imortant concets. De nition 1. A function f : M! R k is a smooth function if

More information

Extremal Polynomials with Varying Measures

Extremal Polynomials with Varying Measures International Mathematical Forum, 2, 2007, no. 39, 1927-1934 Extremal Polynomials with Varying Measures Rabah Khaldi Deartment of Mathematics, Annaba University B.P. 12, 23000 Annaba, Algeria rkhadi@yahoo.fr

More information

Some results of convex programming complexity

Some results of convex programming complexity 2012c12 $ Ê Æ Æ 116ò 14Ï Dec., 2012 Oerations Research Transactions Vol.16 No.4 Some results of convex rogramming comlexity LOU Ye 1,2 GAO Yuetian 1 Abstract Recently a number of aers were written that

More information

Feedback-error control

Feedback-error control Chater 4 Feedback-error control 4.1 Introduction This chater exlains the feedback-error (FBE) control scheme originally described by Kawato [, 87, 8]. FBE is a widely used neural network based controller

More information

Multiplicity of weak solutions for a class of nonuniformly elliptic equations of p-laplacian type

Multiplicity of weak solutions for a class of nonuniformly elliptic equations of p-laplacian type Nonlinear Analysis 7 29 536 546 www.elsevier.com/locate/na Multilicity of weak solutions for a class of nonuniformly ellitic equations of -Lalacian tye Hoang Quoc Toan, Quô c-anh Ngô Deartment of Mathematics,

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell October 25, 2009 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)

More information

Mollifiers and its applications in L p (Ω) space

Mollifiers and its applications in L p (Ω) space Mollifiers and its alications in L () sace MA Shiqi Deartment of Mathematics, Hong Kong Batist University November 19, 2016 Abstract This note gives definition of mollifier and mollification. We illustrate

More information

Ž. Ž. Ž. 2 QUADRATIC AND INVERSE REGRESSIONS FOR WISHART DISTRIBUTIONS 1

Ž. Ž. Ž. 2 QUADRATIC AND INVERSE REGRESSIONS FOR WISHART DISTRIBUTIONS 1 The Annals of Statistics 1998, Vol. 6, No., 573595 QUADRATIC AND INVERSE REGRESSIONS FOR WISHART DISTRIBUTIONS 1 BY GERARD LETAC AND HELENE ` MASSAM Universite Paul Sabatier and York University If U and

More information

THE 2D CASE OF THE BOURGAIN-DEMETER-GUTH ARGUMENT

THE 2D CASE OF THE BOURGAIN-DEMETER-GUTH ARGUMENT THE 2D CASE OF THE BOURGAIN-DEMETER-GUTH ARGUMENT ZANE LI Let e(z) := e 2πiz and for g : [0, ] C and J [0, ], define the extension oerator E J g(x) := g(t)e(tx + t 2 x 2 ) dt. J For a ositive weight ν

More information

Various Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various Families of R n Norms and Some Open Problems

Various Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various Families of R n Norms and Some Open Problems Int. J. Oen Problems Comt. Math., Vol. 3, No. 2, June 2010 ISSN 1998-6262; Coyright c ICSRS Publication, 2010 www.i-csrs.org Various Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various

More information

arxiv: v1 [stat.ml] 10 Mar 2016

arxiv: v1 [stat.ml] 10 Mar 2016 Global and Local Uncertainty Princiles for Signals on Grahs Nathanael Perraudin, Benjamin Ricaud, David I Shuman, and Pierre Vandergheynst March, 206 arxiv:603.03030v [stat.ml] 0 Mar 206 Abstract Uncertainty

More information

Applications to stochastic PDE

Applications to stochastic PDE 15 Alications to stochastic PE In this final lecture we resent some alications of the theory develoed in this course to stochastic artial differential equations. We concentrate on two secific examles:

More information

A Note on Guaranteed Sparse Recovery via l 1 -Minimization

A Note on Guaranteed Sparse Recovery via l 1 -Minimization A Note on Guaranteed Sarse Recovery via l -Minimization Simon Foucart, Université Pierre et Marie Curie Abstract It is roved that every s-sarse vector x C N can be recovered from the measurement vector

More information

Positivity, local smoothing and Harnack inequalities for very fast diffusion equations

Positivity, local smoothing and Harnack inequalities for very fast diffusion equations Positivity, local smoothing and Harnack inequalities for very fast diffusion equations Dedicated to Luis Caffarelli for his ucoming 60 th birthday Matteo Bonforte a, b and Juan Luis Vázquez a, c Abstract

More information

MATH 6210: SOLUTIONS TO PROBLEM SET #3

MATH 6210: SOLUTIONS TO PROBLEM SET #3 MATH 6210: SOLUTIONS TO PROBLEM SET #3 Rudin, Chater 4, Problem #3. The sace L (T) is searable since the trigonometric olynomials with comlex coefficients whose real and imaginary arts are rational form

More information

GENERALIZED NORMS INEQUALITIES FOR ABSOLUTE VALUE OPERATORS

GENERALIZED NORMS INEQUALITIES FOR ABSOLUTE VALUE OPERATORS International Journal of Analysis Alications ISSN 9-8639 Volume 5, Number (04), -9 htt://www.etamaths.com GENERALIZED NORMS INEQUALITIES FOR ABSOLUTE VALUE OPERATORS ILYAS ALI, HU YANG, ABDUL SHAKOOR Abstract.

More information

Asymptotically Optimal Simulation Allocation under Dependent Sampling

Asymptotically Optimal Simulation Allocation under Dependent Sampling Asymtotically Otimal Simulation Allocation under Deendent Samling Xiaoing Xiong The Robert H. Smith School of Business, University of Maryland, College Park, MD 20742-1815, USA, xiaoingx@yahoo.com Sandee

More information

Principal Components Analysis and Unsupervised Hebbian Learning

Principal Components Analysis and Unsupervised Hebbian Learning Princial Comonents Analysis and Unsuervised Hebbian Learning Robert Jacobs Deartment of Brain & Cognitive Sciences University of Rochester Rochester, NY 1467, USA August 8, 008 Reference: Much of the material

More information

An Analysis of Reliable Classifiers through ROC Isometrics

An Analysis of Reliable Classifiers through ROC Isometrics An Analysis of Reliable Classifiers through ROC Isometrics Stijn Vanderlooy s.vanderlooy@cs.unimaas.nl Ida G. Srinkhuizen-Kuyer kuyer@cs.unimaas.nl Evgueni N. Smirnov smirnov@cs.unimaas.nl MICC-IKAT, Universiteit

More information

Statics and dynamics: some elementary concepts

Statics and dynamics: some elementary concepts 1 Statics and dynamics: some elementary concets Dynamics is the study of the movement through time of variables such as heartbeat, temerature, secies oulation, voltage, roduction, emloyment, rices and

More information

Uniformly best wavenumber approximations by spatial central difference operators: An initial investigation

Uniformly best wavenumber approximations by spatial central difference operators: An initial investigation Uniformly best wavenumber aroximations by satial central difference oerators: An initial investigation Vitor Linders and Jan Nordström Abstract A characterisation theorem for best uniform wavenumber aroximations

More information

LECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)]

LECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)] LECTURE 7 NOTES 1. Convergence of random variables. Before delving into the large samle roerties of the MLE, we review some concets from large samle theory. 1. Convergence in robability: x n x if, for

More information

Interleaving Distance between Merge Trees

Interleaving Distance between Merge Trees Noname manuscript No. (will be inserted by the editor) Interleaving Distance between Merge Trees Dmitriy Morozov Kenes Beketayev Gunther H. Weber the date of receipt and acceptance should be inserted later

More information

Topic 7: Using identity types

Topic 7: Using identity types Toic 7: Using identity tyes June 10, 2014 Now we would like to learn how to use identity tyes and how to do some actual mathematics with them. By now we have essentially introduced all inference rules

More information

Quantitative estimates of propagation of chaos for stochastic systems with W 1, kernels

Quantitative estimates of propagation of chaos for stochastic systems with W 1, kernels oname manuscrit o. will be inserted by the editor) Quantitative estimates of roagation of chaos for stochastic systems with W, kernels Pierre-Emmanuel Jabin Zhenfu Wang Received: date / Acceted: date Abstract

More information

Lecture 6. 2 Recurrence/transience, harmonic functions and martingales

Lecture 6. 2 Recurrence/transience, harmonic functions and martingales Lecture 6 Classification of states We have shown that all states of an irreducible countable state Markov chain must of the same tye. This gives rise to the following classification. Definition. [Classification

More information

1 Probability Spaces and Random Variables

1 Probability Spaces and Random Variables 1 Probability Saces and Random Variables 1.1 Probability saces Ω: samle sace consisting of elementary events (or samle oints). F : the set of events P: robability 1.2 Kolmogorov s axioms Definition 1.2.1

More information

ANALYTIC NUMBER THEORY AND DIRICHLET S THEOREM

ANALYTIC NUMBER THEORY AND DIRICHLET S THEOREM ANALYTIC NUMBER THEORY AND DIRICHLET S THEOREM JOHN BINDER Abstract. In this aer, we rove Dirichlet s theorem that, given any air h, k with h, k) =, there are infinitely many rime numbers congruent to

More information

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning TNN-2009-P-1186.R2 1 Uncorrelated Multilinear Princial Comonent Analysis for Unsuervised Multilinear Subsace Learning Haiing Lu, K. N. Plataniotis and A. N. Venetsanooulos The Edward S. Rogers Sr. Deartment

More information

RANDOM WALKS AND PERCOLATION: AN ANALYSIS OF CURRENT RESEARCH ON MODELING NATURAL PROCESSES

RANDOM WALKS AND PERCOLATION: AN ANALYSIS OF CURRENT RESEARCH ON MODELING NATURAL PROCESSES RANDOM WALKS AND PERCOLATION: AN ANALYSIS OF CURRENT RESEARCH ON MODELING NATURAL PROCESSES AARON ZWIEBACH Abstract. In this aer we will analyze research that has been recently done in the field of discrete

More information

PETER J. GRABNER AND ARNOLD KNOPFMACHER

PETER J. GRABNER AND ARNOLD KNOPFMACHER ARITHMETIC AND METRIC PROPERTIES OF -ADIC ENGEL SERIES EXPANSIONS PETER J. GRABNER AND ARNOLD KNOPFMACHER Abstract. We derive a characterization of rational numbers in terms of their unique -adic Engel

More information

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO) Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment

More information

On the capacity of the general trapdoor channel with feedback

On the capacity of the general trapdoor channel with feedback On the caacity of the general tradoor channel with feedback Jui Wu and Achilleas Anastasooulos Electrical Engineering and Comuter Science Deartment University of Michigan Ann Arbor, MI, 48109-1 email:

More information

The inverse Goldbach problem

The inverse Goldbach problem 1 The inverse Goldbach roblem by Christian Elsholtz Submission Setember 7, 2000 (this version includes galley corrections). Aeared in Mathematika 2001. Abstract We imrove the uer and lower bounds of the

More information

Positive decomposition of transfer functions with multiple poles

Positive decomposition of transfer functions with multiple poles Positive decomosition of transfer functions with multile oles Béla Nagy 1, Máté Matolcsi 2, and Márta Szilvási 1 Deartment of Analysis, Technical University of Budaest (BME), H-1111, Budaest, Egry J. u.

More information

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules. Introduction: The is widely used in industry to monitor the number of fraction nonconforming units. A nonconforming unit is

More information

Distributed Rule-Based Inference in the Presence of Redundant Information

Distributed Rule-Based Inference in the Presence of Redundant Information istribution Statement : roved for ublic release; distribution is unlimited. istributed Rule-ased Inference in the Presence of Redundant Information June 8, 004 William J. Farrell III Lockheed Martin dvanced

More information

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V.

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deriving ndicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deutsch Centre for Comutational Geostatistics Deartment of Civil &

More information

Chapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population

Chapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population Chater 7 and s Selecting a Samle Point Estimation Introduction to s of Proerties of Point Estimators Other Methods Introduction An element is the entity on which data are collected. A oulation is a collection

More information

Proof: We follow thearoach develoed in [4]. We adot a useful but non-intuitive notion of time; a bin with z balls at time t receives its next ball at

Proof: We follow thearoach develoed in [4]. We adot a useful but non-intuitive notion of time; a bin with z balls at time t receives its next ball at A Scaling Result for Exlosive Processes M. Mitzenmacher Λ J. Sencer We consider the following balls and bins model, as described in [, 4]. Balls are sequentially thrown into bins so that the robability

More information

MAZUR S CONSTRUCTION OF THE KUBOTA LEPOLDT p-adic L-FUNCTION. n s = p. (1 p s ) 1.

MAZUR S CONSTRUCTION OF THE KUBOTA LEPOLDT p-adic L-FUNCTION. n s = p. (1 p s ) 1. MAZUR S CONSTRUCTION OF THE KUBOTA LEPOLDT -ADIC L-FUNCTION XEVI GUITART Abstract. We give an overview of Mazur s construction of the Kubota Leooldt -adic L- function as the -adic Mellin transform of a

More information

Solved Problems. (a) (b) (c) Figure P4.1 Simple Classification Problems First we draw a line between each set of dark and light data points.

Solved Problems. (a) (b) (c) Figure P4.1 Simple Classification Problems First we draw a line between each set of dark and light data points. Solved Problems Solved Problems P Solve the three simle classification roblems shown in Figure P by drawing a decision boundary Find weight and bias values that result in single-neuron ercetrons with the

More information

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III AI*IA 23 Fusion of Multile Pattern Classifiers PART III AI*IA 23 Tutorial on Fusion of Multile Pattern Classifiers by F. Roli 49 Methods for fusing multile classifiers Methods for fusing multile classifiers

More information

Intrinsic Approximation on Cantor-like Sets, a Problem of Mahler

Intrinsic Approximation on Cantor-like Sets, a Problem of Mahler Intrinsic Aroximation on Cantor-like Sets, a Problem of Mahler Ryan Broderick, Lior Fishman, Asaf Reich and Barak Weiss July 200 Abstract In 984, Kurt Mahler osed the following fundamental question: How

More information

A CONCRETE EXAMPLE OF PRIME BEHAVIOR IN QUADRATIC FIELDS. 1. Abstract

A CONCRETE EXAMPLE OF PRIME BEHAVIOR IN QUADRATIC FIELDS. 1. Abstract A CONCRETE EXAMPLE OF PRIME BEHAVIOR IN QUADRATIC FIELDS CASEY BRUCK 1. Abstract The goal of this aer is to rovide a concise way for undergraduate mathematics students to learn about how rime numbers behave

More information

Some Unitary Space Time Codes From Sphere Packing Theory With Optimal Diversity Product of Code Size

Some Unitary Space Time Codes From Sphere Packing Theory With Optimal Diversity Product of Code Size IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 5, NO., DECEMBER 4 336 Some Unitary Sace Time Codes From Shere Packing Theory With Otimal Diversity Product of Code Size Haiquan Wang, Genyuan Wang, and Xiang-Gen

More information

Estimation of Separable Representations in Psychophysical Experiments

Estimation of Separable Representations in Psychophysical Experiments Estimation of Searable Reresentations in Psychohysical Exeriments Michele Bernasconi (mbernasconi@eco.uninsubria.it) Christine Choirat (cchoirat@eco.uninsubria.it) Raffaello Seri (rseri@eco.uninsubria.it)

More information

Lecture 10: Hypercontractivity

Lecture 10: Hypercontractivity CS 880: Advanced Comlexity Theory /15/008 Lecture 10: Hyercontractivity Instructor: Dieter van Melkebeek Scribe: Baris Aydinlioglu This is a technical lecture throughout which we rove the hyercontractivity

More information

7.2 Inference for comparing means of two populations where the samples are independent

7.2 Inference for comparing means of two populations where the samples are independent Objectives 7.2 Inference for comaring means of two oulations where the samles are indeendent Two-samle t significance test (we give three examles) Two-samle t confidence interval htt://onlinestatbook.com/2/tests_of_means/difference_means.ht

More information

Geometric Inference for Probability distributions

Geometric Inference for Probability distributions Geometric Inference for Probability distributions F. Chazal 1 D. Cohen-Steiner 2 Q. Mérigot 2 1 Geometrica, INRIA Saclay, 2 Geometrica, INRIA Sophia-Antipolis 2009 July 8 F. Chazal, D. Cohen-Steiner, Q.

More information

Brownian Motion and Random Prime Factorization

Brownian Motion and Random Prime Factorization Brownian Motion and Random Prime Factorization Kendrick Tang June 4, 202 Contents Introduction 2 2 Brownian Motion 2 2. Develoing Brownian Motion.................... 2 2.. Measure Saces and Borel Sigma-Algebras.........

More information

ON THE LEAST SIGNIFICANT p ADIC DIGITS OF CERTAIN LUCAS NUMBERS

ON THE LEAST SIGNIFICANT p ADIC DIGITS OF CERTAIN LUCAS NUMBERS #A13 INTEGERS 14 (014) ON THE LEAST SIGNIFICANT ADIC DIGITS OF CERTAIN LUCAS NUMBERS Tamás Lengyel Deartment of Mathematics, Occidental College, Los Angeles, California lengyel@oxy.edu Received: 6/13/13,

More information

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management

More information