Renyi divergence and asymptotic theory of minimal K-point random graphs Alfred Hero Dept. of EECS, The University of Michigan, Summer 1999
|
|
- Jacob Sutton
- 5 years ago
- Views:
Transcription
1 Renyi divergence and asymptotic theory of minimal K-point random graphs Alfred Hero Dept. of EECS, The University of Michigan, Summer 999
2 Outline Rényi Entropy and Rényi Divergence ffl Euclidean k-minimal graphs ffl Asymptotics for tightly coverable graphs ffl Asymptotics for greedy k-minimal graphs ffl Influence function for greedy k-minimal graph ffl Applications ffl 2
3 ffl Rényi Entropy of order ν ffl Rényi Divergence of order ν. Rényi Entropy and Rényi Divergence ffl X ο f (x) a d-dimensional random vector. ν (f ) = H ν ln Z f ν (x)dx () ν (f;f o ) = I ν ln Z f (x) f o (x) ffl f o a dominating Lebesgue density ν f o (x)dx (2) 3
4 2 Examples: ffl Hellinger distance squared pf I (f;f o ) = ln Z (x)fo (x)dx 2 ffl Kullback-Liebler divergence Z lim I ν(f;f o ) = ν! o (x)ln f o(x) f (x) dx: f 4
5 2. k-minimal graphs vertices are subset of X n = fx i g n i= : n points in IRd ffl AgraphGofdegreel consists of vertices and edges ffl edges are denoted fe ij g ffl for any i: cardfe ij g j» l Weight (with power exponent fl) ofg L G (X n ) = X kek fl e2g 5
6 e2m(x n ) Examples: n-point Minimal Spanning Tree (MST) Let M(X n ) denote the possible sets of edges in the class of acyclic graphs spanning X n (spanning trees). The Euclidean Power Weighted MST achieves min n ) M(X L MST (X n ) = X kek fl : 6
7 256 random samples MST Figure. A data set and the MST 7
8 L TSP n ) = min (X n ) T(X e2t(x n ) kek fl : n-point Traveling Salesman Problem (TSP) Let T(X n ) be sets of edges in the class of graphs of degree 2 spanning X n. The minimal power-weighted TSP tour achieves X 8
9 L(m[(F Q i ) q i ]) + C m d fl L(m[(F Q i ) q i ]) C 2 m d fl 2.. Quasi-additive Euclidean Functionals is a continuous subadditive functional if it satises L Null L(f) =,wheref Condition: is the null set. Subadditivity: There exists a C constant with the following property: For any uniform resolution Q =m-partition m md» m X ) L(F i= Superadditivity: There exists a constant C 2 with the following property: md m X ) L(F i= 9
10 F and G of [; ] d Continuity: There exists a constant C 3 such that for all nite subsets jl(f [ G) L(F )j» C 3 (card(g)) (d fl)=d Denition A continuous subadditive functional L is said to be a quasi-additive functional when there exists a continuous superadditive functional L Λ which satises L(F ) + L Λ (F ) and the approximation property je[l(u ;:::;U n )] E[L Λ (U ;:::;U n )]j» C 4 n (d fl )=d (3) where U ;:::;U n are i.i.d. uniform random vectors in [; ] d.
11 lim n)=n (d fl)=d Z = L;fl L(X n! Asymptotics: the BHH Theorem and entropy 2.2. estimation Theorem [Redmond&Yukich:96] Let L be a quasi-additive Euclidean functional with power-exponent fl, and let X n = fx ;:::;x n g be an i.i.d. sample drawn from a distribution on [; ] d with an absolutely continuous component having (Lebesgue) density f (x). Then (4) f (x) (d fl)=d dx; (a:s:) Or, letting ν = (d fl)=d lim L(X n)=n ν = L;fl exp (( ν)h ν (f )) ; (a:s:) n!
12 uniform 2 d distribution (n=) triangular 2 d distribution (n=) y.5 y.5.5 MST x.5 MST x y.5 y.5.5 Mean MST length x as function of n 25 Mean MST length n, uniform 2 d distribution on [,].5 Mean MST length x as function of n 25 MST length n, triangular 2 d distribution on [,] Figure 2. 2D Triangular vs. Uniform sample study for MST. 2
13 MST length 5 2*Log(Ln/sqrt(n)) N, o=unif, *=triang number of points, o=unif, *=triang Figure 3. MST and log MST weights as function of number of samples for 2D uniform vs. triangular study. 3
14 ffl g(x): a reference density on IR d 2.3. I-Divergence and Quasi-additive functions ffl Assume f g, i.e. for all x such that g(x) = we have f (x) =. Make measure dx! g(x)dx transformation [; ] on. Then for Y d ffl n = transformed data [Hero&Michel:HOS99] lim L(Y n)=n ν = L;fl exp (( ν)i ν (f;g)) ; (a:s:) n! 4
15 x = [x ;:::;x d ] T! y = [y ;:::;y d ] T y d = G(x d jx d ;:::;x ) Proof. Make transformation of variables y = G(x ) (5) 2 = G(x 2 jx ) y.. where G(x k jx k ;:::;x ) = R x k g(~xk jx k ;:::;x )d~x k 2. Induced density h(y), of the vector y, takes the form: = f (G (y );:::;G (y d jy d ;:::;y )) h(y) (y );:::;G (y d jy d ;:::;y )) g(g (6) where G is inverse CDF and x k = G (y k jx k ;:::;x ). 5
16 ν g(x)dx = I(f;g) 3. Then we know (a:s:) ν (Y n )! ^H ν ln Z h ν (y)dy 4. By Jacobian formula: dy = dx = g(x)dx and dy dx ν ln Z ν (y)dy = h ν ln Z f (x) g(x) 6
17 3. Outlier Sensitivity of minimal n-point graphs f Assume is a mixture density of the form = ( ffl)f + fflf o ; (7) f where f o is a known outlier density ffl f is an unknown target density ffl ffl ffl 2 [; ] is unknown mixture parameter 7
18 5 samples from f density Add 5 samples of uniform noise x2.5 x2.5.5 x.5 x x2.5 x2.5.5 x.5 x Figure 4. st row: 2D torus density with and without the addition of uniform outliers. 2nd row: corresponding MST s. 8
19 3.. Minimal k-point Euclidean Graphs Fix k,» k» n. Let T n;k = T(x i ;:::;x ik ) be a minimal graph connecting k distinct vertices x i ;:::;x ik. The power weighted k-minimal graph T Λ n;k = TΛ (x i Λ ;:::;x i Λ k ) is the overall minimum weight k-point graph Λ n;k = L Λ (X n;k ) = min L ;:::;i k i min T n;k X kek fl e2t n;k 9
20 k MST (k=99): outlier rejection (k=98): 2 outlier rejection x2.5 x2.5.5 k MST (k=62): 38 outlier rejection.5 (k=25): 75 outlier rejection x2.5 x2.5.5 x.5 x Figure 5. k-mst for 2D torus density with and without the addition of uniform outliers. 2
21 4. Extended BHH Thm for k-minimal Graphs Fix ff 2 [; ] and assume that the k-minimal graph is tightly coverable. If k = bffnc,as n! we have (Hero&Michel:IT99) f ν (xjx 2 A)dx (a:s:) L(X Λ n;k )=(bffnc)ν! L;fl min (A) ff A:P Z or, alternatively, with ν (fjx 2 A) = H ν ln Z f ν (xjx 2 A)dx L(X Λ n;k )=(bffnc)ν! L;fl exp ν) min ( H ν(fjx 2 A) (A) ff A:P (a:s:) 2
22 n D m bffnc ) bffnc card(u n Denition 2 (Tightly Coverable Graphs) Let Q m, m = ; 2;:::,bea sequence of uniform partitions of [; ] d of resolution =m. LetG be an algorithm which constructs a graph with with k = bffnc vertices U n;k ρ U n, an i.i.d. uniform sample over [; ] d. Dene covers U n;k. The algorithm G is said to generate tightly coverable subgraphs if for any ffl > there exists an M such that for all m > M D m k = fc2ff(q m ):U n;k 2Cg C the minimum volume set in ff(a m ) which sup lim n!» ffl; (a:s:) 22
23 5. Greedy Partition Algorithm Greedy approximation to k-minimal graph (Ravi&etal:94) ) specify a uniform partition Q m of [; ] d having m d cells Q i of resolution =m; ) nd the smallest subset B m k = [ iq i of partition elements containing at least k points 2) out of X n B m k select k points X n;k which minimize L(X n;k ). Properties: Greedy algorithm is polynomial time unlike exponential time exact ffl k-minimal algorithm. ffl Greedy algorithm yields tightly coverable graphs by construction 23
24 Figure 6. A sample of 75 points from the mixture density f(x) = :25f (x)+:75fo(x) where fo is a uniform density over [; ] 2 and f is a bivariate Gaussian density with mean (=2; =2) and diagonal covariance diag(:). A smallest B subset k is the union of the two cross hatched cells shown for the case of m = 5 and k = 7. m 24
25 Figure 7. Another smallest B subset k containing at least k = 7 points for the mixture sample shown in Fig 6. m 25
26 Minimal =m-cover of Probability at least ff If for any C 2 ff(q m ) satisfying P (C) ff the set A 2 ff(q m ) satises P (C) P (A) ff; then A is called a minimal resolution-=m set of probability at least ff. The class of all such sets is denoted A m ff. ) all sets in A m ff have identical coverage probabilities pa m ff ff. 26
27 L(X G m Theorem 2 Let X n be an i.i.d. sample from a distribution having Lebesgue density f (x). Fixff 2 [; ], fl 2 (;d). Letf (d fl)=d be of bounded variation over [; ] d and denote by v(a) its total variation over a subset A ρ [; ] d.letl be a quasi-additive functional with power exponent fl as in Theorem. Then, Z sup lim n! n;bffnc )=nν L;fl min (A) ff A:P ν (xja)dx < f; (a:s:); f where md 2m X d L;fl = f v(q m ff ) + C 3(pA m ff ff)(d fl)=d i= = O(m fl d ); and pa m ff is the coverage probability of minimizing set A = Am ff. 27
28 Main idea behind proof: for large n can index over X n;k ρ X n by indexing over A ρ Borel in [; ] d. min L(X E[ n;bffnc )] ß n;bffnc X inf (A) ff E[L(X n A)]; A:P 28
29 Proof of Theorem uses following lemmas: Lemma For given ff 2 [; ] and a set of n i.i.d. points n [x ;:::;x n ] let T B m n be the minimal cover of bffnc points with X = produced by the greedy subset selection algorithm. Then resolution-=m P inf lim fx n : B m n 2 A m ff g = : n! 29
30 d md Lemma 2 For ν 2 [; ] let f ν be of bounded variation over [; ] d and denote by v(a) its total variation over any subset A 2 [; ] d. Dene the resolution =m block density approximation ~ f (x) = P m d where i = m d R Q i f (x)dx. Then for any A 2 ff(q m ) i= ii Qi (x)» Z A [ ~ f ν (x) f ν (x)]dx» m X v(q i A): i= 3
31 Lemma 3 Assume f is of bounded total variation v(q i ) in each partition cell Q i 2 Q m.leta be any set in the class A m ff. Then for any quasi-additive functional L n (B m bffnc ) def = L(X n B m bffnc ) sup lim n! L n(b m bffnc )=n(d fl)=d L;fl ZA f (d fl)=d (x)dx v(q m ff ); (a:s): md d X L;fl 2m < i= 3
32 =n (d fl)=d Lemma X 4 Let n;bffnc bffnc be any points B selected from. Then, bffnc for any quasi-additive m functional def (B m bffnc ) n L = L n (X n B m bffnc ) sup lim n! L n (B m bffnc ) L(X n;bffnc) < C 3 (pa m ff ff)(d fl)=d ; (a:s:) where pa m ff = P (Am ff ) is the coverage probability of sets Am ff in A m ff. 32
33 ffl + C 3 )v : (2 Interpretations of Theorem 2: ffl Bound f is tight: lim ff! f = and theorem reduces to BHH. Since x2q(q) f (d fl)=d (x)» v([; ] d ) and ffl sup d P m m ff ), )» v([; ]d v(q i= f can be upper bounded by f» 2 L;fl m d + C 3 m fl dλ v([; ] d ): Thus if an upper bound v on the total variation of f is available and the tolerance ffl is given G m n;bffnc )=(bffnc)ν L;fl expf ( ν)r ν )j < ffl jl(x we obtain a selection rule for required partition =m resolution =m» ffl f decreases to zero as a function of resolution =m at overall rate 33
34 O(m fl d ). Thus convergence rate in =m is fastest for small fl. ffl Conjecture: as MST (U n )] LMST ;fln (d fl)=d j = O max(;n (d fl )=d ) je[l [Redmond&Yukich:96], rate of convergence in the limsup of Theorem 2 is at O(=n best ) and this rate can be attained only when =d» d. fl k-minimal graph entropy estimator will have fastest convergence ffl when the Rényi order ν parameter is in the =d» ν < range. Theorem extends easily to I-divergence limit by measure ffl transformation. 34
35 ffl f (x) is 2D separable triangle density on [; ] 2 ffl f (x) is 2D uniform density on [; ] 2 6. Application : MST Discrimination ffl f (x) = ( ffl)f (x) + fflf (x): mixture density 35
36 ROC K, N=256, ε=.,.3,.5,.7,.9 ; ref=unif PD PFA Figure 8. ROC curves for the Rényi information divergence test for detecting triangle-uniform mixture density f = ( ffl)f + fflf (H ) against triangular hypothesis f = f (H ). Curves are increasing in ffl 2 f:; :3; :5; :7; :9g. 36
37 7. Application II: Nonuniform Outlier Rejection 256 random samples MST Figure 9. Left: A scatterplot of a 256 point sample from triangleuniform mixture density with ffl = :. Labels o and * mark those realizations from the uniform and triangular densities, respectively. Right: superimposed is the k-mst implemented directly on the scatterplot X n with k =
38 N=256, k/n=.9, f=.*triang+.9*unif N=256, k/n=.9, f=.*triang+.9*unif Original Coordinates Transformed Coordinates Figure. Left: A sample from triangle-uniform mixture density with ffl = :9 in the transformed domain Y n. Labels o and * mark those realizations from the uniform and triangular densities, respectively. Right: transformed coordinates. 38
39 N=256, k/n=.9, f o =unif, f =triang N=256, k/n=.9, f o =unif, f =triang Clustering in InvTriangle Transformed Data Domain Clustering in Original Data Domain Figure. Left: the k-mst implemented on the transformed scatterplot Y n with k = 23. Right: same k-mst displayed in the original data domain. 39
Estimation of Rényi Information Divergence via Pruned Minimal Spanning Trees 1
Estimation of Rényi Information Divergence via Pruned Minimal Spanning Trees Alfred Hero Dept. of EECS, The university of Michigan, Ann Arbor, MI 489-, USA Email: hero@eecs.umich.edu Olivier J.J. Michel
More informationEntropic Graphs. Alfred O. Hero. Dept. EECS, Dept Biomed. Eng., Dept. Statistics University of Michigan - Ann Arbor
Entropic Graphs Alfred O. Hero Dept. EECS, Dept Biomed. Eng., Dept. Statistics University of Michigan - Ann Arbor hero@eecs.umich.edu http://www.eecs.umich.edu/ hero Collaborators: H. Heemuchwala, J. Costa,
More informationTO APPEAR - IEEE TRANS ON IT 2 Figure List 1. A sample of 75 points from the mixture density f(x) =0:25f 1 (x)+0:75f o (x) where f o is a uniform dens
TO APPEAR - IEEE TRANS ON IT 1 Asymptotic theory of greedy approximations to minimal K-point random graphs Alfred O. Hero and Olivier J.J. Michel Abstract Let X n = fx 1 ;:::;x n g, be an i.i.d. sample
More information1. Entropy Estimation Let fx i g n i=1 be a point cloud in IRd, d 1. Isaac Newton Institute: k-mst 2 fx i g n i=1 are i.i.d.random vectors with unknow
Robust entropy estimation via pruned minimal spanning trees 1. Entropy estimation Hero Alfred of Michigan - Ann Arbor University http://www.eecs.umich.edu/ hero/hero.html 2. Minimal spanning trees (MST)
More informationEntropic-graphs: Applications
Entropic-graphs: Applications Alfred O. Hero Dept. EECS, Dept Biomed. Eng., Dept. Statistics University of Michigan - Ann Arbor hero@eecs.umich.edu http://www.eecs.umich.edu/ hero Collaborators: Huzefa
More informationEntropic Graphs. Alfred O. Hero. Dept. EECS, Dept Biomed. Eng., Dept. Statistics University of Michigan - Ann Arbor
Entropic Graphs Alfred O Hero Dept EECS, Dept Biomed Eng, Dept Statistics University of Michigan - Ann Arbor hero@eecsumichedu http://wwweecsumichedu/ hero Collaborators: A Almal, J Costa, H Neemuchwala,
More informationuniform 2 d distribution (n=100) triangular 2 d distribution (n=100) MST x
Robust Entropy Estimation Strategies Based on Edge Weighted Random Graphs (with corrections) Alfred O. Hero a and Olivier Michel b a Dept. EECS, University of Michigan, Ann Arbor, MI, 489-222 U.S.A. b
More informationGeometric entropy minimization (GEM) for anomaly detection and localization
Geometric entropy minimization (GEM) for anomaly detection and localization Alfred O Hero, III University of Michigan Ann Arbor, MI 89- hero@umich.edu Abstract We introduce a novel adaptive non-parametric
More informationChapter 6. Integration. 1. Integrals of Nonnegative Functions. a j µ(e j ) (ca j )µ(e j ) = c X. and ψ =
Chapter 6. Integration 1. Integrals of Nonnegative Functions Let (, S, µ) be a measure space. We denote by L + the set of all measurable functions from to [0, ]. Let φ be a simple function in L +. Suppose
More informationErgodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R.
Ergodic Theorems Samy Tindel Purdue University Probability Theory 2 - MA 539 Taken from Probability: Theory and examples by R. Durrett Samy T. Ergodic theorems Probability Theory 1 / 92 Outline 1 Definitions
More informationEntrance Exam, Real Analysis September 1, 2017 Solve exactly 6 out of the 8 problems
September, 27 Solve exactly 6 out of the 8 problems. Prove by denition (in ɛ δ language) that f(x) = + x 2 is uniformly continuous in (, ). Is f(x) uniformly continuous in (, )? Prove your conclusion.
More informationMathematical Methods for Neurosciences. ENS - Master MVA Paris 6 - Master Maths-Bio ( )
Mathematical Methods for Neurosciences. ENS - Master MVA Paris 6 - Master Maths-Bio (2014-2015) Etienne Tanré - Olivier Faugeras INRIA - Team Tosca October 22nd, 2014 E. Tanré (INRIA - Team Tosca) Mathematical
More informationMATHS 730 FC Lecture Notes March 5, Introduction
1 INTRODUCTION MATHS 730 FC Lecture Notes March 5, 2014 1 Introduction Definition. If A, B are sets and there exists a bijection A B, they have the same cardinality, which we write as A, #A. If there exists
More informationLecture 5 Channel Coding over Continuous Channels
Lecture 5 Channel Coding over Continuous Channels I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw November 14, 2014 1 / 34 I-Hsiang Wang NIT Lecture 5 From
More informationScheduling and Optimization Course (MPRI)
MPRI Scheduling and optimization: lecture p. /6 Scheduling and Optimization Course (MPRI) Leo Liberti LIX, École Polytechnique, France MPRI Scheduling and optimization: lecture p. /6 Teachers Christoph
More informationScalable robust hypothesis tests using graphical models
Scalable robust hypothesis tests using graphical models Umamahesh Srinivas ipal Group Meeting October 22, 2010 Binary hypothesis testing problem Random vector x = (x 1,...,x n ) R n generated from either
More informationChapter 5. Measurable Functions
Chapter 5. Measurable Functions 1. Measurable Functions Let X be a nonempty set, and let S be a σ-algebra of subsets of X. Then (X, S) is a measurable space. A subset E of X is said to be measurable if
More information2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?
MA 645-4A (Real Analysis), Dr. Chernov Homework assignment 1 (Due 9/5). Prove that every countable set A is measurable and µ(a) = 0. 2 (Bonus). Let A consist of points (x, y) such that either x or y is
More information3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?
MA 645-4A (Real Analysis), Dr. Chernov Homework assignment 1 (Due ). Show that the open disk x 2 + y 2 < 1 is a countable union of planar elementary sets. Show that the closed disk x 2 + y 2 1 is a countable
More informationIntroduction to Real Analysis Alternative Chapter 1
Christopher Heil Introduction to Real Analysis Alternative Chapter 1 A Primer on Norms and Banach Spaces Last Updated: March 10, 2018 c 2018 by Christopher Heil Chapter 1 A Primer on Norms and Banach Spaces
More informationApplications of Information Geometry to Hypothesis Testing and Signal Detection
CMCAA 2016 Applications of Information Geometry to Hypothesis Testing and Signal Detection Yongqiang Cheng National University of Defense Technology July 2016 Outline 1. Principles of Information Geometry
More informationMath 421, Homework #9 Solutions
Math 41, Homework #9 Solutions (1) (a) A set E R n is said to be path connected if for any pair of points x E and y E there exists a continuous function γ : [0, 1] R n satisfying γ(0) = x, γ(1) = y, and
More informationMarkov processes Course note 2. Martingale problems, recurrence properties of discrete time chains.
Institute for Applied Mathematics WS17/18 Massimiliano Gubinelli Markov processes Course note 2. Martingale problems, recurrence properties of discrete time chains. [version 1, 2017.11.1] We introduce
More informationABHELSINKI UNIVERSITY OF TECHNOLOGY
Approximation Algorithms Seminar 1 Set Cover, Steiner Tree and TSP Siert Wieringa siert.wieringa@tkk.fi Approximation Algorithms Seminar 1 1/27 Contents Approximation algorithms for: Set Cover Steiner
More informationReal Analysis Chapter 3 Solutions Jonathan Conder. ν(f n ) = lim
. Suppose ( n ) n is an increasing sequence in M. For each n N define F n : n \ n (with 0 : ). Clearly ν( n n ) ν( nf n ) ν(f n ) lim n If ( n ) n is a decreasing sequence in M and ν( )
More informationEC 521 MATHEMATICAL METHODS FOR ECONOMICS. Lecture 1: Preliminaries
EC 521 MATHEMATICAL METHODS FOR ECONOMICS Lecture 1: Preliminaries Murat YILMAZ Boğaziçi University In this lecture we provide some basic facts from both Linear Algebra and Real Analysis, which are going
More informationKernel Density Estimation
EECS 598: Statistical Learning Theory, Winter 2014 Topic 19 Kernel Density Estimation Lecturer: Clayton Scott Scribe: Yun Wei, Yanzhen Deng Disclaimer: These notes have not been subjected to the usual
More informationMathematical Methods for Physics and Engineering
Mathematical Methods for Physics and Engineering Lecture notes for PDEs Sergei V. Shabanov Department of Mathematics, University of Florida, Gainesville, FL 32611 USA CHAPTER 1 The integration theory
More informationThe best expert versus the smartest algorithm
Theoretical Computer Science 34 004 361 380 www.elsevier.com/locate/tcs The best expert versus the smartest algorithm Peter Chen a, Guoli Ding b; a Department of Computer Science, Louisiana State University,
More informationREVIEW OF ESSENTIAL MATH 346 TOPICS
REVIEW OF ESSENTIAL MATH 346 TOPICS 1. AXIOMATIC STRUCTURE OF R Doğan Çömez The real number system is a complete ordered field, i.e., it is a set R which is endowed with addition and multiplication operations
More informationUNSUPERVISED CLUSTERING WITH MST: APPLICATION TO ASTEROID DATA Mai 2005 O. Michel, P.Bendjoya,
UNSUPERVISED CLUSTERING WITH MST: APPLICATION TO ASTEROID DATA Mai 25 O. Michel, P.Bendjoya, Classification...a certain knowhow (a,e,i)computation Proposed metrics(zappala et al., 99) d(i, j) = where mm
More informationChapter 2. Binary and M-ary Hypothesis Testing 2.1 Introduction (Levy 2.1)
Chapter 2. Binary and M-ary Hypothesis Testing 2.1 Introduction (Levy 2.1) Detection problems can usually be casted as binary or M-ary hypothesis testing problems. Applications: This chapter: Simple hypothesis
More informationconditional cdf, conditional pdf, total probability theorem?
6 Multiple Random Variables 6.0 INTRODUCTION scalar vs. random variable cdf, pdf transformation of a random variable conditional cdf, conditional pdf, total probability theorem expectation of a random
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationMeasure-theoretic probability
Measure-theoretic probability Koltay L. VEGTMAM144B November 28, 2012 (VEGTMAM144B) Measure-theoretic probability November 28, 2012 1 / 27 The probability space De nition The (Ω, A, P) measure space is
More informationProbability and Measure
Chapter 4 Probability and Measure 4.1 Introduction In this chapter we will examine probability theory from the measure theoretic perspective. The realisation that measure theory is the foundation of probability
More informationarxiv: v5 [math.pr] 5 Nov 2015
Shortest Path through Random Points Sung Jin Hwang Steven B. Damelin Alfred O. Hero III arxiv:1202.0045v5 [math.pr] 5 Nov 2015 University of Michigan Mathematical Reviews Abstract Let M, g 1 ) be a complete
More informationThe University of Texas at Austin Department of Electrical and Computer Engineering. EE381V: Large Scale Learning Spring 2013.
The University of Texas at Austin Department of Electrical and Computer Engineering EE381V: Large Scale Learning Spring 2013 Assignment 1 Caramanis/Sanghavi Due: Thursday, Feb. 7, 2013. (Problems 1 and
More informationInternational Competition in Mathematics for Universtiy Students in Plovdiv, Bulgaria 1994
International Competition in Mathematics for Universtiy Students in Plovdiv, Bulgaria 1994 1 PROBLEMS AND SOLUTIONS First day July 29, 1994 Problem 1. 13 points a Let A be a n n, n 2, symmetric, invertible
More informationIn class midterm Exam - Answer key
Fall 2013 In class midterm Exam - Answer key ARE211 Problem 1 (20 points). Metrics: Let B be the set of all sequences x = (x 1,x 2,...). Define d(x,y) = sup{ x i y i : i = 1,2,...}. a) Prove that d is
More informationMATH MEASURE THEORY AND FOURIER ANALYSIS. Contents
MATH 3969 - MEASURE THEORY AND FOURIER ANALYSIS ANDREW TULLOCH Contents 1. Measure Theory 2 1.1. Properties of Measures 3 1.2. Constructing σ-algebras and measures 3 1.3. Properties of the Lebesgue measure
More informationh(x) lim H(x) = lim Since h is nondecreasing then h(x) 0 for all x, and if h is discontinuous at a point x then H(x) > 0. Denote
Real Variables, Fall 4 Problem set 4 Solution suggestions Exercise. Let f be of bounded variation on [a, b]. Show that for each c (a, b), lim x c f(x) and lim x c f(x) exist. Prove that a monotone function
More informationDistance-Divergence Inequalities
Distance-Divergence Inequalities Katalin Marton Alfréd Rényi Institute of Mathematics of the Hungarian Academy of Sciences Motivation To find a simple proof of the Blowing-up Lemma, proved by Ahlswede,
More informationA List of Problems in Real Analysis
A List of Problems in Real Analysis W.Yessen & T.Ma December 3, 218 This document was first created by Will Yessen, who was a graduate student at UCI. Timmy Ma, who was also a graduate student at UCI,
More informationLebesgue Integration on R n
Lebesgue Integration on R n The treatment here is based loosely on that of Jones, Lebesgue Integration on Euclidean Space We give an overview from the perspective of a user of the theory Riemann integration
More informationNew concepts: Span of a vector set, matrix column space (range) Linearly dependent set of vectors Matrix null space
Lesson 6: Linear independence, matrix column space and null space New concepts: Span of a vector set, matrix column space (range) Linearly dependent set of vectors Matrix null space Two linear systems:
More informationConditional distributions. Conditional expectation and conditional variance with respect to a variable.
Conditional distributions Conditional expectation and conditional variance with respect to a variable Probability Theory and Stochastic Processes, summer semester 07/08 80408 Conditional distributions
More informationNational Sun Yat-Sen University CSE Course: Information Theory. Maximum Entropy and Spectral Estimation
Maximum Entropy and Spectral Estimation 1 Introduction What is the distribution of velocities in the gas at a given temperature? It is the Maxwell-Boltzmann distribution. The maximum entropy distribution
More informationMGMT 69000: Topics in High-dimensional Data Analysis Falll 2016
MGMT 69000: Topics in High-dimensional Data Analysis Falll 2016 Lecture 14: Information Theoretic Methods Lecturer: Jiaming Xu Scribe: Hilda Ibriga, Adarsh Barik, December 02, 2016 Outline f-divergence
More informationFunctional Analysis Winter 2018/2019
Functional Analysis Winter 2018/2019 Peer Christian Kunstmann Karlsruher Institut für Technologie (KIT) Institut für Analysis Englerstr. 2, 76131 Karlsruhe e-mail: peer.kunstmann@kit.edu These lecture
More informationAdvanced Analysis Qualifying Examination Department of Mathematics and Statistics University of Massachusetts. Tuesday, January 16th, 2018
NAME: Advanced Analysis Qualifying Examination Department of Mathematics and Statistics University of Massachusetts Tuesday, January 16th, 2018 Instructions 1. This exam consists of eight (8) problems
More informationRandom Variables and Their Distributions
Chapter 3 Random Variables and Their Distributions A random variable (r.v.) is a function that assigns one and only one numerical value to each simple event in an experiment. We will denote r.vs by capital
More informationSection 27. The Central Limit Theorem. Po-Ning Chen, Professor. Institute of Communications Engineering. National Chiao Tung University
Section 27 The Central Limit Theorem Po-Ning Chen, Professor Institute of Communications Engineering National Chiao Tung University Hsin Chu, Taiwan 3000, R.O.C. Identically distributed summands 27- Central
More information3 Measurable Functions
3 Measurable Functions Notation A pair (X, F) where F is a σ-field of subsets of X is a measurable space. If µ is a measure on F then (X, F, µ) is a measure space. If µ(x) < then (X, F, µ) is a probability
More informationLinear Regression and Its Applications
Linear Regression and Its Applications Predrag Radivojac October 13, 2014 Given a data set D = {(x i, y i )} n the objective is to learn the relationship between features and the target. We usually start
More informationGENERALITY OF HENSTOCK-KURZWEIL TYPE INTEGRAL ON A COMPACT ZERO-DIMENSIONAL METRIC SPACE
Ø Ñ Å Ø Ñ Ø Ð ÈÙ Ð Ø ÓÒ DOI: 10.2478/v10127-011-0027-z Tatra Mt. Math. Publ. 49 (2011), 81 88 GENERALITY OF HENSTOCK-KURZWEIL TYPE INTEGRAL ON A COMPACT ZERO-DIMENSIONAL METRIC SPACE Francesco Tulone ABSTRACT.
More informationCourse 212: Academic Year Section 1: Metric Spaces
Course 212: Academic Year 1991-2 Section 1: Metric Spaces D. R. Wilkins Contents 1 Metric Spaces 3 1.1 Distance Functions and Metric Spaces............. 3 1.2 Convergence and Continuity in Metric Spaces.........
More informationMATH 202B - Problem Set 5
MATH 202B - Problem Set 5 Walid Krichene (23265217) March 6, 2013 (5.1) Show that there exists a continuous function F : [0, 1] R which is monotonic on no interval of positive length. proof We know there
More informationLecture 21. Hypothesis Testing II
Lecture 21. Hypothesis Testing II December 7, 2011 In the previous lecture, we dened a few key concepts of hypothesis testing and introduced the framework for parametric hypothesis testing. In the parametric
More information12 CHAPTER 1. PRELIMINARIES Lemma 1.3 (Cauchy-Schwarz inequality) Let (; ) be an inner product in < n. Then for all x; y 2 < n we have j(x; y)j (x; x)
1.4. INNER PRODUCTS,VECTOR NORMS, AND MATRIX NORMS 11 The estimate ^ is unbiased, but E(^ 2 ) = n?1 n 2 and is thus biased. An unbiased estimate is ^ 2 = 1 (x i? ^) 2 : n? 1 In x?? we show that the linear
More informationMaxima and Minima. (a, b) of R if
Maxima and Minima Definition Let R be any region on the xy-plane, a function f (x, y) attains its absolute or global, maximum value M on R at the point (a, b) of R if (i) f (x, y) M for all points (x,
More informationMATH5011 Real Analysis I. Exercise 1 Suggested Solution
MATH5011 Real Analysis I Exercise 1 Suggested Solution Notations in the notes are used. (1) Show that every open set in R can be written as a countable union of mutually disjoint open intervals. Hint:
More informationLebesgue s Differentiation Theorem via Maximal Functions
Lebesgue s Differentiation Theorem via Maximal Functions Parth Soneji LMU München Hütteseminar, December 2013 Parth Soneji Lebesgue s Differentiation Theorem via Maximal Functions 1/12 Philosophy behind
More informationRandom Geometric Graphs
Random Geometric Graphs Mathew D. Penrose University of Bath, UK SAMBa Symposium (Talk 2) University of Bath November 2014 1 MODELS of RANDOM GRAPHS Erdos-Renyi G(n, p): start with the complete n-graph,
More informationEcon Lecture 3. Outline. 1. Metric Spaces and Normed Spaces 2. Convergence of Sequences in Metric Spaces 3. Sequences in R and R n
Econ 204 2011 Lecture 3 Outline 1. Metric Spaces and Normed Spaces 2. Convergence of Sequences in Metric Spaces 3. Sequences in R and R n 1 Metric Spaces and Metrics Generalize distance and length notions
More informationStatistical inference on Lévy processes
Alberto Coca Cabrero University of Cambridge - CCA Supervisors: Dr. Richard Nickl and Professor L.C.G.Rogers Funded by Fundación Mutua Madrileña and EPSRC MASDOC/CCA student workshop 2013 26th March Outline
More informationINDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS
INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS STEVEN P. LALLEY AND ANDREW NOBEL Abstract. It is shown that there are no consistent decision rules for the hypothesis testing problem
More informationLearning Intrinsic Dimension and Entropy of High-Dimensional Shape Spaces
1 Learning Intrinsic Dimension and Entropy of High-Dimensional Shape Spaces Jose A. Costa, Alfred O. Hero, III Abstract Given a finite set of random samples from a smooth Riemannian manifold embedded in
More information1 Joint and marginal distributions
DECEMBER 7, 204 LECTURE 2 JOINT (BIVARIATE) DISTRIBUTIONS, MARGINAL DISTRIBUTIONS, INDEPENDENCE So far we have considered one random variable at a time. However, in economics we are typically interested
More informationProblem set 1, Real Analysis I, Spring, 2015.
Problem set 1, Real Analysis I, Spring, 015. (1) Let f n : D R be a sequence of functions with domain D R n. Recall that f n f uniformly if and only if for all ɛ > 0, there is an N = N(ɛ) so that if n
More informationGeometric Inference for Probability distributions
Geometric Inference for Probability distributions F. Chazal 1 D. Cohen-Steiner 2 Q. Mérigot 2 1 Geometrica, INRIA Saclay, 2 Geometrica, INRIA Sophia-Antipolis 2009 July 8 F. Chazal, D. Cohen-Steiner, Q.
More informationIntegration on Measure Spaces
Chapter 3 Integration on Measure Spaces In this chapter we introduce the general notion of a measure on a space X, define the class of measurable functions, and define the integral, first on a class of
More informationChapter 4. Measure Theory. 1. Measure Spaces
Chapter 4. Measure Theory 1. Measure Spaces Let X be a nonempty set. A collection S of subsets of X is said to be an algebra on X if S has the following properties: 1. X S; 2. if A S, then A c S; 3. if
More informationOptimal Estimation of a Nonsmooth Functional
Optimal Estimation of a Nonsmooth Functional T. Tony Cai Department of Statistics The Wharton School University of Pennsylvania http://stat.wharton.upenn.edu/ tcai Joint work with Mark Low 1 Question Suppose
More informationPerformance Limits on the Classification of Kronecker-structured Models
Performance Limits on the Classification of Kronecker-structured Models Ishan Jindal and Matthew Nokleby Electrical and Computer Engineering Wayne State University Motivation: Subspace models (Union of)
More informationCoarse Geometry. Phanuel Mariano. Fall S.i.g.m.a. Seminar. Why Coarse Geometry? Coarse Invariants A Coarse Equivalence to R 1
Coarse Geometry 1 1 University of Connecticut Fall 2014 - S.i.g.m.a. Seminar Outline 1 Motivation 2 3 The partition space P ([a, b]). Preliminaries Main Result 4 Outline Basic Problem 1 Motivation 2 3
More informationACO Comprehensive Exam October 14 and 15, 2013
1. Computability, Complexity and Algorithms (a) Let G be the complete graph on n vertices, and let c : V (G) V (G) [0, ) be a symmetric cost function. Consider the following closest point heuristic for
More informationProblem Set 5: Solutions Math 201A: Fall 2016
Problem Set 5: s Math 21A: Fall 216 Problem 1. Define f : [1, ) [1, ) by f(x) = x + 1/x. Show that f(x) f(y) < x y for all x, y [1, ) with x y, but f has no fixed point. Why doesn t this example contradict
More informationACO Comprehensive Exam October 18 and 19, Analysis of Algorithms
Consider the following two graph problems: 1. Analysis of Algorithms Graph coloring: Given a graph G = (V,E) and an integer c 0, a c-coloring is a function f : V {1,,...,c} such that f(u) f(v) for all
More informationEntropic graphs for registration
Entropic graphs for registration Huzefa Neemuchwala and Alfred Hero Abstract In many applications, fusion of images acquired via two or more sensors requires image alignment to an identical pose, a process
More informationExercises with solutions (Set D)
Exercises with solutions Set D. A fair die is rolled at the same time as a fair coin is tossed. Let A be the number on the upper surface of the die and let B describe the outcome of the coin toss, where
More informationNordhaus-Gaddum Theorems for k-decompositions
Nordhaus-Gaddum Theorems for k-decompositions Western Michigan University October 12, 2011 A Motivating Problem Consider the following problem. An international round-robin sports tournament is held between
More informationApplications of Entropic Spanning Graphs
Applications of Entropic Spanning Graphs Alfred Hero y,bingma #, Olivier Michel Λ, and John Gorman + Dept of EECS y, The University of Michigan, Ann Arbor, MI 489-222, USA M-Vision Inc., Belleville MI,
More informationMonte-Carlo MMD-MA, Université Paris-Dauphine. Xiaolu Tan
Monte-Carlo MMD-MA, Université Paris-Dauphine Xiaolu Tan tan@ceremade.dauphine.fr Septembre 2015 Contents 1 Introduction 1 1.1 The principle.................................. 1 1.2 The error analysis
More informationIntroduction to Probability Theory
Introduction to Probability Theory Ping Yu Department of Economics University of Hong Kong Ping Yu (HKU) Probability 1 / 39 Foundations 1 Foundations 2 Random Variables 3 Expectation 4 Multivariate Random
More informationNon-parametric Inference and Resampling
Non-parametric Inference and Resampling Exercises by David Wozabal (Last update 3. Juni 2013) 1 Basic Facts about Rank and Order Statistics 1.1 10 students were asked about the amount of time they spend
More informationX. Hu, R. Shonkwiler, and M.C. Spruill. School of Mathematics. Georgia Institute of Technology. Atlanta, GA 30332
Approximate Speedup by Independent Identical Processing. Hu, R. Shonkwiler, and M.C. Spruill School of Mathematics Georgia Institute of Technology Atlanta, GA 30332 Running head: Parallel iip Methods Mail
More informationChapter 1: Probability Theory Lecture 1: Measure space, measurable function, and integration
Chapter 1: Probability Theory Lecture 1: Measure space, measurable function, and integration Random experiment: uncertainty in outcomes Ω: sample space: a set containing all possible outcomes Definition
More informationMeasure and Integration: Solutions of CW2
Measure and Integration: s of CW2 Fall 206 [G. Holzegel] December 9, 206 Problem of Sheet 5 a) Left (f n ) and (g n ) be sequences of integrable functions with f n (x) f (x) and g n (x) g (x) for almost
More informationConcentration of Measures by Bounded Size Bias Couplings
Concentration of Measures by Bounded Size Bias Couplings Subhankar Ghosh, Larry Goldstein University of Southern California [arxiv:0906.3886] January 10 th, 2013 Concentration of Measure Distributional
More information2-Distance Problems. Combinatorics, 2016 Fall, USTC Week 16, Dec 20&22. Theorem 1. (Frankl-Wilson, 1981) If F is an L-intersecting family in 2 [n],
Combinatorics, 206 Fall, USTC Week 6, Dec 20&22 2-Distance Problems Theorem (Frankl-Wilson, 98 If F is an L-intersecting family in 2 [n], then F L k=0 ( n k Proof Let F = {A, A 2,, A m } where A A 2 A
More informationChapter 1: Probability Theory Lecture 1: Measure space and measurable function
Chapter 1: Probability Theory Lecture 1: Measure space and measurable function Random experiment: uncertainty in outcomes Ω: sample space: a set containing all possible outcomes Definition 1.1 A collection
More informationMeasure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond
Measure Theory on Topological Spaces Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond May 22, 2011 Contents 1 Introduction 2 1.1 The Riemann Integral........................................ 2 1.2 Measurable..............................................
More informationS6880 #7. Generate Non-uniform Random Number #1
S6880 #7 Generate Non-uniform Random Number #1 Outline 1 Inversion Method Inversion Method Examples Application to Discrete Distributions Using Inversion Method 2 Composition Method Composition Method
More informationPROOF OF ZADOR-GERSHO THEOREM
ZADOR-GERSHO THEOREM FOR VARIABLE-RATE VQ For a stationary source and large R, the least distortion of k-dim'l VQ with nth-order entropy coding and rate R or less is δ(k,n,r) m k * σ 2 η kn 2-2R = Z(k,n,R)
More informationFeature selection. c Victor Kitov August Summer school on Machine Learning in High Energy Physics in partnership with
Feature selection c Victor Kitov v.v.kitov@yandex.ru Summer school on Machine Learning in High Energy Physics in partnership with August 2015 1/38 Feature selection Feature selection is a process of selecting
More informationMath 61CM - Solutions to homework 6
Math 61CM - Solutions to homework 6 Cédric De Groote November 5 th, 2018 Problem 1: (i) Give an example of a metric space X such that not all Cauchy sequences in X are convergent. (ii) Let X be a metric
More informationWeek 2 Spring Lecture 3. The Canonical normal means estimation problem (cont.).! (X) = X+ 1 X X, + (X) = X+ 1
Week 2 Spring 2009 Lecture 3. The Canonical normal means estimation problem (cont.). Shrink toward a common mean. Theorem. Let X N ; 2 I n. Let 0 < C 2 (n 3) (hence n 4). De ne!! (X) = X+ 1 Then C 2 X
More informationAnalysis Qualifying Exam
Analysis Qualifying Exam Spring 2017 Problem 1: Let f be differentiable on R. Suppose that there exists M > 0 such that f(k) M for each integer k, and f (x) M for all x R. Show that f is bounded, i.e.,
More informationSOLUTIONS OF SELECTED PROBLEMS
SOLUTIONS OF SELECTED PROBLEMS Problem 36, p. 63 If µ(e n < and χ En f in L, then f is a.e. equal to a characteristic function of a measurable set. Solution: By Corollary.3, there esists a subsequence
More information