Robustness of classifiers to uniform l p and Gaussian noise Supplementary material
|
|
- Barrie Woods
- 5 years ago
- Views:
Transcription
1 Robustness of classifiers to uniform l and Gaussian noise Sulementary material Jean-Yves Franceschi Ecole Normale Suérieure de Lyon LIP UMR 5668 Omar Fawzi Ecole Normale Suérieure de Lyon LIP UMR 5668 Alhussein Fawzi UCLA Vision Lab In these aendices we rove the theoretical results stated in the main article A Preliminary Results In this section we exlicitly comute r x for a linear classifier as described in the main article Lemma For all [ ] the l -distance from any oint x to the decision hyerlane H defined by f z 0 is: if : if : if : r x r x r x f x w ; f x w ; f x w Overall for all [ ] the l -distance from any oint x to the decision hyerlane H : f z 0 is: Proof We distinguish between the three cases r x f x w Suose The distance from x to H is equal to the minimum radius α of a ball ie for l a hyercube centered at x that intersects H This intersection with minimum radius necessarily contains a vertex of the hyercube To determine which one it suffices to determine which vector x + αε with ε d first intersects H when α increases starting at 0 Such an intersection arises when w T x + αε + b 0 so α fx and since α must be non-negative: w T ε r x min f x f x fx w T ε0 w T ε w because ε d simly choose ε i sign f x w i Now at DeeMind
2 Suose In this case the roof is symmetric to the one for with ε d having exactly one non-zero coordinate Suose The distance from x to H is equal to the minimum radius α of an l ball B centered at x that intersects H This ball is described by the following equation where z is the variable: z i x i α i For such a minimum radius the lane described by f z 0 is tangent to B at some oint x + n Let us assume without loss of generality that every coordinate of n is non-negative We also know that this hyerlane is described by the following equations where z is the variable: x+n i z i x i α T z x + n 0 i i n i z i x i n i 0 n i z i x i α beacuse n belongs to the boundary of B The last equation thus describes the same hyerlane as w T z b Therefore there exists λ R \ 0 such that i n i λw i Then since x + n B : and since x + n H: we have λ α fx Finally: i n i x i + n i x i λ w i n i α i w i x i + n i + b f x + w T n 0 i α α i n i f x w i λw i α f x w B Robustness of Linear Classifiers to l Noise B Main Theorem Theorem Let [ ] Let [ ] be such that + Then there exist universal constants C c c > 0 such that for all ε < c c : where ζ ε C ε and ζ ε c c ε Theorem is roved by the following lemmas ζ εd / w r εx / w w r ζ εd x w
3 Lemma There exists a universal constant C > 0 such that where ζ ε C ε r ε x / w r ζ εd x w Proof Let us first exress conveniently P v B g x g x + αv where v B means that v is chosen uniformly at random in B : P v B g x g x + αv P v B f x f x + αv 0 P v B sign f x w T x + b sign f x αw T v P v B w r P v B w x P r v B w x r x sign f x αw T v w T v w T v 3 where Eq is given by Lemma and Eq and 3 follow from v B v B Markov s inequality gives from Eq 3: P v B g x g x + αv E v B [ d i w iv i ] w r x In [Barthe et al 005 Theorem 7] it is roved that there is a constant C 0 > 0 such that: w i v i C0 w E v B i d Therefore: P v B g x g x + αv C 0 d w w r x So if < ε d C0 w constant C C0 w r x then P v B g x g x + αv < ε Thus there is a universal > 0 such that: r ε x / w r ζ εd x w Lemma 3 There exist universal constants c c > 0 such that for all ε < c c : where ζ ε c c ε r ε x / w r ζ εd x w 3
4 Proof We first transform the exression of P v B g x g x + αv: P v B g x g x + αv P v B w r x P r v B w x P v B Var w T v w w T v w T v r x w T v Var w T v Paley-Zygmund s inequality states that if X is a random variable with finite variance and t [0 ] then: P X > te [X] t E [X] E [X ] w Note that E T v v B because E Varw T v v B w T v 0 So by using Paley-Zygmund s inequality v with X wt Varw T v and t r w x w Varw T v when VarwT v r x : So if > P v B g x g x + αv Varw T v εe[w T v 4 ] w Varw T v r x E v B [ wt v 4 Varw T v ] w r x then P v B g x g x + αv > ε According to [Barthe et al 005 Theorem 7] there is a universal constant c 0 > 0 such that: for Var w T v : Var w T v c0 d w ; for E [ w T v 4 ] : [ w E T v ] 4 4 4C0 w So there are universal constants c c 0 c 5C 4 0 > 0 such that: d r ε x / w r ζ εd x w B Alternative Lower Bound Actually the lower bound of Theorem may be imroved for most -norms by the following result Lemma 4 There exists a universal constant C > 0 such that where ζ ε C log 3 min ε r ε x / w r ζ εd x w 4
5 Proof Let min We have: where t w P v B g x g x + αv P v B w r x P v B e θt ex θw T v r x for any θ > 0 Markov s inequality gives: P v B g x g x + αv e θt E v B [ ex θw T v ] e θt e θt k0 k0 k! E v B [ θw T v k ] since w T v is symmetric In [Barthe et al 005 Theorem 7] it is roved that: w T v k! E v B [ θw T v k ] if k d and : if k d and > : if k > d and : E v B E v B E v B i k w i v i C 0 k d i k w i v i C 0 k d i w k ; w k ; k w i v i C 0 w k C 0 k d w k ; if k > d and > : E v B k w i v i C 0 d d i w k C 0 k d w k where C 0 is a universal constant the same as in the roof of Lemma So overall: E v B k w i v i C 0 k k w d i Thus: P v B g x g x + αv e θt k0 θ C 0 k k! d w k 5
6 We can bound the following ower series using Stirling-like bounds [Robbins 955] in 4 and 5: k0 k k k! xk + π k k k k k+ k e x k + k k e x k π + π k + e π k + e π k ex ex k e k x k k e x k k! k e x k k! 4 5 Therefore: By choosing θ t P v B g x g x + αv 3e θt ex θ e C 0 w d e C 0k d w : P v B g x g x + αv 3 ex t d e C0 w r x 3 ex d w e C0 w So if < C d w ln 3 w r ε x then P v B g x g x + αv < ε where C > 0 e C0 is a universal constant and: r ε x / w r ζ εd x w B3 Tyical Value of the Multilicative Factor Proosition For any ] if w is a random direction uniformly distributed over the unit l -shere then as d : d / w w d Γ as π Moreover for d w w d ln d as g Proof w can be written as g where g g g d are iid with normal distribution µ 0 σ The law of large numbers gives that for : d i g i as E g Γ 6 + π
7 Thus: and for ]: g d Γ + as π d g d g Γ as π because g d as For we use a result roved in [Galambos 987 Examle 44] directly imlying that Using the revious comutations for we find: g ln d as d w w d ln d as C Robustness of Linear Classifiers to Gaussian Noise C Main Theorem Theorem For ε < 3 ζ ε ln ε and ζ ε ζ ε w r Σεx Σw r x Theorem is roved by the following lemmas Lemma 5 For ζ ε ln ε 3ε : ζ ε w Σw r Σε x r x ζ ε w Σw Proof As in the roof of Lemma : P v N 0Σ g x + αv g x P v N 0Σ w r x w T v Since v N 0 Σ follows a multivariate normal distribution with a ositive definite covariance matrix Σ if Σ is the symmetric square root of Σ then v Σv with v N 0 I d So: P v N 0Σ g x + αv g x P v N 0Id w r x w T Σv P v N 0Id w r x T Σw v If v N 0 I d then T Σw v N 0 α Σw Therefore: P v N 0Σ g x + αv g x ex w r x α Σw 7
8 So if < w ln ε r Σw x then P v N 0Σ g x + αv g x < ε Thus r Σε x r x ζ ε w Σw Lemma 6 For ε < 3 and ζ ε 3ε r Σε x r x ζ ε w Σw Proof P v N 0Σ g x + αv g x P v N 0Id w r x T Σw v P v N 0I d w r x α T Σw v P w v N 0I d α r x Σw Σw T v Σw Note that E v N 0Id Σw T v Σw Var v N0I d Σw Σw T v So by using Paley-Zygmund s inequality when w Σw r x : P v N 0Σ g x + αv g x So if > 3ε w α w r Σw x Σw Σw 4 Σw 4 α w r Σw x 3 α w r Σw x Σw 4 Σw 4 + Σw r x then P v N 0Σ g x + αv g x > ε Therefore r Σε x r x ζ ε w Σw C Tyical Value of the Multilicative Factor Proosition Let Σ be a d d ositive semidefinite matrix with Tr Σ If w is a random direction uniformly distributed over the unit l -shere then for t π 8 d: w P d Σw t where t 5 t ex t + ex 8d t 8d Tr Σ + ex 00 Tr Σ 8
9 Proof Suose that w is a random direction uniformly distributed over the unit l shere g Then w can be written as g where g g g d are iid with normal distribution µ 0 σ By using this reresentation in the orthogonal basis in which Σ is diagonal we get g Σg d i g i d i λ ig i where Σ Diag λ i in the reviously mentioned orthogonal basis Let us focus on the concentration of d i λ ig i We have: λ i g i λ i g i i i λ i One of Bernstein-tye inequalities [Bernstein 97] can be alied: P λ i g i E gi t Var g i E gi i for t β Tr Σ where β π 8 is a constant ie for t β : P Σg t t ex Tr Σ i i λ i g i E gi i λ 4 i e t g has a chi-squared distribution so using a simle concentration inequality for the chi-squared distribution : P d g t ex dt Overall for t βd and t 5 t: g P d Σg t g P d Σg Σg t g P d d Σg Σg t d g + Σg P t Σg d P d g t + P Σg d t d + P Σg 0 so using the revious inequalities: g P d Σg t ex t t + ex 8d 8d Tr Σ + ex 00 Tr Σ Because Γ k+ π E g i k E gi k π 4 k! for all k > Using the fact that g is a sum of indeendent sub-exonential random variables see htts://wwwstatberkeley edu/~mjwain/stat0b/cha_tailbounds_jan_05df Examle 5 for instance 9
10 D Robustness of LAF Classifiers to l and Gaussian Noise Theorem 3 Let [ ] Let [ ] be such that + Let ε 0 ζ ε ζ ε be as in Theorem Then for all ε < ε 0 the following holds Assume f is a classifier that is γ η-laf at oint x and x be such that r x x x Then: γζ εd / fx fx r εx r x and rovided r ε x r x + γζ εd / fx fx η + γζ εd / fx fx r x η lim Proof Let f and f + be functions such that the searating hyerlanes of resectively H γ x x and H + γ x x are described by equations resectively f z 0 and f + z 0 By definition we know that r f x γ r x and r f + x + γ r x From the definition of LAF classifiers since for all η γ +γ η lim z Hγ x x B x η f z f x > 0 we have r ε f x r ε x; indeed if x + αv with α γ +γ η lim is not misclassified by f then it is not misclassified by f Therefore by alying Lemma to f we get: γ ζ ε d / fx fx r εx r x Since as long as η η lim z H γ + x x B x η f z f x < 0 we can aly a symmetric reasoning for f + and get: r ε x r + γζ εd / fx x fx Theorem 4 Let Σ be a d d ositive semidefinite matrix with TrΣ Let ε 0 ζ ε ζ ε as in Theorem Then for all ε < ε 0 the following holds Assume f is a classifier that is γ η-laf at oint x and x be such that r x x x Then: and rovided γζ ε r Σε x r x + γζ η + γ + 8 Tr Σ ln 4 ζ ε fx r Σεx Σ fx r x 3ε fx Σ fx 3ε fx r Σ fx x η lim Proof This roof can be directly adated from the roof of Theorem 3 The difference in the Gaussian case is that v is no longer samled from the unit ball and its norm is not limited anymore However its norm can be bounded with high robability and this enables to adat the bounds of Theorem 3 to the Gaussian case Indeed using a Bernstein inequality as in the roof of Proosition we have: t P Σv t ex 8 Tr Σ ε for t ψε 8 Tr Σ ln 4 ε 0
11 Let us focus on the uer bound for this roof; the lower bound follows by a similar reasoning From the definition of LAF classifiers since for all for all η η lim z H γ + x x B x η f z f x < 0 we have r Σε x r Σ 3ε f + x; indeed if x + αv with α η lim +ψε is misclassified by f + then it is misclassified by f if αv η lim Therefore by alying Lemma 6 to f + : r Σε x ε fx r x + γζ Σ fx E Generalization to Multi-class Classifiers We resent in this section a generalization of Theorem to multi-class linear classifiers and discuss about the generalization of the other results to the multi-class case A classifier f is said to be linear if for all k L there are vector w k b k such that f k x w T k x + b ε k In this setting Theorem can be generalized by relacing ζ ε by ζ L in the lower bound Theorem 5 Let [ ] Let [ ] be such that + Let ε 0 ζ ε ζ ε be the constants as defined in Theorem Let k g x the label attributed to x by f j be a class such that x + r x lies on the decision boundary between classes k and j ie the class of the adversarial ertubation of x and j w argmin k w l l w k w l Then for all ε < ε 0 : ε ζ d / w k w j r εx L w k w j r ζ εd / w k w j x w k w j Proof We first define for the sake of the demonstration for any class l the adversarial erturbation in the binary case where only classes k and l are considered: r x l argmin r st f k x + r < f j x + r r It is then ossible to exress conveniently P v B g x g x + αv: P v B g x g x + αv P v B l k f k x < f l x + αv P v B l k w l w k T v f k x f l x P v B l k w l w k T v r x l w l w k Let us first rove the inequality on the uer bound as in Lemma 3 w j w k T P v B g x g x + αv P v B v r x j w j w k w j w k T P v B v r x w j w k by definition of j Then using the same reasoning as in Lemma 3 leads to r ε x r x ζ εd / w k w j w k w j Let us then rove the inequality on the lower bounds as in Lemma We use the union bound to derive the inequality: P v B g x g x + αv w l w k T P v B v r x l w l w k l k w l w k T P v B v r x w l w k l k
12 because r x r x l for all l Moreover for < ζ the reasoning of Lemma for each l k: P v B g x g x + αv l k ε L d w k w j w k w j r x by following ε L ε Therefore: r ε x ε r ζ d / w k w j x L w k w j The roof of this theorem uses the union bound to obtain the lower bound exlaining that ζ ε in the binary case becomes ζ ε L in the multi-class setting However this inequality reresents a worst case in the majoration used in the roof and we observed in our exeriments that using the coefficient ζ ε instead of ζ ε L gives a roer lower bound on rεx r x Notice that it is ossible to generalize other results that we roved in the binary case Lemma 4 Theorems and 3 to the multi-class roblem with a similar transormation of the inequalities relacing ζ ε by ζ ε L and using similar definitions of j and j References [Barthe et al 005] Barthe F Guédon O Mendelson S and Naor A 005 A robabilistic aroach to the geometry of the l n -ball The Annals of Probability 33: [Bernstein 97] Bernstein S 97 Theory of Probability [Galambos 987] Galambos J 987 The Asymtotic Theory of Extreme Order Statistics R E Krieger second edition [Robbins 955] Robbins H 955 A remark on stirling s formula The American Mathematical Monthly 6:6 9
Improved Bounds on Bell Numbers and on Moments of Sums of Random Variables
Imroved Bounds on Bell Numbers and on Moments of Sums of Random Variables Daniel Berend Tamir Tassa Abstract We rovide bounds for moments of sums of sequences of indeendent random variables. Concentrating
More informationTHE 2D CASE OF THE BOURGAIN-DEMETER-GUTH ARGUMENT
THE 2D CASE OF THE BOURGAIN-DEMETER-GUTH ARGUMENT ZANE LI Let e(z) := e 2πiz and for g : [0, ] C and J [0, ], define the extension oerator E J g(x) := g(t)e(tx + t 2 x 2 ) dt. J For a ositive weight ν
More informationElementary Analysis in Q p
Elementary Analysis in Q Hannah Hutter, May Szedlák, Phili Wirth November 17, 2011 This reort follows very closely the book of Svetlana Katok 1. 1 Sequences and Series In this section we will see some
More informationLecture 6. 2 Recurrence/transience, harmonic functions and martingales
Lecture 6 Classification of states We have shown that all states of an irreducible countable state Markov chain must of the same tye. This gives rise to the following classification. Definition. [Classification
More informationGENERICITY OF INFINITE-ORDER ELEMENTS IN HYPERBOLIC GROUPS
GENERICITY OF INFINITE-ORDER ELEMENTS IN HYPERBOLIC GROUPS PALLAVI DANI 1. Introduction Let Γ be a finitely generated grou and let S be a finite set of generators for Γ. This determines a word metric on
More information#A37 INTEGERS 15 (2015) NOTE ON A RESULT OF CHUNG ON WEIL TYPE SUMS
#A37 INTEGERS 15 (2015) NOTE ON A RESULT OF CHUNG ON WEIL TYPE SUMS Norbert Hegyvári ELTE TTK, Eötvös University, Institute of Mathematics, Budaest, Hungary hegyvari@elte.hu François Hennecart Université
More informationUniform Law on the Unit Sphere of a Banach Space
Uniform Law on the Unit Shere of a Banach Sace by Bernard Beauzamy Société de Calcul Mathématique SA Faubourg Saint Honoré 75008 Paris France Setember 008 Abstract We investigate the construction of a
More informationSums of independent random variables
3 Sums of indeendent random variables This lecture collects a number of estimates for sums of indeendent random variables with values in a Banach sace E. We concentrate on sums of the form N γ nx n, where
More informationarxiv: v1 [physics.data-an] 26 Oct 2012
Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch
More informationHENSEL S LEMMA KEITH CONRAD
HENSEL S LEMMA KEITH CONRAD 1. Introduction In the -adic integers, congruences are aroximations: for a and b in Z, a b mod n is the same as a b 1/ n. Turning information modulo one ower of into similar
More informationFinite Mixture EFA in Mplus
Finite Mixture EFA in Mlus November 16, 2007 In this document we describe the Mixture EFA model estimated in Mlus. Four tyes of deendent variables are ossible in this model: normally distributed, ordered
More informationJournal of Mathematical Analysis and Applications
J. Math. Anal. Al. 44 (3) 3 38 Contents lists available at SciVerse ScienceDirect Journal of Mathematical Analysis and Alications journal homeage: www.elsevier.com/locate/jmaa Maximal surface area of a
More informationStochastic integration II: the Itô integral
13 Stochastic integration II: the Itô integral We have seen in Lecture 6 how to integrate functions Φ : (, ) L (H, E) with resect to an H-cylindrical Brownian motion W H. In this lecture we address the
More informationECON 4130 Supplementary Exercises 1-4
HG Set. 0 ECON 430 Sulementary Exercises - 4 Exercise Quantiles (ercentiles). Let X be a continuous random variable (rv.) with df f( x ) and cdf F( x ). For 0< < we define -th quantile (or 00-th ercentile),
More informationCERIAS Tech Report The period of the Bell numbers modulo a prime by Peter Montgomery, Sangil Nahm, Samuel Wagstaff Jr Center for Education
CERIAS Tech Reort 2010-01 The eriod of the Bell numbers modulo a rime by Peter Montgomery, Sangil Nahm, Samuel Wagstaff Jr Center for Education and Research Information Assurance and Security Purdue University,
More informationSolvability and Number of Roots of Bi-Quadratic Equations over p adic Fields
Malaysian Journal of Mathematical Sciences 10(S February: 15-35 (016 Secial Issue: The 3 rd International Conference on Mathematical Alications in Engineering 014 (ICMAE 14 MALAYSIAN JOURNAL OF MATHEMATICAL
More informationwhere x i is the ith coordinate of x R N. 1. Show that the following upper bound holds for the growth function of H:
Mehryar Mohri Foundations of Machine Learning Courant Institute of Mathematical Sciences Homework assignment 2 October 25, 2017 Due: November 08, 2017 A. Growth function Growth function of stum functions.
More information2 Asymptotic density and Dirichlet density
8.785: Analytic Number Theory, MIT, sring 2007 (K.S. Kedlaya) Primes in arithmetic rogressions In this unit, we first rove Dirichlet s theorem on rimes in arithmetic rogressions. We then rove the rime
More informationEstimation of the large covariance matrix with two-step monotone missing data
Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo
More informationInformation collection on a graph
Information collection on a grah Ilya O. Ryzhov Warren Powell February 10, 2010 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements
More information4. Score normalization technical details We now discuss the technical details of the score normalization method.
SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules
More informationCHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules
CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules. Introduction: The is widely used in industry to monitor the number of fraction nonconforming units. A nonconforming unit is
More information1 Extremum Estimators
FINC 9311-21 Financial Econometrics Handout Jialin Yu 1 Extremum Estimators Let θ 0 be a vector of k 1 unknown arameters. Extremum estimators: estimators obtained by maximizing or minimizing some objective
More informationThe non-stochastic multi-armed bandit problem
Submitted for journal ublication. The non-stochastic multi-armed bandit roblem Peter Auer Institute for Theoretical Comuter Science Graz University of Technology A-8010 Graz (Austria) auer@igi.tu-graz.ac.at
More informationŽ. Ž. Ž. 2 QUADRATIC AND INVERSE REGRESSIONS FOR WISHART DISTRIBUTIONS 1
The Annals of Statistics 1998, Vol. 6, No., 573595 QUADRATIC AND INVERSE REGRESSIONS FOR WISHART DISTRIBUTIONS 1 BY GERARD LETAC AND HELENE ` MASSAM Universite Paul Sabatier and York University If U and
More informationApproximating min-max k-clustering
Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost
More informationIMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES
IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES OHAD GILADI AND ASSAF NAOR Abstract. It is shown that if (, ) is a Banach sace with Rademacher tye 1 then for every n N there exists
More informationAnalysis of some entrance probabilities for killed birth-death processes
Analysis of some entrance robabilities for killed birth-death rocesses Master s Thesis O.J.G. van der Velde Suervisor: Dr. F.M. Sieksma July 5, 207 Mathematical Institute, Leiden University Contents Introduction
More informationVarious Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various Families of R n Norms and Some Open Problems
Int. J. Oen Problems Comt. Math., Vol. 3, No. 2, June 2010 ISSN 1998-6262; Coyright c ICSRS Publication, 2010 www.i-csrs.org Various Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various
More informationElements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley
Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the
More informationON THE SET a x + b g x (mod p) 1 Introduction
PORTUGALIAE MATHEMATICA Vol 59 Fasc 00 Nova Série ON THE SET a x + b g x (mod ) Cristian Cobeli, Marian Vâjâitu and Alexandru Zaharescu Abstract: Given nonzero integers a, b we rove an asymtotic result
More informationThe Hasse Minkowski Theorem Lee Dicker University of Minnesota, REU Summer 2001
The Hasse Minkowski Theorem Lee Dicker University of Minnesota, REU Summer 2001 The Hasse-Minkowski Theorem rovides a characterization of the rational quadratic forms. What follows is a roof of the Hasse-Minkowski
More informationECE 534 Information Theory - Midterm 2
ECE 534 Information Theory - Midterm Nov.4, 009. 3:30-4:45 in LH03. You will be given the full class time: 75 minutes. Use it wisely! Many of the roblems have short answers; try to find shortcuts. You
More information2 Asymptotic density and Dirichlet density
8.785: Analytic Number Theory, MIT, sring 2007 (K.S. Kedlaya) Primes in arithmetic rogressions In this unit, we first rove Dirichlet s theorem on rimes in arithmetic rogressions. We then rove the rime
More informationOn a class of Rellich inequalities
On a class of Rellich inequalities G. Barbatis A. Tertikas Dedicated to Professor E.B. Davies on the occasion of his 60th birthday Abstract We rove Rellich and imroved Rellich inequalities that involve
More informationAsymptotically Optimal Simulation Allocation under Dependent Sampling
Asymtotically Otimal Simulation Allocation under Deendent Samling Xiaoing Xiong The Robert H. Smith School of Business, University of Maryland, College Park, MD 20742-1815, USA, xiaoingx@yahoo.com Sandee
More informationMollifiers and its applications in L p (Ω) space
Mollifiers and its alications in L () sace MA Shiqi Deartment of Mathematics, Hong Kong Batist University November 19, 2016 Abstract This note gives definition of mollifier and mollification. We illustrate
More informationNumerical Linear Algebra
Numerical Linear Algebra Numerous alications in statistics, articularly in the fitting of linear models. Notation and conventions: Elements of a matrix A are denoted by a ij, where i indexes the rows and
More informationLecture: Condorcet s Theorem
Social Networs and Social Choice Lecture Date: August 3, 00 Lecture: Condorcet s Theorem Lecturer: Elchanan Mossel Scribes: J. Neeman, N. Truong, and S. Troxler Condorcet s theorem, the most basic jury
More informationLecture 3 Consistency of Extremum Estimators 1
Lecture 3 Consistency of Extremum Estimators 1 This lecture shows how one can obtain consistency of extremum estimators. It also shows how one can find the robability limit of extremum estimators in cases
More informationSystem Reliability Estimation and Confidence Regions from Subsystem and Full System Tests
009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract
More informationInformation collection on a graph
Information collection on a grah Ilya O. Ryzhov Warren Powell October 25, 2009 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements
More informationStrong Matching of Points with Geometric Shapes
Strong Matching of Points with Geometric Shaes Ahmad Biniaz Anil Maheshwari Michiel Smid School of Comuter Science, Carleton University, Ottawa, Canada December 9, 05 In memory of Ferran Hurtado. Abstract
More informationA Note on Guaranteed Sparse Recovery via l 1 -Minimization
A Note on Guaranteed Sarse Recovery via l -Minimization Simon Foucart, Université Pierre et Marie Curie Abstract It is roved that every s-sarse vector x C N can be recovered from the measurement vector
More informationMarcinkiewicz-Zygmund Type Law of Large Numbers for Double Arrays of Random Elements in Banach Spaces
ISSN 995-0802, Lobachevskii Journal of Mathematics, 2009, Vol. 30, No. 4,. 337 346. c Pleiades Publishing, Ltd., 2009. Marcinkiewicz-Zygmund Tye Law of Large Numbers for Double Arrays of Random Elements
More informationChapter 7: Special Distributions
This chater first resents some imortant distributions, and then develos the largesamle distribution theory which is crucial in estimation and statistical inference Discrete distributions The Bernoulli
More informationLecture 4: Law of Large Number and Central Limit Theorem
ECE 645: Estimation Theory Sring 2015 Instructor: Prof. Stanley H. Chan Lecture 4: Law of Large Number and Central Limit Theorem (LaTeX reared by Jing Li) March 31, 2015 This lecture note is based on ECE
More informationPETER J. GRABNER AND ARNOLD KNOPFMACHER
ARITHMETIC AND METRIC PROPERTIES OF -ADIC ENGEL SERIES EXPANSIONS PETER J. GRABNER AND ARNOLD KNOPFMACHER Abstract. We derive a characterization of rational numbers in terms of their unique -adic Engel
More informationE-companion to A risk- and ambiguity-averse extension of the max-min newsvendor order formula
e-comanion to Han Du and Zuluaga: Etension of Scarf s ma-min order formula ec E-comanion to A risk- and ambiguity-averse etension of the ma-min newsvendor order formula Qiaoming Han School of Mathematics
More informationA Social Welfare Optimal Sequential Allocation Procedure
A Social Welfare Otimal Sequential Allocation Procedure Thomas Kalinowsi Universität Rostoc, Germany Nina Narodytsa and Toby Walsh NICTA and UNSW, Australia May 2, 201 Abstract We consider a simle sequential
More informationp-adic Measures and Bernoulli Numbers
-Adic Measures and Bernoulli Numbers Adam Bowers Introduction The constants B k in the Taylor series exansion t e t = t k B k k! k=0 are known as the Bernoulli numbers. The first few are,, 6, 0, 30, 0,
More informationState Estimation with ARMarkov Models
Deartment of Mechanical and Aerosace Engineering Technical Reort No. 3046, October 1998. Princeton University, Princeton, NJ. State Estimation with ARMarkov Models Ryoung K. Lim 1 Columbia University,
More informationk- price auctions and Combination-auctions
k- rice auctions and Combination-auctions Martin Mihelich Yan Shu Walnut Algorithms March 6, 219 arxiv:181.3494v3 [q-fin.mf] 5 Mar 219 Abstract We rovide for the first time an exact analytical solution
More informationTowards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK
Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)
More informationAn Estimate For Heilbronn s Exponential Sum
An Estimate For Heilbronn s Exonential Sum D.R. Heath-Brown Magdalen College, Oxford For Heini Halberstam, on his retirement Let be a rime, and set e(x) = ex(2πix). Heilbronn s exonential sum is defined
More informationImproved Capacity Bounds for the Binary Energy Harvesting Channel
Imroved Caacity Bounds for the Binary Energy Harvesting Channel Kaya Tutuncuoglu 1, Omur Ozel 2, Aylin Yener 1, and Sennur Ulukus 2 1 Deartment of Electrical Engineering, The Pennsylvania State University,
More informationSECTION 12: HOMOTOPY EXTENSION AND LIFTING PROPERTY
SECTION 12: HOMOTOPY EXTENSION AND LIFTING PROPERTY In the revious section, we exloited the interlay between (relative) CW comlexes and fibrations to construct the Postnikov and Whitehead towers aroximating
More informationInteractive Hypothesis Testing Against Independence
013 IEEE International Symosium on Information Theory Interactive Hyothesis Testing Against Indeendence Yu Xiang and Young-Han Kim Deartment of Electrical and Comuter Engineering University of California,
More information1 1 c (a) 1 (b) 1 Figure 1: (a) First ath followed by salesman in the stris method. (b) Alternative ath. 4. D = distance travelled closing the loo. Th
18.415/6.854 Advanced Algorithms ovember 7, 1996 Euclidean TSP (art I) Lecturer: Michel X. Goemans MIT These notes are based on scribe notes by Marios Paaefthymiou and Mike Klugerman. 1 Euclidean TSP Consider
More informationarxiv: v4 [math.nt] 11 Oct 2017
POPULAR DIFFERENCES AND GENERALIZED SIDON SETS WENQIANG XU arxiv:1706.05969v4 [math.nt] 11 Oct 2017 Abstract. For a subset A [N], we define the reresentation function r A A(d := #{(a,a A A : d = a a }
More informationMarch 4, :21 WSPC/INSTRUCTION FILE FLSpaper2011
International Journal of Number Theory c World Scientific Publishing Comany SOLVING n(n + d) (n + (k 1)d ) = by 2 WITH P (b) Ck M. Filaseta Deartment of Mathematics, University of South Carolina, Columbia,
More informationMath 4400/6400 Homework #8 solutions. 1. Let P be an odd integer (not necessarily prime). Show that modulo 2,
MATH 4400 roblems. Math 4400/6400 Homework # solutions 1. Let P be an odd integer not necessarily rime. Show that modulo, { P 1 0 if P 1, 7 mod, 1 if P 3, mod. Proof. Suose that P 1 mod. Then we can write
More informationMultiplicity of weak solutions for a class of nonuniformly elliptic equations of p-laplacian type
Nonlinear Analysis 7 29 536 546 www.elsevier.com/locate/na Multilicity of weak solutions for a class of nonuniformly ellitic equations of -Lalacian tye Hoang Quoc Toan, Quô c-anh Ngô Deartment of Mathematics,
More informationA Special Case Solution to the Perspective 3-Point Problem William J. Wolfe California State University Channel Islands
A Secial Case Solution to the Persective -Point Problem William J. Wolfe California State University Channel Islands william.wolfe@csuci.edu Abstract In this aer we address a secial case of the ersective
More informationOn Z p -norms of random vectors
On Z -norms of random vectors Rafa l Lata la Abstract To any n-dimensional random vector X we may associate its L -centroid body Z X and the corresonding norm. We formulate a conjecture concerning the
More informationPositive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Application
BULGARIA ACADEMY OF SCIECES CYBEREICS AD IFORMAIO ECHOLOGIES Volume 9 o 3 Sofia 009 Positive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Alication Svetoslav Savov Institute of Information
More informationConsistency and asymptotic normality
Consistency an asymtotic normality Class notes for Econ 842 Robert e Jong March 2006 1 Stochastic convergence The asymtotic theory of minimization estimators relies on various theorems from mathematical
More informationCOMMUNICATION BETWEEN SHAREHOLDERS 1
COMMUNICATION BTWN SHARHOLDRS 1 A B. O A : A D Lemma B.1. U to µ Z r 2 σ2 Z + σ2 X 2r ω 2 an additive constant that does not deend on a or θ, the agents ayoffs can be written as: 2r rθa ω2 + θ µ Y rcov
More informationThe spectrum of kernel random matrices
The sectrum of kernel random matrices Noureddine El Karoui Deartment of Statistics, University of California, Berkeley Abstract We lace ourselves in the setting of high-dimensional statistical inference,
More informationSUMS OF TWO SQUARES PAIR CORRELATION & DISTRIBUTION IN SHORT INTERVALS
SUMS OF TWO SQUARES PAIR CORRELATION & DISTRIBUTION IN SHORT INTERVALS YOTAM SMILANSKY Abstract. In this work we show that based on a conjecture for the air correlation of integers reresentable as sums
More information1 Riesz Potential and Enbeddings Theorems
Riesz Potential and Enbeddings Theorems Given 0 < < and a function u L loc R, the Riesz otential of u is defined by u y I u x := R x y dy, x R We begin by finding an exonent such that I u L R c u L R for
More informationElementary theory of L p spaces
CHAPTER 3 Elementary theory of L saces 3.1 Convexity. Jensen, Hölder, Minkowski inequality. We begin with two definitions. A set A R d is said to be convex if, for any x 0, x 1 2 A x = x 0 + (x 1 x 0 )
More informationApplications to stochastic PDE
15 Alications to stochastic PE In this final lecture we resent some alications of the theory develoed in this course to stochastic artial differential equations. We concentrate on two secific examles:
More informationarxiv: v2 [math.ac] 5 Jan 2018
Random Monomial Ideals Jesús A. De Loera, Sonja Petrović, Lily Silverstein, Desina Stasi, Dane Wilburne arxiv:70.070v [math.ac] Jan 8 Abstract: Insired by the study of random grahs and simlicial comlexes,
More informationCommutators on l. D. Dosev and W. B. Johnson
Submitted exclusively to the London Mathematical Society doi:10.1112/0000/000000 Commutators on l D. Dosev and W. B. Johnson Abstract The oerators on l which are commutators are those not of the form λi
More informationGeneral Linear Model Introduction, Classes of Linear models and Estimation
Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)
More informationOn the stability and integration of Hamilton-Poisson systems on so(3)
Rendiconti di Matematica, Serie VII Volume 37, Roma (016), 1 4 On the stability and integration of Hamilton-Poisson systems on so(3) R. M. ADAMS R. BIGGS W. HOLDERBAUM C. C. REMSING Abstract: We consider
More information1 Probability Spaces and Random Variables
1 Probability Saces and Random Variables 1.1 Probability saces Ω: samle sace consisting of elementary events (or samle oints). F : the set of events P: robability 1.2 Kolmogorov s axioms Definition 1.2.1
More informationLEIBNIZ SEMINORMS IN PROBABILITY SPACES
LEIBNIZ SEMINORMS IN PROBABILITY SPACES ÁDÁM BESENYEI AND ZOLTÁN LÉKA Abstract. In this aer we study the (strong) Leibniz roerty of centered moments of bounded random variables. We shall answer a question
More informationBEST CONSTANT IN POINCARÉ INEQUALITIES WITH TRACES: A FREE DISCONTINUITY APPROACH
BEST CONSTANT IN POINCARÉ INEQUALITIES WITH TRACES: A FREE DISCONTINUITY APPROACH DORIN BUCUR, ALESSANDRO GIACOMINI, AND PAOLA TREBESCHI Abstract For Ω R N oen bounded and with a Lischitz boundary, and
More informationSTRONG TYPE INEQUALITIES AND AN ALMOST-ORTHOGONALITY PRINCIPLE FOR FAMILIES OF MAXIMAL OPERATORS ALONG DIRECTIONS IN R 2
STRONG TYPE INEQUALITIES AND AN ALMOST-ORTHOGONALITY PRINCIPLE FOR FAMILIES OF MAXIMAL OPERATORS ALONG DIRECTIONS IN R 2 ANGELES ALFONSECA Abstract In this aer we rove an almost-orthogonality rincile for
More informationNew Information Measures for the Generalized Normal Distribution
Information,, 3-7; doi:.339/info3 OPEN ACCESS information ISSN 75-7 www.mdi.com/journal/information Article New Information Measures for the Generalized Normal Distribution Christos P. Kitsos * and Thomas
More informationADAMS INEQUALITY WITH THE EXACT GROWTH CONDITION IN R 4
ADAMS INEQUALITY WITH THE EXACT GROWTH CONDITION IN R 4 NADER MASMOUDI AND FEDERICA SANI Contents. Introduction.. Trudinger-Moser inequality.. Adams inequality 3. Main Results 4 3. Preliminaries 6 3..
More informationElliptic Curves Spring 2015 Problem Set #1 Due: 02/13/2015
18.783 Ellitic Curves Sring 2015 Problem Set #1 Due: 02/13/2015 Descrition These roblems are related to the material covered in Lectures 1-2. Some of them require the use of Sage, and you will need to
More informationAP Calculus Testbank (Chapter 10) (Mr. Surowski)
AP Calculus Testbank (Chater 1) (Mr. Surowski) Part I. Multile-Choice Questions 1. The grah in the xy-lane reresented by x = 3 sin t and y = cost is (A) a circle (B) an ellise (C) a hyerbola (D) a arabola
More informationThe Longest Run of Heads
The Longest Run of Heads Review by Amarioarei Alexandru This aer is a review of older and recent results concerning the distribution of the longest head run in a coin tossing sequence, roblem that arise
More informationOnline Learning of Noisy Data with Kernels
Online Learning of Noisy Data with Kernels Nicolò Cesa-Bianchi Università degli Studi di Milano cesa-bianchi@dsiunimiit Shai Shalev Shwartz The Hebrew University shais@cshujiacil Ohad Shamir The Hebrew
More informationOn information plus noise kernel random matrices
On information lus noise kernel random matrices Noureddine El Karoui Deartment of Statistics, UC Berkeley First version: July 009 This version: February 5, 00 Abstract Kernel random matrices have attracted
More informationNOTES. Hyperplane Sections of the n-dimensional Cube
NOTES Edited by Sergei Tabachnikov Hyerlane Sections of the n-dimensional Cube Rolfdieter Frank and Harald Riede Abstract. We deduce an elementary formula for the volume of arbitrary hyerlane sections
More informationDepartment of Mathematics
Deartment of Mathematics Ma 3/03 KC Border Introduction to Probability and Statistics Winter 209 Sulement : Series fun, or some sums Comuting the mean and variance of discrete distributions often involves
More informationON THE LEAST SIGNIFICANT p ADIC DIGITS OF CERTAIN LUCAS NUMBERS
#A13 INTEGERS 14 (014) ON THE LEAST SIGNIFICANT ADIC DIGITS OF CERTAIN LUCAS NUMBERS Tamás Lengyel Deartment of Mathematics, Occidental College, Los Angeles, California lengyel@oxy.edu Received: 6/13/13,
More informationMODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL
Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management
More informationNew weighing matrices and orthogonal designs constructed using two sequences with zero autocorrelation function - a review
University of Wollongong Research Online Faculty of Informatics - Paers (Archive) Faculty of Engineering and Information Sciences 1999 New weighing matrices and orthogonal designs constructed using two
More informationOn-Line Appendix. Matching on the Estimated Propensity Score (Abadie and Imbens, 2015)
On-Line Aendix Matching on the Estimated Proensity Score Abadie and Imbens, 205 Alberto Abadie and Guido W. Imbens Current version: August 0, 205 The first art of this aendix contains additional roofs.
More informationA New Perspective on Learning Linear Separators with Large L q L p Margins
A New Persective on Learning Linear Searators with Large L q L Margins Maria-Florina Balcan Georgia Institute of Technology Christoher Berlind Georgia Institute of Technology Abstract We give theoretical
More informationAVERAGES OF EULER PRODUCTS, DISTRIBUTION OF SINGULAR SERIES AND THE UBIQUITY OF POISSON DISTRIBUTION
AVERAGES OF EULER PRODUCTS, DISTRIBUTION OF SINGULAR SERIES AND THE UBIQUITY OF POISSON DISTRIBUTION EMMANUEL KOWALSKI Abstract. We discuss in some detail the general roblem of comuting averages of convergent
More informationANALYTIC NUMBER THEORY AND DIRICHLET S THEOREM
ANALYTIC NUMBER THEORY AND DIRICHLET S THEOREM JOHN BINDER Abstract. In this aer, we rove Dirichlet s theorem that, given any air h, k with h, k) =, there are infinitely many rime numbers congruent to
More informationEötvös Loránd University Faculty of Informatics. Distribution of additive arithmetical functions
Eötvös Loránd University Faculty of Informatics Distribution of additive arithmetical functions Theses of Ph.D. Dissertation by László Germán Suervisor Prof. Dr. Imre Kátai member of the Hungarian Academy
More informationCoding Along Hermite Polynomials for Gaussian Noise Channels
Coding Along Hermite olynomials for Gaussian Noise Channels Emmanuel A. Abbe IG, EFL Lausanne, 1015 CH Email: emmanuel.abbe@efl.ch Lizhong Zheng LIDS, MIT Cambridge, MA 0139 Email: lizhong@mit.edu Abstract
More informationCHAPTER 2: SMOOTH MAPS. 1. Introduction In this chapter we introduce smooth maps between manifolds, and some important
CHAPTER 2: SMOOTH MAPS DAVID GLICKENSTEIN 1. Introduction In this chater we introduce smooth mas between manifolds, and some imortant concets. De nition 1. A function f : M! R k is a smooth function if
More information