arxiv: v2 [math.pr] 8 Feb 2016

Size: px
Start display at page:

Download "arxiv: v2 [math.pr] 8 Feb 2016"

Transcription

1 Noname manuscript No will be inserted by the editor Bounds on Tail Probabilities in Exponential families Peter Harremoës arxiv:600579v [mathpr] 8 Feb 06 Received: date / Accepted: date Abstract In this paper we present various new inequalities for tail proabilities for distributions that are elements of the most improtant exponential families These families include the Poisson distributions, the Gamma distributions, the binomial distributions, the negative binomial distributions and the inverse Gaussian distributions All these exponential families have simple variance functions and the variance functions play an important role in the exposition All the inequalities presented in this paper are formulated in terms of the signed log-likelihood The inequalities are of a qualitative nature in that they can be formulated either in terms of stochastic domination or in terms of an intersection property that states that a certain discrete distribution is very close to a certain continuous distribution Keywords Tail probability exponential family signed Log-likelihood variance function inequalities Mathematics Subject Classification E5 6E7 60F0 Introduction Let X,, X n be iid random variables such that the moment generating function β E[expβX ] is finite in a neighborhood of the origin For a fixed value of µ one is interested in approximating the tail distribution: Pr n i X i µ If µ is close to the mean of X one would usually approximate the tail probability by the tail probability of a Gaussian random variable If µ is far from the mean of X the tail probability can be estimated using large deviation theory According to the Sanov theorem the probability that the deviation from the mean is as large as µ is of the Peter Harremoës Copenhagen Business College Copenhagen, Denmark Tel: harremoes@ieeeorg

2 Peter Harremoës order exp D where D is a constant or to be more precise ln Pr n i X i µ D n for n Bahadur and Rao [, ] improved the estimate of this large deviation probability, and in [5] such Gaussian tail approximations were extended to situations where one normally uses large deviation techniques The distribution of the signed log-likelihood is close to a standard Gaussian for a variety of distributions An asymptotic result for large sample sizes this is not new [0], but in this paper we are interested in inequalities that hold for any sample size Some inequalities of this type can be found in [? 9, 6,? ], but here we attempt to give a more systematic presentation including a number of new or improved inequalities In this paper we let τ denote the circle constant π and φ will denote the standard Gaussian density exp x τ / We let Φ denote the distribution function of the standard Gaussian Φ t ˆ t φ x dx The rest of the paper is organized as follows In Section we define the signed log-likelihood of exponential families and look at some of the fundamental properties of the signed log-likelihood Next we study inequalities for the signed log-likelihood for certain exponential families associated with continuous waiting times We start with the inverse Gaussian in Section 3 that is particularly simple Then we study the exponential distributions Section 4 and more general Gamma distributions Section 5 Next we turn our attention to discrete waiting times First we obtain some new inequalities for the geometric distributions Section 6 and then we generalize the results to negative binomial distributions Section 7 The negative binomial distributions are waiting times in Bernoulli processes, so in Section 8 our inequalities between negative binomial distributions and Gamma distributions are translated into inequalities between binomial distributions and Poisson distributions Combined with our domination inequalities for Gamma distributions we obtain an intersection inequality between binomial distributions and the Standard Gaussian distribution In this paper the focus is on intersection inequalities and stochastic domination inequalities, but in the discussion we mention some related inequalities of other types and how they may be improved The signed log-likelihood for exponential families Consider the -dimensional exponential family P β where dp β exp β x x dp 0 Z β and Z denotes the moment generating function given by Zβ exp β x dp 0 x Let P µ denote the element in the exponential family with mean value µ, and let ˆβ µ

3 Bounds on Tail Probabilities 3 denote the corresponding maximum likelihood estimate of β Let µ 0 denote the mean value of P 0 Then ˆ D P µ dp µ P 0 ln x dp µ x dp 0 The variance function of an exponential family is defined so that V µ is the variance of P µ The variance functions uniquely characterizes the corresponding exponential families and most important exponential families have very simple variance functions If we know the varince function the divergence can be calculated according to the following formula Lemma In an exponential family P µ parametrized by mean value µand with variance function V information divergence can be calculated according to the formula ˆ µ D P µ P µ µ µ µ V µ dµ Proof The divergence is given by ˆ D P µ P µ dp µ ln dp µ ˆ ln The derivative with respect to β is expβx Zβ expβ x Zβ x dp µ x dp µ x ˆ β β x ln Z β + ln Z β dp µ x β β µ ln Z β + ln Z β β D P µ P µ µ µ Therefore the derivative with respect to µ is Together with the trivial identity D P µ P µ µ µ µ dµ dβ µ µ V µ the results follows D P µ P µ ˆ µ µ µ µ V µ dµ Definition From [3] Let X be a random variable with distribution P 0 Then the signed log-likelihood G X of X is the random variable given by { [D P x P 0 ] /, for x < µ 0 ; G x + [D P x P 0 ] /, for x µ 0

4 4 Peter Harremoës We will need the following general lemma Lemma If the variance function is increasing then is a decreasing function of x Proof We have G x x µ 0 d G x x µ 0 D x Gx G x dx x µ 0 x µ 0 x µ 0 µ 0 x V µ dµ D x µ 0 G x x µ 0 x µ 0 V µ dµ D x µ 0 G x We have to prove that numerator is positive for x < µ 0 and negative for x > µ 0 The numerator can be calculated as ˆ x ˆ x ˆ x µ 0 µ 0 V µ dµ D x µ x 0 µ 0 V µ dµ µ µ 0 µ 0 V µ dµ ˆ x x µ0 V µ µ µ 0 dµ V µ If x > µ 0 then µ ˆ 0 x x + µ 0 µ µ 0 V µ dµ ˆ x x + µ 0 µ µ 0 V µ dµ ˆ x+µ 0 x + µ 0 µ µ 0 V µ ˆ x+µ 0 x + µ 0 µ µ 0 V x+µ 0 ˆ x ˆ x x + µ 0 µ dµ + dµ x+µ 0 V µ 0 ˆ x dµ + x + µ 0 µ µ 0 V x+µ 0 dµ 0 The inequality for x < µ 0 is proved in the same way x+µ 0 0 x + µ 0 µ V x+µ 0 dµ 3 Inequalities for inverse Gaussian The inverse Gaussian distribution and it is used to model waiting times for a Wiener process Brownian motion with drift An inverse Gaussian distribution has density f w [ ] / λ λ w µ τw 3 exp µ w

5 Bounds on Tail Probabilities 5 Fig The signed log-likelihood of an inverse Gaussian distribution with mean value and shape parameter with mean value parameter µ and shape parameter λ The variance function is V µ µ 3 /λ The divergence of an inverse Gaussian distribution with mean µ from an inverse Gaussian distribution with mean µ is ˆ µ Hence the signed log-likelihood is We observe that µ µ µ µ 3 /λ dµ λ µ µ µ µ G µ,λ w λ / w µ w / µ G µ,λ w [ λ µ ] / w µ [ w µ [ ] / λ G, µ ] / w µ Note that the saddle-point approximation [4] is exact for the family of inverse Gaussian distributions, ie φ G w f w [V w] / Lemma 3 The probability density of the random variable G µ,λ W is where g denotes the function G, φ z + g z [ ] µ / λ

6 6 Peter Harremoës Proof The density of G µ,λ W is Now we use that Hence f G µ,λ z G µ,λ z G µ,λ φg µ,λg µ,λ z [V G µ,λ z] / G µ,λ G µ,λ z φ z [ ] V G µ,λ z / G µ,λ G µ,λ z G µ,λ w w / µλ / / w / µλ / w µ wµ λ / µ + w w 3 / µ [ [V w] / G w 3 µ,λ w λ Therefore the density of G µ,λ W is f z ]/ µ + w µ φ z µ G µ,λ z + µ By isolating x in the equation G µ,λ x z we get G µ,λ z µ G, λ / µ + w w 3 / µ [ µ ] / z λ Hence which proves the theorem f z G, φ z z [ µ λ ] / +, Lemma 4 From [6] Let X and X denote random variables with density functions f and f If f x f x for x x 0 and f x f x for x x 0, then X is stochastically dominated by X In particular if f x f x is increasing then X is stochastically dominated by X Proof Assume that f x f x for x x 0 and that f x f x for x x 0 For t x 0 we have P X t ˆ t ˆ t f x dx f x dx P X t

7 Bounds on Tail Probabilities 7 Fig The density of the signed log-likelihood of an inverse Gaussian distribution blue with mean value and shape parameter comparead with the density of a standard Gaussian distribution red Fig 3 Plot of the quantiles of a standard Gaussian vs the same quantiles of the signed log-likelihood of the invers Gaussian with µ and λ Similarly it is proved that P X t P X t for t x 0 but this implies that P X > t P X > t If f x f x is increasing then there exist a number x 0 such that f x f x for x x 0 and that f x f x for x x 0 Theorem If W is inverse Gauiisan distributed IG µ, λ then the signed loglikelihood G W µ,λ W is stochastically dominated by the standard Gaussian distribution, ie the inequality holds for any w ]0, [ Φ G W µ,λ w Pr W w Proof We have to prove that if W has an inverse Gaussian distribution then G W is stochastically dominated by the standard Gaussian According to Lemma 4 we can prove stochastic dominance by proving that φ z /f z is increasing Now φ z g z [ ] µ / f z λ + which is increasing because the function g is increasing

8 8 Peter Harremoës If Wald random variables are added they become more and more Gaussian and so do their signed log-likelihood The next theorem states that the convergence of the signed log-likelihood towards the standard Gaussian is monotone in stochastic domination Theorem Assume that W and W have inverse Gaussian distributions let G W and G W denote their signed log-likelihood Then G W is stochantically dominated by G W if and only if µ λ > µ λ Proof We have to prove that the densities satisfy φ z g z [ µ λ ] / + < φ z g z [ µ λ ] / + for z > 0 and the reverse inequality for z < 0 The inequality is equivalent to ] / ] / > g g z [ µ λ For z > 0 this follows because the function g is increasing The reversed inequality is proved in the same way z [ µ λ 4 Exponential distributions Although the tail probabilities of the exponential distribution are easy to calculate the inequalities related to the signed log-likelihood of the exponential distribution are non-trivial and will be useful later The exponential distribution Exp has density The distribution function is Pr X x f x exp x ˆ x 0 exp t dt exp x The mean of the exponential distribution Exp is and the variance is so the variance function is V µ µ The divergence can be calculated as ˆ D Exp Exp µ µ dµ ln From this we see that [ x G Exp x ± ln x ] / x γ

9 Bounds on Tail Probabilities 9 Fig 4 The signed log-likelihood γ x of an exponential distribution where γ denotes the function γ x { [ x ln x] /, when x ; + [ x ln x] /, when x > Note that the saddle-point approximation is exact for the family of exponential distributions, ie f x τ / φ G x e [V x] / Lemma 5 The density of the signed log-likelihood of an exponential random variable is given by τ / e zφ z γ z Proof Let X be a Exp distributed random variable The density of the signed loglikelihood is f G z G G z τ / e τ / e φgg z [V G z] / G G z The variance function is V x x so the density is τ / e φ z [V G z] / G G z φ z G z G G z

10 0 Peter Harremoës Fig 5 Plot of the quantiles of a standard Gaussian vs the quantiles of the signed log-likelihood of an exponential distribution From G D follows that G G D so that G x Hence the density of G X can be written as τ / e φ z G z G z GG z dd dx G x x G x τ / e τ / e zφ z G z zφ z γ z, which proves the lemma Theorem 3 The signed log-likelihood of an exponentially distributed random variable is stochastically dominated by the standard Gaussian Proof The quotient between the density of a standard Gaussian and the density of G X is e τ γ z / z We have to prove that this quotient is increasing The function γ is increasing so it is sufficient to prove that t γt is increasing or equivalently that γ t t is decreasing This follows from Lemma because the variance function is increasing

11 Bounds on Tail Probabilities 5 Gamma distributions The sum of k exponentially distributed random variables is Gamma distributed Γ k, where k is called the shape parameter and is the scale parameter It has density f x k Γ k xk exp x and this formula is used to define the Gamma distribution when k is not an integer The mean of the Gamma distribution Γ k, is k and the variance is k so the variance function is V µ µ /k The divergence can be calculated as ˆ k µ k D Γ k, Γ k, k µ /k dµ k ln Further we have that G Γ k, x k / x γ k Note that the saddle-point approximation is exact for the family of Gamma distributions, ie f x kk exp k Γ k kk τ / exp k Γ k k / exp k x k ln x k x φ G Γ k, x [V x] / Proposition If F denotes the distribution function of the distribution Γ k, with mean µ k then d dµ F t equals minus the density of the distribution Γ k +, Proof We have F t ˆ t 0 ˆ t/ 0 Γ k xk exp k x dx Γ k yk exp y dx Hence d d F t Γ k k t exp t t k k+ Γ k + xk exp d dµ F t k+ Γ k + xk exp x x

12 Peter Harremoës Fig 6 The quantiles of a standard Gaussian vs gamma distributions for k blue, k 5 black, and k 0 green The red line corresponds to a perfect match with a standard Gaussian The dependence on on shape and scaling is determined from the equation From this we see that D Γ k, x x Γ k, k k k ln x k x k k ln x k [ x G k, x ± k k ln x ] / k [ ±k / x k ln x ] / k k / x γ k which proves the proposition The following lemma is proved in the same way as Lemma 5 Lemma 6 The density of the signed log-likelihood of a Gamma random variable is given by k k τ / z exp k k / φ z Γ k k / γ z Theorem 4 From [6] The signed log-likelihood of a Gamma distributed random variable is stochastically dominated by the standard Gaussian, ie k / Pr X x Φ G Γ x Proof This is proved in the same way as the corresponding result for exponential distributions

13 Bounds on Tail Probabilities 3 Theorem 5 Let X and X denote Gamma disstributed random variables with shape parameters k and k and scale parameters and The the signed loglikelihood of X is dominated by the signed log-likelihood of X if and only if k k Proof We have to prove that z k / γ φ z z k / z k / φ z < γ z k / for z > 0 and the reverse inequality for z < 0 The inequality is equivalent to γ z k / z k / This follows because the function γ < γ t t z k / z k / is increasing and because both sides have the same limit as z tends to zero from the right 6 Geometric distributions Compounding a Poisson distribution P o λ with rate parameter λ distributed according to an exponential distribution Exp leads a geometric distribution that we will denote Geo We note that this is an unusual way of parametrizing the geometric distributions, but it will be usuful for some of our calculations Since λ is both the mean and the variance of P o λ the mean of Geo is and the variance is V µ µ + µ The point probabilities of a negative binomial distribution can be written as Pr M m ˆ 0 ˆ The distribution function can be calculated as λ m exp λ m! exp λ dλ t m exp t exp t dt 0 m! m + m+ m j Pr M m + j+ j0 m+ +

14 4 Peter Harremoës Fig 7 Plot the quantiles of the signed log-likelihood of Exp 35 vs the quantiles of the signed log-likelihood of Geo 35 The divergence is given by D Geo Geo ˆ µ µ + µ dµ ln + ln + + Hence the signed log-likelihood of the geometric distribution with mean is given by [ g x ± x ln x ] / x + x + ln + Theorem 6 Assume that the random variable M has a geometric distribution Geo and let the random variable X be exponentially distributed Exp If Pr X x Pr M < m then G Geo m / G Exp x G Geo m Proof First we note that G Exp x γ x / and Pr X x Pr X / x / Therefore we introduce the variable y x / and the random variable Y X / that is exponentially distributed Exp We will prove that Pr Y y Pr M < m implies g m / γ y g m The two inequalities are proved separately First we prove that Pr Y y Pr M < m implies that g m / γ y Equivalently we have to prove that γ y g m / γ y g m / γ y + g m /

15 Bounds on Tail Probabilities 5 is positive The probability Pr M < m is a decreasing function of Therefore the probability Pr Y y is a decreasing function of, but the destribution of Y does not depend on so y must be a decreasing function of Therefore the denominator γ y + g m / is a decreasing function of and it equals zero when m / The numerator also equals zero when m / so it is sufficient to prove that the numerator is a decreasing function of Therefore we have to prove the inequality γ y g m 0 or, equivalently, that g m γ y One also have to prove that Pr Y y Pr M < m implies that γ y g m and it is sufficient to prove that γ y g m We have γ y dy d For the geometric distribution we have g m Therefore we have to prove that m + / + dy d d γ y dy y m ln m m + m + m + + dy d m ln m y + Therefore Equation can be solved as m exp y + + y m ln The derivative is dy d m + m + m +

16 6 Peter Harremoës Finally we have to prove that m + / + m + m + / m + + / ln / ln ln m ln + m + ln + + The right ineqality is trivial The left inequality is equivalent to + ln + 0 / m + We have for The derivative is negative which proves the theorem + ln + 0 / / + +, Corollary Assume that the random variable M has a geometric distribution Geo and let the random variable X be exponential distributed Exp If G Exp x G Geo m then Pr M < m Pr X x Pr M m If we plot quantiles of an exponential distribution against the corresponding quantiles of the signed log-likelihood of a Geometric distribution we get a stair case function, ie a sequence of horisontal lines The inequality means that the left endpoint of any step is to the left of the line y x Actually the line y x intersects each step and we say that the plot has an intersection property as illustrated in Figure 7 Proof Since implies Pr X x Pr M < m G Exp x G Geo m and both Pr X x and G Exp x are increasing functions of x we have that G Exp x G Geo m implies that Pr X x Pr M < m Since Pr X x Pr M < m

17 Bounds on Tail Probabilities 7 implies G Geo m / G Exp x we have that G Geo m / G Exp x implies that Pr X x Pr M < m Hence G Geo m + / G Exp x implies that Pr X x Pr M < m + Pr M m Since G Geo m G Geo m + / we also have that G Geo m G Exp x implies that Pr X x Pr M m 7 Inequalities for negative binomial distributions Compounding a Poisson distribution P o λ with rate parameter λ distributed according to a Gamma distribution Γ k, leads a negative binomial distribution The link to waiting times in Bernoulli processes will be explored in Sectoin 8 In this section we will parametrize the negative binomial distribution as neg k, where k and are the prameters of the corresponding Gamma distribution We note that this is an unusual way of parametrizing the negative binomial distribution, but it will be usuful for some of our calculations Since λ is both the mean and the variance of P o λ we can calculate the mean of neg k, as µ k and the variance as V µ µ + µ k The point probaiblities of a negative binomial distribution can be written in several ways Pr M m ˆ 0 ˆ 0 λ m m! exp λ k Γ k λk exp λ dλ t m m! Γ m + k m!γ k exp t Γ k tk exp t dt m + m+k We need an explicite formula for the divergence that is given by ˆ k µ k D neg k, neg k, dµ k µ + µ k k ln + ln + + The log-likelihood is given by where g is given by Equation We will need the following lemma G negk, x k / g x k Lemma 7 A Poisson random variable K with distribution P o λ satisfies d Pr K k Pr K k dλ

18 8 Peter Harremoës Proof If X is a Gamma distributed Γ k +, then Pr K k Pr K < k + Pr X λ Hence d Pr K k dλ k Γ k + λk+ exp λ λk exp λ, k! which proves the lemma Lemma 8 If the distribution of M k is neg k, then the partial derivative of the point probability with respect to the mean value parameter equals where M k+ is neg k +, Proof We have d dµ Pr M k m dµ d d dµ Pr M k m Pr M k+ m k d d ˆ 0 ˆ 0 ˆ 0 m P o t; j Γ k tk exp t dt j0 t P o t; m Γ k tk exp t dt P o t; m Γ k + tk exp t dt The last integral equals Pr M k+ m, which proves the lemma The following theorem generalizes Corollary from k to arbitrary positive values of k We cannot use the same proof technique because we do not have an explicite formula for the quantile function for the Gamma distributions except in the case when k Lemma 4 cannot be used because we want to compare a discrete distribution with a continuous function Instead the proof combines a proof method developed by Zubkov and Serov [] with the ideas and results developed in the previous sections Theorem 7 Assume that the random variable M has a negative binomial distribution neg k, and let the random variable X be Gamma distributed Γ k, If G Γ k, x G negk, m then Pr M < m Pr X x Pr M m 3

19 Bounds on Tail Probabilities 9 Proof Below we only give the proof of the upper bound in Inequality 3 The lower bound is proved the in the same way First we note that G Γ k, x G Γ k, x / and Pr X x Pr X / x / Therefore we introduce the variable y x / and the random variable Y X / that is Gamma distributed Γ k, Introduce the difference and note that δ µ 0 Pr M m Pr Y y δ 0 lim µ 0 δ µ We note that there exists at least one value of µ 0 such that δ µ 0 0 It is sufficient to prove that δ is first increasing and then decreasing in [0, [ According to Lemma 8 the derivative of Pr M m with respect to µ 0 is Γ m + k + m Pr M m µ 0 m!γ k + + m+k+ m + k Γ m + k m k + m!γ k + m+k m + k Pr M m µ 0 + k ˆ + Pr M m + where µ 0 /k is the scale parameter and where and ˆ m /k is the maximum likelihood estimate of the scale parameter Let ˆPr denote the probability of M calculated with respect to this maximum likelihood estimate ˆ Then we have m + k Pr M m + exp D ˆPr M m The condition can be written as which implies G Γ k, x G negk, m k / y γ k / g ˆ k y ˆ γ g k Differentiation with respect to gives k dy y k d ˆ + so that dy d k y ˆ +

20 0 Peter Harremoës Therefore dy Pr Y y f y d Combining these results we get kk τ / exp k Γ k k / kk τ / exp k Γ k k / kk τ / exp k Γ k k / δ m + k + ˆPr M m exp D kk τ / exp k Γ k k / exp D y k y exp D y k ˆ + exp D γ g ˆ exp D γ g ˆ kk τ / exp k Γ k ˆ γ g ˆ Remark that the first factor is positive and that exp D + Γ k k / ˆ k k τ / exp k k + ˆPr M m ˆ + ˆ + ˆ + Γ k m + k k k τ / exp k ˆPr M m is a positive number that does not depend on Therefore it is sufficient to prove that is a decreasing function of, or, equivalently, to prove that ˆ γ g ˆ γ g ˆ ˆ is an increasing function of The partial derivative with respect to is ˆ γ g ˆ + γ g ˆ γ g ˆ ˆ ˆ + + γ g ˆ γ g ˆ γ g ˆ ˆ ˆ+γ g ˆ ˆ ˆγ g ˆ We have to prove that γ g ˆ γ g ˆ ˆ ˆ + γ g ˆ

21 Bounds on Tail Probabilities If ˆ the inequality is equivalent to γ g ˆ ˆ γ g ˆ ˆ + If ˆ < the inequality is equivalent to γ g ˆ ˆ γ g ˆ ˆ + The equation s s t can be solved with respect to x, which gives the solutions s + t ± +4t] / [t For ˆ we get [ ] / ˆ + 4 ˆ For ˆ < we get γ g ˆ + γ g ˆ + ˆ ˆ+ + ˆ+ ˆ+ [ + ˆ ˆ + ˆ + + 4ˆ ˆ + ˆ ˆ+ ] / [ ˆ + 4 ˆ ˆ+ ˆ+ [ + ˆ ˆ + ˆ + + 4ˆ ˆ + Since γ is increasing we have to prove that [ ˆ + ˆ + + 4ˆ g ˆ γ ˆ + ˆ + ] / ] / ] / or, equivalently, that [ ˆ + ˆ + + 4ˆ g ˆ γ ˆ + ˆ + { { } g ˆ γ + g ˆ + γ + ] / [ / } ˆ + ˆ+ +4ˆ] ˆ ˆ+ [ / ˆ + ˆ+ +4ˆ] ˆ ˆ+

22 Peter Harremoës is positive Both the denominator and the numerator are zero when ˆ Therefore it is sufficient to prove that both the denominator and the numerator are decreasing functions of First we prove that the denominator is decreasing The first term is obviously decreasing The second term is composed of γ, which is increasing, and y + y ± +4y] / [y which is increasing or decreasing depending on the sign of ±, and the function ˆ which is decreasing when ˆ and increasing when ˆ ˆ+ Therefore the composed function is a decreasing function of The numerator can be written as [ / { ˆ ln ˆ ˆ + ln ˆ } ˆ + ˆ+ +4ˆ] ˆ + ˆ+ [ / + ˆ + ˆ+ +4ˆ] ln + ˆ ˆ+ We calculate the derivative with respect to, which can be written as 4 +ˆ+4 + ˆ + ˆ + [ /, ˆ + + 4ˆ] + + ˆ + + 4ˆ which is obviously less than or equal to zero If we want to give lower bounds and upper bounds to the tail probabilities of a negative binomial distribution the following reformulation of Theorem 7 is useful Corollary Assume that the random variable M has a negative binomial distribution neg k, and let the random variable X be Gamma distributed Γ k, If G Γ k, x G negk, m Then Pr X x m Pr M m Pr X x m+ 5 where x m and x m+ are determined by G Γ k, x m G negk, m, G Γ k, x m+ G negk, m + 8 Inequalities for binomial distributions and Poisson distributions We will prove that intersections results for binomial distributions and Poisson distributions follows from the corresponding intersection result for negative binomial distributions and Gamma distributions We note that the point probabilities of a negative binomial distribution can be written as Γ m + k m!γ k m m k m+k + m! pk p m

23 Bounds on Tail Probabilities 3 Fig 8 Plot the quantiles of the signed log-likelihood of a standard Gaussian vs the quantiles of the sigend log-likelihood of bin 7, / where p + and where m denotes the raising factorial Let nb p, k denote a negativ binomial distribution with succes probability p Then nb p, k is the distribution of the number of failures before the k th success in a Bernoulli process with success probability p Our inequality for the negative binomial distribution can be translated into an inequality for the binomial distribution Assume that K is binomial bin n, p and M is negative binomial nb p, k Then Pr K k Pr M + k n In terms of p the divergence can be written as D nb p, k nb p, k k p ln p + p ln p p p p We have so D bin n, p bin n, p p ln p p + p ln p p k k D nb n, k k nb p, k n n ln n D bin k ln k n n p + p n, k bin n, p n If G bin is the signed log-likelihood of bin n, p and G nb is the signed log-likelihood of nb p, k then G binn,p k G nbp,k n k If K is Poisson distributed with mean λ and X is Gamma distributed with shape parameter k and scale parameter, ie the distribution of the waiting time until k observations from an Poisson process with intensity Then Next we note that Pr K k Pr X λ D P o k P o λ D Γ k, λ Γ k, k

24 4 Peter Harremoës Fig 9 Plot of quantiles of a standard Gaussian vs the log-ligelihood of the Poisson distribution P o 35 If G P oλ is the signed log-likelihood for P o λ and G Γ k, is the signed log-likelihood for Γ k, then G P oλ k G Γ k, λ Theorem 8 Assume that K is binomially distributed bin n, p and let G binn,p denote the signed log-likelihood function of the exponential family based on bin n, p Assume that L is a Poisson random variable with distribution P o λ and let G P oλ denote the signed log-likelihood function of the exponential family based on P o λ If G binn,p k G P oλ k Then Pr K < k Pr L < k Pr K k 6 Proof Let M denote a negative binomial random variable with distribution nb p, k and let X denote a Gamma random variable with distribution Γ k, where the parameter equals p such that the distributions nb p, k and Γ k, have the same mean value Now G nbp,k n k G binn,p k and G Γ k, λ G P oλ k Then G nbp,k n k G Γ k, λ The left part of the Inequality 6 is proved as follows Pr K < k Pr K k Pr M + k n Pr X λ Pr L k Pr L < k The right part of the inequality is proved in the same way Note that Theorem 7 cannot be proved from Theorem 8 because the number parameter for a binomial distribution has to be an integer while the number parameter of a negative binomial distribution may assume any positive value Now, our inequalities for negative binomial distributions can be translated into inequalities for binomial distributions Now we can prove the an intersection inequalities for the binomial family as stated in the following theorem that was recently proved by Serov and Zubkov []

25 Bounds on Tail Probabilities 5 Corollary 3 Assume that K is binomially distributed bin n, p and let G binn,p denote the signed log-likelihood function of the exponential family based on bin n, p Then Pr K < k Φ G binn,p k Pr K k 7 Similarly, assume that L is Poisson distributed P o λ and let G P oλ denote the signed log-likelihood function of the exponential family based on P o λ Then Pr L < k Φ G P oλ k Pr L k 8 Proof First we prove the left part of Inequality 8 Let X denote a Gamma distributed Γ k, and let Z denote a standard Gaussian Then G P oλ k G Γ k, λ and Pr L < k Pr L k Pr X λ Pr X λ Pr Z G Γ k, λ Pr Z G P oλ k Φ G P oλ k The left part of Inequality 7 is obtained by combining the left part of Inequality 8 with the left part of Inequality 6 Proof The right part of Inequality 7 is obtained follows from the left part of Inequality 7 by replacing p by p and replacing k by n k Proof Since a Poisson distribution is a limit of binomial distributions the right part of Inequality 8 follows from the right part of Inequality 8 The intersection property for Poisson distributions was proved in [6] where the inequality for binomial distributions was also conjectured

26 6 Peter Harremoës 9 Summary The main theorems in this paper are domination theorems and intersection theorems The first type of inequalities states that the signed log-likelihood of one distribution is dominated by the signed loglikelihood of another distribution, ie the distribution function of the first distribution is larger than the distribution function of the second distribution signed ll dom by signed ll Condition Theorem Inverse Gaussian Gaussian IG µ, λ IG µ, λ µ λ > µ λ Gamma Gaussian 4 Γ k, Γ k, k k 5 Table Stochastic domination results Note that the exponential distributions are special cases of Gamma distributions The second type of result are intersection results, ie the distribution function of the log-likelihood of a discrete distribution is a staircase function where each step is intersected by the distribution function of the log-likelihood of a continuous distribution Table Intersection results Discrete distribution Continuous distribution Theorem Geometric Exponential Negative binomial Gamma 7 Binomial Gaussian 3 Poisson Gaussian 3 0 Discussion In this paper we have presented inequalities of two types The inequalities for inverse Gaussian distributions, exponential distributions and other Gamma distributions are about stochastic domination The inequalities for Poisson distributions, binomial distributions, geometric distributions and other negative binomial distributions are about intersection These inequalities can be combined in order to get inequalities of other types For instance a negative binomial random variable M with distribution neg k, satisfies Φ G negk, m Pr M m, where G nbp,k denotes the signed log-likelihood of the negative binomial distribution Contrary to the similar inequality for the binomial distribution the inquality Pr M < m Φ G negk, m does in general not hold as illustrated in Figure 0 We have both lower bounds and upper bounds on the Poission distributions The upper bound for the Poisson distribution corresponds to the lower bound for the

27 Bounds on Tail Probabilities 7 Fig 0 Plot of the quantiles of a standard Gaussian vs the quantiles of the signed loglikelihood of the negative binomial distribution neg, 35 blue and of the signed loglikelihood of the Gamma distribution Γ, 35 green Gamma distribution presented in Theorem 4, but the lower for the Poisson distribution translated into a new upper bound for the distribution function of the Gamma distribution Numerical calculations also indicates that in Inequality 8 the right hand inequality can be improved to Φ G P oλ k + Pr L k This inequality is much tighter than the inequality in 8 Similarly, J Reiczigel, L Rejtő and G Tusnády conjectured that both the lower bound and the upper bound in Inequality 7 can be significantly improved when for p / [9], and their conjecture has been a major motivation for initializing this research For the most important distributions like the binomial distributions, the Poisson distributions, the negative binomial distributions, the inverse Gaussian distributions and the Gamma distributions we can formulate sharp inequalities that hold for any sample size All these distributions have variance functions that are polynomials of order and 3 Natural exponential families with polynomial variance functions of order at most 3 have been classified [8, 7] and there is a chance that one can formulate and prove sharp inequalities for each of these exponential families Although there may exist very nice results for the rest of the exponential families with simple variance functions the rest of these exponential families have much fewer applications than the exponential families that have been the subject of the present paper In the present paper inequalities have been developed for specific exponential families, but one may hope that some more general inequalities may be developed where bounds on the tails are derived directly from the properties of the variace function Acknowledgements The author want to thank Gabor Tusnády, who inspired me to look at this type of inequalities The author also want the thank László Györfi, Joram Gat, Janás Komlos, and A Zubkov for useful correspondence or discussions Finally I want to express my gratityde to Narayana Prasad Santhanam who invitet me to a two month visit at the Electrical Engineering Department, University of Hawai i This paper was completed during my visit

28 8 Peter Harremoës References Bahadur RR 960 Some approximations to the binomial distribution function Annals of mathematical statistics 3:43 54 Bahadur RR, Rao RR 960 On deviation of the sample mean Annals of Mathematical Statistics 3: Barndorff-Nielsen OE 990 A note on the standardized signed log likelihood ratio Scandinavian Journal of Statistics 7:57 60, URL stable/ Daniels HE 954 Saddlepoint approximations in statistics Ann Math Statist 54: Györfi L, Harremoës P, Tusnády G 0 Gaussian approximation of large deviation probabilities, unpublished 6 Harremoës P, Tusnády G 0 Information divergence is more χ -distributed than the χ -statistic In: International Symposium on Information Theory ISIT 0, IEEE, Cambridge, Massachusetts, USA, pp , URL org/abs/05 7 Letac G, Mora M 990 Natural real exponential families with cubic variance functions Ann Stat 8: 37 8 Morris C 98 Natural exponential families with quadratic variance functions Ann Statist 0: Reiczigel J, Rejtő L, Tusnády G 0 A sharpning of Tusnády s inequality, arxiv 0367v 0 Zhang T 008 Limiting distribution of the G statistics Statistics and Probability Letters 78:656 66, URL statprobletters008pdf Zubkov AM, Serov AA 03 A complete proof of universal inequalities for the distribution function of the binomial law Theory Probab Appl 573: , DOI 037/S X , URL S X http://arxivorg/abs/073838

Mutual information of Contingency Tables and Related Inequalities

Mutual information of Contingency Tables and Related Inequalities Mutual information of Contingency Tables and Related Inequalities Peter Harremoës Copenhagen Business College Copenhagen, Denmark Email: harremoes@ieeeorg arxiv:4020092v [mathst] Feb 204 Abstract For testing

More information

A Criterion for the Compound Poisson Distribution to be Maximum Entropy

A Criterion for the Compound Poisson Distribution to be Maximum Entropy A Criterion for the Compound Poisson Distribution to be Maximum Entropy Oliver Johnson Department of Mathematics University of Bristol University Walk Bristol, BS8 1TW, UK. Email: O.Johnson@bristol.ac.uk

More information

Continuous Distributions

Continuous Distributions Continuous Distributions 1.8-1.9: Continuous Random Variables 1.10.1: Uniform Distribution (Continuous) 1.10.4-5 Exponential and Gamma Distributions: Distance between crossovers Prof. Tesler Math 283 Fall

More information

Literature on Bregman divergences

Literature on Bregman divergences Literature on Bregman divergences Lecture series at Univ. Hawai i at Mānoa Peter Harremoës February 26, 2016 Information divergence was introduced by Kullback and Leibler [25] and later Kullback started

More information

Physics 403 Probability Distributions II: More Properties of PDFs and PMFs

Physics 403 Probability Distributions II: More Properties of PDFs and PMFs Physics 403 Probability Distributions II: More Properties of PDFs and PMFs Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Last Time: Common Probability Distributions

More information

On a conjecture concerning the sum of independent Rademacher random variables

On a conjecture concerning the sum of independent Rademacher random variables On a conjecture concerning the sum of independent Rademacher rom variables Martien C.A. van Zuijlen arxiv:111.4988v1 [math.pr] 1 Dec 011 IMAPP, MATHEMATICS RADBOUD UNIVERSITY NIJMEGEN Heyendaalseweg 135

More information

MATH4210 Financial Mathematics ( ) Tutorial 7

MATH4210 Financial Mathematics ( ) Tutorial 7 MATH40 Financial Mathematics (05-06) Tutorial 7 Review of some basic Probability: The triple (Ω, F, P) is called a probability space, where Ω denotes the sample space and F is the set of event (σ algebra

More information

Continuous Random Variables and Continuous Distributions

Continuous Random Variables and Continuous Distributions Continuous Random Variables and Continuous Distributions Continuous Random Variables and Continuous Distributions Expectation & Variance of Continuous Random Variables ( 5.2) The Uniform Random Variable

More information

CHINO VALLEY UNIFIED SCHOOL DISTRICT INSTRUCTIONAL GUIDE ALGEBRA II

CHINO VALLEY UNIFIED SCHOOL DISTRICT INSTRUCTIONAL GUIDE ALGEBRA II CHINO VALLEY UNIFIED SCHOOL DISTRICT INSTRUCTIONAL GUIDE ALGEBRA II Course Number 5116 Department Mathematics Qualification Guidelines Successful completion of both semesters of Algebra 1 or Algebra 1

More information

1 + lim. n n+1. f(x) = x + 1, x 1. and we check that f is increasing, instead. Using the quotient rule, we easily find that. 1 (x + 1) 1 x (x + 1) 2 =

1 + lim. n n+1. f(x) = x + 1, x 1. and we check that f is increasing, instead. Using the quotient rule, we easily find that. 1 (x + 1) 1 x (x + 1) 2 = Chapter 5 Sequences and series 5. Sequences Definition 5. (Sequence). A sequence is a function which is defined on the set N of natural numbers. Since such a function is uniquely determined by its values

More information

Approximating the Conway-Maxwell-Poisson normalizing constant

Approximating the Conway-Maxwell-Poisson normalizing constant Filomat 30:4 016, 953 960 DOI 10.98/FIL1604953S Published by Faculty of Sciences and Mathematics, University of Niš, Serbia Available at: http://www.pmf.ni.ac.rs/filomat Approximating the Conway-Maxwell-Poisson

More information

Exam C Solutions Spring 2005

Exam C Solutions Spring 2005 Exam C Solutions Spring 005 Question # The CDF is F( x) = 4 ( + x) Observation (x) F(x) compare to: Maximum difference 0. 0.58 0, 0. 0.58 0.7 0.880 0., 0.4 0.680 0.9 0.93 0.4, 0.6 0.53. 0.949 0.6, 0.8

More information

MOMENT CONVERGENCE RATES OF LIL FOR NEGATIVELY ASSOCIATED SEQUENCES

MOMENT CONVERGENCE RATES OF LIL FOR NEGATIVELY ASSOCIATED SEQUENCES J. Korean Math. Soc. 47 1, No., pp. 63 75 DOI 1.4134/JKMS.1.47..63 MOMENT CONVERGENCE RATES OF LIL FOR NEGATIVELY ASSOCIATED SEQUENCES Ke-Ang Fu Li-Hua Hu Abstract. Let X n ; n 1 be a strictly stationary

More information

Universal examples. Chapter The Bernoulli process

Universal examples. Chapter The Bernoulli process Chapter 1 Universal examples 1.1 The Bernoulli process First description: Bernoulli random variables Y i for i = 1, 2, 3,... independent with P [Y i = 1] = p and P [Y i = ] = 1 p. Second description: Binomial

More information

Discrete Distributions Chapter 6

Discrete Distributions Chapter 6 Discrete Distributions Chapter 6 Negative Binomial Distribution section 6.3 Consider k r, r +,... independent Bernoulli trials with probability of success in one trial being p. Let the random variable

More information

Independence of some multiple Poisson stochastic integrals with variable-sign kernels

Independence of some multiple Poisson stochastic integrals with variable-sign kernels Independence of some multiple Poisson stochastic integrals with variable-sign kernels Nicolas Privault Division of Mathematical Sciences School of Physical and Mathematical Sciences Nanyang Technological

More information

Strong log-concavity is preserved by convolution

Strong log-concavity is preserved by convolution Strong log-concavity is preserved by convolution Jon A. Wellner Abstract. We review and formulate results concerning strong-log-concavity in both discrete and continuous settings. Although four different

More information

Kernel families of probability measures. Saskatoon, October 21, 2011

Kernel families of probability measures. Saskatoon, October 21, 2011 Kernel families of probability measures Saskatoon, October 21, 2011 Abstract The talk will compare two families of probability measures: exponential, and Cauchy-Stjelties families. The exponential families

More information

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 1998 Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ Lawrence D. Brown University

More information

Jump-type Levy Processes

Jump-type Levy Processes Jump-type Levy Processes Ernst Eberlein Handbook of Financial Time Series Outline Table of contents Probabilistic Structure of Levy Processes Levy process Levy-Ito decomposition Jump part Probabilistic

More information

Asymptotics for posterior hazards

Asymptotics for posterior hazards Asymptotics for posterior hazards Pierpaolo De Blasi University of Turin 10th August 2007, BNR Workshop, Isaac Newton Intitute, Cambridge, UK Joint work with Giovanni Peccati (Université Paris VI) and

More information

ELEMENTS OF PROBABILITY THEORY

ELEMENTS OF PROBABILITY THEORY ELEMENTS OF PROBABILITY THEORY Elements of Probability Theory A collection of subsets of a set Ω is called a σ algebra if it contains Ω and is closed under the operations of taking complements and countable

More information

Moments of the maximum of the Gaussian random walk

Moments of the maximum of the Gaussian random walk Moments of the maximum of the Gaussian random walk A.J.E.M. Janssen (Philips Research) Johan S.H. van Leeuwaarden (Eurandom) Model definition Consider the partial sums S n = X 1 +... + X n with X 1, X,...

More information

Brief Review of Probability

Brief Review of Probability Maura Department of Economics and Finance Università Tor Vergata Outline 1 Distribution Functions Quantiles and Modes of a Distribution 2 Example 3 Example 4 Distributions Outline Distribution Functions

More information

16.4. Power Series. Introduction. Prerequisites. Learning Outcomes

16.4. Power Series. Introduction. Prerequisites. Learning Outcomes Power Series 6.4 Introduction In this Section we consider power series. These are examples of infinite series where each term contains a variable, x, raised to a positive integer power. We use the ratio

More information

On random sequence spacing distributions

On random sequence spacing distributions On random sequence spacing distributions C.T.J. Dodson School of Mathematics, Manchester University Manchester M6 1QD, UK ctdodson@manchester.ac.uk Abstract The random model for allocation of an amino

More information

Page Max. Possible Points Total 100

Page Max. Possible Points Total 100 Math 3215 Exam 2 Summer 2014 Instructor: Sal Barone Name: GT username: 1. No books or notes are allowed. 2. You may use ONLY NON-GRAPHING and NON-PROGRAMABLE scientific calculators. All other electronic

More information

Random variables. DS GA 1002 Probability and Statistics for Data Science.

Random variables. DS GA 1002 Probability and Statistics for Data Science. Random variables DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall17 Carlos Fernandez-Granda Motivation Random variables model numerical quantities

More information

arxiv: v1 [math.oc] 18 Jul 2011

arxiv: v1 [math.oc] 18 Jul 2011 arxiv:1107.3493v1 [math.oc] 18 Jul 011 Tchebycheff systems and extremal problems for generalized moments: a brief survey Iosif Pinelis Department of Mathematical Sciences Michigan Technological University

More information

Department of Mathematics

Department of Mathematics Department of Mathematics Ma 3/103 KC Border Introduction to Probability and Statistics Winter 2017 Lecture 8: Expectation in Action Relevant textboo passages: Pitman [6]: Chapters 3 and 5; Section 6.4

More information

Moment Properties of Distributions Used in Stochastic Financial Models

Moment Properties of Distributions Used in Stochastic Financial Models Moment Properties of Distributions Used in Stochastic Financial Models Jordan Stoyanov Newcastle University (UK) & University of Ljubljana (Slovenia) e-mail: stoyanovj@gmail.com ETH Zürich, Department

More information

Asymptotic behavior for sums of non-identically distributed random variables

Asymptotic behavior for sums of non-identically distributed random variables Appl. Math. J. Chinese Univ. 2019, 34(1: 45-54 Asymptotic behavior for sums of non-identically distributed random variables YU Chang-jun 1 CHENG Dong-ya 2,3 Abstract. For any given positive integer m,

More information

Distributed Estimation, Information Loss and Exponential Families. Qiang Liu Department of Computer Science Dartmouth College

Distributed Estimation, Information Loss and Exponential Families. Qiang Liu Department of Computer Science Dartmouth College Distributed Estimation, Information Loss and Exponential Families Qiang Liu Department of Computer Science Dartmouth College Statistical Learning / Estimation Learning generative models from data Topic

More information

1 Review of Probability

1 Review of Probability 1 Review of Probability Random variables are denoted by X, Y, Z, etc. The cumulative distribution function (c.d.f.) of a random variable X is denoted by F (x) = P (X x), < x

More information

04. Random Variables: Concepts

04. Random Variables: Concepts University of Rhode Island DigitalCommons@URI Nonequilibrium Statistical Physics Physics Course Materials 215 4. Random Variables: Concepts Gerhard Müller University of Rhode Island, gmuller@uri.edu Creative

More information

Math Bootcamp 2012 Miscellaneous

Math Bootcamp 2012 Miscellaneous Math Bootcamp 202 Miscellaneous Factorial, combination and permutation The factorial of a positive integer n denoted by n!, is the product of all positive integers less than or equal to n. Define 0! =.

More information

Continuous RVs. 1. Suppose a random variable X has the following probability density function: π, zero otherwise. f ( x ) = sin x, 0 < x < 2

Continuous RVs. 1. Suppose a random variable X has the following probability density function: π, zero otherwise. f ( x ) = sin x, 0 < x < 2 STAT 4 Exam I Continuous RVs Fall 7 Practice. Suppose a random variable X has the following probability density function: f ( x ) = sin x, < x < π, zero otherwise. a) Find P ( X < 4 π ). b) Find µ = E

More information

Lecture 4. f X T, (x t, ) = f X,T (x, t ) f T (t )

Lecture 4. f X T, (x t, ) = f X,T (x, t ) f T (t ) LECURE NOES 21 Lecture 4 7. Sufficient statistics Consider the usual statistical setup: the data is X and the paramter is. o gain information about the parameter we study various functions of the data

More information

Let (Ω, F) be a measureable space. A filtration in discrete time is a sequence of. F s F t

Let (Ω, F) be a measureable space. A filtration in discrete time is a sequence of. F s F t 2.2 Filtrations Let (Ω, F) be a measureable space. A filtration in discrete time is a sequence of σ algebras {F t } such that F t F and F t F t+1 for all t = 0, 1,.... In continuous time, the second condition

More information

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes Alea 4, 117 129 (2008) Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes Anton Schick and Wolfgang Wefelmeyer Anton Schick, Department of Mathematical Sciences,

More information

1. Stochastic Processes and filtrations

1. Stochastic Processes and filtrations 1. Stochastic Processes and 1. Stoch. pr., A stochastic process (X t ) t T is a collection of random variables on (Ω, F) with values in a measurable space (S, S), i.e., for all t, In our case X t : Ω S

More information

The random variable 1

The random variable 1 The random variable 1 Contents 1. Definition 2. Distribution and density function 3. Specific random variables 4. Functions of one random variable 5. Mean and variance 2 The random variable A random variable

More information

STAT/MATH 395 A - PROBABILITY II UW Winter Quarter Moment functions. x r p X (x) (1) E[X r ] = x r f X (x) dx (2) (x E[X]) r p X (x) (3)

STAT/MATH 395 A - PROBABILITY II UW Winter Quarter Moment functions. x r p X (x) (1) E[X r ] = x r f X (x) dx (2) (x E[X]) r p X (x) (3) STAT/MATH 395 A - PROBABILITY II UW Winter Quarter 07 Néhémy Lim Moment functions Moments of a random variable Definition.. Let X be a rrv on probability space (Ω, A, P). For a given r N, E[X r ], if it

More information

MAT1300 Final Review. Pieter Hofstra. December 4, 2009

MAT1300 Final Review. Pieter Hofstra. December 4, 2009 December 4, 2009 Sections from the book to study (8th Edition) Chapter 0: 0.1: Real line and Order 0.2: Absolute Value and Distance 0.3: Exponents and Radicals 0.4: Factoring Polynomials (you may omit

More information

Octavio Arizmendi, Ole E. Barndorff-Nielsen and Víctor Pérez-Abreu

Octavio Arizmendi, Ole E. Barndorff-Nielsen and Víctor Pérez-Abreu ON FREE AND CLASSICAL TYPE G DISTRIBUTIONS Octavio Arizmendi, Ole E. Barndorff-Nielsen and Víctor Pérez-Abreu Comunicación del CIMAT No I-9-4/3-4-29 (PE /CIMAT) On Free and Classical Type G Distributions

More information

On the Entropy of Sums of Bernoulli Random Variables via the Chen-Stein Method

On the Entropy of Sums of Bernoulli Random Variables via the Chen-Stein Method On the Entropy of Sums of Bernoulli Random Variables via the Chen-Stein Method Igal Sason Department of Electrical Engineering Technion - Israel Institute of Technology Haifa 32000, Israel ETH, Zurich,

More information

College Algebra To learn more about all our offerings Visit Knewton.com

College Algebra To learn more about all our offerings Visit Knewton.com College Algebra 978-1-63545-097-2 To learn more about all our offerings Visit Knewton.com Source Author(s) (Text or Video) Title(s) Link (where applicable) OpenStax Text Jay Abramson, Arizona State University

More information

ULTRASPHERICAL TYPE GENERATING FUNCTIONS FOR ORTHOGONAL POLYNOMIALS

ULTRASPHERICAL TYPE GENERATING FUNCTIONS FOR ORTHOGONAL POLYNOMIALS ULTRASPHERICAL TYPE GENERATING FUNCTIONS FOR ORTHOGONAL POLYNOMIALS arxiv:083666v [mathpr] 8 Jan 009 Abstract We characterize, up to a conjecture, probability distributions of finite all order moments

More information

Stat410 Probability and Statistics II (F16)

Stat410 Probability and Statistics II (F16) Stat4 Probability and Statistics II (F6 Exponential, Poisson and Gamma Suppose on average every /λ hours, a Stochastic train arrives at the Random station. Further we assume the waiting time between two

More information

Recurrence of Simple Random Walk on Z 2 is Dynamically Sensitive

Recurrence of Simple Random Walk on Z 2 is Dynamically Sensitive arxiv:math/5365v [math.pr] 3 Mar 25 Recurrence of Simple Random Walk on Z 2 is Dynamically Sensitive Christopher Hoffman August 27, 28 Abstract Benjamini, Häggström, Peres and Steif [2] introduced the

More information

Continuous Distributions

Continuous Distributions A normal distribution and other density functions involving exponential forms play the most important role in probability and statistics. They are related in a certain way, as summarized in a diagram later

More information

A note on profile likelihood for exponential tilt mixture models

A note on profile likelihood for exponential tilt mixture models Biometrika (2009), 96, 1,pp. 229 236 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asn059 Advance Access publication 22 January 2009 A note on profile likelihood for exponential

More information

Invariance Properties of the Negative Binomial Lévy Process and Stochastic Self-similarity

Invariance Properties of the Negative Binomial Lévy Process and Stochastic Self-similarity International Mathematical Forum, 2, 2007, no. 30, 1457-1468 Invariance Properties of the Negative Binomial Lévy Process and Stochastic Self-similarity Tomasz J. Kozubowski Department of Mathematics &

More information

Cauchy-Stieltjes kernel families

Cauchy-Stieltjes kernel families Cauchy-Stieltjes kernel families Wlodek Bryc Fields Institute, July 23, 2013 NEF versus CSK families The talk will switch between two examples of kernel families K(µ) = {P θ (dx) : θ Θ} NEF versus CSK

More information

Power Series. Part 2 Differentiation & Integration; Multiplication of Power Series. J. Gonzalez-Zugasti, University of Massachusetts - Lowell

Power Series. Part 2 Differentiation & Integration; Multiplication of Power Series. J. Gonzalez-Zugasti, University of Massachusetts - Lowell Power Series Part 2 Differentiation & Integration; Multiplication of Power Series 1 Theorem 1 If a n x n converges absolutely for x < R, then a n f x n converges absolutely for any continuous function

More information

Introduction to Probability and Statistics (Continued)

Introduction to Probability and Statistics (Continued) Introduction to Probability and Statistics (Continued) Prof. icholas Zabaras Center for Informatics and Computational Science https://cics.nd.edu/ University of otre Dame otre Dame, Indiana, USA Email:

More information

ASYMPTOTIC EQUIVALENCE OF DENSITY ESTIMATION AND GAUSSIAN WHITE NOISE. By Michael Nussbaum Weierstrass Institute, Berlin

ASYMPTOTIC EQUIVALENCE OF DENSITY ESTIMATION AND GAUSSIAN WHITE NOISE. By Michael Nussbaum Weierstrass Institute, Berlin The Annals of Statistics 1996, Vol. 4, No. 6, 399 430 ASYMPTOTIC EQUIVALENCE OF DENSITY ESTIMATION AND GAUSSIAN WHITE NOISE By Michael Nussbaum Weierstrass Institute, Berlin Signal recovery in Gaussian

More information

Algebra and Trigonometry

Algebra and Trigonometry Algebra and Trigonometry 978-1-63545-098-9 To learn more about all our offerings Visit Knewtonalta.com Source Author(s) (Text or Video) Title(s) Link (where applicable) OpenStax Jay Abramson, Arizona State

More information

Sub-Gaussian estimators under heavy tails

Sub-Gaussian estimators under heavy tails Sub-Gaussian estimators under heavy tails Roberto Imbuzeiro Oliveira XIX Escola Brasileira de Probabilidade Maresias, August 6th 2015 Joint with Luc Devroye (McGill) Matthieu Lerasle (CNRS/Nice) Gábor

More information

Alg Review/Eq & Ineq (50 topics, due on 01/19/2016)

Alg Review/Eq & Ineq (50 topics, due on 01/19/2016) Course Name: MAC 1140 Spring 16 Course Code: XQWHD-P4TU6 ALEKS Course: PreCalculus Instructor: Van De Car Course Dates: Begin: 01/11/2016 End: 05/01/2016 Course Content: 307 topics Textbook: Coburn: Precalculus,

More information

Received: 20 December 2011; in revised form: 4 February 2012 / Accepted: 7 February 2012 / Published: 2 March 2012

Received: 20 December 2011; in revised form: 4 February 2012 / Accepted: 7 February 2012 / Published: 2 March 2012 Entropy 2012, 14, 480-490; doi:10.3390/e14030480 Article OPEN ACCESS entropy ISSN 1099-4300 www.mdpi.com/journal/entropy Interval Entropy and Informative Distance Fakhroddin Misagh 1, * and Gholamhossein

More information

Primer on statistics:

Primer on statistics: Primer on statistics: MLE, Confidence Intervals, and Hypothesis Testing ryan.reece@gmail.com http://rreece.github.io/ Insight Data Science - AI Fellows Workshop Feb 16, 018 Outline 1. Maximum likelihood

More information

Sharp threshold functions for random intersection graphs via a coupling method.

Sharp threshold functions for random intersection graphs via a coupling method. Sharp threshold functions for random intersection graphs via a coupling method. Katarzyna Rybarczyk Faculty of Mathematics and Computer Science, Adam Mickiewicz University, 60 769 Poznań, Poland kryba@amu.edu.pl

More information

ON COMPOUND POISSON POPULATION MODELS

ON COMPOUND POISSON POPULATION MODELS ON COMPOUND POISSON POPULATION MODELS Martin Möhle, University of Tübingen (joint work with Thierry Huillet, Université de Cergy-Pontoise) Workshop on Probability, Population Genetics and Evolution Centre

More information

Solutions to the Spring 2015 CAS Exam ST

Solutions to the Spring 2015 CAS Exam ST Solutions to the Spring 2015 CAS Exam ST (updated to include the CAS Final Answer Key of July 15) There were 25 questions in total, of equal value, on this 2.5 hour exam. There was a 10 minute reading

More information

Elementary Probability. Exam Number 38119

Elementary Probability. Exam Number 38119 Elementary Probability Exam Number 38119 2 1. Introduction Consider any experiment whose result is unknown, for example throwing a coin, the daily number of customers in a supermarket or the duration of

More information

The Brownian graph is not round

The Brownian graph is not round The Brownian graph is not round Tuomas Sahlsten The Open University, Milton Keynes, 16.4.2013 joint work with Jonathan Fraser and Tuomas Orponen Fourier analysis and Hausdorff dimension Fourier analysis

More information

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

STATISTICS SYLLABUS UNIT I

STATISTICS SYLLABUS UNIT I STATISTICS SYLLABUS UNIT I (Probability Theory) Definition Classical and axiomatic approaches.laws of total and compound probability, conditional probability, Bayes Theorem. Random variable and its distribution

More information

4 Sums of Independent Random Variables

4 Sums of Independent Random Variables 4 Sums of Independent Random Variables Standing Assumptions: Assume throughout this section that (,F,P) is a fixed probability space and that X 1, X 2, X 3,... are independent real-valued random variables

More information

Stochastic differential equations in neuroscience

Stochastic differential equations in neuroscience Stochastic differential equations in neuroscience Nils Berglund MAPMO, Orléans (CNRS, UMR 6628) http://www.univ-orleans.fr/mapmo/membres/berglund/ Barbara Gentz, Universität Bielefeld Damien Landon, MAPMO-Orléans

More information

A COUNTEREXAMPLE TO AN ENDPOINT BILINEAR STRICHARTZ INEQUALITY TERENCE TAO. t L x (R R2 ) f L 2 x (R2 )

A COUNTEREXAMPLE TO AN ENDPOINT BILINEAR STRICHARTZ INEQUALITY TERENCE TAO. t L x (R R2 ) f L 2 x (R2 ) Electronic Journal of Differential Equations, Vol. 2006(2006), No. 5, pp. 6. ISSN: 072-669. URL: http://ejde.math.txstate.edu or http://ejde.math.unt.edu ftp ejde.math.txstate.edu (login: ftp) A COUNTEREXAMPLE

More information

Probability for Statistics and Machine Learning

Probability for Statistics and Machine Learning ~Springer Anirban DasGupta Probability for Statistics and Machine Learning Fundamentals and Advanced Topics Contents Suggested Courses with Diffe~ent Themes........................... xix 1 Review of Univariate

More information

A remark on the maximum eigenvalue for circulant matrices

A remark on the maximum eigenvalue for circulant matrices IMS Collections High Dimensional Probability V: The Luminy Volume Vol 5 (009 79 84 c Institute of Mathematical Statistics, 009 DOI: 04/09-IMSCOLL5 A remark on the imum eigenvalue for circulant matrices

More information

1 Review of di erential calculus

1 Review of di erential calculus Review of di erential calculus This chapter presents the main elements of di erential calculus needed in probability theory. Often, students taking a course on probability theory have problems with concepts

More information

Lecture 1: August 28

Lecture 1: August 28 36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 1: August 28 Our broad goal for the first few lectures is to try to understand the behaviour of sums of independent random

More information

Poisson Variables. Robert DeSerio

Poisson Variables. Robert DeSerio Poisson Variables Robert DeSerio Poisson processes A Poisson process is one in which events are randomly distributed in time, space or some other variable with the number of events in any non-overlapping

More information

CS Lecture 19. Exponential Families & Expectation Propagation

CS Lecture 19. Exponential Families & Expectation Propagation CS 6347 Lecture 19 Exponential Families & Expectation Propagation Discrete State Spaces We have been focusing on the case of MRFs over discrete state spaces Probability distributions over discrete spaces

More information

Central limit theorems for ergodic continuous-time Markov chains with applications to single birth processes

Central limit theorems for ergodic continuous-time Markov chains with applications to single birth processes Front. Math. China 215, 1(4): 933 947 DOI 1.17/s11464-15-488-5 Central limit theorems for ergodic continuous-time Markov chains with applications to single birth processes Yuanyuan LIU 1, Yuhui ZHANG 2

More information

Physics 6720 Introduction to Statistics April 4, 2017

Physics 6720 Introduction to Statistics April 4, 2017 Physics 6720 Introduction to Statistics April 4, 2017 1 Statistics of Counting Often an experiment yields a result that can be classified according to a set of discrete events, giving rise to an integer

More information

n! (k 1)!(n k)! = F (X) U(0, 1). (x, y) = n(n 1) ( F (y) F (x) ) n 2

n! (k 1)!(n k)! = F (X) U(0, 1). (x, y) = n(n 1) ( F (y) F (x) ) n 2 Order statistics Ex. 4.1 (*. Let independent variables X 1,..., X n have U(0, 1 distribution. Show that for every x (0, 1, we have P ( X (1 < x 1 and P ( X (n > x 1 as n. Ex. 4.2 (**. By using induction

More information

Systems Driven by Alpha-Stable Noises

Systems Driven by Alpha-Stable Noises Engineering Mechanics:A Force for the 21 st Century Proceedings of the 12 th Engineering Mechanics Conference La Jolla, California, May 17-20, 1998 H. Murakami and J. E. Luco (Editors) @ASCE, Reston, VA,

More information

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 23 Feb 1998

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 23 Feb 1998 Multiplicative processes and power laws arxiv:cond-mat/978231v2 [cond-mat.stat-mech] 23 Feb 1998 Didier Sornette 1,2 1 Laboratoire de Physique de la Matière Condensée, CNRS UMR6632 Université des Sciences,

More information

Power series and Taylor series

Power series and Taylor series Power series and Taylor series D. DeTurck University of Pennsylvania March 29, 2018 D. DeTurck Math 104 002 2018A: Series 1 / 42 Series First... a review of what we have done so far: 1 We examined series

More information

Chapter 3 Common Families of Distributions

Chapter 3 Common Families of Distributions Lecture 9 on BST 631: Statistical Theory I Kui Zhang, 9/3/8 and 9/5/8 Review for the previous lecture Definition: Several commonly used discrete distributions, including discrete uniform, hypergeometric,

More information

Statistics and data analyses

Statistics and data analyses Statistics and data analyses Designing experiments Measuring time Instrumental quality Precision Standard deviation depends on Number of measurements Detection quality Systematics and methology σ tot =

More information

Some Contra-Arguments for the Use of Stable Distributions in Financial Modeling

Some Contra-Arguments for the Use of Stable Distributions in Financial Modeling arxiv:1602.00256v1 [stat.ap] 31 Jan 2016 Some Contra-Arguments for the Use of Stable Distributions in Financial Modeling Lev B Klebanov, Gregory Temnov, Ashot V. Kakosyan Abstract In the present paper,

More information

The moment-generating function of the log-normal distribution using the star probability measure

The moment-generating function of the log-normal distribution using the star probability measure Noname manuscript No. (will be inserted by the editor) The moment-generating function of the log-normal distribution using the star probability measure Yuri Heymann Received: date / Accepted: date Abstract

More information

STAT 7032 Probability Spring Wlodek Bryc

STAT 7032 Probability Spring Wlodek Bryc STAT 7032 Probability Spring 2018 Wlodek Bryc Created: Friday, Jan 2, 2014 Revised for Spring 2018 Printed: January 9, 2018 File: Grad-Prob-2018.TEX Department of Mathematical Sciences, University of Cincinnati,

More information

Recap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks

Recap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks Recap Probability, stochastic processes, Markov chains ELEC-C7210 Modeling and analysis of communication networks 1 Recap: Probability theory important distributions Discrete distributions Geometric distribution

More information

MOMENTS AND CUMULANTS OF THE SUBORDINATED LEVY PROCESSES

MOMENTS AND CUMULANTS OF THE SUBORDINATED LEVY PROCESSES MOMENTS AND CUMULANTS OF THE SUBORDINATED LEVY PROCESSES PENKA MAYSTER ISET de Rades UNIVERSITY OF TUNIS 1 The formula of Faa di Bruno represents the n-th derivative of the composition of two functions

More information

On the quantiles of the Brownian motion and their hitting times.

On the quantiles of the Brownian motion and their hitting times. On the quantiles of the Brownian motion and their hitting times. Angelos Dassios London School of Economics May 23 Abstract The distribution of the α-quantile of a Brownian motion on an interval [, t]

More information

An inverse of Sanov s theorem

An inverse of Sanov s theorem An inverse of Sanov s theorem Ayalvadi Ganesh and Neil O Connell BRIMS, Hewlett-Packard Labs, Bristol Abstract Let X k be a sequence of iid random variables taking values in a finite set, and consider

More information

EXPLICIT MULTIVARIATE BOUNDS OF CHEBYSHEV TYPE

EXPLICIT MULTIVARIATE BOUNDS OF CHEBYSHEV TYPE Annales Univ. Sci. Budapest., Sect. Comp. 42 2014) 109 125 EXPLICIT MULTIVARIATE BOUNDS OF CHEBYSHEV TYPE Villő Csiszár Budapest, Hungary) Tamás Fegyverneki Budapest, Hungary) Tamás F. Móri Budapest, Hungary)

More information

Sample Spaces, Random Variables

Sample Spaces, Random Variables Sample Spaces, Random Variables Moulinath Banerjee University of Michigan August 3, 22 Probabilities In talking about probabilities, the fundamental object is Ω, the sample space. (elements) in Ω are denoted

More information

arxiv: v2 [math.st] 20 Feb 2013

arxiv: v2 [math.st] 20 Feb 2013 Exact bounds on the closeness between the Student and standard normal distributions arxiv:1101.3328v2 [math.st] 20 Feb 2013 Contents Iosif Pinelis Department of Mathematical Sciences Michigan Technological

More information

Week 9 The Central Limit Theorem and Estimation Concepts

Week 9 The Central Limit Theorem and Estimation Concepts Week 9 and Estimation Concepts Week 9 and Estimation Concepts Week 9 Objectives 1 The Law of Large Numbers and the concept of consistency of averages are introduced. The condition of existence of the population

More information

Zeros of lacunary random polynomials

Zeros of lacunary random polynomials Zeros of lacunary random polynomials Igor E. Pritsker Dedicated to Norm Levenberg on his 60th birthday Abstract We study the asymptotic distribution of zeros for the lacunary random polynomials. It is

More information

Algebraic Information Geometry for Learning Machines with Singularities

Algebraic Information Geometry for Learning Machines with Singularities Algebraic Information Geometry for Learning Machines with Singularities Sumio Watanabe Precision and Intelligence Laboratory Tokyo Institute of Technology 4259 Nagatsuta, Midori-ku, Yokohama, 226-8503

More information

PAS04 - Important discrete and continuous distributions

PAS04 - Important discrete and continuous distributions PAS04 - Important discrete and continuous distributions Jan Březina Technical University of Liberec 30. října 2014 Bernoulli trials Experiment with two possible outcomes: yes/no questions throwing coin

More information