A large-deviation principle for Dirichlet posteriors

Size: px
Start display at page:

Download "A large-deviation principle for Dirichlet posteriors"

Transcription

1 Bernoulli 6(6), 2000, 02±034 A large-deviation principle for Dirichlet posteriors AYALVADI J. GANESH and NEIL O'CONNELL 2 Microsoft Research, Guildhall Street, Cambridge CB2 3NH, UK. ajg@microsoft.com 2 BRIMS, Hewlett-Packard Laboratories, Filton Road, Bristol BS2 6QZ, UK. noc@hplb.hpl.hp.com Let X k be a sequence of independent and identically distributed random variables taking values in a compact metric space, and consider the problem of estimating the law of X in a Bayesian framework. A conjugate family of priors for nonparametric Bayesian inference is the Dirichlet process priors popularized by Ferguson. We prove that if the prior distribution is Dirichlet, then the sequence of posterior distributions satis es a large-deviation principle, and give an explicit expression for the rate function. As an application, we obtain an asymptotic formula for the predictive probability of ruin in the classical gambler's ruin problem. Keywords: asymptotics; Bayesian nonparametrics; Dirichlet process; large deviations. Introduction Let X be a Hausdorff topological space with Borel ó -algebra B, and let ì n be a sequence of probability measures on (X, B ). A rate function is a non-negative lower semicontinuous function on X. We say that the sequence ì n satis es the large-deviation principle (LDP) with rate function I, if for all B 2 B, inf I(x) < lim inf x2b n n log ì n(b) < lim sup n n log ì n(b) < inf x2b Let be a complete, separable metric space (Polish space) and denote by M () the space of probability measures on. Consider a sequence of independent random variables X k taking values in, with common law ì 2 M (). Denote by L n the empirical measure corresponding to the rst n observations: L n ˆ X n ä X n k : kˆ We denote the law of L n by L (L n ). For í 2 M () de ne its Kullback±Leibler distance or relative entropy (relative to ì) by I(x): 350±7265 # 2000 ISI/BS

2 022 A.J. Ganesh and N. O'Connell 8 < dí dí log dì, í ì, H(íjì) ˆ dì dì :, otherwise: The statement of Sanov's theorem is that the sequence L (L n ) satis es the LDP in M () equipped with the ô-topology (see Dembo and Zeitouni 993, Theorem 6.2.0), with rate function H(:jì). As a corollary, the LDP also holds in the weak topology on M (), which is weaker than the ô-topology. In an earlier paper (Ganesh and O'Connell 999), we proved an inverse of this result, which arises naturally in a Bayesian setting, for nite sets. The underlying distribution (of the X k ) is unknown, and has a prior distribution ð 2 M (M ()). The posterior distribution, given the rst n observations, is a function of the empirical measure L n and is denoted ð n (L n ). We showed that, on the set fl n! ìg, for any xed ì in the support of the prior, the sequence ð n (L n ) satis es the LDP in M () with rate function given by H( ìj:) on the support of the prior (otherwise it is in nite). Note that the roles played by the arguments of the relative entropy function are interchanged compared to Sanov's theorem. We pointed out that the extension of the result to more general would require additional assumptions about the prior. To see that this is a delicate issue, note that, since H( ìjì) ˆ 0, the LDP implies consistency of the posterior distribution in the topology generated by Kullback±Leibler neighbourhoods; in particular, it implies weak consistency. But it was shown by Freedman (963) that Bayes estimates can be inconsistent even for countable ; even if the true distribution is in the weak support of the prior, it does not follow that the posterior mass of each weak neighbourhood tends to (in fact, it can tend to zero!). There has recently been renewed interest in the consistency of nonparametric Bayes methods, prompted by their increasing popularity in applied work. A notable early result in this eld is due to Schwartz (965), who showed that if the prior assigns positive probability to every Kullback±Leibler neighbourhood of the true distribution, then the posterior is weakly consistent. If, in addition, the relevant space of probability distributions satis es a `metric entropy' condition, then Barron et al. (999) show that the posterior concentrates on neighbourhoods de ned by the Hellinger metric; these are ner than weak neighbourhoods. (The Hellinger distance between two densities f and g with respect to a reference measure ì is de ned by p p ( f g ) 2 dì.) Recent research on the consistency of Bayes methods is reviewed by Ghosal et al. (999) and Wasserman (998). Rates of convergence of the posterior have been investigated by Ghosal et al. (998) and Shen and Wasserman (998), but there is relatively little work on more re ned asymptotics. In this paper, we prove an LDP for the special (but nevertheless useful) case of Dirichlet process priors on a compact metric space. The problem of extending our results to an arbitrary Polish space remains open. An LDP with a similar avour for a sequence of Dirichlet processes has been derived by Lynch and Sethuraman (987); we compare our result with theirs following the statement of Theorem. The techniques we use in this paper are very different from those of Lynch and Sethuraman, who obtain their results as a consequence of an LDP they derive for processes

3 A large-deviation principle for Dirichlet posteriors 023 with stationary, independent increments. We believe that our methods are of independent interest, and also that they can be generalized to a wider class of prior distributions. The LDP for Dirichlet posteriors derived here has applications to queue and risk management that are discussed in Ganesh et al. (998). Some questions of interest in this context are posed in terms of the ruin probability in the classical gambler's ruin problem. In Section 3, we use the LDP for the posterior distributions to obtain an asymptotic formula for the predictive probability of ruin. 2. The large-deviation principle Let be a compact metric space with Borel ó -algebra F. Let M () denote the space of probability measures on (, F ), and B (M ()) the Borel ó -algebra induced by the weak topology on M (). In this case, it is not possible to establish an LDP for Bayes posteriors corresponding to arbitrary prior distributions, for reasons discussed above. Therefore, we shall work with a speci c family of priors, namely Dirichlet process priors; see Ferguson (973) for a detailed discussion of their properties. The n-dimensional Dirichlet distribution with parameter a ˆ (a,..., a n ), denoted D(a), is de ned to be the joint distribution of (Z =Z,..., Z n =Z), where Z i, i ˆ,..., n, are mutually independent, Z i has the gamma distribution with shape parameter a i and scale parameter, and Z ˆ Z... Z n. Denote by M () the space of nite non-negative measures on (, F ). The Dirichlet process with parameter á 2 M (), denoted D (á), is a probability distribution on M (). A random probability measure, ì, on is said to have law D (á) if, for every nite measurable partition (A,..., A n ) of, the vector ( ì(a ),..., ì(a n )) has the n- dimensional Dirichlet distribution D(á(A ),..., á(a n )). The distribution of ( ì(b ),..., ì(b n )) for arbitrary measurable B,..., B n follows in an obvious way from the distributions for partitions. Let ð be a Dirichlet process prior, D (á), on the space M (). Then, conditional on observing ù,..., ù n, the posterior distribution is also a Dirichlet process, but with parameter á P n iˆ ä ù i, where ä x denotes Dirac measure at x (see Ferguson 973; 974). In other words, the Dirichlet processes D (á), á 2 M (), are a conjugate family of priors. This property greatly facilitates computation of posterior distributions and is very useful in analytical work. We now prove an LDP for the sequence of distributions fd (á P n iˆ ä ù i ), n ˆ, 2,...g. Theorem. Let á be a nite non-negative measure on (, B ()), with support. Let ì be a probability measure on (, B ()), and let fx n g be an -valued sequence such that X n ä xi! ì weakly, n iˆ where ä xi denotes Dirac measure at x i. Then the sequence of probability measures

4 024 A.J. Ganesh and N. O'Connell D (á P n iˆ ä x i ) satis es an LDP in M () equipped with its weak topology, with rate function I(:) given by I(í) ˆ H( ìjí), where H( ìjí) denotes the relative entropy of ì with respect to í. Corollary. If X i, i 2 N, are independent and identically distributed with common law ì, then the sequence of empirical distributions (=n) P n iˆ ä X i converges weakly to ì with probability one. Hence, the sequence of random probability measures D (á P n iˆ ä X i ) almost surely satis es an LDP (on M () equipped with its weak topology) with rate function I(:) ˆ H( ìj:). Remark. There is no loss of generality in the assumption that the support of the prior, á, is. Indeed, if the prior were supported on some smaller set, then since the posterior assigns no mass outside, we can con ne ourselves to the closed set, which is also a compact metric space. Remark 2. Lynch and Sethuraman (987) prove an LDP for the sequence of Dirichlet distributions D (nì) on [0, ]. Their result is equivalent to our theorem, for ˆ [0, ], if D (nì) and D (á nì n ) are exponentially equivalent whenever ì n converges weakly to ì; however, establishing exponential equivalence does not appear to be trivial. We now sketch the main ideas behind the proof before proceeding with a formal derivation. Let ì n be a random element of M () with distribution D (á P n iˆ ä x i )as above. For bounded measurable functions f :! R, we de ne Ë n ( f ) ˆ log E exp f dì n : () We show in Lemma below that, for nite measurable partitions (A,..., A k )of, the vector ( ì n (A ),..., ì n (A k )) satis es the LDP in R k. We then use Varadhan's integral lemma (Dembo and Zeitouni 993, Theorem 4.3.) to infer the existence of the limit Ë( f ) ˆ lim Ë(nf ), n for simple functions f ˆ Pk iˆ c i Ai ; here Ai denotes the indicator of A i. This is extended in Lemma 3 to all bounded continuous functions on, using the continuity of Ë(:). By Theorem in Dembo and Zeitouni (993), the existence of the limiting logarithmic moment generating function, Ë, implies the large-deviation upper bound for the sequence fì n g, for all compact subsets of M (). Since was assumed to be compact, M () is compact in the weak topology and so the upper bound holds for all closed sets. The rate function for this upper bound is the convex conjugate of Ë, which we identify to be H( ìj:). We use the LDP for ( ì n (A ),..., ì n (A k )) and the contraction principle (Dembo and Zeitouni 993, Theorem 4.2.) to obtain an LDP for P k iˆ c iì n (A i ), for arbitrary constants c i. Thus, we obtain a large-deviation lower bound for sets of the form n!

5 A large-deviation principle for Dirichlet posteriors 025 U(ö k, x, ä) ˆ í 2 M (): ö k dí 2 (x ä, x ä), with rate function H k ( ìj:) given in Lemma 2. Here x 2 R and ä. 0 are arbitrary, and ö k is any simple function ö k ˆ Pk iˆ c i Ai, so that ö k dí ˆ Pc i í(a i ). We extend the lower bound to sets of the form U(ö, x, ä), where ö is any bounded continuous function on, by using increasingly ne partitions of to approximate ö by simple functions. The rate function for this lower bound is the limit of H k ( ìj:) as the partitions indexed by k get ner, which is shown in Lemma 2 to be H( ìj:). Since the sets U(ö, x, ä) constitute a base for the weak topology on M (), this establishes the large-deviation lower bound for all open sets. The proof of Theorem uses the following lemmas, whose proofs are in the Appendix. Lemma. Let (A,..., A k ) be a measurable partition of and suppose that the interior of A i is non-empty for each i ˆ,..., k. Let f be bounded and measurable with respect to ó (A,..., A k ), the ó algebra generated by the sets A,..., A k. Then Ë( f ) :ˆ lim n! n Ë n(nf ) (2) exists and is nite, and is given by Ë( f ) ˆ sup í2m () f dí H( ìjí) : (3) Lemma 2. Let A k ˆ (A k,..., Ak n k ), k 2 N, be a sequence of partitions of such that the corresponding ó-algebras, ó (A k ), increase to B (), the Borel ó-algebra on. Then, for all í 2 M (), we have H( ìjí) ˆ sup k where H k ( ìjí) :ˆ Pn k iˆ ì(ak i )log[ì(ak i )=í(ak i )]. H k ( ìjí) ˆ lim k! H k ( ìjí), This result is well known; see Georgii (988), for example. Lemma 3. For all bounded, continuous functions f :! R, the limit in (2) exists and is nite. The map Ë : C b ()! R is convex and continuous, and we have Ë( f ) ˆ sup í2m () f dí H( ìjí) : Here, C b () denotes the space of bounded continuous functions from to R, equipped with the supremum norm, k f k ˆ sup x2 j f (x)j. Proof of Theorem. We have from Lemma 3 that Ë is the convex conjugate of H( ìj:). But H( ìj:) is convex, and lower semicontinuous in the weak topology (see Dupuis and Ellis 997, Lemma.4.3, and recall that M () is Polish since is a Polish space). Hence, H( ìj:) and Ë(:) are convex duals of each other. The large-deviations upper bound for

6 026 A.J. Ganesh and N. O'Connell compact subsets of M () now follows from Dembo and Zeitouni (993, Theorem 4.5.3). But was assumed to be compact, hence M () is compact in the weak topology, so the upper bound holds for all closed sets in M (). We now turn to the proof of the largedeviations lower bound. The weak topology on M () is generated by the sets U ö,x,ä ˆ í 2 M (): ö dí x, ä, ö 2 C b (), x 2 R, ä. 0: Given such a set and å. 0, we can nd a sequence of measurable partitions A k ˆ (A k,..., Ak n k )of, and a sequence of simple functions ö k measurable with respect to ó (A k ), with the following properties: the ó -algebras ó (A k ) increase to B (), the Borel ó algebra on ; for all k and all i 2f,..., n k g, Ai k is a ì-continuity set with non-empty interior; for some K. 0 and all k. K, kö k ök, å. We shall assume that å, ä=3. We now have P( ì n 2 U ö,x,ä ) > P ö k dì n x, ä å, 8k. K: (4) Let ö k ˆ Pn k iˆ ck i A k i. Then ö k dì n ˆ Xn k ci k ì n(ai k ): iˆ It is shown in the proof of Lemma (see equation (3)) that the sequence ( ì n (A k ),..., ì n(a k n k )) n>0 satis es an LDP with rate function I k given by 8 >< P nk jˆ I k (y,..., y nk ) ˆ ì(a j) log ì(a j), if y 2 R n k y and P n k iˆ y i ˆ, j >:, otherwise: It follows from the contraction principle (Dembo and Zeitouni 993, Theorem 4.2.) that P nk iˆ ck i ì n(a k i ) satis es an LDP with rate function J k given by ( ) J k (x) ˆ inf I k (y): Xn k ci k y i ˆ x iˆ ˆ inf H k ( ìjí): í 2 M (), ö k dí ˆ x : In particular, we obtain the large-deviations lower bound,

7 A large-deviation principle for Dirichlet posteriors 027 lim inf n! n log P ö k dì n x, ä å > inffj k (y): jy xj, ä åg ˆ inf H k ( ìjí): í 2 M (), ö k dí x, ä å : (6) Now, kö ö k k, å for all k. K, so we have, for all í 2 M (), that ö dí x, ä 2å ) ö k dí x, ä å, 8k. K: It now follows from (4) and (6) that, for all k. K, lim inf n! n log P( ì n 2 U ö,x,ä ) > inf H k ( ìjí): ö dí x, ä 2å : Hence, we have from Lemma 2 that lim inf n! n log P( ì n 2 U ö,x,ä ) > inf H( ìjí): ö dí x, ä 2å : Since å. 0 was arbitrary, we can let å decrease to zero, to obtain lim inf n! n log P( ì n 2 U ö,x,ä ) > inf H( ìjí): ö dí x, ä, which is the desired large-deviations lower bound for the set U ö,x,ä, with rate function H( ìj:). We have thus established the large-deviations lower bound for a base of the weak topology on M (), and hence for all open sets in this topology. Combined with the upper bound above, this completes the proof of the theorem. h We have established an LDP for the sequence of Dirichlet posterior distributions in the weak topology on M (), with rate function I(í) ˆ H( ìjí). The rate function differs from that in Sanov's theorem in that its argument, í, enters as the second rather than the rst argument in the relative entropy function. (Sanov's theorem says that the empirical distribution of a sequence of independent and identically distributed -valued random variables with common law ì satis es an LDP with rate function J(í) ˆ H(íjì).) Intuitively, this is because in Sanov's theorem we are asking how likely we are to observe í, given that the true distribution is ì, whereas in this paper we are asking how likely it is that the true distribution is í, given that we observe ì. We believe that our result holds for a wider class of priors, of the form described below. Let P be the set of all nite measurable partitions of. ForP 2 P, let ó (P) denote the ó -algebra generated by P. The restriction of a measure í 2 M () to the ó-algebra ó (P) is denoted í P. In other words, í P ˆ E[íjó (P)]. For a prior ð 2 M (M ()) we denote by ð P the corresponding element in M (M (, ó (P))), thus the restriction of ð to the Borel ó-algebra B (M (, ó (P))). We x a subset P 9 of P and say that a prior measure (5)

8 028 A.J. Ganesh and N. O'Connell ð 2 M (M ()) is exchangeable with respect to nite projections in P 9 if, for every P 2 P 9, wehave [ð n ( ì n )] P ˆ ð n P ( ì n,p): Here ð n ( ì n ) denotes the posterior distribution on M (, B ()) corresponding to the prior ð and the empirical distribution ì n ; [ð n ( ì n )] P its restriction to ó (P); and ð n P ( ì n,p) the posterior distribution on M (, ó (P)) corresponding to the prior ð P and the empirical distribution restricted to ó (P). The essential property of the Dirichlet process that we have used in the proof of Theorem is its exchangeability with respect to P 9, where P 9 is the collection of nite partitions consisting of sets with non-empty interiors. This collection is large enough to generate the Borel ó-algebra on. We believe that our methods can be generalized to priors which are exchangeable with respect to nite projections in P 9, for some P 9 which generates the Borel ó-algebra on, although there do seem to be some technical dif culties which we hope to address in future work. An example of a class of priors which are exchangeable with respect to nite projections are the PoÂlya tree distributions studied by Mauldin et al. (992) and Lavine (992), which generalize the Dirichlet process. 3. Application to the gambler's ruin problem Suppose now that is a compact subset of R. As before, fx k g is a sequence of independent, identically distributed random variables with common law ì 2 M (), and we are interested in level-crossing probabilities for the random walk S n ˆ X... X n.forq. 0, denote by R(Q, ì) the probability that the walk ever exceeds the level Q. If a gambler has initial capital Q, and loses amount X k on the k th bet, then R(Q, ì) is the probability of ultimate ruin. If the underlying distribution ì is unknown, the gambler may wish to assess this probability based on experience: this leads to a predictive probability of ruin, given by the formula P n (Q, ì n ) ˆ R(Q, ë)ð n (dë), where, as before, ì n is the empirical distribution of the rst n observations and ð n ð n ( ì n ) is the posterior distribution corresponding to some prior, ð, and the empirical distribution, ì n. A standard re nement of Wald's approximation yields Ce ä( ì)q < R(Q, ì) < e ä( ì)q, for some C. 0, where ä( ì) ˆ sup è > 0: e èx ì(dx) < : Thus,

9 A large-deviation principle for Dirichlet posteriors 029 C e ä(ë)q ð n (dë) < P n (Q, ì n ) < e ä(ë)q ð n (dë): M () M () Now, if ð is the Dirichlet process D (á), parametrized by an arbitrary nite positive measure á whose support is all of, then the sequence ð n obeys an LDP by Theorem, and we can apply Varadhan's lemma (see, for example, Dembo and Zeitouni 993, Theorem 4.3.) to obtain the asymptotic formula, for q. 0, lim n! n log P n(qn, ì n ) ˆ inffh( ìjí) ä(í)q : í 2 supp ðg, on the set ì n! ì. Here, we are using the easy ( is compact) fact that ä : M ()! R is continuous. This formula can be simpli ed in special cases. Its implications for risk and network management are discussed in Ganesh et al. (998). 4. Conclusion In this paper, we establish a large-deviation principle for the sequence of Bayesian posteriors induced by a Dirichlet prior on a compact metric space. Can the result be extended to an arbitrary Polish space? Our approach yields the large-deviation lower bound for arbitrary open subsets of this space, and the upper bound for compact subsets. In other words, we can prove a weak LDP on a Polish space. This could be strengthened to a full LDP if the sequence of Dirichlet posteriors were exponentially tight. However, exponential tightness of this sequence would imply the goodness of the rate function H( ìj:), which we know not to be true in general. For example, take ˆ R, ì ˆ ä 0, the unit mass at 0, and í n ˆ ( 2 )ä 0 ( 2 )ä n. Then H( ìjí n ) ˆ log 2 for all n, but the sequence í n is not tight. This implies that H( ìj:) does not have compact level sets, that is, it is not a good rate function. Hence, our method cannot be easily extended to arbitrary Polish spaces. Finally, while we have worked with Dirichlet process priors, we believe that our approach can be extended to priors with the appropriate exchangeability properties, as discussed at the end of Section 2. However, there do appear to be some technical dif culties with this approach, which we hope to address in future work. Appendix: Proofs Proof of Lemma. Let (A,..., A k ) be a partition of such that the interior of A i is nonempty and that A i is a ì-continuity set for every i ˆ,..., k. Let f be bounded and measurable with respect to the ó-algebra generated by the partition. Then we can write f ˆ Xk c i Ai, (7) for some constants c i, where Ai denotes the indicator of A i. Then, by (), iˆ

10 030 A.J. Ganesh and N. O'Connell " # Ë n ( f ) ˆ log E exp Xk c i ì n (A i ) : (8) By the assumption that each A i has non-empty interior and that the support of á is, we have á n (A j ) :ˆ á(a j ) Xn ä xi (A j ). 0, 8n 2 N, j ˆ,..., k: (9) iˆ We have from the de nition of the Dirichlet distribution that! Z n Z k n ( ì n (A ),..., ì n (A k )) P k iˆ Z,..., i P k n iˆ Z, i n where the Z i n are independent gamma random variables, with Z n j G (á n(a j ), ), and á n is de ned in (9). Here G (á, ) denotes the gamma distribution with shape parameter á and scale parameter. It is straightforward to evaluate the cumulant generating functions of the Z n j.wehave ë j n (è) :ˆ log E[exp(èZ n j )] ˆ á n(a j )log( è), if è,,, otherwise: Since P n iˆ ä x i (A j )=n! ì(a j ) by assumption, we obtain ë j (è) :ˆ lim n! n ë j n (è) ˆ ì(a j)log( è), if è,,, otherwise: Hence, by the GaÈrtner±Ellis theorem (see Dembo and Zeitouni 993, Theorem 2.3.6), the sequence of random variables Z n j =n satis es an LDP in R with rate function ë j which is the convex dual of ë j, that is, 8 ë < j (x) ˆ sup[èx ë j (è)] ˆ x ì(a j) ì(a j )log ì(a j), if x. 0, x (0) è2r :, otherwise: If ì(a j ) ˆ 0, then the assumption of steepness of ë j is not satis ed, so the GaÈrtner±Ellis theorem does not apply. However, it is not hard to verify directly in this case that Z n j =n does indeed satisfy an LDP with the above rate function. Since fz n j, j ˆ,..., kg are independent, fz n j =n, j ˆ,..., kg jointly satisfy an LDP in R k with rate function ë (x) ˆ Pk jˆ ë j (x j), where x ˆ (x,..., x k ) and ë j is given by (0). De ne Y n j ˆ Z n j =P k iˆ Z i n. Since P k iˆ Z i n is strictly positive with probability, the maps (Z n,..., Z k n )! (Y n,..., Y k n ) are almost surely continuous for every n. It follows from the contraction principle (Dembo and Zeitouni 993, Theorem 4.2.) that fy n j, j ˆ,..., kg jointly satisfy an LDP with rate function I given by iˆ

11 A large-deviation principle for Dirichlet posteriors 03 8 < I(y,..., y k ) ˆ inf : X k jˆ ë j (z j): y j ˆ zj P k iˆ z i 9 =, j ˆ,..., k ; : () If y j, 0 for some j, then any z included in the in mum in () must have z i, 0 for some i and so, by (0) I(y) ˆ. Next, if y j ˆ 0 for all j or if P n iˆ y i 6ˆ, then there is no z 2 R k such that y j ˆ z j = P k iˆ z i for all j. Hence I(y), being the in mum of an empty set, is again. In the following, we shall con ne attention to y 2 R k such that y > 0 and P k iˆ y i ˆ. If z 2 R k is such that y j ˆ z j = P k iˆ z i for all j ˆ,..., k, then we can write z ˆ ây for some â. y. Now () gives X k I(y,..., y k ) ˆ inf ë j (ây j): (2) â.0 jˆ Setting the derivative of the sum on the right with respect to â equal to zero yields 0 ˆ Xk y j ì(a j) ˆ â â : jˆ To obtain the last equality, we have used the fact that P k jˆ y j ˆ by assumption, while P k jˆ ì(a j) ˆ asì is a probability distribution and A,..., A k partition. Since each ë j is convex, the above implies that the in mum in (2) is achieved at â ˆ, and I(y) ˆ Xk jˆ ˆ Xk jˆ ë j (y j) ˆ Xk jˆ ì(a j )log ì(a j) y j : y j ì(a j ) ì(a j )log ì(a j) y j The second equality above comes from (0) and the third follows from the fact that ì and y are both probability distributions, and hence sum to. It follows from the preceding discussion that the sequence of random vectors ( ì n (A ),..., ì n (A k )) satis es an LDP in R k with rate function 8 X >< k I(y) ˆ jˆ >:, otherwise: ì(a j )log ì(a j), if y 2 R k Xk and y i ˆ, y j iˆ Observe from (7) that j f dì n j < maxiˆ k jc ij as ì n is a probability distribution. Hence, we have from Varadhan's lemma (Dembo and Zeitouni 993, Theorem 4.3.) and the LDP for ( ì n (A ),..., ì n (A k )) that Ë( f ) :ˆ lim n! n Ë n(nf ) ˆ sup y2r k " # Xk c i y i I(y) : iˆ (3)

12 032 A.J. Ganesh and N. O'Connell Using (3), we can rewrite the above as " # X k Ë( f ) ˆ sup c i í(a i ) Xk ì(a i )log ì(a i) í2m () í(a iˆ iˆ i ) ˆ sup í2m () f dí H k ( ìjí), (4) where H k ( ìjí) is de ned in Lemma 2. We now show that we can replace H k ( ìjí) in the supremum above by H( ìjí). Let í 2 M () be arbitrary. If ì(a i ). 0 and í(a i ) ˆ 0 for some A i, then ì 6 í and H( ìjí) and H k ( ìjí) are both in nite. Hence, such í can be excluded from consideration of the supremum above, and we shall suppose without loss of generality that ì(a i ) ˆ 0 whenever í(a i ) ˆ 0. We now de ne ë 2 M () as follows. Set ë í on A i if ì(a i ) ˆ 0; if ì(a i ). 0, take ë to be absolutely continuous with respect to ì on A i, with Radon± Nikodym derivative dë dì í(a i) ì(a i ). 0: Then ì is absolutely continuous with respect to ë and we have Also, H( ìjë) ˆ ˆ f dí ˆ Xk dì log dì dë ˆ X i:ì(a i ). 0 iˆ X i:ì(a i ). 0 A i dì log dì dë ì(a i ) log ì(a i) í(a i ) ˆ H k( ìjí): (5) c i í(a i ) ˆ Xk iˆ c i ë(a i ) ˆ f dë: (6) Since í 2 M () was arbitrary, we obtain from (5) and (6) that sup f dí H k ( ìjí) < sup f d ë H( ìjë) : í2m () ë2m () The reverse inequality holds as well because H k ( ìjí) < H( ìjí) for all í 2 M () by Lemma 2 (or by the convexity of x 7! x log x on [0, )). Hence, equality holds above and the claim of the lemma follows from (4). h Proof of Lemma 3. Let å. 0 be given, and let f :! R be bounded and continuous. We can nd k. 0 and a simple function g ˆ Pk iˆ c i Ai such that k f gk, å. Since f is continuous, we can in fact choose the A i to be ì-continuity sets with non-empty interiors. Now, by () and the fact that each ì n is a probability distribution,

13 A large-deviation principle for Dirichlet posteriors 033 Ë n (nf ) ˆ log E exp nf dì n < log E exp ng dì n nå ˆ nå Ë n (ng), so that, by (2), lim sup n! n Ë n(nf ) < Ë(g) å: Likewise, lim inf n! Ë n (nf )=n > Ë(g) å. Since å. 0 is arbitrary, it follows that Ë( f ) :ˆ lim n! n Ë n(nf ) exists and is nite for all bounded, continuous f :! R. The arguments above also show that Ë : C b ()! R is continuous, with jë( f ) Ë(g)j < k f gk. For f 2 L (), de ne H ( f ) ˆ sup í2m () f dí H( ìjí), (7) that is, H is the convex conjugate of H( ìj:). Now j f díj < k f k for all í 2 M (), while H( ìj:) is non-negative, with H( ìjì) ˆ 0. Thus, jh ( f )j < k f k. Since H is a convex function with domain L (), which is bounded on the open neighbourhood f f : k f k, g, we have by Rockafellar (974, Theorem 8) that H is continuous on the interior of its domain, which is all of L (). By Lemma, H and Ë agree on functions of the form f ˆ Pk iˆ c i Ai, where the A i partition and each A i is a ì-continuity set with non-empty interior. Since such functions are dense in C b (), Ë was shown to be continuous on C b () and H to be continuous on L () C b (), it follows that Ë ˆ H on all of C b () and, consequently, that Ë is convex. h Acknowledgements The research of this paper was carried out in part while the rst named author was at BRIMS. We would like to thank the referees for their comments, which have helped to improve the presentation of the paper. References Barron, A., Schervish, M.J. and Wasserman, L. (999) The consistency of posterior distributions in nonparametric problems. Ann. Statist., 27, 536±56. Dembo, A. and Zeitouni, O. (993) Large Deviations Techniques and Applications. Boston and London: Jones and Bartlett.

14 034 A.J. Ganesh and N. O'Connell Dupuis, P. and Ellis, R.S. (997) A Weak Convergence Approach to the Theory of Large Deviations. New York: Wiley. Ferguson, T.S. (973) A Bayesian analysis of some non-parametric problems. Ann. Statist.,, 209± 230. Ferguson, T.S. (974) Prior distributions on spaces of probability measures. Ann. Statist., 2, 65±629. Freedman, D. (963) On the asymptotic behavior of Bayes estimates in the discrete case. Ann. Math. Statist., 34, 386±403. Ganesh, A.J. and O'Connell, N. (999) An inverse of Sanov's theorem. Statist. Probab. Lett., 42, 20± 206. Ganesh, A., Green, P., O'Connell, N. and Pitts, S. (998) Bayesian network management, Queueing Systems, 28, 267±282. Georgii, H.-O. (988) Gibbs Measures and Phase Transitions. Berlin: De Gruyter. Ghosal, S., Ghosh, J.K. and van der Vaart, A.W. (998) Convergence rates of posterior distributions. Preprint. Ghosal, S., Ghosh, J.K. and Ramamoorthi, R.V. (999) Consistency issues in Bayesian nonparametrics. In S. Ghosh (ed.), Asymptotics, Nonparametrics and Time Series. New York: Marcel Dekker. Lavine, M. (992) Some aspects of PoÂlya tree distributions for statistical modelling. Ann. Statist., 20, 222±235. Lynch, J. and Sethuraman, J. (987) Large deviations for processes with independent increments. Ann. Probab., 5, 60±627. Mauldin, R.D., Sudderth, W.D. and Williams, S.C. (992) PoÂlya trees and random distributions. Ann. Statist., 20, 203±22. Rockafellar, R.T. (974) Conjugate Duality and Optimization. Philadelphia: Society for Industrial and Applied Mathematics. Schwartz, L. (965) On Bayes procedures. Z. Wahrscheinlichkeitstheorie. Verw. Geb., 4, 0±26. Shen, X. and Wasserman, L. (998) Rates of convergence of posterior distributions. Technical report, Dept. of Statistics, Carnegie-Mellon University. Wasserman, L. (998) Asymptotic properties of nonparametric Bayesian procedures. Technical report, Dept. of Statistics, Carnegie-Mellon University. Received November 998 and revised November 999

An inverse of Sanov s theorem

An inverse of Sanov s theorem An inverse of Sanov s theorem Ayalvadi Ganesh and Neil O Connell BRIMS, Hewlett-Packard Labs, Bristol Abstract Let X k be a sequence of iid random variables taking values in a finite set, and consider

More information

Stanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon Measures

Stanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon Measures 2 1 Borel Regular Measures We now state and prove an important regularity property of Borel regular outer measures: Stanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon

More information

Consistent semiparametric Bayesian inference about a location parameter

Consistent semiparametric Bayesian inference about a location parameter Journal of Statistical Planning and Inference 77 (1999) 181 193 Consistent semiparametric Bayesian inference about a location parameter Subhashis Ghosal a, Jayanta K. Ghosh b; c; 1 d; ; 2, R.V. Ramamoorthi

More information

2a Large deviation principle (LDP) b Contraction principle c Change of measure... 10

2a Large deviation principle (LDP) b Contraction principle c Change of measure... 10 Tel Aviv University, 2007 Large deviations 5 2 Basic notions 2a Large deviation principle LDP......... 5 2b Contraction principle................ 9 2c Change of measure................. 10 The formalism

More information

A Note On Large Deviation Theory and Beyond

A Note On Large Deviation Theory and Beyond A Note On Large Deviation Theory and Beyond Jin Feng In this set of notes, we will develop and explain a whole mathematical theory which can be highly summarized through one simple observation ( ) lim

More information

Introduction and Preliminaries

Introduction and Preliminaries Chapter 1 Introduction and Preliminaries This chapter serves two purposes. The first purpose is to prepare the readers for the more systematic development in later chapters of methods of real analysis

More information

General Theory of Large Deviations

General Theory of Large Deviations Chapter 30 General Theory of Large Deviations A family of random variables follows the large deviations principle if the probability of the variables falling into bad sets, representing large deviations

More information

Spread, estimators and nuisance parameters

Spread, estimators and nuisance parameters Bernoulli 3(3), 1997, 323±328 Spread, estimators and nuisance parameters EDWIN. VAN DEN HEUVEL and CHIS A.J. KLAASSEN Department of Mathematics and Institute for Business and Industrial Statistics, University

More information

Priors for the frequentist, consistency beyond Schwartz

Priors for the frequentist, consistency beyond Schwartz Victoria University, Wellington, New Zealand, 11 January 2016 Priors for the frequentist, consistency beyond Schwartz Bas Kleijn, KdV Institute for Mathematics Part I Introduction Bayesian and Frequentist

More information

Foundations of Nonparametric Bayesian Methods

Foundations of Nonparametric Bayesian Methods 1 / 27 Foundations of Nonparametric Bayesian Methods Part II: Models on the Simplex Peter Orbanz http://mlg.eng.cam.ac.uk/porbanz/npb-tutorial.html 2 / 27 Tutorial Overview Part I: Basics Part II: Models

More information

Large Deviations for Weakly Dependent Sequences: The Gärtner-Ellis Theorem

Large Deviations for Weakly Dependent Sequences: The Gärtner-Ellis Theorem Chapter 34 Large Deviations for Weakly Dependent Sequences: The Gärtner-Ellis Theorem This chapter proves the Gärtner-Ellis theorem, establishing an LDP for not-too-dependent processes taking values in

More information

Measure-theoretic probability

Measure-theoretic probability Measure-theoretic probability Koltay L. VEGTMAM144B November 28, 2012 (VEGTMAM144B) Measure-theoretic probability November 28, 2012 1 / 27 The probability space De nition The (Ω, A, P) measure space is

More information

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3 Index Page 1 Topology 2 1.1 Definition of a topology 2 1.2 Basis (Base) of a topology 2 1.3 The subspace topology & the product topology on X Y 3 1.4 Basic topology concepts: limit points, closed sets,

More information

LECTURE 12 UNIT ROOT, WEAK CONVERGENCE, FUNCTIONAL CLT

LECTURE 12 UNIT ROOT, WEAK CONVERGENCE, FUNCTIONAL CLT MARCH 29, 26 LECTURE 2 UNIT ROOT, WEAK CONVERGENCE, FUNCTIONAL CLT (Davidson (2), Chapter 4; Phillips Lectures on Unit Roots, Cointegration and Nonstationarity; White (999), Chapter 7) Unit root processes

More information

Gärtner-Ellis Theorem and applications.

Gärtner-Ellis Theorem and applications. Gärtner-Ellis Theorem and applications. Elena Kosygina July 25, 208 In this lecture we turn to the non-i.i.d. case and discuss Gärtner-Ellis theorem. As an application, we study Curie-Weiss model with

More information

Economics Bulletin, 2012, Vol. 32 No. 1 pp Introduction. 2. The preliminaries

Economics Bulletin, 2012, Vol. 32 No. 1 pp Introduction. 2. The preliminaries 1. Introduction In this paper we reconsider the problem of axiomatizing scoring rules. Early results on this problem are due to Smith (1973) and Young (1975). They characterized social welfare and social

More information

Metric Spaces and Topology

Metric Spaces and Topology Chapter 2 Metric Spaces and Topology From an engineering perspective, the most important way to construct a topology on a set is to define the topology in terms of a metric on the set. This approach underlies

More information

Near convexity, metric convexity, and convexity

Near convexity, metric convexity, and convexity Near convexity, metric convexity, and convexity Fred Richman Florida Atlantic University Boca Raton, FL 33431 28 February 2005 Abstract It is shown that a subset of a uniformly convex normed space is nearly

More information

Columbia University. Department of Economics Discussion Paper Series. The Knob of the Discord. Massimiliano Amarante Fabio Maccheroni

Columbia University. Department of Economics Discussion Paper Series. The Knob of the Discord. Massimiliano Amarante Fabio Maccheroni Columbia University Department of Economics Discussion Paper Series The Knob of the Discord Massimiliano Amarante Fabio Maccheroni Discussion Paper No.: 0405-14 Department of Economics Columbia University

More information

Large Deviations Techniques and Applications

Large Deviations Techniques and Applications Amir Dembo Ofer Zeitouni Large Deviations Techniques and Applications Second Edition With 29 Figures Springer Contents Preface to the Second Edition Preface to the First Edition vii ix 1 Introduction 1

More information

The Frobenius{Perron Operator

The Frobenius{Perron Operator (p.75) CHPTER 4 The Frobenius{Perron Operator The hero of this book is the Frobenius{Perron operator. With this powerful tool we shall study absolutely continuous invariant measures, their existence and

More information

PROBLEMS. (b) (Polarization Identity) Show that in any inner product space

PROBLEMS. (b) (Polarization Identity) Show that in any inner product space 1 Professor Carl Cowen Math 54600 Fall 09 PROBLEMS 1. (Geometry in Inner Product Spaces) (a) (Parallelogram Law) Show that in any inner product space x + y 2 + x y 2 = 2( x 2 + y 2 ). (b) (Polarization

More information

4 Sums of Independent Random Variables

4 Sums of Independent Random Variables 4 Sums of Independent Random Variables Standing Assumptions: Assume throughout this section that (,F,P) is a fixed probability space and that X 1, X 2, X 3,... are independent real-valued random variables

More information

Weak convergence. Amsterdam, 13 November Leiden University. Limit theorems. Shota Gugushvili. Generalities. Criteria

Weak convergence. Amsterdam, 13 November Leiden University. Limit theorems. Shota Gugushvili. Generalities. Criteria Weak Leiden University Amsterdam, 13 November 2013 Outline 1 2 3 4 5 6 7 Definition Definition Let µ, µ 1, µ 2,... be probability measures on (R, B). It is said that µ n converges weakly to µ, and we then

More information

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory Part V 7 Introduction: What are measures and why measurable sets Lebesgue Integration Theory Definition 7. (Preliminary). A measure on a set is a function :2 [ ] such that. () = 2. If { } = is a finite

More information

Bayesian Consistency for Markov Models

Bayesian Consistency for Markov Models Bayesian Consistency for Markov Models Isadora Antoniano-Villalobos Bocconi University, Milan, Italy. Stephen G. Walker University of Texas at Austin, USA. Abstract We consider sufficient conditions for

More information

If Y and Y 0 satisfy (1-2), then Y = Y 0 a.s.

If Y and Y 0 satisfy (1-2), then Y = Y 0 a.s. 20 6. CONDITIONAL EXPECTATION Having discussed at length the limit theory for sums of independent random variables we will now move on to deal with dependent random variables. An important tool in this

More information

Chapter 4. Measure Theory. 1. Measure Spaces

Chapter 4. Measure Theory. 1. Measure Spaces Chapter 4. Measure Theory 1. Measure Spaces Let X be a nonempty set. A collection S of subsets of X is said to be an algebra on X if S has the following properties: 1. X S; 2. if A S, then A c S; 3. if

More information

2 Interval-valued Probability Measures

2 Interval-valued Probability Measures Interval-Valued Probability Measures K. David Jamison 1, Weldon A. Lodwick 2 1. Watson Wyatt & Company, 950 17th Street,Suite 1400, Denver, CO 80202, U.S.A 2. Department of Mathematics, Campus Box 170,

More information

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time ASYMPTOTIC PROPERTIES OF THE GMLE WITH CASE 2 INTERVAL-CENSORED DATA By Qiqing Yu a;1 Anton Schick a, Linxiong Li b;2 and George Y. C. Wong c;3 a Dept. of Mathematical Sciences, Binghamton University,

More information

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond Measure Theory on Topological Spaces Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond May 22, 2011 Contents 1 Introduction 2 1.1 The Riemann Integral........................................ 2 1.2 Measurable..............................................

More information

A SHORT NOTE ON STRASSEN S THEOREMS

A SHORT NOTE ON STRASSEN S THEOREMS DEPARTMENT OF MATHEMATICS TECHNICAL REPORT A SHORT NOTE ON STRASSEN S THEOREMS DR. MOTOYA MACHIDA OCTOBER 2004 No. 2004-6 TENNESSEE TECHNOLOGICAL UNIVERSITY Cookeville, TN 38505 A Short note on Strassen

More information

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Ann Inst Stat Math (2009) 61:773 787 DOI 10.1007/s10463-008-0172-6 Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Taisuke Otsu Received: 1 June 2007 / Revised:

More information

Max-Planck-Institut fur Mathematik in den Naturwissenschaften Leipzig Uniformly distributed measures in Euclidean spaces by Bernd Kirchheim and David Preiss Preprint-Nr.: 37 1998 Uniformly Distributed

More information

Mean-Variance Utility

Mean-Variance Utility Mean-Variance Utility Yutaka Nakamura University of Tsukuba Graduate School of Systems and Information Engineering Division of Social Systems and Management -- Tennnoudai, Tsukuba, Ibaraki 305-8573, Japan

More information

Week 5 Lectures 13-15

Week 5 Lectures 13-15 Week 5 Lectures 13-15 Lecture 13 Definition 29 Let Y be a subset X. A subset A Y is open in Y if there exists an open set U in X such that A = U Y. It is not difficult to show that the collection of all

More information

Problem Set 2: Solutions Math 201A: Fall 2016

Problem Set 2: Solutions Math 201A: Fall 2016 Problem Set 2: s Math 201A: Fall 2016 Problem 1. (a) Prove that a closed subset of a complete metric space is complete. (b) Prove that a closed subset of a compact metric space is compact. (c) Prove that

More information

arxiv: v1 [math.ds] 31 Jul 2018

arxiv: v1 [math.ds] 31 Jul 2018 arxiv:1807.11801v1 [math.ds] 31 Jul 2018 On the interior of projections of planar self-similar sets YUKI TAKAHASHI Abstract. We consider projections of planar self-similar sets, and show that one can create

More information

SOME NEW EXAMPLES OF INFINITE IMAGE PARTITION REGULAR MATRICES

SOME NEW EXAMPLES OF INFINITE IMAGE PARTITION REGULAR MATRICES #A5 INTEGERS 9 (29) SOME NEW EXAMPLES OF INFINITE IMAGE PARTITION REGULAR MATRIES Neil Hindman Department of Mathematics, Howard University, Washington, D nhindman@aolcom Dona Strauss Department of Pure

More information

The small ball property in Banach spaces (quantitative results)

The small ball property in Banach spaces (quantitative results) The small ball property in Banach spaces (quantitative results) Ehrhard Behrends Abstract A metric space (M, d) is said to have the small ball property (sbp) if for every ε 0 > 0 there exists a sequence

More information

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability... Functional Analysis Franck Sueur 2018-2019 Contents 1 Metric spaces 1 1.1 Definitions........................................ 1 1.2 Completeness...................................... 3 1.3 Compactness......................................

More information

Topics in Mathematical Economics. Atsushi Kajii Kyoto University

Topics in Mathematical Economics. Atsushi Kajii Kyoto University Topics in Mathematical Economics Atsushi Kajii Kyoto University 26 June 2018 2 Contents 1 Preliminary Mathematics 5 1.1 Topology.................................. 5 1.2 Linear Algebra..............................

More information

ITERATED FUNCTION SYSTEMS WITH CONTINUOUS PLACE DEPENDENT PROBABILITIES

ITERATED FUNCTION SYSTEMS WITH CONTINUOUS PLACE DEPENDENT PROBABILITIES UNIVERSITATIS IAGELLONICAE ACTA MATHEMATICA, FASCICULUS XL 2002 ITERATED FUNCTION SYSTEMS WITH CONTINUOUS PLACE DEPENDENT PROBABILITIES by Joanna Jaroszewska Abstract. We study the asymptotic behaviour

More information

Convergence of Banach valued stochastic processes of Pettis and McShane integrable functions

Convergence of Banach valued stochastic processes of Pettis and McShane integrable functions Convergence of Banach valued stochastic processes of Pettis and McShane integrable functions V. Marraffa Department of Mathematics, Via rchirafi 34, 90123 Palermo, Italy bstract It is shown that if (X

More information

FRAGMENTABILITY OF ROTUND BANACH SPACES. Scott Sciffer. lim <Jl(x+A.y)- $(x) A-tO A

FRAGMENTABILITY OF ROTUND BANACH SPACES. Scott Sciffer. lim <Jl(x+A.y)- $(x) A-tO A 222 FRAGMENTABILITY OF ROTUND BANACH SPACES Scott Sciffer Introduction A topological. space X is said to be fragmented by a metric p if for every e > 0 and every subset Y of X there exists a nonempty relatively

More information

Subdifferential representation of convex functions: refinements and applications

Subdifferential representation of convex functions: refinements and applications Subdifferential representation of convex functions: refinements and applications Joël Benoist & Aris Daniilidis Abstract Every lower semicontinuous convex function can be represented through its subdifferential

More information

Extreme points of compact convex sets

Extreme points of compact convex sets Extreme points of compact convex sets In this chapter, we are going to show that compact convex sets are determined by a proper subset, the set of its extreme points. Let us start with the main definition.

More information

LECTURE 15: COMPLETENESS AND CONVEXITY

LECTURE 15: COMPLETENESS AND CONVEXITY LECTURE 15: COMPLETENESS AND CONVEXITY 1. The Hopf-Rinow Theorem Recall that a Riemannian manifold (M, g) is called geodesically complete if the maximal defining interval of any geodesic is R. On the other

More information

Topics in Mathematical Economics. Atsushi Kajii Kyoto University

Topics in Mathematical Economics. Atsushi Kajii Kyoto University Topics in Mathematical Economics Atsushi Kajii Kyoto University 25 November 2018 2 Contents 1 Preliminary Mathematics 5 1.1 Topology.................................. 5 1.2 Linear Algebra..............................

More information

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define Homework, Real Analysis I, Fall, 2010. (1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define ρ(f, g) = 1 0 f(x) g(x) dx. Show that

More information

Packing-Dimension Profiles and Fractional Brownian Motion

Packing-Dimension Profiles and Fractional Brownian Motion Under consideration for publication in Math. Proc. Camb. Phil. Soc. 1 Packing-Dimension Profiles and Fractional Brownian Motion By DAVAR KHOSHNEVISAN Department of Mathematics, 155 S. 1400 E., JWB 233,

More information

Entrance from 0 for increasing semistable Markov processes

Entrance from 0 for increasing semistable Markov processes Bernoulli 8 2), 22, 195±25 Entrance from for increasing semistable Markov processes JEAN BERTOIN 1 and MARIA-EMILIA CABALLERO 2 1 Laboratoire de ProbabiliteÂs et Institut Universitaire de France, UniversiteÂ

More information

(convex combination!). Use convexity of f and multiply by the common denominator to get. Interchanging the role of x and y, we obtain that f is ( 2M ε

(convex combination!). Use convexity of f and multiply by the common denominator to get. Interchanging the role of x and y, we obtain that f is ( 2M ε 1. Continuity of convex functions in normed spaces In this chapter, we consider continuity properties of real-valued convex functions defined on open convex sets in normed spaces. Recall that every infinitedimensional

More information

Stochastic integral. Introduction. Ito integral. References. Appendices Stochastic Calculus I. Geneviève Gauthier.

Stochastic integral. Introduction. Ito integral. References. Appendices Stochastic Calculus I. Geneviève Gauthier. Ito 8-646-8 Calculus I Geneviève Gauthier HEC Montréal Riemann Ito The Ito The theories of stochastic and stochastic di erential equations have initially been developed by Kiyosi Ito around 194 (one of

More information

MATHS 730 FC Lecture Notes March 5, Introduction

MATHS 730 FC Lecture Notes March 5, Introduction 1 INTRODUCTION MATHS 730 FC Lecture Notes March 5, 2014 1 Introduction Definition. If A, B are sets and there exists a bijection A B, they have the same cardinality, which we write as A, #A. If there exists

More information

Chapter 3 Continuous Functions

Chapter 3 Continuous Functions Continuity is a very important concept in analysis. The tool that we shall use to study continuity will be sequences. There are important results concerning the subsets of the real numbers and the continuity

More information

Asymptotic Tail Probabilities of Sums of Dependent Subexponential Random Variables

Asymptotic Tail Probabilities of Sums of Dependent Subexponential Random Variables Asymptotic Tail Probabilities of Sums of Dependent Subexponential Random Variables Jaap Geluk 1 and Qihe Tang 2 1 Department of Mathematics The Petroleum Institute P.O. Box 2533, Abu Dhabi, United Arab

More information

δ xj β n = 1 n Theorem 1.1. The sequence {P n } satisfies a large deviation principle on M(X) with the rate function I(β) given by

δ xj β n = 1 n Theorem 1.1. The sequence {P n } satisfies a large deviation principle on M(X) with the rate function I(β) given by . Sanov s Theorem Here we consider a sequence of i.i.d. random variables with values in some complete separable metric space X with a common distribution α. Then the sample distribution β n = n maps X

More information

Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping.

Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping. Minimization Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping. 1 Minimization A Topological Result. Let S be a topological

More information

Banach-Alaoglu theorems

Banach-Alaoglu theorems Banach-Alaoglu theorems László Erdős Jan 23, 2007 1 Compactness revisited In a topological space a fundamental property is the compactness (Kompaktheit). We recall the definition: Definition 1.1 A subset

More information

An introduction to some aspects of functional analysis

An introduction to some aspects of functional analysis An introduction to some aspects of functional analysis Stephen Semmes Rice University Abstract These informal notes deal with some very basic objects in functional analysis, including norms and seminorms

More information

AN INTRODUCTION TO GEOMETRIC MEASURE THEORY AND AN APPLICATION TO MINIMAL SURFACES ( DRAFT DOCUMENT) Academic Year 2016/17 Francesco Serra Cassano

AN INTRODUCTION TO GEOMETRIC MEASURE THEORY AND AN APPLICATION TO MINIMAL SURFACES ( DRAFT DOCUMENT) Academic Year 2016/17 Francesco Serra Cassano AN INTRODUCTION TO GEOMETRIC MEASURE THEORY AND AN APPLICATION TO MINIMAL SURFACES ( DRAFT DOCUMENT) Academic Year 2016/17 Francesco Serra Cassano Contents I. Recalls and complements of measure theory.

More information

E cient Simulation and Conditional Functional Limit Theorems for Ruinous Heavy-tailed Random Walks

E cient Simulation and Conditional Functional Limit Theorems for Ruinous Heavy-tailed Random Walks E cient Simulation and Conditional Functional Limit Theorems for Ruinous Heavy-tailed Random Walks Jose Blanchet and Jingchen Liu y June 14, 21 Abstract The contribution of this paper is to introduce change

More information

SEPARABILITY OF STOCHASTIC PROCESSES

SEPARABILITY OF STOCHASTIC PROCESSES SEPARABILITY OF STOCHASTIC PROCESSES S. H. COLEMAN1 This paper applies a generalization of the Daniell theory of integration to stochastic processes with values in a compact Hausdorff space. A consistent

More information

On Kusuoka Representation of Law Invariant Risk Measures

On Kusuoka Representation of Law Invariant Risk Measures MATHEMATICS OF OPERATIONS RESEARCH Vol. 38, No. 1, February 213, pp. 142 152 ISSN 364-765X (print) ISSN 1526-5471 (online) http://dx.doi.org/1.1287/moor.112.563 213 INFORMS On Kusuoka Representation of

More information

COMPLETE QTH MOMENT CONVERGENCE OF WEIGHTED SUMS FOR ARRAYS OF ROW-WISE EXTENDED NEGATIVELY DEPENDENT RANDOM VARIABLES

COMPLETE QTH MOMENT CONVERGENCE OF WEIGHTED SUMS FOR ARRAYS OF ROW-WISE EXTENDED NEGATIVELY DEPENDENT RANDOM VARIABLES Hacettepe Journal of Mathematics and Statistics Volume 43 2 204, 245 87 COMPLETE QTH MOMENT CONVERGENCE OF WEIGHTED SUMS FOR ARRAYS OF ROW-WISE EXTENDED NEGATIVELY DEPENDENT RANDOM VARIABLES M. L. Guo

More information

Real Analysis: Homework # 12 Fall Professor: Sinan Gunturk Fall Term 2008

Real Analysis: Homework # 12 Fall Professor: Sinan Gunturk Fall Term 2008 Eduardo Corona eal Analysis: Homework # 2 Fall 2008 Professor: Sinan Gunturk Fall Term 2008 #3 (p.298) Let X be the set of rational numbers and A the algebra of nite unions of intervals of the form (a;

More information

Fragmentability and σ-fragmentability

Fragmentability and σ-fragmentability F U N D A M E N T A MATHEMATICAE 143 (1993) Fragmentability and σ-fragmentability by J. E. J a y n e (London), I. N a m i o k a (Seattle) and C. A. R o g e r s (London) Abstract. Recent work has studied

More information

Serena Doria. Department of Sciences, University G.d Annunzio, Via dei Vestini, 31, Chieti, Italy. Received 7 July 2008; Revised 25 December 2008

Serena Doria. Department of Sciences, University G.d Annunzio, Via dei Vestini, 31, Chieti, Italy. Received 7 July 2008; Revised 25 December 2008 Journal of Uncertain Systems Vol.4, No.1, pp.73-80, 2010 Online at: www.jus.org.uk Different Types of Convergence for Random Variables with Respect to Separately Coherent Upper Conditional Probabilities

More information

MATH 202B - Problem Set 5

MATH 202B - Problem Set 5 MATH 202B - Problem Set 5 Walid Krichene (23265217) March 6, 2013 (5.1) Show that there exists a continuous function F : [0, 1] R which is monotonic on no interval of positive length. proof We know there

More information

Stochastic Processes

Stochastic Processes Introduction and Techniques Lecture 4 in Financial Mathematics UiO-STK4510 Autumn 2015 Teacher: S. Ortiz-Latorre Stochastic Processes 1 Stochastic Processes De nition 1 Let (E; E) be a measurable space

More information

Simple Abelian Topological Groups. Luke Dominic Bush Hipwood. Mathematics Institute

Simple Abelian Topological Groups. Luke Dominic Bush Hipwood. Mathematics Institute M A E NS G I T A T MOLEM UNIVERSITAS WARWICENSIS Simple Abelian Topological Groups by Luke Dominic Bush Hipwood supervised by Dr Dmitriy Rumynin 4th Year Project Submitted to The University of Warwick

More information

Math 4317 : Real Analysis I Mid-Term Exam 1 25 September 2012

Math 4317 : Real Analysis I Mid-Term Exam 1 25 September 2012 Instructions: Answer all of the problems. Math 4317 : Real Analysis I Mid-Term Exam 1 25 September 2012 Definitions (2 points each) 1. State the definition of a metric space. A metric space (X, d) is set

More information

A SHORT INTRODUCTION TO BANACH LATTICES AND

A SHORT INTRODUCTION TO BANACH LATTICES AND CHAPTER A SHORT INTRODUCTION TO BANACH LATTICES AND POSITIVE OPERATORS In tis capter we give a brief introduction to Banac lattices and positive operators. Most results of tis capter can be found, e.g.,

More information

Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, Metric Spaces

Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, Metric Spaces Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, 2013 1 Metric Spaces Let X be an arbitrary set. A function d : X X R is called a metric if it satisfies the folloing

More information

WEAK CONVERGENCE OF RESOLVENTS OF MAXIMAL MONOTONE OPERATORS AND MOSCO CONVERGENCE

WEAK CONVERGENCE OF RESOLVENTS OF MAXIMAL MONOTONE OPERATORS AND MOSCO CONVERGENCE Fixed Point Theory, Volume 6, No. 1, 2005, 59-69 http://www.math.ubbcluj.ro/ nodeacj/sfptcj.htm WEAK CONVERGENCE OF RESOLVENTS OF MAXIMAL MONOTONE OPERATORS AND MOSCO CONVERGENCE YASUNORI KIMURA Department

More information

Some Properties of the Augmented Lagrangian in Cone Constrained Optimization

Some Properties of the Augmented Lagrangian in Cone Constrained Optimization MATHEMATICS OF OPERATIONS RESEARCH Vol. 29, No. 3, August 2004, pp. 479 491 issn 0364-765X eissn 1526-5471 04 2903 0479 informs doi 10.1287/moor.1040.0103 2004 INFORMS Some Properties of the Augmented

More information

Sample path large deviations of a Gaussian process with stationary increments and regularily varying variance

Sample path large deviations of a Gaussian process with stationary increments and regularily varying variance Sample path large deviations of a Gaussian process with stationary increments and regularily varying variance Tommi Sottinen Department of Mathematics P. O. Box 4 FIN-0004 University of Helsinki Finland

More information

Introduction to Information Entropy Adapted from Papoulis (1991)

Introduction to Information Entropy Adapted from Papoulis (1991) Introduction to Information Entropy Adapted from Papoulis (1991) Federico Lombardo Papoulis, A., Probability, Random Variables and Stochastic Processes, 3rd edition, McGraw ill, 1991. 1 1. INTRODUCTION

More information

Maths 212: Homework Solutions

Maths 212: Homework Solutions Maths 212: Homework Solutions 1. The definition of A ensures that x π for all x A, so π is an upper bound of A. To show it is the least upper bound, suppose x < π and consider two cases. If x < 1, then

More information

A Note on the Convergence of Random Riemann and Riemann-Stieltjes Sums to the Integral

A Note on the Convergence of Random Riemann and Riemann-Stieltjes Sums to the Integral Gen. Math. Notes, Vol. 22, No. 2, June 2014, pp.1-6 ISSN 2219-7184; Copyright c ICSRS Publication, 2014 www.i-csrs.org Available free online at http://www.geman.in A Note on the Convergence of Random Riemann

More information

Brownian Motion and Conditional Probability

Brownian Motion and Conditional Probability Math 561: Theory of Probability (Spring 2018) Week 10 Brownian Motion and Conditional Probability 10.1 Standard Brownian Motion (SBM) Brownian motion is a stochastic process with both practical and theoretical

More information

A Note on the Strong Law of Large Numbers

A Note on the Strong Law of Large Numbers JIRSS (2005) Vol. 4, No. 2, pp 107-111 A Note on the Strong Law of Large Numbers V. Fakoor, H. A. Azarnoosh Department of Statistics, School of Mathematical Sciences, Ferdowsi University of Mashhad, Iran.

More information

Banach Spaces II: Elementary Banach Space Theory

Banach Spaces II: Elementary Banach Space Theory BS II c Gabriel Nagy Banach Spaces II: Elementary Banach Space Theory Notes from the Functional Analysis Course (Fall 07 - Spring 08) In this section we introduce Banach spaces and examine some of their

More information

2. The Concept of Convergence: Ultrafilters and Nets

2. The Concept of Convergence: Ultrafilters and Nets 2. The Concept of Convergence: Ultrafilters and Nets NOTE: AS OF 2008, SOME OF THIS STUFF IS A BIT OUT- DATED AND HAS A FEW TYPOS. I WILL REVISE THIS MATE- RIAL SOMETIME. In this lecture we discuss two

More information

On bounded redundancy of universal codes

On bounded redundancy of universal codes On bounded redundancy of universal codes Łukasz Dębowski Institute of omputer Science, Polish Academy of Sciences ul. Jana Kazimierza 5, 01-248 Warszawa, Poland Abstract onsider stationary ergodic measures

More information

Gaussian Random Fields

Gaussian Random Fields Gaussian Random Fields Mini-Course by Prof. Voijkan Jaksic Vincent Larochelle, Alexandre Tomberg May 9, 009 Review Defnition.. Let, F, P ) be a probability space. Random variables {X,..., X n } are called

More information

Ergodic Theory (Lecture Notes) Joan Andreu Lázaro Camí

Ergodic Theory (Lecture Notes) Joan Andreu Lázaro Camí Ergodic Theory (Lecture Notes) Joan Andreu Lázaro Camí Department of Mathematics Imperial College London January 2010 Contents 1 Introduction 2 2 Measure Theory 5 2.1 Motivation: Positive measures and

More information

ON A HYBRID PROXIMAL POINT ALGORITHM IN BANACH SPACES

ON A HYBRID PROXIMAL POINT ALGORITHM IN BANACH SPACES U.P.B. Sci. Bull., Series A, Vol. 80, Iss. 3, 2018 ISSN 1223-7027 ON A HYBRID PROXIMAL POINT ALGORITHM IN BANACH SPACES Vahid Dadashi 1 In this paper, we introduce a hybrid projection algorithm for a countable

More information

ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS

ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS Bendikov, A. and Saloff-Coste, L. Osaka J. Math. 4 (5), 677 7 ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS ALEXANDER BENDIKOV and LAURENT SALOFF-COSTE (Received March 4, 4)

More information

The Skorokhod reflection problem for functions with discontinuities (contractive case)

The Skorokhod reflection problem for functions with discontinuities (contractive case) The Skorokhod reflection problem for functions with discontinuities (contractive case) TAKIS KONSTANTOPOULOS Univ. of Texas at Austin Revised March 1999 Abstract Basic properties of the Skorokhod reflection

More information

Invariant measures for iterated function systems

Invariant measures for iterated function systems ANNALES POLONICI MATHEMATICI LXXV.1(2000) Invariant measures for iterated function systems by Tomasz Szarek (Katowice and Rzeszów) Abstract. A new criterion for the existence of an invariant distribution

More information

3 Integration and Expectation

3 Integration and Expectation 3 Integration and Expectation 3.1 Construction of the Lebesgue Integral Let (, F, µ) be a measure space (not necessarily a probability space). Our objective will be to define the Lebesgue integral R fdµ

More information

Miscellanea Kernel density estimation and marginalization consistency

Miscellanea Kernel density estimation and marginalization consistency Biometrika (1991), 78, 2, pp. 421-5 Printed in Great Britain Miscellanea Kernel density estimation and marginalization consistency BY MIKE WEST Institute of Statistics and Decision Sciences, Duke University,

More information

Stability and Sensitivity of the Capacity in Continuous Channels. Malcolm Egan

Stability and Sensitivity of the Capacity in Continuous Channels. Malcolm Egan Stability and Sensitivity of the Capacity in Continuous Channels Malcolm Egan Univ. Lyon, INSA Lyon, INRIA 2019 European School of Information Theory April 18, 2019 1 / 40 Capacity of Additive Noise Models

More information

Random Process Lecture 1. Fundamentals of Probability

Random Process Lecture 1. Fundamentals of Probability Random Process Lecture 1. Fundamentals of Probability Husheng Li Min Kao Department of Electrical Engineering and Computer Science University of Tennessee, Knoxville Spring, 2016 1/43 Outline 2/43 1 Syllabus

More information

Functional Analysis: Assignment Set # 3 Spring 2009 Professor: Fengbo Hang February 25, 2009

Functional Analysis: Assignment Set # 3 Spring 2009 Professor: Fengbo Hang February 25, 2009 duardo Corona Functional Analysis: Assignment Set # 3 Spring 9 Professor: Fengbo Hang February 5, 9 C6. Show that a norm that satis es the parallelogram identity: comes from a scalar product. kx + yk +

More information

A LARGE DEVIATION PRINCIPLE FOR THE ERDŐS RÉNYI UNIFORM RANDOM GRAPH

A LARGE DEVIATION PRINCIPLE FOR THE ERDŐS RÉNYI UNIFORM RANDOM GRAPH A LARGE DEVIATION PRINCIPLE FOR THE ERDŐS RÉNYI UNIFORM RANDOM GRAPH AMIR DEMBO AND EYAL LUBETZKY Abstract. Starting with the large deviation principle (ldp) for the Erdős Rényi binomial random graph G(n,

More information

at time t, in dimension d. The index i varies in a countable set I. We call configuration the family, denoted generically by Φ: U (x i (t) x j (t))

at time t, in dimension d. The index i varies in a countable set I. We call configuration the family, denoted generically by Φ: U (x i (t) x j (t)) Notations In this chapter we investigate infinite systems of interacting particles subject to Newtonian dynamics Each particle is characterized by its position an velocity x i t, v i t R d R d at time

More information

CHAPTER I THE RIESZ REPRESENTATION THEOREM

CHAPTER I THE RIESZ REPRESENTATION THEOREM CHAPTER I THE RIESZ REPRESENTATION THEOREM We begin our study by identifying certain special kinds of linear functionals on certain special vector spaces of functions. We describe these linear functionals

More information