PRESERVATION THEOREMS FOR GLIVENKO-CANTELLI AND UNIFORM GLIVENKO-CANTELLI CLASSES AAD VAN DER VAART AND JON A. WELLNER ABSTRACT We show that the P Gli

Size: px
Start display at page:

Download "PRESERVATION THEOREMS FOR GLIVENKO-CANTELLI AND UNIFORM GLIVENKO-CANTELLI CLASSES AAD VAN DER VAART AND JON A. WELLNER ABSTRACT We show that the P Gli"

Transcription

1 PRESERVATION THEOREMS FOR GLIVENKO-CANTELLI AND UNIFORM GLIVENKO-CANTELLI CLASSES AAD VAN DER VAART AND JON A. WELLNER ABSTRACT We show that the P Glivenko property of classes of functions F ;::: ;F k is preserved by a continuous function ' from R k to R in the sense that the new class of functions x! '(f (x);::: ;fk(x)); fi 2F i; i=;::: ;k is again a Glivenko-Cantelli class of functions if it has an integrable envelope. We also prove an analogous result for preservation of the uniform Glivenko-Cantelli property. Corollaries of the main theorem include two preservation theorems of Dudley (998). We apply the main result to reprove a theorem of Schick and Yu (999) concerning consistency of the NPMLE in a model for mixed case" interval censoring. Finally a version of the consistency result of Schick and Yu (999) is established for a general model for mixed case interval censoring" in which a general sample space Y is partitioned into sets which are members of some VC-class C of subsets of Y.. Glivenko - Cantelli Theorems. Let (X ; A;P) be a probability space, and suppose that F ρ L (P ). For such a class of functions, let F 0;P ff Pf : f 2 Fg. We also let F F (x) sup f2f jf(x)j, the envelope function of F. Ifjfj»F for all f 2F with F measurable, then F is an envelope for F. Suppose that X ;X 2 ;::: are i.i.d. P. Glivenko-Cantelli theorems give conditions under which the empirical measure Pn converges uniformly to P over a class F, either in probability (in which case we say that F is a weak Glivenko-Cantelli class for P ) or almost surely: kpn P k F sup jpnf Pfj! a:s: Λ 0; f2f in this case wesay that wesay that F is a strong Glivenko-Cantelli class for P. Useful sufficient conditions for a class F to be a strong Glivenko-Cantelli class for P are that it has an integrable envelope and either log N [] (ffl; F;L (P)) < for all ffl>0

2 2 Glivenko-Cantelli preservation theorems or log N(ffl; F M ;L (Pn)) = o Λ P (n) for every ffl>0 and 0 <M<: Here F M = ff [F»M] : f 2Fg. In the second case some additional measurability conditions are necessary; see Van der Vaart and Wellner (996), Chapter 2.4, pages In particular, the condition in Theorem 2.4.3, page 23, is shown by Giné and inn (984) to be both necessary and sufficient, under measurability assumptions, for the class F to be a strong Glivenko-Cantelli class (see Talagrand (996) for another proof). Talagrand (987b) gives necessary and sufficient conditions for the Glivenko-Cantelli theorem without any measurability hypotheses. Theorem. (Giné and inn, 984). Suppose that F is L (P ) bounded and nearly linearly supremum measurable for P ; in particular this holds if F is image admissible Suslin. Then the following are equivalent: (a) F is a strong Glivenko-Cantelli class for P. (b) F has an envelope function F 2 L (P ) and the truncated classes F M ffff»mg: f2fgsatisfy n EΛ log N(ffl; F M ;L r (Pn))! 0 for all ffl>0; and for all M 2 (0; ) for some (all) r 2 (0; ] where kfk Lr(P ) kfk P;r fp(jfj r )g r ^. 2. Preservation of the Glivenko - Cantelli Property. Our goal in this section is to present several results concerning the stability of the Glivenko-Cantelli property of one or more classes of functions under composition with functions '. A theorem which motivated our interest is the following result of Dudley (998a). Theorem 2. (Dudley, 998a). Suppose that F is a strong Glivenko-Cantelli class for P with PF <, J is a possibly unbounded interval including the ranges of all f 2F,'is continuous and monotone on J, and for some finite constants c; d, j'(y)j»cjyj+dfor all y 2 J. Then '(F) is also a strong Glivenko-Cantelli class for P. Dudley (998a) proves this via the characterization of Glivenko-Cantelli classes due to Talagrand (987b). Dudley (998b) also uses Talagrand's characterization to prove the following interesting proposition. Proposition. (Dudley, 998b). Suppose that F is a strong Glivenko- Cantelli class for P with PF <, and g is a fixed bounded function (kgk < ). Then the class of functions g F fg f :f 2Fgis a strong Glivenko-Cantelli class for P. Yet another proposition in this same vein is:

3 van der Vaart and Wellner 3 Proposition 2. (Giné and inn, 984). Suppose that F is a uniformly bounded strong Glivenko-Cantelli class for P, and g 2 L (P ) is a fixed function. Then the class of functions g F fg f : f 2 Fg is a strong Glivenko-Cantelli class for P. Given classes F ;::: ;F k of functions f i : X! R and a function ' : R k! R, let '(F ;::: ;F k ) be the class of functions x! '(f (x);::: ;f k (x)) ; where f i 2F i,i=;::: ;k. Theorem 2 and Propositions and 2 are all corollaries of the following theorem. Theorem 3. Suppose that F ;::: ;F k are P Glivenko-Cantelli classes of functions, and that ' : R k! R is continuous. Then H '(F ;::: ;F k )is P Glivenko-Cantelli provided that it has an integrable envelope function. Proof. We first assume that the classes of functions F i are appropriately measurable. Let F ;::: ;F k and H be integrable envelopes for F ;::: ;F k and H respectively, and set F = F _ :::_F k.for M 2 (0; ), define Now H M f'(f) [F»M] : f =(f ;::: ;f k )2F F 2 F k Fg: k(pn P )'(f)k F» (Pn + P )H [F>M] +k(pn P)hk HM : The expectation of the first term on the right converges to 0 as M!. Hence it suffices to show that H M is P Glivenko-Cantelli for every fixed M. Let ffi = ffi(ffl) betheffiof Lemma 2 below for ' :[ M;M] k! R, ffl>0, and k kthe L -norm k k. Then for any (f j ;g j )2F j,j=;::: ;k, implies that It follows that Pnjf j g j j [Fj»M]» ffi ; j =;::: ;k k Pnj'(f ;::: ;f k ) '(g ;::: ;g k )j [F»M]»ffl: N(ffl; H M ;L (Pn))» ky j= N( ffi k ; F j [Fj»M];L (Pn)) : Thus E Λ log N(ffl; H M ;L (Pn)) = o(n) for every ffl>0, M<. This implies that E Λ log N(ffl; (H M ) N ;L (Pn)) = o(n) for (H M ) N the functions hfh» Ng for h 2 H M.Thus H M is strong Glivenko-Cantelli for P by Theorem. This concludes the proof that H = '(F) is weak Glivenko-Cantelli. Because it has an integrable envelope, it is strong Glivenko-Cantelli by, e.g., Lemma of Van der Vaart and Wellner (996). This concludes the proof for appropriately measurable classes F j, j =;::: ;k. We extend the theorem to general Glivenko-Cantelli classes using separable versions as in Talagrand (987a). (Also see van der Vaart and Wellner

4 4 Glivenko-Cantelli preservation theorems (996), pages 5-20 for a discussion.) As shown in the preceding argument, it is not a loss of generality to assume that the classes F i are uniformly bounded. Furthermore, it suffices to show that '(F ;::: ;F k )is weak Glivenko-Cantelli. We first need a lemma. Lemma. Any strong P -Glivenko Cantelli class F is totally bounded in L (P ) if and only if kp k F <. Furthermore for any r 2 (; ), if F has an envelope that is contained in L r (P ), then F is also totally bounded in L r (P ). Proof. A class that is totally bounded is also bounded. Thus for the first statement we only need to prove that a strong Glivenko-Cantelli class F with kp k F < is totally bounded in L (P ). It is well-known that such a class has an integrable envelope. E.g. see Giné and inn (983) or Problem 2.4. of van der Vaart and Wellner (996) to conclude first that P Λ kf Pfk F <. Next the claim follows from the triangle inequality kfk F»kf Pfk F +kpk F.Thus it is no loss of generality to assume that the class F possesses an envelope that is finite everywhere. If F is not totally bounded in L (P ), then there is " > 0 and G ρ F countable such that the packing number D("; G;L (P)) =. This is obvious, since if F is not totally bounded, then D("; F;L (P)) = for some ">0; then take a countable set of " separated points. So, now G has all the measurablity properties, is GC and L (P )-bounded, and moreover, D("; G;L )=. But this is a contradiction because, as shown in Giné and inn (984), first half of page 984, D("; G;L )must be finite for all ">0. This completes the proof of the first part of the lemma. If F has an envelope in L r (P ), then F is totally bounded in L r (P ) if the class F M of functions fff» Mg is totally bounded in L r (P ) for every fixed M. The class F M is P -GC by Theorem 3 and hence this class is totally bounded in L (P ). But then it is also totally bounded in L r (P ), because P jfj r» P jfjm r for any f that is bounded by M and we can construct the ffl-net over F M in L (P ) to consist of functions that are bounded by M. 2 Because a Glivenko-Cantelli class F with kp k F < is totally bounded in L (P )by Lemma, it is separable as a subset of L (P ). A minor generalization of Theorem in van der Vaart and Wellner (996) shows that there exists a bijection f $ ~ f of F onto a class ~ FρL (P) such that ffl f = ~ f P almost surely for every f 2F. ffl there exists a countable subset G ρ ~ F such that for every n there exists a measurable set N n ρ X n with P n (N n ) = 0 such that for all (x ;::: ;x n ) =2 N n and f 2 ~ F there exists fgm g ρ G such that P jg m ~ fj!0 and (g m (x );::: ;g m (x n ))! ( ~ f(x );::: ; ~ f(x n )).

5 van der Vaart and Wellner 5 By an adaptation of a theorem due to Talagrand (987a) (see Theorem in van der Vaart and Wellner (996)) a class F is weak Glivenko- Cantelli if and only if the class ~ F is weak Glivenko-Cantelli and sup Pnjf fj!0 ~ f2f in outer probability. Construct a pointwise separable version" ~ Fi for each of the classes F i. The classes ~ Fi possess enough measurability tomake the preceding argument work; in particular pointwise separable version" in the above sense is sufficient for the nearly linearly supremum measurable hypothesis of Giné and inn (984) for both F ;::: ;F k and '(F ;::: ;F k ). Thus the class '( ~ F ;::: ; ~ Fk ) is Glivenko-Cantelli for P. Now by Lemma 2 there exists for every ffl>0affi>0 such that Pnjf j ~ fj j < ffi k ; j =;::: ;k; implies Pnj'(f ;::: ;f k ) '( f ~ ;::: ; f ~ k )j<ffl: The theorem follows. 2 Lemma 2. Suppose that ' : K! R is continuous and K ρ R k is compact. Then for every ffl > 0 there exists ffi > 0 such that for all n and for all a ;::: ;a n ;b ;::: ;b n 2K ρr k implies n n nx i= nx i= ka i b i k <ffi j'(a i ) '(b i )j <ffl: Here k kcan be any norm on R k ; in particular it can be kxk r = ψ k X i= jx i j r! =r r 2 [; ) orkxk max»i»k jx i j for x =(x ;::: ;x k )2 R k. Proof. Let U n be uniform on f;::: ;ng, and set X n = a Un, Y n = b Un. Then we can write and n nx i= n nx i= ka i b i k = EkX n Y n k j'(a i ) '(b i )j = Ej'(X n ) '(Y n )j : ;

6 6 Glivenko-Cantelli preservation theorems Hence it suffices to show that for every ffl>0 there exists ffi>0 such that for all (X; Y ) random vectors in K ρ R k, EkX Y k <ffi implies Ej'(X) '(Y )j <ffl: Suppose not. Then for some ffl > 0 and for all m = ; 2;::: there exists (X m ;Y m ) such that EkX m Y m k < m ; Ej'(X m) '(Y m )j ffl: But since f(x m ;Y m )g is tight, there exists (X m 0;Y m 0)! d (X; Y ). Then it follows that EkX Y k = lim EkX m 0 m! 0 Y m0k =0 so that X = Y a.s., while on the other hand 0=Ej'(X) '(Y)j= lim Ej'(X m 0 m 0) '(Y m0)j ffl>0:! This contradiction means that the desired implication holds. 2 Another potentially useful preservation theorem is one based on building up Glivenko-Cantelli classes from the restrictions of a class of functions to elements of a partition of the sample space. The following theorem is related to the results of Van der Vaart (996) for Donsker classes. Theorem 4. Suppose that F is a class of functions on (X ; A;P), and fx i g is a partition of X : [ X i= i = X, X i X j = ; for i 6= j. Suppose that F j ff Xj : f 2 Fg is P Glivenko-Cantelli for each j, and F has an integrable envelope function F. Then F is itself P Glivenko-Cantelli. Proof. Since it follows that f = f X E Λ kpn P k F» Xj = X j= j= X j= f Xj ; E Λ kpn P k Fj! 0 by the dominated convergence theorem since each term in the sum converges to zero by the hypothesis that each F j is P Glivenko-Cantelli, and we have E Λ kpn Pk Fj»E Λ Pn(F Xj )+P(F Xj )»2P(F Xj ) where P j= P (F X j )=P(F)<. 2

7 van der Vaart and Wellner 7 3. Preservation of the Uniform Glivenko - Cantelli Property. We say that F is a strong uniform Glivenko-Cantelli class if for all ffl>0 sup PrP Λ sup kpm P k F >ffl!0 as n! P 2P(X ;A) m n where P(X ; A) is the set of all probability measures on (X ; A). For x = (x ;::: ;x n ) 2 X n, n = ;2;:::, and r 2 (0; ), we define on F the pseudo-distances ( ) X n r ^ e x;r (f;g) = n jf(x i ) g(x i )j r ; i= e x; (f;g) = max jf(x i) g(x i )j; f; g 2F:»i»n Let N(ffl; F;e x;r ) denote the ffl covering numberof(f;e x;r ), ffl>0. Then define, for n =;2;:::, ffl>0, and r 2 (0; ], the quantities N n;r (ffl; F) = sup x2x n N(ffl; F;e x;r ) : Theorem 5. (Dudley, Giné, and inn (99)). Suppose that F is a class of uniformly bounded functions such that F is image admissible Suslin. Then the following are equivalent: (a) F is a strong uniform Glivenko-Cantelli class. (b) log N n;r (ffl; F)! 0 for all ffl>0 n for some (all) r 2 (0; ]. For the definition of the image admissible Suslin property see Dudley (984), sections 0.3 and.. The following theorem gives natural sufficient conditions for preservation of the uniform Glivenko-Cantelli theorem. Theorem 6. Suppose that F ;::: ;F k are classes of uniformly bounded functions on (X ; A) such that F ;::: ;F k are image-admissible Suslin and strong uniform Glivenko-Cantelli classes. Suppose that ' : R k! R is continuous. Then H '(F ;::: ;F k ) is a strong uniform Glivenko-Cantelli class. Proof. It follows from Lemma 2 that for any ffl>0 there exists ffi>0 such that for any f j ;g j 2F j,j=;::: ;k, x2x n, e x; (f j ;g j )» ffi ; j =;::: ;k k implies that e x; ('(f ;::: ;f k );'(g ;::: ;g k ))» ffl:

8 8 Glivenko-Cantelli preservation theorems It follows that and hence that N n; (ffl; H)» n log N n;(ffl; H)» ky j= kx j= N n; ( ffi k ; F j) ; n log N n;( ffi k ; F j)! 0 where the convergence follows from part (b) of Theorem 5 with r =.If weshow that H = '(F) is image admissible Suslin, then the conclusion follows from (b) implies (a) in Theorem 5. We give a proof of a slightly stronger statement in the following lemma. 2 Lemma 3. If F ;::: ;F k are image admissible Suslin via (Y i ; S i ;T i ), i = ;::: ;k, and ffi : R k! R is measurable, then '(F ;::: ;F k ) is image admissible Suslin via ( Q k i= Y i; Q k i= S i;t) for T (y ;::: ;y k )='(T y ;::: ;T k y k ): Proof. There exist Polish spaces i and measurable maps g i : i! Y i which are onto. Then g : k! Y Y k defined by g(z ;::: ;z k )=(g (z );::: ;g k (z k )) is onto and measurable for S S k. So ( Q i Y i; Q i S i) is Suslin. It suffices to check that T is onto and ψ defined by the map ψ(x; y ;::: ;y k )='(T y (x);::: ;T k y k (x)) is measurable. Obviously T is onto, and ψ is measurable because each map (x; y i ) 7! T i y i (x) is measurable on X Y i and hence on X Y Y k, and hence (x; y ;::: ;y k )7! (T y (x);::: ;T k y k (x)) is measurable from X Yto R k Nonparametric Maximum Likelihood Estimation: a general result. Now we prove a general result for nonparametric maximum likelihood estimation in a class of densities. The main proposition in this section is related to results of Pfanzagl (988) and Van de Geer (993), (996). Suppose that P is a class of densities with respect to a fixed ff finite measure μ on a measurable space (X ; A). Suppose that X ;::: ;X n are i.i.d. P 0 with density p 0 2P. Let bp n argmax Pn log p:

9 van der Vaart and Wellner 9 For 0 <ff», let ' ff (t) =(t ff )=(t ff + ) for t 0, '(t) = for t<0. Then ' ff is bounded and continuous for each ff 2 (0; ]. For 0 < fi < define h 2 fi(p; q) p fi q fi dμ : Note that h =2 (p; q) h(p; q) is the Hellinger distance between p and q, and by Hölder's inequality, h fi (p; q) 0 with equality if and only if p = q a.e. μ. Proposition 3. Suppose that P is convex. Then h 2 (bp ff=2 n;p 0 )»(Pn P 0 ) ' ff ( bp n ) p 0 In particular, when ff =wehave, with ' ', h 2 (bp n ;p 0 ) h 2 =2 (bp n;p 0 )»(Pn P 0 ) : '( bp n ) : p 0 Corollary. Suppose that f'(p=p 0 ): p2pgis a P 0 Glivenko-Cantelli class. Then for each 0<ff», h ff=2 (bp n ;p 0 )! a:s: 0. Proof of Proposition 3. It follows from convexity of Pand convexity of t!' ff (=t) that () 0» Pn' ff (bp n =p 0 )=(Pn P 0 )' ff (bp n =p 0 )+P 0 ' ff (bp n =p 0 ); see van der Vaart and Wellner (996) page 330, and Pfanzagl (988), pages Now we show that P 0 ' ff (p=p 0 )= p ff p ff 0 (2) p ff +p ff dp 0» p fi 0 p fi dμ 0 for fi = ff=2. Note that this holds if and only if or But his holds if +2 p ff p ff 0 +pffp 0dμ» + p fi 0 p fi dμ 2 With fi = ff=2, this becomes p fi 0 p fi dμ ; p ff p ff 0 + pff p 0dμ : p fi 0 p fi 2 pff p 0 p ff 0 + pff : 2 (pff 0 + p ff ) p ff=2 0 p ff=2 = p p ff 0 pff ; and this holds by the arithmetic mean - geometric mean inequality. Thus (2) holds. Combining (2) with () yields the claim of the proposition. 2

10 0 Glivenko-Cantelli preservation theorems 5. Example: a result of Schick and Yu. Our goal in this section is to give another proof of the consistency result of Schick and Yu (999) for the Non-Parametric Maximum Likelihood Estimator (NPMLE) b Fn for mixed case" interval censored data. Our proof is based on the inequality of the preceding section, and is similar in spirit to results of Van de Geer (993), (996). Suppose that Y is a random variable taking values in R + = [0; ) with distribution function F 2 F = fall df's F on R + g. Unfortunately we are not able to observe Y itself. What we do observe is a vector of times T K =(T K; ;::: ;T K;K ) where K, the number of times, is itself random, and the interval (T K;j ;T K;j ] into which Y falls (with T K;0 0, T K;K+ ). More formally, we assume that K is an integer-valued random variable, and T = ft k;j ;j =;::: ;k;k =;2;:::g, is a triangular array of potential observation times", and that Y and (K; T) are independent. Let X = ( K;T K ;K), with a possible value x = (ffi k ;t k ;k), where k = ( k; ;::: ; k;k ) with k;j = (Tk;j ;T k;j ](Y ), j = ; 2;::: ;k +, and T k is the kth row of the triangular array T. Suppose we observe n i.i.d. copies of X; X ;X 2 ;::: ;X n, where X i = ( (i) ;T (i) ;K (i) ), i = K (i) K (i) ; 2;::: ;n. Here (Y (i) ;T (i) ;K (i) ), i = ; 2;::: are the underlying i.i.d. copies of (Y;T;K). We first note that conditionally on K and T K, the vector K has a multinomial distribution: where ( KjK; T K ) ο Multinomial K+ (; F K ) F K (F (T K; );F(T K;2 ) F (T K; );::: ; F(T K;K )) : Suppose for the moment that the distribution G k of (T K jk = k) has density g k and p k P (K = k). Then a density ofxis given by (3) p F (x) p F (ffi k ;t k ;k)= k+ Y j= where t k;0 0, t k;k+. In general, (4) p F (x) p F (ffi k ;t k ;k) = = (F (t k;j ) F (t k;j )) ffi k;j g k (t k )p k k+ Y j= k+ X j= (F (t k;j ) F (t k;j )) ffi k;j ffi k;j (F (t k;j ) F (t k;j )) is a density of X with respect to the dominating measure ν where ν is determined by the joint distribution of (K; T), and it is this version of the

11 van der Vaart and Wellner density of X with which we will work throughout the rest of the paper. Thus the log-likelihood function for F of X ;::: ;X n is given by where n l n(f jx) = n nx i= = Pnm F m F (X) = X K (i) + j= K+ X j= K+ X j= (i) K;j log F (T (i) (i) ) F (T K (i) ;j K ) (i) ;j K;j log (F (T K;j ) F (T K;j )) K;j log ( F K;j ) and where we have ignored the terms not involving F.We also note that, with P 0 P F0, 0 P 0 m F (X) =P K+ X F 0;K;j log ( F K;j ) A : j= The Nonparametric Maximum Likelihood Estimator (NPMLE) b Fn is the distribution function b Fn (t) which puts all its mass at the observed time points and maximizes the log-likelihood l n (F jx). It can be calculated via the iterative convex minorant algorithm proposed in Groeneboom and Wellner (992) for case 2 interval censored data. By Proposition 3 with ff = and ' ' as before, it follows that h 2 (pf bn ;p F0 )»(Pn P 0 ) '(pf bn =p F0 ) where ' is bounded and continuous from R to R. Now the collection of functions G fp F : F2Fg is easily seen to be a Glivenko-Cantelli class of functions: this can be seen by first applying Theorem 4 to the collections G k, k = ; 2;::: obtained from G by restricting to the sets K = k. Then for fixed k, the collections G k = fp F (ffi;t k ;k): F 2Fgare P 0 Glivenko-Cantelli classes since F is a uniform Glivenko-Cantelli class, and since the functions p F are continuous transformations of the classes of functions x! ffi k;j and x! F (t k;j ) for j =;::: ;k+, and hence G is P Glivenko-Cantelli by Theorem 3. Note that the single function p F0 is trivially P 0 Glivenko-Cantelli since it is uniformly bounded, and the single function (=p F0 ) is also P 0 GC since P 0 (=p F0 ) <. Thus by Proposition 2 with g = (=p F0 ) and F = G = fp F : F 2Fg,itfollows that G 0 fp F =p F0 : F 2Fgis P 0 Glivenko- Cantelli. Finally another application of Theorem 3 shows that the collection H fl (p F =p F0 ): F 2Fg=f'(p F =p F0 ): F 2Fg

12 2 Glivenko-Cantelli preservation theorems is also P 0 -Glivenko-Cantelli. When combined with Proposition 3, this yields the following theorem: Theorem 7. The NPMLE b Fn satisfies h(p b F n ;p F0 )! a:s: 0 : To relate this result to a recent theorem of Schick and Yu (999), it remains only to understand the relationship between their L (μ) and the Hellinger metric h between p F and p F0. Let B denote the collection of Borel sets in R. OnBwe define measures μ and eμ as follows: For B 2B, (5) and (6) μ(b)= eμ(b) = X k= X k= P (K = k) P (K = k) k kx j= kx j= P (T k;j 2 BjK = k) ; P (T k;j 2 BjK = k) : Let d be the L (μ) metric on the class F; thus for F ;F 2 2F, d(f ;F 2 )= jf (t) F 2 (t)jdμ(t) : The measure μ was introduced by Schick and Yu (999); note that μ is a finite measure if E(K) <. Note that d(f ;F 2 ) can also be written in terms of an expectation as: 2 X d(f ;F 2 )=E (K;T ) 4 K fi fi fi 3 (7) (F (T K;j ) F 2 (T K;j ) fi5 : j= As Schick and Yu (999) observed, consistency of the NPMLE b Fn in L (μ) holds under virtually no further hypotheses. Theorem 8. (Schick and Yu). Suppose that E(K) <. Then d( b Fn ;F 0 )! a:s: 0. Proof. We will show that Theorem 8 follows from Theorem 7 and the following Lemma. Lemma 4. ρ 2 Proof. We know that j b Fn F 0 jd~μff 2» h 2 (p b F n ;p F 0 ): h 2 (p b F n ;p F0 )»d TV (p b F n ;p F0 )» p 2h(p b F n ;p F0 )

13 where, with y k;0 =, y k;k+ =, while Note that h 2 (p b F n ;p F0 )= X k= d TV (p b F n ;p F0 )= j= P (K = k) X k= van der Vaart and Wellner 3 k+ X f[ Fn b (y k;j ) Fn b (y k;j )] =2 j= [F 0 (y k;j ) F 0 (y k;j )] =2 g 2 dg k (y) P (K = k) k+ X j[ Fn b (y k;j ) Fn b (y k;j )] j= [F 0 (y k;j ) F 0 (y k;j )]jdg k (y) : k+ X j[ Fn b (y k;j ) Fn b (y k;j )] [F 0 (y k;j ) F 0 (y k;j )]j = k+ X j= j( b Fn F 0 )(y k;j ;y k;j ]j max j Fn b (y k;j ) F 0 (y k;j )j ;»j»k+ so integrating across this inequality with respect to G k (y) yields k+ X j[ Fn b (y k;j ) Fn b (y k;j )] [F 0 (y k;j ) F 0 (y k;j )]j dg k (y) j= max»j»k k kx j= j b Fn (y k;j ) F 0 (y k;j )j dg k;j (y k;j ) j b Fn (y k;j ) F 0 (y k;j )j dg k;j (y k;j ) : By multiplying across by P (K = k) and summing over k, this yields and hence (8) d TV (p b F n ;p F0 ) h 2 (p b F n ;p F0 ) 2 ρ jb Fn F 0 jd~μ; jb Fn F 0 jd~μff 2 : The measure ~μ figuring in Lemma 4 is not the same as the measure μ of Schick and Yu (999) because of the factor =k. Note that this factor means that the measure ~μ is always a finite measure, even if E(K) =. It is clear that ~μ(b)» μ(b) 2

14 4 Glivenko-Cantelli preservation theorems for every Borel set B, and that μ ffiffi ~μ. The following lemma (Lemma 2.2 of Schick and Yu (999)) together with Lemma 4 shows that Theorem 7 implies the result of Schick and Yu once again: Lemma 5. Suppose that μ and ~μ are two finite measures, and that g; g, g 2 ;::: are measurable functions with range in [0; ]. Suppose that μ is absolutely continuous with respect to ~μ. Then R jg n gjd~μ! 0 implies that R jg n gjdμ! 0. Proof. Write jg n gjdμ = jg n gj dμ d~μ d~μ and use the dominated convergence theorem applied to a.e. convergent subsequences Example: generalizing the result of Schick and Yu. Our goal in this section is to give a generalization of the consistency result of Schick and Yu (999). Suppose that Y is a random variable taking values in Y. Suppose that Y has distribution Q on the measurable space (Y; B). Unfortunately we are not able to observe Y itself. What we do observe is a vector of random sets C K = (C K; ;::: ;C K;K ) where K, the number of sets is itself random, and the set fc K;j g K form a partition of Y: j= C K;j C K;j 0 = ; if j 6= j 0 and [ K j= C K;j = Y. More formally, we assume that K is an integer-valued random variable, and C = fc k;j ;j =;::: ;k;k =;2;:::g, is a triangular array of random sets", and that Y and (K; C) are independent. Let X =( K;C K ;K), with a possible value x =(ffi k ;c k ;k), where k =( k; ;::: ; k;k ) with k;j = Ck;j (Y ), j =;2;::: ;k, and C k is the k th row of the triangular array C. Suppose we observe n i.i.d. copies of X; X ;X 2 ;::: ;X n, where X =( (i) i ;C (i) ;K K (i) K (i) ), i =;2;::: ;n. Here (i) (Y (i) ;C (i) ;K (i) ), i =;2;::: are the underlying i.i.d. copies of (Y;C;K). We first note that conditionally on K and C K, the vector K has a multinomial distribution: ( KjK; C K ) ο Multinomial K (;Q K ) where Q K (Q(C K; );Q(C K;2 );::: ;Q(C K;K )) :

15 van der Vaart and Wellner 5 Suppose for the moment that the distribution G k of (C K jk = k) has density g k and p k P (K = k). Then a density ofxis given by (9) In general, (0) p Q (x) p Q (ffi k ;c k ;k)= p Q (x) p Q (ffi k ;c k ;k)= ky j= ky j= Q(c k;j ) ffi k;j g k (c k )p k : Q(c k;j ) ffi k;j = kx j= ffi k;j Q(c k;j ) is a density of X with respect to the dominating measure ν where ν is determined by the joint distribution of (K; C), and it is this version of the density of X with which we will work throughout the rest of the paper. Thus the log-likelihood function for Q of X ;::: ;X n is given by where n l n(qjx) = n m Q (X) = nx K X (i) i= j= KX j= (i) K;j log Q(C(i) K (i) ;j )=P nm Q K;j log Q(C K;j ) and where we have ignored the terms not involving Q. We also note that, with P 0 P Q0, 0 X P 0 m Q (X) =P K Q 0 (C K;j ) log Q(C K;j ) A : j= The Nonparametric Maximum Likelihood Estimator (NPMLE) b Qn is a probability measure b Qn which maximizes the log-likelihood l n (QjX). Here we will bypass the many interesting existence, characterization, and computational issues connected with the NPMLE b Qn, and focus instead on the issue of consistency once we have the NPMLE in hand. By Proposition 3 with ff = it follows that h 2 (p b Q n ;p Q0 )»(Pn P 0 ) '(p b Q n =p Q0 ) where ' is bounded and continuous. Now the collection of functions G fp Q : Q2Qg where Q is the collection of all probability distributions on (Y; B), is easily seen to be a Glivenko-Cantelli class of functions: this can be seen by first applying Theorem 4 to the collections G k, k =;2;::: obtained from G by restricting to the sets K = k. The next step is to show that the collections G k fp Q ( ; ;k): Q2Qg :

16 6 Glivenko-Cantelli preservation theorems are P 0 Glivenko-Cantelli for each k. To this end, suppose that (Y; B) bea measurable space and let C be a universal GC - class of sets in Y (i.e. Q GC for every probability measure Q on (Y; B)). Let (T;T ;G) be a probability space, and let t 7! C t be a map with values in C. Lemma 6. Suppose that the collection fft : y 2 C t g : y 2Ygis G GC. Then the collection of functions t 7! Q(C t ) where Q ranges over all probability measures Q on Y is G GC. Remark: By Assouad (983), Proposition 2.2, page 246, together with Corollary.0, page 24, it follows that fft 2 T : y 2 C t g : y 2Ygis a VC-class of subsets of T if and only if fc t : t 2 T g is a VC-class of subsets of Y.Thus if we assume that all the subsets C = C t arising in the partitions are elements of a VC-class C, then, subject to being suitably measurable, the hypothesis of Lemma 6 is satisfied (and the class in question is even universal Glivenko-Cantelli). Proof. For Q the Dirac measure at y (unit mass at y), the function t 7! Q(C t ) becomes t 7! Ct (y) : By assumption, the set of all such functions, with y ranging over Y, is G GC. Then so is the set of all functions t 7! m mx i= Ct (y i ); y ;::: ;y m 2Y;m2 N: Let Y ;::: ;Y m be an i.i.d. sample from a given Q. Because C is Q GC we have fi sup fi mx fi fy i 2 Cg P(C) fi!0 a.s. Λ : m C2C i= Then there exists a sequence y ;y 2 ;::: in Y such that and consequently sup fi m C2C fi fi supfi t2t m mx fi fy i 2 Cg P(C) fi!0; i= mx fi fy i 2 C t g P(C t ) fi!0: i= It follows that the maps t! Q(C t ) are the (uniform) limits of sequences of functions from a G GC class. Hence they are G GC. 2 Note on measurability: It is assumed implicitly that the sets ft : y 2 C t g are measurable in T for every y 2 Y. The proof shows that the maps t 7! P (C t ) are then also measurable.

17 van der Vaart and Wellner 7 It follows that for fixed k, the collections G k = fp Q (ffi;c k ;k) : Q 2 Qg are P 0 -Glivenko-Cantelli, and since the functions p Q are continuous transformations of the classes of functions x! ffi k;j and x! Q(c k;j ) for j = ;::: ;k, and hence G is P Glivenko-Cantelli by Theorem 3. Note that the single function p Q0 is trivially P 0 Glivenko-Cantelli since it is uniformly bounded. Now another application of Theorem 3 shows that the collection H fl (p Q =p Q0 ): Q2Qg=f'(p Q =p Q0 ): Q2Qg is also P 0 -Glivenko-Cantelli. When combined with Proposition 3, this yields the following theorem. Theorem 9. If all C K;j 2 C, a VC collection of subsets of X, then the NPMLE b Qn satisfies h(p b Q n ;p Q0 )! a:s: 0 : To obtain a statement analogous to the theorem of Schick and Yu (999), let ± denote a sigma algebra for the space C of subsets: we are assuming that P ( K j= [C K;j 2C]) =. Thus (C; ±) is a measurable space. On ± we define the measure μ as follows: () Note that (2) μ(b) = X k= d TV (p b Q n ;p Q0 ) = = P (K = k) X k= kx j= P (K = k) P (C k;j 2 BjK = k) ; B 2 ±: kx j= j b Qn (c k;j ) Q 0 (c k;j )jdg k (c) j b Qn (c) Q 0 (c)j dμ(c) d( b Qn ;Q 0 ): Here is a generalization of the theorem of Schick and Yu (999). Theorem 0. The NPMLE b Qn satisfies d( b Qn ;Q 0 )! a:s: 0. Proof. Theorem 0 follows from Theorem 9, (2) and the observation that we then have d( b Qn ;Q 0 )=d TV (p b Q n ;p Q0 )» p 2h(p b Q n ;p Q0 ): Example. (Mixed case interval censoring in R 2 ). Suppose that Y ο Q 0 takes values in R +2, and the partitions fc K;j g K j= are obtained as the natural rectangles formed from two increasing collections 0» S K;» :::» S K;K and 0» T K2;» :::» T K2;K on the two coordinate axes. Then K = (K + )(K 2 + ), and all the elements of the partitions are 2

18 8 Glivenko-Cantelli preservation theorems elements of the VC-class of rectangles in R 2. It follows that the NPMLE bq n of Q 0 satisfies s;t2r +2 j b Qn (s; t] Q 0 (s; t]jdμ(s; t)! 0 a.s. Λ : Acknowledgments: We owe thanks to Evarist Giné and Joel inn for suggesting the proof given for Lemma. References Assouad, P. (983). Densité et dimension. Annales de l'institut Fourier 33, Dudley, R. M. (984). A Course on Empirical Processes. Ecole d'été de probabilités de St.-Flour, 982. Lecture Notes in Mathematics 097, -42, Springer, Berlin. Dudley, R. M. (998a). Consistency of M-estimators and one-sided bracketing." In Proceedings of the First International Conference on High- Dimensional Probability, E. Eberlein, M. Hahn, and M. Talagrand, editors. Birkhäuser, Basel. Dudley, R. M. (998b). Personal communication. Dudley, R. M., Giné, E., and inn, J. (99). Uniform and universal Glivenko-Cantelli classes. J. Theoretical Probab. 4, Giné, E., and inn, J. (984). Some limit theorems for empirical processes. Ann. Probability 2, Groeneboom, P. and Wellner, J. A. (992). Information Bounds and Nonparametric Maximum Likelihood Estimation. DMV Seminar Band 9, Birkhäuser, Basel. Pfanzagl, J. (988). Consistency of maximum likelihood estimators for certain nonparametric families, in particular: mixtures." J. Statist. Planning and Inference 9, Schick, A. and Yu, Q. (999), Consistency of the GMLE with Mixed Case Interval-Censored Data," Scand. J. Statist., to appear. Talagrand, M. (987a). Measurability problems for empirical measures." Ann. Probability 5, Talagrand, M. (987b). The Glivenko-Cantelli problem." Ann. Probability 5, Talagrand, M. (996). The Glivenko-Cantelli problem, ten years later." J. Theor. Prob. 9,

19 van der Vaart and Wellner 9 Van de Geer, S. (993). Hellinger consistency of certain nonparametric maximum likelihood estimators." Ann. Statist. 2, Van de Geer, S. (996). Rates of convergence for the maximum likelihood estimator in mixture models." J. Nonparametric Statist. 6, Van der Vaart, A. W. (996). New Donsker classes. Ann. Probability 24, Van der Vaart, A. W., and Wellner, J. A. (996). Weak Convergence and Empirical Processes with Applications to Statistics, Springer-Verlag, New York. Department of Mathematics Free University De Boelelaan 08a 08 HV Amsterdam The Netherlands aad@cs.vu.nl Department of Statistics University of Washington P.O. Box Seattle, Washington U.S.A. jaw@stat.washington.edu

Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes

Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes This is page 5 Printer: Opaque this Aad van der Vaart and Jon A. Wellner ABSTRACT We show that the P Glivenko property

More information

Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes

Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes Aad van der Vaart and Jon A. Wellner Free University and University of Washington ABSTRACT We show that the P Glivenko

More information

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS SOME CONVERSE LIMIT THEOREMS OR EXCHANGEABLE BOOTSTRAPS Jon A. Wellner University of Washington The bootstrap Glivenko-Cantelli and bootstrap Donsker theorems of Giné and Zinn (990) contain both necessary

More information

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation Statistics 62: L p spaces, metrics on spaces of probabilites, and connections to estimation Moulinath Banerjee December 6, 2006 L p spaces and Hilbert spaces We first formally define L p spaces. Consider

More information

(Y; I[X Y ]), where I[A] is the indicator function of the set A. Examples of the current status data are mentioned in Ayer et al. (1955), Keiding (199

(Y; I[X Y ]), where I[A] is the indicator function of the set A. Examples of the current status data are mentioned in Ayer et al. (1955), Keiding (199 CONSISTENCY OF THE GMLE WITH MIXED CASE INTERVAL-CENSORED DATA By Anton Schick and Qiqing Yu Binghamton University April 1997. Revised December 1997, Revised July 1998 Abstract. In this paper we consider

More information

MATHS 730 FC Lecture Notes March 5, Introduction

MATHS 730 FC Lecture Notes March 5, Introduction 1 INTRODUCTION MATHS 730 FC Lecture Notes March 5, 2014 1 Introduction Definition. If A, B are sets and there exists a bijection A B, they have the same cardinality, which we write as A, #A. If there exists

More information

3 Integration and Expectation

3 Integration and Expectation 3 Integration and Expectation 3.1 Construction of the Lebesgue Integral Let (, F, µ) be a measure space (not necessarily a probability space). Our objective will be to define the Lebesgue integral R fdµ

More information

Empirical Processes: General Weak Convergence Theory

Empirical Processes: General Weak Convergence Theory Empirical Processes: General Weak Convergence Theory Moulinath Banerjee May 18, 2010 1 Extended Weak Convergence The lack of measurability of the empirical process with respect to the sigma-field generated

More information

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9 MAT 570 REAL ANALYSIS LECTURE NOTES PROFESSOR: JOHN QUIGG SEMESTER: FALL 204 Contents. Sets 2 2. Functions 5 3. Countability 7 4. Axiom of choice 8 5. Equivalence relations 9 6. Real numbers 9 7. Extended

More information

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time ASYMPTOTIC PROPERTIES OF THE GMLE WITH CASE 2 INTERVAL-CENSORED DATA By Qiqing Yu a;1 Anton Schick a, Linxiong Li b;2 and George Y. C. Wong c;3 a Dept. of Mathematical Sciences, Binghamton University,

More information

Statistics 581 Revision of Section 4.4: Consistency of Maximum Likelihood Estimates Wellner; 11/30/2001

Statistics 581 Revision of Section 4.4: Consistency of Maximum Likelihood Estimates Wellner; 11/30/2001 Statistics 581 Revision of Section 4.4: Consistency of Maximum Likelihood Estimates Wellner; 11/30/2001 Some Uniform Strong Laws of Large Numbers Suppose that: A. X, X 1,...,X n are i.i.d. P on the measurable

More information

MATH MEASURE THEORY AND FOURIER ANALYSIS. Contents

MATH MEASURE THEORY AND FOURIER ANALYSIS. Contents MATH 3969 - MEASURE THEORY AND FOURIER ANALYSIS ANDREW TULLOCH Contents 1. Measure Theory 2 1.1. Properties of Measures 3 1.2. Constructing σ-algebras and measures 3 1.3. Properties of the Lebesgue measure

More information

HILBERT SPACES AND THE RADON-NIKODYM THEOREM. where the bar in the first equation denotes complex conjugation. In either case, for any x V define

HILBERT SPACES AND THE RADON-NIKODYM THEOREM. where the bar in the first equation denotes complex conjugation. In either case, for any x V define HILBERT SPACES AND THE RADON-NIKODYM THEOREM STEVEN P. LALLEY 1. DEFINITIONS Definition 1. A real inner product space is a real vector space V together with a symmetric, bilinear, positive-definite mapping,

More information

Stanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon Measures

Stanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon Measures 2 1 Borel Regular Measures We now state and prove an important regularity property of Borel regular outer measures: Stanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon

More information

Real Analysis Qualifying Exam May 14th 2016

Real Analysis Qualifying Exam May 14th 2016 Real Analysis Qualifying Exam May 4th 26 Solve 8 out of 2 problems. () Prove the Banach contraction principle: Let T be a mapping from a complete metric space X into itself such that d(tx,ty) apple qd(x,

More information

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure? MA 645-4A (Real Analysis), Dr. Chernov Homework assignment 1 (Due ). Show that the open disk x 2 + y 2 < 1 is a countable union of planar elementary sets. Show that the closed disk x 2 + y 2 1 is a countable

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

Defining the Integral

Defining the Integral Defining the Integral In these notes we provide a careful definition of the Lebesgue integral and we prove each of the three main convergence theorems. For the duration of these notes, let (, M, µ) be

More information

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond Measure Theory on Topological Spaces Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond May 22, 2011 Contents 1 Introduction 2 1.1 The Riemann Integral........................................ 2 1.2 Measurable..............................................

More information

STATISTICAL INFERENCE UNDER ORDER RESTRICTIONS LIMIT THEORY FOR THE GRENANDER ESTIMATOR UNDER ALTERNATIVE HYPOTHESES

STATISTICAL INFERENCE UNDER ORDER RESTRICTIONS LIMIT THEORY FOR THE GRENANDER ESTIMATOR UNDER ALTERNATIVE HYPOTHESES STATISTICAL INFERENCE UNDER ORDER RESTRICTIONS LIMIT THEORY FOR THE GRENANDER ESTIMATOR UNDER ALTERNATIVE HYPOTHESES By Jon A. Wellner University of Washington 1. Limit theory for the Grenander estimator.

More information

Real Analysis Problems

Real Analysis Problems Real Analysis Problems Cristian E. Gutiérrez September 14, 29 1 1 CONTINUITY 1 Continuity Problem 1.1 Let r n be the sequence of rational numbers and Prove that f(x) = 1. f is continuous on the irrationals.

More information

Chapter 8. General Countably Additive Set Functions. 8.1 Hahn Decomposition Theorem

Chapter 8. General Countably Additive Set Functions. 8.1 Hahn Decomposition Theorem Chapter 8 General Countably dditive Set Functions In Theorem 5.2.2 the reader saw that if f : X R is integrable on the measure space (X,, µ) then we can define a countably additive set function ν on by

More information

Weak convergence. Amsterdam, 13 November Leiden University. Limit theorems. Shota Gugushvili. Generalities. Criteria

Weak convergence. Amsterdam, 13 November Leiden University. Limit theorems. Shota Gugushvili. Generalities. Criteria Weak Leiden University Amsterdam, 13 November 2013 Outline 1 2 3 4 5 6 7 Definition Definition Let µ, µ 1, µ 2,... be probability measures on (R, B). It is said that µ n converges weakly to µ, and we then

More information

Perfect Measures and Maps

Perfect Measures and Maps Preprint Ser. No. 26, 1991, Math. Inst. Aarhus Perfect Measures and Maps GOAN PESKI This paper is designed to provide a solid introduction to the theory of nonmeasurable calculus. In this context basic

More information

A Law of the Iterated Logarithm. for Grenander s Estimator

A Law of the Iterated Logarithm. for Grenander s Estimator A Law of the Iterated Logarithm for Grenander s Estimator Jon A. Wellner University of Washington, Seattle Probability Seminar October 24, 2016 Based on joint work with: Lutz Dümbgen and Malcolm Wolff

More information

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi Real Analysis Math 3AH Rudin, Chapter # Dominique Abdi.. If r is rational (r 0) and x is irrational, prove that r + x and rx are irrational. Solution. Assume the contrary, that r+x and rx are rational.

More information

Maximum likelihood: counterexamples, examples, and open problems. Jon A. Wellner. University of Washington. Maximum likelihood: p.

Maximum likelihood: counterexamples, examples, and open problems. Jon A. Wellner. University of Washington. Maximum likelihood: p. Maximum likelihood: counterexamples, examples, and open problems Jon A. Wellner University of Washington Maximum likelihood: p. 1/7 Talk at University of Idaho, Department of Mathematics, September 15,

More information

Maximum likelihood: counterexamples, examples, and open problems

Maximum likelihood: counterexamples, examples, and open problems Maximum likelihood: counterexamples, examples, and open problems Jon A. Wellner University of Washington visiting Vrije Universiteit, Amsterdam Talk at BeNeLuxFra Mathematics Meeting 21 May, 2005 Email:

More information

STAT 7032 Probability Spring Wlodek Bryc

STAT 7032 Probability Spring Wlodek Bryc STAT 7032 Probability Spring 2018 Wlodek Bryc Created: Friday, Jan 2, 2014 Revised for Spring 2018 Printed: January 9, 2018 File: Grad-Prob-2018.TEX Department of Mathematical Sciences, University of Cincinnati,

More information

Analysis Comprehensive Exam Questions Fall 2008

Analysis Comprehensive Exam Questions Fall 2008 Analysis Comprehensive xam Questions Fall 28. (a) Let R be measurable with finite Lebesgue measure. Suppose that {f n } n N is a bounded sequence in L 2 () and there exists a function f such that f n (x)

More information

Measurable functions are approximately nice, even if look terrible.

Measurable functions are approximately nice, even if look terrible. Tel Aviv University, 2015 Functions of real variables 74 7 Approximation 7a A terrible integrable function........... 74 7b Approximation of sets................ 76 7c Approximation of functions............

More information

Nets Hawk Katz Theorem. There existsaconstant C>so that for any number >, whenever E [ ] [ ] is a set which does not contain the vertices of any axis

Nets Hawk Katz Theorem. There existsaconstant C>so that for any number >, whenever E [ ] [ ] is a set which does not contain the vertices of any axis New York Journal of Mathematics New York J. Math. 5 999) {3. On the Self Crossing Six Sided Figure Problem Nets Hawk Katz Abstract. It was shown by Carbery, Christ, and Wright that any measurable set E

More information

Lecture 2: Uniform Entropy

Lecture 2: Uniform Entropy STAT 583: Advanced Theory of Statistical Inference Spring 218 Lecture 2: Uniform Entropy Lecturer: Fang Han April 16 Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal

More information

Introduction and Preliminaries

Introduction and Preliminaries Chapter 1 Introduction and Preliminaries This chapter serves two purposes. The first purpose is to prepare the readers for the more systematic development in later chapters of methods of real analysis

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research

More information

Nonparametric estimation under Shape Restrictions

Nonparametric estimation under Shape Restrictions Nonparametric estimation under Shape Restrictions Jon A. Wellner University of Washington, Seattle Statistical Seminar, Frejus, France August 30 - September 3, 2010 Outline: Five Lectures on Shape Restrictions

More information

MEASURE THEORY AND LEBESGUE INTEGRAL 15

MEASURE THEORY AND LEBESGUE INTEGRAL 15 MASUR THORY AND LBSGU INTGRAL 15 Proof. Let 2Mbe such that µ() = 0, and f 1 (x) apple f 2 (x) apple and f n (x) =f(x) for x 2 c.settingg n = c f n and g = c f, we observe that g n = f n a.e. and g = f

More information

d(x n, x) d(x n, x nk ) + d(x nk, x) where we chose any fixed k > N

d(x n, x) d(x n, x nk ) + d(x nk, x) where we chose any fixed k > N Problem 1. Let f : A R R have the property that for every x A, there exists ɛ > 0 such that f(t) > ɛ if t (x ɛ, x + ɛ) A. If the set A is compact, prove there exists c > 0 such that f(x) > c for all x

More information

Integration on Measure Spaces

Integration on Measure Spaces Chapter 3 Integration on Measure Spaces In this chapter we introduce the general notion of a measure on a space X, define the class of measurable functions, and define the integral, first on a class of

More information

INTRODUCTION TO MEASURE THEORY AND LEBESGUE INTEGRATION

INTRODUCTION TO MEASURE THEORY AND LEBESGUE INTEGRATION 1 INTRODUCTION TO MEASURE THEORY AND LEBESGUE INTEGRATION Eduard EMELYANOV Ankara TURKEY 2007 2 FOREWORD This book grew out of a one-semester course for graduate students that the author have taught at

More information

Math 4121 Spring 2012 Weaver. Measure Theory. 1. σ-algebras

Math 4121 Spring 2012 Weaver. Measure Theory. 1. σ-algebras Math 4121 Spring 2012 Weaver Measure Theory 1. σ-algebras A measure is a function which gauges the size of subsets of a given set. In general we do not ask that a measure evaluate the size of every subset,

More information

Real Analysis Notes. Thomas Goller

Real Analysis Notes. Thomas Goller Real Analysis Notes Thomas Goller September 4, 2011 Contents 1 Abstract Measure Spaces 2 1.1 Basic Definitions........................... 2 1.2 Measurable Functions........................ 2 1.3 Integration..............................

More information

L p Spaces and Convexity

L p Spaces and Convexity L p Spaces and Convexity These notes largely follow the treatments in Royden, Real Analysis, and Rudin, Real & Complex Analysis. 1. Convex functions Let I R be an interval. For I open, we say a function

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 13: Entropy Calculations

Introduction to Empirical Processes and Semiparametric Inference Lecture 13: Entropy Calculations Introduction to Empirical Processes and Semiparametric Inference Lecture 13: Entropy Calculations Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 1, Issue 1 2005 Article 3 Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Moulinath Banerjee Jon A. Wellner

More information

THEOREMS, ETC., FOR MATH 516

THEOREMS, ETC., FOR MATH 516 THEOREMS, ETC., FOR MATH 516 Results labeled Theorem Ea.b.c (or Proposition Ea.b.c, etc.) refer to Theorem c from section a.b of Evans book (Partial Differential Equations). Proposition 1 (=Proposition

More information

arxiv:math/ v2 [math.st] 17 Jun 2008

arxiv:math/ v2 [math.st] 17 Jun 2008 The Annals of Statistics 2008, Vol. 36, No. 3, 1031 1063 DOI: 10.1214/009053607000000974 c Institute of Mathematical Statistics, 2008 arxiv:math/0609020v2 [math.st] 17 Jun 2008 CURRENT STATUS DATA WITH

More information

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure? MA 645-4A (Real Analysis), Dr. Chernov Homework assignment 1 (Due 9/5). Prove that every countable set A is measurable and µ(a) = 0. 2 (Bonus). Let A consist of points (x, y) such that either x or y is

More information

Real Analysis Chapter 4 Solutions Jonathan Conder

Real Analysis Chapter 4 Solutions Jonathan Conder 2. Let x, y X and suppose that x y. Then {x} c is open in the cofinite topology and contains y but not x. The cofinite topology on X is therefore T 1. Since X is infinite it contains two distinct points

More information

Examples of Dual Spaces from Measure Theory

Examples of Dual Spaces from Measure Theory Chapter 9 Examples of Dual Spaces from Measure Theory We have seen that L (, A, µ) is a Banach space for any measure space (, A, µ). We will extend that concept in the following section to identify an

More information

MATH5011 Real Analysis I. Exercise 1 Suggested Solution

MATH5011 Real Analysis I. Exercise 1 Suggested Solution MATH5011 Real Analysis I Exercise 1 Suggested Solution Notations in the notes are used. (1) Show that every open set in R can be written as a countable union of mutually disjoint open intervals. Hint:

More information

THEOREMS, ETC., FOR MATH 515

THEOREMS, ETC., FOR MATH 515 THEOREMS, ETC., FOR MATH 515 Proposition 1 (=comment on page 17). If A is an algebra, then any finite union or finite intersection of sets in A is also in A. Proposition 2 (=Proposition 1.1). For every

More information

1/12/05: sec 3.1 and my article: How good is the Lebesgue measure?, Math. Intelligencer 11(2) (1989),

1/12/05: sec 3.1 and my article: How good is the Lebesgue measure?, Math. Intelligencer 11(2) (1989), Real Analysis 2, Math 651, Spring 2005 April 26, 2005 1 Real Analysis 2, Math 651, Spring 2005 Krzysztof Chris Ciesielski 1/12/05: sec 3.1 and my article: How good is the Lebesgue measure?, Math. Intelligencer

More information

4 Sums of Independent Random Variables

4 Sums of Independent Random Variables 4 Sums of Independent Random Variables Standing Assumptions: Assume throughout this section that (,F,P) is a fixed probability space and that X 1, X 2, X 3,... are independent real-valued random variables

More information

7 Convergence in R d and in Metric Spaces

7 Convergence in R d and in Metric Spaces STA 711: Probability & Measure Theory Robert L. Wolpert 7 Convergence in R d and in Metric Spaces A sequence of elements a n of R d converges to a limit a if and only if, for each ǫ > 0, the sequence a

More information

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set Analysis Finite and Infinite Sets Definition. An initial segment is {n N n n 0 }. Definition. A finite set can be put into one-to-one correspondence with an initial segment. The empty set is also considered

More information

GLUING LEMMAS AND SKOROHOD REPRESENTATIONS

GLUING LEMMAS AND SKOROHOD REPRESENTATIONS GLUING LEMMAS AND SKOROHOD REPRESENTATIONS PATRIZIA BERTI, LUCA PRATELLI, AND PIETRO RIGO Abstract. Let X, E), Y, F) and Z, G) be measurable spaces. Suppose we are given two probability measures γ and

More information

The intersection axiom of

The intersection axiom of The intersection axiom of conditional independence: some new [?] results Richard D. Gill Mathematical Institute, University Leiden This version: 26 March, 2019 (X Y Z) & (X Z Y) X (Y, Z) Presented at Algebraic

More information

Summary of Real Analysis by Royden

Summary of Real Analysis by Royden Summary of Real Analysis by Royden Dan Hathaway May 2010 This document is a summary of the theorems and definitions and theorems from Part 1 of the book Real Analysis by Royden. In some areas, such as

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics

More information

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define Homework, Real Analysis I, Fall, 2010. (1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define ρ(f, g) = 1 0 f(x) g(x) dx. Show that

More information

Banach Spaces V: A Closer Look at the w- and the w -Topologies

Banach Spaces V: A Closer Look at the w- and the w -Topologies BS V c Gabriel Nagy Banach Spaces V: A Closer Look at the w- and the w -Topologies Notes from the Functional Analysis Course (Fall 07 - Spring 08) In this section we discuss two important, but highly non-trivial,

More information

Chapter 4. The dominated convergence theorem and applications

Chapter 4. The dominated convergence theorem and applications Chapter 4. The dominated convergence theorem and applications The Monotone Covergence theorem is one of a number of key theorems alllowing one to exchange limits and [Lebesgue] integrals (or derivatives

More information

Nonparametric estimation of log-concave densities

Nonparametric estimation of log-concave densities Nonparametric estimation of log-concave densities Jon A. Wellner University of Washington, Seattle Seminaire, Institut de Mathématiques de Toulouse 5 March 2012 Seminaire, Toulouse Based on joint work

More information

van Rooij, Schikhof: A Second Course on Real Functions

van Rooij, Schikhof: A Second Course on Real Functions vanrooijschikhof.tex April 25, 2018 van Rooij, Schikhof: A Second Course on Real Functions Notes from [vrs]. Introduction A monotone function is Riemann integrable. A continuous function is Riemann integrable.

More information

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by L p Functions Given a measure space (, µ) and a real number p [, ), recall that the L p -norm of a measurable function f : R is defined by f p = ( ) /p f p dµ Note that the L p -norm of a function f may

More information

Lebesgue-Radon-Nikodym Theorem

Lebesgue-Radon-Nikodym Theorem Lebesgue-Radon-Nikodym Theorem Matt Rosenzweig 1 Lebesgue-Radon-Nikodym Theorem In what follows, (, A) will denote a measurable space. We begin with a review of signed measures. 1.1 Signed Measures Definition

More information

Exercise Solutions to Functional Analysis

Exercise Solutions to Functional Analysis Exercise Solutions to Functional Analysis Note: References refer to M. Schechter, Principles of Functional Analysis Exersize that. Let φ,..., φ n be an orthonormal set in a Hilbert space H. Show n f n

More information

Part II Probability and Measure

Part II Probability and Measure Part II Probability and Measure Theorems Based on lectures by J. Miller Notes taken by Dexter Chua Michaelmas 2016 These notes are not endorsed by the lecturers, and I have modified them (often significantly)

More information

The Frobenius{Perron Operator

The Frobenius{Perron Operator (p.75) CHPTER 4 The Frobenius{Perron Operator The hero of this book is the Frobenius{Perron operator. With this powerful tool we shall study absolutely continuous invariant measures, their existence and

More information

Chapter 4. Measure Theory. 1. Measure Spaces

Chapter 4. Measure Theory. 1. Measure Spaces Chapter 4. Measure Theory 1. Measure Spaces Let X be a nonempty set. A collection S of subsets of X is said to be an algebra on X if S has the following properties: 1. X S; 2. if A S, then A c S; 3. if

More information

An elementary proof of the weak convergence of empirical processes

An elementary proof of the weak convergence of empirical processes An elementary proof of the weak convergence of empirical processes Dragan Radulović Department of Mathematics, Florida Atlantic University Marten Wegkamp Department of Mathematics & Department of Statistical

More information

An idea how to solve some of the problems. diverges the same must hold for the original series. T 1 p T 1 p + 1 p 1 = 1. dt = lim

An idea how to solve some of the problems. diverges the same must hold for the original series. T 1 p T 1 p + 1 p 1 = 1. dt = lim An idea how to solve some of the problems 5.2-2. (a) Does not converge: By multiplying across we get Hence 2k 2k 2 /2 k 2k2 k 2 /2 k 2 /2 2k 2k 2 /2 k. As the series diverges the same must hold for the

More information

Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics

Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Moulinath Banerjee 1 and Jon A. Wellner 2 1 Department of Statistics, Department of Statistics, 439, West

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

A SKOROHOD REPRESENTATION THEOREM WITHOUT SEPARABILITY PATRIZIA BERTI, LUCA PRATELLI, AND PIETRO RIGO

A SKOROHOD REPRESENTATION THEOREM WITHOUT SEPARABILITY PATRIZIA BERTI, LUCA PRATELLI, AND PIETRO RIGO A SKOROHOD REPRESENTATION THEOREM WITHOUT SEPARABILITY PATRIZIA BERTI, LUCA PRATELLI, AND PIETRO RIGO Abstract. Let (S, d) be a metric space, G a σ-field on S and (µ n : n 0) a sequence of probabilities

More information

1 Weak Convergence in R k

1 Weak Convergence in R k 1 Weak Convergence in R k Byeong U. Park 1 Let X and X n, n 1, be random vectors taking values in R k. These random vectors are allowed to be defined on different probability spaces. Below, for the simplicity

More information

Math 240 (Driver) Qual Exam (9/12/2017)

Math 240 (Driver) Qual Exam (9/12/2017) 1 Name: I.D. #: Math 240 (Driver) Qual Exam (9/12/2017) Instructions: Clearly explain and justify your answers. You may cite theorems from the text, notes, or class as long as they are not what the problem

More information

An introduction to some aspects of functional analysis

An introduction to some aspects of functional analysis An introduction to some aspects of functional analysis Stephen Semmes Rice University Abstract These informal notes deal with some very basic objects in functional analysis, including norms and seminorms

More information

Consistency of the Semi-parametric MLE under the Cox Model with Linearly Time-dependent Covariates and Interval-censored Data

Consistency of the Semi-parametric MLE under the Cox Model with Linearly Time-dependent Covariates and Interval-censored Data Journal of Advanced Statistics, Vol. 4, No. 1, March 2019 https://dx.doi.org/10.22606/jas.2019.41001 1 Consistency of the Semi-parametric MLE under the Cox Model with Linearly Time-dependent Covariates

More information

The main results about probability measures are the following two facts:

The main results about probability measures are the following two facts: Chapter 2 Probability measures The main results about probability measures are the following two facts: Theorem 2.1 (extension). If P is a (continuous) probability measure on a field F 0 then it has a

More information

MATH FINAL EXAM REVIEW HINTS

MATH FINAL EXAM REVIEW HINTS MATH 109 - FINAL EXAM REVIEW HINTS Answer: Answer: 1. Cardinality (1) Let a < b be two real numbers and define f : (0, 1) (a, b) by f(t) = (1 t)a + tb. (a) Prove that f is a bijection. (b) Prove that any

More information

Logical Connectives and Quantifiers

Logical Connectives and Quantifiers Chapter 1 Logical Connectives and Quantifiers 1.1 Logical Connectives 1.2 Quantifiers 1.3 Techniques of Proof: I 1.4 Techniques of Proof: II Theorem 1. Let f be a continuous function. If 1 f(x)dx 0, then

More information

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1 Chapter 2 Probability measures 1. Existence Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension to the generated σ-field Proof of Theorem 2.1. Let F 0 be

More information

Verifying Regularity Conditions for Logit-Normal GLMM

Verifying Regularity Conditions for Logit-Normal GLMM Verifying Regularity Conditions for Logit-Normal GLMM Yun Ju Sung Charles J. Geyer January 10, 2006 In this note we verify the conditions of the theorems in Sung and Geyer (submitted) for the Logit-Normal

More information

CHAPTER I THE RIESZ REPRESENTATION THEOREM

CHAPTER I THE RIESZ REPRESENTATION THEOREM CHAPTER I THE RIESZ REPRESENTATION THEOREM We begin our study by identifying certain special kinds of linear functionals on certain special vector spaces of functions. We describe these linear functionals

More information

Deviation Measures and Normals of Convex Bodies

Deviation Measures and Normals of Convex Bodies Beiträge zur Algebra und Geometrie Contributions to Algebra Geometry Volume 45 (2004), No. 1, 155-167. Deviation Measures Normals of Convex Bodies Dedicated to Professor August Florian on the occasion

More information

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing.

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing. 5 Measure theory II 1. Charges (signed measures). Let (Ω, A) be a σ -algebra. A map φ: A R is called a charge, (or signed measure or σ -additive set function) if φ = φ(a j ) (5.1) A j for any disjoint

More information

2 Sequences, Continuity, and Limits

2 Sequences, Continuity, and Limits 2 Sequences, Continuity, and Limits In this chapter, we introduce the fundamental notions of continuity and limit of a real-valued function of two variables. As in ACICARA, the definitions as well as proofs

More information

FUNCTIONAL ANALYSIS LECTURE NOTES: WEAK AND WEAK* CONVERGENCE

FUNCTIONAL ANALYSIS LECTURE NOTES: WEAK AND WEAK* CONVERGENCE FUNCTIONAL ANALYSIS LECTURE NOTES: WEAK AND WEAK* CONVERGENCE CHRISTOPHER HEIL 1. Weak and Weak* Convergence of Vectors Definition 1.1. Let X be a normed linear space, and let x n, x X. a. We say that

More information

3. Fourier decomposition of functions

3. Fourier decomposition of functions 22 C. MOUHOT 3.1. The Fourier transform. 3. Fourier decomposition of functions Definition 3.1 (Fourier Transform on L 1 (R d )). Given f 2 L 1 (R d ) define its Fourier transform F(f)( ) := R d e 2i x

More information

MTH 404: Measure and Integration

MTH 404: Measure and Integration MTH 404: Measure and Integration Semester 2, 2012-2013 Dr. Prahlad Vaidyanathan Contents I. Introduction....................................... 3 1. Motivation................................... 3 2. The

More information

Asymptotic efficiency of simple decisions for the compound decision problem

Asymptotic efficiency of simple decisions for the compound decision problem Asymptotic efficiency of simple decisions for the compound decision problem Eitan Greenshtein and Ya acov Ritov Department of Statistical Sciences Duke University Durham, NC 27708-0251, USA e-mail: eitan.greenshtein@gmail.com

More information

Problem List MATH 5143 Fall, 2013

Problem List MATH 5143 Fall, 2013 Problem List MATH 5143 Fall, 2013 On any problem you may use the result of any previous problem (even if you were not able to do it) and any information given in class up to the moment the problem was

More information

Random Process Lecture 1. Fundamentals of Probability

Random Process Lecture 1. Fundamentals of Probability Random Process Lecture 1. Fundamentals of Probability Husheng Li Min Kao Department of Electrical Engineering and Computer Science University of Tennessee, Knoxville Spring, 2016 1/43 Outline 2/43 1 Syllabus

More information

Semiparametric Minimax Rates

Semiparametric Minimax Rates arxiv: math.pr/0000000 Semiparametric Minimax Rates James Robins, Eric Tchetgen Tchetgen, Lingling Li, and Aad van der Vaart James Robins Eric Tchetgen Tchetgen Department of Biostatistics and Epidemiology

More information

Chapter 7 Statistical Functionals and the Delta Method

Chapter 7 Statistical Functionals and the Delta Method Chapter 7 Statistical Functionals and the Delta Method. Estimators as Functionals of F n or P n 2. Continuity of Functionals of F or P 3. Metrics for Distribution Functions F and Probability Distributions

More information

Math212a1413 The Lebesgue integral.

Math212a1413 The Lebesgue integral. Math212a1413 The Lebesgue integral. October 28, 2014 Simple functions. In what follows, (X, F, m) is a space with a σ-field of sets, and m a measure on F. The purpose of today s lecture is to develop the

More information

Supplementary Notes for W. Rudin: Principles of Mathematical Analysis

Supplementary Notes for W. Rudin: Principles of Mathematical Analysis Supplementary Notes for W. Rudin: Principles of Mathematical Analysis SIGURDUR HELGASON In 8.00B it is customary to cover Chapters 7 in Rudin s book. Experience shows that this requires careful planning

More information

Measure Theory & Integration

Measure Theory & Integration Measure Theory & Integration Lecture Notes, Math 320/520 Fall, 2004 D.H. Sattinger Department of Mathematics Yale University Contents 1 Preliminaries 1 2 Measures 3 2.1 Area and Measure........................

More information