CONFIDENCE TUBES FOR MULTIPLE QUANTILE PLOTS VIA EMPIRICAL LIKELIHOOD

Size: px
Start display at page:

Download "CONFIDENCE TUBES FOR MULTIPLE QUANTILE PLOTS VIA EMPIRICAL LIKELIHOOD"

Transcription

1 The Annals of Statistics 999, Vol. 7, No. 4, CONFIDENCE TUBES FOR MULTIPLE QUANTILE PLOTS VIA EMPIRICAL LIKELIHOOD BY JOHN H. J. EINMAHL AND IAN W. MCKEAGUE Eindhoven University of Technology and EURANDOM and Florida State University The nonparametric empirical lielihood approach is used to obtain simultaneous confidence tubes for multiple quantile plots based on independent Ž possibly right-censored. samples. These tubes are asymptotically distribution free, except when both 3 and censoring is present. Pointwise versions of the confidence tubes, however, are asymptotically distribution free in all cases. The various confidence tubes are valid under minimal conditions. The proposed methods are applied in three real data examples.. Introduction. The quantile-quantile Ž Q-Q. plot is a well nown and attractive graphical method for comparing two distributions, especially when confidence limits are added. In this paper we develop Q-Q plot methods for the comparison of two or more distributions from randomly censored data. More specifically, we consider the problem of finding simultaneous confidence tubes for multiple quantile plots Ž for brevity, multi-q plots. from independent samples of possibly right-censored survival times. The multi-q plot is defined to be the -dimensional curve ŽQ Ž p.,..., Q Ž p.. parameterized by 0 p, where Q is the quantile function of the th distribution. It specializes to the ordinary Q-Q plot in the two-sample case. The comparison of quantile functions is particularly useful for the analysis of survival data in biomedical settings. Frail and strong individuals Žcorre- sponding to low and high values of p. often respond to different treatments in different ways, so treatment effects can be hard to determine from comparison of mean or median survival times alone; see, for example, Dosum Ž The approach developed here allows comparison of treatments simultaneously across all frailty levels. Our approach is based on the nonparametric empirical lielihood method. This method was originally developed by Thomas and Grunemeier Ž 975. and Owen Ž 988, 990. as a way of improving upon Wald-type confidence regions. There now exists a substantial literature on empirical lielihood indicating that it is widely viewed as a desirable and natural approach to Received May 998; revised April 999. Supported in part by a Fulbright grant and by European Union Grant ERB CHRX-CT AMS 99 subect classifications. Primary 6G5; secondary 6G0. Key words and phrases. Censoring, confidence region, distribution-free, -sample comparison, nonparametric lielihood ratio, quantile-quantile plot. 348

2 CONFIDENCE TUBES FOR QUANTILE PLOTS 349 statistical inference in a variety of settings. Moreover, there is considerable evidence that procedures based on the method outperform competing procedures. Empirical lielihood based confidence bands for individual quantile functions have recently been derived in Li, Hollander, McKeague and Yang Ž Nai-Nimbalar and Raarshi Ž 997. employed the approach to test for equality of medians; their test naturally extends to a test for equality of quantiles. We use the nonparametric empirical lielihood approach to derive asymptotic simultaneous confidence tubes for multi-q plots based on independent random samples, including confidence bands for ordinary Q-Q plots Ž.. The tubes are applicable to situations with or without random censoring. The limiting processes involved in the construction of the tubes are distribution free, except when 3 and censoring is present. In general, we are able to obtain asymptotically distribution-free pointwise confidence regions for the multi-q plot. The various confidence tubes are valid under minimal conditions, although for convenience we shall assume continuity of the underlying distribution functions. Q-Q plots are studied in detail using classical methods in Dosum Ž974, 977., Dosum and Sievers Ž 976. and Switzer Ž 976. for models without censoring; see Shorac and Wellner Ž 986., pages for a summary and discussion. For models with censoring, Wald-type simultaneous confidence bands for Q-Q plots are obtained in Aly Ž 986., but restrictive differentiability conditions on the underlying distribution functions are required. The -sample problem without censoring is studied in Nair Ž 978, 98., but essentially only pairwise comparisons are made there. A review of graphical methods in nonparametric statistics with extensive coverage of Q-Q plots can be found in Fisher Ž Some refined approximation results for normalized Q-Q plots with statistical applications have been established in Beirlant and Deheuvels Ž 990. for the uncensored case and Deheuvels and Einmahl Ž 99. in the censored case. The paper is organized as follows. The proposed confidence tubes and the main results are presented in Section. Our approach is illustrated in Section 3 using three real data examples. All the proofs are contained in Section 4.. Main results. We begin by specifying the setup precisely and introducing the basic notation. It is convenient first to recall the notation in the one-sample case. For the corresponding notation in the general -sample case, we use a further subscript to refer to the th sample. The random censorship model deals with n i.i.d. pairs Ž Z,. i i, i,..., n, obtained from two independent random samples X and Y, i,..., n, in the following way: Zi Xi Y i, i X Y 4. The distribution functions of X i i i and Yi are denoted F and G, respectively, and F is assumed to be continu- ous. We will wor with nonnegative Xi and Y i, but this restriction is in fact not needed anywhere; see the discussion at the end of this section. The right-continuous quantile function corresponding to F is denoted by Q. We i i

3 350 J. H. J. EINMAHL AND I. W. MCKEAGUE write n i Ł i i i i L F F Ž Z. F Ž Z. F Ž Z. i for the lielihood, where F belongs to, the space of all distribution functions on 0,.. The ordered uncensored survival times, that is, the X i with corresponding i, are written 0 T TN, and r Ý n denotes the size of the ris set at T. The empirical lielihood i Z i T4 ratio for Ft p Ž given 0 p. is defined by 4 4 sup LŽ F. : F Ž t. p, F RŽ t.. sup LŽ F. : F Note that the sup in the denominator is attained by the KaplanMeier Žor product-limit. estimator Fn Ž t. Ł. ž r / i: Tit i It can be shown with the aid of Lagrange s method see Thomas and Grunemeier Ž 975. or Li Ž 995. that ½ 5 i i log RŽ t. Ý Ž ri. log ri log, r r i: T t i where the Lagrange multiplier D max Ž r. i: T t i satisfies the equa- i tion. p. r Ł i: Tit ž i / Now we turn to the multisample setup. The samples are assumed to be independent with sample sizes denoted n,..., n ; write n Ý n. Set Ž F,..., F. and define the multi-q plot to be 4 Q Ž p.,..., Q Ž p. :0p. Observe that this is the classical Q-Q plot when. In the sequel we consider the following more convenient version of the multi-q plot: the graph Q of the function t Q F Ž t.,..., Q F Ž t. Ž. for t 0. Denote the oint lielihood by L L F Ł and the empirical lielihood ratio at t Ž t,..., t. by ½ 5 sup LŽ.: FŽ t. FŽ t. for all,...,, RŽ t. sup LŽ.: 4.

4 CONFIDENCE TUBES FOR QUANTILE PLOTS 35 Again we find, using Lagrange s method with the constraints, F Ž t. F Ž t.,,...,, that. log RŽ t. ½ 5 i i Ý Ý Ž ri. log ri log, r r i: Tit where the,,...,, satisfy the equations i i.3 ; r r Ł Ł i: Tit i: Tit Ž here we have set Ý so Ý 0. and the should satisfy D for,...,. Later we show that this system of equations indeed has a unique solution; see Lemma 4.. In the one-sample case, it is immediately clear that the corresponding Lagrange multiplier equation. has a unique solution, but it is not obvious in the multisample case. Computation of the s can be carried out using a special-purpose root-finding procedure which exploits the monotonicity of the r.h.s. of 3. as a function of Ž see Section 3 and the proof of Lemma 4... The various confidence sets we propose are easily obtained from the main theorem below and are presented in the three subsequent theorems. These confidence sets are all of the form t: RŽ. t c 4, where c is derived using asymptotic considerations. Before stating our main theorem we introduce some more notation. We assume throughout that n n p 0asn for,..., Ž although with some care this condition can be relaxed to n.. Define s df Ž u. Ž s. H. F Ž u. F Ž u. G Ž u. 0 Ž.Ž.Ž. We will need the -matrix D DŽ. t with entries iž ti. Ž t. i, for i, pp where d i i ti pi ' i Ý li li, for i, Ł l i, l l Ž tl. pl i Ý Ł Ž t. p q l q l l l Ž the empty product is defined to be.. Also define V VŽ t. to be the random -vector with th entry W Ž Ž t.. Ž t., where the W are independent standard Wiener processes. Let be such that F Ž. 0 and let be

5 35 J. H. J. EINMAHL AND I. W. MCKEAGUE such that F Ž., G Ž. and G ŽQ ŽF Ž... for,...,. We assume throughout that the F are continuous. THEOREM.. When R, D and V are evaluated at t Žt, Q ŽF Ž t..,..., Q ŽF Ž t... for t, we have.4 log R DV D on D,, with the -dimensional Euclidian norm. Write the restriction of Q to t, as Q,. In the next theorem we consider the important case, in which the multi-q plot reduces to the usual Q-Q plot. Define c s, s for 0 by ž / ss, s P sup W Ž s. s c s, s. Set ˆc c ˆ Ž., ˆ Ž., where ˆ 5 ½ ˆ t Qn F n Ž t. 5. ˆ Ž t. n, n n with ˆ.6 s n Ý i: T s i r r and with Q the Ž right-continuous. n quantile function corresponding to F n. Now we define the confidence band for Q, to be i i B t, 0,. : log RŽ t. ˆc 4. THEOREM.. In the censored case, for and 0, lim P Ž Q, B.. n 7. REMARK.. In the uncensored case Ž and. we have that Ž. Ž t. Q FŽ t. Ž t. p p ž p p / FŽ t. p p F Ž t. t.

6 CONFIDENCE TUBES FOR QUANTILE PLOTS 353 ˆ Therefore for this case we can replace the Ž t. defined in 5. by the simpler but almost equivalent estimator Fn Ž t. ˆ Ž t. n. ž n n F Ž t. / For use in ˆc, we can replace ˆ Ž t. by n F n Ž t.. F Ž t. n Observe that this last expression is not an estimator of Ž t. but of Ž t.. This however maes no difference because of 7. and the fact that for c 0 sup W Ž s. s sup W Ž s. s. D ss, s scs, cs Of course, here the KaplanMeier estimator Fn is ust the empirical distribution function of the first sample. Next we return to general, but assume that there is no censoring. Note that in this case the assumptions on reduce to F Ž.. Define C s, s for 0 by Ý ž s / s s, s 8. P sup W Ž s. C s, s. ˆ ˆ ˆ Set C C Ž., Ž., where F n Ž t. ˆ Ž t.. F Ž t. Define the confidence tube for Q, by n. ˆ 4 T t, 0, : log R t C. THEOREM.3. In the absence of censoring, for all and 0, lim P Ž Q, T.. n Now we allow censoring and but tae. Set Q Q,. Define the confidence region for Q by 4. 4 R t 0, : log R t, where is the upper -quantile of the chi-square distribution with degrees of freedom. In the case note that R amounts to a confidence interval for the F Ž.-quantile of F.

7 354 J. H. J. EINMAHL AND I. W. MCKEAGUE THEOREM.4. In the censored case, for all and 0, lim P Ž Q R.. n The asymptotic null distribution in the test for equality of medians developed by Nai-Nimbalar and Raarshi Ž 997. can be essentially derived from the proof of Theorem.4 by taing as their estimator * of the common median. Finally we establish an interval property for the confidence tube T Žwhich also applies to B and R.: one-dimensional cross-sections parallel to a given axis are intervals. This is useful for computing the various confidence sets because points belonging to them can then be found by a simple search strategy that sweeps along each axis. Žl. Ž Žl. THEOREM.5. Suppose that t t,..., t,..., t. T for l,, Ž. Ž. Ž. Ž. where t t. Then t* t,..., t,..., t T for any t t, t. In the two-sample case Ž. we have a somewhat stronger result. THEOREM.6. Let Žt Žl., t Žl.., l,, belong to the confidence band B and Ž. Ž. Ž. Ž. suppose t t and t t. Then Žt, t. also belongs to B whenever t Ž. t t Ž. and t Ž. t t Ž.. Ž. Ž. This theorem as well as Theorem.5 implies, by taing t t or t Ž. t Ž., that the intersection of the band B with a vertical or horizontal line is an interval. In addition, it shows that the bands are nondecreasing in the sense that their lower or upper boundaries are nondecreasing. Discussion. We wish to emphasize that our approach, including the definition of the multi-q plot, is new even in the uncensored case. We also remind that nonnegativity of the observations is not needed anywhere in the proofs. This is especially useful in the uncensored case, where often the samples do not represent life or failure times or when a transformation is applied to the data Ž see the third example in Section 3.. Another desirable feature of our approach is that the confidence bands and tubes are essentially invariant under permutations of the order of the samples involved. ŽOnly at the two ends of the tube does the first sample play a somewhat special role.. We did not formulate a version of our confidence tubes in the censored case for 3 since then DV in 4. is not distribution free, even when only one of the samples is subect to censoring. Our approach can, however, be generalized to this situation by estimating all the unnowns appearing in D and V and then using simulation. This means that we replace D by C Žgiven. Ž. in the proof of Theorem., t by Qn F n t for,...,, and by. The process to be simulated has the form CV, where Vˆ ˆ is the estimated version of V. Hence, approximate confidence tubes can be

8 CONFIDENCE TUBES FOR QUANTILE PLOTS 355 constructed for the censored case as well, but we do not pursue this in further detail here. The one-sample Q-Q plot, t QF Ž Ž t.. 0 with F0 nown, is essentially treated in Li, Hollander, McKeague and Yang Ž 996., since their confidence bands for QŽ p. can be transformed to bands for QF Ž Ž t.. 0 by the time change p F Ž t. 0. The present paper can be seen as a generalization of their approach to the -sample case. For uncensored data, in the two-sample Q-Q plot case, our confidence bands perform well in the tails due to the weighting which naturally arises when using the empirical lielihood method. Our bands share this property with the weighted bands Ž W bands. introduced in Dosum and Sievers Ž 976., which are based on the standardized two-sample empirical process. The bands in Switzer Ž 976. and Aly Ž 986. for the censored case are much wider in the tails, since they are based on the unweighted empirical process. All these procedures as well as our procedures are essentially based on the inversion of a distance between empirical distribution functions Žor KaplanMeier estimators.. In fact, the W bands are asymptotically equivalent to our bands in the uncensored case. 3. Applications to real data. In this section we illustrate our approach in three real data examples. First we consider a biomedical example for the two-sample case with censored survival data. The data come from a Mayo Clinic trial involving a treatment for primary biliary cirrhosis of the liver; see Fleming and Harrington Ž 99. for discussion. A total of n 3 patients participated in the randomized clinical trial, 58 receiving the treatment Ž D-penicillamine. and 54 receiving a placebo. Censoring is heavy Ž87 of the 3 observations are censored.. Figure displays the 90% confidence band Žand pointwise confidence intervals. for the Q-Q plot of treatment versus placebo for survival time in days. The standard empirical Q-Q plot based on quantiles of the KaplanMeier estimator is also displayed. Note that although the diagonal departs from the pointwise confidence region at some points, it remains within the simultaneous band, so there is no overall evidence of a difference between treatment and placebo. The second example also illustrates the two-sample case. Hollander, McKeague and Yang Ž 997. analyzed data on 43 manuscripts submitted to the Theory and Methods Section of JASA during 994. Each observation consists of the number of days between a manuscript s submission and its first review or the end of the year, along with a censoring indicator Ž if a paper received its first review by the end of the year; 0 otherwise.. Similar data Ž on 444 manuscripts. are available for 995. The censoring is light Ž330 of the 876 observations are censored. compared with the previous example. It is of interest to loo for differences in the pattern of review times for the two years. Figure displays the 95% confidence band Žand pointwise confidence intervals. for the Q-Q plot. The lower endpoints of the pointwise confidence intervals touch the diagonal between 0 and 5 days, which might suggest

9 356 J. H. J. EINMAHL AND I. W. MCKEAGUE FIG.. 90% confidence band Ž solid line. for the treatment versus placebo Q-Q plot in the Mayo Clinic trial, for 86 t 976 days; pointwise confidence intervals Ž short dashed line., empirical Q-Q plot Ž long dashed line.. that rapid reviews were faster in 994 than in 995. However, the diagonal is completely contained within the simultaneous band, so there is no overall evidence of a difference between the patterns of review times. The third example concerns times to breadown Ž in minutes. of an insulating fluid under three elevated voltage stresses, from data reported in Nair Ž 98., Table. It is important to determine whether the distribution of time to breadown changes with voltage. There are 60 uncensored observations at each voltage level Ž 34, 35 and 36 Kv.. As in Nair Ž 98. we use the 34 Kv measurements as a reference sample and put the breadown times on a log-scale. Figure 3 shows cross-sections of the 95% confidence regions for the multi-q plot at three values of the reference sample: t 0.4,.06 and.65. The confidence tube gives simultaneous coverage over the interval 0.4 t.65. The diagonal Ž t, t, t. runs above the pointwise confidence region at t.65 Ž top right plot. suggesting that increased voltage can reduce breadown time in the upper tail of the distribution. However, the diagonal falls completely inside the simultaneous tube Ž left column. so there is a lac of significant evidence for breadown time changing with voltage. In these examples, we computed the Lagrange multipliers in the system of equations 3. using the van WingaardenDeerBrent root finding algorithm Press, Teuolsy, Vetterling and Flannery Ž 99., page 359. The proof of Lemma 4. provides a constructive method to obtain the solution by repeated use of their algorithm. The thresholds c and Cˆ used in the ˆ

10 CONFIDENCE TUBES FOR QUANTILE PLOTS 357 FIG.. 95% confidence band Ž solid line. for the Q-Q plot based on the JASA time-to-first-review data, for 5 t 95 days; pointwise confidence intervals Ž short dashed line., empirical Q-Q plot Ž long dashed line.. confidence bands and tubes were computed by simulation of the Wiener processes on a fine grid. 4. Proofs. Here we present proofs of the theorems in Section. Some lemmas used in these proofs are given at the end of this section. PROOF OF THEOREM.. First we note that, by Lemma 4., below the system of equations in.3 with replaced by Ý has a unique solution for all. Define g : Ž D,. by g Ý log ž r / i i: Tit for,...,. Denote a g Ž 0. log SˆŽ t., where Sˆ is the KaplanMeier estimator of S F and b g 0 t n with as in 6. ˆ ˆ. Here t Q ŽF Ž t.. for,...,. Taylor series expansions of g and g, in conunction with Lemma 4. and the argument of Li Ž 995., proof of 5., page 0 yield Ž g Ž. g a a b b O Ž n., P

11 358 J. H. J. EINMAHL AND I. W. MCKEAGUE FIG. 3. Time to insulating fluid breadown Ž in log-scale.,36kv sample versus 35 Kv sample; cross-sections of the 95% simultaneous confidence tube Ž left column. and pointwise confidence regions Ž right column. at t 0.4,.06 and.65 in the 34 Kv reference sample Ž bottom row to top row, respectively.. uniformly in t,. Ignore the remainder term for the moment and consider the system of equations Ž 4.. b b a a for,...,, 0 with unnowns,...,. By Lemma 4.3 this system has as unique solution Ý Ž a a. i i i with the i as defined in the lemma. We now use this result to obtain an approximation for the. The remainder term in Ž 4.. consists of the remainders in the Taylor series Ž expansions of g and g, and both are of order O n. P. Attach these remainder terms to a and a, respectively, and apply Lemma 4.3. Note that

12 CONFIDENCE TUBES FOR QUANTILE PLOTS 359 Ž is a uniformly consistent estimator of, so b O n. ˆ P and i O Ž n., and it follows that P O Ž n. O Ž n. O Ž., P P P uniformly for t,. We also have that 4.3 b b OP n. Applying the Taylor series argument of Li Ž 995., page 0 to. and using Ž 4.3. then gives Ý P log RŽ t. b O Ž n.. Write the leading term above in the form Ý b Cw, where C is the -matrix with entries bb for i, c ' i i iý li li i b, for i. and w is the -vector with entries a log SŽ t. a log SŽ t. w. b b ' The proof is completed by noting that Ž./ ž W t wž t. D VŽ t.,,..., t ',..., where W,..., W are independent standard Wiener processes, and c d. i P i PROOF OF THEOREM.. Let us first simplify DV for this case. Note that for we have 0 Ž t. Ž t. Ž t. p p p D, Ž t. Ž t. Ž t. Ž t. ' ' p p p

13 360 with J. H. J. EINMAHL AND I. W. MCKEAGUE as in Remar.. So 0 Ž t. 'p D, Ž t. Ž t. ' p where a aa, and hence DV 0 0 Ž t. / 'p ž W t W t t ' p ' p ' p t Ž t. 'p D WŽ Ž t.., Ž t. Ž t. ' p so that W Ž t. DV D. Ž t. It is well nown that Ž s. Ž s. ˆ P,,, and hence with some care it can be shown that ˆ Ž. Ž. l P l, l,. Setting c c Ž., Ž., this yields ˆc P c. Combining the above we obtain W Ž Ž t.. P Ž Q, B. P log RŽ t, Q Ž F Ž t.. ˆc for all t, ž / t, ž W Ž s. / s s, P sup c Ž t. P sup c, where we used, for the convergence statement, that the random variable in the last expression has a continuous distribution. Before continuing with the proofs of the theorems let us do some calcula- tions on DV of Theorem. in general. Note that D is symmetric and by

14 CONFIDENCE TUBES FOR QUANTILE PLOTS 36 Lemma 4.4 it is idempotent of ran. Thus we may diagonalize D Dt Ž. as follows: I 0 Ž 4.4. DŽ t. PŽ t. PŽ t., ž 0 0/ where Pt is orthogonal and I is the identity matrix of order. Put Zt Pt VŽ t.. Then Ž 4.5. DŽ t. VŽ t. VŽ t. DŽ t. DŽ t. VŽ t. VŽ t. DŽ t. VŽ t. I 0 ZŽ t. ZŽ t., 0 0 where the second equality follows since D is symmetric and idempotent. The covariance structure of the process ZŽ t. is given, for two values of t, say s t, by EŽ ZŽ s. ZŽ t.. Ž. Ž Ž ž Ž. Ž./ Ž 4.6. Ž s. Q FŽ s. Q FŽ s. PŽ s. diag,,..., PŽ t.. Ž t. Q F Ž t. Q F Ž t. First observe that FŽ QŽ FŽ t... FŽ t. Ž QŽ FŽ t... Ž t., F Q F Ž t. F Ž t. PROOF OF THEOREM.3. Ž. for,...,. This implies Dt, and hence Pt, does not depend on t. Thus the r.h.s. of Ž 4.6. reduces to Ž s. Ž t. It follows that the process Zt has the same distribution as the process I. W Ž t. W Ž t. ž t t /. Ž,...,, where the W s are independent standard Wiener processes, and hence by Ž 4.5., W Ž Ž t.. DV D Ý. Ž t. Now the proof of this theorem can be completed along the same lines as that of Theorem.. In this case, use continuity of the random variable W Ž s. sup Ý, s s,

15 36 J. H. J. EINMAHL AND I. W. MCKEAGUE which follows from a property of Gaussian measures on Banach spaces, namely that the measure of a closed ball is a continuous function of its radius; apply, for example, Paulausas and Racausas ˇ Ž 989., Chapter 4, Theorem. to the Gaussian measure induced by the process Ž. s W s,..., W s on the Banach space of -valued continuous functions on Ž., Ž. endowed with the supremum norm. PROOF OF THEOREM.4. This theorem can be proved along the lines of the previous two. We only note that now the r.h.s. of Ž 4.6., with s t, reduces to the identity matrix I. Thus ZŽ. is a -vector of independent standard normal random variables. Hence from Ž 4.5. we find that DV, evaluated at, has a distribution. PROOF OF THEOREM.5. In order not to overdo the notation we restrict ourselves to proving this theorem for 3; for 3 the proof is essentially the same. W.l.o.g. we tae 3. Because the denominator of the lielihood ratio does not depend on t Ž t,..., t., we only consider the expression 3 N r i ŁŁ i Ž i. i log h h, with the h Ž 0,. defined by h F Ž T. Ž F Ž T.. i i i, i. Setting z i logž h., this becomes i 3 N ŁŁ i i i i Ž. Ž. log exp z exp z r 3 N Ý Ý i Ž i. Ž i. 4 z r log expž z. g Ž z., i with z Ž z,..., z, z,..., z, z,..., z. N N 3 3N. Observe that g is a con- 3 vex function. N N N Now g z, z, 0 3, has to be minimized under the constraints 4.7 z z and z z. Ý Ý Ý Ý i i i 3i i: Tit i: Tit i: Tit i: T3it3 Žl. Žl. Solutions of 4.7 for t t that minimize g z, are denoted with z, l,, respectively. Ž. Ž. For t t, t, define the function Ý Ž i i. Ý Ž 3i 3i. fž x. xz Ž. Ž x. z Ž. xz Ž. Ž x. z Ž. i: Tit i: T3it3 Ž. for 0 x. Since t t, we easily see that fž Similarly, using Ž. t t, we obtain f 0. Thus there exists an x 0, 3 3 such that fž x*. 0. Define z* x*z Ž. Ž x*. z Ž.,..., x z Ž. Ž x*. z Ž.. Ž 3 N 3 N. 3 3

16 CONFIDENCE TUBES FOR QUANTILE PLOTS 363 Then trivially the two equations in Ž 4.7. are satisfied for z z* and t t*. Also because g is convex, g Ž z*. x*g Ž z Ž.. Ž x*. g Ž z Ž... Žl. This implies, since log RŽ t. C ˆ, l,, that log RŽ t*. C ˆ, that is, t* T. The proof of Theorem.6 is similar to, but easier than, the previous proof. Moreover it is a straightforward extension of the proof of Theorem in Li, Hollander, McKeague and Yang Ž Therefore we will omit the proof here. We conclude by proving the four lemmas that we used earlier. LEMMA 4.. The system of equations 3.,with unnowns,...,, has a unique solution for all provided D 0 for,...,. PROOF. Define f : Ž D,. Ž 0,. by Ž 4.8. f Ł i: T t ž r / i i for,...,. We need to show that the system of equations Ý 4.9 f f,,..., has a unique solution. Note that f is continuous, strictly increasing, and vanishes as D. It then follows that there is a unique solution to Ž 4.9. when, because the decreasing function f Ž. must cross the increasing function f Ž. at exactly one value of Ž D, D.. Now consider 3. For each fixed D and 3,...,, there exists a unique Ž. such that f Ž. f Ž.. Each of these s is strictly increasing as a function of because f and f are strictly increasing. Now consider the equation Ý 3 Ž 4.0. f Ž. f Ž.. The l.h.s. of Ž 4.0. is defined whenever D D, where D is the unique solution to Note that D D because Ý 3 D D D. Ý D Ž D. D 0 D. Ý 3 Moreover, as a function of Ž D, D., the l.h.s. of Ž 4.0. is strictly decreasing and vanishes as D ; the r.h.s. is strictly increasing and

17 364 J. H. J. EINMAHL AND I. W. MCKEAGUE vanishes as D. Thus 4.0 holds for some unique D, D. Now set Ž. for 3,...,. It is then clear that Ž,...,. is the unique solution to Ž LEMMA 4.. Suppose n n p 0 for,...,. Let t Q ŽF Ž t.. for,..., and t Ž t,..., t.. Then Ž t. O Ž n. uniformly over,. P PROOF. Write the value of each side of 3. as p when t has the above form. By Li Ž 995., page 0, if 0 then n n / logž p. A ˆ Ž t., where Aˆ is the NelsonAalen estimator of A, the cumulative hazard function corresponding to F, and if 0 then the above inequality reverses. Thus for any pair, with 0, 0 Ž l l such pairs always exist, if not all the s are 0, since Ý 0. we have and hence ž / n nl Aˆ t Aˆ Ž. lž t l. ž, n n l l n Aˆ Ž t. n Aˆ Ž t. Aˆ Ž t. Aˆ Ž t. n n. l l l l l l l Note that A Ž t. A Ž t. and A Ž t. is bounded away from 0 if t. Thus by the uniform convergence of the NelsonAalen estimators A ˆ, we have that ˆ for any 0 and n sufficiently large, A Ž t. A Ž t. for all t, with probability at least, similarly for A ˆ. It then follows that 0 Ž n n. A Ž t. Aˆ Ž t. Aˆ Ž t. n n, l l l l l l with probability, for n sufficiently large. Finally, using the fact that ˆ Ž. Ž A t A t O n uniformly over,, we find that O n. P P for all,...,, uniformly for t,. LEMMA 4.3. where The system of equations Ž 4.. has solution Ý Ž ai a. i, i i 0 Ł l 0 Ý Ł l li, l ž li / i b and b. The solution is unique when all the b s are positive. l

18 CONFIDENCE TUBES FOR QUANTILE PLOTS 365 PROOF. The coefficient of a in Ý is Ý i Ý 0, i similarly for the coefficients of a,..., a. Thus Ý a in b,is and the coefficient of a in b is Ł b b Ý 0 l l Ý Ł b b i 0 l i i li so the coefficient of a in b b is Ł Ý Ł b b. 0 l 0 l l i li 0. The coefficient of The same argument shows that the coefficient of a in b b is. The coefficient of a, with q 3, in b b is q ž Ł Ł / b b b b b b 0. q q 0 l l l, lq l, lq This shows that b b a a and the same argument shows that all the other equations in Ž 4.. are satisfied. Ž. LEMMA 4.4. The -matrix D D t is idempotent, that is, D D and of ran. PROOF. Setting v t p, we have Ý i Łl vl dii, Ý Ł v q l q l ' i l i, l l vv Ł v di, i. Ý Ł v q l q l Because of the various symmetries it suffices to show that 4. d di and d d i, d i, i i Ý Ý for the idempotency of D. For the first equality we need to show that ÝŁ ÝŁ ÝŁ Ý Ł v v v vv v. l l l l l i li l l, l

19 366 J. H. J. EINMAHL AND I. W. MCKEAGUE Writing C Ý Ł v, this reduces to l l C C Łvl C v Ý v Ł v l, l l, l or, subtracting C on both sides, ÝŁ Ł Ý Ł v v v v v, l l l l l l, l which is easily seen to be true. For the second equality in 4. we have to show that ' vv Łvl Ý Łvl Ý Łvl ' vv Łv ž l l3 li l l3 / i ž ÝŁvl ' vv Łv /ž l l l3 / ' Ý Ł Ł v v v v v. i3 Dividing both sides by ' vv Ł l3 v l yields i l l l, li l, li ÝŁ ÝŁ ÝŁ ÝŁ v v v v, l l l l i li l l i3 li which is obviously true. This establishes the idempotency of D. For the second statement in the lemma, note that the ran of an idempotent matrix is equal to its trace. It is easily seen that the trace of D is. Acnowledgments. We than the reviewers for many constructive suggestions and Myles Hollander for providing the JASA time-to-first-review data. John Einmahl thans the Department of Statistics, Florida State University, for their warm hospitality during the writing of the article. REFERENCES ALY, E.-E. A. A. Ž Quantile-quantile plots under random censorship. J. Statist. Plann. Inference BEIRLANT, J. and DEHEUVELS, P. Ž On the approximation of P-P and Q-Q plot processes by Brownian bridges. Statist. Probab. Lett DEHEUVELS, P. and EINMAHL, J. H. J. Ž 99.. Approximations and two-sample tests based on P-P and Q-Q plots of the KaplanMeier estimators of lifetime distributions. J. Multivariate Anal DOKSUM, K. A. Ž Empirical probability plots and statistical inference for nonlinear models in the two-sample case. Ann. Statist DOKSUM, K. A. Ž Some graphical methods in statistics. A review and some extensions. Statist. Neerlandica

20 CONFIDENCE TUBES FOR QUANTILE PLOTS 367 DOKSUM, K. A. and SIEVERS, G. L. Ž Plotting with confidence: graphical comparisons of two populations. Biometria FISHER, N. I. Ž Graphical methods in nonparametric statistics: a review and annotated bibliography. Internat. Statist. Rev FLEMING, T. R. and HARRINGTON, D. P. Ž 99.. Counting Processes and Survival Analysis. Wiley, New Yor. HOLLANDER, M., MCKEAGUE, I. W. and YANG, J. Ž Lielihood ratio-based confidence bands for survival functions. J. Amer. Statist. Assoc LI, G. Ž On nonparametric lielihood ratio estimation of survival probabilities for censored data. Statist. Probab. Lett LI, G., HOLLANDER, M., MCKEAGUE, I. W. and YANG, J. Ž Nonparametric lielihood ratio confidence bands for quantile functions from incomplete survival data. Ann. Statist NAIK-NIMBALKAR, U. V. and RAJARSHI, M. B. Ž Empirical lielihood ratio test for equality of medians in censored data. Statist. Probab. Lett NAIR, V. N. Ž Graphical Comparisons of Populations in Some Non-linear Models. Ph.D. dissertation, Univ. California, Bereley. NAIR, V. N. Ž 98.. Q-Q plots with confidence bands for comparing several populations. Scand. J. Statist OWEN, A. Ž Empirical lielihood ratio confidence intervals for a single functional. Biometria OWEN, A. Ž Empirical lielihood ratio confidence regions. Ann. Statist PAULAUSKAS, V. and RACKAUSKAS, ˇ A. Ž Approximation Theory in the Central Limit Theorem. Exact Results in Banach Spaces. Kluwer, Dordrecht. PRESS, W. H., TEUKOLSKY, S. A., VETTERLING, W. T. and FLANNERY, B. P. Ž 99.. Numerical Recipes in C, nd ed. Cambridge Univ. Press. SHORACK, G. R. and WELLNER, J. A. Ž Empirical Processes with Applications to Statistics. Wiley, New Yor. SWITZER, P. Ž Confidence procedures for two-sample problems. Biometria THOMAS, D. R. and GRUNKEMEIER, G. L. Ž Confidence interval estimation of survival probabilities for censored data. J. Amer. Statist. Assoc DEPARTMENT OF MATHEMATICS DEPARTMENT OF STATISTICS AND COMPUTING SCIENCE FLORIDA STATE UNIVERSITY EINDHOVEN UNIVERSITY OF TECHNOLOGY TALLAHASSEE, FLORIDA P.O. BOX 53 mceague@stat.fsu.edu 5600 MB EINDHOVEN THE NETHERLANDS einmahl@win.tue.nl

Visualizing multiple quantile plots

Visualizing multiple quantile plots Visualizing multiple quantile plots Marko A.A. Boon Eindhoven University of Technology John H.J. Einmahl Tilburg University Ian W. McKeague Columbia University January 18, 2012 Abstract. Multiple quantile

More information

Visualizing multiple quantile plots

Visualizing multiple quantile plots Visualizing multiple quantile plots Boon, M.A.A.; Einmahl, J.H.J.; McKeague, I.W. Published: 01/01/2011 Document Version Publisher s PDF, also known as Version of Record (includes final page, issue and

More information

CONFIDENCE TUBES FOR MULTIPLE QUANTILE PLOTS VIA EMPIRICAL LIKELIHOOD. Eindhoven University of Technology and EURANDOM and Florida State University

CONFIDENCE TUBES FOR MULTIPLE QUANTILE PLOTS VIA EMPIRICAL LIKELIHOOD. Eindhoven University of Technology and EURANDOM and Florida State University The Annals of Statistics 1999, Vol. 27, No. 4, 1348-1367 CONFIDENCE TUBES FOR MULTIPLE QUANTILE PLOTS VIA EMPIRICAL LIKELIHOOD BY JOHN H. J. EINMAHL1 AND IAN W. MCKEAGUE Eindhoven University of Technology

More information

Marko A. A. Boon a, John H. J. Einmahl b & Ian W. McKeague c a Department of Mathematics and Computer Science, Eindhoven

Marko A. A. Boon a, John H. J. Einmahl b & Ian W. McKeague c a Department of Mathematics and Computer Science, Eindhoven This article was downloaded by: [Columbia University] On: 8 March 013, At: 1: Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 107954 Registered office: Mortimer

More information

substantial literature on emirical likelihood indicating that it is widely viewed as a desirable and natural aroach to statistical inference in a vari

substantial literature on emirical likelihood indicating that it is widely viewed as a desirable and natural aroach to statistical inference in a vari Condence tubes for multile quantile lots via emirical likelihood John H.J. Einmahl Eindhoven University of Technology Ian W. McKeague Florida State University May 7, 998 Abstract The nonarametric emirical

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Comparing Distribution Functions via Empirical Likelihood

Comparing Distribution Functions via Empirical Likelihood Georgia State University ScholarWorks @ Georgia State University Mathematics and Statistics Faculty Publications Department of Mathematics and Statistics 25 Comparing Distribution Functions via Empirical

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Inequalities Relating Addition and Replacement Type Finite Sample Breakdown Points

Inequalities Relating Addition and Replacement Type Finite Sample Breakdown Points Inequalities Relating Addition and Replacement Type Finite Sample Breadown Points Robert Serfling Department of Mathematical Sciences University of Texas at Dallas Richardson, Texas 75083-0688, USA Email:

More information

Likelihood ratio confidence bands in nonparametric regression with censored data

Likelihood ratio confidence bands in nonparametric regression with censored data Likelihood ratio confidence bands in nonparametric regression with censored data Gang Li University of California at Los Angeles Department of Biostatistics Ingrid Van Keilegom Eindhoven University of

More information

COMPUTE CENSORED EMPIRICAL LIKELIHOOD RATIO BY SEQUENTIAL QUADRATIC PROGRAMMING Kun Chen and Mai Zhou University of Kentucky

COMPUTE CENSORED EMPIRICAL LIKELIHOOD RATIO BY SEQUENTIAL QUADRATIC PROGRAMMING Kun Chen and Mai Zhou University of Kentucky COMPUTE CENSORED EMPIRICAL LIKELIHOOD RATIO BY SEQUENTIAL QUADRATIC PROGRAMMING Kun Chen and Mai Zhou University of Kentucky Summary Empirical likelihood ratio method (Thomas and Grunkmier 975, Owen 988,

More information

COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA. Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky

COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA. Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky Summary The empirical likelihood ratio method is a general nonparametric

More information

A note on the asymptotic distribution of Berk-Jones type statistics under the null hypothesis

A note on the asymptotic distribution of Berk-Jones type statistics under the null hypothesis A note on the asymptotic distribution of Berk-Jones type statistics under the null hypothesis Jon A. Wellner and Vladimir Koltchinskii Abstract. Proofs are given of the limiting null distributions of the

More information

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky EMPIRICAL ENVELOPE MLE AND LR TESTS Mai Zhou University of Kentucky Summary We study in this paper some nonparametric inference problems where the nonparametric maximum likelihood estimator (NPMLE) are

More information

ST745: Survival Analysis: Nonparametric methods

ST745: Survival Analysis: Nonparametric methods ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate

More information

Renormings of c 0 and the minimal displacement problem

Renormings of c 0 and the minimal displacement problem doi: 0.55/umcsmath-205-0008 ANNALES UNIVERSITATIS MARIAE CURIE-SKŁODOWSKA LUBLIN POLONIA VOL. LXVIII, NO. 2, 204 SECTIO A 85 9 ŁUKASZ PIASECKI Renormings of c 0 and the minimal displacement problem Abstract.

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter

More information

Good Exact Confidence Sets for a Multivariate Normal Mean

Good Exact Confidence Sets for a Multivariate Normal Mean University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 997 Good Exact Confidence Sets for a Multivariate Normal Mean Yu-Ling Tseng Lawrence D. Brown University of Pennsylvania

More information

Empirical Likelihood in Survival Analysis

Empirical Likelihood in Survival Analysis Empirical Likelihood in Survival Analysis Gang Li 1, Runze Li 2, and Mai Zhou 3 1 Department of Biostatistics, University of California, Los Angeles, CA 90095 vli@ucla.edu 2 Department of Statistics, The

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Mai Zhou 1 University of Kentucky, Lexington, KY 40506 USA Summary. Empirical likelihood ratio method (Thomas and Grunkmier

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

AN INEQUALITY FOR TAIL PROBABILITIES OF MARTINGALES WITH BOUNDED DIFFERENCES

AN INEQUALITY FOR TAIL PROBABILITIES OF MARTINGALES WITH BOUNDED DIFFERENCES Lithuanian Mathematical Journal, Vol. 4, No. 3, 00 AN INEQUALITY FOR TAIL PROBABILITIES OF MARTINGALES WITH BOUNDED DIFFERENCES V. Bentkus Vilnius Institute of Mathematics and Informatics, Akademijos 4,

More information

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data Journal of Multivariate Analysis 78, 6282 (2001) doi:10.1006jmva.2000.1939, available online at http:www.idealibrary.com on Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone

More information

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time ASYMPTOTIC PROPERTIES OF THE GMLE WITH CASE 2 INTERVAL-CENSORED DATA By Qiqing Yu a;1 Anton Schick a, Linxiong Li b;2 and George Y. C. Wong c;3 a Dept. of Mathematical Sciences, Binghamton University,

More information

Almost sure limit theorems for U-statistics

Almost sure limit theorems for U-statistics Almost sure limit theorems for U-statistics Hajo Holzmann, Susanne Koch and Alesey Min 3 Institut für Mathematische Stochasti Georg-August-Universität Göttingen Maschmühlenweg 8 0 37073 Göttingen Germany

More information

Negative Multinomial Model and Cancer. Incidence

Negative Multinomial Model and Cancer. Incidence Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence S. Lahiri & Sunil K. Dhar Department of Mathematical Sciences, CAMS New Jersey Institute of Technology, Newar,

More information

Large Sample Properties of Estimators in the Classical Linear Regression Model

Large Sample Properties of Estimators in the Classical Linear Regression Model Large Sample Properties of Estimators in the Classical Linear Regression Model 7 October 004 A. Statement of the classical linear regression model The classical linear regression model can be written in

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Minimum distance tests and estimates based on ranks

Minimum distance tests and estimates based on ranks Minimum distance tests and estimates based on ranks Authors: Radim Navrátil Department of Mathematics and Statistics, Masaryk University Brno, Czech Republic (navratil@math.muni.cz) Abstract: It is well

More information

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests Biometrika (2014),,, pp. 1 13 C 2014 Biometrika Trust Printed in Great Britain Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests BY M. ZHOU Department of Statistics, University

More information

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,

More information

Estimation of the extreme value index and high quantiles under random censoring

Estimation of the extreme value index and high quantiles under random censoring Estimation of the extreme value index and high quantiles under random censoring Jan Beirlant () & Emmanuel Delafosse (2) & Armelle Guillou (2) () Katholiee Universiteit Leuven, Department of Mathematics,

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note

More information

Pólya s Random Walk Theorem arxiv: v2 [math.pr] 18 Apr 2013

Pólya s Random Walk Theorem arxiv: v2 [math.pr] 18 Apr 2013 Pólya s Random Wal Theorem arxiv:131.3916v2 [math.pr] 18 Apr 213 Jonathan ova Abstract This note presents a proof of Pólya s random wal theorem using classical methods from special function theory and

More information

On nonlinear weighted least squares fitting of the three-parameter inverse Weibull distribution

On nonlinear weighted least squares fitting of the three-parameter inverse Weibull distribution MATHEMATICAL COMMUNICATIONS 13 Math. Commun., Vol. 15, No. 1, pp. 13-24 (2010) On nonlinear weighted least squares fitting of the three-parameter inverse Weibull distribution Dragan Juić 1 and Darija Marović

More information

Editorial Manager(tm) for Lifetime Data Analysis Manuscript Draft. Manuscript Number: Title: On An Exponential Bound for the Kaplan - Meier Estimator

Editorial Manager(tm) for Lifetime Data Analysis Manuscript Draft. Manuscript Number: Title: On An Exponential Bound for the Kaplan - Meier Estimator Editorial Managertm) for Lifetime Data Analysis Manuscript Draft Manuscript Number: Title: On An Exponential Bound for the Kaplan - Meier Estimator Article Type: SI-honor of N. Breslow invitation only)

More information

On a Nonparametric Notion of Residual and its Applications

On a Nonparametric Notion of Residual and its Applications On a Nonparametric Notion of Residual and its Applications Bodhisattva Sen and Gábor Székely arxiv:1409.3886v1 [stat.me] 12 Sep 2014 Columbia University and National Science Foundation September 16, 2014

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

The Central Limit Theorem Under Random Truncation

The Central Limit Theorem Under Random Truncation The Central Limit Theorem Under Random Truncation WINFRIED STUTE and JANE-LING WANG Mathematical Institute, University of Giessen, Arndtstr., D-3539 Giessen, Germany. winfried.stute@math.uni-giessen.de

More information

Chain Plot: A Tool for Exploiting Bivariate Temporal Structures

Chain Plot: A Tool for Exploiting Bivariate Temporal Structures Chain Plot: A Tool for Exploiting Bivariate Temporal Structures C.C. Taylor Dept. of Statistics, University of Leeds, Leeds LS2 9JT, UK A. Zempléni Dept. of Probability Theory & Statistics, Eötvös Loránd

More information

The Skorokhod reflection problem for functions with discontinuities (contractive case)

The Skorokhod reflection problem for functions with discontinuities (contractive case) The Skorokhod reflection problem for functions with discontinuities (contractive case) TAKIS KONSTANTOPOULOS Univ. of Texas at Austin Revised March 1999 Abstract Basic properties of the Skorokhod reflection

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 1998 Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ Lawrence D. Brown University

More information

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES JEREMY J. BECNEL Abstract. We examine the main topologies wea, strong, and inductive placed on the dual of a countably-normed space

More information

Asymptotic Properties of Kaplan-Meier Estimator. for Censored Dependent Data. Zongwu Cai. Department of Mathematics

Asymptotic Properties of Kaplan-Meier Estimator. for Censored Dependent Data. Zongwu Cai. Department of Mathematics To appear in Statist. Probab. Letters, 997 Asymptotic Properties of Kaplan-Meier Estimator for Censored Dependent Data by Zongwu Cai Department of Mathematics Southwest Missouri State University Springeld,

More information

Density estimators for the convolution of discrete and continuous random variables

Density estimators for the convolution of discrete and continuous random variables Density estimators for the convolution of discrete and continuous random variables Ursula U Müller Texas A&M University Anton Schick Binghamton University Wolfgang Wefelmeyer Universität zu Köln Abstract

More information

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a

More information

Approximate Self Consistency for Middle-Censored Data

Approximate Self Consistency for Middle-Censored Data Approximate Self Consistency for Middle-Censored Data by S. Rao Jammalamadaka Department of Statistics & Applied Probability, University of California, Santa Barbara, CA 93106, USA. and Srikanth K. Iyer

More information

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS SOME CONVERSE LIMIT THEOREMS OR EXCHANGEABLE BOOTSTRAPS Jon A. Wellner University of Washington The bootstrap Glivenko-Cantelli and bootstrap Donsker theorems of Giné and Zinn (990) contain both necessary

More information

A MODIFICATION OF THE HARTUNG KNAPP CONFIDENCE INTERVAL ON THE VARIANCE COMPONENT IN TWO VARIANCE COMPONENT MODELS

A MODIFICATION OF THE HARTUNG KNAPP CONFIDENCE INTERVAL ON THE VARIANCE COMPONENT IN TWO VARIANCE COMPONENT MODELS K Y B E R N E T I K A V O L U M E 4 3 ( 2 0 0 7, N U M B E R 4, P A G E S 4 7 1 4 8 0 A MODIFICATION OF THE HARTUNG KNAPP CONFIDENCE INTERVAL ON THE VARIANCE COMPONENT IN TWO VARIANCE COMPONENT MODELS

More information

Universally divergent Fourier series via Landau s extremal functions

Universally divergent Fourier series via Landau s extremal functions Comment.Math.Univ.Carolin. 56,2(2015) 159 168 159 Universally divergent Fourier series via Landau s extremal functions Gerd Herzog, Peer Chr. Kunstmann Abstract. We prove the existence of functions f A(D),

More information

A NOTE ON SPURIOUS BREAK

A NOTE ON SPURIOUS BREAK Econometric heory, 14, 1998, 663 669+ Printed in the United States of America+ A NOE ON SPURIOUS BREAK JUSHAN BAI Massachusetts Institute of echnology When the disturbances of a regression model follow

More information

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Chanseok Park 1 Department of Mathematical Sciences, Clemson University, Clemson, SC 29634 Short Title: Smooth

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

MAT 585: Johnson-Lindenstrauss, Group testing, and Compressed Sensing

MAT 585: Johnson-Lindenstrauss, Group testing, and Compressed Sensing MAT 585: Johnson-Lindenstrauss, Group testing, and Compressed Sensing Afonso S. Bandeira April 9, 2015 1 The Johnson-Lindenstrauss Lemma Suppose one has n points, X = {x 1,..., x n }, in R d with d very

More information

UPPER DEVIATIONS FOR SPLIT TIMES OF BRANCHING PROCESSES

UPPER DEVIATIONS FOR SPLIT TIMES OF BRANCHING PROCESSES Applied Probability Trust 7 May 22 UPPER DEVIATIONS FOR SPLIT TIMES OF BRANCHING PROCESSES HAMED AMINI, AND MARC LELARGE, ENS-INRIA Abstract Upper deviation results are obtained for the split time of a

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 1, Issue 1 2005 Article 3 Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Moulinath Banerjee Jon A. Wellner

More information

Full likelihood inferences in the Cox model: an empirical likelihood approach

Full likelihood inferences in the Cox model: an empirical likelihood approach Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s10463-010-0272-y Full likelihood inferences in the Cox model: an empirical likelihood approach Jian-Jian Ren Mai Zhou Received: 22 September 2008 / Revised:

More information

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased

More information

Censoring mechanisms

Censoring mechanisms Censoring mechanisms Patrick Breheny September 3 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Fixed vs. random censoring In the previous lecture, we derived the contribution to the likelihood

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

NONPARAMETRIC TESTING FOR MULTIPLE SURVIVAL FUNCTIONS WITH NON-INFERIORITY MARGINS

NONPARAMETRIC TESTING FOR MULTIPLE SURVIVAL FUNCTIONS WITH NON-INFERIORITY MARGINS Submitted to the Annals of Statistics NONPARAMETRIC TESTING FOR MULTIPLE SURVIVAL FUNCTIONS WITH NON-INFERIORITY MARGINS By Hsin-Wen Chang, Ian W. McKeague Academia Sinica and Columbia University New nonparametric

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississippi University, MS38677 K-sample location test, Koziol-Green

More information

Prime numbers and Gaussian random walks

Prime numbers and Gaussian random walks Prime numbers and Gaussian random walks K. Bruce Erickson Department of Mathematics University of Washington Seattle, WA 9895-4350 March 24, 205 Introduction Consider a symmetric aperiodic random walk

More information

Test Volume 11, Number 1. June 2002

Test Volume 11, Number 1. June 2002 Sociedad Española de Estadística e Investigación Operativa Test Volume 11, Number 1. June 2002 Optimal confidence sets for testing average bioequivalence Yu-Ling Tseng Department of Applied Math Dong Hwa

More information

Lectures 2 3 : Wigner s semicircle law

Lectures 2 3 : Wigner s semicircle law Fall 009 MATH 833 Random Matrices B. Való Lectures 3 : Wigner s semicircle law Notes prepared by: M. Koyama As we set up last wee, let M n = [X ij ] n i,j=1 be a symmetric n n matrix with Random entries

More information

COMPLEX ANALYSIS Spring 2014

COMPLEX ANALYSIS Spring 2014 COMPLEX ANALYSIS Spring 204 Cauchy and Runge Under the Same Roof. These notes can be used as an alternative to Section 5.5 of Chapter 2 in the textbook. They assume the theorem on winding numbers of the

More information

Notes 6 : First and second moment methods

Notes 6 : First and second moment methods Notes 6 : First and second moment methods Math 733-734: Theory of Probability Lecturer: Sebastien Roch References: [Roc, Sections 2.1-2.3]. Recall: THM 6.1 (Markov s inequality) Let X be a non-negative

More information

Bahadur representations for bootstrap quantiles 1

Bahadur representations for bootstrap quantiles 1 Bahadur representations for bootstrap quantiles 1 Yijun Zuo Department of Statistics and Probability, Michigan State University East Lansing, MI 48824, USA zuo@msu.edu 1 Research partially supported by

More information

19.1 Problem setup: Sparse linear regression

19.1 Problem setup: Sparse linear regression ECE598: Information-theoretic methods in high-dimensional statistics Spring 2016 Lecture 19: Minimax rates for sparse linear regression Lecturer: Yihong Wu Scribe: Subhadeep Paul, April 13/14, 2016 In

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

On the generalized maximum likelihood estimator of survival function under Koziol Green model

On the generalized maximum likelihood estimator of survival function under Koziol Green model On the generalized maximum likelihood estimator of survival function under Koziol Green model By: Haimeng Zhang, M. Bhaskara Rao, Rupa C. Mitra Zhang, H., Rao, M.B., and Mitra, R.C. (2006). On the generalized

More information

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9 MAT 570 REAL ANALYSIS LECTURE NOTES PROFESSOR: JOHN QUIGG SEMESTER: FALL 204 Contents. Sets 2 2. Functions 5 3. Countability 7 4. Axiom of choice 8 5. Equivalence relations 9 6. Real numbers 9 7. Extended

More information

Pólya s Random Walk Theorem

Pólya s Random Walk Theorem Pólya s Random Walk Theorem Jonathan ovak Abstract. This note presents a proof of Pólya s random walk theorem using classical methods from special function theory and asymptotic analysis. 1. ITRODUCTIO.

More information

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen The Annals of Statistics 21, Vol. 29, No. 5, 1344 136 A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1 By Thomas H. Scheike University of Copenhagen We present a non-parametric survival model

More information

Closest Moment Estimation under General Conditions

Closest Moment Estimation under General Conditions Closest Moment Estimation under General Conditions Chirok Han and Robert de Jong January 28, 2002 Abstract This paper considers Closest Moment (CM) estimation with a general distance function, and avoids

More information

Submitted to the Brazilian Journal of Probability and Statistics

Submitted to the Brazilian Journal of Probability and Statistics Submitted to the Brazilian Journal of Probability and Statistics Multivariate normal approximation of the maximum likelihood estimator via the delta method Andreas Anastasiou a and Robert E. Gaunt b a

More information

ASYMPTOTIC EQUIVALENCE OF DENSITY ESTIMATION AND GAUSSIAN WHITE NOISE. By Michael Nussbaum Weierstrass Institute, Berlin

ASYMPTOTIC EQUIVALENCE OF DENSITY ESTIMATION AND GAUSSIAN WHITE NOISE. By Michael Nussbaum Weierstrass Institute, Berlin The Annals of Statistics 1996, Vol. 4, No. 6, 399 430 ASYMPTOTIC EQUIVALENCE OF DENSITY ESTIMATION AND GAUSSIAN WHITE NOISE By Michael Nussbaum Weierstrass Institute, Berlin Signal recovery in Gaussian

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Method of Conditional Moments Based on Incomplete Data

Method of Conditional Moments Based on Incomplete Data , ISSN 0974-570X (Online, ISSN 0974-5718 (Print, Vol. 20; Issue No. 3; Year 2013, Copyright 2013 by CESER Publications Method of Conditional Moments Based on Incomplete Data Yan Lu 1 and Naisheng Wang

More information

ASSESSING A VECTOR PARAMETER

ASSESSING A VECTOR PARAMETER SUMMARY ASSESSING A VECTOR PARAMETER By D.A.S. Fraser and N. Reid Department of Statistics, University of Toronto St. George Street, Toronto, Canada M5S 3G3 dfraser@utstat.toronto.edu Some key words. Ancillary;

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

Point and Interval Estimation for Gaussian Distribution, Based on Progressively Type-II Censored Samples

Point and Interval Estimation for Gaussian Distribution, Based on Progressively Type-II Censored Samples 90 IEEE TRANSACTIONS ON RELIABILITY, VOL. 52, NO. 1, MARCH 2003 Point and Interval Estimation for Gaussian Distribution, Based on Progressively Type-II Censored Samples N. Balakrishnan, N. Kannan, C. T.

More information

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases Communications in Statistics Simulation and Computation, 34: 43 5, 005 Copyright Taylor & Francis, Inc. ISSN: 0361-0918 print/153-4141 online DOI: 10.1081/SAC-00055639 Distribution Theory Comparison Between

More information

AFT Models and Empirical Likelihood

AFT Models and Empirical Likelihood AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky) Accelerated Failure Time (AFT) models: Y = log(t

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS

KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS Bull. Korean Math. Soc. 5 (24), No. 3, pp. 7 76 http://dx.doi.org/34/bkms.24.5.3.7 KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS Yicheng Hong and Sungchul Lee Abstract. The limiting

More information

1 Local Asymptotic Normality of Ranks and Covariates in Transformation Models

1 Local Asymptotic Normality of Ranks and Covariates in Transformation Models Draft: February 17, 1998 1 Local Asymptotic Normality of Ranks and Covariates in Transformation Models P.J. Bickel 1 and Y. Ritov 2 1.1 Introduction Le Cam and Yang (1988) addressed broadly the following

More information

Bayesian vs frequentist techniques for the analysis of binary outcome data

Bayesian vs frequentist techniques for the analysis of binary outcome data 1 Bayesian vs frequentist techniques for the analysis of binary outcome data By M. Stapleton Abstract We compare Bayesian and frequentist techniques for analysing binary outcome data. Such data are commonly

More information

On a Class of Multidimensional Optimal Transportation Problems

On a Class of Multidimensional Optimal Transportation Problems Journal of Convex Analysis Volume 10 (2003), No. 2, 517 529 On a Class of Multidimensional Optimal Transportation Problems G. Carlier Université Bordeaux 1, MAB, UMR CNRS 5466, France and Université Bordeaux

More information

APPROXIMATING CONTINUOUS FUNCTIONS: WEIERSTRASS, BERNSTEIN, AND RUNGE

APPROXIMATING CONTINUOUS FUNCTIONS: WEIERSTRASS, BERNSTEIN, AND RUNGE APPROXIMATING CONTINUOUS FUNCTIONS: WEIERSTRASS, BERNSTEIN, AND RUNGE WILLIE WAI-YEUNG WONG. Introduction This set of notes is meant to describe some aspects of polynomial approximations to continuous

More information

EXPLICIT NONPARAMETRIC CONFIDENCE INTERVALS FOR THE VARIANCE WITH GUARANTEED COVERAGE

EXPLICIT NONPARAMETRIC CONFIDENCE INTERVALS FOR THE VARIANCE WITH GUARANTEED COVERAGE EXPLICIT NONPARAMETRIC CONFIDENCE INTERVALS FOR THE VARIANCE WITH GUARANTEED COVERAGE Joseph P. Romano Department of Statistics Stanford University Stanford, California 94305 romano@stat.stanford.edu Michael

More information

Regular finite Markov chains with interval probabilities

Regular finite Markov chains with interval probabilities 5th International Symposium on Imprecise Probability: Theories and Applications, Prague, Czech Republic, 2007 Regular finite Markov chains with interval probabilities Damjan Škulj Faculty of Social Sciences

More information

arxiv:submit/ [math.st] 6 May 2011

arxiv:submit/ [math.st] 6 May 2011 A Continuous Mapping Theorem for the Smallest Argmax Functional arxiv:submit/0243372 [math.st] 6 May 2011 Emilio Seijo and Bodhisattva Sen Columbia University Abstract This paper introduces a version of

More information

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters Communications for Statistical Applications and Methods 2017, Vol. 24, No. 5, 519 531 https://doi.org/10.5351/csam.2017.24.5.519 Print ISSN 2287-7843 / Online ISSN 2383-4757 Goodness-of-fit tests for randomly

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach By Shiqing Ling Department of Mathematics Hong Kong University of Science and Technology Let {y t : t = 0, ±1, ±2,

More information