Accepted for publication in: Comm. Statist. Theory and Methods April 14, 2015 ASYMPTOTICS OF GOODNESS-OF-FIT TESTS BASED ON MINIMUM P-VALUE STATISTICS

Size: px
Start display at page:

Download "Accepted for publication in: Comm. Statist. Theory and Methods April 14, 2015 ASYMPTOTICS OF GOODNESS-OF-FIT TESTS BASED ON MINIMUM P-VALUE STATISTICS"

Transcription

1 Accepted for publication in: Comm. Statist. Theory and Methods April 14, 2015 ASYMPTOTICS OF GOODNESS-OF-FIT TESTS BASED ON MINIMUM P-VALUE STATISTICS Veronika Gontscharuk and Helmut Finner Institute for Biometrics and Epidemiology German Diabetes Center, Leibniz Institute for Diabetes Research at Heinrich-Heine-University Düsseldorf Auf m Hennekamp 65, D Düsseldorf, Germany veronika.gontscharuk@ddz.uni-duesseldorf.de Key Words: equal local levels; goodness-of-fit; higher criticism; Kolmogorov-Smirnov test; minimum level attained; order statistics. ABSTRACT This paper provides some new results on the asymptotics of goodness-of-fit (GOF) tests based on minimum p-value statistics. In connection with detectability of sparse signals in high-dimensional data, various tests were proposed and investigated during the last decade, especially with respect to asymptotic properties. Minimum p-value GOF statistics were already investigated as minimum level attained statistic by Berk and Jones with respect to Bahadur efficiency. The distribution of minimum p-value GOF statistics is closely related to the distribution of higher criticism statistics, the distribution of the supremum of a normalized Brownian bridge and the supremum of an Ornstein-Uhlenbeck process. 1. INTRODUCTION This paper is concerned with the asymptotics of goodness-of-fit (GOF) tests based on so-called minimum level attained statistics studied in Berk and Jones (1978) and Berk and 1

2 Jones (1979). The notation level attained is a synonym for p-value. Nowadays a minimum level attained statistic is often referred to as min-p or minp statistic. If we have p-values p i, i I, the minp statistic is defined by min i I p i. The minp statistic is closely related to the union-intersection principle and well-known in multiple hypothesis testing at least since the nineteen thirties. We refer to the minimum level attained GOF tests of Berk and Jones as minp GOF or minp tests. Recently, these tests have gained new interest and were rediscovered and studied under different names and representations by various authors, e.g., aspecial caseofcalibration for simultaneityin BujaandRolke (2006), testsbased on thenew tail-sensitive simultaneous confidence bands in Aldor et al. (2013), GOF tests with equal local levels in Gontscharuk et al. (2014), new higher criticism (HC) tests in Gontscharuk et al. (2015) and a non-asymptotic standardization of binomial counts in Mary and Ferrari (2014). To set up notation, let X 1,...,X n, n N, be real-valued independently identically distributed (iid) random variables with continuous cumulative distribution function (cdf) F. For a continuous cdf F 0 we consider the testing problems H 0 + : F(x) F 0 (x) x R vs. H 1 + : F(x) > F 0 (x) for some x R and H 0 : F(x) = F 0 (x) x R vs. H 1 : F(x) F 0 (x) for some x R. Since F 0 (X i ), i = 1,...,n, are iid uniformly distributed on [0,1] if H 0 is true, we restrict attention to the case where F 0 (x) = x, x [0,1], and X i, i = 1,...,n, are iid with values in [0,1]. Let X 1:n,...,X n:n be the order statistics of X 1,...,X n and let F i,n denote the cdf of the beta distribution with parameters i and n i+1, i.e., the cdf of X i:n under H 0. The oneand two-sided versions of the minp statistic are given by M + n = min 1 i n F i,n(x i:n ), M n = 2 min 1 i n {F i,n(x i:n ),1 F i,n (X i:n )}. 2

3 Thereby, F i,n (X i:n ) and min{f i,n (X i:n ),1 F i,n (X i:n )} can be viewed as local p-values based on X i:n. Since a minp statistic tends to smaller values under alternatives than under the null hypothesis, the related minp GOF test rejects H + 0 and/or H 0 if M + n and/or M n, respectively, are not larger than the minp critical value d n (say). For a given α (0,1) define a critical value α loc n α loc n (α) such that the related one- or two-sided minp test is an exact level α test, i.e., P(M + n α loc n H 0 ) = α or P(M n α loc n H 0 ) = α, (1) respectively. Clearly, α/n < α loc n < α for n 2. A characterization of the asymptotic behavior of the minp critical values was mentioned in Gontscharuk et al. (2014) and Gontscharuk et al. (2015) without proof. More precisely, if and only if n P(M+ n d n H 0 ) = α and/or P(M n d n H 0 ) = α, (2) n d n 2log(log(n))log(n) n log(1 α) Among others, we provide a proof for this result. The paper is organized as follows. In Section 2 we show a strong connection between minp and higher criticism (HC) tests and summarize some important results related to HC statistics. In Section 3 we provide three different asymptotic critical values leading to asymptotic level α minp GOF tests. We also present the asymptotic minp distribution under the null hypothesis and show that the HC statistic and a z-transformed version of the minp statistic coincide asymptotically in distribution. Section 4 addresses the applicability = 1. of the asymptotic results to the finite case. Proofs are deferred to an Appendix. 2. LOCAL LEVELS OF HIGHER CRITICISM AND MINP TESTS A thorough analysis of the asymptotic behavior of HC related quantities is a key step to obtain results for the asymptotics of the minp test. We first provide some known results concerning HC tests. The HC tests considered here can be seen as normalized versions of the 3

4 well-known Kolmogorov-Smirnov (KS) tests, e.g., cf. Eicker (1979) and Jaeschke (1979) for the asymptotics of the normalized KS statistic, Donoho and Jin (2004), Hall and Jin (2008) and Donoho and Jin (2009) for the HC concept, Jager and Wellner (2007), Gontscharuk et al. (2014) and Chapter 16 in Shorack and Wellner (2009) for the relationship to the normalized Brownian bridge and Ornstein-Uhlenbeck process. The one- and two-sided HC statistics considered here are HC + n i/n X = max n i:n 1 i n Xi:n (1 X i:n ), (3) HC n = max 1 i n The iting null distributions are characterized by { } n i/n X i:n Xi:n (1 X i:n ), n X i:n (i 1)/n. (4) Xi:n (1 X i:n ) n P(HC+ n < b n (t) H 0 ) = exp( exp( t)), (5) P(HC n < b n (t) H 0 ) = exp( 2exp( t)), (6) n where b n (t) = 2log 2 (n)+(log 3 (n) log(π)+2t)/(2 2log 2 (n)) (7) with log 2 (n) = log(log(n)) and log 3 (n) = log 2 (log(n)), cf. Eicker (1979) and Jaeschke (1979). We consider HC tests based on the critical value b n (t), i.e., a one-sided HC test rejects the null hypothesis H 0 + if HC n + b n (t) while a two-sided HC test rejects H 0 if HC n b n (t). Replacing t in b n (t) in (5) and (6) by t + α = log( log(1 α)) and t α = log( log(1 α)/2) (8) leads to asymptotic level α one- and two-sided HC tests, respectively. Instead of considering union-intersection related GOF tests in terms of test statistics, many of them can be rewritten in terms of so-called local levels, cf. Gontscharuk et al. (2014) for this concept. In our case, the ith local level of a GOF test based on all order statistics is defined as the probability that the ith local test statistic based on the order statistic X i:n exceeds its critical value. For example, local levels related to the one-sided 4

5 asymptotic level α HC test are given by ( ) n αi,n HC αi,n HC i/n X i:n (α) = P Xi:n (1 X i:n ) b n(t + α) H 0. (9) Local levels related to the one-sided minp test with critical value d n (0,1) (say) are given by α minp i,n d n = P(F i,n (X i:n ) d n H 0 ), (10) i.e., all minp local levels are equal to the underlying critical value d n. By construction, all minp local levels are equal in the two-sided case, too. Moreover, as shown in Gontscharuk et al. (2014), almost all local levels of the oneand two-sided HC tests are asymptotically equal in the sense that the ratio of two local levels tends to 1 for n. The remaining local levels do not contribute to the asymptotic HC global level. Therefore, there is some evidence that minp and HC tests coincide asymptotically in some sense. The following lemma summarizes HC properties which are useful in order to get asymptotic results related to the minp test in Section 3. Lemma 1.(i) The local levels of the one-sided HC asymptotic level α test with critical value b n (t + α) (see (7) and (8)) satisfy for log(n) i n log(n). α HC i,n (α) = log(1 α) 2log 2 (n)log(n) [ 1+O ( )] log3 (n) log 2 (n) (ii) The local levels α i,n (say) of two-sided HC tests with critical value b n (t + α) > 1 from the one-sided test in (i) satisfy α i,n = α HC i,n (α)+α HC n i+1,n(α), i = 1,...,n. (iii) A restricted version of the HC statistic, where the maximum in (3) or (4) is taking over i [log(n), n log(n)] only, has the same asymptotic distribution as the corresponding original HC statistics, i.e., (5) and (6) are also fulfilled if HC + n and HC n are replaced by the corresponding restricted versions. 5

6 3. ASYMPTOTICS OF MINP GOF TESTS In this section we present several results for one- and two-sided minp tests. For convenience, we sometimes refer to both as minp tests. The following theorem provides a rate of convergence for the critical values related to the asymptotic level α minp tests and hence the asymptotic null distribution of the minp statistics. only if Theorem 1. The minp test with critical value d n is an asymptotic level α test if and d n/αn = 1 with αn αn(α) = log(1 α) n 2log 2 (n)log(n). (11) The asymptotic null distribution of the minp statistics is given by n P(M+ n αn(t) H 0 ) = P(M n α n n(t) H 0 ) = t, t (0,1). Some important conclusions are given in the following remarks. Remark 1. Theorem1showsthataminPcriticalvalued n, whichleadstoanasymptotic level α one-sided minp test, also leads to an asymptotic level α two-sided minp test. However, forfinitenthelevelsofone-andtwo-sidedtestsbasedonthesamecriticalvalued n maydiffer tosomeextend. Forexample, d n = , , leadstoP(M + n d n H 0 ) 0.05 and P(M n d n H 0 ) for n = 100,500,1000. It looks like the two-sided minp critical values α loc n defined in (1) should be somewhat smaller than the one-sided counterparts. Remark 2. Theorem 1 implies that the critical values d n related to asymptotic level α minp tests converge to 0 as the sample size increases. Note that all local levels of the HC test also converge to 0, cf. Theorem 5.2 in Gontscharuk et al. (2014). Compared to the Bonferroni adjustment α/n, the local levels of the minp GOF test converge to 0 very slowly. Remark 3. Considering the proof of Theorem 1, it can be easily seen that the sensitivity range of the asymptotic level α minp test, i.e, the index set of order statistics that contribute totheasymptoticlevelαunderthenullhypothesish 0, coincideswiththesensitivityrangeof 6

7 the HC test, cf. Eicker (1979), Jaeschke (1979) and Gontscharuk et al. (2015) for sensitivity ranges of HC tests. Let Φ be the cdf of the standard normal distribution and Φ 1 be the inverse of Φ. Note that Z i = Φ 1 (1 F i,n (U i:n )) N(0,1), i = 1,...,n. Herewith, the z-transformed minp statistics are maxima of (dependent) normals/absolute normals, that is, Φ 1 (1 M + n ) = max 1 i n Z i and Φ 1 (1 M n /2) = max 1 i n Z i. The z-transformed minp statistics have some advantage compared to the HC statistics defined in (3) and (4), cf. Gontscharuk et al. (2015). For example, most of the global α level of the HC tests is taken away by a very small amount of the most extreme order statistics, even for extremely large n, although they are asymptotically negligible. Loosely speaking, even for extremely large n there is a great gap between finite and asymptotic HC behavior. The next theorem provides the relationship between the asymptotic distributions of the HC and z-transformed minp statistics. Theorem 2. For the HC critical values b n (t) we get and n P(Φ 1 (1 M n + ) < b n (t) H 0 ) = exp( exp( t)) n P(Φ 1 (1 M n /2) < b n (t) H 0 ) = exp( 2exp( t)), i.e., the asymptotic distribution of the z-transformed minp statistics Φ 1 (1 M + n ) and Φ 1 (1 M n /2) is the same as the asymptotic distribution of HC + n and HC n, respectively. Remark 4. Theorem 2 yields two competing choices of the minp critical values α n(α) and α n(α) (say), i.e., α n α n(α) 1 Φ(b n (t + α)) and α n α n(α) 2(1 Φ(b n (t α ))), (12) which both lead to asymptotic level α minp tests. This is because of n α n/α n = n α n/α n = 1, cf. the proof of Theorem 2. 7

8 Table1: Locallevelsα loc n fulfilling(1) for various n- and α-values such that the corresponding one- or two-sided minp test is an exact level α test. α = 0.01 α = 0.05 α = 0.1 n one-sided two-sided one-sided two-sided one-sided two-sided n = n = n = n = FINITE SAMPLES AND MINP GOF TESTS We briefly investigate the applicability of asymptotic minp critical values in the finite setting. The critical value α loc n related to the exact level α minp test, i.e., α loc n fulfilling (1), can be calculated numerically, e.g., via some search algorithm. Note that the probabilities in (1)canberepresentedasP(X i:n < c i,i = 1,...,n H 0 )forsomec 1 c n intheone-sided case and P(a i < X i:n < c i,i = 1,...,n H 0 ) for some a 1 a n with a i < c i, i = 1,...,n, in the two-sided case. Below, probabilities of the first type are calculated via Bolshev s recursion, cf. pp in Shorack and Wellner (2009), and probabilities of the second type are calculated via a recursive procedure proposed in Khmaladze and Shinjikashvili (2001). For example, α loc n -values for n = 10,10 2,10 3,10 4 and α = 0.01,0.05,0.1 are provided in Table 1. Now we compare critical values leading to asymptotic level α minp tests, i.e., α n defined in (11), α n and α n defined in (12), with a finite counterpart α loc n defined by (1). The left graph in Figure 1 shows α n, α n, α n and α loc n for α = 0.05 and n = 50,...,10 4, and the right graph in Figure 1 shows related probabilities to reject the true null hypothesis H 0, i.e., probabilities given in (1). Similar pictures can be observed for other values of α, e.g., α = 0.01,0.1. It looks like the probability to reject the true null hypothesis H 0 by a minp test based on an asymptotic critical value is larger in the two-sided case than in the one- 8

9 Figure 1: Left graph: asymptotic minp critical values α n (dotted curve), α n (dash-dotted curve)andα n (dashedcurve)togetherwithα loc n leading to finite level α one-sided(upper solid curve) and two-sided (lower solid curve) minp tests for n = 50,...,10 4 and α = Right graph: probabilities to reject the true H 0 by the corresponding minp tests, i.e., P(M + n d n H 0 ) (lower curves) and P(M n d n H 0 ) (upper curves) for d n = α n,α n,α n (dotted curves, dash-dotted curves and dashed curves, respectively). sided case. Moreover, minp tests based on the asymptotic critical value αn exceed the α level at least for n 10 4, while minp tests based on α n or α n are conservative and minp tests based on α n are most conservative. Surprisingly, although the minp test based on the critical value αn is an asymptotic level α test, cf. Theorem 1, the finite global level, i.e., the related probability in (1), is considerably larger than α = 0.05 and even increases in n {50,...,10 4 }, cf. the right graph in Figure 1. Finally, we focus on z-transformed minp statistics, which have the same asymptotic distribution as related HC tests, cf. Theorem 2. Since for any x R we get Φ 1 (1 M n + ) x and Φ 1 (1 M n /2) x iff M n + 1 Φ(x) and M n 2(1 Φ(x)), respectively, the right graph in Figure 1 also provides probabilities that z-transformed minp statistics exceed the 9

10 Figure 2: The cdf of Φ 1 (1 M n /2) (dashed curves) and the cdf of HC n (dotted curves) simulated by 10 5 repetitions together with the corresponding asymptotic cdf F n (y) (solid curves) for n = 10 4 (left graph) and n = 10 6 (right graph). corresponding (asymptotic) HC critical values, that is, P(Φ 1 (1 M + n ) b n (t + α) H 0 ) and P(Φ 1 (1 M n /2) b n (t α ) H 0 ) are given by the lower dash-dotted and upper dashed curves, respectively. We observe that z-transformed minp GOF tests based on the corresponding asymptotic HC critical values are level α tests but too conservative in the finite setting. Contrary to this behavior, asymptotic level α HC tests, i.e., GOF tests based on statistics HC + n, HC n and critical values b n (t + α), b n (t α ), respectively, exceed the α level drastically for a finite sample size, e.g., cf. Jaeschke (1979), Khmaladze and Shinjikashvili (2001) and Gontscharuk et al. (2015). For example, Figure 2 shows the (simulated) cdfs of the twosided HC statistic HC n and the two-sided z-transformed minp statistic Φ 1 (1 M n /2) together with the asymptotic (Gumbel-related) cdf F n (y) (say) for n = 10 4,10 6. Thereby, the asymptotic cdf F n (y) is defined by (6) so that F n (y) = exp( 2exp( b 1 n (y))), where b 1 n (y) is the inverse function of b n (t) defined in (7). The finite distribution of z-transformed two-sided minp statistics seems to be closer to the asymptotic Gumbel-related distribution 10

11 Figure 3: Critical values d n (α) defined in (13) (diamonds) with c α = 1.1,1.3,1.6 (from top to bottom) and two-sided critical values α loc n fulfilling (1) (solid curves) for α = 0.01,0.05,0.1 (from bottom to top) and n = 10 3,...,10 4. than the finite two-sided HC distribution. In the one-sided case we get a similar picture, cf. Figure 7 in Gontscharuk et al. (2015). We note that the minp critical values α loc n fulfilling (1) can be calculated exactly at least for n The asymptotic critical values α n defined in (12) always lead to finite level α minp tests in Figure 1. Although these tests seem to be too conservative, one may prefer minp GOF tests based on α n for n > In order to get minp critical values d n leading to an asymptotic level α test with improved finite behavior, we have to keep in mind that Theorem 1 requires d n /α n 1 as n. Motivated by (i) in Lemma 1, we can try the minp critical values defined by d n (α) = log(1 α) 2log 2 (n)log(n) [ ] log 1 c 3 (n) α, (13) log 2 (n) where c α R is a suitable constant. Note that, in fact, d n (α)/α n 1 as n. Figure 3 shows d n (α) based on c α = 1.1,1.3,1.6 for α = 0.01,0.05,0.1, respectively, together with the corresponding minp two-sided critical values α loc n fulfilling (1) for n = 10 3,...,10 4. It seems 11

12 Table 2: Probabilities P(M n d n (α) H 0 ) (and P(M n α loc n H 0 ) for n = 10 4 only) simulated by 10 5 repetitions, where d n (α) is based on c α = 1.6,1.3,1.1 for α = 0.01,0.05,0.1, respectively, and α loc n fulfills (1) in the two-sided case. n α = 0.01 α = 0.05 α = 0.1 n = ( ) ( ) ( ) n = n = n = n = that d n (α) approximates α loc n very well at least for the considered α- and n-values. However, one cannot expect that this approximation works well for all n > It is also difficult to check the appropriateness of any approximation since even simulations become more and more time consuming for larger values of n. Table 2 provides simulated probabilities P(M n d n (α) H 0 ) based on d n (α) defined in (13) with c α = 1.6,1.3,1.1 for α = 0.01,0.05,0.1, respectively, and some n 10 4, and, for n = 10 4, simulated probabilities P(M n α loc n H 0 ) where α loc n refers to two-sided exact level α minp tests. Taking account of the simulation inaccuracy, minp critical values defined in (13) seem to work not that bad for the considered n- and α-values. In summary, the asymptotic representation of the minp critical values α n defined in (11) gives us some idea about the magnitude of the local level w.r.t. a single p-value. In order to improve the finite sample behavior of an asymptotic minp GOF test, new results with respect to higher order asymptotics for local levels or the Gumbel approximation seem desirable. However, this seems to be a difficult issue. APPENDIX Proof of Lemma 1. Part (i) immediately follows from Lemma 4.3 in Gontscharuk et 12

13 al. (2014); for (ii) see p.10 in Gontscharuk et al. (2014); Proposition 1 in Eicker (1979) implies (iii). Proof of Theorem 1. First, we show that the one-sided minp test with critical value d n α n(α) is an asymptotic level α test, i.e., n P(M+ n αn(α) H 0 ) = α. (14) Let U 1:n U n:n denote the order statistics of iid U(0,1)-distributed random variables. Setting c minp i,n that α minp i,n F 1 i,n (α n(α)), i = 1,...,n, we get for the minp local levels defined in (10) = P(U i:n c minp i,n ) = αn(α) and P(M + n α n(α) H 0 ) = P( n i=1{u i:n c minp i,n }). For notational convenience, we split the index set I n = {i : i = 1,...,n} into J 1 = {i I n : i < log(n) or i > n log(n)} and J 2 = I n \J 1. By means of the Bonferroni inequality we obtain P(M n + αn(α) H 0 ) P ( i J1 {U i:n c minp i,n } ) +P ( i J2 {U i:n c minp i,n } ). Moreover, the Bonferroni inequality also implies Hence, P( i J1 {U i:n c minp i,n }) i J 1 P(U i:n c minp i,n ) log(1 α). log 2 (n) n P(M+ n αn(α) H 0 ) = P( i J2 {U i:n c i,n }). (15) n For an arbitrary but fixed ε (0,min(α,1 α)) we consider two one-sided HC tests at the asymptotic level α + ǫ and α ǫ, respectively. That is, these HC tests are based on the critical values b n (t + α+ǫ) and b n (t + α ǫ), respectively, with b n defined in (7) and t + α defined in (8). Due to (i) in Lemma 1, the corresponding HC local levels α HC i,n (α ε) and α HC i,n (α+ε), cf. (9), fulfill α HC i,n (α±ε)/α n(α±ǫ) 1 as n 13

14 uniformly for i J 2. Since α n(α) is a monotonically increasing function in α (0,1), we obtain for n N large enough that α HC i,n (α ε) α n(α) α HC i,n (α+ε), i J 2. (16) Setting g i,n (u) n(i/n u)/ u(1 u), local levels related to the asymptotic level α HC test can be represented as α HC i,n (α) = P(g i,n (U i:n ) > b n (t + α)), i = 1,...,n. Hence, setting c HC i,n (α) g 1 i,n (b n(t + α)), i = 1,...,n, we get α HC i,n (α) = P(U i:n < c HC i,n (α)), i = 1,...,n. Therefore, (16) implies c HC i,n (α ε)) c minp i,n for n N large enough, which immediately leads to c HC i,n (α+ε), i J 2, P ( i J2 {U i:n c minp i,n } ) P ( i J2 {U i:n c HC i,n (α ε))} ) (17) and P ( i J2 {U i:n c minp i,n } ) P ( i J2 {U i:n c HC i,n (α+ε)} ). (18) The last statement in Lemma 1 yields that order statistics U i:n, i / J 2, do not contribute anything to the asymptotic level of the HC test in the sense n P( n i=1{u i:n c HC i,n (α)}) = P( i J2 {U i:n c HC i,n (α)}) = α n for any α (0,1). This together with (15), (17) and (18) implies α ε n P(M + n α n(α) H 0 ) α+ε, which is true for any ε (0,min(α,1 α)) and hence (14) follows. Now we focus on the one-sided minp test in the general case d n (α/n,α). For a given d n we define a level α n (say) as a solution of d n = α n(α n), i.e., α n = 1 exp( 2d n log 2 (n)log(n)). W.l.o.g. let α n α for some α [0,1]. Obviously, 14

15 if 0 < α < 1 then for any ǫ (0,min(α,1 α )) and larger n we get α (α ǫ) d n α (α +ǫ), if α = 0 then for any ǫ (0,1) and larger n we get d n α (ǫ), if α = 1 then for any ǫ (0,1) and larger n we get d n α (1 ǫ). Since the probability P(M n + d n H 0 ) is a monotonically increasing function in d n, we obtain by (14) that n P(M n + d n H 0 ) α +ǫ and/or n P(M n + d n H 0 ) α ǫ for any (feasible) ǫ > 0. Therefore, n P(M n + d n H 0 ) = α. Noting that α = α iff n d n /αn(α ) = 1, the assertion for the one-sided test follows. The two-sided case can be proved in the same way via the assertion (ii) in Lemma 1. Proof of Theorem 2. We first restrict attention to the one-sided case. Setting t t + α given in (8), the assertion of the theorem can be rewritten as n P(M+ n Φ(b n (t + α)) H 0 ) = α, where Φ(x) = 1 Φ(x). Due to Theorem 1 it suffices to show that Φ(b n (t + α))/αn(α) = 1. n Applying Mill s ratio, we get Φ(b n (t + α)) = φ(b n (t + α))/b n (t + α)[1+o(1)], where φ( ) is the density of the standard normal distribution. This together with simple analysis leads to Φ(b n (t + α)) = which implies the assertion. 1 1 exp ( b n(t + α) 2 ) [1+o(1)] 2log2 (n) 2π 2 = exp( log 2(n) log( log 2 (n))+log( π) t + α +o(1)) 2 πlog 2 (n) [1+o(1)] = log(1 α) 2log 2 (n)log(n) [1+o(1)] = α n(α)[1+o(1)], The two-sided case follows by noting that 2 Φ(b n (t α )) = α n(α)[1+o(1)] for t α in (8). 15

16 ACKNOWLEDGEMENTS The authors greatly acknowledge the constructive comments and suggestions of the anonymous referee. This work was supported by the Ministry of Science and Research of the State of North Rhine-Westphalia (MIWF NRW) and the German Federal Ministry of Health (BMG). BIBLIOGRAPHY Aldor-Noiman, S., Brown, L., Buja, A., Rolke, W. and Stine, R. (2013). The power to see: A new graphical test of normality. Am. Stat., 67:4, Berk, R. and Jones, D. (1978). Relatively optimal combinations of test statistics. Scand. J. Stat., 5, Berk, R. and Jones, D. (1979). Goodness-of-fit test statistics that dominate the Kolmogorov Statistics. Z. Wahrscheinlichkeit., 47, Buja, A. and Rolke, W. (2006). Calibration for simultaneity: (Re)Sampling methods for simultaneous inference with applications to function estimation and functional data. Unpublished manuscript. Donoho, D. and Jin, J. (2004). Higher criticism for detecting sparse heterogeneous mixtures. Ann. Stat., 32, Donoho, D. and Jin, J. (2009). Feature selection by higher criticism thresholding achieves the optimal phase diagram. Philos. Tr. R. Soc. A, 367, Eicker, F. (1979). The asymptotic distribution of the suprema of the standardized empirical processes. Ann. Stat., 7, Gontscharuk, V., Landwehr, S. and Finner, H. (2014). Goodness of fit tests in terms of local levels with special emphasis on higher criticism tests. Accepted for publication in Bernoulli, Gontscharuk, V., Landwehr, S. and Finner, H. (2015). The intermediates take it all: asymp- 16

17 totics of higher criticism statistics and a powerful alternative based on equal local levels. Biometrical J., 57, Hall, P. and Jin, J. (2008). Properties of higher criticism under strong dependence. Ann. Stat., 36, Jaeschke, D. (1979). The asymptotic distribution of the supremum of the standardized empirical distribution function on subintervals. Ann. Stat., 7, Jager, L. and Wellner, J. (2007). Goodness-of-fit tests via phi-divergences. Ann. Stat., 35, Khmaladze, E. and Shinjikashvili, E. (2001). Calculation of noncrossing probabilities for Poisson processes and its corollaries. Adv. Appl. Probab., 33, Mary, D. and Ferrari, A. (2014). A non-asymptotic standardization of binomial counts in higher criticism. IEEE International Symposium on Information Theory (ISIT), Shorack, G. and Wellner, J. (2009). Empirical Processes with Applications to Statistics. Philadelphia: Society for Industrial and Applied Mathematics. 17

TWO-SAMPLE KOLMOGOROV-SMIRNOV TYPE TESTS REVISITED: OLD AND NEW TESTS IN TERMS OF LOCAL LEVELS

TWO-SAMPLE KOLMOGOROV-SMIRNOV TYPE TESTS REVISITED: OLD AND NEW TESTS IN TERMS OF LOCAL LEVELS Submitted to the Annals of Statistics arxiv: arxiv:0000.0000 TWO-SAMPLE KOLMOGOROV-SMIRNOV TYPE TESTS REVISITED: OLD AND NEW TESTS IN TERMS OF LOCAL LEVELS By Helmut Finner and Veronika Gontscharuk Institute

More information

A note on the asymptotic distribution of Berk-Jones type statistics under the null hypothesis

A note on the asymptotic distribution of Berk-Jones type statistics under the null hypothesis A note on the asymptotic distribution of Berk-Jones type statistics under the null hypothesis Jon A. Wellner and Vladimir Koltchinskii Abstract. Proofs are given of the limiting null distributions of the

More information

arxiv: v3 [stat.me] 2 Oct 2014

arxiv: v3 [stat.me] 2 Oct 2014 arxiv: 3.390 The Calibrated Kolmogorov-Smirnov Test arxiv:3.390v3 stat.me 2 Oct 204 Amit Moscovich-Eiger,* Boaz Nadler,** and Clifford Spiegelman 2, Department of Computer Science and Applied Mathematics,

More information

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that Lecture 28 28.1 Kolmogorov-Smirnov test. Suppose that we have an i.i.d. sample X 1,..., X n with some unknown distribution and we would like to test the hypothesis that is equal to a particular distribution

More information

Asymptotic Statistics-VI. Changliang Zou

Asymptotic Statistics-VI. Changliang Zou Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous

More information

Exercises in Extreme value theory

Exercises in Extreme value theory Exercises in Extreme value theory 2016 spring semester 1. Show that L(t) = logt is a slowly varying function but t ǫ is not if ǫ 0. 2. If the random variable X has distribution F with finite variance,

More information

IMPROVING TWO RESULTS IN MULTIPLE TESTING

IMPROVING TWO RESULTS IN MULTIPLE TESTING IMPROVING TWO RESULTS IN MULTIPLE TESTING By Sanat K. Sarkar 1, Pranab K. Sen and Helmut Finner Temple University, University of North Carolina at Chapel Hill and University of Duesseldorf October 11,

More information

Comparing distributions by multiple testing across quantiles

Comparing distributions by multiple testing across quantiles Comparing distributions by multiple testing across quantiles Matt Goldman David M. Kaplan November 22, 2016 Abstract When comparing two distributions, it is often helpful to learn at which quantiles there

More information

Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests

Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests K. Strassburger 1, F. Bretz 2 1 Institute of Biometrics & Epidemiology German Diabetes Center,

More information

The main results about probability measures are the following two facts:

The main results about probability measures are the following two facts: Chapter 2 Probability measures The main results about probability measures are the following two facts: Theorem 2.1 (extension). If P is a (continuous) probability measure on a field F 0 then it has a

More information

ON TWO RESULTS IN MULTIPLE TESTING

ON TWO RESULTS IN MULTIPLE TESTING ON TWO RESULTS IN MULTIPLE TESTING By Sanat K. Sarkar 1, Pranab K. Sen and Helmut Finner Temple University, University of North Carolina at Chapel Hill and University of Duesseldorf Two known results in

More information

GOODNESS-OF-FIT TESTS VIA PHI-DIVERGENCES. BY LEAH JAGER 1 AND JON A. WELLNER 2 Grinnell College and University of Washington

GOODNESS-OF-FIT TESTS VIA PHI-DIVERGENCES. BY LEAH JAGER 1 AND JON A. WELLNER 2 Grinnell College and University of Washington The Annals of Statistics 2007, Vol. 35, No. 5, 208 2053 DOI: 0.24/0009053607000000244 Institute of Mathematical Statistics, 2007 GOODNESS-OF-FIT TESTS VIA PHI-DIVERGENCES BY LEAH JAGER AND JON A. WELLNER

More information

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu

More information

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1 Chapter 2 Probability measures 1. Existence Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension to the generated σ-field Proof of Theorem 2.1. Let F 0 be

More information

On probabilities of large and moderate deviations for L-statistics: a survey of some recent developments

On probabilities of large and moderate deviations for L-statistics: a survey of some recent developments UDC 519.2 On probabilities of large and moderate deviations for L-statistics: a survey of some recent developments N. V. Gribkova Department of Probability Theory and Mathematical Statistics, St.-Petersburg

More information

Extension of continuous functions in digital spaces with the Khalimsky topology

Extension of continuous functions in digital spaces with the Khalimsky topology Extension of continuous functions in digital spaces with the Khalimsky topology Erik Melin Uppsala University, Department of Mathematics Box 480, SE-751 06 Uppsala, Sweden melin@math.uu.se http://www.math.uu.se/~melin

More information

Modified Simes Critical Values Under Positive Dependence

Modified Simes Critical Values Under Positive Dependence Modified Simes Critical Values Under Positive Dependence Gengqian Cai, Sanat K. Sarkar Clinical Pharmacology Statistics & Programming, BDS, GlaxoSmithKline Statistics Department, Temple University, Philadelphia

More information

Comparing distributions by multiple testing across quantiles or CDF values

Comparing distributions by multiple testing across quantiles or CDF values Comparing distributions by multiple testing across quantiles or CDF values arxiv:1708.04658v1 [math.st] 15 Aug 2017 Matt Goldman David M. Kaplan August 17, 2017 Abstract When comparing two distributions,

More information

Estimation and Confidence Sets For Sparse Normal Mixtures

Estimation and Confidence Sets For Sparse Normal Mixtures Estimation and Confidence Sets For Sparse Normal Mixtures T. Tony Cai 1, Jiashun Jin 2 and Mark G. Low 1 Abstract For high dimensional statistical models, researchers have begun to focus on situations

More information

Applying the Benjamini Hochberg procedure to a set of generalized p-values

Applying the Benjamini Hochberg procedure to a set of generalized p-values U.U.D.M. Report 20:22 Applying the Benjamini Hochberg procedure to a set of generalized p-values Fredrik Jonsson Department of Mathematics Uppsala University Applying the Benjamini Hochberg procedure

More information

FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES

FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES Sanat K. Sarkar a a Department of Statistics, Temple University, Speakman Hall (006-00), Philadelphia, PA 19122, USA Abstract The concept

More information

High Breakdown Analogs of the Trimmed Mean

High Breakdown Analogs of the Trimmed Mean High Breakdown Analogs of the Trimmed Mean David J. Olive Southern Illinois University April 11, 2004 Abstract Two high breakdown estimators that are asymptotically equivalent to a sequence of trimmed

More information

Lecture 16: Sample quantiles and their asymptotic properties

Lecture 16: Sample quantiles and their asymptotic properties Lecture 16: Sample quantiles and their asymptotic properties Estimation of quantiles (percentiles Suppose that X 1,...,X n are i.i.d. random variables from an unknown nonparametric F For p (0,1, G 1 (p

More information

Controlling Bayes Directional False Discovery Rate in Random Effects Model 1

Controlling Bayes Directional False Discovery Rate in Random Effects Model 1 Controlling Bayes Directional False Discovery Rate in Random Effects Model 1 Sanat K. Sarkar a, Tianhui Zhou b a Temple University, Philadelphia, PA 19122, USA b Wyeth Pharmaceuticals, Collegeville, PA

More information

Bahadur representations for bootstrap quantiles 1

Bahadur representations for bootstrap quantiles 1 Bahadur representations for bootstrap quantiles 1 Yijun Zuo Department of Statistics and Probability, Michigan State University East Lansing, MI 48824, USA zuo@msu.edu 1 Research partially supported by

More information

Lecture 2 One too many inequalities

Lecture 2 One too many inequalities University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 2 One too many inequalities In lecture 1 we introduced some of the basic conceptual building materials of the course.

More information

Cramér-Type Moderate Deviation Theorems for Two-Sample Studentized (Self-normalized) U-Statistics. Wen-Xin Zhou

Cramér-Type Moderate Deviation Theorems for Two-Sample Studentized (Self-normalized) U-Statistics. Wen-Xin Zhou Cramér-Type Moderate Deviation Theorems for Two-Sample Studentized (Self-normalized) U-Statistics Wen-Xin Zhou Department of Mathematics and Statistics University of Melbourne Joint work with Prof. Qi-Man

More information

Econometrica, Vol. 71, No. 1 (January, 2003), CONSISTENT TESTS FOR STOCHASTIC DOMINANCE. By Garry F. Barrett and Stephen G.

Econometrica, Vol. 71, No. 1 (January, 2003), CONSISTENT TESTS FOR STOCHASTIC DOMINANCE. By Garry F. Barrett and Stephen G. Econometrica, Vol. 71, No. 1 January, 2003), 71 104 CONSISTENT TESTS FOR STOCHASTIC DOMINANCE By Garry F. Barrett and Stephen G. Donald 1 Methods are proposed for testing stochastic dominance of any pre-specified

More information

BY JIAN LI AND DAVID SIEGMUND Stanford University

BY JIAN LI AND DAVID SIEGMUND Stanford University The Annals of Statistics 2015, Vol. 43, No. 3, 1323 1350 DOI: 10.1214/15-AOS1312 Institute of Mathematical Statistics, 2015 HIGHER CRITICISM: p-values AND CRITICISM BY JIAN LI AND DAVID SIEGMUND Stanford

More information

Two-stage stepup procedures controlling FDR

Two-stage stepup procedures controlling FDR Journal of Statistical Planning and Inference 38 (2008) 072 084 www.elsevier.com/locate/jspi Two-stage stepup procedures controlling FDR Sanat K. Sarar Department of Statistics, Temple University, Philadelphia,

More information

Bi-s -Concave Distributions

Bi-s -Concave Distributions Bi-s -Concave Distributions Jon A. Wellner (Seattle) Non- and Semiparametric Statistics In Honor of Arnold Janssen, On the occasion of his 65th Birthday Non- and Semiparametric Statistics: In Honor of

More information

Systems Simulation Chapter 7: Random-Number Generation

Systems Simulation Chapter 7: Random-Number Generation Systems Simulation Chapter 7: Random-Number Generation Fatih Cavdur fatihcavdur@uludag.edu.tr April 22, 2014 Introduction Introduction Random Numbers (RNs) are a necessary basic ingredient in the simulation

More information

Proof. We indicate by α, β (finite or not) the end-points of I and call

Proof. We indicate by α, β (finite or not) the end-points of I and call C.6 Continuous functions Pag. 111 Proof of Corollary 4.25 Corollary 4.25 Let f be continuous on the interval I and suppose it admits non-zero its (finite or infinite) that are different in sign for x tending

More information

Estimates for probabilities of independent events and infinite series

Estimates for probabilities of independent events and infinite series Estimates for probabilities of independent events and infinite series Jürgen Grahl and Shahar evo September 9, 06 arxiv:609.0894v [math.pr] 8 Sep 06 Abstract This paper deals with finite or infinite sequences

More information

Cramér type moderate deviations for trimmed L-statistics

Cramér type moderate deviations for trimmed L-statistics arxiv:1608.05015v1 [math.pr] 17 Aug 2016 Cramér type moderate deviations for trimmed L-statistics Nadezhda Gribkova St.Petersburg State University, Mathematics and Mechanics Faculty, 199034, Universitetskaya

More information

Asymptotic results for empirical measures of weighted sums of independent random variables

Asymptotic results for empirical measures of weighted sums of independent random variables Asymptotic results for empirical measures of weighted sums of independent random variables B. Bercu and W. Bryc University Bordeaux 1, France Workshop on Limit Theorems, University Paris 1 Paris, January

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Evenly sensitive KS-type inference on distributions: new computational, Bayesian, and two-sample contributions

Evenly sensitive KS-type inference on distributions: new computational, Bayesian, and two-sample contributions Evenly sensitive KS-type inference on distributions: new computational, Bayesian, and two-sample contributions Matt Goldman David M. Kaplan January 19, 2015; first version December 4, 2012 Abstract In

More information

,... We would like to compare this with the sequence y n = 1 n

,... We would like to compare this with the sequence y n = 1 n Example 2.0 Let (x n ) n= be the sequence given by x n = 2, i.e. n 2, 4, 8, 6,.... We would like to compare this with the sequence = n (which we know converges to zero). We claim that 2 n n, n N. Proof.

More information

Extreme Value for Discrete Random Variables Applied to Avalanche Counts

Extreme Value for Discrete Random Variables Applied to Avalanche Counts Extreme Value for Discrete Random Variables Applied to Avalanche Counts Pascal Alain Sielenou IRSTEA / LSCE / CEN (METEO FRANCE) PostDoc (MOPERA and ECANA projects) Supervised by Nicolas Eckert and Philippe

More information

Design of the Fuzzy Rank Tests Package

Design of the Fuzzy Rank Tests Package Design of the Fuzzy Rank Tests Package Charles J. Geyer July 15, 2013 1 Introduction We do fuzzy P -values and confidence intervals following Geyer and Meeden (2005) and Thompson and Geyer (2007) for three

More information

CPSC 531: Random Numbers. Jonathan Hudson Department of Computer Science University of Calgary

CPSC 531: Random Numbers. Jonathan Hudson Department of Computer Science University of Calgary CPSC 531: Random Numbers Jonathan Hudson Department of Computer Science University of Calgary http://www.ucalgary.ca/~hudsonj/531f17 Introduction In simulations, we generate random values for variables

More information

HANDBOOK OF APPLICABLE MATHEMATICS

HANDBOOK OF APPLICABLE MATHEMATICS HANDBOOK OF APPLICABLE MATHEMATICS Chief Editor: Walter Ledermann Volume VI: Statistics PART A Edited by Emlyn Lloyd University of Lancaster A Wiley-Interscience Publication JOHN WILEY & SONS Chichester

More information

Endogeny for the Logistic Recursive Distributional Equation

Endogeny for the Logistic Recursive Distributional Equation Zeitschrift für Analysis und ihre Anendungen Journal for Analysis and its Applications Volume 30 20, 237 25 DOI: 0.47/ZAA/433 c European Mathematical Society Endogeny for the Logistic Recursive Distributional

More information

PCA with random noise. Van Ha Vu. Department of Mathematics Yale University

PCA with random noise. Van Ha Vu. Department of Mathematics Yale University PCA with random noise Van Ha Vu Department of Mathematics Yale University An important problem that appears in various areas of applied mathematics (in particular statistics, computer science and numerical

More information

Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process

Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 3083-3093 Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Julia Bondarenko Helmut-Schmidt University Hamburg University

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

Optimal detection of heterogeneous and heteroscedastic mixtures

Optimal detection of heterogeneous and heteroscedastic mixtures J. R. Statist. Soc. B (0) 73, Part 5, pp. 69 66 Optimal detection of heterogeneous and heteroscedastic mixtures T. Tony Cai and X. Jessie Jeng University of Pennsylvania, Philadelphia, USA and Jiashun

More information

Adv. App. Stat. Presentation On a paradoxical property of the Kolmogorov Smirnov two-sample test

Adv. App. Stat. Presentation On a paradoxical property of the Kolmogorov Smirnov two-sample test Adv. App. Stat. Presentation On a paradoxical property of the Kolmogorov Smirnov two-sample test Stefan Hasselgren Niels Bohr Institute March 9, 2017 Slide 1/11 Bias of Kolmogorov g.o.f. test Draw a sample

More information

Sharp threshold functions for random intersection graphs via a coupling method.

Sharp threshold functions for random intersection graphs via a coupling method. Sharp threshold functions for random intersection graphs via a coupling method. Katarzyna Rybarczyk Faculty of Mathematics and Computer Science, Adam Mickiewicz University, 60 769 Poznań, Poland kryba@amu.edu.pl

More information

Asymptotic efficiency of simple decisions for the compound decision problem

Asymptotic efficiency of simple decisions for the compound decision problem Asymptotic efficiency of simple decisions for the compound decision problem Eitan Greenshtein and Ya acov Ritov Department of Statistical Sciences Duke University Durham, NC 27708-0251, USA e-mail: eitan.greenshtein@gmail.com

More information

Phase Transition Phenomenon in Sparse Approximation

Phase Transition Phenomenon in Sparse Approximation Phase Transition Phenomenon in Sparse Approximation University of Utah/Edinburgh L1 Approximation: May 17 st 2008 Convex polytopes Counting faces Sparse Representations via l 1 Regularization Underdetermined

More information

Convergence of Multivariate Quantile Surfaces

Convergence of Multivariate Quantile Surfaces Convergence of Multivariate Quantile Surfaces Adil Ahidar Institut de Mathématiques de Toulouse - CERFACS August 30, 2013 Adil Ahidar (Institut de Mathématiques de Toulouse Convergence - CERFACS) of Multivariate

More information

Lehrstuhl für Statistik und Ökonometrie. Diskussionspapier 87 / Some critical remarks on Zhang s gamma test for independence

Lehrstuhl für Statistik und Ökonometrie. Diskussionspapier 87 / Some critical remarks on Zhang s gamma test for independence Lehrstuhl für Statistik und Ökonometrie Diskussionspapier 87 / 2011 Some critical remarks on Zhang s gamma test for independence Ingo Klein Fabian Tinkl Lange Gasse 20 D-90403 Nürnberg Some critical remarks

More information

STAT 302 Introduction to Probability Learning Outcomes. Textbook: A First Course in Probability by Sheldon Ross, 8 th ed.

STAT 302 Introduction to Probability Learning Outcomes. Textbook: A First Course in Probability by Sheldon Ross, 8 th ed. STAT 302 Introduction to Probability Learning Outcomes Textbook: A First Course in Probability by Sheldon Ross, 8 th ed. Chapter 1: Combinatorial Analysis Demonstrate the ability to solve combinatorial

More information

ASYMPTOTIC PROPERTIES OF SOME GOODNESS-OF-FIT TESTS BASED ON THE L1-NORM

ASYMPTOTIC PROPERTIES OF SOME GOODNESS-OF-FIT TESTS BASED ON THE L1-NORM Ann. Inst. Statist. Math. Vol. 41, No. 4, 753-764 (1989) ASYMPTOTIC PROPERTIES OF SOME GOODNESS-OF-FIT TESTS BASED ON THE L1-NORM SIGEO AKI* AND NOBUHISA KASHIWAGI The Institute of Statistical Mathematics,

More information

THE ASYMPTOTICS OF L-STATISTICS FOR NON I.I.D. VARIABLES WITH HEAVY TAILS

THE ASYMPTOTICS OF L-STATISTICS FOR NON I.I.D. VARIABLES WITH HEAVY TAILS PROBABILITY AN MATHEMATICAL STATISTICS Vol. 31, Fasc. 2 2011, pp. 285 299 THE ASYMPTOTICS OF L-STATISTICS FOR NON I.I.. VARIABLES WITH HEAVY TAILS BY AAM BA R C Z Y K, ARNOL JA N S S E N AN MARKUS PAU

More information

Lower Bounds for Testing Bipartiteness in Dense Graphs

Lower Bounds for Testing Bipartiteness in Dense Graphs Lower Bounds for Testing Bipartiteness in Dense Graphs Andrej Bogdanov Luca Trevisan Abstract We consider the problem of testing bipartiteness in the adjacency matrix model. The best known algorithm, due

More information

On Rescaled Poisson Processes and the Brownian Bridge. Frederic Schoenberg. Department of Statistics. University of California, Los Angeles

On Rescaled Poisson Processes and the Brownian Bridge. Frederic Schoenberg. Department of Statistics. University of California, Los Angeles On Rescaled Poisson Processes and the Brownian Bridge Frederic Schoenberg Department of Statistics University of California, Los Angeles Los Angeles, CA 90095 1554, USA Running head: Rescaled Poisson Processes

More information

Asymptotic results for empirical measures of weighted sums of independent random variables

Asymptotic results for empirical measures of weighted sums of independent random variables Asymptotic results for empirical measures of weighted sums of independent random variables B. Bercu and W. Bryc University Bordeaux 1, France Seminario di Probabilità e Statistica Matematica Sapienza Università

More information

OHSU OGI Class ECE-580-DOE :Statistical Process Control and Design of Experiments Steve Brainerd Basic Statistics Sample size?

OHSU OGI Class ECE-580-DOE :Statistical Process Control and Design of Experiments Steve Brainerd Basic Statistics Sample size? ECE-580-DOE :Statistical Process Control and Design of Experiments Steve Basic Statistics Sample size? Sample size determination: text section 2-4-2 Page 41 section 3-7 Page 107 Website::http://www.stat.uiowa.edu/~rlenth/Power/

More information

van Rooij, Schikhof: A Second Course on Real Functions

van Rooij, Schikhof: A Second Course on Real Functions vanrooijschikhofproblems.tex December 5, 2017 http://thales.doa.fmph.uniba.sk/sleziak/texty/rozne/pozn/books/ van Rooij, Schikhof: A Second Course on Real Functions Some notes made when reading [vrs].

More information

PM functions, their characteristic intervals and iterative roots

PM functions, their characteristic intervals and iterative roots ANNALES POLONICI MATHEMATICI LXV.2(1997) PM functions, their characteristic intervals and iterative roots by Weinian Zhang (Chengdu) Abstract. The concept of characteristic interval for piecewise monotone

More information

Statistical Applications in Genetics and Molecular Biology

Statistical Applications in Genetics and Molecular Biology Statistical Applications in Genetics and Molecular Biology Volume 5, Issue 1 2006 Article 28 A Two-Step Multiple Comparison Procedure for a Large Number of Tests and Multiple Treatments Hongmei Jiang Rebecca

More information

EXPLICIT NONPARAMETRIC CONFIDENCE INTERVALS FOR THE VARIANCE WITH GUARANTEED COVERAGE

EXPLICIT NONPARAMETRIC CONFIDENCE INTERVALS FOR THE VARIANCE WITH GUARANTEED COVERAGE EXPLICIT NONPARAMETRIC CONFIDENCE INTERVALS FOR THE VARIANCE WITH GUARANTEED COVERAGE Joseph P. Romano Department of Statistics Stanford University Stanford, California 94305 romano@stat.stanford.edu Michael

More information

STAT 992 Paper Review: Sure Independence Screening in Generalized Linear Models with NP-Dimensionality J.Fan and R.Song

STAT 992 Paper Review: Sure Independence Screening in Generalized Linear Models with NP-Dimensionality J.Fan and R.Song STAT 992 Paper Review: Sure Independence Screening in Generalized Linear Models with NP-Dimensionality J.Fan and R.Song Presenter: Jiwei Zhao Department of Statistics University of Wisconsin Madison April

More information

Recall the Basics of Hypothesis Testing

Recall the Basics of Hypothesis Testing Recall the Basics of Hypothesis Testing The level of significance α, (size of test) is defined as the probability of X falling in w (rejecting H 0 ) when H 0 is true: P(X w H 0 ) = α. H 0 TRUE H 1 TRUE

More information

Forcing unbalanced complete bipartite minors

Forcing unbalanced complete bipartite minors Forcing unbalanced complete bipartite minors Daniela Kühn Deryk Osthus Abstract Myers conjectured that for every integer s there exists a positive constant C such that for all integers t every graph of

More information

Asymptotic statistics using the Functional Delta Method

Asymptotic statistics using the Functional Delta Method Quantiles, Order Statistics and L-Statsitics TU Kaiserslautern 15. Februar 2015 Motivation Functional The delta method introduced in chapter 3 is an useful technique to turn the weak convergence of random

More information

40.530: Statistics. Professor Chen Zehua. Singapore University of Design and Technology

40.530: Statistics. Professor Chen Zehua. Singapore University of Design and Technology Singapore University of Design and Technology Lecture 9: Hypothesis testing, uniformly most powerful tests. The Neyman-Pearson framework Let P be the family of distributions of concern. The Neyman-Pearson

More information

ENTROPY-BASED GOODNESS OF FIT TEST FOR A COMPOSITE HYPOTHESIS

ENTROPY-BASED GOODNESS OF FIT TEST FOR A COMPOSITE HYPOTHESIS Bull. Korean Math. Soc. 53 (2016), No. 2, pp. 351 363 http://dx.doi.org/10.4134/bkms.2016.53.2.351 ENTROPY-BASED GOODNESS OF FIT TEST FOR A COMPOSITE HYPOTHESIS Sangyeol Lee Abstract. In this paper, we

More information

arxiv:math/ v1 [math.st] 29 Dec 2006 Jianqing Fan Peter Hall Qiwei Yao

arxiv:math/ v1 [math.st] 29 Dec 2006 Jianqing Fan Peter Hall Qiwei Yao TO HOW MANY SIMULTANEOUS HYPOTHESIS TESTS CAN NORMAL, STUDENT S t OR BOOTSTRAP CALIBRATION BE APPLIED? arxiv:math/0701003v1 [math.st] 29 Dec 2006 Jianqing Fan Peter Hall Qiwei Yao ABSTRACT. In the analysis

More information

arxiv: v1 [math.st] 31 Mar 2009

arxiv: v1 [math.st] 31 Mar 2009 The Annals of Statistics 2009, Vol. 37, No. 2, 619 629 DOI: 10.1214/07-AOS586 c Institute of Mathematical Statistics, 2009 arxiv:0903.5373v1 [math.st] 31 Mar 2009 AN ADAPTIVE STEP-DOWN PROCEDURE WITH PROVEN

More information

Exact goodness-of-fit tests for censored data

Exact goodness-of-fit tests for censored data Exact goodness-of-fit tests for censored data Aurea Grané Statistics Department. Universidad Carlos III de Madrid. Abstract The statistic introduced in Fortiana and Grané (23, Journal of the Royal Statistical

More information

BIVARIATE P-BOXES AND MAXITIVE FUNCTIONS. Keywords: Uni- and bivariate p-boxes, maxitive functions, focal sets, comonotonicity,

BIVARIATE P-BOXES AND MAXITIVE FUNCTIONS. Keywords: Uni- and bivariate p-boxes, maxitive functions, focal sets, comonotonicity, BIVARIATE P-BOXES AND MAXITIVE FUNCTIONS IGNACIO MONTES AND ENRIQUE MIRANDA Abstract. We give necessary and sufficient conditions for a maxitive function to be the upper probability of a bivariate p-box,

More information

PAijpam.eu ADAPTIVE K-S TESTS FOR WHITE NOISE IN THE FREQUENCY DOMAIN Hossein Arsham University of Baltimore Baltimore, MD 21201, USA

PAijpam.eu ADAPTIVE K-S TESTS FOR WHITE NOISE IN THE FREQUENCY DOMAIN Hossein Arsham University of Baltimore Baltimore, MD 21201, USA International Journal of Pure and Applied Mathematics Volume 82 No. 4 2013, 521-529 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu doi: http://dx.doi.org/10.12732/ijpam.v82i4.2

More information

Ahlswede Khachatrian Theorems: Weighted, Infinite, and Hamming

Ahlswede Khachatrian Theorems: Weighted, Infinite, and Hamming Ahlswede Khachatrian Theorems: Weighted, Infinite, and Hamming Yuval Filmus April 4, 2017 Abstract The seminal complete intersection theorem of Ahlswede and Khachatrian gives the maximum cardinality of

More information

Postulate 2 [Order Axioms] in WRW the usual rules for inequalities

Postulate 2 [Order Axioms] in WRW the usual rules for inequalities Number Systems N 1,2,3,... the positive integers Z 3, 2, 1,0,1,2,3,... the integers Q p q : p,q Z with q 0 the rational numbers R {numbers expressible by finite or unending decimal expansions} makes sense

More information

Quantum query complexity of entropy estimation

Quantum query complexity of entropy estimation Quantum query complexity of entropy estimation Xiaodi Wu QuICS, University of Maryland MSR Redmond, July 19th, 2017 J o i n t C e n t e r f o r Quantum Information and Computer Science Outline Motivation

More information

arxiv: v2 [math.pr] 8 Feb 2016

arxiv: v2 [math.pr] 8 Feb 2016 Noname manuscript No will be inserted by the editor Bounds on Tail Probabilities in Exponential families Peter Harremoës arxiv:600579v [mathpr] 8 Feb 06 Received: date / Accepted: date Abstract In this

More information

University of California San Diego and Stanford University and

University of California San Diego and Stanford University and First International Workshop on Functional and Operatorial Statistics. Toulouse, June 19-21, 2008 K-sample Subsampling Dimitris N. olitis andjoseph.romano University of California San Diego and Stanford

More information

Finding Outliers in Monte Carlo Computations

Finding Outliers in Monte Carlo Computations Finding Outliers in Monte Carlo Computations Prof. Michael Mascagni Department of Computer Science Department of Mathematics Department of Scientific Computing Graduate Program in Molecular Biophysics

More information

Existence and convergence of moments of Student s t-statistic

Existence and convergence of moments of Student s t-statistic U.U.D.M. Report 8:18 Existence and convergence of moments of Student s t-statistic Fredrik Jonsson Filosofie licentiatavhandling i matematisk statistik som framläggs för offentlig granskning den 3 maj,

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Exact goodness-of-fit tests for censored data

Exact goodness-of-fit tests for censored data Ann Inst Stat Math ) 64:87 3 DOI.7/s463--356-y Exact goodness-of-fit tests for censored data Aurea Grané Received: February / Revised: 5 November / Published online: 7 April The Institute of Statistical

More information

Some Statistical Inferences For Two Frequency Distributions Arising In Bioinformatics

Some Statistical Inferences For Two Frequency Distributions Arising In Bioinformatics Applied Mathematics E-Notes, 14(2014), 151-160 c ISSN 1607-2510 Available free at mirror sites of http://www.math.nthu.edu.tw/ amen/ Some Statistical Inferences For Two Frequency Distributions Arising

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations

More information

Journal Club: Higher Criticism

Journal Club: Higher Criticism Journal Club: Higher Criticism David Donoho (2002): Higher Criticism for Heterogeneous Mixtures, Technical Report No. 2002-12, Dept. of Statistics, Stanford University. Introduction John Tukey (1976):

More information

B.N.Bandodkar College of Science, Thane. Random-Number Generation. Mrs M.J.Gholba

B.N.Bandodkar College of Science, Thane. Random-Number Generation. Mrs M.J.Gholba B.N.Bandodkar College of Science, Thane Random-Number Generation Mrs M.J.Gholba Properties of Random Numbers A sequence of random numbers, R, R,., must have two important statistical properties, uniformity

More information

Optimal Detection of Heterogeneous and Heteroscedastic Mixtures

Optimal Detection of Heterogeneous and Heteroscedastic Mixtures Optimal Detection of Heterogeneous and Heteroscedastic Mixtures T. Tony Cai Department of Statistics, University of Pennsylvania X. Jessie Jeng Department of Biostatistics and Epidemiology, University

More information

Supplement to Post hoc inference via joint family-wise error rate control

Supplement to Post hoc inference via joint family-wise error rate control Supplement to Post hoc inference via joint family-wise error rate control Gilles Blanchard Universität Potsdam, Institut für Mathemati Karl-Liebnecht-Straße 24-25 14476 Potsdam, Germany e-mail: gilles.blanchard@math.uni-potsdam.de

More information

On the Bennett-Hoeffding inequality

On the Bennett-Hoeffding inequality On the Bennett-Hoeffding inequality of Iosif 1,2,3 1 Department of Mathematical Sciences Michigan Technological University 2 Supported by NSF grant DMS-0805946 3 Paper available at http://arxiv.org/abs/0902.4058

More information

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations John R. Michael, Significance, Inc. and William R. Schucany, Southern Methodist University The mixture

More information

Normal approximation of Poisson functionals in Kolmogorov distance

Normal approximation of Poisson functionals in Kolmogorov distance Normal approximation of Poisson functionals in Kolmogorov distance Matthias Schulte Abstract Peccati, Solè, Taqqu, and Utzet recently combined Stein s method and Malliavin calculus to obtain a bound for

More information

INTRODUCTION TO INTERSECTION-UNION TESTS

INTRODUCTION TO INTERSECTION-UNION TESTS INTRODUCTION TO INTERSECTION-UNION TESTS Jimmy A. Doi, Cal Poly State University San Luis Obispo Department of Statistics (jdoi@calpoly.edu Key Words: Intersection-Union Tests; Multiple Comparisons; Acceptance

More information

Lecture 6 April

Lecture 6 April Stats 300C: Theory of Statistics Spring 2017 Lecture 6 April 14 2017 Prof. Emmanuel Candes Scribe: S. Wager, E. Candes 1 Outline Agenda: From global testing to multiple testing 1. Testing the global null

More information

An elementary proof of the weak convergence of empirical processes

An elementary proof of the weak convergence of empirical processes An elementary proof of the weak convergence of empirical processes Dragan Radulović Department of Mathematics, Florida Atlantic University Marten Wegkamp Department of Mathematics & Department of Statistical

More information

Spring 2012 Math 541B Exam 1

Spring 2012 Math 541B Exam 1 Spring 2012 Math 541B Exam 1 1. A sample of size n is drawn without replacement from an urn containing N balls, m of which are red and N m are black; the balls are otherwise indistinguishable. Let X denote

More information

Nonparametric one-sided testing for the mean and related extremum problems

Nonparametric one-sided testing for the mean and related extremum problems Nonparametric one-sided testing for the mean and related extremum problems Norbert Gaffke University of Magdeburg, Faculty of Mathematics D-39016 Magdeburg, PF 4120, Germany E-mail: norbert.gaffke@mathematik.uni-magdeburg.de

More information

On Decision Making under Interval Uncertainty: A New Justification of Hurwicz Optimism-Pessimism Approach and Its Use in Group Decision Making

On Decision Making under Interval Uncertainty: A New Justification of Hurwicz Optimism-Pessimism Approach and Its Use in Group Decision Making On Decision Making under Interval Uncertainty: A New Justification of Hurwicz Optimism-Pessimism Approach and Its Use in Group Decision Making Van Nam Huynh 1, Chenyi Hu, Yoshiteru Nakamori 1, and Vladik

More information