An Alpha-Exhaustive Multiple Testing Procedure

Size: px
Start display at page:

Download "An Alpha-Exhaustive Multiple Testing Procedure"

Transcription

1 Current Research in Biostatistics Original Research Paper An Alpha-Exhaustive Multiple Testing Procedure, Mar Chang, Xuan Deng and John Balser Boston University, Boston MA, USA Veristat, Southborough MA, USA Article history Received: Revised: Accepted: Corresponding Author: Mar Chang Boston University, Boston MA, USA and Veristat, Southborough MA, USA Abstract: A multiple testing procedure can be a single-step procedure such as Bonferroni s method or a stepwise procedure such as Hochberg s stepup method and Hommel s method. It can be an -exhaustive or - conservative approach. We develop a single -exhaustive procedure that can improve power -5% over Hochberg s and Hommel s methods in common situations when the test statistics are mutually independent. The method can also be generalized to dependent test statistics. The idea behind our method is to construct the rejection rules using the product of marginal p-values and by controlling the upper bounds of the th order terms so that is controlled for any configuration of null hypotheses. Such upper bounds or critical values are determined progressively from = towards = K, the number of null hypotheses in the problem. Keywords: Multiple Testing, Alpha-Exhaustive, Hypothesis Test Introduction Multiple testing problems are common in pharmaceutical statistics and life-sciences in general. The main goal of Multiple Testing Procedures (MTP) is to (strongly) control the family wise type-i error rate. A MTP can be a single-step data-independent procedure such as Bonferroni s method or a data-dependent stepwise procedure such as Hochberg s stepup method and Hommel s stepup method. It can be an-exhaustive or -conservative approach. Conceptually, stepwise procedures are usually more powerful than single-step procedures and -exhaustive procedures are usually more powerful than -conservative approach. However, consider these two aspects together, comparisons of testing procedures are not that simple, often depending on the configuration of the alternative "hypotheses" or more precisely, the truths. For example, the power of a fallbac procedure is dependent on the weight and "effective sizes" in the alternative hypotheses. A fixed sequence testing procedure is a special case of the fallbac procedure and its power is heavily dependent on the order of the test sequence of the hypothesis. We develop a simple single -exhaustive procedure that can improve power -5% over Hochberg s and Hommel s methods in common situations when the test statistics are independent. The method can also be generalized to dependent test statistics. The idea behind our method is to construct the rejection rules using the product of marginal p-values and by controlling the upper bounds of the th order terms so that is exhausted for any configuration of null hypotheses. Such upper bounds or critical values are determined progressively from = towards = K, the number of null hypotheses in the problem. Unlie common stepwise test procedures, in which every step in the decision rule will only involve one critical value for decision-maing, the proposed -exhaustive approach is a single-step method with multiple critical values involved in the decision rules. The paper is organized as follows. In section, we will review several important stepwise test procedures that will be used in our power comparisons. In section 3, we elaborate our progressive -exhaustive procedure for two-hypothesis testing, outline the idea, derive the formulations for critical values and provide examples of using this procedure in comparison with other methods. We also provide the power formulation for the -exhaustive procedure for two-hypothesis testing. In section, we provide power comparisons among several different methods using simulation. In section 5, we extend the -exhaustive procedure to threehypothesis testing and compare with Hommel s procedure in power under broad conditions. In section 6, we further describe the -exhaustive procedure for general K-hypothesis testing and simulation algorithms for determining the critical values. In the last section, discussion and summary are provided. We place 06 Mar Chang, Xuan Deng and John Balser. This open access article is distributed under a Creative Commons Attribution (CC-BY) 3.0 license.

2 Mar Chang et al. / Current Research in Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp mathematical derivation in the Appendix. To mae the procedure ready for practical use, we have included the SAS code in the Appendix. Multiple Testing Procedures Stepwise procedures are different from single-step procedures, in the sense that a stepwise procedure must follow a specific order to test each hypothesis. In general, stepwise procedures are more powerful than single-step procedures. There are three categories of stepwise procedures that are dependent on how the stepwise tests proceed: Stepup, stepdown and fallbac procedure. The commonly used stepwise procedures include the Bonferroni-Holm stepdown method (Holm, 979), the Holm stepdown method (Dmitrieno et al., 009, p.65), Hommel s stepup procedure (Hommel, 988), Hochberg s stepup method (Hochberg, 988), the fallbac procedure (Wiens, 003) and the sequential test with fixed sequences (Westfall et al., 999). Stepdown Procedure A stepdown procedure starts with the most significant p-value and ends with the least significant p-value. In the procedure, the p-values are arranged in an ascending order, i.e., from the smallest to the largest: p p p () ( ) ( )... ( K ) with the corresponding hypotheses: H, H,..., H () K ( ) ( ) ( ) The test proceeds from H () to H (K). If p () >C ( =,..., K), retain all H (i) (i ); otherwise, reject H () and continue to test H (+). The constants C are different for different procedures. The adjusted p-values are: pɶ = C p ( ) pɶ = max ( pɶ, C p( ) ), =,..., n (3) Therefore an alternative test procedure is to compare the adjusted p-values against the unadjusted. After adjusting p-values, one can test the hypotheses in any order. Fallbac Procedure (Wiens, 003) The Holm procedure is based on a data-driven order of testing, while the fixed-sequence procedure is based on a prefixed order of testing. A compromise between them is the so-called fallbac procedure. The fallbac procedure was introduced by Wiens (003) and was further studied by Dmitrieno et al. (006; Hommel and Bretz, 008). The test procedure can be outlined as follows. In the fallbac procedure, we allocate the overall error rate among the hypotheses according to their weights w, where w 0 and w =. For fixed sequence test, w = and w =... = w K = 0: Step : Test H at = w. If p, reject this hypothesis; otherwise retain it. Go to the next step Step i =,..., K : Test H at = + w if H is rejected and at = w if H is retained. If p, reject H ; otherwise retainit. Go to the next step Step K: Test H K at K = K + w K if H K is rejected and at K = w K if H K is retained. If p K K, reject H K ; otherwise retainit The formula for the adjusted p-value is complicated to be written explicitly. Stepup Procedure A stepup procedure starts with the least significant p- value and ends with the most significant p-value. The procedure proceeds from H (K) to H (). If, P () C ( =,..., K), reject all H (i) (i ); otherwise, retain H () and continue to test H ( ). The adjusted p-values are: pɶ K = CK p( ), K pɶ = min ( pɶ +, C p( ) ), = K,..., () Progressive -Exhaustive Testing Procedure An -exhaustive procedure is a closed testing procedure based on intersection hypothesis tests the size of which is exactly. In other words, Pr (Reject H I ) = for any intersection hypothesis H I, I {,...,K}. Put in a simple way, in an -exhaustive procedure, the supremum of the probability of false rejection in any null hypothesis configuration is equal to. Many stepwise test procedures have been developed, which are not necessarily -exhaustive. Therefore, there is a room for improvement. However, an -exhaustive procedure is not necessarily a powerful test. A fixed sequence testis an -exhaustive test, but it is often the least powerful test if the sequence of tests was inappropriately chosen. Let s discuss the situation of two-hypothesis testing: H : H H versus H : H H (5) o Here H is the negation of H, =,. In this setting, if either H or H is rejected, the null hypothesis H o is 3

3 Mar Chang et al. / Current Research in Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp rejected. Let p and p be the marginalp-values for testing H and H, respectively. The decision rules of the progressive -exhaustive testing procedure are: If p p and p, then reject H If p p and p, then reject H where critical value > 0 and > 0. The idea behind this procedure is to borrow strength among marginal p-values. In plain language, the procedure says that we don t have to mae an adjustment, as long as p and the other p-value p is small. For example, if p = and p = 0.0, we can reject H. The and are so determined that when both H and H are true, the type-i error will not exceed. The procedure can control the Familywise Error Rate (FWER) strongly but at the same time exhaust all under all the null hypothesis configurations: H, H and H H. This is done progressively as described below. Step : To control the familywise error rate at level, when only H is true and H is not true, then p p can be satisfied with probability of (if, for example, the test drug is very effective, p will be virtually always 0). Therefore, to control FWER, a necessary condition is sup Pr(p p p H H ) = sup Pr (p H ) =. Therefore, type-i error is strongly controlled and exhausted when H H is true. By the same toen, we can reach the same conclusion when H H is true. Step : Now we need to determine and to exhaust when H H is true. In this study, we only consider the case when the two test statistics, p and p, are independent. Under the global null hypothesis H 0, p and p are iid U (0, ), which is equivalent to two standard normal test statistics: z = -Φ(p ) and z =-Φ(p ) under H 0 being independent. However, woring on the p-scale, the testing procedure can be used for different endpoints (normal, binary, survival), as long as p and p are independent and stochastically equal to or larger than uniform p and p. Since T = p p < implies p <, we have the p conditional cdf for T: T p ( ) p, < = < F T p, p p (6) Fig.. Pr (p p p ) If, then Pr (p p p H,H ) = Pr (p H,H ) =. If <, then (Fig. for a geometric interpretation): Pr ( p p p ) F ( T p T p ) f ( p ) dp dp dp (7) = < = = + ln Pr Thus: ( p p p ) p, = + ln, (8) Consequently, the type-i error rate under H H is given by (for simplicity, we just use FWER (H H )): ( H ) ( p p p p p p ) ( p p p ) Pr( p p p ) ( p p ( a ) p p ) FWER H = Pr = Pr + Pr min, = + ln + + ln ( ), for min, (9) In (9), we have used the following result: if min(, ), then: 3

4 Mar Chang et al. / Current Research in Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp ( p p ( ) p p ) ( p p ) Pr min, = Pr = (0) However, if > min(a, ), then the probability becomes (Fig. ): Pr ( p p p p ) min (, )/ = dp + dp 0 min (, )/ p = min (, ) + min (, ) ln min, () ( ) To summarize the type-i error rates under various null configurations, we have: ( ) ( ) FWER H = FWER H = FWER H ( H ), min > = + ln + + ln, min + ln + + ln min + ln, min < min where, min = min (, ): () (3) The and are determined so that FWER(H H ) =. We are not interested in, because p p in the rejection criteria has no effect. In fact, FWER(H H ) = - = will have no solution for any between 0 and. We are not interested in < either, because it maes the conditions, p < and p <, have no effect in determining the rejection boundary. In fact: FWER( H H ) = + ln + + ln min + ln = min has no solution for < and <. Therefore, the only scenario that we are interested in is: FWER ( H H ) = + ln + + ln, () With (), we can determine the rejection boundaries, for given. Here are the steps: Choose so that < Let FWER (H H ) = to solve for That is, is the solution of: + ln + + ln =, min (5) Examples of critical values from (5) are presented in Table and. When =, (5) can be simplified as: ln + = (6) Fig.. Pr (p p p p ) The critical values for various with = are presented in Table 3. The rejection boundaries in Table -3 have been verified each by 0,000,000 simulations. Illustrative Example Suppose in the Statistical Analysis Plan for a cardiovascular trial with two primary endpoints (not coprimary endpoints that have to be met simultaneously), the two test statistics for the two hypotheses (H and H ) of the two endpoints are assumed to be independent and the -exhaustive procedure (with one-sided = = and = 0.05) was specified in the analysis for 33

5 Mar Chang et al. / Current Research in Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp the multiplicity adjustment to control FWER. At the end of trial, the p-values for the two endpoints are: Scenario () one-sided p = 0.0 and p = 0.05, scenario () p = 0.0 and p = 0., (3) p = 0.05 and p = 0.0, () p = 0.0 and p = 0.6 and (5) p = 0.0 and p = 0.5. Using the -exhaustive procedure for scenario (), we reject both H and H because p p = p = 0.0 to reject H and p p = p = 0.05 to reject H. For scenario (), we reject H but not H because p p = p = 0.0 and p p = p = 0. >. For scenario (3), we reject H, but not H. For scenario (), we reject H but not H. For scenario (5), we can reject neither H nor H. Using the test procedures described in the previous section, we summarize the rejection status in Table. The -exhaustive procedure can reject at least one hypothesis except for scenario 5. The reason that -Ex method (with = ) cannot reject any hypothesis for scenario (5) is that the method emphasizes the consistency of the evidence against all the hypotheses and such consistency is obviously not presented in scenario (5). The power of -Ex procedure ( = ) for the twohypothesis at one-side -level can be written as: ( z ) ( z ) ( ( ) ( )) Φ Φ Power = Pr ( min Φ z, Φ z < H, H As a comparison, the power of Hommel s procedure for the two-hypothesis at one-sided -level can be written as: ( Φ( z ) Φ( z )) ( ( ) ( )) min Power = Pr min Φ z, Φ z < / H, H Table. Critical values for one-sided = Table. Critical values for one-sided = Table 3. Critical values for one-sided test when = , Table. Rejection with different test procedures Scenario Method 3 5 Fixed Sequence H, H H H H Bonferroni H H Fallbac (w = 0.5) H H Holm-Stepdown H H Hochberg H, H H Hommel H, H H H -exhaustive H, H H H H Note: One-sided = 0.05, = = for -exhaustive Table 5. Power comparisons for two-hypothesis testing (δ = 0.3, σ = ) δ = 0.3 δ = 0.5 δ = Method Power Power Power Power Power Power Fixed Seq (H,H ) Fixed Seq (H,H ) Bonferroni Fallbac (H,H ) Fallbac (H,H ) Holm Hochberg Hommel Progressive -Ex Note: Sample size = 90, one-sided = 0.05, = = for Progressive -Ex 3

6 Mar Chang et al. / Current Research in Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp Power Comparison of Two Hypotheses There seems a general impression that whatever the test procedure to use the power of rejection cannot be improved significantly for two-hypothesis testing. However, this is not necessarily true. We have compared power of seven different testing methods described in section and presented results in Table 5, where Power is probability of simultaneously rejecting H :δ 0 and H : δ 0 and Power is the probability of rejecting either H or H. For the fallbac procedure the weights w = w = 0.5 are used. The fixed sequence method is equivalent to the fallbac procedure with w = and w = 0. The progressive -exhaustive procedure performs overall the best, while the Hommel method performs the second best. In general, Holm procedure is uniformly more powerful than the Bonferroni procedure. Hochberg s procedure is uniformly more powerful than Holm s procedure and Hommel s procedure is uniformly more powerful than Hochberg s procedure. Holm, fixed-sequence and fallbac are nonparametric and control FWER for any joint distribution of test statistics. Hommel and Hochberg procedures are semi parametric and control FWER only for some joint distributions, including positively dependent test statistics such as multivariate normal test statistics. Nonparametric procedures mae no assumptions about the joint distribution of test statistics which results in power loss (Dmitrieno, 03). For two-hypothesis testing, Hochberg s method is equivalent to Hommel s method. The power of the fallbac method depends on the weights w i and the order of the hypotheses. Comparisons of Hommel s method to use of the -Ex method with different and are presented in Table 6. For -Ex, = = ; -Ex, = ; = ; -Ex 3, = , =.6 ; -Ex, = , =.5 ; -Ex 5, = , =.5. From the table, we can see that all -Ex procedures perform very well except -Ex 5, in which the treatment effect δ is smaller than δ, but alphas were set up in a wrong direction ( =.5 > ). In general, should be chosen larger than if δ is expected larger than δ ; otherwise choose. Table 6. Power comparison for two-hypothesis testing δ = δ (σ = ) Method 0.0/ / /0.3 0./ /0.3 0./ /0.3 Hommel Ex Ex Ex Ex Ex The reason that -Ex method (with = ) has lower power than Hommel s method at the extreme case when δ = 0 and δ = 0.3 is that the former emphasizes the consistency of the evidence against all the hypotheses, while δ = 0 and δ = 0.3 are clearly inconsistent. If we believe δ is smaller than δ, we should use different and (e.g., =.6 in - Ex 3 ). However, if the directional guess is wrong, it could reduce the power as seen in -Ex 5. Formulation for Three Hypotheses We now discuss the progressive -Exhaustive procedure for three-hypothesis testing: H : H H H vsh : H H H (7) 0 3 a 3 Similar to two-hypothesis testing, the rejectionacceptance rules of the -exhaustive procedure for threehypothesis testing are: If p p p 3 p p p p 3 p, reject H ; otherwise accept H If p p p 3 p p p p 3 p, reject H, otherwise accept H If p p p 3 < p 3 p 3 p 3 p 3 p 3, reject H 3, otherwise accept H 3 Here 3. Determination of Critical Values The derivations of the critical values are placed in the Appendix. Here we described the ey steps and summarized the results. The critical values, and 3 are determined by all the paired null hypotheses: H H, H H 3 and H H 3. To exhaust familywise error rate, it necessarily requires that FWER(H H ) =, FWER(H H 3 ) = and FWER(H 3 H ) =, which are equivalent to (Appendix), respectively: + ln + + ln = + ln ln =, < 3 < (8) ln + + ln = 3 Each equation in (8) is in the same expression as (5). Therefore, the critical values in Table -3 are also valid for (8). 35

7 Mar Chang et al. / Current Research in Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp Table 7. Critical values for one-sided test ( = = 3 ) ,, Table 8. Power comparison δ = δ (δ 3 = 0.3, σ = ) Method 0.0/ / /0.3 0./0.3 0./0. 0./0. 0.3/0.3 Hommel Dunnet Step-up Graphic Approach Ex Note: = 0.05, = = 3 = , = , sample size = 60 To exhaust under H H H 3, it requires that FWER(H H H 3 ) =, that is (assume = = 3, Appendix): ln + 3 ( ) + 3 = (9) Now we can use (8) to determine, and 3 and use (9) to determine for the case when = = 3. Examples of critical values for various are presented in Table 7. The critical values can also be determined through simulations. Especially when dimension is high, simulation is a convenient way to obtain the results: For given (,, 3 ), we can use simulation by trying different until FWER(H H H 3 ) =. We have verified the critical values through simulations: For = 0.05 and = = 3 = , = ; the type-i error rate is under H H H 3 through 0,000,000 simulations. This progressive method to determine the critical values can be generalized to K-hypothesis testing. Power Comparison Let H i : δ i 0, i =,, 3. Using the rejection boundaries in Table 7, we can easily obtain the power of the -Ex method through simulations. To compare the performance of -Ex, we compared the best method, Hommel s method as the standard. Since there are three hypotheses, it is meaningful to compare our approach to other more recently developed approaches. However, the gate eeping procedure is difficult to communicate with the nonstatisticians and requires large set of tests when the number of individual hypotheses increases. An iterative graphical approach by Bretz et al. (009) deals with those weaness and constructs the Bonferroni-type tests with a simple updating algorithm that fully describes a sequentially rejective test procedure. The graphic approach was then extended by dissociating the underlying weighting strategy and applied using weighted Bonferroni tests, weighted parametric tests and weighted Simes tests (Bretz et al., 0). The existing methods controls the FWER; However, the power is addressed or compared with other methods. From Table 8, we can see that the -Ex procedure provides more power in all cases except the case when δ = δ = 0 and δ = 0.3. Again, lie in the case of two-hypothesis testing, when the parameters in the alternative hypotheses (e.g., effects of the different endpoints) are very different, we should use different, and 3 such that their trend is in opposite to the trend of parameters in the alternative hypotheses. K-Hypothesis Testing Procedure We now discuss the progressive -exhaustive procedure for K-hypothesis testing. To avoid the rejection boundary being too small, causing inconvenience, we use term p i= i instead of p i = i in the decision rules for a general K-hypothesis testing. It is obvious that these two test statistics are equivalent in terms of power. For K-hypothesis testing, the K rejection rules are specified as: We will reject H i if and only if: /3 / ( ) ( ) p p p, p p, p. i l l i i > l, i, l i i We didn t include higher order term of p-product because through simulations, we find that even we set p i p j p p m, the FWER is controlled. This means that for a multiple testing problem with more than four hypothese, the proposed procedure may not be an alpha-exhaustive one. The rejection boundaries,,... K are progressively determined: Determine = based on 36

8 Mar Chang et al. / Current Research in Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp one-hypothesis testing, then given, determine based on two-hypothesis testing; and given and, determine 3 based on three-hypothesis testing. The process continues until K is determined. For high dimensional hypothesis testing problems, Partition Principle for multiple testing (Hsu, 996) can be used to reduce the number of null configurations to be tested. Simulation is usually more convenient than numerical integration when the dimension is high. Summary and Discussion To construct a MTP, we need to consider at least three things to ensure the power: () -exhaustive, () synergize strengths among data for local hypothesis or marginal p-values and (3) be able to use correlations between local test statistics or local p-values. In principle, the proposed -exhaustive procedure has considered all three aspects. To achieve -exhaustive, we use the marginal p-value product corresponding to each null hypothesis configuration and enforce it with an upper bound in the rejection rules. Such p-value product terms in the rejection rules also ensure the synergy between the marginal p-values. The K-hypothesis testing algorithm can be applied to the test statistics with correlations with modifications of critical regions for rejection (the critical values in Table -3 are applicable for independent test statistics), but due to its complexity and larger applications in clinical trials (dose-finding, subgroup analysis, adaptive design), we are developing separate manuscripts to address that. Unlie traditional stepwise procedures, the decision rule in the progressive -exhaustive procedure explicitly uses a set of statistics (p,p p,p p 3,p p p 3,etc.) with a set of critical values in the decision rule for rejecting a single H ( =,,..., K). In this sense, the decision rules in the -exhaustive procedure are expressed in the form of "adjusted p-values" and hence the order of the tests is irrelevant. Many stepwise testing procedures also have the feature of borrowing strengths among data for local hypotheses, but such dependencies are realized through a discrete function. The-exhaustive procedure uses a continuous dependency function of marginal p- values, i.e., product of p-values, which is more effective. We have also tried other functions such as average p-values or linear combination of normalinverse p-values, the results are similar. In summary, the proposed progressive -exhaustive procedure is not only statistically powerful, but it also stresses the importance of clinical/practical meaningfulness since the method emphasizes the consistency among the evidences coming from different endpoints, different doses and different populations, that is, the totality of the evidence. The test procedure is simple and performs well in broad situations. When the true "standardized" effect size (value of the parameter) is very different for different hypothesis, the critical values don t have to the same for rejecting all the hypotheses. Instead, the critical values can be different and optimized based on the prior information on effect size or considering the importance of different endpoints. Appendices Derivations of Formulations of Critical Values for Three-Hypothesis Testing FWER H = suppr H3 H3 ( H ) ( p p p3 p p p p3 p ) ( p pp3 p p p p3 p ) (( p p p ) ( p p p )) = suppr ( p p p ) Pr ( p p p ) ( p p p p ) = Pr + Pr = + ln + + ln,, < where, FWER(H H ) is the type-i error rate under FWER(H H ) and sup is the supreme under all H3 possible H 3. To control type-i error under H H, H H 3 and H 3 H, it is required that, respectively: ln ln + ln + + ln = + ln + + ln = = (A) From (A ), we can see that among, and 3, at least two should be the same, either = or = 3 ( ). The type-i error rate under H H H 3 can be expressed as: ( 3 ) ( p p p3 p p p p3 p ) ( p p p3 p p p p3 p ) ( p p p3 p3 p 3 p3 p 3 p3 ) FWER H H H = Pr For the purpose of critical value derivation, we rewrite FWER(H H H 3 )as: 37

9 Mar Chang et al. / Current Research in Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp ( ) FWER H H H = π + π 3 + π π π π + π where, under H H H 3 : ( p p p p p p p p ) ( p p p p p p p p ) ( p p p p p p p p ) (A) π = Pr 3 3, π = Pr 3 3, (A3) π3 = Pr p p p p p Pr, 3 π = p p3 p p3 p p p p p3 p p π 3 = Pr, (A) p p3 p3 p 3 p p3 p p p3 p3 p 3 π 3 = Pr, p p3 p p p3 p and: π 3 p p p3 p p < p p3 < = Pr p3 p < p < p < p3 < (A5) We will use Fig. 3 to assist in our integration of π, π andπ 3. That is: π + = adp dp + dp dp + dp dp Ω Ω p Ω3 p Ω dp dp p p = p dpdp dp dp 0 p p dp dp p p = + ln + (A6) Similarly, we use Fig. to assist in our integration of π, π 3 and π 3. That is: π = dp dp + dp dp + dp dp Ω Ω p Ω3 p ( since ) p dpdp 0 p = + < ( ) = a + a = (A7) For π 3, we just give the result for the case when 3 = = 3. Assume, otherwise only p p p 3 has an effect in the joint probabilities, while p p and other will have no effect. Fig. 3.π = π = π 3 38

10 Mar Chang et al. / Current Research in Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp Fig..π Fig. 5. π 3 39

11 Mar Chang et al. / American Journal of Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp In fact, the is so small that the three curves p i p j = cut the cube into a smaller cube, as shown in Fig. 5. Therefore, we have: π = dp dp = (A8) 3 3 Ω To Summarize, we have: ( 3 ) FWER H H H = 3 a + ln + 3( ) + 3 = 3 + ln + 3 ( ) (A9) To control, FWER(H H H 3 ) at level, it is required that (assume we have chosen = = 3 ): 3 3a + ln 3 ( ) + 3 = (A0) After = = 3 is determined using (A), we can solve from (A0). For a general case, after is chosen between and, and 3 can be determined using (A) and can be easily obtained from simulations. The power simulations are presented in Table 8. SAS Code for Progressive Test Procedure /* Progressive Alpha-exhaustive Test Procedure for Two-Hypothesis */ %Macro aextesth(nsims, u, u, sigma, N, alpha, alpha, alpha); * nsims = the number of simulation runs; * u, u = parameters for H and H. sigma = common standard deviation; * N = sample size; * alpha, alpha, alpha = critical values on p-scale; * Power = prob of rejecting H or H, * PowerBoth = prob of rejecting H and H.; Data aexh; eep u u N PowerBoth Power; N=&N; u=&u; u=&u; sigma=σ alpha=α Power=0; PowerBoth=0; Do isim= To &nsims; z=rand("normal", &u, sigma/sqrt(n))/sigma*sqrt(n); p=-cdf("normal", z); z=rand("normal", &u, sigma/sqrt(n))/sigma*sqrt(n); p=-cdf("normal", z); sig=0; sig=0; If p*p<=&alpha And p<=alpha Then sig=; If p*p<=&alpha And p<=alpha Then sig=; If sig= OR sig= Then Power=Power+/&nSims; If sig= And sig= Then PowerBoth=PowerBoth+/&nSims; End; Output; Run; %Mend; Title "Checing Type-I Error under H and H"; %aextesth( , 0, 0.0,, 90, , , 0.05); Proc print data=aexh; Run; Title "Power under H: u=0.3 and H: u= 0.3"; %aextesth(000000, 0.3, 0.3,, 90, , , 0.05); Proc print data=aexh; Run; /* Progressive Alpha-exhaustive Test Procedure for Three-Hypothesis */ %Macro aextest3h(nsims, u, u, u3, sigma, N, alpha, alpha, alpha); * nsims = the number of simulation runs; * N = sample size; * u, u, u3 = parmeters for H, H and H3; * alpha, alpha, alpha = cretical values on p-scale; * Power = prob of rejecting H or H or H3; * PowerAll = prob of rejecting H, H and H3 simutanneuosly; Data aex3h; eep u u u3 sigma N alpha PowerAll Power; u=&u; u=&u; u3=&u3; sigma=σ N=&N; alpha=α alpha=α alpha=α Power=0; PowerAll=0; Do isim= To &nsims; z=rand("normal", u, sigma/sqrt(n))/sigma*sqrt(n); p=-cdf("normal", z); z=rand("normal", u, sigma/sqrt(n))/sigma*sqrt(n); p=-cdf("normal", z); z3=rand("normal", u3, sigma/sqrt(n))/sigma*sqrt(n); p3=-cdf("normal", z3); sig=0; sig=0; sig3=0; p=p*p*p3; If p<=alpha & p*p<=alpha & p*p3<=alpha & p<=alpha Then sig=; If p<=alpha & p*p<=alpha & p*p3<=alpha & p<=alpha Then sig=; If p<=alpha & p3*p<=alpha & p3*p<=alpha & p3<=alpha Then sig3=; If sig= OR sig= Or sig3= Then Power=Power+/&nSims; 0

12 Mar Chang et al. / American Journal of Biostatistics 06, 6 (): 30. DOI:0.38/amjbsp If sig= And sig= And sig3= Then PowerAll=PowerAll+/&nSims; End; Output; Run; %Mend; Title "Checing Type-I Error under H, H and H3"; %aextest3h( , 0, 0, 0,, 60, , , 0.05); proc print data=aex3h; Run; Title "Power when u=0, u=0.3 and u3= 0.3"; %aextest3h(000000, 0, 0.3, 0.3,, 60, , , 0.05); Proc print data=aex3h; Run; Acnowledgment We would lie to than the anonymous reviewers and editors for their reviews. Author s Contributions Mar Chang: Involved in the methodology development, draft and approval of the manuscript. Xuan Deng: Contributed to reveiw, revise and approval of the manuscript. John Balser: Discussion on idea, issues, review manuscript. Ethics This article is original and contains unpublished material. The corresponding author confirms that all of the authors have read and approved the manuscript and no ethical issues involved. Reference Bretz, F., W. Maurer, W. Brannath and M. Posch, 009. A graphical approach to sequentially rejective multiple test procedures. Stat. Med., 8: DOI: 0.00/sim.395 Bretz, F., W. Maurer and G. Hommel, 0. Test and power considerations for multiple endpoint analyses using sequentially rejective graphical procedures. Stat. Med., 30: DOI: 0.00/sim.3988 Dmitrieno, A., A.C. Tamhane and F. Bretz, 009. Multiple Testing Problems in Pharmaceutical Statistics. st Edn., CRC Press, ISBN-0: , pp: 30. Dmitrieno, A., B.L. Wiens and P.H. Westfall, 006. Fallbac tests in dose-response clinical trials. J. Biopharmaceutical Stat., 6: DOI: 0.080/ Dmitrieno, A., 03. Multiple testing procedures in clinical trials. Proceedings of the IBS Worshop, Sept. 9-0, Berlin. Hochberg, Y., 988. A sharper Bonferroni procedure for multiple tests of significance. Biometria, 75: DOI: 0.307/33635 Holm, S., 979. A simple sequentially rejective multiple test procedure. Scandinavian J. Stat., 6: Hommel, G., 988. A stagewise rejective multiple test procedure based on a modified Bonferroni test. Biometria, 75: DOI: 0.093/biomet/ Hommel, G. and F. Bretz, 008. Aesthetics and power considerations in multiple testing-a contradiction? Biom. J., 50: DOI: 0.00/bimj Hsu, J.C., 996. Multiple Comparisons: Theory and Methods. st Edn., CRC Press, ISBN-0: 0988, pp: 96. Westfall, P.H., R.D. Tobias, D. Rom, R.D. Wolfinger and Y. Hochberg, 999. Multiple comparisons and multiple tests using SAS system. SAS Institute, SAS Campus Drivew, Cary, North Carolina, USA. Wiens, B.L., 003. A fixed sequence Bonferroni procedure for testing multiple endpoints. Pharmaceutical Stat., : -5. DOI: 0.00/pst.6

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS Journal of Biopharmaceutical Statistics, 21: 726 747, 2011 Copyright Taylor & Francis Group, LLC ISSN: 1054-3406 print/1520-5711 online DOI: 10.1080/10543406.2011.551333 MULTISTAGE AND MIXTURE PARALLEL

More information

Superchain Procedures in Clinical Trials. George Kordzakhia FDA, CDER, Office of Biostatistics Alex Dmitrienko Quintiles Innovation

Superchain Procedures in Clinical Trials. George Kordzakhia FDA, CDER, Office of Biostatistics Alex Dmitrienko Quintiles Innovation August 01, 2012 Disclaimer: This presentation reflects the views of the author and should not be construed to represent the views or policies of the U.S. Food and Drug Administration Introduction We describe

More information

On Generalized Fixed Sequence Procedures for Controlling the FWER

On Generalized Fixed Sequence Procedures for Controlling the FWER Research Article Received XXXX (www.interscience.wiley.com) DOI: 10.1002/sim.0000 On Generalized Fixed Sequence Procedures for Controlling the FWER Zhiying Qiu, a Wenge Guo b and Gavin Lynch c Testing

More information

Adaptive, graph based multiple testing procedures and a uniform improvement of Bonferroni type tests.

Adaptive, graph based multiple testing procedures and a uniform improvement of Bonferroni type tests. 1/35 Adaptive, graph based multiple testing procedures and a uniform improvement of Bonferroni type tests. Martin Posch Center for Medical Statistics, Informatics and Intelligent Systems Medical University

More information

A Mixture Gatekeeping Procedure Based on the Hommel Test for Clinical Trial Applications

A Mixture Gatekeeping Procedure Based on the Hommel Test for Clinical Trial Applications A Mixture Gatekeeping Procedure Based on the Hommel Test for Clinical Trial Applications Thomas Brechenmacher (Dainippon Sumitomo Pharma Co., Ltd.) Jane Xu (Sunovion Pharmaceuticals Inc.) Alex Dmitrienko

More information

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials Testing a secondary endpoint after a group sequential test Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj 9th Annual Adaptive Designs in

More information

Familywise Error Rate Controlling Procedures for Discrete Data

Familywise Error Rate Controlling Procedures for Discrete Data Familywise Error Rate Controlling Procedures for Discrete Data arxiv:1711.08147v1 [stat.me] 22 Nov 2017 Yalin Zhu Center for Mathematical Sciences, Merck & Co., Inc., West Point, PA, U.S.A. Wenge Guo Department

More information

Control of Directional Errors in Fixed Sequence Multiple Testing

Control of Directional Errors in Fixed Sequence Multiple Testing Control of Directional Errors in Fixed Sequence Multiple Testing Anjana Grandhi Department of Mathematical Sciences New Jersey Institute of Technology Newark, NJ 07102-1982 Wenge Guo Department of Mathematical

More information

On adaptive procedures controlling the familywise error rate

On adaptive procedures controlling the familywise error rate , pp. 3 On adaptive procedures controlling the familywise error rate By SANAT K. SARKAR Temple University, Philadelphia, PA 922, USA sanat@temple.edu Summary This paper considers the problem of developing

More information

ON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE. By Wenge Guo and M. Bhaskara Rao

ON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE. By Wenge Guo and M. Bhaskara Rao ON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE By Wenge Guo and M. Bhaskara Rao National Institute of Environmental Health Sciences and University of Cincinnati A classical approach for dealing

More information

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials Research Article Received 29 January 2010, Accepted 26 May 2010 Published online 18 April 2011 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.4008 Mixtures of multiple testing procedures

More information

Type I error rate control in adaptive designs for confirmatory clinical trials with treatment selection at interim

Type I error rate control in adaptive designs for confirmatory clinical trials with treatment selection at interim Type I error rate control in adaptive designs for confirmatory clinical trials with treatment selection at interim Frank Bretz Statistical Methodology, Novartis Joint work with Martin Posch (Medical University

More information

A class of improved hybrid Hochberg Hommel type step-up multiple test procedures

A class of improved hybrid Hochberg Hommel type step-up multiple test procedures Biometrika (2014), 101,4,pp. 899 911 doi: 10.1093/biomet/asu032 Printed in Great Britain Advance Access publication 24 October 2014 A class of improved hybrid Hochberg Hommel type step-up multiple test

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 7, Issue 1 2011 Article 12 Consonance and the Closure Method in Multiple Testing Joseph P. Romano, Stanford University Azeem Shaikh, University of Chicago

More information

Adaptive Dunnett Tests for Treatment Selection

Adaptive Dunnett Tests for Treatment Selection s for Treatment Selection Franz König 1 Werner Brannath 1 Frank Bretz 2 Martin Posch 1 1 Section of Medical Statistics Medical University of Vienna 2 Novartis Pharma AG Basel Workshop Adaptive Designs

More information

Hochberg Multiple Test Procedure Under Negative Dependence

Hochberg Multiple Test Procedure Under Negative Dependence Hochberg Multiple Test Procedure Under Negative Dependence Ajit C. Tamhane Northwestern University Joint work with Jiangtao Gou (Northwestern University) IMPACT Symposium, Cary (NC), November 20, 2014

More information

Applying the Benjamini Hochberg procedure to a set of generalized p-values

Applying the Benjamini Hochberg procedure to a set of generalized p-values U.U.D.M. Report 20:22 Applying the Benjamini Hochberg procedure to a set of generalized p-values Fredrik Jonsson Department of Mathematics Uppsala University Applying the Benjamini Hochberg procedure

More information

On weighted Hochberg procedures

On weighted Hochberg procedures Biometrika (2008), 95, 2,pp. 279 294 C 2008 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asn018 On weighted Hochberg procedures BY AJIT C. TAMHANE Department of Industrial Engineering

More information

On Procedures Controlling the FDR for Testing Hierarchically Ordered Hypotheses

On Procedures Controlling the FDR for Testing Hierarchically Ordered Hypotheses On Procedures Controlling the FDR for Testing Hierarchically Ordered Hypotheses Gavin Lynch Catchpoint Systems, Inc., 228 Park Ave S 28080 New York, NY 10003, U.S.A. Wenge Guo Department of Mathematical

More information

Two-stage stepup procedures controlling FDR

Two-stage stepup procedures controlling FDR Journal of Statistical Planning and Inference 38 (2008) 072 084 www.elsevier.com/locate/jspi Two-stage stepup procedures controlling FDR Sanat K. Sarar Department of Statistics, Temple University, Philadelphia,

More information

INTRODUCTION TO INTERSECTION-UNION TESTS

INTRODUCTION TO INTERSECTION-UNION TESTS INTRODUCTION TO INTERSECTION-UNION TESTS Jimmy A. Doi, Cal Poly State University San Luis Obispo Department of Statistics (jdoi@calpoly.edu Key Words: Intersection-Union Tests; Multiple Comparisons; Acceptance

More information

Statistica Sinica Preprint No: SS R1

Statistica Sinica Preprint No: SS R1 Statistica Sinica Preprint No: SS-2017-0072.R1 Title Control of Directional Errors in Fixed Sequence Multiple Testing Manuscript ID SS-2017-0072.R1 URL http://www.stat.sinica.edu.tw/statistica/ DOI 10.5705/ss.202017.0072

More information

Closure properties of classes of multiple testing procedures

Closure properties of classes of multiple testing procedures AStA Adv Stat Anal (2018) 102:167 178 https://doi.org/10.1007/s10182-017-0297-0 ORIGINAL PAPER Closure properties of classes of multiple testing procedures Georg Hahn 1 Received: 28 June 2016 / Accepted:

More information

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018 High-Throughput Sequencing Course Multiple Testing Biostatistics and Bioinformatics Summer 2018 Introduction You have previously considered the significance of a single gene Introduction You have previously

More information

The Design of Group Sequential Clinical Trials that Test Multiple Endpoints

The Design of Group Sequential Clinical Trials that Test Multiple Endpoints The Design of Group Sequential Clinical Trials that Test Multiple Endpoints Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Bruce Turnbull

More information

Modified Simes Critical Values Under Positive Dependence

Modified Simes Critical Values Under Positive Dependence Modified Simes Critical Values Under Positive Dependence Gengqian Cai, Sanat K. Sarkar Clinical Pharmacology Statistics & Programming, BDS, GlaxoSmithKline Statistics Department, Temple University, Philadelphia

More information

Optimal exact tests for multiple binary endpoints

Optimal exact tests for multiple binary endpoints Optimal exact tests for multiple binary endpoints Robin Ristl 1, Dong Xi, Ekkehard Glimm 3,4, Martin Posch 1 December 3, 16 arxiv:161.7561v1 [stat.me] Dec 16 Abstract In confirmatory clinical trials with

More information

Adaptive Treatment Selection with Survival Endpoints

Adaptive Treatment Selection with Survival Endpoints Adaptive Treatment Selection with Survival Endpoints Gernot Wassmer Institut für Medizinische Statisti, Informati und Epidemiologie Universität zu Köln Joint wor with Marus Roters, Omnicare Clinical Research,

More information

A note on tree gatekeeping procedures in clinical trials

A note on tree gatekeeping procedures in clinical trials STATISTICS IN MEDICINE Statist. Med. 2008; 06:1 6 [Version: 2002/09/18 v1.11] A note on tree gatekeeping procedures in clinical trials Alex Dmitrienko 1, Ajit C. Tamhane 2, Lingyun Liu 2, Brian L. Wiens

More information

Consonance and the Closure Method in Multiple Testing. Institute for Empirical Research in Economics University of Zurich

Consonance and the Closure Method in Multiple Testing. Institute for Empirical Research in Economics University of Zurich Institute for Empirical Research in Economics University of Zurich Working Paper Series ISSN 1424-0459 Working Paper No. 446 Consonance and the Closure Method in Multiple Testing Joseph P. Romano, Azeem

More information

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013 Adaptive designs beyond p-value combination methods Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013 Outline Introduction Combination-p-value method and conditional error function

More information

Group sequential designs for Clinical Trials with multiple treatment arms

Group sequential designs for Clinical Trials with multiple treatment arms Group sequential designs for Clinical Trials with multiple treatment arms Susanne Urach, Martin Posch Cologne, June 26, 2015 This project has received funding from the European Union s Seventh Framework

More information

IMPROVING TWO RESULTS IN MULTIPLE TESTING

IMPROVING TWO RESULTS IN MULTIPLE TESTING IMPROVING TWO RESULTS IN MULTIPLE TESTING By Sanat K. Sarkar 1, Pranab K. Sen and Helmut Finner Temple University, University of North Carolina at Chapel Hill and University of Duesseldorf October 11,

More information

Adaptive Extensions of a Two-Stage Group Sequential Procedure for Testing a Primary and a Secondary Endpoint (II): Sample Size Re-estimation

Adaptive Extensions of a Two-Stage Group Sequential Procedure for Testing a Primary and a Secondary Endpoint (II): Sample Size Re-estimation Research Article Received XXXX (www.interscience.wiley.com) DOI: 10.100/sim.0000 Adaptive Extensions of a Two-Stage Group Sequential Procedure for Testing a Primary and a Secondary Endpoint (II): Sample

More information

Bonferroni - based gatekeeping procedure with retesting option

Bonferroni - based gatekeeping procedure with retesting option Bonferroni - based gatekeeping procedure with retesting option Zhiying Qiu Biostatistics and Programming, Sanofi Bridgewater, NJ 08807, U.S.A. Wenge Guo Department of Mathematical Sciences New Jersey Institute

More information

PROCEDURES CONTROLLING THE k-fdr USING. BIVARIATE DISTRIBUTIONS OF THE NULL p-values. Sanat K. Sarkar and Wenge Guo

PROCEDURES CONTROLLING THE k-fdr USING. BIVARIATE DISTRIBUTIONS OF THE NULL p-values. Sanat K. Sarkar and Wenge Guo PROCEDURES CONTROLLING THE k-fdr USING BIVARIATE DISTRIBUTIONS OF THE NULL p-values Sanat K. Sarkar and Wenge Guo Temple University and National Institute of Environmental Health Sciences Abstract: Procedures

More information

ROI ANALYSIS OF PHARMAFMRI DATA:

ROI ANALYSIS OF PHARMAFMRI DATA: ROI ANALYSIS OF PHARMAFMRI DATA: AN ADAPTIVE APPROACH FOR GLOBAL TESTING Giorgos Minas, John A.D. Aston, Thomas E. Nichols and Nigel Stallard Department of Statistics and Warwick Centre of Analytical Sciences,

More information

Multiple Testing in Group Sequential Clinical Trials

Multiple Testing in Group Sequential Clinical Trials Multiple Testing in Group Sequential Clinical Trials Tian Zhao Supervisor: Michael Baron Department of Mathematical Sciences University of Texas at Dallas txz122@utdallas.edu 7/2/213 1 Sequential statistics

More information

Procedures controlling generalized false discovery rate

Procedures controlling generalized false discovery rate rocedures controlling generalized false discovery rate By SANAT K. SARKAR Department of Statistics, Temple University, hiladelphia, A 922, U.S.A. sanat@temple.edu AND WENGE GUO Department of Environmental

More information

Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests

Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests K. Strassburger 1, F. Bretz 2 1 Institute of Biometrics & Epidemiology German Diabetes Center,

More information

A Brief Introduction to Intersection-Union Tests. Jimmy Akira Doi. North Carolina State University Department of Statistics

A Brief Introduction to Intersection-Union Tests. Jimmy Akira Doi. North Carolina State University Department of Statistics Introduction A Brief Introduction to Intersection-Union Tests Often, the quality of a product is determined by several parameters. The product is determined to be acceptable if each of the parameters meets

More information

Lecture 6 April

Lecture 6 April Stats 300C: Theory of Statistics Spring 2017 Lecture 6 April 14 2017 Prof. Emmanuel Candes Scribe: S. Wager, E. Candes 1 Outline Agenda: From global testing to multiple testing 1. Testing the global null

More information

Multiple Testing of One-Sided Hypotheses: Combining Bonferroni and the Bootstrap

Multiple Testing of One-Sided Hypotheses: Combining Bonferroni and the Bootstrap University of Zurich Department of Economics Working Paper Series ISSN 1664-7041 (print) ISSN 1664-705X (online) Working Paper No. 254 Multiple Testing of One-Sided Hypotheses: Combining Bonferroni and

More information

Adaptive clinical trials with subgroup selection

Adaptive clinical trials with subgroup selection Adaptive clinical trials with subgroup selection Nigel Stallard 1, Tim Friede 2, Nick Parsons 1 1 Warwick Medical School, University of Warwick, Coventry, UK 2 University Medical Center Göttingen, Göttingen,

More information

FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES

FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES Sanat K. Sarkar a a Department of Statistics, Temple University, Speakman Hall (006-00), Philadelphia, PA 19122, USA Abstract The concept

More information

Chapter 13 Section D. F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons

Chapter 13 Section D. F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons Explaining Psychological Statistics (2 nd Ed.) by Barry H. Cohen Chapter 13 Section D F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons In section B of this chapter,

More information

Two-Phase, Three-Stage Adaptive Designs in Clinical Trials

Two-Phase, Three-Stage Adaptive Designs in Clinical Trials Japanese Journal of Biometrics Vol. 35, No. 2, 69 93 (2014) Preliminary Report Two-Phase, Three-Stage Adaptive Designs in Clinical Trials Hiroyuki Uesaka 1, Toshihiko Morikawa 2 and Akiko Kada 3 1 The

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests 1999 Prentice-Hall, Inc. Chap. 8-1 Chapter Topics Hypothesis Testing Methodology Z Test

More information

Stepwise Gatekeeping Procedures in Clinical Trial Applications

Stepwise Gatekeeping Procedures in Clinical Trial Applications 984 Biometrical Journal 48 (2006) 6, 984 991 DOI: 10.1002/bimj.200610274 Stepwise Gatekeeping Procedures in Clinical Trial Applications Alex Dmitrienko *,1, Ajit C. Tamhane 2, Xin Wang 2, and Xun Chen

More information

Type-II Generalized Family-Wise Error Rate Formulas with Application to Sample Size Determination

Type-II Generalized Family-Wise Error Rate Formulas with Application to Sample Size Determination Type-II Generalized Family-Wise Error Rate Formulas with Application to Sample Size Determination Benoit Liquet 1,2, Pierre Lafaye de Micheaux 3 and Jérémie Riou 4 1 University de Pau et Pays de l Adour,

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 267 Optimizing Randomized Trial Designs to Distinguish which Subpopulations Benefit from

More information

Family-wise Error Rate Control in QTL Mapping and Gene Ontology Graphs

Family-wise Error Rate Control in QTL Mapping and Gene Ontology Graphs Family-wise Error Rate Control in QTL Mapping and Gene Ontology Graphs with Remarks on Family Selection Dissertation Defense April 5, 204 Contents Dissertation Defense Introduction 2 FWER Control within

More information

Finding Critical Values with Prefixed Early. Stopping Boundaries and Controlled Type I. Error for A Two-Stage Adaptive Design

Finding Critical Values with Prefixed Early. Stopping Boundaries and Controlled Type I. Error for A Two-Stage Adaptive Design Finding Critical Values with Prefixed Early Stopping Boundaries and Controlled Type I Error for A Two-Stage Adaptive Design Jingjing Chen 1, Sanat K. Sarkar 2, and Frank Bretz 3 September 27, 2009 1 ClinForce-GSK

More information

ON TWO RESULTS IN MULTIPLE TESTING

ON TWO RESULTS IN MULTIPLE TESTING ON TWO RESULTS IN MULTIPLE TESTING By Sanat K. Sarkar 1, Pranab K. Sen and Helmut Finner Temple University, University of North Carolina at Chapel Hill and University of Duesseldorf Two known results in

More information

Multiple Testing. Anjana Grandhi. BARDS, Merck Research Laboratories. Rahway, NJ Wenge Guo. Department of Mathematical Sciences

Multiple Testing. Anjana Grandhi. BARDS, Merck Research Laboratories. Rahway, NJ Wenge Guo. Department of Mathematical Sciences Control of Directional Errors in Fixed Sequence arxiv:1602.02345v2 [math.st] 18 Mar 2017 Multiple Testing Anjana Grandhi BARDS, Merck Research Laboratories Rahway, NJ 07065 Wenge Guo Department of Mathematical

More information

Alpha-recycling for the analyses of primary and secondary endpoints of. clinical trials

Alpha-recycling for the analyses of primary and secondary endpoints of. clinical trials Alpharecycling for the analyses of primary and secondary endpoints of Presenters: clinical trials Mohammad Huque, Ph.D. FDA/CDER/OTS/ Office of Biostatistics Sirisha Mushti, Ph.D. Division of Biometrics

More information

Statistical Applications in Genetics and Molecular Biology

Statistical Applications in Genetics and Molecular Biology Statistical Applications in Genetics and Molecular Biology Volume 5, Issue 1 2006 Article 28 A Two-Step Multiple Comparison Procedure for a Large Number of Tests and Multiple Treatments Hongmei Jiang Rebecca

More information

arxiv: v1 [math.st] 14 Nov 2012

arxiv: v1 [math.st] 14 Nov 2012 The Annals of Statistics 2010, Vol. 38, No. 6, 3782 3810 DOI: 10.1214/10-AOS829 c Institute of Mathematical Statistics, 2010 arxiv:1211.3313v1 [math.st] 14 Nov 2012 THE SEQUENTIAL REJECTION PRINCIPLE OF

More information

GENERALIZING SIMES TEST AND HOCHBERG S STEPUP PROCEDURE 1. BY SANAT K. SARKAR Temple University

GENERALIZING SIMES TEST AND HOCHBERG S STEPUP PROCEDURE 1. BY SANAT K. SARKAR Temple University The Annals of Statistics 2008, Vol. 36, No. 1, 337 363 DOI: 10.1214/009053607000000550 Institute of Mathematical Statistics, 2008 GENERALIZING SIMES TEST AND HOCHBERG S STEPUP PROCEDURE 1 BY SANAT K. SARKAR

More information

arxiv: v1 [math.st] 13 Mar 2008

arxiv: v1 [math.st] 13 Mar 2008 The Annals of Statistics 2008, Vol. 36, No. 1, 337 363 DOI: 10.1214/009053607000000550 c Institute of Mathematical Statistics, 2008 arxiv:0803.1961v1 [math.st] 13 Mar 2008 GENERALIZING SIMES TEST AND HOCHBERG

More information

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare

More information

STEPDOWN PROCEDURES CONTROLLING A GENERALIZED FALSE DISCOVERY RATE. National Institute of Environmental Health Sciences and Temple University

STEPDOWN PROCEDURES CONTROLLING A GENERALIZED FALSE DISCOVERY RATE. National Institute of Environmental Health Sciences and Temple University STEPDOWN PROCEDURES CONTROLLING A GENERALIZED FALSE DISCOVERY RATE Wenge Guo 1 and Sanat K. Sarkar 2 National Institute of Environmental Health Sciences and Temple University Abstract: Often in practice

More information

Statistical Applications in Genetics and Molecular Biology

Statistical Applications in Genetics and Molecular Biology Statistical Applications in Genetics and Molecular Biology Volume 3, Issue 1 2004 Article 14 Multiple Testing. Part II. Step-Down Procedures for Control of the Family-Wise Error Rate Mark J. van der Laan

More information

Group Sequential Trial with a Biomarker Subpopulation

Group Sequential Trial with a Biomarker Subpopulation Group Sequential Trial with a Biomarker Subpopulation Ting-Yu (Jeff) Chen, Jing Zhao, Linda Sun and Keaven Anderson ASA Biopharm Workshop, Sep 13th. 2018 1 Outline Motivation: Phase III PD-L1/PD-1 Monotherapy

More information

Lecture 7 April 16, 2018

Lecture 7 April 16, 2018 Stats 300C: Theory of Statistics Spring 2018 Lecture 7 April 16, 2018 Prof. Emmanuel Candes Scribe: Feng Ruan; Edited by: Rina Friedberg, Junjie Zhu 1 Outline Agenda: 1. False Discovery Rate (FDR) 2. Properties

More information

The Pennsylvania State University The Graduate School Eberly College of Science GENERALIZED STEPWISE PROCEDURES FOR

The Pennsylvania State University The Graduate School Eberly College of Science GENERALIZED STEPWISE PROCEDURES FOR The Pennsylvania State University The Graduate School Eberly College of Science GENERALIZED STEPWISE PROCEDURES FOR CONTROLLING THE FALSE DISCOVERY RATE A Dissertation in Statistics by Scott Roths c 2011

More information

Summary and discussion of: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

Summary and discussion of: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing Summary and discussion of: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing Statistics Journal Club, 36-825 Beau Dabbs and Philipp Burckhardt 9-19-2014 1 Paper

More information

UNIFORMLY MOST POWERFUL TESTS FOR SIMULTANEOUSLY DETECTING A TREATMENT EFFECT IN THE OVERALL POPULATION AND AT LEAST ONE SUBPOPULATION

UNIFORMLY MOST POWERFUL TESTS FOR SIMULTANEOUSLY DETECTING A TREATMENT EFFECT IN THE OVERALL POPULATION AND AT LEAST ONE SUBPOPULATION UNIFORMLY MOST POWERFUL TESTS FOR SIMULTANEOUSLY DETECTING A TREATMENT EFFECT IN THE OVERALL POPULATION AND AT LEAST ONE SUBPOPULATION BY MICHAEL ROSENBLUM Department of Biostatistics, Johns Hopkins Bloomberg

More information

Power assessment in group sequential design with multiple biomarker subgroups for multiplicity problem

Power assessment in group sequential design with multiple biomarker subgroups for multiplicity problem Power assessment in group sequential design with multiple biomarker subgroups for multiplicity problem Lei Yang, Ph.D. Statistical Scientist, Roche (China) Holding Ltd. Aug 30 th 2018, Shanghai Jiao Tong

More information

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika. Biometrika Trust A Stagewise Rejective Multiple Test Procedure Based on a Modified Bonferroni Test Author(s): G. Hommel Source: Biometrika, Vol. 75, No. 2 (Jun., 1988), pp. 383-386 Published by: Biometrika

More information

A Gatekeeping Test on a Primary and a Secondary Endpoint in a Group Sequential Design with Multiple Interim Looks

A Gatekeeping Test on a Primary and a Secondary Endpoint in a Group Sequential Design with Multiple Interim Looks A Gatekeeping Test in a Group Sequential Design 1 A Gatekeeping Test on a Primary and a Secondary Endpoint in a Group Sequential Design with Multiple Interim Looks Ajit C. Tamhane Department of Industrial

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Inference for Binomial Parameters

Inference for Binomial Parameters Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for

More information

Chapter Seven: Multi-Sample Methods 1/52

Chapter Seven: Multi-Sample Methods 1/52 Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze

More information

Contrasts and Multiple Comparisons Supplement for Pages

Contrasts and Multiple Comparisons Supplement for Pages Contrasts and Multiple Comparisons Supplement for Pages 302-323 Brian Habing University of South Carolina Last Updated: July 20, 2001 The F-test from the ANOVA table allows us to test the null hypothesis

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds Chapter 6 Logistic Regression In logistic regression, there is a categorical response variables, often coded 1=Yes and 0=No. Many important phenomena fit this framework. The patient survives the operation,

More information

Philippe Delorme a, Pierre Lafaye de Micheaux a,

Philippe Delorme a, Pierre Lafaye de Micheaux a, Package SSDDA : Sample Size Determination and Data Analysis in the context of continuous co-primary endpoints in clinical trials. Philippe Delorme a, Pierre Lafaye de Micheaux a, Benoît Liquet c,d and

More information

Step-down FDR Procedures for Large Numbers of Hypotheses

Step-down FDR Procedures for Large Numbers of Hypotheses Step-down FDR Procedures for Large Numbers of Hypotheses Paul N. Somerville University of Central Florida Abstract. Somerville (2004b) developed FDR step-down procedures which were particularly appropriate

More information

Adaptive Designs: Why, How and When?

Adaptive Designs: Why, How and When? Adaptive Designs: Why, How and When? Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj ISBS Conference Shanghai, July 2008 1 Adaptive designs:

More information

This paper has been submitted for consideration for publication in Biometrics

This paper has been submitted for consideration for publication in Biometrics BIOMETRICS, 1 10 Supplementary material for Control with Pseudo-Gatekeeping Based on a Possibly Data Driven er of the Hypotheses A. Farcomeni Department of Public Health and Infectious Diseases Sapienza

More information

Exact and Approximate Stepdown Methods For Multiple Hypothesis Testing

Exact and Approximate Stepdown Methods For Multiple Hypothesis Testing Exact and Approximate Stepdown Methods For Multiple Hypothesis Testing Joseph P. Romano Department of Statistics Stanford University Michael Wolf Department of Economics and Business Universitat Pompeu

More information

Dose-response modeling with bivariate binary data under model uncertainty

Dose-response modeling with bivariate binary data under model uncertainty Dose-response modeling with bivariate binary data under model uncertainty Bernhard Klingenberg 1 1 Department of Mathematics and Statistics, Williams College, Williamstown, MA, 01267 and Institute of Statistics,

More information

Hypothesis testing. Data to decisions

Hypothesis testing. Data to decisions Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the

More information

Control of Generalized Error Rates in Multiple Testing

Control of Generalized Error Rates in Multiple Testing Institute for Empirical Research in Economics University of Zurich Working Paper Series ISSN 1424-0459 Working Paper No. 245 Control of Generalized Error Rates in Multiple Testing Joseph P. Romano and

More information

Multiple Testing of General Contrasts: Truncated Closure and the Extended Shaffer-Royen Method

Multiple Testing of General Contrasts: Truncated Closure and the Extended Shaffer-Royen Method Multiple Testing of General Contrasts: Truncated Closure and the Extended Shaffer-Royen Method Peter H. Westfall, Texas Tech University Randall D. Tobias, SAS Institute Pairwise Comparisons ANOVA, g =10groups,

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Multiple Testing. Hoang Tran. Department of Statistics, Florida State University

Multiple Testing. Hoang Tran. Department of Statistics, Florida State University Multiple Testing Hoang Tran Department of Statistics, Florida State University Large-Scale Testing Examples: Microarray data: testing differences in gene expression between two traits/conditions Microbiome

More information

Analysis of Multiple Endpoints in Clinical Trials

Analysis of Multiple Endpoints in Clinical Trials Chapter 4 Analysis of Multiple Endpoints in Clinical Trials Ajit C. Tamhane Northwestern University Alex Dmitrienko Eli Lilly and Company 4.1 Introduction Most human diseases are characterized by multidimensional

More information

Controlling the False Discovery Rate: Understanding and Extending the Benjamini-Hochberg Method

Controlling the False Discovery Rate: Understanding and Extending the Benjamini-Hochberg Method Controlling the False Discovery Rate: Understanding and Extending the Benjamini-Hochberg Method Christopher R. Genovese Department of Statistics Carnegie Mellon University joint work with Larry Wasserman

More information

The Design of a Survival Study

The Design of a Survival Study The Design of a Survival Study The design of survival studies are usually based on the logrank test, and sometimes assumes the exponential distribution. As in standard designs, the power depends on The

More information

Multiple testing: Intro & FWER 1

Multiple testing: Intro & FWER 1 Multiple testing: Intro & FWER 1 Mark van de Wiel mark.vdwiel@vumc.nl Dep of Epidemiology & Biostatistics,VUmc, Amsterdam Dep of Mathematics, VU 1 Some slides courtesy of Jelle Goeman 1 Practical notes

More information

A BAYESIAN STEPWISE MULTIPLE TESTING PROCEDURE. By Sanat K. Sarkar 1 and Jie Chen. Temple University and Merck Research Laboratories

A BAYESIAN STEPWISE MULTIPLE TESTING PROCEDURE. By Sanat K. Sarkar 1 and Jie Chen. Temple University and Merck Research Laboratories A BAYESIAN STEPWISE MULTIPLE TESTING PROCEDURE By Sanat K. Sarar 1 and Jie Chen Temple University and Merc Research Laboratories Abstract Bayesian testing of multiple hypotheses often requires consideration

More information

SAMPLE SIZE RE-ESTIMATION FOR ADAPTIVE SEQUENTIAL DESIGN IN CLINICAL TRIALS

SAMPLE SIZE RE-ESTIMATION FOR ADAPTIVE SEQUENTIAL DESIGN IN CLINICAL TRIALS Journal of Biopharmaceutical Statistics, 18: 1184 1196, 2008 Copyright Taylor & Francis Group, LLC ISSN: 1054-3406 print/1520-5711 online DOI: 10.1080/10543400802369053 SAMPLE SIZE RE-ESTIMATION FOR ADAPTIVE

More information

Non-specific filtering and control of false positives

Non-specific filtering and control of false positives Non-specific filtering and control of false positives Richard Bourgon 16 June 2009 bourgon@ebi.ac.uk EBI is an outstation of the European Molecular Biology Laboratory Outline Multiple testing I: overview

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

Notes on Decision Theory and Prediction

Notes on Decision Theory and Prediction Notes on Decision Theory and Prediction Ronald Christensen Professor of Statistics Department of Mathematics and Statistics University of New Mexico October 7, 2014 1. Decision Theory Decision theory is

More information

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

EC2001 Econometrics 1 Dr. Jose Olmo Room D309 EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:

More information

Christopher J. Bennett

Christopher J. Bennett P- VALUE ADJUSTMENTS FOR ASYMPTOTIC CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE by Christopher J. Bennett Working Paper No. 09-W05 April 2009 DEPARTMENT OF ECONOMICS VANDERBILT UNIVERSITY NASHVILLE,

More information

On Methods Controlling the False Discovery Rate 1

On Methods Controlling the False Discovery Rate 1 Sankhyā : The Indian Journal of Statistics 2008, Volume 70-A, Part 2, pp. 135-168 c 2008, Indian Statistical Institute On Methods Controlling the False Discovery Rate 1 Sanat K. Sarkar Temple University,

More information