Resampling-Based Control of the FDR
|
|
- Phillip Mervyn Stanley
- 5 years ago
- Views:
Transcription
1 Resampling-Based Control of the FDR Joseph P. Romano 1 Azeem S. Shaikh 2 and Michael Wolf 3 1 Departments of Economics and Statistics Stanford University 2 Department of Economics University of Chicago 3 Department of Economics University of Zurich
2 Outline 1 Problem Formulation 2 Existing Methods 3 New Method 4 Theory & Practice 5 Simulations 6 Conclusions 7 References
3 Outline 1 Problem Formulation 2 Existing Methods 3 New Method 4 Theory & Practice 5 Simulations 6 Conclusions 7 References
4 General Set-Up & Notation Data X =(X 1,...,X n ) from distribution P Interest in parameter vector θ(p)=θ =(θ 1,...,θ s ) The individual hypotheses concern the elements θ i, for i=1,...,s, and can be (all) one-sided or (all) two-sided One-sided hypotheses: Two-sided hypotheses: H i : θ i θ 0,i vs. H i : θ i > θ 0,i H i : θ i = θ 0,i vs. H i : θ i θ 0,i
5 General Set-Up & Notation Data X =(X 1,...,X n ) from distribution P Interest in parameter vector θ(p)=θ =(θ 1,...,θ s ) The individual hypotheses concern the elements θ i, for i=1,...,s, and can be (all) one-sided or (all) two-sided One-sided hypotheses: Two-sided hypotheses: H i : θ i θ 0,i vs. H i : θ i > θ 0,i H i : θ i = θ 0,i vs. H i : θ i θ 0,i Test statistic T n,i =(ˆθ n,i θ 0,i )/ ˆσ n,i or T n,i = ˆθ n,i θ 0,i / ˆσ n,i ˆσ n,i is a standard error for ˆθ n,i or ˆσ n,i 1/ n ˆp n,i is an individual p-value
6 The False Discovery Rate Consider s individual tests H i vs. H i. False discovery proportion F = # false rejections; R = # total rejections False discovery rate FDR P = E P (FDP) FDP= F R 1{R>0}= F max{r, 1} Goal: (strong) asymptotic control of the FDR at level α: lim supfdr P α n for all P
7 Outline 1 Problem Formulation 2 Existing Methods 3 New Method 4 Theory & Practice 5 Simulations 6 Conclusions 7 References
8 Benjamini and Hochberg (1995) Stepup method: Ordered p-values: ˆp n,(1) ˆp n,(2)... ˆp n,(s) Let j = max { j : ˆp n,(j) α j }, where αj = jα/s Reject H (1),...,H (j ) Comments: Original proof assumes independence of the p-values Validity has been extended to certain dependence types (Benjamini and Yekutieli, 2001)
9 Modifications of BH (1995) Storey, Taylor and Siegmund (2004): Under sufficient conditions for BH (1995): FDR P s 0 s α where s 0 = I(P) =#{true hypotheses} Instead of α j = jα/s use α j = jα/ŝ 0 with ŝ 0 = #{ˆp n,i > λ}+1 1 λ for some 0<λ < 1 Proof assumes the ˆp n,i to be almost independent
10 Modifications of BH (1995) Bejamini, Krieger and Yekutieli (2006): Step 1: Apply the BH (1995) procedure at nominal level α = α/(1+α) Let r denote the number of rejected hypotheses (a) If r=0, reject nothing, and stop (b) If r=s, reject everything, and stop (c) Otherwise, continue Step 2: Apply the BH (1995) procedure at nominal level α Instead of α j = jα/s use α j = jα/ŝ 0 with ŝ 0 = s r Proof assumes independence of the ˆp n,i, but simulations show robustness against various dependence structures
11 Outline 1 Problem Formulation 2 Existing Methods 3 New Method 4 Theory & Practice 5 Simulations 6 Conclusions 7 References
12 Basic Idea (Troendle, 2000) For any stepdown procedure with critical values c 1,...,c s : [ ] F 1 FDR P = E P = max{r, 1} 1 r s r E P[F R=r]P{R=r} with P{R=r}=P{T n,(s) c s,...,t n,(s r+1) c s r+1,t n,(s r) < c s r }
13 Basic Idea (Troendle, 2000) For any stepdown procedure with critical values c 1,...,c s : [ ] F 1 FDR P = E P = max{r, 1} 1 r s r E P[F R=r]P{R=r} with P{R=r}=P{T n,(s) c s,...,t n,(s r+1) c s r+1,t n,(s r) < c s r } If all false hypotheses are rejected with p. 1, then with p. 1: FDR P = s s 0 +1 r s r s+s 0 r P{T n,s0 :s 0 c s0,...,t n,s r+1:s0 c s r+1,t n,s r:s0 < c s r } (1) Here T n,r:t is the rth largest of the test statistics T n,1,...,t n,t, and we assume w.l.o.g. that I(P)={1,...,s 0 }.
14 Basic Idea (continued) Goal: Bound (1) above by α for any P, at least asymptotically In particular, this must be ensured for any 1 s 0 s. First, consider any P such that s 0 = 1: Then (1) reduces to 1 s P{T n,1:1 c 1 } And so c 1 = inf{x R: 1 s P{T n,1:1 x} α}
15 Basic Idea (continued) Goal: Bound (1) above by α for any P, at least asymptotically In particular, this must be ensured for any 1 s 0 s. First, consider any P such that s 0 = 1: Then (1) reduces to 1 s P{T n,1:1 c 1 } And so c 1 = inf{x R: 1 s P{T n,1:1 x} α} Next, consider any P such that s 0 = 2. Then (1) reduces to: 1 s 1 P{T n,2:2 c 2,T n,1:2 < c 1 }+ 2 s P{T n,2:2 c 2,T n,1:2 c 1 } And so c 2 is the smallest x R for which 1 s 1 P{T n,2:2 x,t n,1:2 < c 1 }+ 2 s P{T n,2:2 x,t n,1:2 c 1 } α
16 Basic Idea (continued) Goal: Bound (1) above by α for any P, at least asymptotically In particular, this must be ensured for any 1 s 0 s. First, consider any P such that s 0 = 1: Then (1) reduces to 1 s P{T n,1:1 c 1 } And so c 1 = inf{x R: 1 s P{T n,1:1 x} α} Next, consider any P such that s 0 = 2. Then (1) reduces to: 1 s 1 P{T n,2:2 c 2,T n,1:2 < c 1 }+ 2 s P{T n,2:2 c 2,T n,1:2 c 1 } And so c 2 is the smallest x R for which 1 s 1 P{T n,2:2 x,t n,1:2 < c 1 }+ 2 s P{T n,2:2 x,t n,1:2 c 1 } α And so forth...
17 Estimation of the c i Since P is unknown, so are the ideal critical values c i. We sugggest a bootstrap method to estimate the c i : ˆP n is an unrestricted estimate of P with θ i ( ˆP n )= ˆθ n,i X is generated from ˆP n and the T n,i are computed from X, but centered at ˆθ n,i rather than at θ 0,i E.g., for one-sided testing: T n,i =(ˆθ n,i ˆθ n,i )/ ˆσ n,i
18 Estimation of the c i Since P is unknown, so are the ideal critical values c i. We sugggest a bootstrap method to estimate the c i : ˆP n is an unrestricted estimate of P with θ i ( ˆP n )= ˆθ n,i X is generated from ˆP n and the T n,i are computed from X, but centered at ˆθ n,i rather than at θ 0,i E.g., for one-sided testing: T n,i =(ˆθ n,i ˆθ n,i )/ ˆσ n,i Important detail: The ordering of the original T n,i determines the true null hypotheses in the bootstrap world The permutation{k 1,...,k s } of {1,...,s} is defined such that T n,k1 = T n,(1),...,t n,ks = T n,(s) Then T n,r:t is the rth smallest of the statistics T n,k 1,...,T n,k t
19 Estimation of the c i (continued) Start with c 1 : ĉ 1 = inf{x R: 1 s ˆP n {T n,1:1 x} α}
20 Estimation of the c i (continued) Start with c 1 : ĉ 1 = inf{x R: 1 s ˆP n {T n,1:1 x} α} Then move on to c 2 : ĉ 2 is the smallest x R for which 1 s 1 ˆP n {T n,2:2 x,t n,1:2 < ĉ 1 }+ 2 s ˆP n {T n,2:2 x,t n,1:2 ĉ 1 } α
21 Estimation of the c i (continued) Start with c 1 : ĉ 1 = inf{x R: 1 s ˆP n {T n,1:1 x} α} Then move on to c 2 : ĉ 2 is the smallest x R for which 1 s 1 ˆP n {T n,2:2 x,t n,1:2 < ĉ 1 }+ 2 s ˆP n {T n,2:2 x,t n,1:2 ĉ 1 } α And so forth... Unlike Troendle (2000), monotonicity ĉ i+1 ĉ i is not enforced.
22 Outline 1 Problem Formulation 2 Existing Methods 3 New Method 4 Theory & Practice 5 Simulations 6 Conclusions 7 References
23 Some Theory Assumptions (1) The sampling distribution of n( ˆθ n θ) under P converges to a limit distribution with continuous marginals (2) The bootstrap consistently estimates this limit distribution (3) n ˆσ n,i and n ˆσ n,i converge to the same constant in probability (for i=1,...,s) (4) The limiting joint distribution corresponding to the true test statistics is exchangeable
24 Some Theory Assumptions (1) The sampling distribution of n( ˆθ n θ) under P converges to a limit distribution with continuous marginals (2) The bootstrap consistently estimates this limit distribution (3) n ˆσ n,i and n ˆσ n,i converge to the same constant in probability (for i=1,...,s) (4) The limiting joint distribution corresponding to the true test statistics is exchangeable Theorem (i) Any false H i will be rejected with p. 1 as n (ii) The method asymptotically controls the FDR at level α
25 Some Practice Assumption (4) is somewhat restrictive (though less restrictive than an assumption of independence) But simulations indicate that the method appears robust to different limiting variances of the true test statistics different limiting correlations of the true test statistics
26 Outline 1 Problem Formulation 2 Existing Methods 3 New Method 4 Theory & Practice 5 Simulations 6 Conclusions 7 References
27 Set-Up I Data generating process and testing problem: I.i.d. random vectors from N(θ,Σ), with n=100 θ i = 0 or θ i = 0.2, with s=50 Σ has constant correlation ρ H i : θ i 0 vs. H i : θ i > 0 T n,i is the usual t-statistic Methods considered: (BH) Benjamini and Hochberg (1995) (STS) Storey et al. (2004) with λ = 0.5 (BKY) Benjamini et al. (2006) (Boot) Bootstrap method Criteria: Empirical FDR (nominal level α = 10%) Average number of true rejections
28 Results I ρ = 0 ρ = 0.9 BH STS BKY Boot BH STS BKY Boot All θ i = 0 Control Rejected Ten θ i = 0.2 Control Rejected Twenty five θ i = 0.2 Control Rejected All θ i = 0.2 Control Rejected
29 Set-Up II Data generating process and testing problem: I.i.d. random vectors from N(θ,Σ), with n=100 Three θ i = 0 and one θ i = 0.2 or 20, with s=4 Σ is a random correlation matrix: take 1,000 draws H i : θ i 0 vs. H i : θ i > 0 T n,i is the usual t-statistic Methods considered: (BH) Benjamini and Hochberg (1995) (STS) Storey et al. (2004) with λ = 0.5 (BKY) Benjamini et al. (2006) (Boot) Bootstrap method Criteria: Boxplot of empirical FDRs (nominal level α = 10%)
30 Results II Realized FDRs: one theta = BH STS BKY Boot Realized FDRs: one theta = BH STS BKY Boot
31 Outline 1 Problem Formulation 2 Existing Methods 3 New Method 4 Theory & Practice 5 Simulations 6 Conclusions 7 References
32 Conclusions Methodology: Bootstrap method implicitly accounts for the dependence structure of the test statistics Extended the approach of Troendle (2000) to non-normal data
33 Conclusions Methodology: Bootstrap method implicitly accounts for the dependence structure of the test statistics Extended the approach of Troendle (2000) to non-normal data Advantages: Appears more powerful than current competitors At least compared to those, that are also robust against various dependence structures
34 Conclusions Methodology: Bootstrap method implicitly accounts for the dependence structure of the test statistics Extended the approach of Troendle (2000) to non-normal data Advantages: Appears more powerful than current competitors At least compared to those, that are also robust against various dependence structures Disadvantage: Computationally more expensive than methods based on the individual p-values Should be considered negligible this day and age
35 Outline 1 Problem Formulation 2 Existing Methods 3 New Method 4 Theory & Practice 5 Simulations 6 Conclusions 7 References
36 References Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B, 57(1): Benjamini, Y., Krieger, A. M., and Yekutieli, D. (2006). Adaptive linear step-up procedures that control the false discovery rate. Biometrika, 93(3): Benjamini, Y. and Yekutieli, D. (2001). The control of the false discovery rate in multiple testing under dependency. Annals of Statistics, 29(4): Storey, J. D., Taylor, J. E., and Siegmund, D. (2004). Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. Journal of the Royal Statistical Society, Series B, 66(1): Troendle, J. F. (2000). Stepwise normal theory test procedures controlling the false discovery rate. Journal of Statistical Planning and Inference, 84(1):
Control of the False Discovery Rate under Dependence using the Bootstrap and Subsampling
Institute for Empirical Research in Economics University of Zurich Working Paper Series ISSN 1424-0459 Working Paper No. 337 Control of the False Discovery Rate under Dependence using the Bootstrap and
More informationRejoinder on: Control of the false discovery rate under dependence using the bootstrap and subsampling
Test (2008) 17: 461 471 DOI 10.1007/s11749-008-0134-6 DISCUSSION Rejoinder on: Control of the false discovery rate under dependence using the bootstrap and subsampling Joseph P. Romano Azeem M. Shaikh
More informationControl of Generalized Error Rates in Multiple Testing
Institute for Empirical Research in Economics University of Zurich Working Paper Series ISSN 1424-0459 Working Paper No. 245 Control of Generalized Error Rates in Multiple Testing Joseph P. Romano and
More informationComments on: Control of the false discovery rate under dependence using the bootstrap and subsampling
Test (2008) 17: 443 445 DOI 10.1007/s11749-008-0127-5 DISCUSSION Comments on: Control of the false discovery rate under dependence using the bootstrap and subsampling José A. Ferreira Mark A. van de Wiel
More informationMultiple Testing of One-Sided Hypotheses: Combining Bonferroni and the Bootstrap
University of Zurich Department of Economics Working Paper Series ISSN 1664-7041 (print) ISSN 1664-705X (online) Working Paper No. 254 Multiple Testing of One-Sided Hypotheses: Combining Bonferroni and
More informationHeterogeneity and False Discovery Rate Control
Heterogeneity and False Discovery Rate Control Joshua D Habiger Oklahoma State University jhabige@okstateedu URL: jdhabigerokstateedu August, 2014 Motivating Data: Anderson and Habiger (2012) M = 778 bacteria
More informationModified Simes Critical Values Under Positive Dependence
Modified Simes Critical Values Under Positive Dependence Gengqian Cai, Sanat K. Sarkar Clinical Pharmacology Statistics & Programming, BDS, GlaxoSmithKline Statistics Department, Temple University, Philadelphia
More informationOn adaptive procedures controlling the familywise error rate
, pp. 3 On adaptive procedures controlling the familywise error rate By SANAT K. SARKAR Temple University, Philadelphia, PA 922, USA sanat@temple.edu Summary This paper considers the problem of developing
More informationPROCEDURES CONTROLLING THE k-fdr USING. BIVARIATE DISTRIBUTIONS OF THE NULL p-values. Sanat K. Sarkar and Wenge Guo
PROCEDURES CONTROLLING THE k-fdr USING BIVARIATE DISTRIBUTIONS OF THE NULL p-values Sanat K. Sarkar and Wenge Guo Temple University and National Institute of Environmental Health Sciences Abstract: Procedures
More informationExact and Approximate Stepdown Methods For Multiple Hypothesis Testing
Exact and Approximate Stepdown Methods For Multiple Hypothesis Testing Joseph P. Romano Department of Statistics Stanford University Michael Wolf Department of Economics and Business Universitat Pompeu
More informationOn Methods Controlling the False Discovery Rate 1
Sankhyā : The Indian Journal of Statistics 2008, Volume 70-A, Part 2, pp. 135-168 c 2008, Indian Statistical Institute On Methods Controlling the False Discovery Rate 1 Sanat K. Sarkar Temple University,
More informationON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE. By Wenge Guo and M. Bhaskara Rao
ON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE By Wenge Guo and M. Bhaskara Rao National Institute of Environmental Health Sciences and University of Cincinnati A classical approach for dealing
More informationarxiv: v1 [math.st] 31 Mar 2009
The Annals of Statistics 2009, Vol. 37, No. 2, 619 629 DOI: 10.1214/07-AOS586 c Institute of Mathematical Statistics, 2009 arxiv:0903.5373v1 [math.st] 31 Mar 2009 AN ADAPTIVE STEP-DOWN PROCEDURE WITH PROVEN
More informationApplying the Benjamini Hochberg procedure to a set of generalized p-values
U.U.D.M. Report 20:22 Applying the Benjamini Hochberg procedure to a set of generalized p-values Fredrik Jonsson Department of Mathematics Uppsala University Applying the Benjamini Hochberg procedure
More informationSTEPDOWN PROCEDURES CONTROLLING A GENERALIZED FALSE DISCOVERY RATE. National Institute of Environmental Health Sciences and Temple University
STEPDOWN PROCEDURES CONTROLLING A GENERALIZED FALSE DISCOVERY RATE Wenge Guo 1 and Sanat K. Sarkar 2 National Institute of Environmental Health Sciences and Temple University Abstract: Often in practice
More informationSTEPUP PROCEDURES FOR CONTROL OF GENERALIZATIONS OF THE FAMILYWISE ERROR RATE
AOS imspdf v.2006/05/02 Prn:4/08/2006; 11:19 F:aos0169.tex; (Lina) p. 1 The Annals of Statistics 2006, Vol. 0, No. 00, 1 26 DOI: 10.1214/009053606000000461 Institute of Mathematical Statistics, 2006 STEPUP
More informationA Simple, Graphical Procedure for Comparing Multiple Treatment Effects
A Simple, Graphical Procedure for Comparing Multiple Treatment Effects Brennan S. Thompson and Matthew D. Webb May 15, 2015 > Abstract In this paper, we utilize a new graphical
More informationStep-down FDR Procedures for Large Numbers of Hypotheses
Step-down FDR Procedures for Large Numbers of Hypotheses Paul N. Somerville University of Central Florida Abstract. Somerville (2004b) developed FDR step-down procedures which were particularly appropriate
More informationChapter 1. Stepdown Procedures Controlling A Generalized False Discovery Rate
Chapter Stepdown Procedures Controlling A Generalized False Discovery Rate Wenge Guo and Sanat K. Sarkar Biostatistics Branch, National Institute of Environmental Health Sciences, Research Triangle Park,
More informationImproving the Performance of the FDR Procedure Using an Estimator for the Number of True Null Hypotheses
Improving the Performance of the FDR Procedure Using an Estimator for the Number of True Null Hypotheses Amit Zeisel, Or Zuk, Eytan Domany W.I.S. June 5, 29 Amit Zeisel, Or Zuk, Eytan Domany (W.I.S.)Improving
More informationTwo-stage stepup procedures controlling FDR
Journal of Statistical Planning and Inference 38 (2008) 072 084 www.elsevier.com/locate/jspi Two-stage stepup procedures controlling FDR Sanat K. Sarar Department of Statistics, Temple University, Philadelphia,
More informationA GENERAL DECISION THEORETIC FORMULATION OF PROCEDURES CONTROLLING FDR AND FNR FROM A BAYESIAN PERSPECTIVE
A GENERAL DECISION THEORETIC FORMULATION OF PROCEDURES CONTROLLING FDR AND FNR FROM A BAYESIAN PERSPECTIVE Sanat K. Sarkar 1, Tianhui Zhou and Debashis Ghosh Temple University, Wyeth Pharmaceuticals and
More informationSanat Sarkar Department of Statistics, Temple University Philadelphia, PA 19122, U.S.A. September 11, Abstract
Adaptive Controls of FWER and FDR Under Block Dependence arxiv:1611.03155v1 [stat.me] 10 Nov 2016 Wenge Guo Department of Mathematical Sciences New Jersey Institute of Technology Newark, NJ 07102, U.S.A.
More informationTable of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. T=number of type 2 errors
The Multiple Testing Problem Multiple Testing Methods for the Analysis of Microarray Data 3/9/2009 Copyright 2009 Dan Nettleton Suppose one test of interest has been conducted for each of m genes in a
More informationExact and Approximate Stepdown Methods for Multiple Hypothesis Testing
Exact and Approximate Stepdown Methods for Multiple Hypothesis Testing Joseph P. ROMANO and Michael WOLF Consider the problem of testing k hypotheses simultaneously. In this article we discuss finite-
More informationFALSE DISCOVERY AND FALSE NONDISCOVERY RATES IN SINGLE-STEP MULTIPLE TESTING PROCEDURES 1. BY SANAT K. SARKAR Temple University
The Annals of Statistics 2006, Vol. 34, No. 1, 394 415 DOI: 10.1214/009053605000000778 Institute of Mathematical Statistics, 2006 FALSE DISCOVERY AND FALSE NONDISCOVERY RATES IN SINGLE-STEP MULTIPLE TESTING
More informationMultiple Testing. Hoang Tran. Department of Statistics, Florida State University
Multiple Testing Hoang Tran Department of Statistics, Florida State University Large-Scale Testing Examples: Microarray data: testing differences in gene expression between two traits/conditions Microbiome
More informationLecture 7 April 16, 2018
Stats 300C: Theory of Statistics Spring 2018 Lecture 7 April 16, 2018 Prof. Emmanuel Candes Scribe: Feng Ruan; Edited by: Rina Friedberg, Junjie Zhu 1 Outline Agenda: 1. False Discovery Rate (FDR) 2. Properties
More informationTesting Jumps via False Discovery Rate Control
Testing Jumps via False Discovery Rate Control Yu-Min Yen August 12, 2011 Abstract Many recently developed nonparametric jump tests can be viewed as multiple hypothesis testing problems. For such multiple
More informationFalse discovery control for multiple tests of association under general dependence
False discovery control for multiple tests of association under general dependence Nicolai Meinshausen Seminar für Statistik ETH Zürich December 2, 2004 Abstract We propose a confidence envelope for false
More informationChristopher J. Bennett
P- VALUE ADJUSTMENTS FOR ASYMPTOTIC CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE by Christopher J. Bennett Working Paper No. 09-W05 April 2009 DEPARTMENT OF ECONOMICS VANDERBILT UNIVERSITY NASHVILLE,
More informationFamilywise Error Rate Controlling Procedures for Discrete Data
Familywise Error Rate Controlling Procedures for Discrete Data arxiv:1711.08147v1 [stat.me] 22 Nov 2017 Yalin Zhu Center for Mathematical Sciences, Merck & Co., Inc., West Point, PA, U.S.A. Wenge Guo Department
More informationLarge-Scale Multiple Testing of Correlations
Large-Scale Multiple Testing of Correlations T. Tony Cai and Weidong Liu Abstract Multiple testing of correlations arises in many applications including gene coexpression network analysis and brain connectivity
More informationFalse Discovery Rate
False Discovery Rate Peng Zhao Department of Statistics Florida State University December 3, 2018 Peng Zhao False Discovery Rate 1/30 Outline 1 Multiple Comparison and FWER 2 False Discovery Rate 3 FDR
More informationFDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES
FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES Sanat K. Sarkar a a Department of Statistics, Temple University, Speakman Hall (006-00), Philadelphia, PA 19122, USA Abstract The concept
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationThe Pennsylvania State University The Graduate School Eberly College of Science GENERALIZED STEPWISE PROCEDURES FOR
The Pennsylvania State University The Graduate School Eberly College of Science GENERALIZED STEPWISE PROCEDURES FOR CONTROLLING THE FALSE DISCOVERY RATE A Dissertation in Statistics by Scott Roths c 2011
More informationControlling the False Discovery Rate in Two-Stage. Combination Tests for Multiple Endpoints
Controlling the False Discovery Rate in Two-Stage Combination Tests for Multiple ndpoints Sanat K. Sarkar, Jingjing Chen and Wenge Guo May 29, 2011 Sanat K. Sarkar is Professor and Senior Research Fellow,
More informationA Simple, Graphical Procedure for Comparing. Multiple Treatment Effects
A Simple, Graphical Procedure for Comparing Multiple Treatment Effects Brennan S. Thompson and Matthew D. Webb October 26, 2015 > Abstract In this paper, we utilize a new
More informationIncorporation of Sparsity Information in Large-scale Multiple Two-sample t Tests
Incorporation of Sparsity Information in Large-scale Multiple Two-sample t Tests Weidong Liu October 19, 2014 Abstract Large-scale multiple two-sample Student s t testing problems often arise from the
More informationNew Procedures for False Discovery Control
New Procedures for False Discovery Control Christopher R. Genovese Department of Statistics Carnegie Mellon University http://www.stat.cmu.edu/ ~ genovese/ Elisha Merriam Department of Neuroscience University
More informationOn Procedures Controlling the FDR for Testing Hierarchically Ordered Hypotheses
On Procedures Controlling the FDR for Testing Hierarchically Ordered Hypotheses Gavin Lynch Catchpoint Systems, Inc., 228 Park Ave S 28080 New York, NY 10003, U.S.A. Wenge Guo Department of Mathematical
More informationResearch Article Sample Size Calculation for Controlling False Discovery Proportion
Probability and Statistics Volume 2012, Article ID 817948, 13 pages doi:10.1155/2012/817948 Research Article Sample Size Calculation for Controlling False Discovery Proportion Shulian Shang, 1 Qianhe Zhou,
More informationControlling Bayes Directional False Discovery Rate in Random Effects Model 1
Controlling Bayes Directional False Discovery Rate in Random Effects Model 1 Sanat K. Sarkar a, Tianhui Zhou b a Temple University, Philadelphia, PA 19122, USA b Wyeth Pharmaceuticals, Collegeville, PA
More informationSTAT 461/561- Assignments, Year 2015
STAT 461/561- Assignments, Year 2015 This is the second set of assignment problems. When you hand in any problem, include the problem itself and its number. pdf are welcome. If so, use large fonts and
More informationFalse discovery rate and related concepts in multiple comparisons problems, with applications to microarray data
False discovery rate and related concepts in multiple comparisons problems, with applications to microarray data Ståle Nygård Trial Lecture Dec 19, 2008 1 / 35 Lecture outline Motivation for not using
More informationStatistical Applications in Genetics and Molecular Biology
Statistical Applications in Genetics and Molecular Biology Volume 5, Issue 1 2006 Article 28 A Two-Step Multiple Comparison Procedure for a Large Number of Tests and Multiple Treatments Hongmei Jiang Rebecca
More informationInference for Identifiable Parameters in Partially Identified Econometric Models
Inference for Identifiable Parameters in Partially Identified Econometric Models Joseph P. Romano Department of Statistics Stanford University romano@stat.stanford.edu Azeem M. Shaikh Department of Economics
More informationON THE UNIFORM ASYMPTOTIC VALIDITY OF SUBSAMPLING AND THE BOOTSTRAP. Joseph P. Romano Azeem M. Shaikh
ON THE UNIFORM ASYMPTOTIC VALIDITY OF SUBSAMPLING AND THE BOOTSTRAP By Joseph P. Romano Azeem M. Shaikh Technical Report No. 2010-03 April 2010 Department of Statistics STANFORD UNIVERSITY Stanford, California
More informationHigh-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018
High-Throughput Sequencing Course Multiple Testing Biostatistics and Bioinformatics Summer 2018 Introduction You have previously considered the significance of a single gene Introduction You have previously
More informationA NEW APPROACH FOR LARGE SCALE MULTIPLE TESTING WITH APPLICATION TO FDR CONTROL FOR GRAPHICALLY STRUCTURED HYPOTHESES
A NEW APPROACH FOR LARGE SCALE MULTIPLE TESTING WITH APPLICATION TO FDR CONTROL FOR GRAPHICALLY STRUCTURED HYPOTHESES By Wenge Guo Gavin Lynch Joseph P. Romano Technical Report No. 2018-06 September 2018
More informationAvoiding data snooping in multilevel and mixed effects models
J. R. Statist. Soc. A (2007) 170, Part 4, pp. 1035 1059 Avoiding data snooping in multilevel and mixed effects models David Afshartous University of Miami, Coral Gables, USA and Michael Wolf University
More informationControl of Directional Errors in Fixed Sequence Multiple Testing
Control of Directional Errors in Fixed Sequence Multiple Testing Anjana Grandhi Department of Mathematical Sciences New Jersey Institute of Technology Newark, NJ 07102-1982 Wenge Guo Department of Mathematical
More informationWeb-based Supplementary Materials for. of the Null p-values
Web-based Supplementary Materials for ocedures Controlling the -FDR Using Bivariate Distributions of the Null p-values Sanat K Sarar and Wenge Guo Temple University and National Institute of Environmental
More informationA multiple testing procedure for input variable selection in neural networks
A multiple testing procedure for input variable selection in neural networks MicheleLaRoccaandCiraPerna Department of Economics and Statistics - University of Salerno Via Ponte Don Melillo, 84084, Fisciano
More informationHunting for significance with multiple testing
Hunting for significance with multiple testing Etienne Roquain 1 1 Laboratory LPMA, Université Pierre et Marie Curie (Paris 6), France Séminaire MODAL X, 19 mai 216 Etienne Roquain Hunting for significance
More informationNon-specific filtering and control of false positives
Non-specific filtering and control of false positives Richard Bourgon 16 June 2009 bourgon@ebi.ac.uk EBI is an outstation of the European Molecular Biology Laboratory Outline Multiple testing I: overview
More informationWeighted Adaptive Multiple Decision Functions for False Discovery Rate Control
Weighted Adaptive Multiple Decision Functions for False Discovery Rate Control Joshua D. Habiger Oklahoma State University jhabige@okstate.edu Nov. 8, 2013 Outline 1 : Motivation and FDR Research Areas
More information3 Comparison with Other Dummy Variable Methods
Stats 300C: Theory of Statistics Spring 2018 Lecture 11 April 25, 2018 Prof. Emmanuel Candès Scribe: Emmanuel Candès, Michael Celentano, Zijun Gao, Shuangning Li 1 Outline Agenda: Knockoffs 1. Introduction
More informationlarge number of i.i.d. observations from P. For concreteness, suppose
1 Subsampling Suppose X i, i = 1,..., n is an i.i.d. sequence of random variables with distribution P. Let θ(p ) be some real-valued parameter of interest, and let ˆθ n = ˆθ n (X 1,..., X n ) be some estimate
More informationTO HOW MANY SIMULTANEOUS HYPOTHESIS TESTS CAN NORMAL, STUDENT S t OR BOOTSTRAP CALIBRATION BE APPLIED? Jianqing Fan Peter Hall Qiwei Yao
TO HOW MANY SIMULTANEOUS HYPOTHESIS TESTS CAN NORMAL, STUDENT S t OR BOOTSTRAP CALIBRATION BE APPLIED? Jianqing Fan Peter Hall Qiwei Yao ABSTRACT. In the analysis of microarray data, and in some other
More informationLarge-Scale Multiple Testing of Correlations
University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 5-5-2016 Large-Scale Multiple Testing of Correlations T. Tony Cai University of Pennsylvania Weidong Liu Follow this
More informationProcedures controlling generalized false discovery rate
rocedures controlling generalized false discovery rate By SANAT K. SARKAR Department of Statistics, Temple University, hiladelphia, A 922, U.S.A. sanat@temple.edu AND WENGE GUO Department of Environmental
More informationConsonance and the Closure Method in Multiple Testing. Institute for Empirical Research in Economics University of Zurich
Institute for Empirical Research in Economics University of Zurich Working Paper Series ISSN 1424-0459 Working Paper No. 446 Consonance and the Closure Method in Multiple Testing Joseph P. Romano, Azeem
More informationJournal of Statistical Software
JSS Journal of Statistical Software MMMMMM YYYY, Volume VV, Issue II. doi: 10.18637/jss.v000.i00 GroupTest: Multiple Testing Procedure for Grouped Hypotheses Zhigen Zhao Abstract In the modern Big Data
More informationPHASE TRANSITION AND REGULARIZED BOOTSTRAP IN LARGE-SCALE t-tests WITH FALSE DISCOVERY RATE CONTROL
The Annals of Statistics 2014, Vol. 42, No. 5, 2003 2025 DOI: 10.1214/14-AOS1249 Institute of Mathematical Statistics, 2014 PHASE TRANSITION AND REGULARIZED BOOTSTRAP IN LARGE-SCALE t-tests WITH FALSE
More informationarxiv:math/ v1 [math.st] 29 Dec 2006 Jianqing Fan Peter Hall Qiwei Yao
TO HOW MANY SIMULTANEOUS HYPOTHESIS TESTS CAN NORMAL, STUDENT S t OR BOOTSTRAP CALIBRATION BE APPLIED? arxiv:math/0701003v1 [math.st] 29 Dec 2006 Jianqing Fan Peter Hall Qiwei Yao ABSTRACT. In the analysis
More informationGeneralized estimators for multiple testing: proportion of true nulls and false discovery rate by. Xiongzhi Chen and R.W. Doerge
Generalized estimators for multiple testing: proportion of true nulls and false discovery rate by Xiongzhi Chen and R.W. Doerge Department of Statistics, Purdue University, West Lafayette, USA. Technical
More informationDepartment of Statistics University of Central Florida. Technical Report TR APR2007 Revised 25NOV2007
Department of Statistics University of Central Florida Technical Report TR-2007-01 25APR2007 Revised 25NOV2007 Controlling the Number of False Positives Using the Benjamini- Hochberg FDR Procedure Paul
More informationHigh-throughput Testing
High-throughput Testing Noah Simon and Richard Simon July 2016 1 / 29 Testing vs Prediction On each of n patients measure y i - single binary outcome (eg. progression after a year, PCR) x i - p-vector
More informationClosure properties of classes of multiple testing procedures
AStA Adv Stat Anal (2018) 102:167 178 https://doi.org/10.1007/s10182-017-0297-0 ORIGINAL PAPER Closure properties of classes of multiple testing procedures Georg Hahn 1 Received: 28 June 2016 / Accepted:
More informationA Large-Sample Approach to Controlling the False Discovery Rate
A Large-Sample Approach to Controlling the False Discovery Rate Christopher R. Genovese Department of Statistics Carnegie Mellon University Larry Wasserman Department of Statistics Carnegie Mellon University
More informationSOME STEP-DOWN PROCEDURES CONTROLLING THE FALSE DISCOVERY RATE UNDER DEPENDENCE
Statistica Sinica 18(2008), 881-904 SOME STEP-DOWN PROCEDURES CONTROLLING THE FALSE DISCOVERY RATE UNDER DEPENDENCE Yongchao Ge 1, Stuart C. Sealfon 1 and Terence P. Speed 2,3 1 Mount Sinai School of Medicine,
More informationThe International Journal of Biostatistics
The International Journal of Biostatistics Volume 7, Issue 1 2011 Article 12 Consonance and the Closure Method in Multiple Testing Joseph P. Romano, Stanford University Azeem Shaikh, University of Chicago
More informationControlling the False Discovery Rate: Understanding and Extending the Benjamini-Hochberg Method
Controlling the False Discovery Rate: Understanding and Extending the Benjamini-Hochberg Method Christopher R. Genovese Department of Statistics Carnegie Mellon University joint work with Larry Wasserman
More informationEstimation of a Two-component Mixture Model
Estimation of a Two-component Mixture Model Bodhisattva Sen 1,2 University of Cambridge, Cambridge, UK Columbia University, New York, USA Indian Statistical Institute, Kolkata, India 6 August, 2012 1 Joint
More informationOn the Uniform Asymptotic Validity of Subsampling and the Bootstrap
On the Uniform Asymptotic Validity of Subsampling and the Bootstrap Joseph P. Romano Departments of Economics and Statistics Stanford University romano@stanford.edu Azeem M. Shaikh Department of Economics
More informationSimultaneous Testing of Grouped Hypotheses: Finding Needles in Multiple Haystacks
University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 2009 Simultaneous Testing of Grouped Hypotheses: Finding Needles in Multiple Haystacks T. Tony Cai University of Pennsylvania
More informationStatistical Applications in Genetics and Molecular Biology
Statistical Applications in Genetics and Molecular Biology Volume 3, Issue 1 2004 Article 14 Multiple Testing. Part II. Step-Down Procedures for Control of the Family-Wise Error Rate Mark J. van der Laan
More informationFalse Discovery Control in Spatial Multiple Testing
False Discovery Control in Spatial Multiple Testing WSun 1,BReich 2,TCai 3, M Guindani 4, and A. Schwartzman 2 WNAR, June, 2012 1 University of Southern California 2 North Carolina State University 3 University
More informationCentral Limit Theorem ( 5.3)
Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately
More informationBiostatistics Advanced Methods in Biostatistics IV
Biostatistics 140.754 Advanced Methods in Biostatistics IV Jeffrey Leek Assistant Professor Department of Biostatistics jleek@jhsph.edu Lecture 11 1 / 44 Tip + Paper Tip: Two today: (1) Graduate school
More informationAdvanced Statistics II: Non Parametric Tests
Advanced Statistics II: Non Parametric Tests Aurélien Garivier ParisTech February 27, 2011 Outline Fitting a distribution Rank Tests for the comparison of two samples Two unrelated samples: Mann-Whitney
More informationarxiv: v3 [math.st] 15 Jul 2018
A New Step-down Procedure for Simultaneous Hypothesis Testing Under Dependence arxiv:1503.08923v3 [math.st] 15 Jul 2018 Contents Prasenjit Ghosh 1 and Arijit Chakrabarti 2 1 Department of Statistics, Presidency
More informationStatistical Inference
Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will
More informationMultiple Testing in Growth Econometrics
Multiple Testing in Growth Econometrics Thomas Deckers Universität Bonn Christoph Hanck Rijksuniversiteit Groningen January 8, 2010 Abstract This paper discusses two longstanding questions in growth econometrics
More informationFDR and ROC: Similarities, Assumptions, and Decisions
EDITORIALS 8 FDR and ROC: Similarities, Assumptions, and Decisions. Why FDR and ROC? It is a privilege to have been asked to introduce this collection of papers appearing in Statistica Sinica. The papers
More informationThe miss rate for the analysis of gene expression data
Biostatistics (2005), 6, 1,pp. 111 117 doi: 10.1093/biostatistics/kxh021 The miss rate for the analysis of gene expression data JONATHAN TAYLOR Department of Statistics, Stanford University, Stanford,
More informationSequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process
Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 3083-3093 Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Julia Bondarenko Helmut-Schmidt University Hamburg University
More informationRobust Performance Hypothesis Testing with the Variance. Institute for Empirical Research in Economics University of Zurich
Institute for Empirical Research in Economics University of Zurich Working Paper Series ISSN 1424-0459 Working Paper No. 516 Robust Performance Hypothesis Testing with the Variance Olivier Ledoit and Michael
More informationExceedance Control of the False Discovery Proportion Christopher Genovese 1 and Larry Wasserman 2 Carnegie Mellon University July 10, 2004
Exceedance Control of the False Discovery Proportion Christopher Genovese 1 and Larry Wasserman 2 Carnegie Mellon University July 10, 2004 Multiple testing methods to control the False Discovery Rate (FDR),
More informationEstimating False Discovery Proportion Under Arbitrary Covariance Dependence
Estimating False Discovery Proportion Under Arbitrary Covariance Dependence arxiv:1010.6056v2 [stat.me] 15 Nov 2011 Jianqing Fan, Xu Han and Weijie Gu May 31, 2018 Abstract Multiple hypothesis testing
More informationMarginal Screening and Post-Selection Inference
Marginal Screening and Post-Selection Inference Ian McKeague August 13, 2017 Ian McKeague (Columbia University) Marginal Screening August 13, 2017 1 / 29 Outline 1 Background on Marginal Screening 2 2
More informationStatistical testing. Samantha Kleinberg. October 20, 2009
October 20, 2009 Intro to significance testing Significance testing and bioinformatics Gene expression: Frequently have microarray data for some group of subjects with/without the disease. Want to find
More informationMultiple hypothesis testing using the excess discovery count and alpha-investing rules
Multiple hypothesis testing using the excess discovery count and alpha-investing rules Dean P. Foster and Robert A. Stine Department of Statistics The Wharton School of the University of Pennsylvania Philadelphia,
More informationEMPIRICAL BAYES METHODS FOR ESTIMATION AND CONFIDENCE INTERVALS IN HIGH-DIMENSIONAL PROBLEMS
Statistica Sinica 19 (2009), 125-143 EMPIRICAL BAYES METHODS FOR ESTIMATION AND CONFIDENCE INTERVALS IN HIGH-DIMENSIONAL PROBLEMS Debashis Ghosh Penn State University Abstract: There is much recent interest
More informationThis paper has been submitted for consideration for publication in Biometrics
BIOMETRICS, 1 10 Supplementary material for Control with Pseudo-Gatekeeping Based on a Possibly Data Driven er of the Hypotheses A. Farcomeni Department of Public Health and Infectious Diseases Sapienza
More informationLet us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided
Let us first identify some classes of hypotheses. simple versus simple H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided H 0 : θ θ 0 versus H 1 : θ > θ 0. (2) two-sided; null on extremes H 0 : θ θ 1 or
More informationDoing Cosmology with Balls and Envelopes
Doing Cosmology with Balls and Envelopes Christopher R. Genovese Department of Statistics Carnegie Mellon University http://www.stat.cmu.edu/ ~ genovese/ Larry Wasserman Department of Statistics Carnegie
More informationEstimation of the False Discovery Rate
Estimation of the False Discovery Rate Coffee Talk, Bioinformatics Research Center, Sept, 2005 Jason A. Osborne, osborne@stat.ncsu.edu Department of Statistics, North Carolina State University 1 Outline
More information