Bootstrap tests. Patrick Breheny. October 11. Bootstrap vs. permutation tests Testing for equality of location

Similar documents
Permutation tests. Patrick Breheny. September 25. Conditioning Nonparametric null hypotheses Permutation testing

Empirical Likelihood

Bootstrap Confidence Intervals

The bootstrap. Patrick Breheny. December 6. The empirical distribution function The bootstrap

Relative efficiency. Patrick Breheny. October 9. Theoretical framework Application to the two-group problem

Hypothesis Testing with the Bootstrap. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods

Resampling and the Bootstrap

Resampling and the Bootstrap

Conditioning Nonparametric null hypotheses Permutation testing. Permutation tests. Patrick Breheny. October 5. STA 621: Nonparametric Statistics

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Non-parametric Inference and Resampling

STAT Section 2.1: Basic Inference. Basic Definitions

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I

Nonparametric tests, Bootstrapping

36. Multisample U-statistics and jointly distributed U-statistics Lehmann 6.1

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Chapter 18 Resampling and Nonparametric Approaches To Data

Bootstrap confidence levels for phylogenetic trees B. Efron, E. Halloran, and S. Holmes, 1996

Semiparametric Regression

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics

Permutation Tests. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods

Advanced Statistics II: Non Parametric Tests

Lecture 30. DATA 8 Summer Regression Inference

Nonparametric Estimation of Distributions in a Large-p, Small-n Setting

What to do today (Nov 22, 2018)?

One-Sample Numerical Data

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!

Distribution-Free Procedures (Devore Chapter Fifteen)

Semester , Example Exam 1

Non-parametric methods

Bootstrap, Jackknife and other resampling methods

LECTURE 5. Introduction to Econometrics. Hypothesis testing

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

Module 9: Nonparametric Statistics Statistics (OA3102)

This does not cover everything on the final. Look at the posted practice problems for other topics.

The Nonparametric Bootstrap

Non-parametric Inference and Resampling

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

Comparison of two samples


Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Selection should be based on the desired biological interpretation!

Opening Theme: Flexibility vs. Stability

A comparison study of the nonparametric tests based on the empirical distributions

Master s Written Examination - Solution

Multiple samples: Modeling and ANOVA

Stat 139 Homework 2 Solutions, Spring 2015

Nonparametric Tests. Mathematics 47: Lecture 25. Dan Sloughter. Furman University. April 20, 2006

Data Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

0.1 The plug-in principle for finding estimators

Math 494: Mathematical Statistics

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

Simple Linear Regression

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Introduction to Nonparametric Statistics

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)

L11 Hypothesis testing

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

3. Nonparametric methods

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

Lecture 8 Sampling Theory

The assumptions are needed to give us... valid standard errors valid confidence intervals valid hypothesis tests and p-values

Bootstrap, Jackknife and other resampling methods

What is a Hypothesis?

A Signed-Rank Test Based on the Score Function

Non-parametric Hypothesis Testing

Local regression I. Patrick Breheny. November 1. Kernel weighted averages Local linear regression

BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014

Two-sample inference: Continuous data

Estimating the accuracy of a hypothesis Setting. Assume a binary classification setting

Business Statistics MEDIAN: NON- PARAMETRIC TESTS

Chapter 3.3 Continuous distributions

Some Observations on the Wilcoxon Rank Sum Test

STAT440/840: Statistical Computing

6 Single Sample Methods for a Location Parameter

Monte Carlo Studies. The response in a Monte Carlo study is a random variable.

Introduction to bivariate analysis

11. Bootstrap Methods

STATISTICS 4, S4 (4769) A2

Math Review Sheet, Fall 2008

Introduction to bivariate analysis

Multivariate Regression

Note on Bivariate Regression: Connecting Practice and Theory. Konstantin Kashin

STA 302f16 Assignment Five 1

Chapter 2: Resampling Maarten Jansen

Kernel density estimation

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Physics 509: Non-Parametric Statistics and Correlation Testing

STAT 461/561- Assignments, Year 2015

Chapter 10. U-statistics Statistical Functionals and V-Statistics

Nonparametric hypothesis tests and permutation tests

Power and sample size calculations

Count data and the Poisson distribution

A Note on Bootstraps and Robustness. Tony Lancaster, Brown University, December 2003.

Marginal Screening and Post-Selection Inference

Data Analysis and Statistical Methods Statistics 651

Section 3: Permutation Inference

Transcription:

Bootstrap tests Patrick Breheny October 11 Patrick Breheny STA 621: Nonparametric Statistics 1/14

Introduction Conditioning on the observed data to obtain permutation tests is certainly an important idea and a useful tool for hypothesis testing Once we condition on the observed data, we can test hypotheses of whether or not the data is identically and independently drawn from the same distribution by permuting the observed values or equivalently, drawing without replacement from the observed values One could also think about drawing with replacement from the observed values i.e., carrying out a bootstrap test Patrick Breheny STA 621: Nonparametric Statistics 2/14

Advantages of permutation testing What are the ramifications of this difference? On the one hand, the p-values of a permutation test are exact conditional probabilities (up to computational limits) for all sample sizes Permutation tests do not make any effort to estimate the common distribution F ; it is treated as a nuisance parameter In contrast, a bootstrap test estimates F using the empirical cdf and draws from ˆF to estimate p-values Bootstrap tests therefore have no interpretation as exact probabilities, for any sample size Patrick Breheny STA 621: Nonparametric Statistics 3/14

Advantages of bootstrap testing However, their probabilities are guaranteed to be accurate as the sample size becomes large Furthermore, bootstrap tests are more general, and can be applied to a wider range of hypotheses In contrast, permutation tests are typically limited to relatively simple hypotheses like H 0, H 1, and H 2 Patrick Breheny STA 621: Nonparametric Statistics 4/14

Testing for equality of means For example, consider again the two-sample problem where X F and Y G, but instead of testing H 0 : F = G, we wish to test H 0 : E(X) = E(Y ) In other words, F and G may differ in other ways for example, V(X) V(Y ) but as long as E(X) = E(Y ), the null hypothesis is still satisfied This hypothesis cannot be tested with a permutation test, but can be tested with a bootstrap approach Patrick Breheny STA 621: Nonparametric Statistics 5/14

Bootstrap approach The bootstrap test proceeds based on estimating F and G subject to the restriction that they must both have the same mean Specifically, let ˆF 0 put equal probability on the points x i = x i x + z and Ĝ0 put equal probability on the points ỹ i = y i ȳ + z, where z is the mean of the combined sample This will leave everything about the distribution of X and Y the same, with the exception of their location, which are constrained to have the same center, z Patrick Breheny STA 621: Nonparametric Statistics 6/14

Bootstrap p-value: Procedure (1) Form B bootstrap data sets (x, y ) by sampling x with replacement from x and y with replacement from ỹ (2) Evaluate t b for each data set 1,..., B: t b = x b ȳ b, ˆσ xb 2 /n + ˆσ 2 yb /m where x b is the mean and ˆσ 2 xb is the variance of the bth bootstrap sample x b, and so on (3) Approximate the achieved significance level by: ÂSL boot = B 1 b 1( t b t obs ) where t obs is the observed value of the test statistic Patrick Breheny STA 621: Nonparametric Statistics 7/14

Simulation results: Normal distribution Student Welch Wilcoxon Boot mean 1.0 Equal More in HV Fewer in HV 0.8 0.6 Power 0.4 0.2 0.0 Delta Patrick Breheny STA 621: Nonparametric Statistics 8/14

Simulation results: Double exponential distribution Student Welch Wilcoxon Boot mean 1.0 Equal More in HV Fewer in HV 0.8 Power 0.6 0.4 0.2 0.0 Delta Patrick Breheny STA 621: Nonparametric Statistics 9/14

Bootstrap test vs. Welch s test It is perhaps not surprising that the results of this bootstrap test that we have developed are quite similar to that of Welch s test However, unlike Welch s test, we can easily extend this idea to other tests For example, perhaps we would have more power if we decided to test the hypothesis of equal medians Patrick Breheny STA 621: Nonparametric Statistics 10/14

Testing equality of medians An alternative bootstrap test, then, is to estimate ˆF 0 by placing equal mass at all x i x + z, where x is the median of the {x i } s and z is the median of the combined sample, and so on for Ĝ0 A possible test statistic would then be the difference of medians between the resampled values from ˆF 0 and Ĝ0 Patrick Breheny STA 621: Nonparametric Statistics 11/14

Simulation results: Double exponential distribution Student Welch Wilcoxon Boot median 1.0 Equal More in HV Fewer in HV 0.8 Power 0.6 0.4 0.2 0.0 Delta Patrick Breheny STA 621: Nonparametric Statistics 12/14

Simulation results: Mixture of normals Student Welch Wilcoxon Boot median Equal More in HV Fewer in HV 0.8 0.6 Power 0.4 0.2 0.0 Delta Patrick Breheny STA 621: Nonparametric Statistics 13/14

Homework Bootstrap vs. permutation tests Homework: At a conference once, I heard someone say that Wilcoxon rank-sum tests are not valid level α tests for the two-group comparison problem when the variances of the two groups are different. Is this correct? Why or why not? Patrick Breheny STA 621: Nonparametric Statistics 14/14