STAT Section 3.4: The Sign Test. The sign test, as we will typically use it, is a method for analyzing paired data.

Similar documents
Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

6 Single Sample Methods for a Location Parameter

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

Basic Business Statistics, 10/e

Non-parametric methods

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Business Statistics MEDIAN: NON- PARAMETRIC TESTS

Nonparametric Statistics

STAT Section 5.8: Block Designs

16. Nonparametric Methods. Analysis of ordinal data

psychological statistics

Small n, σ known or unknown, underlying nongaussian

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

What is a Hypothesis?

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)

Lecture 25. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Textbook Examples of. SPSS Procedure

Review of Statistics 101

Exam details. Final Review Session. Things to Review

Sampling Distribution of a Sample Proportion

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I

Classroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats

Glossary for the Triola Statistics Series

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

Statistics Handbook. All statistical tables were computed by the author.

Psych Jan. 5, 2005

Discrete Multivariate Statistics

Statistics in medicine

A3. Statistical Inference Hypothesis Testing for General Population Parameters

Mark Scheme (Results) June 2008

Master s Written Examination - Solution

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

Non-parametric tests, part A:

STATISTICS ANCILLARY SYLLABUS. (W.E.F. the session ) Semester Paper Code Marks Credits Topic

Statistics: revision

Inferential statistics

Frequency Distribution Cross-Tabulation

Package BayesNI. February 19, 2015

Basic Statistical Analysis

Nonparametric tests. Mark Muldoon School of Mathematics, University of Manchester. Mark Muldoon, November 8, 2005 Nonparametric tests - p.

10: Crosstabs & Independent Proportions

EXAM # 2. Total 100. Please show all work! Problem Points Grade. STAT 301, Spring 2013 Name

Diploma Part 2. Quantitative Methods. Examiners Suggested Answers

1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance?

Contents 1. Contents


Chapter 7 Comparison of two independent samples

Intro to Parametric & Nonparametric Statistics

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

ADJUSTED POWER ESTIMATES IN. Ji Zhang. Biostatistics and Research Data Systems. Merck Research Laboratories. Rahway, NJ

Spearman Rho Correlation

Analysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013

Analysis of 2x2 Cross-Over Designs using T-Tests

TA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM

Inferences About Two Proportions

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Lecture 26 Section 8.4. Wed, Oct 14, 2009

Chapter 9 Inferences from Two Samples

Tribhuvan University Institute of Science and Technology 2065

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

Probability Distributions.

Logistic Regression - problem 6.14

= 1 i. normal approximation to χ 2 df > df

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Math Sec 4 CST Topic 7. Statistics. i.e: Add up all values and divide by the total number of values.

Chapter Three. Hypothesis Testing

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Association Between Variables Measured at the Ordinal Level

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

Wilcoxon Test and Calculating Sample Sizes

3. Nonparametric methods

Module 9: Nonparametric Statistics Statistics (OA3102)

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

Design of the Fuzzy Rank Tests Package

Physics 509: Non-Parametric Statistics and Correlation Testing

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!

6.4 Type I and Type II Errors

Comparison of Two Samples

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION

Introduction to Statistics

Formulas and Tables by Mario F. Triola

Chapter 8 of Devore , H 1 :

Unit 14: Nonparametric Statistical Methods

A3. Statistical Inference

Statistics 135 Fall 2008 Final Exam

Version 1: Equality of Distributions. 3. F (x) and G(x) represent the distribution functions corresponding to the Xs and Y s, respectively.

Chapter Six: Two Independent Samples Methods 1/51

ST4241 Design and Analysis of Clinical Trials Lecture 9: N. Lecture 9: Non-parametric procedures for CRBD

Transcription:

STAT 518 --- Section 3.4: The Sign Test The sign test, as we will typically use it, is a method for analyzing paired data. Examples of Paired Data: Similar subjects are paired off and one of two treatments is given to each subject in the pair. or We could have two observations on the same subject. The key: With paired data, the pairings cannot be switched around without affecting the analysis. We might label one of the variables X i and the other variable Y i. Our entire bivariate data set for n individual pairs is: The bivariate random vectors are assumed to be independent across observations. The goal may be to determine whether the X variable tends to be larger than or smaller than the corresponding Y variable.

Assuming the data are at least ordinal, we could classify each pair as + if X i < Y i or if X i > Y i. If X i = Y i then the pair is classified as 0 or tie. We further assume internal consistency: If P(+) > P( ) for one pair, then P(+) > P( ) for all pairs, and same holds for P(+) < P( ) and P(+) = P( ). Test Statistic: The null distribution of T is where n = The hypotheses of the sign test can be stated in a variety of ways. Most generally, we can test any one of: H 0 : H 0 : H 0 : H 1 : H 1 : H 1 : These could be stated in terms of comparing the population medians of X and Y: H 0 : H 0 : H 0 : H 1 : H 1 : H 1 :

The corresponding rejection rules in each case are: The P-values for each case are: where T ~ Binom(n, 0.5). The exact critical region and P-value are found using Table A3 (or the computer) similarly to the other binomial-based tests. Example 1: Six students are given two tests, one after being fed, and one on an empty stomach. Is there evidence that students perform better on a full stomach? (Use =.05.) Student Scores 1 2 3 4 5 6 X (with food) 74 71 82 77 72 81 Y (without food) 68 71 86 70 67 80

Example 2: 18 boy/girl sets of twins were scored for empathy on a personality test. In ten sets, the girl scored higher; in 7 sets, the boy scored higher, and in one set, the scores were equal. Can we conclude a difference in median empathy between the boy and girl populations? (Use =.05.) Some Notes One way to view the sign test is simply as the binomial test, where p* = 0.5. We classify each trial as + or and determine whether the probability of + is different from/greater than/ less than 0.5. Note that performing the quantile test about the median is essentially performing the sign test, where the second variable is simply the constant number x*.

The sign test is appropriate for any data measured on an ordinal or stronger scale. If the paired differences Y i X i are continuous with a symmetric distribution, the Wilcoxon signed-rank test (we will see it in Chapter 5) may be more powerful than the sign test. If the paired differences Y i X i have a normal distribution, the paired t-test is the most powerful option. Efficiency of the Sign Test Population A.R.E.(sign vs. signed-rank) A.R.E.(sign vs. paired-t) Normal Uniform (light tails) Double exponential (heavy tails)

Sec. 3.5 --- Variations of the Sign Test Tests based on the sign test can be used to answer a variety of questions. McNemar s Test Consider two paired binary variables (nominal). Both X and Y can only take the values 0 and 1, say. This type of data often arises from before vs. after experiments. X = 1 might represent having some condition a treatment is applied and Y = 1 having it the treatment is applied. Question: Is the probability of having the condition the same before and after the treatment is applied? Or does the treatment change the probability of having it? Null and Alternative hypotheses: H 0 : vs. H 1 :

The data from such a study can be summarized with a 2 2 table: Consider the (X i = 0, Y i = 1) entries to be the + observations. Let the (X i = 1, Y i = 0) entries be the observations. Then we can use the test to test H 0. Note the a and d entries are treated as. The test statistic is simply T 2 = The null distribution of T 2 is Using Table A3 with n = b + c and p = 0.5, reject H 0 if where t is the value corresponding to a probability of /2. The P-value is

Example 1: Suppose 200 subjects were asked last month and again this month whether they approved of the president s job performance. 90 said yes both times; 90 said no both times; 12 said yes the first month and no the second, and 8 said no the first month and yes the second. At = 0.05, has the president s approval rating significantly changed? Contingency Table: Hypotheses: Test Statistic: T 2 = P-value: Conclusion:

For large samples (n > 20), we can use the test statistic T 1 = which has a null distribution. Why is this? Cox-Stuart Test for Trend An ordered sequence of numbers exhibits trend if the later numbers in the sequence tend to be greater than the earlier numbers ( trend) or if the later numbers in the sequence tend to be less than the earlier numbers ( trend). In the arranged data, we essentially pair points to the left of the middle ordered value with points to the right of the middle ordered value, and perform a sign test.

We assume the data X 1, X 2,, X n are at least ordinal in scale. Pair the data as (X 1, X 1+c ), (X 2, X 2+c ),, (X n c, X n ) where c = n /2 if n is even; c = (n + 1)/2 if n is odd. Note that if n is odd, the middle value is ignored. If the first element in a pair is less than the second, we write a + for that pair. If the first element in a pair is greater than the second, we write a for that pair. If the first element in a pair equals the second, we ignore that pair. Null hypothesis: H 0 : 3 possible alternatives: Test Statistic: T = Null distribution of T is with p = and n = the number of untied pairs. The decision rule and p-value are obtained in the same way as the sign test.

Example: NASA data give the average global temperatures for the last 13 decades, from the 1880s to the 2000s: -0.493, -0.457, -0.466, -0.497, -0.315, -0.077, 0.063, -0.036, -0.025, -0.002, 0.317, 0.563, 0.923. (Temperatures given in degrees F, centered by subtracting from 1951-1980 mean). Does Cox-Stuart test find evidence (at = 0.05) of an increasing trend? Hypotheses: n = and c = Pairs: T = Look at binomial table with p = 0.5 and n = P-value = Conclusion: In order to test for a specified type of trend other than increasing or decreasing (such as periodic, alternating, etc.), the data must first be reordered to reflect the expected ordering according to the specified trend. Then the Cox-Stuart test can be implemented on the reordered data.