Version 1: Equality of Distributions. 3. F (x) and G(x) represent the distribution functions corresponding to the Xs and Y s, respectively.
|
|
- Irma George
- 5 years ago
- Views:
Transcription
1 4 Two-Sample Methods 4.1 The (Mann-Whitney) Wilcoxon Rank Sum Test Version 1: Equality of Distributions Assumptions: Given two independent random samples X 1, X 2,..., X n and Y 1, Y 2,..., Y m : 1. The measurement scale is at least ordinal. 2. The response variable of interest is continuous. 3. F (x) and G(x) represent the distribution functions corresponding to the Xs and Y s, respectively. Hypotheses: (A) Two-sided: H 0 : F (x) = G(x) for all x vs H 1 : F (x) G(x) for some x (B) One-sided: H 0 : F (x) = G(x) for all x vs H 1 : F (x) > G(x) for some x (C) One-sided: H 0 : F (x) = G(x) for all x vs H 1 : F (x) < G(x) for some x The alternative hypothesis H 1 in (B) implies that X tends to be less than Y and H 1 in (C) implies that Y tends to be less than X. Rationale: Suppose that ranks are assigned to the combined sample X 1, X 2,..., X n, Y 1, Y 2,..., Y m. If the distributions of X and Y are the same, then we would expect that the average of the ranks associated with the Xs and with the Y s would be similar. However, if the distributions of X and Y are different, and the smaller (larger) ranks tend to be associated with X and the larger (smaller) ranks with Y, then we would expect the rank averages associated with the Xs and Y s to systematically differ. Thus, if the average (or sum) of the ranks associated with the Xs or Y s is too large or small, we have evidence that H 0 : F (x) = G(x) should be rejected. Method: For a given α Assign ranks to the combined sample X 1, X 2,..., X n, Y 1, Y 2,..., Y m. Let W = Under H 0, the sampling distribution of W is determined from the randomization distribution of n + m ranks partitioned into two groups of size n and m. For each randomization, W is calculated. The distribution of W is then determined. Let w p be a p th quantile of W. That is, w p is the largest value of test statistic W such that P (W w p ) p. In the table of critical values, selected lower and upper quantiles w p are given for various common values of p and for n, m
2 Decision Rule (A) For two-sided test H 1 : F (x) G(x), reject H 0 if W < w L or W > w U where w L and w U are the lower and upper α/2 tabled critical values. Otherwise, fail to reject H 0. (B) For one-sided test H 1 : F (x) > G(x),, Reject H 0 if W < w U (the lower α tabled critical value). Otherwise, Fail to Reject H 0. 9 Appendix (C) For one-sided : Wilcoxon test H 1 Rank-Sum : F (x) < G(x), Table, Reject H 0 if W > w U (the upper α tabled critical Probabilities value). relate Otherwise, to distribution Fail to Reject of W A H, 0 the. rank sum for group A when H 0 : A = B is true. The tabulated value for the lower tail is the largest value of w A for which pr(w A w A ) prob. The tabulated value for the upper tail is the smallest Table of value Critical of w A for Values whichfor pr(w W A for wsmall A ) prob. m and n Lower Tail Upper Tail prob prob n A n B
3 Example of the Distribution of W under H 0 when n = 4, m = 4 The following table lists the ( 8 4) = 70 possible assignments of ranks (R1, R 2, R 3, R 4 ) and the corresponding ranksum W. R 1 R 2 R 3 R 4 W R 1 R 2 R 3 R 4 W R 1 R 2 R 3 R 4 W The following table describes the distribution of W under H 0 when n = 4, m = 4. The cumulative probabilities are then used to calculate an exact p-value for a given value of W. Distribution of W with n = 4 and m = 4 P (W = w L ) P (W w L ) w L w U Frequency = P (W = w U ) = P (W w U )
4 Version 2: Shift in Location In addition to Assumptions 1-3 in Version 1, assume that the distributions of X and Y differ only with respect to a shift in location, that is; (4) G(x) = F (x + ) for all x This assumption states that the two distributions are identical in shape but are centered at different values. Hypotheses By including assumption (4), the MWW-test can be used to compare means or medians: For means (θ x = µ X, θ y = µ Y ) or for medians (θ x = M x, θ y = M y ) (A) Two-sided: H 0 : θ x = θ y vs H 1 : θ x θ y. (B) One-sided: H 0 : θ x = θ y vs H 1 : θ x < θ y. (C) One-sided: H 0 : θ x = θ y vs H 1 : θ x > θ y. Rationale: If the distribution of X and Y differ only with respect to a shift in location, then = and =. Consider the following cases: (i) If > 0, then the X s will tend to be larger than the Y s the ranks of the X s will tend to be larger than the ranks of the Y s. (ii) If < 0, then the X s will tend to be smaller than the Y s the ranks of the X s will tend to be smaller than the ranks of the Y s. (iii) If = 0, then the distributions of X and Y are identical. Or, the ranks of the X s will tend to be similarly distributed to the ranks of the Y s. Method and Decision Rule: Follow the same Method and Decision Rule for Version 1. Large Sample Approximation: When either n 1 or n 2 or both are large and there are no ties, the Central Limit Theorem for ranks applies. That is, z = W n 1(n 1 + n 2 + 1)/2 ± c n1 n 2 (n 1 + n 2 + 1)/12 = W E(W ) ± c V ar(w ) d N(0, 1). A value of z can be used to approximate p-values. If no continuity correction is used, then c = 0. If a continuity correction is used, then c =.5 if W > 0 or c = +.5 if W < 0. Both SAS and R use this correction. 76
5 In there are ties, see page 94 of Daniel for the adjustment to W. Later, we will use see a Randomization Test as an alternative approach but does not need a special formula to handle ties (Higgins p.41-42). Example: In a study of 11 patients with Huntington s disease and a control group of 10 subjects, the response (milligrams percent) to a 5-hour oral glucose treatment was studied. Perform a Mann-Whitney (Wilcoxon) Rank Sum Test for a two-sided alternative comparing medians. Control (X) Rank HD patient (Y ) Rank 77
6 R code for Mann-Whitney (Wilcoxon) Rank Sum Test # Mann-Whitney (Wilcoxon) Rank Sum Test HDsubject <- c(85,89,86,91,77,93,100,82,92,86,86) Control <- c(83,73,65,65,90,77,78,97,85,75) # Two-sided alternative with confidence interval for median difference wilcox.test(hdsubject,control,conf.int=t) # One-sided alternative: Median 1 > Median 2 wilcox.test(hdsubject,control,alternative="greater") # One-sided alternative: Median 1 < Median 2 wilcox.test(hdsubject,control,alternative="less") R output for Mann-Whitney (Wilcoxon) Rank Sum Test Warning: R does not report the W statistic, but an equivalent U statistic where U = min(u 1, U 2 ) and U 1 = U 2 = U is referred to as the Mann-Whitney U-statistic. It is also mathematically equivalent to either the number of pairs (X i, Y j ) for which X i < Y j or the number of pairs (X i, Y j ) for which Y j < X i (whichever is smaller). See Siegel p. 120 for details. If there are any tied values (X i = Y j for some i, j), then U can be adjusted by assigning 1/2 to each tied pair. Wilcoxon rank sum test with continuity correction data: HDsubject and Control W = 87, p-value = <-- p-value alternative hypothesis: true location shift is not equal to 0 95 percent confidence interval: sample estimates: difference in location Wilcoxon rank sum test with continuity correction data: HDsubject and Control W = 87, p-value = alternative hypothesis: true location shift is greater than 0 Wilcoxon rank sum test with continuity correction data: HDsubject and Control W = 87, p-value = alternative hypothesis: true location shift is less than 0 78
7 SAS output for Mann-Whitney (Wilcoxon) Rank Sum Test Mann-Whitney (Wilcoxon) Rank Sum Test The NPAR1WAY Procedure Wilcoxon Scores (Rank Sums) for Variable glucose Classified by Variable treatmnt Sum of Expected Std Dev Mean treatmnt N Scores Under H0 Under H0 Score HD subj Control Average scores were used for ties. Wilcoxon Two-Sample Test Statistic (S) Normal Approximation Z One-Sided Pr < Z Two-Sided Pr > Z t Approximation One-Sided Pr < Z Two-Sided Pr > Z Exact Test One-Sided Pr <= S Two-Sided Pr >= S - Mean Z includes a continuity correction of 0.5. Hodges-Lehmann Estimation Location Shift (Control - HD subj) Interval Asymptotic Type 95% Confidence Limits Midpoint Standard Error Asymptotic (Moses) Exact SAS code for Mann-Whitney (Wilcoxon) Rank Sum Test DM LOG;CLEAR;OUT;CLEAR; ; ODS LISTING; OPTIONS LS=72 PS=54 NONUMBER NODATE; data in; do i = 1 to 11; treatmnt = HD subj ; input output; end; do i = 1 to 10; treatmnt = Control ; input output; end; lines; ; PROC NPAR1WAY DATA=in WILCOXON; CLASS treatmnt; VAR glucose; EXACT WILCOXON HL; TITLE Mann-Whitney (Wilcoxon) Rank Sum Test ; RUN; 79
8 4.1.1 Hodges-Lehman Confidence Intervals for the Shift Parameter The following procedure generates an approximate 100(1 α)% confidence interval for. Calculate all mn paired differences U ij = X i Y j Arrange the U ij in increasing order. Use the table of critical values for W to find k = w α/2 n(n + 1)/2. Let L and U bet the k th smallest and largest U ij values. The approximate 100(1 α)% confidence interval for the shift parameter θ is (L, U). Thus, it is also a confidence interval for µ X µ Y and M x M y. This is the procedure used by SAS and R. In the previous example, the output from SAS indicated the 95% confidence interval was 17 1 while R indicated the 95% confidence interval was 1 17 (milligrams percent). You need to keep track of the direction of the calculated differences (e.g., Control - HD Patient in SAS or HD Patient - Control in R). 4.2 The Median Test for Differences in Population Medians Assumptions: Given two independent random samples X 1, X 2,..., X m and Y 1, Y 2,..., Y n : The measurement scale is at least ordinal. The variable of interest is continuous. The two populations have the same shape. Hypotheses about medians: H 0 : M x = M y vs H 1 : M x M y or H 1 : M x < M y or H 1 : M x > M y Rationale: If the two populations have the same median M: For a random X or Y, P (X M) = P (Y M) =.5. Assuming H 0 is true, we estimate the common median M by computing the sample median M of the combined sample X 1, X 2,..., X m, Y 1, Y 2,..., Y n of N = m + n observations. We would expect about half of the X 1, X 2,..., X m sample and half of the Y 1, Y 2,..., Y n sample to be above (or below) M. Method: For a given α: Compute M and summarize the results in a 2 2 table: Sample X s Y s Total Relationship to Above A B A + B Sample Median Equal or Below C D C + D Total m n N 80
9 Under H 0, the sampling distribution of A is hypergeometric. That is, for a sample of size A + B from a population of size N comprised of two types of items (m of Type X and n = N m of Type Y ), the probability of observing exactly A of the Type X and B of Type Y is P(A, B m, n) = ( m )( n A B) ( N ) A+B These probabilities can be determined using SAS or R or can be approximated by the normal distribution when N is large. For a two-sided alternative, the exact p-value is the sum of the extreme tail probabilities from hypergeometric distributions: p value = P (X min(a, B)) + P (X max(c, D)) For a one-sided alternative, use the appropriate single extreme tail probability to find the exact p-value. An approximate p-value can be generated from a normal approximation with z-statistic: z = (A/m) (B/n) p(1 p)(1/m + 1/n) where p = (A + B)/N. Decision Rule using p-values If p-value α, Reject H 0. Otherwise, Fail to Reject H 0. Example: In a study of 11 patients with Huntington s disease and a control group of 10 subjects, the response (milligrams percent) to a 5-hour oral glucose treatment was studied. Perform a Median Test for a two-sided alternative. HD patient (X) Control (Y )
10 R output from generating a hypergeometric distribution > # Hypergeometric Distribution for A,B row xh pdfh cdfh rev_cdfh [1,] [2,] [3,] < [4,] [5,] [6,] [7,] [8,] [9,] [10,] [11,] > # Hypergeometric Distribution C,D row xh pdfh cdfh rev_cdfh [1,] [2,] [3,] [4,] [5,] [6,] [7,] [8,] [9,] < [10,] [11,] [12,] R code for generating a hypergeometric distribution Nh <- 21 AB <- 10 # Hypergeometric Distribution for A,B row mh <- 10 xh <- (0:mh) pdfh <- dhyper(xh,mh,nh-mh,ab) pdfh <- round(pdfh,10) cdfh <- phyper(xh,mh,nh-mh,ab) cdfh <- round(cdfh,10) rev_cdfh <- phyper(ab-xh,mh,nh-mh,ab) rev_cdfh <- round(rev_cdfh,10) hypergeo <- cbind(xh,pdfh,cdfh,rev_cdfh) hypergeo # Hypergeometric Distribution C,D row nh <- 11 xh <- (0:nh) pdfh <- dhyper(xh,nh,nh-nh,ab) pdfh <- round(pdfh,10) cdfh <- phyper(xh,nh,nh-nh,ab) cdfh <- round(cdfh,10) rev_cdfh <- phyper(ab-xh,nh,nh-nh,ab) rev_cdfh <- round(rev_cdfh,10) hypergeo <- cbind(xh,pdfh,cdfh,rev_cdfh) hypergeo 82
11 SAS output for Median Test to compare two medians Median Test to Compare Two Medians The NPAR1WAY Procedure Median Scores (Number of Points Above Median) for Variable glucose Classified by Variable treatmnt Sum of Expected Std Dev Mean treatmnt N Scores Under H0 Under H0 Score HD subj Control Average scores were used for ties. Median Two-Sample Test Statistic (S) Normal Approximation Z One-Sided Pr < Z Two-Sided Pr > Z Exact Test One-Sided Pr <= S Two-Sided Pr >= S - Mean SAS code for Median Test to compare two medians DM LOG;CLEAR;OUT;CLEAR; ; OPTIONS LS=72 PS=54 NONUMBER NODATE; ODS LISTING; data in; do i = 1 to 11; treatmnt = HD subj ; input output; end; do i = 1 to 10; treatmnt = Control ; input output; end; lines; ; PROC NPAR1WAY DATA=in MEDIAN; CLASS treatmnt; VAR glucose; EXACT MEDIAN; TITLE Median Test to Compare Two Medians ; RUN; 83
12 R code for Median Test to compare two medians # Two-Sample Median Test library(modeltools) library(coin) # x are the HD subjects x <- c(85,89,86,91,77,93,100,82,92,86,86) # y are the control subjects y <- c(83,73,65,65,90,77,78,97,85,75) xy <- c(x,y) group <- c(rep(1,length(x)),rep(2,length(y))) group <- as.factor(group) two_sample <- data.frame(cbind(xy,group)) # two_sample # Two-sided alternative with confidence interval for median difference median_test(xy~group,conf.int=t,conf.level=.95,distribution="exact") # One-sided alternatives median_test(xy~group,conf.int=t,distribution="exact",alternative="l") median_test(xy~group,conf.int=t,distribution="exact",alternative="g") R output for Median Test to compare two medians # Two-Sample Median Test > # Two-sided alternative with confidence interval for median difference Exact Median Test Z = , p-value = alternative hypothesis: true mu is not equal to 0 95 percent confidence interval: 1 19 sample estimates: difference in location 0.5 > # One-sided alternatives Exact Median Test Z = , p-value = alternative hypothesis: true mu is less than 0 95 percent confidence interval: -Inf 19 sample estimates: difference in location 0.5 Exact Median Test Z = , p-value = alternative hypothesis: true mu is greater than 0 95 percent confidence interval: 1 Inf sample estimates: difference in location
6 Single Sample Methods for a Location Parameter
6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually
More informationNonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I
1 / 16 Nonparametric tests Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I Nonparametric one and two-sample tests 2 / 16 If data do not come from a normal
More informationData are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)
BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data
More informationDistribution-Free Procedures (Devore Chapter Fifteen)
Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal
More information5.3 Three-Stage Nested Design Example
5.3 Three-Stage Nested Design Example A researcher designs an experiment to study the of a metal alloy. A three-stage nested design was conducted that included Two alloy chemistry compositions. Three ovens
More informationSTAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test.
STAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test. Rebecca Barter March 30, 2015 Mann-Whitney Test Mann-Whitney Test Recall that the Mann-Whitney
More informationWilcoxon Test and Calculating Sample Sizes
Wilcoxon Test and Calculating Sample Sizes Dan Spencer UC Santa Cruz Dan Spencer (UC Santa Cruz) Wilcoxon Test and Calculating Sample Sizes 1 / 33 Differences in the Means of Two Independent Groups When
More informationNon-parametric Tests
Statistics Column Shengping Yang PhD,Gilbert Berdine MD I was working on a small study recently to compare drug metabolite concentrations in the blood between two administration regimes. However, the metabolite
More informationNonparametric Statistics
Nonparametric Statistics Nonparametric or Distribution-free statistics: used when data are ordinal (i.e., rankings) used when ratio/interval data are not normally distributed (data are converted to ranks)
More informationE509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.
E509A: Principle of Biostatistics (Week 11(2): Introduction to non-parametric methods ) GY Zou gzou@robarts.ca Sign test for two dependent samples Ex 12.1 subj 1 2 3 4 5 6 7 8 9 10 baseline 166 135 189
More informationDesign of the Fuzzy Rank Tests Package
Design of the Fuzzy Rank Tests Package Charles J. Geyer July 15, 2013 1 Introduction We do fuzzy P -values and confidence intervals following Geyer and Meeden (2005) and Thompson and Geyer (2007) for three
More informationSTAT Section 5.8: Block Designs
STAT 518 --- Section 5.8: Block Designs Recall that in paired-data studies, we match up pairs of subjects so that the two subjects in a pair are alike in some sense. Then we randomly assign, say, treatment
More information7.3 Ridge Analysis of the Response Surface
7.3 Ridge Analysis of the Response Surface When analyzing a fitted response surface, the researcher may find that the stationary point is outside of the experimental design region, but the researcher wants
More informationPSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests
PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution
More informationNonparametric Statistics Notes
Nonparametric Statistics Notes Chapter 5: Some Methods Based on Ranks Jesse Crawford Department of Mathematics Tarleton State University (Tarleton State University) Ch 5: Some Methods Based on Ranks 1
More information4.8 Alternate Analysis as a Oneway ANOVA
4.8 Alternate Analysis as a Oneway ANOVA Suppose we have data from a two-factor factorial design. The following method can be used to perform a multiple comparison test to compare treatment means as well
More informationNonparametric Location Tests: k-sample
Nonparametric Location Tests: k-sample Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)
More informationChapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics
Section 15.1: An Overview of Nonparametric Statistics Understand Difference between Parametric and Nonparametric Statistical Procedures Parametric statistical procedures inferential procedures that rely
More information5 Introduction to the Theory of Order Statistics and Rank Statistics
5 Introduction to the Theory of Order Statistics and Rank Statistics This section will contain a summary of important definitions and theorems that will be useful for understanding the theory of order
More informationNonparametric hypothesis tests and permutation tests
Nonparametric hypothesis tests and permutation tests 1.7 & 2.3. Probability Generating Functions 3.8.3. Wilcoxon Signed Rank Test 3.8.2. Mann-Whitney Test Prof. Tesler Math 283 Fall 2018 Prof. Tesler Wilcoxon
More informationIn many situations, there is a non-parametric test that corresponds to the standard test, as described below:
There are many standard tests like the t-tests and analyses of variance that are commonly used. They rest on assumptions like normality, which can be hard to assess: for example, if you have small samples,
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationFrequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=
A frequency distribution is a kind of probability distribution. It gives the frequency or relative frequency at which given values have been observed among the data collected. For example, for age, Frequency
More informationNonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown
Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding
More informationContents 1. Contents
Contents 1 Contents 2 Two-Sample Methods 3 2.1 Classic Method...................... 7 2.2 A Two-sample Permutation Test............. 11 2.2.1 Permutation test................. 11 2.2.2 Steps for a two-sample
More informationST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data
ST4241 Design and Analysis of Clinical Trials Lecture 7: Non-parametric tests for PDG data Department of Statistics & Applied Probability 8:00-10:00 am, Friday, September 2, 2016 Outline Non-parametric
More informationModule 9: Nonparametric Statistics Statistics (OA3102)
Module 9: Nonparametric Statistics Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 15.1-15.6 Revision: 3-12 1 Goals for this Lecture
More informationNon-parametric methods
Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish
More informationResampling Methods. Lukas Meier
Resampling Methods Lukas Meier 20.01.2014 Introduction: Example Hail prevention (early 80s) Is a vaccination of clouds really reducing total energy? Data: Hail energy for n clouds (via radar image) Y i
More informationBiostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test
Biostatistics 270 Kruskal-Wallis Test 1 ORIGIN 1 Kruskal-Wallis Test The Kruskal-Wallis is a non-parametric analog to the One-Way ANOVA F-Test of means. It is useful when the k samples appear not to come
More informationSTAT Section 3.4: The Sign Test. The sign test, as we will typically use it, is a method for analyzing paired data.
STAT 518 --- Section 3.4: The Sign Test The sign test, as we will typically use it, is a method for analyzing paired data. Examples of Paired Data: Similar subjects are paired off and one of two treatments
More informationStatistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data
Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data 999 Prentice-Hall, Inc. Chap. 9 - Chapter Topics Comparing Two Independent Samples: Z Test for the Difference
More informationAnalyzing Small Sample Experimental Data
Analyzing Small Sample Experimental Data Session 2: Non-parametric tests and estimators I Dominik Duell (University of Essex) July 15, 2017 Pick an appropriate (non-parametric) statistic 1. Intro to non-parametric
More information1 ONE SAMPLE TEST FOR MEDIAN: THE SIGN TEST
NON-PARAMETRIC STATISTICS ONE AND TWO SAMPLE TESTS Non-parametric tests are normally based on ranks of the data samples, and test hypotheses relating to quantiles of the probability distribution representing
More informationSession 3 The proportional odds model and the Mann-Whitney test
Session 3 The proportional odds model and the Mann-Whitney test 3.1 A unified approach to inference 3.2 Analysis via dichotomisation 3.3 Proportional odds 3.4 Relationship with the Mann-Whitney test Session
More informationFrequency Distribution Cross-Tabulation
Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape
More informationAnalysis of 2x2 Cross-Over Designs using T-Tests
Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous
More informationExst7037 Multivariate Analysis Cancorr interpretation Page 1
Exst7037 Multivariate Analysis Cancorr interpretation Page 1 1 *** C03S3D1 ***; 2 ****************************************************************************; 3 *** The data set insulin shows data from
More informationChapter 7 Comparison of two independent samples
Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N
More informationUnit 14: Nonparametric Statistical Methods
Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based
More information3. Nonparametric methods
3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests
More informationTMA4255 Applied Statistics V2016 (23)
TMA4255 Applied Statistics V2016 (23) Part 7: Nonparametric tests Signed-Rank test [16.2] Wilcoxon Rank-sum test [16.3] Anna Marie Holand April 19, 2016, wiki.math.ntnu.no/tma4255/2016v/start 2 Outline
More informationHYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă
HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and
More informationNonparametric tests. Mark Muldoon School of Mathematics, University of Manchester. Mark Muldoon, November 8, 2005 Nonparametric tests - p.
Nonparametric s Mark Muldoon School of Mathematics, University of Manchester Mark Muldoon, November 8, 2005 Nonparametric s - p. 1/31 Overview The sign, motivation The Mann-Whitney Larger Larger, in pictures
More informationComparison of Two Population Means
Comparison of Two Population Means Esra Akdeniz March 15, 2015 Independent versus Dependent (paired) Samples We have independent samples if we perform an experiment in two unrelated populations. We have
More informationContents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47
Contents 1 Non-parametric Tests 3 1.1 Introduction....................................... 3 1.2 Advantages of Non-parametric Tests......................... 4 1.3 Disadvantages of Non-parametric Tests........................
More informationWeek 7.1--IES 612-STA STA doc
Week 7.1--IES 612-STA 4-573-STA 4-576.doc IES 612/STA 4-576 Winter 2009 ANOVA MODELS model adequacy aka RESIDUAL ANALYSIS Numeric data samples from t populations obtained Assume Y ij ~ independent N(μ
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationContents 1. Contents
Contents 1 Contents 1 One-Sample Methods 3 1.1 Parametric Methods.................... 4 1.1.1 One-sample Z-test (see Chapter 0.3.1)...... 4 1.1.2 One-sample t-test................. 6 1.1.3 Large sample
More informationVisual interpretation with normal approximation
Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation
More informationThis is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!
Two sample tests (part II): What to do if your data are not distributed normally: Option 1: if your sample size is large enough, don't worry - go ahead and use a t-test (the CLT will take care of non-normal
More informationNon-Parametric Statistics: When Normal Isn t Good Enough"
Non-Parametric Statistics: When Normal Isn t Good Enough" Professor Ron Fricker" Naval Postgraduate School" Monterey, California" 1/28/13 1 A Bit About Me" Academic credentials" Ph.D. and M.A. in Statistics,
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationGROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION
FOR SAMPLE OF RAW DATA (E.G. 4, 1, 7, 5, 11, 6, 9, 7, 11, 5, 4, 7) BE ABLE TO COMPUTE MEAN G / STANDARD DEVIATION MEDIAN AND QUARTILES Σ ( Σ) / 1 GROUPED DATA E.G. AGE FREQ. 0-9 53 10-19 4...... 80-89
More informationRank-sum Test Based on Order Restricted Randomized Design
Rank-sum Test Based on Order Restricted Randomized Design Omer Ozturk and Yiping Sun Abstract One of the main principles in a design of experiment is to use blocking factors whenever it is possible. On
More informationEXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"
EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically
More informationTesting Independence
Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1
More information16. Nonparametric Methods. Analysis of ordinal data
16. Nonparametric Methods 數 Analysis of ordinal data 料 1 Data : Non-interval data : nominal data, ordinal data Interval data but not normally distributed Nonparametric tests : Two dependent samples pair
More informationWhat to do today (Nov 22, 2018)?
What to do today (Nov 22, 2018)? Part 1. Introduction and Review (Chp 1-5) Part 2. Basic Statistical Inference (Chp 6-9) Part 3. Important Topics in Statistics (Chp 10-13) Part 4. Further Topics (Selected
More informationRelating Graph to Matlab
There are two related course documents on the web Probability and Statistics Review -should be read by people without statistics background and it is helpful as a review for those with prior statistics
More informationInference for Binomial Parameters
Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for
More informationAdvanced Statistics II: Non Parametric Tests
Advanced Statistics II: Non Parametric Tests Aurélien Garivier ParisTech February 27, 2011 Outline Fitting a distribution Rank Tests for the comparison of two samples Two unrelated samples: Mann-Whitney
More informationTHE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future
More informationNonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health
Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,
More informationFormulas and Tables by Mario F. Triola
Copyright 010 Pearson Education, Inc. Ch. 3: Descriptive Statistics x f # x x f Mean 1x - x s - 1 n 1 x - 1 x s 1n - 1 s B variance s Ch. 4: Probability Mean (frequency table) Standard deviation P1A or
More informationIntroduction to Statistical Data Analysis III
Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationNon-parametric tests, part A:
Two types of statistical test: Non-parametric tests, part A: Parametric tests: Based on assumption that the data have certain characteristics or "parameters": Results are only valid if (a) the data are
More informationViolating the normal distribution assumption. So what do you do if the data are not normal and you still need to perform a test?
Violating the normal distribution assumption So what do you do if the data are not normal and you still need to perform a test? Remember, if your n is reasonably large, don t bother doing anything. Your
More informationMATH4427 Notebook 5 Fall Semester 2017/2018
MATH4427 Notebook 5 Fall Semester 2017/2018 prepared by Professor Jenny Baglivo c Copyright 2009-2018 by Jenny A. Baglivo. All Rights Reserved. 5 MATH4427 Notebook 5 3 5.1 Two Sample Analysis: Difference
More informationThis is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
EXST3201 Chapter 13c Geaghan Fall 2005: Page 1 Linear Models Y ij = µ + βi + τ j + βτij + εijk This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
More informationInferential Statistics
Inferential Statistics Eva Riccomagno, Maria Piera Rogantin DIMA Università di Genova riccomagno@dima.unige.it rogantin@dima.unige.it Part G Distribution free hypothesis tests 1. Classical and distribution-free
More informationData analysis and Geostatistics - lecture VII
Data analysis and Geostatistics - lecture VII t-tests, ANOVA and goodness-of-fit Statistical testing - significance of r Testing the significance of the correlation coefficient: t = r n - 2 1 - r 2 with
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More information6 Designs with Split Plots
6 Designs with Split Plots Many factorial experimental designs are incorrectly analyzed because the assumption of complete randomization is not true. Many factorial experiments have one or more restrictions
More informationLecture 4. Checking Model Adequacy
Lecture 4. Checking Model Adequacy Montgomery: 3-4, 15-1.1 Page 1 Model Checking and Diagnostics Model Assumptions 1 Model is correct 2 Independent observations 3 Errors normally distributed 4 Constant
More informationEpidemiology Principle of Biostatistics Chapter 11 - Inference about probability in a single population. John Koval
Epidemiology 9509 Principle of Biostatistics Chapter 11 - Inference about probability in a single population John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is
More informationFish SR P Diff Sgn rank Fish SR P Diff Sng rank
Nonparametric tests Distribution free methods require fewer assumptions than parametric methods Focus on testing rather than estimation Not sensitive to outlying observations Especially useful for cruder
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number
More informationBiostatistics Quantitative Data
Biostatistics Quantitative Data Descriptive Statistics Statistical Models One-sample and Two-Sample Tests Introduction to SAS-ANALYST T- and Rank-Tests using ANALYST Thomas Scheike Quantitative Data This
More informationNONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12
NONPARAMETRIC TESTS LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-1 lmb@iasri.res.in 1. Introduction Testing (usually called hypothesis testing ) play a major
More informationIntroduction to Crossover Trials
Introduction to Crossover Trials Stat 6500 Tutorial Project Isaac Blackhurst A crossover trial is a type of randomized control trial. It has advantages over other designed experiments because, under certain
More informationMeasuring relationships among multiple responses
Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.
More informationSAS/STAT 14.1 User s Guide. Introduction to Nonparametric Analysis
SAS/STAT 14.1 User s Guide Introduction to Nonparametric Analysis This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual is as follows:
More information4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures
Non-parametric Test Stephen Opiyo Overview Distinguish Parametric and Nonparametric Test Procedures Explain commonly used Nonparametric Test Procedures Perform Hypothesis Tests Using Nonparametric Procedures
More informationIntroduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.
Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of
More informationIntroduction to Nonparametric Analysis (Chapter)
SAS/STAT 9.3 User s Guide Introduction to Nonparametric Analysis (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for
More informationNon-parametric Hypothesis Testing
Non-parametric Hypothesis Testing Procedures Hypothesis Testing General Procedure for Hypothesis Tests 1. Identify the parameter of interest.. Formulate the null hypothesis, H 0. 3. Specify an appropriate
More informationHypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =
Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,
More informationBasic Business Statistics, 10/e
Chapter 1 1-1 Basic Business Statistics 11 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Basic Business Statistics, 11e 009 Prentice-Hall, Inc. Chap 1-1 Learning Objectives In this chapter,
More informationChapter 16. Nonparametric Tests
Chapter 16 Nonparametric Tests The statistical tests we have examined so far are called parametric tests, because they assume the data have a known distribution, such as the normal, and test hypotheses
More informationBIO 682 Nonparametric Statistics Spring 2010
BIO 682 Nonparametric Statistics Spring 2010 Steve Shuster http://www4.nau.edu/shustercourses/bio682/index.htm Lecture 8 Example: Sign Test 1. The number of warning cries delivered against intruders by
More informationWhat is a Hypothesis?
What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:
More informationTwo Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests
Chapter 59 Two Correlated Proportions on- Inferiority, Superiority, and Equivalence Tests Introduction This chapter documents three closely related procedures: non-inferiority tests, superiority (by a
More information7 The Analysis of Response Surfaces
7 The Analysis of Response Surfaces Goal: the researcher is seeking the experimental conditions which are most desirable, that is, determine optimum design variable levels. Once the researcher has determined
More informationS D / n t n 1 The paediatrician observes 3 =
Non-parametric tests Paired t-test A paediatrician measured the blood cholesterol of her patients and was worried to note that some had levels over 00mg/100ml To investigate whether dietary regulation
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationContents 1. Contents
Contents 1 Contents 4 Paired Comparisons & Block Designs 3 4.1 Paired Comparisons.................... 3 4.1.1 Paired Data.................... 3 4.1.2 Existing Approaches................ 6 4.1.3 Paired-comparison
More informationDr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)
Dr. Maddah ENMG 617 EM Statistics 10/12/12 Nonparametric Statistics (Chapter 16, Hines) Introduction Most of the hypothesis testing presented so far assumes normally distributed data. These approaches
More informationContents. Acknowledgments. xix
Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables
More information