16. Nonparametric Methods. Analysis of ordinal data
|
|
- Norah Patrick
- 6 years ago
- Views:
Transcription
1 16. Nonparametric Methods 數 Analysis of ordinal data 料 1
2 Data : Non-interval data : nominal data, ordinal data Interval data but not normally distributed Nonparametric tests : Two dependent samples pair t-test 1. Sign Test : Only the information of + or - is used Also can be used to test the median of a population 2. Wilcoxon Signed-Rank Test : Use the rank of the data. Two independent samples : t/z test Wilcoxon Rank-Sum Test More than 2 independent samples : ANOVA F-test Kruskal-Wallis Test Spearman s coefficient of rank correlation r s r Test the significance of r s 2
3 Sign test Data: Binary, + or Problem: H0 : P(+) = P( ) = 0.5 In contrast, if + is preferred, H1 : P( ) > P( ) Or, if is preferred, H1 : P( + ) < P( ) 3
4 Sign test Application 1 1. Whether there is change after an intervention? Before/after experiments : two dependent samples Define increase = +; decrease = If there is no change after the event, P(+)=P(-)=1/2 H0 : P(+) = P( ) = 0.5 In contrast, if there is an increase, P( )>1/2>P( ). Example. 識 Experiment: Data: X=, if > ; X=-, if < Recall : for continuous data, the paired t-test is used under normal population assumption. 4
5 Sign test Application 2 Multiple choices: A or B is preferred? 2. Whether A is more popular than B? Single population : a sample data Define prefer A = +, prefer B = No difference H0: P(prefer A) = P(prefer B) = 0.5 H0: P(+) = P( ) In contrast, if A is more preferred, H1: P(+)>P( ) Example. 諾 樂 樂 Q A 諾 B 樂 5
6 Sign test Application 3 3. Whether the population median = m0? Single population : a sample data < m0 = ; > m0 = + Whether P(X>m0) = P(X<m0) = 0.5? Example. 數 80? Recall : for continuous data, Ch. 9~10 emphasize only on the population mean. 6
7 Exact Sign Test Let P(+) = Step 1. Null hypothesis : H0 : = 0.5 Step 2. Significance level Step 3. Test statistic : B = number of + 7
8 Step 4. Rejection region : Direction : coincident with H1 H1 : > 0.5, B c, (large B reject H0) H1 : < 0.5, B c, (small B reject H0) H1 : 0.5, B c2 or B c1 (large or small B) Critical value(s) : c? c1? c2? B~ distribution? Check table? 8
9 Step 4. Rejection region : Critical value(s): Tie = no change or x=m0 Let n = untied number = (total + ) + (total ) Then under null hypothesis H0: = 0.5 B ~ Binomial( n, 0.5) check Binomial(n,0.5) table 9
10 Under H0, B ~ Binomial(n, =0.5), the critical value is such that the type I error rate For example, if H1 : >0.5, H0 is rejected if B is too large. Type I error rate =tail prob. =0.1 Thus, rejection region is {B 10}. 10
11 Step 4. Rejection region, critical value : If H1 : > 0.5, find c in Binomial(n, 0.5) table such that P( B c ) If H1 : < 0.5, find c in Binomial(n, 0.5) table such that P( B c ) If H1 : 0.5, find c1, c2 in Binomial(n, 0.5) table such that P(B c 1) /2 and P(B c2 ) /2 Step 5. Draw conclusion 11
12 Example. P548 Samuelson Chemicals. Intervention A computer training program Sample n=15 managers was selected at random Experiment competence and understanding wrt computer before/after the intervention of these managers Ordinal outcome {poor, fair, good, excellent, outstanding} Q : Are the managers more competent after the program? Q : 練 識 Define : Sign of difference : + = improved; = worse. Table 16-1 Q : P(improved)=P(+)= > 0.5? Note: if continuous outcome, one may use a pair-t test. 12
13 A tie 13
14 Step 1. H0 : =0.5, H1 : > 0.5 Step 2. = 0.1 Step 3. Test statistic = B = no of + in the sample Step 4. Rejection region The 5 th pair observation is a tie. Thus, n = n-1=15-1=14 Direction : B c Critical value : find c such that P(B c) =0.1 (Table 16-2) 14
15 Table Bin(14,0.5) Exact critical value : c = 10, Exact rejection region : H0 is rejected at =0.1 if B 10 15
16 CHART Bin(14,0.5), under H0 Type I error rate
17 Step 5. Since there are 11 + out of the 14 managers. B=11 10! H0 is rejected at =0.1. Conclusion : the computer course was effective. 17
18 Normal Approximation: Under H0, d B ~ Binomial(n ', 0.5), B N(n ' 0.5, n ' 0.5 (1 0.5)) Validity : n 0 = 0.5 n 5; or equivalently, n 10 Continuity correction ( 連 ): 0.5 Step 3. Test statistic Z B n ', B > 0.5n ' 0.5 n ' = B n ', B < 0.5n ' 0.5 n ' 18
19 Normal Approximation Step 4. Rejection region : Z ~ N(0,1), check N(0,1) table. If H1 : > 0.5, Z Z If H1 : < 0.5, Z -Z If H1 : 0.5, Z -Z /2, Z Z /2 19
20 Example. P553 Cola, Inc. Two versions of the drink are considered : A and B Experiment : a sample of 64 consumers were selected. Each will taste both A and B and indicate a preference. Data : 42 preferred A. Which one is preferred? Let A= +, B= 20
21 Step 1. Hypotheses H0 : no difference in consumer preference Or let =P(A)=P(+), then, H0 : =0.5, H1 : 0.5 Step 2. = 0.05 Step 3. Test statistic B = no of + (A) in the sample No tie, n =n=64, Since n =64 >10 normal approximation Z-test is used. B n ', B > 0.5n ' 0.5 n ' Z = B n ', B < 0.5n ' 0.5 n ' 21
22 Step 4. Rejection region :Z = 1.96, H0 is rejected if Z 1.96 or Z Step 5. Since B = 42 > 0.5(64) = 32, B n ' (64) Z = = = 2.38 > n ' Reject H0 at level Note that : p-value=2p(z>2.38)=2(0.0087)= <
23 Example. P Customer Research Department of Superior Grocers found the median weekly spent on grocery by young couples was $123. The CEO would like to repeat the research to determine whether the median has changed. Data : a random sample of n=102 young couples, 60 spent more than $123, 40 spent less, and 2 spent exactly $123. At the 0.10 significance level, is it reasonable to conclude that the median has changed? Let >123 = +, <123 = 23
24 Step 1. Hypotheses H0 : median = 123, H1 : median 123 Let =P(Spent >123)=P(+), Step 2. = 0.1 H0 : =0.5 H1 : 0.5 Step 3. Test statistic : sign test B = no of + ( >123) in the sample Ignore the 2 ties (exactly x = 123), since n =100 >10 normal approximation Z-test is used. B n', B > 0.5n ' 0.5 n ' Z = B n ', B < 0.5n ' 0.5 n ' 24
25 Step 4. Rejection region :Z 0.05 = 1.65, H0 is rejected if Z 1.65 or Z Step 5. Since B = 60 > 0.5(100) = 50, B n ' (100) Z= = = 1.9> n ' Reject H0 at level Note that : p-value=2p(z>1.9)=2(0.0287)= <
26 Wilcoxon signed-rank test In comparing two dependent sample, the sign test uses only the information of progression(+) or regression( ). The information of magnitudes are lost. Example. Data: A:0 1(+), B:0 1(+), C:0-20(-), D:0-20(-) There are 2 + s and 2 s. The sign test has an insignificant result. However, the decrements are more severe. To embed the information of magnitudes, one may consider a Wilcoxon signed-rank test. 26
27 Wilcoxon signed-rank test Problem: Comparing two populations Data: Two dependent samples Before/After Match pair Measurement: Ordinal or continuous Nonparametric tests: Sign test Wilcoxon signed-rank test Sign test vs. Wilcoxon signed-rank test : Sign Test : categorize data into binary outcomes, either + or -. Wilcoxon Signed-Rank Test : fully using the rank of the data. 27
28 Wilcoxon signed-rank test statistic T statistic 1. Compute the difference(diff) between each pair: 2. If the DIFF = 0, this is a tie. Drop ties from the sample. 3. Classify the data into two groups : Group : DIFF>0 Group : DIFF<0 4. Calculate the absolute DIFF and pool the two groups 5. Order the DIFF s from smallest to largest, assign rank for each. 6. R+ = rank sum for group 7. R- = rank sum for group 8. The test statistic T = the smaller one of R+ and R- 28
29 Rationale of using T H0 : No difference between two populations Recall for the sign test : Under H0, P(+)=P(-)=0.5 The proportions of the two groups are about the same. 29
30 Moreover, H0 : No difference between two populations For the Wilcoxon signed-rank test, If H0 is true, The rank sums should be about the same, R+ R-. The total rank sum = S =? S=( n)=n(n+1)/2. R+ and R- are expected to be equal and close to S/2 If H0 is false, One of R+ and R- is extremely large T = min (R+, R-) should be sufficiently small 30
31 Wilcoxon signed-rank test statistic Step 1. Hypotheses H0 : No difference in the two populations Step 2. Step 3. Wilcoxon signed-rank test statistic, T Step 4. Decision rule : H0 is rejected if T is too small Critical value(s) : Appendix H The row is used for one-tailed tests and 2 row for twotailed tests. Step 5. Draw conclusion 31
32 Example. P557 Mr. Frick, the owner of a family restaurant, developed a new recipe. He wants to conduct some tests to find that if the new recipe is better. Data : a random sample of n=15 customers Each one rates new/old flavors on a scale of 1 to 20. At =
33 Step 1. Hypotheses H0 : no difference in the ratings of the two flavors H1 : the new recipe have higher ratings Step 2. At =0.05 Step 3. Test statistic : Table Compute the DIFF between each pair: 2. If the DIFF= 0, this is a tie. Drop ties from the sample. 3. Classify the data into two groups : Group : DIFF>0 Group : DIFF<0 4. Calculate DIFF and pool the two groups 5. Order the DIFF s from smallest to largest, assign rank for each. 6. R+ = rank sum for group + 7. R- = rank sum for group 8. The test statistic T = the smaller one of R+ and R- 33
34 Table DIFF Remove the sign Pool rank 34
35 What if there are more than one observations which have same value? In the example, the pool DIFF data set is, 2, 3, 4, 7, 8, 8, 8, 9, 9, 9, 10, 12, 14, 16 there are 3 replicated 8 s and 3 replicated 9 s. How to assign rank for these values? These are also ties. same value = a tie When assigning ranks, break the ties by giving the average ranks. For example. The corresponding ranks for the three 8 s should be 5,6,7. Then the average rank is (5+6+7)/3=6, one gives each observation rank 6. 35
36 Step 4. Decision Rule : Appendix H Remove one tie, n=14 At =0.05, H0 is rejected if T 25 Step 5. Since R- = 30 < R+ = 75, T = 30 > 25, thus H0 of no difference is not rejected at level
37 Wilcoxon Rank-Sum Test Problem: Comparing two populations Data: Two independent samples Measurement: Ordinal or continuous Without normal assumption Wilcoxon rank-sum test. Recall : for two continuous independent samples, see Ch. 11 Normal + known variances : Z-test Large sample sizes : Z-test Normal + unknown equal variances : t- test. 37
38 Wilcoxon Rank-Sum Test Wilcoxon rank-sum test statistic, W : 1. Pool the two samples from different populations 2. Order the pooled sample from smallest to largest and assign the rank 3. Calculate the rank sum of each sample 4. W = rank sum of the first sample The critical value(s) of an exact Wilcoxon rank-sum test can be found on some other books. Here, an approximated test is used. 38
39 If H0 is true, W has Mean = Wilcoxon Rank-Sum Test n (n1 + n ) Variance = n n 2(n1 + n ) Approximately normal, if n1, n2 are large. 39
40 The mean of the W-statistic The total rank sum =(1+2+ +(n1+n2))=(n1+n2)(n1+n2+1)/2 If H0 is true, there is no difference, W is expected to be closed to n1 (n1+ n2)(n1+ n2 + 1) n1(n1+ n2 + 1) = n1+ n
41 Step 1. Hypotheses Wilcoxon Rank-Sum Test H0 : No difference in two populations Step 2. At significance level Step 3. Wilcoxon rank-sum test statistic : Z = W n 1 n n 1 2 (n1 + n 2 (n1 + n ) + 1) Where W = sum of the ranks of the 1 st sample from the 1 st population, n1 = sample size of the 1 st sample, n2 = sample size of the 2 nd sample
42 Step 4. Decision Rule : Under null hypothesis, Wilcoxon rank-sum test N(0,1) As a Z-test, find the critical value(s) in N(0,1) table Step 5. Draw conclusion : Note : for small sample sizes, some books provide the exact critical values of the Wilcoxon rank-sum test. 42
43 Example. P561 The president of CEO airlines recently noted an increase in the number of no-shows for flights out of Atlanta. Determine whether there are more no-shows for flights from Atlanta compared with from Chicago. Data : Table 16-4 Sample 1 : number of no-shows of 9 flights from Atlanta Sample 2 : number of no-shows of 8 flights from Chicago At 0.05 significance level. Departure from N o. of no-shows Atlanta Chicago
44 Wilcoxon Rank-Sum Test Step 1. Hypotheses H0 : The distribution of no-shows is the same for Atlanta and Chicago H1 : The distribution of no-shows is larger for Atlanta than for Chicago. Step 2. At =0.05 significance level Step 3. Wilcoxon rank-sum test statistic : Z = W n 1 n n 1 2 (n1 + n 2 (n1 + n ) + 1) Where W = sum of the ranks of the 1 st sample from Atlanta, n1 = sample size of the Altanta sample = 9 n2 = sample size of the Chicago sample = 8 44
45 Step 4. Decision Rule : one-tailed z-test H0 is rejected if Z > Z 0.05 =1.65 Step 5. Draw conclusion : w=96.5 Z = W n 1 n n 1 2 (n1 + n 2 (n1 + n ) + 1) 9( ) 96.5 = 2 9 8( ) 12 = 1.49 < 1.65 Since z = 1.49 < 1.65, the null hypothesis is not rejected at =0.05. Also the p-value = P(Z>1.49)=0.0681>0.05, same conclusion is obtained. 45
46 From No-shows(pool) Rank-1 Chicago Chicago Atlanta Chicago Atlanta Atlanta Chicago Chicago Atlanta Chicago Chicago Atlanta Atlanta Chicago Atlanta Atlanta Atlanta Table W=96.5 Atlanta Chicago No-shows Rank No-shows Rank Chicago ranksum Atlanta ranksum 46
47 Kruskal-Willis Test : ANOVA by ranks Recall to test the equality of several independent, continuous population means, under normal + equal variance assumption, ANOVA is used. See Ch. 12 For ordinal/continuous data, without any assumption about the populations, the Kruskal-Willis one-way analysis of variance by ranks is proposed. 47
48 Kruskal-Wallis Test Step 1. Hypotheses H0 : Several population distributions are the same H1 : The distributions are not all equal Step 2. At significance level Step 3. Kruskal-Wallis test statistic : H 2 2 ( R ) ( R ) ( R ) 12 2 = 1 2 k + + L+ 3(n + 1) n(n + 1) n1 n2 nk Where ( R 1 ),, ( R k ) = sum of the ranks of sample 1,..,k. n 1,,n k = sample size of sample 1,,k n = total sample size = n i 48
49 Step 4. Decision Rule : Under null hypothesis, when n1, nk 5, H has approximately a chisquare distribution with df = (k-1) H d χ 2 k 1 H0 is rejected if H χ 2 (k 1, α) Step 5. Draw conclusion : 49
50 Example. P565 A manager seminar consists of executives from manufacturing, finance, and the trades. Before the seminar, the seminar leader is interested in whether the three groups are equally knowledgeable about management principles. Data : 3 independent samples on scores of a test. Manufacturing executives group : 56, 39, 48, 38, 73, 50, 62 Finance executives group : 102, 87, 51, 95, 68, 42, 107, 89 Trade executives group : 42, 38, 89, 75, 35, 61 50
51 Step 1. Hypotheses Kruskal-Wallis Test H0 : The three population scores have same distributions H1 : The distributions are not all equal Step 2. At =0.05 significance level Step 3. Kruskal-Wallis test statistic : H 2 2 ( R ) ( R ) ( R ) 12 2 = (n + 1) n(n + 1) n1 n2 n3 Where ( R 1 ),( R 2 ),( R 3 ) = rank sums of sample 1,2,3. n 1 =7, n 2 =8, n 3 =6, n = 7+8+6=21 51
52 Step 4. Decision Rule : Since n1, n2, n3 5, df = (k-1)=(3-1)=2, =0.05 H0 is rejected if H χ ( 2,0.05) = Step 5. Draw conclusion : ( R ) ( R ) ( R ) 12 H = (n n(n + 1) n1 n2 n ( 57.5) ( 121) ( 52.5) = + + 3(21+ 1) 21(21+ 1) = < ) Since H=5.736<5.991, the null hypothesis of no difference is not rejected at level However, P-value = is close to
53 Division Test Scores rank sum sample size Trade Manufacturing Manufacturing Finance Trade Trade Manufacturing Finance Trade Manufacturing Manufacturing Finance Manufacturing Trade Manufacturing Finance Manufacturing Trade Finance Finance Trade Finance Finance Finance 53
54 Note. P567 Compare to usual ANOVA F-test in ANOVA can be used if Normal population Equal variances/standard deviations Independent samples Testing H 0 : µ E = µ F = µ T 54
55 異數 EXCEL : ANOVA 數 異數 度 臨 Since p-value=0.034<0.05, H0 is rejected at =0.05 different conclusion! However, the difference in p-values = =0.023, not large. 55
56 Spearman s coefficient of correlation Recall between continuous variables, the Pearson correlation coefficient is considered. See Ch. 13 For ordinal-level data, Spearman s coefficient of rank correlation is used to describe the relationship. r s 6 (R = 1 n(n 1 2 R 2 1) R 1, R 2 = the ranks of each pair of observation ) 2 56
57 Rank-Order Correlation -1 r s 1-1 : perfect negative correlation 1 : perfect positive correlation. 0 : no strong association among the ranks. 57
58 Example. 570 A rating is given by executives o each college graduate joining a plastics manufacturing firm. The rating is an expression of the future potential. Another rating of an training program is also given. Find the correlation between these two ratings. 58
59 Table
60 Then r 2 E RT ) s = 2 6 (R = 1 n(n 1) 6(78.5) = 1 12(144 1).726 There is a strong positive association between these two ratings. 60
61 Test the significance of rank correlation: n 10 Step 1. Hypotheses H0 : The population rank correlation = 0 Step 2. At significance level Step 3. A t-test statistic : t = r s n 2 1 r 2 s Step 4. Decision rule : Under null hypothesis, t~ t-distribution with df = n-2 This is a t-test, find critical values in Appendix F Step 5. Draw conclusion. 61
62 Step 1. Hypotheses Example. 570 H0 : The population rank correlation = 0 H1 : The population rank correlation > 0 Step 2. At =0.05 significance level Step 3. A t-test statistic : t = r s n 2 1 r 2 s 62
63 Example. P570 Continued Step 4. Decision rule : a one-tailed t-test Since df = n-2=12-2=10, =0.05, H0 is rejected if t Step 5. Draw conclusion. t = n r s =.726 = r > s Since t = > 1.812, the null hypothesis of zero correlation is rejected at significance level
64 Sign test Preference : 31 Before/after : 34 Median : 32 Exercise Wilcoxon Signed rank test : 34 Wilcoxon Rank sum test : 36 Kruskal-Wallis test : 35, 37 Spearman s coefficient of rank correlation : 39 64
Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)
BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data
More informationNonparametric Statistics
Nonparametric Statistics Nonparametric or Distribution-free statistics: used when data are ordinal (i.e., rankings) used when ratio/interval data are not normally distributed (data are converted to ranks)
More informationWhat is a Hypothesis?
What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:
More informationChapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics
Section 15.1: An Overview of Nonparametric Statistics Understand Difference between Parametric and Nonparametric Statistical Procedures Parametric statistical procedures inferential procedures that rely
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More information3. Nonparametric methods
3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests
More informationNon-parametric (Distribution-free) approaches p188 CN
Week 1: Introduction to some nonparametric and computer intensive (re-sampling) approaches: the sign test, Wilcoxon tests and multi-sample extensions, Spearman s rank correlation; the Bootstrap. (ch14
More informationChapter 18 Resampling and Nonparametric Approaches To Data
Chapter 18 Resampling and Nonparametric Approaches To Data 18.1 Inferences in children s story summaries (McConaughy, 1980): a. Analysis using Wilcoxon s rank-sum test: Younger Children Older Children
More informationBasic Business Statistics, 10/e
Chapter 1 1-1 Basic Business Statistics 11 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Basic Business Statistics, 11e 009 Prentice-Hall, Inc. Chap 1-1 Learning Objectives In this chapter,
More informationParametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami
Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous
More informationBiostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test
Biostatistics 270 Kruskal-Wallis Test 1 ORIGIN 1 Kruskal-Wallis Test The Kruskal-Wallis is a non-parametric analog to the One-Way ANOVA F-Test of means. It is useful when the k samples appear not to come
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationIntroduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.
Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of
More informationNonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health
Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,
More informationStatistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data
Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data 1999 Prentice-Hall, Inc. Chap. 10-1 Chapter Topics The Completely Randomized Model: One-Factor
More information4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures
Non-parametric Test Stephen Opiyo Overview Distinguish Parametric and Nonparametric Test Procedures Explain commonly used Nonparametric Test Procedures Perform Hypothesis Tests Using Nonparametric Procedures
More informationStatistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data
Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data 999 Prentice-Hall, Inc. Chap. 9 - Chapter Topics Comparing Two Independent Samples: Z Test for the Difference
More informationDr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)
Dr. Maddah ENMG 617 EM Statistics 10/12/12 Nonparametric Statistics (Chapter 16, Hines) Introduction Most of the hypothesis testing presented so far assumes normally distributed data. These approaches
More informationLecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks
More informationPSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests
PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution
More informationHYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă
HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and
More informationSEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics
SEVERAL μs AND MEDIANS: MORE ISSUES Business Statistics CONTENTS Post-hoc analysis ANOVA for 2 groups The equal variances assumption The Kruskal-Wallis test Old exam question Further study POST-HOC ANALYSIS
More informationFormulas and Tables by Mario F. Triola
Copyright 010 Pearson Education, Inc. Ch. 3: Descriptive Statistics x f # x x f Mean 1x - x s - 1 n 1 x - 1 x s 1n - 1 s B variance s Ch. 4: Probability Mean (frequency table) Standard deviation P1A or
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationLecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks
More informationSTAT Section 3.4: The Sign Test. The sign test, as we will typically use it, is a method for analyzing paired data.
STAT 518 --- Section 3.4: The Sign Test The sign test, as we will typically use it, is a method for analyzing paired data. Examples of Paired Data: Similar subjects are paired off and one of two treatments
More informationCHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC
CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests
More informationNon-parametric tests, part A:
Two types of statistical test: Non-parametric tests, part A: Parametric tests: Based on assumption that the data have certain characteristics or "parameters": Results are only valid if (a) the data are
More informationTextbook Examples of. SPSS Procedure
Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of
More informationBIO 682 Nonparametric Statistics Spring 2010
BIO 682 Nonparametric Statistics Spring 2010 Steve Shuster http://www4.nau.edu/shustercourses/bio682/index.htm Lecture 8 Example: Sign Test 1. The number of warning cries delivered against intruders by
More informationRank-Based Methods. Lukas Meier
Rank-Based Methods Lukas Meier 20.01.2014 Introduction Up to now we basically always used a parametric family, like the normal distribution N (µ, σ 2 ) for modeling random data. Based on observed data
More informationNonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown
Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding
More informationNon-parametric methods
Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish
More informationHypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true
Hypothesis esting Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Statistical Hypothesis: conjecture about a population parameter
More informationIntroduction to Nonparametric Statistics
Introduction to Nonparametric Statistics by James Bernhard Spring 2012 Parameters Parametric method Nonparametric method µ[x 2 X 1 ] paired t-test Wilcoxon signed rank test µ[x 1 ], µ[x 2 ] 2-sample t-test
More informationStatistics Handbook. All statistical tables were computed by the author.
Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance
More informationSTATISTIKA INDUSTRI 2 TIN 4004
STATISTIKA INDUSTRI 2 TIN 4004 Pertemuan 11 & 12 Outline: Nonparametric Statistics Referensi: Walpole, R.E., Myers, R.H., Myers, S.L., Ye, K., Probability & Statistics for Engineers & Scientists, 9 th
More informationUnit 14: Nonparametric Statistical Methods
Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based
More informationANOVA - analysis of variance - used to compare the means of several populations.
12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.
More informationNonparametric Statistics Notes
Nonparametric Statistics Notes Chapter 5: Some Methods Based on Ranks Jesse Crawford Department of Mathematics Tarleton State University (Tarleton State University) Ch 5: Some Methods Based on Ranks 1
More informationChapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing
Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity
More information1.0 Hypothesis Testing
.0 Hypothesis Testing The Six Steps of Hypothesis Testing Rejecting or Accepting the Null Samples and Data. Parametric Tests for Comparing Two Populations Testing Equality of Population Means (T Test!
More informationCorrelation. A statistics method to measure the relationship between two variables. Three characteristics
Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction
More informationFormulas and Tables. for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. ˆp E p ˆp E Proportion
Formulas and Tables for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. Ch. 3: Descriptive Statistics x Sf. x x Sf Mean S(x 2 x) 2 s Å n 2 1 n(sx 2 ) 2 (Sx)
More informationCORELATION - Pearson-r - Spearman-rho
CORELATION - Pearson-r - Spearman-rho Scatter Diagram A scatter diagram is a graph that shows that the relationship between two variables measured on the same individual. Each individual in the set is
More informationBivariate Relationships Between Variables
Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods
More informationLecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationContents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47
Contents 1 Non-parametric Tests 3 1.1 Introduction....................................... 3 1.2 Advantages of Non-parametric Tests......................... 4 1.3 Disadvantages of Non-parametric Tests........................
More informationNonparametric Tests. Mathematics 47: Lecture 25. Dan Sloughter. Furman University. April 20, 2006
Nonparametric Tests Mathematics 47: Lecture 25 Dan Sloughter Furman University April 20, 2006 Dan Sloughter (Furman University) Nonparametric Tests April 20, 2006 1 / 14 The sign test Suppose X 1, X 2,...,
More informationContents. Acknowledgments. xix
Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables
More informationIntroduction to Statistical Data Analysis III
Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The
More informationFormulas and Tables for Elementary Statistics, Eighth Edition, by Mario F. Triola 2001 by Addison Wesley Longman Publishing Company, Inc.
Formulas and Tables for Elementary Statistics, Eighth Edition, by Mario F. Triola 2001 by Addison Wesley Longman Publishing Company, Inc. Ch. 2: Descriptive Statistics x Sf. x x Sf Mean S(x 2 x) 2 s 2
More informationGlossary for the Triola Statistics Series
Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling
More informationHYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC
1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare
More informationAnalysis of variance (ANOVA) Comparing the means of more than two groups
Analysis of variance (ANOVA) Comparing the means of more than two groups Example: Cost of mating in male fruit flies Drosophila Treatments: place males with and without unmated (virgin) females Five treatments
More informationModule 9: Nonparametric Statistics Statistics (OA3102)
Module 9: Nonparametric Statistics Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 15.1-15.6 Revision: 3-12 1 Goals for this Lecture
More informationPooled Variance t Test
Pooled Variance t Test Tests means of independent populations having equal variances Parametric test procedure Assumptions Both populations are normally distributed If not normal, can be approximated by
More informationChapter 7 Comparison of two independent samples
Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationTribhuvan University Institute of Science and Technology 2065
1CSc. Stat. 108-2065 Tribhuvan University Institute of Science and Technology 2065 Bachelor Level/First Year/ First Semester/ Science Full Marks: 60 Computer Science and Information Technology (Stat. 108)
More informationAgonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?
Agonistic Display in Betta splendens: Data Analysis By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether
More informationCHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:
CHAPTER 9, 10 Hypothesis Testing Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: The person is guilty. The person is innocent. To
More informationLecture Slides for INTRODUCTION TO. Machine Learning. ETHEM ALPAYDIN The MIT Press,
Lecture Slides for INTRODUCTION TO Machine Learning ETHEM ALPAYDIN The MIT Press, 2004 alpaydin@boun.edu.tr http://www.cmpe.boun.edu.tr/~ethem/i2ml CHAPTER 14: Assessing and Comparing Classification Algorithms
More informationMeasuring Associations : Pearson s correlation
Measuring Associations : Pearson s correlation Scatter Diagram A scatter diagram is a graph that shows that the relationship between two variables measured on the same individual. Each individual in the
More informationStatistics: revision
NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers
More informationCan you tell the relationship between students SAT scores and their college grades?
Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower
More informationST4241 Design and Analysis of Clinical Trials Lecture 9: N. Lecture 9: Non-parametric procedures for CRBD
ST21 Design and Analysis of Clinical Trials Lecture 9: Non-parametric procedures for CRBD Department of Statistics & Applied Probability 8:00-10:00 am, Friday, September 9, 2016 Outline Nonparametric tests
More informationNon-parametric Tests
Statistics Column Shengping Yang PhD,Gilbert Berdine MD I was working on a small study recently to compare drug metabolite concentrations in the blood between two administration regimes. However, the metabolite
More informationpsychological statistics
psychological statistics B Sc. Counselling Psychology 011 Admission onwards III SEMESTER COMPLEMENTARY COURSE UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION CALICUT UNIVERSITY.P.O., MALAPPURAM, KERALA,
More informationWeek 14 Comparing k(> 2) Populations
Week 14 Comparing k(> 2) Populations Week 14 Objectives Methods associated with testing for the equality of k(> 2) means or proportions are presented. Post-testing concepts and analysis are introduced.
More informationNonparametric hypothesis tests and permutation tests
Nonparametric hypothesis tests and permutation tests 1.7 & 2.3. Probability Generating Functions 3.8.3. Wilcoxon Signed Rank Test 3.8.2. Mann-Whitney Test Prof. Tesler Math 283 Fall 2018 Prof. Tesler Wilcoxon
More informationNonparametric Methods
Nonparametric Methods Marc H. Mehlman marcmehlman@yahoo.com University of New Haven Nonparametric Methods, or Distribution Free Methods is for testing from a population without knowing anything about the
More informationCh14. Multiple Regression Analysis
Ch14. Multiple Regression Analysis 1 Goals : multiple regression analysis Model Building and Estimating More than 1 independent variables Quantitative( 量 ) independent variables Qualitative( ) independent
More informationExam details. Final Review Session. Things to Review
Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit
More informationCDA Chapter 3 part II
CDA Chapter 3 part II Two-way tables with ordered classfications Let u 1 u 2... u I denote scores for the row variable X, and let ν 1 ν 2... ν J denote column Y scores. Consider the hypothesis H 0 : X
More informationStatistiek I. Nonparametric Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen.
Statistiek I Nonparametric Tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/23 Nonparametric Tests NONPARAMETRIC, DISTRIBUTION-FREE
More informationNONPARAMETRICS. Statistical Methods Based on Ranks E. L. LEHMANN HOLDEN-DAY, INC. McGRAW-HILL INTERNATIONAL BOOK COMPANY
NONPARAMETRICS Statistical Methods Based on Ranks E. L. LEHMANN University of California, Berkeley With the special assistance of H. J. M. D'ABRERA University of California, Berkeley HOLDEN-DAY, INC. San
More informationInferential Statistics
Inferential Statistics Eva Riccomagno, Maria Piera Rogantin DIMA Università di Genova riccomagno@dima.unige.it rogantin@dima.unige.it Part G Distribution free hypothesis tests 1. Classical and distribution-free
More informationChapter 8 of Devore , H 1 :
Chapter 8 of Devore TESTING A STATISTICAL HYPOTHESIS Maghsoodloo A statistical hypothesis is an assumption about the frequency function(s) (i.e., PDF or pdf) of one or more random variables. Stated in
More informationLecture 14: ANOVA and the F-test
Lecture 14: ANOVA and the F-test S. Massa, Department of Statistics, University of Oxford 3 February 2016 Example Consider a study of 983 individuals and examine the relationship between duration of breastfeeding
More informationST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data
ST4241 Design and Analysis of Clinical Trials Lecture 7: Non-parametric tests for PDG data Department of Statistics & Applied Probability 8:00-10:00 am, Friday, September 2, 2016 Outline Non-parametric
More information6 Single Sample Methods for a Location Parameter
6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually
More informationNonparametric tests. Mark Muldoon School of Mathematics, University of Manchester. Mark Muldoon, November 8, 2005 Nonparametric tests - p.
Nonparametric s Mark Muldoon School of Mathematics, University of Manchester Mark Muldoon, November 8, 2005 Nonparametric s - p. 1/31 Overview The sign, motivation The Mann-Whitney Larger Larger, in pictures
More information7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between
7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation
More informationHow do we compare the relative performance among competing models?
How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model
More informationAnalysis of variance
Analysis of variance Tron Anders Moger 3.0.007 Comparing more than two groups Up to now we have studied situations with One observation per subject One group Two groups Two or more observations per subject
More informationDistribution-Free Procedures (Devore Chapter Fifteen)
Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal
More informationTypes of Statistical Tests DR. MIKE MARRAPODI
Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample
More informationIntroduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution
Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis
More informationAnalysis of Variance
Analysis of Variance Chapter 12 McGraw-Hill/Irwin Copyright 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Learning Objectives LO 12-1 List the characteristics of the F distribution and locate
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationEcon 325: Introduction to Empirical Economics
Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population
More informationTHE PEARSON CORRELATION COEFFICIENT
CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationTransition Passage to Descriptive Statistics 28
viii Preface xiv chapter 1 Introduction 1 Disciplines That Use Quantitative Data 5 What Do You Mean, Statistics? 6 Statistics: A Dynamic Discipline 8 Some Terminology 9 Problems and Answers 12 Scales of
More information= 1 i. normal approximation to χ 2 df > df
χ tests 1) 1 categorical variable χ test for goodness-of-fit ) categorical variables χ test for independence (association, contingency) 3) categorical variables McNemar's test for change χ df k (O i 1
More informationA3. Statistical Inference Hypothesis Testing for General Population Parameters
Appendix / A3. Statistical Inference / General Parameters- A3. Statistical Inference Hypothesis Testing for General Population Parameters POPULATION H 0 : θ = θ 0 θ is a generic parameter of interest (e.g.,
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Exploring Data: Distributions Look for overall pattern (shape, center, spread) and deviations (outliers). Mean (use a calculator): x = x 1 + x
More informationAnalyzing Small Sample Experimental Data
Analyzing Small Sample Experimental Data Session 2: Non-parametric tests and estimators I Dominik Duell (University of Essex) July 15, 2017 Pick an appropriate (non-parametric) statistic 1. Intro to non-parametric
More information