Two types of statistical test: Non-parametric tests, part A: Parametric tests: Based on assumption that the data have certain characteristics or "parameters": Results are only valid if (a) the data are normally distributed; 0 (b) the data show homogeneity of variance; (c) the data are measurements on an interval or ratio scale. 0 group : M =.9 (s.d. =.33), group : M =.4 (s.d. = 9.) Nonparametric tests: Examples of parametric tests and their non-parametric equivalents: Make no assumptions about the data's characteristics. Use if any of the three properties below are true: (a) the data are not normally distributed (e.g. skewed); (b) the data show inhomogeneity of variance; (c) the data are measurements on an ordinal scale (ranks). Parametric test: Pearson correlation (No equivalent test) Independent-means t-test Dependent-means t-test One-way Independent Measures Analysis of Variance (ANOVA) One-way Repeated-Measures ANOVA Non-parametric counterpart: Spearman's correlation Chi-Square test Mann-Whitney test Wilcoxon test Kruskal-Wallis test Friedman's test
Non-parametric tests for comparing two groups or conditions: (a) The Mann-Whitney test: Used when you have two conditions, each performed by a separate group of subjects. Each subject produces one score. Tests whether there a statistically significant difference between the two groups. Mann-Whitney test, step-by-step: Does it make any difference to students' comprehension of statistics whether the lectures are in English or in Serbo-Croat? Group : statistics lectures in English. Group : statistics lectures in Serbo-Croat. DV: lecturer intellgibility ratings by students (0 = "unintelligible", 0 = "highly intelligible"). Ratings - so Mann-Whitney is appropriate. English group (raw scores) Mean: S.D.: Median: English group (ranks). 3... 4.3.97. Serbo-croat group (raw scores) Mean: S.D.: Median:..33 Step : Rank all the scores together, regardless of group. Serbo-croat group (ranks).... 3.. Revision of how to Rank scores: Same method as for Spearman's correlation. (a) Lowest score gets rank of ; next lowest gets ; and so on. (b) Two or more scores with the same value are tied. (i) Give each tied score the rank it would have had, had it been different from the other scores. (ii) Add the ranks for the tied scores, and divide by the number of tied scores. Each of the ties gets this average rank. (iii) The next score after the set of ties gets the rank it would have obtained, had there been no tied scores. e.g. raw score: 34 34 4 original rank: 3 4 actual rank:.. 4
Step : Add up the ranks for group, to get T. Here, T = 3. Add up the ranks for group, to get T. Here, T = 70. Step 3: N is the number of subjects in group ; N is the number of subjects in group. Here, N = and N = 9. Step 4: Call the larger of these two rank totals Tx. Here, Tx = 3. Nx is the number of subjects in this group; here, Nx =. Step : Find U: Nx (Nx + ) U = N * N + ---------------- - Tx In our example, * ( + ) U = * 9 + ---------------- - 3 U = 7 + 3-3 = If there are unequal numbers of subjects - as in the present case - calculate U for both rank totals and then use the smaller U. In the present example, for T, U =, and for T, U = 47. Therefore, use as U. Step : Look up the critical value of U, (e.g. with the table on my website), taking into account N and N. If our obtained U is smaller than the critical value of U, we reject the null hypothesis and conclude that our two groups do differ significantly. N 9 3 7 3 4 7 0 4 0 3 N 7 7 9 Here, the critical value of U for N = and N = 9 is. Our obtained U of is larger than this, and so we conclude that there is no significant difference between our two groups. Conclusion: ratings of lecturer intelligibility are unaffected by whether the lectures are given in English or in Serbo-Croat. 3
(b) The Wilcoxon test: Used when you have two conditions, both performed by the same subjects. Each subject produces two scores, one for each condition. Tests whether there a statistically significant difference between the two conditions. Wilcoxon test, step-by-step: Does background music affect the mood of factory workers? Eight workers: each tested twice. Condition A: background music. Condition B: silence. DV: worker's mood rating (0 = "extremely miserable", 0 = "euphoric"). Ratings, so use Wilcoxon test. Worker: 3 4 7 Silence 4 Mean:., S.D.:. Median:. Music Mean: 9., S.D.: 4.3 Median:. Step : Find the difference between each pair of scores, keeping track of the sign of the difference. Step : Rank the differences, ignoring their sign. Lowest =. Tied scores dealt with as before. Ignore zero difference-scores. 4 4 Difference - 0 - - Rank 4.. Ignore 4. 7. Step 3: Add together the positive-signed ranks. =. Add together the negative-signed ranks. =. Step 4: "W" is the smaller sum of ranks; W =. N is the number of differences, omitting zero differences. N = - = 7. Step : Use table (e.g. on my website) to find the critical value of W, for your N. Your obtained W has to be smaller than this critical value, for it to be statistically significant. 4
One Tailed Significance levels: 0.0 0.0 0.00 Two Tailed significance levels: N 0.0 0.0 0.0 0 - - 7 0-4 0 9 3 3 Mann-Whitney using SPSS - procedure: The critical value of W (for an N of 7) is. Our obtained W of is bigger than this. Our two conditions are not significantly different. Conclusion: workers' mood appears to be unaffected by presence or absence of background music. Mann-Whitney using SPSS - procedure: Mann-Whitney using SPSS - output: Intelligibility Language English Serbo-croat Total Ranks N Mean Rank Sum of Ranks.3 3.00 9 7.7 70.00 Test Statistics b Mann-Whitney U Wilcoxon W Z Asymp. Sig. (-tailed) Exact Sig. [*(-tailed Sig.)] a. Not corrected for ties. Intelligibility.000 70.000 -.07..3 a b. Grouping Variable: Language
Wilcoxon using SPSS - procedure: Wilcoxon using SPSS - procedure: Wilcoxon using SPSS - output: Ranks silence - music a. silence < music b. silence > music c. silence = music Negative Ranks Positive Ranks Ties Total N Mean Rank Sum of Ranks 4 a.0.00 3 b.00.00 c Test Statistics b Z Asymp. Sig. (-tailed) a. Based on positive ranks. silence - music -.37 a. b. Wilcoxon Signed Ranks Test