Increasing Power in Paired-Samples Designs. by Correcting the Student t Statistic for Correlation. Donald W. Zimmerman. Carleton University

Similar documents
Psicológica ISSN: Universitat de València España

A Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions

Conventional And Robust Paired And Independent-Samples t Tests: Type I Error And Power Rates

COMPARISON OF THE ESTIMATORS OF THE LOCATION AND SCALE PARAMETERS UNDER THE MIXTURE AND OUTLIER MODELS VIA SIMULATION

AN IMPROVEMENT TO THE ALIGNED RANK STATISTIC

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

INFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF INDEPENDENT GROUPS ABSTRACT

8/29/2015 Connect Math Plus

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION

Power Calculations for Preclinical Studies Using a K-Sample Rank Test and the Lehmann Alternative Hypothesis

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES

NONPARAMETRICS. Statistical Methods Based on Ranks E. L. LEHMANN HOLDEN-DAY, INC. McGRAW-HILL INTERNATIONAL BOOK COMPANY

Formulas and Tables by Mario F. Triola

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY

A comparison study of the nonparametric tests based on the empirical distributions

r(equivalent): A Simple Effect Size Indicator

Chapter 15 Confidence Intervals for Mean Difference Between Two Delta-Distributions

Reliability and Quality Mathematics

IMPACT OF ALTERNATIVE DISTRIBUTIONS ON QUANTILE-QUANTILE NORMALITY PLOT

Five-Point Likert Items: t test versus Mann-Whitney-Wilcoxon

Comparison of Power between Adaptive Tests and Other Tests in the Field of Two Sample Scale Problem

Unit 14: Nonparametric Statistical Methods

Distribution-Free Tests for Two-Sample Location Problems Based on Subsamples

Non-parametric (Distribution-free) approaches p188 CN

Assessing the relation between language comprehension and performance in general chemistry. Appendices

ROBUST ESTIMATION OF A CORRELATION COEFFICIENT: AN ATTEMPT OF SURVEY

What p values really mean (and why I should care) Francis C. Dane, PhD

Formulation of L 1 Norm Minimization in Gauss-Markov Models

Model Estimation Example

MATH Notebook 3 Spring 2018

Testing For Aptitude-Treatment Interactions In Analysis Of Covariance And Randomized Block Designs Under Assumption Violations

Extending the Robust Means Modeling Framework. Alyssa Counsell, Phil Chalmers, Matt Sigal, Rob Cribbie

Multiple Comparison Procedures, Trimmed Means and Transformed Statistics. Rhonda K. Kowalchuk Southern Illinois University Carbondale

Correlation Interpretation on Diabetes Mellitus Patients

Testing for a unit root in an ar(1) model using three and four moment approximations: symmetric distributions

Presented to the Graduate Council of the. North Texas State University. in Partial. Fulfillment of the Requirements. For the Degree of.

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

Practical Statistics for the Analytical Scientist Table of Contents

Robust Outcome Analysis for Observational Studies Designed Using Propensity Score Matching

Supporting Information for Estimating restricted mean. treatment effects with stacked survival models

A bias improved estimator of the concordance correlation coefficient

Simulation-based robust IV inference for lifetime data

ScienceDirect. Who s afraid of the effect size?

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption

Formulas and Tables. for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. ˆp E p ˆp E Proportion

On Polynomial Transformations For Simulating Multivariate Non-normal Distributions

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Zero-Mean Difference Discrimination and the. Absolute Linear Discriminant Function. Peter A. Lachenbruch

THE 'IMPROVED' BROWN AND FORSYTHE TEST FOR MEAN EQUALITY: SOME THINGS CAN'T BE FIXED

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin

An Alternative to Cronbach s Alpha: A L-Moment Based Measure of Internal-consistency Reliability

Robustness. James H. Steiger. Department of Psychology and Human Development Vanderbilt University. James H. Steiger (Vanderbilt University) 1 / 37

Inferences About the Difference Between Two Means

Structural equation modeling

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 16 Introduction

Contents 1. Contents

Methodology Review: Applications of Distribution Theory in Studies of. Population Validity and Cross Validity. James Algina. University of Florida

Transition Passage to Descriptive Statistics 28

ECE 516: System Control Engineering

Introduction to Biostatistics: Part 5, Statistical Inference Techniques for Hypothesis Testing With Nonparametric Data

MONTE CARLO ANALYSIS OF CHANGE POINT ESTIMATORS

Empirical Power of Four Statistical Tests in One Way Layout

Factor Analysis of Data Matrices

An Overview of the Performance of Four Alternatives to Hotelling's T Square

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

1 Overview. Coefficients of. Correlation, Alienation and Determination. Hervé Abdi Lynne J. Williams

Generalized nonparametric tests for one-sample location problem based on sub-samples

APPENDICES APPENDIX A. STATISTICAL TABLES AND CHARTS 651 APPENDIX B. BIBLIOGRAPHY 677 APPENDIX C. ANSWERS TO SELECTED EXERCISES 679

Weighted empirical likelihood estimates and their robustness properties

Assessing Normality: Applications in Multi-Group Designs

Comparing Two Dependent Groups: Dealing with Missing Values

On Selecting Tests for Equality of Two Normal Mean Vectors

Confidence Intervals for the Process Capability Index C p Based on Confidence Intervals for Variance under Non-Normality

A MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION. Bercedis Peterson

OPTIMAL B-ROBUST ESTIMATORS FOR THE PARAMETERS OF THE GENERALIZED HALF-NORMAL DISTRIBUTION

TEST POWER IN COMPARISON DIFFERENCE BETWEEN TWO INDEPENDENT PROPORTIONS

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases

A nonparametric two-sample wald test of equality of variances

Simulating Uniform- and Triangular- Based Double Power Method Distributions

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

THE EFFECTS OF NONNORMAL DISTRIBUTIONS ON CONFIDENCE INTERVALS AROUND THE STANDARDIZED MEAN DIFFERENCE: BOOTSTRAP AND PARAMETRIC CONFIDENCE INTERVALS

Multicollinearity and A Ridge Parameter Estimation Approach

Sample size calculations for logistic and Poisson regression models

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

A note on adaptation in garch models Gloria González-Rivera a a

A semi-bayesian study of Duncan's Bayesian multiple

arxiv: v1 [stat.me] 2 Mar 2015

A Signed-Rank Test Based on the Score Function

Modeling Hydrologic Chanae

Statistical comparison of univariate tests of homogeneity of variances

The Effect of Multicollinearity on Prediction in Regression Models Daniel J. Mundfrom Michelle DePoy Smith Lisa W. Kay Eastern Kentucky University

A3. Statistical Inference Hypothesis Testing for General Population Parameters

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Physics 509: Non-Parametric Statistics and Correlation Testing

Probability Distributions.

6 Single Sample Methods for a Location Parameter

Non-parametric confidence intervals for shift effects based on paired ranks

A Convenient Way of Generating Gamma Random Variables Using Generalized Exponential Distribution

A Test of Homogeneity Against Umbrella Scale Alternative Based on Gini s Mean Difference

Transcription:

Power in Paired-Samples Designs Running head: POWER IN PAIRED-SAMPLES DESIGNS Increasing Power in Paired-Samples Designs by Correcting the Student t Statistic for Correlation Donald W. Zimmerman Carleton University Key words: Student t test, Wilcoxon signed-ranks test, rank transformation, skewed distribution, symmetric distribution, nonparametric test, correlation, paired samples, repeated measures, Type I error, Type II error, power Send correspondence to: Donald W. Zimmerman 978 4A Street Surrey, BC V4A 6B6 Canada Phone: (64) 5-9 Fax: (64 ) 55-554 E-mail: dwzimm@telus.net

Power in Paired-Samples Designs Abstract Increasing Power in Paired-Samples Designs by Correcting the Student t Statistic for Correlation Donald W. Zimmerman Carleton University In order to circumvent the influence of correlation in paired-samples and repeated measures experimental designs, researchers typically perform a one-sample Student t test on difference scores. This procedure entails some loss of power, because it employs n degrees of freedom instead of the n degrees of freedom of the independent-samples t test. In the case of non-normal distributions, researchers typically substitute the Wilcoxon signed-ranks test for the one-sample t test. The present study explored an alternate strategy, using a modified two-samples t test with a correction for correlation. For non-normal distributions, the same modified t test was performed on rank-transformed data. Simulations disclosed that this procedure, which retains n degrees of freedom, protects the Type I error rate for moderate and large sample sizes, maintains power for normal distributions, and substantially increases power for many non-normal distributions.

Power in Paired-Samples Designs Increasing Power in Paired-Samples Designs by Correcting the Student t Statistic for Correlation Donald W. Zimmerman Carleton University Statistical analysis of paired-samples and repeated-measures experimental designs typically makes use of the one-sample Student t test on difference scores. In the first part of the last century, before modern statistical methods began to be used widely by researchers in psychology, education, and social sciences, paired-samples data sometimes were handled in a different way. Many introductory textbooks in this early period, focusing mainly on largesample studies, presented methods for dealing with what were called correlated samples, using a modification of the familiar two-sample z test. These textbooks calculated the standard deviation of a difference between means using the formula σ = σ + σ ρ σ σ, () X X X X X X where ρ is the correlation between X and X, and based the standard error on this value when calculating the large-sample z statistic (see, example, Guilford & Fruchter, 97; p. 54; McNemar, 955, p. 85). Modern textbooks (e.g., Hays, 988) occasionally include these formulas, although they now recommend the paired-samples t test, not the z test for correlated samples, as a practical method. Furthermore, after nonparametric methods became widely used to circumvent non-normality, the Wilcoxon signed-ranks test typically was used in place of the t test on difference scores when the normality assumption was questionable. The present study re-examined some two-sample significance tests for paired samples data, using formulas containing correlation coefficients. Instead of the large-sample z test, however, it employed a version of the two-sample Student t tests modified to allow for correlation. Thus, tests were based on n degrees of freedom, instead of the n degrees

Power in Paired-Samples Designs 4 of freedom of the one-sample t test. And in the case of non-normal distributions, it employed the same modified two-sample t test on rank-transformed data, instead of the Wilcoxon signed-ranks test. It turned out that, for a wide variety of non-normal distributions, this strategy brought about improvements in control of Type I error rates, as well as an increase in power to detect differences. A Modified Two-Sample t Test with a Correction for Correlation It is possible to derive a t test for correlated samples analogous to the obsolete z test for correlated samples. The derivation incorporates the more general standard deviation of a difference between means given by equation (), instead of the conventional formula for the standard error of a difference (Zimmerman, Williams, & Zumbo, 99). The result is t = x X + X nn ( ) x ( ρ ), () where n is the number of pairs, x and x are sums of squares, and ρ is the population correlation. Otherwise expressed, if t is the conventional statistic based on two independent samples of n observations each, then t = t/c, where c = ρ. However, in many, if not most, research studies that analyze paired data, the value of a population correlation ρ is not known. In order to make use of equation () in practical significance testing, it is necessary to substitute an obtained sample correlation coefficient, r, for the population correlation. The present simulation study investigated how this modified test compares to the paired-samples t test with regard to Type I error rates and power, and how substitution of a sample estimate, r, for the population correlation, ρ, affects the accuracy of the result.

Power in Paired-Samples Designs 5 Method The present study compared Type I error rates and power of several significance tests performed on correlated samples from normal and non-normal distributions. Three of the nonnormal distributions were symmetric and 7 were skewed. The significance tests were () the usual independent-samples Student t test, () the usual paired-samples Student t test, () the Wilcoxon signed-ranks test, (4) the modified t test, using sample correlation coefficients, r, in place of the population correlation ρ, and (5) the modified t test, using a fixed value of ρ. The random number generator in this study was introduced by Marsaglia, Zaman, and Tsang (99) and has been described by Pashley (99, pp. 95-45). Normal variates, N(,), were generated by the rejection method of Marsaglia and Bray (964) and were transformed to have various distribution shapes using inverse distribution functions. The sample values were further transformed to have mean and standard deviation. In some cases, constants were added to all the scores in one sample in increments of.σ,.σ or.5σ to produce systematic differences in means and to determine the power of the tests. The algorithms for generating the various non-normal distributions are described in Table A in the appendix. For further details concerning simulation of non-normal variates, see, for example, Evans, Hastings, & Peacock (), Gentle (998), Morgan (99), and Patel, Kapadia, & Owen (976). There were 5, iterations of the sampling procedure for each condition in the study, except for the ones in Table, where there were, iterations. All significance tests were non-directional. Results of Simulations Table compares Type I error rates of 4 significance tests performed on samples from normal distributions. Sample sizes ranged from 5 to and population correlations were

Power in Paired-Samples Designs 6 ----------------------------------------------------------------------- Insert Table about here -----------------------------------------------------------------------,.5, and.5. The first column in each section of the table, labeled t, shows results of the independent-samples t test and the second column, labeled t P, shows results of the paired-samples t test, that is, the one-sample t test on difference scores. As expected, the paired-samples test maintained the Type I error rate close to the significance level for all sample sizes and all degrees of correlation, while the Type I error rate of the independent-samples test declined below the nominal significance level as the correlation increased. The same was true for both the.5 and. significance levels. These results are consistent with many previous studies. The next column in each section, t C (r), gives results for the modified two-sample t test given by equation (), using the correlation coefficients between sample values (r) obtained on each iteration of the sampling procedure. For relatively small sample sizes 5,, and 5 the probabilities of Type I errors were noticeably inflated. However, the discrepancy became progressively less as the sample size increased. The next column, t C (ρ), gives results for the same modified two-sample test, using the fixed population correlation (ρ) in the formula. In this case, the control of the Type I error rate improved to some extent, although for the smaller sample sizes there still was some discrepancy. For the larger sample sizes, or more, there was little or no difference in the performance of t P, t C (r), and t C (ρ) for all degrees of correlation. A different picture emerges when the power of the tests to detect differences is considered. Table presents probabilities of rejecting H for normal and various non-normal ---------------------------------------------------------------------------- Insert Table about here ----------------------------------------------------------------------------

Power in Paired-Samples Designs 7 distributions, as the difference between population means varied from to.5σ in increments of.5σ. Population correlations were.5 and.5, sample sizes were 5 and 5, and significance levels were.5 and.. The first column in each section, t P, shows results for the paired-samples t test. The next column, W, shows results for the large-sample normal-approximation version of the Wilcoxon signed-ranks test. The next two columns, t C (r), and t C (ρ), are results of the modified t test applied to rank-transformed data. That is, scores in the two samples were combined and transformed to ranks, after which the modified t test given by equation () was performed on the ranks replacing the original scores in their respective groups. As before, the column labeled t C (r) is the result of using sample correlation coefficients in the equation, and the column labeled t C (ρ) is the result of using the population ρ. Comparing the first two columns in each section, it is clear that the outcomes for the paired-samples t test and the Wilcoxon signed-ranks test were consistent with many previous studies using non-normal distributions. For all distributions, the Wilcoxon test protected the Type I error rate just as well as the t test, and for many of the non-normal distributions, especially skewed distributions, the Wilcoxon test was more powerful than the t test. However, the last two columns indicate that the modified t test on rank-transformed data in many instances was even more powerful than the Wilcoxon test. Furthermore, this difference is noticeable for the same distributions where the Wilcoxon test was a significant improvement over the t test. In the case of the 4 symmetric distributions in Table the normal, Laplace, rectangular, and bimodal distributions there are minimal differences among t P, t C (r), and t C (ρ). However, for the other 7 distributions, which are skewed, the power differences are more evident. The last column of Table A contains coefficients of skewness and kurtosis. Figures and give a more detailed picture of some power functions for 4 skewed

Power in Paired-Samples Designs 8 ------------------------------------------------------------------------- Insert Figures and about here ------------------------------------------------------------------------- distributions (exponential, lognormal, mixed-normal, and truncated normal), where differences between means increased in increments of.σ. It is evident that the power function of each version of the modified t test dominated both the Wilcoxon signed-ranks test and the paired samples t test. Figures and 4 show how these power differences depend on sample sizes for ------------------------------------------------------------------------- Insert Figures, 4, 5, and 6 about here ----------------------------------------------------------------------------- several distributions. In the simulations represented by these figures, there was a fixed difference between means of.σ. In all cases the power of the tests increased as sample size increased, as expected. In addition, the power differences of the various tests increased as sample size increased from about 75 to and then declined somewhat as sample sizes increased further to 5. Furthermore, the rank order of the four significance tests, from least powerful to most powerful, remained the same over the entire range of sample sizes, and the modified t test on rank-transformed data dominated in the case of the four skewed distributions (exponential, lognormal, mixed-normal and truncated normal in Figures,,, and 5), where there is considerable separation of the curves, and slightly dominated in the case of the Laplace distribution (Figure 6). For the normal distribution (Figure 4 ), the paired-samples t test was slightly superior, as expected. Further Comments Researchers typically employ the one-sample Student t test on difference scores when they believe data obtained from paired-samples or repeated-measures designs to be normally

Power in Paired-Samples Designs 9 distributed. This test protects the Type I error rate and has good power to detect differences. The present study revealed that, for large samples say 5 or larger from normal distributions, the modified t test using a correction for correlation performs very nearly the same as the pairedsamples t test. This is true when a known population correlation, ρ, is substituted in equation () and also when a sample correlation, r, obtained on each replication is used. On the other hand, the test with a sample r does not perform well for small sample sizes 5,, 5, or, where the Type I error rates are seriously inflated. This result can be attributed to the sampling variability of r for these small n s. As sample size increases, the variability of r declines, and the t statistic becomes more stable. The test based on a known population correlation, ρ, performs well even for the small sample sizes. In the case of non-normal distributions, the modified t test on rank-transformed data showed a substantial power gain relative to the paired-samples t test. The Wilcoxon signed-ranks test, the paired-samples t test, and the modified t test on rank-transformed data, all had about the same power for the symmetric non-normal distributions in the present study. This was true for both degrees of correlation and both significance levels. However, for the 7 skewed non-normal distributions, the modified t test on rank-transformed data was substantially more powerful than the Wilcoxon signed-ranks test. Comparisons of the paired-samples t test and the Wilcoxon test were consistent with many previous studies (see, for example, Andrews, et. al., 97; Hodges & Lehmann, 956; Randles & Wolfe, 979; Zimmerman & Zumbo, 99). After conversion of scores to ranks, the modified t test on ranks replacing scores displayed the same advantages possessed by the Wilcoxon signed-ranks test for symmetric distributions, and it performed better than the Wilcoxon test for skewed distributions. The superiority of the modified t test can be explained in part because it is evaluated at

Power in Paired-Samples Designs n degrees of freedom, instead of n degrees of freedom. Figures, 4, 5, and 6 indicate that the power difference of the modified test, for skewed distributions, is somewhat greater for the smaller sample sizes. For the small n s, a given difference in degrees of freedom corresponds to a rather large difference in critical values of the t statistic. However, for larger sample sizes, the difference in critical values is smaller, so the gain is somewhat less. Under some conditions, therefore, the modified t test on rank-transformed data can be useful in practical significance testing. In paired-samples designs, when the number of pairs is, say 5 or larger, and the population distribution is known or suspected to be non-normal, this test is a good choice. When samples are correlated, the modified t test on rank-transformed data and the Wilcoxon signed-ranks test both effectively protect the Type I error rate. If the distribution is symmetric, the two tests are about equally powerful, and if the distribution is skewed, the modified t test is substantially more powerful. For non-normal distributions, the modified t test retains the advantage of familiar nonparametric rank methods and, at the same time, the greater number of degrees of freedom of a two-sample test.

Power in Paired-Samples Designs References Andrews, D.F., Bickel, P.J., Hampel, F.R., Huber, P.J., Rogers, W.H., & Tukey, J.W. (97). Robust estimates of location: Survey and advances. Princeton, NJ: Princeton University Press. Evans, M., Hastings, N., & Peacock, B. (). Statistical distributions (rd ed.). New York: Wiley. Gentle, J.E. (998). Random number generation and Monte Carlo methods. New York: Springer. Guilford, J. P., & Fruchter, B. (97). Fundamental statistics in psychology and education (5th ed.). New York: McGraw-Hill. p. 54 Hays, W.L. (988). Statistics (4th ed.). New York: Holt, Rinehart, & Winston. Hodges, J. & Lehmann, E. (956). The efficiency of some nonparametric competitors of the t test. Annals of Mathematical Statistics, 7, 4-5. Marsaglia, G., Zaman, A., & Tsang, W.W. (99). Toward a universal random number generator. SIAM Review, 6, 6-64. McNemar, Q. (955). Psychological statistics (nd ed.). New York: Holt, Rinehart, & Winston. p. 85 Mehrens, W.A., & Lehmann, I.J. (99). Measurement and evaluation in education and psychology (4th ed.). Chicago: Holt, Rinehart, & Winston. Morgan, B.J.T. (99). Elements of Simulation. London: Chapman & Hall. Pashley, P.J. (99). On generating random sequences. In G. Keren & C. Lewis (Eds.), A handbook for data analysis in the behavioral sciences: Methodological issues. Hillsdale, NJ: Lawrence Erlbaum Associates. (pp. 95-45) Patel, J.K., Kapadia, C.H., & Owen, D.B. (976). Handbook of statistical distributions. Randles, R.H., & Wolfe, D.A. (979). Introduction to the theory of nonparametric statistics. New York: Wiley.

Power in Paired-Samples Designs Zimmerman, D.W., & Zumbo, B.D. (99). In G. Keren & C. Lewis (Eds.), A handbook for data analysis in the behavioral sciences: Methodological issues. Hillsdale, NJ: Lawrence Erlbaum Associates. (pp. 48-57) Zimmerman, D.W., Williams, R.H., & Zumbo, B.D. (99). Effect of nonindependence of sample observations on parametric and nonparametric statistical tests. Communications in Statistics: Simulation and Computation,, 779-789.

Power in Paired-Samples Designs Appendix A Let U be a unit rectangular variate and Y a unit normal variate. The distributions in the study were generated using the transformations in Table A (inverse distribution functions). After these transformations, X was standardized by subtracting the mean and dividing by the standard deviation, resulting in a common mean of and standard deviation for all distributions. Table A. Transformations (inverse distribution functions) used to generate variates. Distribution Transformation Coefficients of skewness and kurtosis exponential λ = X = log(u) 9 lognormal shape parameter scale parameter X = exp(y) 6.56 4.5 chi-square shape parameter 4 X = X 4 = Yi, Y is N(,).44 i= 6 Gumbel (extreme value) location parameter scale parameter power function shape parameter.5 scale parameter X = log( log U).4 5.4 X = U.64.4 truncated normal X = N(,), if X >.5 then X =.5.6.478 mixed-normal with outliers (skewed) X = N(,) with probability.75 X=abs(X) with probability.5.56.6

Power in Paired-Samples Designs 4 Table A (continued). Distribution Laplace (double exponential) location parameter scale parameter Transformation X = log(u /U ), where U and U are unit rectangular Coefficients of Skewness and Kurtosis 6 rectangular X = U.8 bimodal (symmetric) X = N(,) with probability.5 X = N(,) with probability.5.9

Power in Paired-Samples Designs 5 Footnote. The computer program used in this study, written in PowerBasic, version.5, PowerBasic, Inc., Carmel, CA, can be obtained by writing to the author.

Power in Paired-Samples Designs 6 Table. Probability of rejecting H for various sample sizes (n) and population correlations (ρ) with significance levels of.5 and. (normal distribution). α =.5 ρ = ρ =.5 ρ =.5 n t t P t C (r) t C (ρ) t t P t C (r) t C (ρ) t t P t C (r) t C (ρ) 5.5.5.95.5..5..5.4.5..55.5.5.67.5.6.5.7.5..5.7.5 5.5.5.6.5.6.5.6.5.9.5.6.5.5.5.58.5.5.5.59.5.7.49.6.5 5.5.5.57.5.4.49.55.49.7.5.58.5 5.49.5.5.49.4.5.5.5.7.5.55.5 75.5.5.5.5.4.5.5.5.6.49.5.5.5.5.5.5.4.5.5.5.6.49.5.5 5.5.5.5.5.4.5.5.5.6.5.5.5.49.49.49.49.4.5.5.5.6.5.5.5 α =. 5...8..5..4....48....9..4....... 5...6..4..6....7....4..4..5....7. 5......9.....4. 5............ 75...................... 5......................

Power in Paired-Samples Designs 7 Table. Probability of rejecting H as a function of difference in population means (α =.5, ρ =.5). n = 5 n = 5 Distribution μ μ t P W t C (r) t C (ρ) t P W t C (r) t C (ρ) exponential.46.7.65.94.47.66.78.944.55..796.974.44.94.785.974.5.9.6.9.5.75.74.964.54.4.8.98.47.95.84.98 lognormal chi-square Gumbel power function truncated normal mixed-normal normal Laplace rectangular bimodal.8.96.7.9.47.7.6.96.49.6.68.9.49.96.6.95.5.6.67.94.5.6.67.97.5..6.95.48..64.99.49.96.596.98.5.97.6.9.48.48.896.986.47..645.95.5..64.9.47.9.567.887.49..6.96.49.65.688.97.48.89.579.9.47.4.646.96.46.79.549.885.48.87.576.898.57.6.965.998.56.54.7.95.58.6.667.94.56.8.66.899.59.46.656.94.57.9.78.969.57.6.595.97.56.56.7.95.55.9.55.88.56..594.9.4.56.965.998.46.8.696.95.5..66.94.49.7.64.96.5..65.96.45.8.767.969.5..595.9.47.9.694.95.5.88.56.886.5.99.59.98.4.64.687.96.49.7.7.95.5.8.67.95.5..6.9.48..6.9.5.5.6.96.5..69.99.5.5.64.94.5.98.66.9.49.99.6.98.5.57.947.997.49.8.666.94.5.7.645.9.49..59.896.49.6.67.98.5.68.74.956.5.9.587.97.5.5.665.94.49.8.559.888.48.9.586.94.54.67.98..54.47.76.958.54..67.944.54.5.68.99.5.46.666.96.5..794.98.5.98.59.95.55.48.75.956.5.84.549.878.5.97,59.9.4.585.98..49.7.7.958.5.6.668.946.5..66.9.47.6.66.97.45.77.78.98.5.97.597.99.5.7.7.956.54.89.559.886.5.97.597.98

Power in Paired-Samples Designs 8 Table (continued) α =.5 ρ =.5 n = 5 n = 5 Distribution μ μ t P W t C (r) t C (ρ) t P W t C (r) t C (ρ) exponential.47.8.64.94.48.68.78.94.57.9.748.96.57.87.76.967.47.4.67.94.47.77.748.965.5.84.759.97.56.94.775.976 lognormal chi-square Gumbel power function truncated normal mixed-normal normal Laplace rectangular bimodal.9.9.74.9.48.8.64.9.5.6.64.98.5.96.597.96.5..66.9.5.7.6.99.49..6.94.49.4.6.96.5.96.598.96.5.98.6.9.47.474.896.986.47..65.97.5..6.9.48.9.566.887.47.7.598.95.5.6.69.99.47.9.578.899.48.8.644.95.49.8.55.884.48.87.577.897.55.5.94.996.56.8.67.96.58..645.95.57..59.895.57.4.6.9.57.8.74.954.55..587.899.55..658.9.58.9.56.885.57.99.58.897.56.58.954.998.54.5.679.94.55.5.649.9.5.5.596.9.55.9.6.97.57.8.74.954.5..59.94.55..667.99.5.89.564.887.5..589.9.4.64.687.98.49.9.6.97.48.4.69.97.5.99.6.9.5..69.98.5.6.66.97.5..6.97.5.6.68.94.5..6.9.5..64.9.49.5.947.997.49..67.94.48.5.649.94.5.98.585.897.49..6.96.5.7.79.956.49.9.589.96.5.8.668.94.49.87.565.889.5.94.589.95.5.58.965.999.5..67.94.5.8.64.9.54.5.6.95.5..6.98.54.76.75.968.54.97.585.9.55.8.66.99.5.9.56.886.5.98.586.9.59.557.97..57.4.685.95.54.6.654.98.55.8.6.9.54.7.644.95.58.89.769.974.56..598.98.58.9.678.947.5.94.57.89.55..597.97

Power in Paired-Samples Designs 9 Table (continued) α =. ρ =.5 n = 5 n = 5 Distribution μ μ t P W t C (r) t C (ρ) t P W t C (r) t C (ρ) exponential.8.75.77.75.8.9.4.79..48.589.97.8.8.56.97..76.77.77..5.499.868..4.6.95.9..587.9 lognormal chi-square Gumbel power function truncated normal mixed-normal normal Laplace rectangular bimodal.5..5.798.9.7.55.74.9.7.5.74..67..79..7.47.745.8.7.5.79..68.6.745.9.7.59.747..6.9.74..66.4.74.8.9.75.9.8.7.6.75.8.65.4.79.7.6.87.67.9.67..7.8.9.44.74.8.56.95.698.8.7.64.75.8.5.7.664.8.55.9.69.4.7.89.99..4.47.847.4.95.4.8..87.67.74.4..49.795..9.574.894.4.78.55.75..8.47.84.4.69.6.7.4.77.5.747.8.7.88.99.9.88.447.844..8.44.8..74.58.748..86.4.794.9..54.89..7.49.757..9.45.844..65.6.7..69.47.75.7.7.48.8.9.74.68.767.9.75.65.77..7.57.77..7.6.77..74.6.76..7.5.769..7.66.768..7.5.767..7.55.768.9.7.87.985.9.8.49.8.9.7.8.796.9.66..7.8.74.58.76..4.484.844.9.64.7.77..8.46.8.9.58..7.9.64.5.77..8.97.998...475.868..8.4.86..8.77.765..98.47.88...587.97..7.5.756..99.47.86..65..7..7.48.754.8..98.998.9.9.458.865..85.44.8..77.7.769.9.88.45.88.8..56.9..7.5.76..89.456.86..66.7.74..7.5.758

Power in Paired-Samples Designs Table (continued). α =. ρ =.5 n = 5 n = 5 Distribution μ μ t P W t C (r) t C (ρ) t P W t C (r) t C (ρ) exponential.8.76.74.75.7.94.4.79..6.5.874..4.548.894.8.74.8.774.8.5.5.87..7.59.9...56.95 lognormal chi-square Gumbel power function truncated normal mixed-normal normal Laplace rectangular bimodal.5..5.795.8.7.57.748.8.7.5.748..66.9.74..67.4.74.9.7.54.74..68.8.74.9.68.5.74..67.8.79..67.5.74.8.5.7.9.7.7.6.756.8.68.44.744.8.58.86.67.8.65.6.77.9.9.44.744.8.57.96.696.8.68.6.749.8.5.7.666.8.56.9.694..7.84.98..95.4.86..9.44.79.4.8.5.7..9.88.76..4.5.86..77.45.78..9.4.8.4.7.8.78..77.4.74..4.868.99..95.444.85..88.44.8..78.6.747..9.94.78..5.59.885..77.55.754..95.47.87..7.8.7..76.5.75.6.4.477.8..7.7.77.9.7.65.77..69.55.77..69.57.769..7.66.77..7.56.77.9.74.67.768..7.57.77..7.59.77.9.7.87.98.9.8.4.86.9.7.8.796.9.66.7.7..7.54.756..4.49.848.9.64.8.74..8.49.8.9.6.5.76..64.9.79..7.888.995..89.4.8..8.4.86..77.58.75..89.9.78..6.5.894..7.44.746..9.44.8..68..78..7.44.74.4.4.96.997..94.447.847..85.44.8..79.67.76..9.4.794...55.9..74.55.76..96.44.84..69.9.75..74.54.755

Power in Paired-Samples Designs Figure Captions Figure. Power functions of four significance tests on paired data for exponential and mixednormal distributions. Figure. Power functions of four significance tests on paired data for lognormal and truncated normal distributions. Figure. Power of four significance tests on paired data, for a fixed difference between population means, as a function of sample size (exponential distribution). Figure 4. Power of four significance tests on paired data, for a fixed difference between population means, as a function of sample size (normal distribution). Figure 5. Power of four significance tests on paired data, for a fixed difference between population means, as a function of sample size (lognormal distribution). Figure 6. Power of four significance tests on paired data, for a fixed difference between population means, as a function of sample size (Laplace distribution).

Power in Paired-Samples Designs Probability of Rejecting H..9.8.7.6.5.4.. Exponential Distribution n = α =.5 Student t Wilcoxon modified t, sample r modified t, population ρ.. 4 5 6 7 8 9 Difference in Standard Units. Mixed-Normal Distribution n = α =. Probability of Rejecting H.9.8.7.6.5.4.. Student t Wilcoxon modified t, sample r modified t, population ρ.. 4 5 6 7 8 9 Difference in Standard Units

Power in Paired-Samples Designs..9 Lognormal Distribution ρ =.5 n = α =.5 Probability of Rejecting H.8.7.6.5.4... Student t Wilcoxon modified t, sample r modified t, population ρ. 4 5 6 7 8 9 Difference in Standard Units. Truncated Normal Distribution ρ =.5 n = α =. Probability of Rejecting H.9.8.7.6.5.4.. Student t Wilcoxon modified t, sample r modified t, population ρ.. 4 5 6 7 8 9 Difference in Standard Units

Power in Paired-Samples Designs 4..9 Exponential Distribution ρ =.5 α =.5 μ μ =. Probability of Rejecting H.8.7.6.5.4... paired samples t Wilcoxon signed ranks modified, sample modified, population. 5 5 75 5 5 Sample Size (n)..9 Exponential Distribution ρ =.5 α =. μ μ =. Probability of Rejecting H.8.7.6.5.4... paired samples t Wilcoxon signed ranks modified, sample modified, population. 5 5 75 5 5 Sample Size (n)

Power in Paired-Samples Designs 5. Normal Distribution ρ =.5 α =.5 μ μ =..9 Probability of Rejecting H.8.7.6.5.4... paired samples t Wilcoxon signed ranks modified, sample modified, population. 5 5 75 5 5 Sample Size (n)..9 Normal Distribution ρ =.5 α =. μ μ =. Probability of Rejecting H.8.7.6.5.4... paired samples t Wilcoxon signed ranks modified, sample modified, population. 5 5 75 5 5 Sample Size (n)

Power in Paired-Samples Designs 6. Lognormal Distribution ρ =.5 α =.5 μ μ =..9 Probability of Rejecting H.8.7.6.5.4... paired samples t Wilcoxon signed ranks modified, sample modified, population. 5 5 75 5 5 Sample Size (n)..9 Lognormal Distribution ρ =.5 α =. μ μ =. Probability of Rejecting H.8.7.6.5.4... paired samples t Wilcoxon signed ranks modified, sample modified, population. 5 5 75 5 5 Sample Size (n)

Power in Paired-Samples Designs 7. Laplace Distribution ρ =.5 α =.5 μ μ =..9 Probability of Rejecting H.8.7.6.5.4... paired samples t Wilcoxon signed ranks modified, sample modified, population. 5 5 75 5 5 Sample Size (n)..9 Laplace Distribution ρ =.5 α =. μ μ =. Probability of Rejecting H.8.7.6.5.4... paired samples t Wilcoxon signed ranks modified, sample modified, population. 5 5 75 5 5 Sample Size (n)