Comparison of Power between Adaptive Tests and Other Tests in the Field of Two Sample Scale Problem

Similar documents
Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Introduction to Statistical Data Analysis III

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

INFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF INDEPENDENT GROUPS ABSTRACT

Distribution-Free Tests for Two-Sample Location Problems Based on Subsamples

5 Introduction to the Theory of Order Statistics and Rank Statistics

GENERAL PROBLEMS OF METROLOGY AND MEASUREMENT TECHNIQUE

Non-parametric Inference and Resampling

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption

Statistical Procedures for Testing Homogeneity of Water Quality Parameters

APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY OF VARIANCES. PART IV

A Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions

The Chi-Square Distributions

Extending the Robust Means Modeling Framework. Alyssa Counsell, Phil Chalmers, Matt Sigal, Rob Cribbie

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Distribution-Free Procedures (Devore Chapter Fifteen)

Chapter 23: Inferences About Means

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Conventional And Robust Paired And Independent-Samples t Tests: Type I Error And Power Rates

Applications of Basu's TheorelTI. Dennis D. Boos and Jacqueline M. Hughes-Oliver I Department of Statistics, North Car-;'lina State University

Two-sample scale rank procedures optimal for the generalized secant hyperbolic distribution

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures

The Chi-Square Distributions

Tentative solutions TMA4255 Applied Statistics 16 May, 2015

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Transition Passage to Descriptive Statistics 28

A comparison study of the nonparametric tests based on the empirical distributions

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47

Adaptive Procedures for the Wilcoxon Mann Whitney Test: Seven Decades of Advances

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY

TESTS BASED ON EMPIRICAL DISTRIBUTION FUNCTION. Submitted in partial fulfillment of the requirements for the award of the degree of

6 Single Sample Methods for a Location Parameter

Does k-th Moment Exist?

A nonparametric two-sample wald test of equality of variances

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Frequency Distribution Cross-Tabulation

TMA4255 Applied Statistics V2016 (23)

Generalized nonparametric tests for one-sample location problem based on sub-samples

Non-parametric tests, part A:

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data

STAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test.

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)

Multiple Comparison Procedures, Trimmed Means and Transformed Statistics. Rhonda K. Kowalchuk Southern Illinois University Carbondale

Section 3: Permutation Inference

f (1 0.5)/n Z =

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Testing homogeneity of variances with unequal sample sizes

psychological statistics

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

ADJUSTED POWER ESTIMATES IN. Ji Zhang. Biostatistics and Research Data Systems. Merck Research Laboratories. Rahway, NJ

Section 3 : Permutation Inference

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

A Monte-Carlo study of asymptotically robust tests for correlation coefficients

TESTS FOR MEAN EQUALITY THAT DO NOT REQUIRE HOMOGENEITY OF VARIANCES: DO THEY REALLY WORK?

An adaptive test for the two-sample scale problem based on U-statistics

AN IMPROVEMENT TO THE ALIGNED RANK STATISTIC

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Unit 14: Nonparametric Statistical Methods

NONPARAMETRICS. Statistical Methods Based on Ranks E. L. LEHMANN HOLDEN-DAY, INC. McGRAW-HILL INTERNATIONAL BOOK COMPANY

Contents 1. Contents

Better Bootstrap Confidence Intervals

YUN WU. B.S., Gui Zhou University of Finance and Economics, 1996 A REPORT. Submitted in partial fulfillment of the requirements for the degree

Increasing Power in Paired-Samples Designs. by Correcting the Student t Statistic for Correlation. Donald W. Zimmerman. Carleton University

MATH4427 Notebook 4 Fall Semester 2017/2018

Computer simulation on homogeneity testing for weighted data sets used in HEP

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Resampling and the Bootstrap

A Test of Cointegration Rank Based Title Component Analysis.

Contents. Acknowledgments. xix

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

The Number of Bootstrap Replicates in Bootstrap Dickey-Fuller Unit Root Tests

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Advanced Experimental Design

Non-parametric (Distribution-free) approaches p188 CN

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

A Test of Symmetry. Abdul R. Othman. Universiti Sains Malaysia. H. J. Keselman. University of Manitoba. Rand R. Wilcox

Inferential statistics

THE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH. Robert R. SOKAL and F. James ROHLF. State University of New York at Stony Brook

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases

COMPARISON OF THE ESTIMATORS OF THE LOCATION AND SCALE PARAMETERS UNDER THE MIXTURE AND OUTLIER MODELS VIA SIMULATION

Kumaun University Nainital

Finite Population Correction Methods

An Overview of the Performance of Four Alternatives to Hotelling's T Square

Bios 6648: Design & conduct of clinical research

Glossary for the Triola Statistics Series

An Omnibus Consistent Adaptive Percentile Modified. Wilcoxon Rank Sum Test with Applicaitions in Gene. Expression Studies

STATISTIKA INDUSTRI 2 TIN 4004

A Signed-Rank Test Based on the Score Function

Statistics for Managers using Microsoft Excel 6 th Edition

Problem 1 (20) Log-normal. f(x) Cauchy

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Bayesian Predictive Modeling for Exponential-Pareto Composite Distribution

Transcription:

Comparison of Power between Adaptive Tests and Other Tests in the Field of Two Sample Scale Problem Chikhla Jun Gogoi 1, Dr. Bipin Gogoi 2 1 Research Scholar, Department of Statistics, Dibrugarh University, Dibrugarh, PIN- 786004, Assam, India 2 Professor, Department of Statistics, Dibrugarh University, Dibrugarh, PIN- 786004, Assam, India Abstract: Usually when we have to deal with an analysis of a particular data set, we often choose a statistical procedure on the reference of the analysis. Now if in this analysis the parametric assumptions are fulfilled we will prefer F test for the two sample scale problem. But if the behaviour of the data set does not fulfil these assumptions then we have to focus on some other tests. In this paper it is tried to throw light on the use of Adaptive tests on the context of this problem. This paper comprises of power comparison between F test, some traditional Non- Parametric tests and proposed Adaptive tests. Under the experiment equal and unequal sample sizes are taken considering different alternatives. This is done by the use of Monte Carlo simulation technique. At the end of the experiment, inspite of many limitations, some satisfactory results are obtained. All those results are displayed in tabular and graphical form. Keywords: F test, Mood Test, Sum of Squared Rank Test, Klotz test, Seigel Tukey test, Ansari Bredley Test, Adaptive test, Monte Carlo Simulation, size and power 1. Introduction and be To test the hypothesis of equality of two scale parameters, there is always a problem that which test is to use. According to literature if the parent distribution is normal, then F test is preferred. But if the normality assumptions are violated specially when N is large then F test become extremely sensitive. In that situation a researcher has to focus on Non Parametric tests. Various Non Parametric tests are available for using as an alternative of F test such as mood test, Klotz test, Seigel Tukey test, Sum of Squared Rank Test, Ansary Bredly test and so on. But still there is a problem to decide the accurate test for the particular set of data so that to get a valid result and draw a valid inference.. In this paper it is tried to get the solution by using the adaptive procedure. In this procedure at the very beginning we will collect some reasonable information from the data and then we will decide which test will be better for the data for further approach. In this paper power comparisons are made between different parametric and non parametric tests and then it is decided which test can be considered as our Adaptive test. 2. Parametric Test Procedures 2.1 F test Let X 1,..., X n and Y 1,..., Y m be independent and identically distributed samples from two populations which each have a normal distribution. The expected values for the two populations can be different, and the hypothesis to be tested is that the variances are equal. Let = and be the sample means. Let the sample variances. Then the test statistic F= has an F-distribution with n 1 and m 1 degrees of freedom if the null hypothesis of equality of variances is true. Otherwise it has a non-central F-distribution. The null hypothesis is rejected if F is either too large or too small. 3. Non- Parametric Test Procedures for Two Sample Scale Problem 3.1 Mood test Mood test is completely analogous to the parametric F-test mentioned above. Let X 1,X 2,..., Xn be an independent random sample from a distribution F and Y 1, Y 2,..., Ym be an independent random sample from a distribution G. Now we combine the two sample and rank them in ascending order. So the array of ordered scores will be v 1 <v 2 <...<v N. Here our hypothesis (H0) is to test that the two distributions, F and G, are identical with respect to scale, against some ordered alternative. We Replace the scores in the pooled sample by their corresponding ranks. Knowing that the mean of a set of ranks from 1 to N is (N+1)/2,we determine the sum of squared rank deviations about this mean for the X sample. Let Zi be the indicator variable. Z i = 1 if the rank score i belongs to X Sample and 0 otherwise. The test statistic for N<20 is M = Paper ID: ART20171203 143

A large value of M implies that the variability of the X sample is significantly greater than the variability of the Y sample. For small values of M, one draws the opposite conclusion. A table of critical values for N<20 is not available at the present time. W= For large sample the test statistics is : Z= When N is greater than 20, values of M approximate a normal distribution with E(M) = n(n 2-1)/12 and Var(M)= nm(n+1)(n 2-4)/180..In this case the test statistic is Z = 3.2 Klotz Test Let X 1,X 2,..., Xn be an independent random sample from a distribution F and Y 1, Y 2,..., Ym be an independent random sample from a distribution G. Now we combine the two sample and rank them in ascending order. So the array of ordered scores will be v 1 <v 2 <...<v N. Here our hypothesis (H0) is to test that the two distributions, F and G, are identical with respect to scale, against some ordered alternative. We Replace the scores in the pooled sample by their corresponding ranks. Knowing that the mean of a set of ranks from 1 to N is (N+1)/2,we determine the sum of squared rank deviations about this mean for the X sample. Let Zi be the indicator variable. Z k = 1 if the rank score k belongs to X Sample and 0 otherwise. The test statistic is: Klotz= When N is large then the values of Klotz statistic approximate a normal distribution with E(Kl) = (1/N) = mean and Var(Kl)=..In this case the test statistic is Z = Where E(W) = Var(W) = if m + n is even. = if m + n is odd if m+n is even = if m+n is odd 3.4 Siegel-Tukey Test This test replaces the pooled data from the two samples with are ordering of the ranks (i) from 1 to N.to illustrate the ranking procedure, note the following table when N is assumed to be an even number. Ordered score v 1 v 2 v 3 v 4... v N/2... v N-3 v N-2 v N-1 v N Rank Replacement 1 4 5 8... N... 7 6 3 2 The indicator random variable is Z i =1, if ith replacement score is associated with the X sample. =0, otherwise The test statistics is- ST= In order to determine the significance of ST tables have been developed by Siegel Tukey(1960). For N>20 the distribution of ST approximates a normal distribution withe(st)= n (N+1)/2 and Var (ST) = nm (N+1)/12.The test statistic becomes Z= 3.3Ansari Bradley Test To compute the Ansari- Bradley statistic special scores are attributed to the observations: Let Z=(X 1,X 2,... X m,y 1,Y 2,...,Y n ) denote the combined sample. If N= m + n is even then Sample Z: Z (1), Z (2),... Z (N/2-1), Z (N/2), Z (N/2+1), Z (N/2+2),..., Z (N-1), Z (N) Scores A i : 1 2... N/2-1 N/2 N/2 N/2-1... 2 1 If N= m + n is odd then Sample Z: Z (1), Z (2),... Z ((N-1)/2), Z ((N+1)/2), Z ((N+3)/2),... Z (N-1), Z (N) Scores A i : 1 2... (N-1)/2 (N+1)/2 (N- 1)/2... 2 1 The Test Statistics is: Sum of Squared Rank Test: let be the random observations, i= 1,2,...,g and j= 1,2,...,. is the mean and are the standard deviation. Now the required test statistics is L= Where and For large sample size L follows chi square distribution with (g-1) d.f. 4. Aadaptive Procedures The objective of this paper is to develop an adaptive test for scale for testing independent distributions of the continuous Paper ID: ART20171203 144

type. It is already mentioned that an adaptive test uses the data first to select a model and then makes an inference based on the model chosen. The models chosen ranges from a light-tailed one (uniform) to a heavy-tailed one (like the Cauchy). There are many nonparametric tests to deal with comparison tests in two sample cases for scale problems. For example in case of skewed distributions we use the Squared-Ranks test statistics. on the other hand when the distributions are medium-tailed or heavy-tailed such as the double exponential or Cauchy distributions, respectively, then suitable tests would be the Mood and Ansari-Bradley tests, respectively. But if the parent distribution is normal we use F test. Now let us assume we have two samples from the c.d.f.'s F x (x) and F y (x), respectively, in order to test the null hypothesis H 0 : = 1. Then we combine the two samples and order the observations in increasing order of magnitude. To measure the skewness we use the statistics Q 1 = where, and the averages of the largest five percent of the Z's, the middle 50 percent of the Z's, and the smallest five percent of the Z's where the Z's denote the order statistics of the combined sample of all N values. The measure of kurtosis is- Q 2 = This is to determine whether the tail weight of the underlying distribution is light or heavy. For different distributions, Q 1 and Q 2 have been evaluated in Table 2.1. This adaptive test will depend on what values of Q 1 and Q 2 are found. On the scheme we propose, we are assuming that the distributions we encounter are not skewed to the left. With this in mind we propose a classification procedure. First, Q 1 and Q 2 are computed using the combined sample of all N values. If Q 1 < 1.5 and Q 2 < 7, then we will use a medium-tailed model. If Q 1 > 1.5 and Q 2 < 2 we will use a right-skewed model. If Q 1 > 1.5 and 2 < Q 2 < 7 then we will use a very badly right-skewed model. If Q 2 > 7, then we will use a heavier-tailed model. Figure 1.1: Diagram of above division The adaptive test A will be defined as follows..if the data indicate a skewed model, we will use the Squared-Ranks statistic denoted by L N.If the data indicate a medium-tailed model, then we can one of the following tests such as Mood statistic or Klotz Statistics. When the data suggest a very heavy-tailed model, then we propose the Ansari-Bradley statistic, A N. Table 1.1: Values of Q 1 and Q 2 for some selected Distributions Distributions Q 1 Q 2 Uniform Normal Exponential Double Exponential Logistic 5. The Monte Carlo Study 1 1 4.5 1 1 1.9 2.563 2.86 3.3 2.86 For the simulation study, F- test, Mood test, Ansary Bradley test, Sum of Squared Rank test, Klotz test, Seigel Tukey test and adaptive tests are selected. Here six family of distributions are considered and these are the Normal, the Cauchy, the Double exponential, the Logistic, the Lognormal. The study was conducted on computer at the Department of Statistics, Dibrugarh University. To generate the standard normal deviate, the method described in Monte Carlo Method by Hammersly and Handscomb(1964) were used and deviate from the other distributions were generated by using the inverse distribution function on uniform deviates. In studying the significant levels, we first considered distributions with scale parameter equal to one. For each set of sample N = n i, i =1,2 the experiment was repeated i 5,000 times and proportion of rejection of the true null hypothesis was recorded and presented in table 1.2 to 2.11. For the power study of the tests, random deviates were generated as above for each group and added to i. Proportion of rejections based on 5000 replications at the levels.10,.05 and.01 for different combinations of i were recorded and presented in the table 1.2 to table 1.11. Paper ID: ART20171203 145

Table 1.2: Empirical Level and Power of Tests under Normal distribution Table 1.3: Empirical Level and Power of Tests under Normal distribution Paper ID: ART20171203 146

Table 1.2: Empirical Level and Power of Tests under Cauchy Distribution Table 1.3: Empirical Level and Power of Tests under Cauchy Distribution Paper ID: ART20171203 147

Table 1.2: Empirical Level and Power of Tests under Logistic Distribution Table 1.3: Empirical Level and Power of Tests under logistic Distribution Paper ID: ART20171203 148

Table 1.2: Empirical Level and Power of Tests under Double Exopnential Distribution Table 1.3: Empirical Level and Power of Tests under Double Exponential Distribution: Paper ID: ART20171203 149

Table 1.3: Empirical Level and Power of Tests under Lognormal Distribution: Table 1.3: Empirical Level and Power of Tests under Lognormal Distribution: Paper ID: ART20171203 150

Table 1.2: Empirical Level and Power of Tests under Exponential distribution: Table 1.3: Empirical Level and Power of Tests under Exponential distribution: Figure 1.2: Comparison of Power between Different Tests Under Normal Distribution with Sample Sizes (20,20) at 10% Level of Significance Paper ID: ART20171203 151

Figure 1.3: Comparison of power Between Different Tests under Cauchy Distribution with Sample Size (10,10) at 10% level Of Significance Figure 1.4: Comparison of Power between Different Tests under Logistic Distribution with Sample Sizes (10,15) at 10% Level of Significance Figure 1.5: Comparison of Power between Different Tests Under Double Exponential distribution with Sample Sizes (10,10) at 10% Level of Significance Figure 1.6: Comparison of Power between Different Tests Under Lognormal distribution with Sample Sizes (10,10) at 10% Level of Significance Paper ID: ART20171203 152

Figure 1.7: Comparison of Power between Different Tests under Exponential Distribution with Sample Sizes (20,20) at 10% Level of Significance 6. Discussion For comparison purposes we have considered various combinations of sample sizes with equal and unequal sample sizes. We have also considered different sets of ϭ i s for the study. From Tables 1.2 1.13 it is observed that the parametric F-test maintain the nominal level except the Cauchy and skew distribution lognormal. In these cases F- test seems to be conservatives. Table 1.2 and table 1.3 shows the power of tests under normal distribution for equal and unequal sample sizes respectively. We have seen that power of F- test is higher than the other tests in this distribution in presence of various combinations of Scale parameters and sample sizes. So here we may conclude that for Normal Distribution F test is our Adaptive test. Table 1.4 and Table 1.5 gives the power of tests statistics under Cauchy distribution for equal and unequal sample sizes respectively. Here,we observe that Sum of Squared Rank Test is more powerful than other tests both in case of equal and unequal sample sizes at 10%, 5% and 1% level of significance. So if the parent distribution is Cauchy then Sum of Squared Rank test is the Adaptive Test. Table 1.6 and Table 1.7 displays the power of tests under logistic distribution. We have seen that Mood Test and Klotz test are more powerful than other tests with both equal and unequal sample sizes and at 10%, 5% and 1% level. In this case Mood Test and Klotz test is our Adaptive Test. Table 1.8 and Table 1.9 shows the empirical power of tests under double exponential distribution. Here it is seen that Klotz test and F test are more powerful than the other tests both in case of equal and unequal sample sizes and at10%, 5% and 1% level.. here our Adaptive Test is the Klotz Test. Table 1.10 and Table 1.11 shows the power of tests under lognormal distribution. Here,Sum of Squared Rank Test is more powerful than the all other tests both in case of equal and unequal sample sizes at 10%, 5% and 1% level. So in this particular Distribution Sum of Squared Test is our Adaptive test. Table 1.12 and Table 1.13 shows the power of tests under Exponential distribution. Here,Sum of Squared Rank Test is more powerful than the all other tests both in case of equal and unequal sample sizes at 10%, 5% and 1% level. So in this particular Distribution Sum of Squared Test is our Adaptive test. References [1] Ansari, A. R. and Bradley, R. A. (1960). Ranksumtests for dispersion. Ann. Math. Statist. 31., 1174-89. [2] Bandyopadhyay,U. and Dutta,D.(2010), Adaptive nonparametric tests for the two-sample scale problem under symmetry,stat Methods Appl,19:153 170 [3] Basu, A.P. and Woodworth,G.(1967), A Note on Nonparametric Tests for Scale The Annals of Mathematical Statistics, 38, 274-277 [4] Fligner,M.A.( 1979), A Class of Two-Sample Distribution-Free Tests for Scale, Journal of the American Statistical Association, 74, 889-893 [5] Fligner,M.A. and Killeen,T.J.,( 1976) Distribution- Free Two-Sample Tests for Scale Journal of the American Statistical Association, 71, 210-213 [6] Hall, P. and Padmanabhan, A.R.(1997), Adaptive Inference for the Two-Sample Scale Problem, Journal of American Statistical Association and American Society for Quality, 39, 412-422 [7] Hogg, R.V.,Randles,R.H. and Fisher,D.M.(1975), A Two Sample Adaptive Distribution Free Test, Journal of the American Statistical Association, 70,656-661 [8] Klotz. J,(1962), Nonparametric Tests for Scale, The Annals of Mathematical Statistics, 33, 498-512 [9] Moses, L.E.(1963), Rank Tests of Dispersion, The Annals of Mathematical Statistics, 34, 973-983 [10] Puri, M.L.,(1972), Some Aspects of Nonparametric Inference, International Statistical Review / Revue Internationale de Statistique, 40, 99-327 [11] Raghavachari,M(1965), The Two-Sample Scale Problem When Locations are Unknown, The Annals of Mathematical Statistics, 36, 1236-1242 Paper ID: ART20171203 153