Testing the Equality of Two Pareto Distributions

Similar documents
If Y is normally Distributed, then and 2 Y Y 10. σ σ

Suggested Answers To Exercises. estimates variability in a sampling distribution of random means. About 68% of means fall

[Saxena, 2(9): September, 2013] ISSN: Impact Factor: INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

Social Studies 201 Notes for March 18, 2005

Comparing Means: t-tests for Two Independent Samples

A FUNCTIONAL BAYESIAN METHOD FOR THE SOLUTION OF INVERSE PROBLEMS WITH SPATIO-TEMPORAL PARAMETERS AUTHORS: CORRESPONDENCE: ABSTRACT

Lecture 7: Testing Distributions

Alternate Dispersion Measures in Replicated Factorial Experiments

Z a>2 s 1n = X L - m. X L = m + Z a>2 s 1n X L = The decision rule for this one-tail test is

Social Studies 201 Notes for November 14, 2003

Source slideplayer.com/fundamentals of Analytical Chemistry, F.J. Holler, S.R.Crouch. Chapter 6: Random Errors in Chemical Analysis

A Bluffer s Guide to... Sphericity

Lecture 4 Topic 3: General linear models (GLMs), the fundamentals of the analysis of variance (ANOVA), and completely randomized designs (CRDs)

ON THE APPROXIMATION ERROR IN HIGH DIMENSIONAL MODEL REPRESENTATION. Xiaoqun Wang

1. The F-test for Equality of Two Variances

Jan Purczyński, Kamila Bednarz-Okrzyńska Estimation of the shape parameter of GED distribution for a small sample size

MINITAB Stat Lab 3

Clustering Methods without Given Number of Clusters

μ + = σ = D 4 σ = D 3 σ = σ = All units in parts (a) and (b) are in V. (1) x chart: Center = μ = 0.75 UCL =

Stratified Analysis of Probabilities of Causation

Asymptotics of ABC. Paul Fearnhead 1, Correspondence: Abstract

Beta Burr XII OR Five Parameter Beta Lomax Distribution: Remarks and Characterizations

Green-Kubo formulas with symmetrized correlation functions for quantum systems in steady states: the shear viscosity of a fluid in a steady shear flow

By Xiaoquan Wen and Matthew Stephens University of Michigan and University of Chicago

CHAPTER 6. Estimation

Inferences Based on Two Samples: Confidence Intervals and Tests of Hypothesis Chapter 7

Lecture 21. The Lovasz splitting-off lemma Topics in Combinatorial Optimization April 29th, 2004

Optimal Coordination of Samples in Business Surveys

Assessing the Discriminatory Power of Credit Scores under Censoring

Asymptotic Values and Expansions for the Correlation Between Different Measures of Spread. Anirban DasGupta. Purdue University, West Lafayette, IN

Bio 112 Lecture Notes; Scientific Method

On the Robustness of the Characteristics Related to (M\M\1) ( \FCFS) Queue System Model

NEGATIVE z Scores. TABLE A-2 Standard Normal (z) Distribution: Cumulative Area from the LEFT. (continued)

Correction of Overlapping Template Matching Test Included in NIST Randomness Test Suite

SMALL-SIGNAL STABILITY ASSESSMENT OF THE EUROPEAN POWER SYSTEM BASED ON ADVANCED NEURAL NETWORK METHOD

( ) ( Statistical Equivalence Testing

Confidence Intervals and Hypothesis Testing of a Population Mean (Variance Known)

SIMPLE LINEAR REGRESSION

NON-GAUSSIAN ERROR DISTRIBUTIONS OF LMC DISTANCE MODULI MEASUREMENTS

arxiv: v1 [math.mg] 25 Aug 2011

Network based Sensor Localization in Multi-Media Application of Precision Agriculture Part 2: Time of Arrival

Combining allele frequency uncertainty and population substructure corrections in forensic DNA calculations

On the Isomorphism of Fractional Factorial Designs 1

PARAMETERS OF DISPERSION FOR ON-TIME PERFORMANCE OF POSTAL ITEMS WITHIN TRANSIT TIMES MEASUREMENT SYSTEM FOR POSTAL SERVICES

THE HAUSDORFF MEASURE OF SIERPINSKI CARPETS BASING ON REGULAR PENTAGON

Estimation of Peaked Densities Over the Interval [0,1] Using Two-Sided Power Distribution: Application to Lottery Experiments

Suggestions - Problem Set (a) Show the discriminant condition (1) takes the form. ln ln, # # R R

Research Article Goodness-of-Fit Based Secure Cooperative Spectrum Sensing for Cognitive Radio Network

STUDY OF THE INFLUENCE OF CONVECTIVE EFFECTS IN INCIDENT RADIATIVE HEAT FLUX DENSITY MEASUREMENT UNCERTAINTY

Estimation of Current Population Variance in Two Successive Occasions

RELIABILITY OF REPAIRABLE k out of n: F SYSTEM HAVING DISCRETE REPAIR AND FAILURE TIMES DISTRIBUTIONS

Tests of Statistical Hypotheses with Respect to a Fuzzy Set

Evolutionary Algorithms Based Fixed Order Robust Controller Design and Robustness Performance Analysis

Research Article Reliability of Foundation Pile Based on Settlement and a Parameter Sensitivity Analysis

Chapter 12 Simple Linear Regression

Unified Design Method for Flexure and Debonding in FRP Retrofitted RC Beams

Acceptance sampling uses sampling procedure to determine whether to

CHAPTER 4 DESIGN OF STATE FEEDBACK CONTROLLERS AND STATE OBSERVERS USING REDUCED ORDER MODEL

Testing for a unit root in noncausal autoregressive models

An Unbiased Estimator For Population Mean Using An Attribute And An Auxiliary Variable

The Use of MDL to Select among Computational Models of Cognition

Finding the location of switched capacitor banks in distribution systems based on wavelet transform

White Rose Research Online URL for this paper: Version: Accepted Version

Factor Analysis with Poisson Output

New bounds for Morse clusters

Adaptive Extended Kalman Filter for Ballistic Missile Tracking

ASSESSING EXPECTED ACCURACY OF PROBE VEHICLE TRAVEL TIME REPORTS

Codes Correcting Two Deletions

CHAPTER 8 OBSERVER BASED REDUCED ORDER CONTROLLER DESIGN FOR LARGE SCALE LINEAR DISCRETE-TIME CONTROL SYSTEMS

A Constraint Propagation Algorithm for Determining the Stability Margin. The paper addresses the stability margin assessment for linear systems

(3) A bilinear map B : S(R n ) S(R m ) B is continuous (for the product topology) if and only if there exist C, N and M such that

ARTICLE Overcoming the Winner s Curse: Estimating Penetrance Parameters from Case-Control Data

Confusion matrices. True / False positives / negatives. INF 4300 Classification III Anne Solberg The agenda today: E.g., testing for cancer

EC381/MN308 Probability and Some Statistics. Lecture 7 - Outline. Chapter Cumulative Distribution Function (CDF) Continuous Random Variables

Soil water electrical conductivity determination based on the salinity index concept

Determination of the local contrast of interference fringe patterns using continuous wavelet transform

Unified Correlation between SPT-N and Shear Wave Velocity for all Soil Types

Chapter 4. The Laplace Transform Method

Efficient Methods of Doppler Processing for Coexisting Land and Weather Clutter

Microblog Hot Spot Mining Based on PAM Probabilistic Topic Model

Performance Analysis of Wald s SPRT with Independent but Non-Stationary Log-Likelihood Ratios

STRAIN LIMITS FOR PLASTIC HINGE REGIONS OF CONCRETE REINFORCED COLUMNS

Finite Element Analysis of a Fiber Bragg Grating Accelerometer for Performance Optimization

The continuous time random walk (CTRW) was introduced by Montroll and Weiss 1.

Automatic Control Systems. Part III: Root Locus Technique

Matching Feature Distributions for Robust Speaker Verification

Standard Guide for Conducting Ruggedness Tests 1

A Note on the Sum of Correlated Gamma Random Variables

A NEW LOAD MODEL OF THE PEDESTRIANS LATERAL ACTION

Introduction The CLEO detector and Y(5S) data sample Analysis techniques: Exclusive approach Inclusive approach Summary

Detection and Estimation Theory

Stochastic Optimization with Inequality Constraints Using Simultaneous Perturbations and Penalty Functions

Efficient Global Optimization Applied to Multi-Objective Design Optimization of Lift Creating Cylinder Using Plasma Actuators

Performance Measures for BSkSP-3 with BMChSP-1 as a reference plan

An Image-encoded Mach-Zehnder Joint Transform Correlator for Polychromatic Pattern Recognition with Multi-level Quantized Reference Functions

Theoretical Computer Science. Optimal algorithms for online scheduling with bounded rearrangement at the end

SOURCE CHARACTERIZATION OF INLAND CRUSTAL EARTHQUAKES FOR NEAR-SOURCE GROUND MOTIONS

Gain and Phase Margins Based Delay Dependent Stability Analysis of Two- Area LFC System with Communication Delays

CHEAP CONTROL PERFORMANCE LIMITATIONS OF INPUT CONSTRAINED LINEAR SYSTEMS

Transitional behaviors in well-graded coarse granular soils. Associate professor, State Key Laboratory of Coal Mine Disaster Dynamics and Control,

Transcription:

Proceeding of the World Congre on Engineering 07 Vol II WCE 07, July 5-7, 07, London, U.K. Teting the Equality of Two Pareto Ditribution Huam A. Bayoud, Member, IAENG Abtract Thi paper propoe an overlapping-baed tet for the equality of two Pareto ditribution with different cale and hape parameter. The propoed tet tatitic i defined a the maximum likelihood etimate of Weitzman overlapping coefficient, which etimate the agreement of two denitie. Simulated critical point are provided for the propoed tet for variou ample ize and ignificance level. Statitical power of the propoed tet are computed via imulation tudie. Type- I error robutne of the propoed tet i tudied alo via imulation tudie when the underlying ditribution are not Pareto. Index Term MLE Monte Carlo Simulation, Overlapping coefficient, Simulated Powe T I. INTRODUCTION eting the imilarity of two population i needed in variou practical field including pharmaceutical cience, laboratory management, engineering cience, food technology, management cience, quality control and many other field. Such tet are needed, for example, to ae the acceptability of a newly developed proce\deign\approach to a gold tandard one. In thi paper, we propoe a tet for the equality of two Pareto ditribution of the firt kind. Pareto ditribution i important in practice becaue of it adequacy in modeling variou ditribution, for example, in the commercial ector, including city population ditribution, tock price fluctuation, and oil field location. In addition, thi ditribution i highly applicable in the Air Force ector for modeling, for example, the failure time of equipment component, ee Davi and Michael (979), maintenance ervice time, ee Harri (968), nuclear fallout particle' ditribution, ee Freiling (966), and error cluter in communication circuit, ee Berger and Mandelbrot (963). Manucript received Jan 6, 07; revied Feb 7, 07. Thi work wa financially upported by Fahad Bin Sultan Univerity, Tabuk, Saudi Arabia. Huam Awni Bayoud, College of Science and Humanitie, Fahad Bin Sultan Univerity, Tabuk, Saudi Arabia, P.O. Box 5700 Tabuk 7454. Phone: 00966 5 8300008; fax: 00966 4 4 76 99; e-mail: huam.awni@yahoo.com. A decribed by Johnon and Kotz (970), the probability denity function (pdf) and the cumulative ditribution function (cdf) of Pareto ditribution with hape and cale parameter and, repectively, are: f ( x, ) ; x x 0, 0, () and F ( x, ) ; x 0, 0, () x Throughout thi article, we denote Pareto ditribution by Pareto,. Conider for teting the imilarity of two Pareto population: H0 : F( x, ) F( x, ) v H : F( x, ) F( x, ), (3) which i equivalent to tet: H 0 : and v. H : or. (4) In general, teting the imilarity of two ditribution can be tatitically linked to one-ided teting problem. For intance, teting the hypothei: H o : F G at a ignificance level can be done by teting the hypothei: H o : SF, G ( ) c, where S F, G i a uitable meaure of imilarity (diimilarity) of F and G, and c i the cut-off point of the meaure S at a ignificance level. In thi paper we propoe an overlapping (OVL) coefficientbaed tet for the imilarity of two Pareto ditribution. The propoed tet i defined a the Maximum Likelihood Etimate (MLE) of the Weitzman OVL coefficient; thi coefficient wa firt propoed by Weitzman (970) to tudy the agreement of two income ditribution. Thi OVL coefficient ha received the attention of different author in the literature for it importance in meauring the agreement of two tatitical ditribution. Helu and Samawi (0) derived three Weitzman' coefficient (970), Matuita Coefficient (955) and Moriita Coefficient (959), for two Pareto ditribution of kind II, and propoed point and confidence interval etimate for thee coefficient baed on Simple Random and Ranked Set Sample, eparately. Inman and Bradley (989) ued the OVL coefficient to meaure the agreement between two normal ditribution auming equal variance. Mulekar and Mihra (994) tudied variou OVL coefficient for two normal population with the ame mean. Reier and Faraggi (999) derived the confidence interval of the OVL coefficient of two normal ditribution auming equal variance in term of non central t and non central F ditribution eparately. Chaubey et. al. (008) tudied the OVL coefficient for two invere Gauian population. Al- Saleh and Samawi (007) tudied OVL coefficient for two exponential population. Recently, OVL-baed tet have been propoed for teting the equality of two exponential ditribution and two normal ditribution eparately by Bayoud (05) and Bayoud and Kittaneh (06), ISBN: 978-988-4048-3- ISSN: 078-0958 (Print); ISSN: 078-0966 (Online) WCE 07

Proceeding of the World Congre on Engineering 07 Vol II WCE 07, July 5-7, 07, London, U.K. repectively. Thi paper propoe the ame technique for teting the equality of two Pareto ditribution. The ret of thi paper i organized a follow: In ection, the propoed tet tatitic i derived in analytical form. Simulated critical point for the propoed tet tatitic are provided in Section 3 for variou ample ize and level of ignificance. An approximation to the null ditribution of the propoed tet tatitic i propoed in Section 4. In ection 5, tatitical power of the propoed tet i tudied under variou cenario. Variou real dataet are analyzed in Section 6. The paper i concluded in Section 7. II. PROPOSED TEST STATISTIC The propoed tet tatitic i defined a the etimated OVL coefficient. The OVL coefficient of f ( x, ) and f x, ) i defined a: ( min( f ( x, ), f ( x, )) dx (5) x The coefficient range from zero to one. A value cloe to zero indicate no common area between the two denity function. A value cloe to one indicate that the two denitie are identical. It can be eaily proved that two Pareto denitie f x, and f x, with different parameter either interect at only one point in the upport,, where Max,, or they do not interect at any point in,. Note that, if and, then the two denitie are clearly identical, and then. If and, then the two denitie do not interect. If then, regardle the value of and, the two denitie interect only at x 0 given by: x 0, (6) provided that thi value belong to,. Algebraically, the value of x 0 in Eq(8) might be outide the interval, at ome value of,, and, For example, when.,, 5 and, the value of x 0 0. 603, which doe not belong to.,, thi mean that the two denitie do not interect in their upport; and hence, there i no interection point. In uch a cae, we conclude that one of the denitie lie above the other at all value in,. Therefore, two different Pareto denitie either interect at x only one point 0 in the interval,, or they do not interect at all value x,. If the pdf f x, and f x, interect at x0,, then in Eq (7) can be repreented a: -, if f( ) f ( ) x0 x0, (7) -, if f( ) f ( ) x0 x0 where fi ( x) f x i, i defined in Eq (), Fi ( x) Fx i, i defined in Eq (), for i,, and x 0 i the interection point defined in Eq(8). On the other hand, if f x and x, f, do not interect in,, then one of the denitie i greater than the other at all value in,, then in Eq (7) can be implified to: f ( x, ) dx, if f( ) f ( ) f ( x, ) dx, if f( ) f ( ), if, if f ( ) f f ( ) f ( ) ( ). (8) Clearly, Eq(0) i the limit of Eq(9) when x 0 approache infinity. However, the propoed tet tatitic for teting the null hypothei in Eq (3) i defined a the MLE of the overlapping coefficient in Eq (7) which can be implified either to Eq (9) or Eq (0) depending on the interection point of f x, and f x,. n, Let, x,..., x ~ Pareto x and, x,..., xn ~ Pareto, x be two random ample, not necearily independent, from Pareto ditribution. The MLE of the cale and hape parameter baed on the random ample above are: ˆ x() minx, x,..., xn, ˆ x() minx, x,..., xn n n, ˆ and ˆ n. n ln x i ln x ln x i ln x i () By the invariant property of the MLE, the propoed tet tatitic, denoted by ˆ, can be obtained by replacing the parameter,,,,, f, f and x 0 in Eq (9) or Eq (0) by their MLE. The null hypothei H o in Eq (4) i rejected at ignificance level if ˆ ˆ, n, n, where Pˆ ˆ, n, H0 n ; i.e. ˆ th, n, n i the percentile of the ditribution of ˆ at n and n under the aumption that H o i true. Since, deriving the null ditribution of ˆ in analytic form i complicated. Therefore, percentile of ˆ ( ˆ, n, n ) are imulated in Section 3 at different ample ize and i () ISBN: 978-988-4048-3- ISSN: 078-0958 (Print); ISSN: 078-0966 (Online) WCE 07

Proceeding of the World Congre on Engineering 07 Vol II WCE 07, July 5-7, 07, London, U.K. ignificance level. Moreover, the null ditribution of the propoed tet tatitic i approximated by Beta ditribution in Section 4. III. SIMULATED CRITICAL POINTS OF ˆ Since the null ditribution of the propoed tet tatitic could not be obtained in cloed form. Therefore, imulated percentile for thi ditribution are generated at different ignificant level, and for different ample ize auming, for implicity, n n. Table how the imulated t,.5th, 5th, and 0th percentile of the null ditribution of the propoed tet tatitic, thee percentile give the critical point of the propoed tet tatitic. Thee percentile have been computed baed on 0,000 Monte Carlo Simulation generated at variou ignificance level under the aumption of equal ample ize n n n by uing the following algorithm: Algorithm I: i) Arbitrary, chooe and, baed on the aumption that H in Eq (3) i true. ii) Generate, x,..., ~ Pareto x,..., xn ~ Pareto, o x n, x,. x and iii) Find the MLE ˆ ˆ,, ˆ ˆ, ˆ Max ˆ ˆ, from the ample obtained in tep. Hence, ˆ and ˆ are not necearily equal, and ˆ and ˆ a well. iv) Compute the MLE of x 0 defined in Eq (8) by replacing the parameter by their MLE thoe have been computed in tep (3). v) If x 0 belong to, then the propoed tet tatitic and ˆ i computed uing Eq (9). Otherwie, the two denitie do not interect in,, and then the propoed tet tatitic ˆ i computed uing Eq (0). vi) Repeat tep (-5) m 0, 000 iteration, to get ˆ, ˆ,..., ˆ m. vii) t,.5th, 5th, and 0th percentile are computed for the et obtained in tep (6) which are the imulated critical point of the propoed tet tatitic at ignificance level of 0. 0, 0. 05, 0. 05 and 0., repectively. IV. APPROXIMATED NULL DISTRIBUTION of ˆ In hi ection, we propoe an approximation for the null ditribution of the propoed tet. Under the aumption that the null hypothei i true, it wa oberved from the imulation tudie in the previou ection that the ditribution of the tet tatitic ˆ i kewed to the left with value between zero and, far away from zero and cloe to. Eventually, Table how that 00 th percentile are much cloer to than zero for fixed n. Moreover, the percentile become much cloer to when n increae. Conequently, one can approximate the null ditribution of ˆ by beta ditribution with parameter p and q auming p q 0, to inure kewed to left ditribution, with pdf: ˆ p q ˆ p q f ˆ provided that p q 0 and p q 0 ˆ. TABLE I SIMULATED 00 th PERCENTILE ˆ,n, n OF THE NULL DISTRIBUTION of ˆ n 0. 0 0. 05 0. 0 0.464 0.58 0.6383 0.5008 0.608 0.6590 0.5386 0.639 0.6808 3 0.565 0.649 0.6960 4 0.589 0.6693 0.770 5 0.59 0.6850 0.739 6 0.66 0.6988 0.734 7 0.696 0.7038 0.7483 8 0.643 0.73 0.7536 9 0.655 0.74 0.769 0 0.6594 0.7330 0.7734 0.6673 0.7443 0.7789 0.6749 0.7497 0.7854 3 0.685 0.7580 0.7895 4 0.697 0.7658 0.797 5 0.7056 0.769 0.799 6 0.76 0.773 0.8034 7 0.795 0.7775 0.8097 8 0.785 0.780 0.848 9 0.799 0.786 0.869 30 0.7357 0.7890 0.8 3 0.746 0.7945 0.85 3 0.746 0.798 0.878 33 0.7484 0.80 0.833 34 0.750 0.8056 0.8355 35 0.7505 0.804 0.8375 36 0.7588 0.89 0.8393 37 0.7630 0.834 0.847 38 0.7653 0.868 0.8466 39 0.7673 0.805 0.8479 40 0.776 0.855 0.8489 50 0.80 0.8456 0.8679 00 0.8600 0.893 0.9079 00 0.9036 0.96 0.9363 Clearly, the parameter p and q are function in n and n. A imulation tudy i performed by uing Algorithm II to tet whether beta ditribution adequately fit the null ditribution of the tet tatitic ˆ. For implicity, we aume n n n. Algorithm II: i) For a fixed n, Step (i-v) in Algorithm I are performed with m 5000 iteration to get ˆ ˆ,..., ˆ, m, a et of 5000 ˆ value. ii) The parameter p and q of beta ditribution are etimated by the method of moment etimation from the et ˆ ˆ,..., ˆ, m obtained in Step (i). iii) The frequency hitogram for the imulated et ˆ ˆ,..., ˆ, m i plotted along with the fitted beta pdf uing the etimated p and q in Step (ii). Table how the frequency hitogram along with the fitted beta pdf obtained from Algorithm II for variou value of n. It i evident from Table that beta ditribution adequately fit the null ditribution of ˆ. Obviouly, there i a poitive linear relationhip between n and p, and a ISBN: 978-988-4048-3- ISSN: 078-0958 (Print); ISSN: 078-0966 (Online) WCE 07

Proceeding of the World Congre on Engineering 07 Vol II WCE 07, July 5-7, 07, London, U.K. negative linear relationhip between n and q. Hence, one can approximate the parameter p and q by fitting uitable function for p and q in term of n. For illutrative purpoe, we tudy the relationhip between n, p and q by analyzing a ample of the form n p, q;, n 0(5)00 that i generated by uing Algorithm II for n 0(5) 00. For thi ample, the etimated linear correlation coefficient between n and p and between n and q are 0.98 and - 0.9, repectively. Thi how that linear model could adequately fit p and q in term of n a follow: p ( n) 9. 869 0. 348n and q(n). 5660. 0030067n Therefore, the null ditribution of the tet tatitic ˆ can be approximated by a beta ditribution with parameter p ( n) 9. 869 0. 348n and q(n). 5660. 0030067n, where n i the ample ize, i.e. Beta9. 869 0. 348n,. 566 0. 0030067n. For intance, at n n n 30, the parameter p and q are etimated by 9. 4507 and. 664, repectively. Hence, the null ditribution of ˆ i approximately 9.4507,.664 TABLE II FITTED BETA TO THE NULL DISTRIBUTION OF ˆ n p q Beta. Beta, Fitted Beta Curve 0 Beta 8.4,.3 0 Beta 5.53,.4 30 Beta 0.48,. 50 Beta 7.74,. 0 Beta 39.6,.96 0 Note that the 5th and 0th percentile of Beta 9.4507,.664 are 0.7785 and 0.833 which are very cloe to the imulated 5th and 0th percentile ˆ 0.7890 and ˆ 0. 8, repectively. 0.05,30,30 0.0,30,30 It i worth mentioning that nonlinear model might alo adequately fit p and q in term of n. However, more tudie are needed in thi direction to tudy other model when n n. V. SIMULATED POWERS Statitical power of a tet i defined a the probability of rejecting the null hypothei H 0 when the null hypothei i fale. Unfortunately, power function of the propoed tet cannot be obtained in explicit form, then imulated tatitical power i computed under different alternative, keeping Type-I error of each tet at the ame level. However, Table 3 how the imulated power of the propoed tet baed on 5000 Monte Carlo imulation for two ample ize n 0 and 30 and variou alternative auming the ignificance level 0. 05, and the Pareto parameter and. TABLE III SIMULATED POWERS OF ˆ, ASSUMING, AND 0.05 LEVEL OF SIGNIFICANCE 0.5.00.5.5.0 n n 0 0.5 0.960 0.7846 0.975 0.9874.0000.0 0.9798 0.68 0.664 0.9583 0.9998.5 0.9980 0.087 0.5443 0.9573 0.9998.0 0.9998 0.0507 0.5358 0.97 0.9999.5.0000 0.07 0.5863 0.984.0000 3.0.0000 0.06 0.6684 0.9889 0.9999 n n 0 0.5.0000 0.9866 0.9999.0000.0000.0.0000 0.57 0.9759 0.9999.0000.5.0000 0.85 0.968.0000.0000.0.0000 0.048 0.976.0000.0000.5.0000 0.099 0.9868.0000.0000 3.0.0000 0.60 0.994.0000.0000 n n 30 0.5.0000 0.9995.0000.0000.0000.0.0000 0.76 0.9999.0000.0000.5.0000 0.755 0.9988.0000.0000.0.0000 0.0460 0.9993.0000.0000.5.0000 0.4.0000.0000.0000 3.0.0000 0.304.0000.0000.0000 It i obviou from Table 3 that the tatitical power of the propoed are atifactory. Clearly, the tatitical power of thi tet increae a the ample ize increae. Furthermore, it appear from Table 3 that thi tet efficiently control the ignificance level, i.e. tatitical power achieve the correct ignificance level in imulation under the aumption that the null hypothei i true. However, ince the propoed tet i derived under the aumption that the ditribution are Pareto, it i ignificant to tudy the Type-I error robutne of the propoed tet for teting the imilarity of non-pareto ditribution. Table 4 ummarize the ignificance (nominal) level of the propoed tet baed on 0,000 imulated pair of ample ISBN: 978-988-4048-3- ISSN: 078-0958 (Print); ISSN: 078-0966 (Online) WCE 07

Proceeding of the World Congre on Engineering 07 Vol II WCE 07, July 5-7, 07, London, U.K. when the null hypothei H0 i true for the elected population. Pareto, Normal, Gamma, Chi-quare, and Exponential ditribution are conidered. Variou aumption of the ample ize and the ignificance level are conidered. It i worth mentioning that Pareto ditribution i conidered again for the ake of teting the nominal level of the tet at variou Type-I error and ample ize. The robutne of a tet could be aeed by the cloene of the imulated nominal level to the correct Type-I erro It i evident from Table 4 that the propoed tet i highly enitive for violation of the Pareto aumption; which mean that the tet i not robut againt departure from the Pareto aumption. TABLE IV NOMINAL LEVELS FOR ˆ WHEN H0 IS TRUE FOR THE GIVEN POPULATION Significance Level Population 0.0 0.05 0.0 n n 0 Pareto (9,) 0.00 0.0533 0.09 N (5,) 0.37 0.464 0.587 Gamma (,3) 0.49 0.493 0.5390 0 0.884 0.395 0.4896 Exp (9) 0.759 0.464 0.567 n n 0 Pareto (9,) 0.0085 0.0485 0.059 N (5,) 0.434 0.574 0.6478 Gamma (,3) 0.4335 0.584 0.6507 0 0.3848 0.54 0.669 Exp (9) 0.4754 0.6038 0.684 n n 30 Pareto (9,) 0.0 0.054 0.007 N (5,) 0.580 0.6353 0.7090 Gamma (,3) 0.58 0.6430 0.6937 0 0.4689 0.5897 0.6677 Exp (9) 0.5546 0.6655 0.733 VI. REAL APPLICATION In thi ection, a real data obtained from Watthanacheewakul and Suwattee (00) i analyzed, thi data ummarize the major rice crop (in Kilogram) in the crop year 00/00 (April t, 00 to March 3t, 00) from two Tambol, Nongyang and Nongjom, of Amphoe Sanai in the Chiang Mai province, Thailand. Sample (hown in Table 5) of ize 8 and 30 were drawn from Nongyang and Nongjom Tambol, repectively. Firt, it wa checked whether Pareto ditribution can be ued or not to analyze thee data et. For Nongyang' data, the MLE of and are. 75765 and 3000, repectively. The Kolmogorov-Smirnov (KS) ditance between the empirical ditribution function and the fitted ditribution function ha been ued to check the goodne of fit. The KS tatitic value i 0.07, and the KS critical value i 0.50 at at n 8 and 0. 05. Accordingly, one cannot reject the hypothei that the data are coming from T-L ditribution. For Nongjom' data, the MLE of and are 3. 0544 and 3600, repectively. The KS tatitic value i 0.33, and the KS critical value i 0.76 at at n 30 and 0. 05. Accordingly, one cannot reject the hypothei that the data are coming from T-L ditribution. TABLE V THE MAJOR RICE CROP IN KILOGRAMS FROM THE TWO TAMBOLS FOR THE CROP YEAR 00/00 (APRIL ST, 00 TO MARCH 3ST, 00) Nongyang Nongjom 3440 3600 300 5000 5400 7500 3800 7800 4300 3600 7000 4000 3700 4000 6000 4800 350 4900 3500 400 3000 4500 3400 400 5000 6800 3600 4000 8000 7000 350 4500 8500 8300 4500 3800 450 4000 3500 5800 3000 370 3600 6000 5000 5400 3000 3600 4000 4500 3600 40 470 3600 5800 4800 400 43 Now, we need to tet the null hypothei that the rice crop of Nongyang and Nongjom are the ame by uing the propoed tet. The propoed tet tatitic i equal to 0.6048 which i le than the critical value ˆ 0.7857, thi mean that the hypothei that the 0.05,8,30 two crop are the ame i rejected at ignificance level of 0.05, which agree with what wa concluded by Watthanacheewakul and Suwattee (00). VII. CONCLUSIONS In thi paper, a non claical tet ha been propoed to tet the imilarity of two Pareto population. The propoed tet tatitic i defined a the MLE of Weitzman OVL coefficient, which etimate the agreement of two denitie. Thi tet tatitic ha been derived in cloed form. Since deriving the exact null ditribution of the propoed tet tatitic i not imple, imulated percentile have been provided at variou equal ample ize and ignificance level. However, the null ditribution of the propoed tet tatitic wa approximated by beta ditribution, but till further tudie are needed in thi direction. Simulated power of the propoed tet have been computed. It ha been concluded that the propoed tet almot ha atifactory power and it efficiently control the ignificance level. Furthermore, we tudied Type-I error robutne of the propoed tet when the underlying ditribution are non- ISBN: 978-988-4048-3- ISSN: 078-0958 (Print); ISSN: 078-0966 (Online) WCE 07

Proceeding of the World Congre on Engineering 07 Vol II WCE 07, July 5-7, 07, London, U.K. Pareto. It ha been noticed that thi tet the propoed tet i highly enitive for violation of the Pareto aumption; which mean that the tet i not robut againt departure from the Pareto aumption. REFERENCES [] H. T. Davi, and M. L. Feldtein, "The Generalized Pareto Law a a Model for Progreively Cenored Survival Data," Biometrika, vol. 66, pp.99-306, 979. [] C. M. Harri, "The Pareto Ditribution a a Queue Service Dicipline," Operation Reearch, vol. ii, pp. 307-33, January- February 968. [3] E. C. Freiling, A Comparion of the Fallout Ma-Size Ditribution Calculated by Lognormal and Power-Law Model, San Francico: U.S. Naval Radiological Defene Laboratory, 966 (AD-646 09). [4] J. M. Berger and B. Mandelbrot, "A New Model for Error Clutering in Telephone Circuit," IBM Journal of Reearch and Development, vol. 7, pp. 4-36, July 963. [5] N. L. Johnon and S. Kotz, Continuou Uniwriute Ditri- hution-i, Houghton Mifflin Co. 970. [6] M.S. Weitzman, Meaure of overlap of income ditribution of white and Negro familie in the United State, Technical paper No., Department of Commerce, Bureau of Cenu, Wahington. U.S. (970). [7] A. Helu and H. Samawi, On Inference of Overlapping Coefficient in Two Lomax Population Uing Different Sampling Method, Journal of Statitical Theory and Practice, vol. 5(4), pp. 683-696, 0. [8] K. Matuita, "Deciion rule baed on the ditance for problem of fir, two ample, and Etimation," Ann. Math. Statit. vol. 6, pp. 63 640, 955. [9] M. Moriita, "Meauring interpecific aociation and imilarity between communitie. Memoir of the faculty of Kyuhu Univerity," Serie E. Biology, vol. 3, pp. 36 80, 959. [0] H.F. Inman and E.L. Bradley, The overlapping coefficient a a meaure of agreement between probability ditribution and point etimation of the overlap of two normal denitie, Commun Stat Theory Method, vol. 8(0), pp. 385 3874, 008. [] M. Mulekar and S. N. Mihra, Overlap coefficient of two normal denitie: equal mean cae, J. Japan. Statit. Soc., vol. 4, pp. 69-80, 994. [] B. Reier and D. Faraggi, Confidence Interval for the overlapping coefficient: the normal equal variance cae, The Statitician, vol. 48 (3), pp. 43-48, 999. [3] Y. P. Chaubey, D. Sen, and S. N. Mihra, Inference on Overlap for Two Invere Gauian Population: Equal Mean Cae, Communication in Statitic-Theory and Method, vol. 37,, 880-894, 008. [4] M. F. AL-Saleh and H. M. Samawi, "Inference on overlapping coefficient in two exponential population," J. of Modern Statit. Sci., vol. 6 (), pp. 503 56, 007. [5] H. A. Bayoud and O. A. Kittaneh, "Teting the equality of Two exponential Ditribution," Communication in Statitic-Simulation and Computation, vol. 45(7), pp. 49-56, 06. [6] H. A. Bayoud, "Teting the Similarity of Two Normal Population with Application to the Bioequivalence Problem," Journal of Applied Statitic, vol. 43(7), pp. 3-334, 06. [7] L. Watthanacheewakul and S. Suwattee, "Comparion of Several Pareto Population Mean," Proceeding of International MultiConference of Engineer and Computer Scientit, vol III, 00. ISBN: 978-988-4048-3- ISSN: 078-0958 (Print); ISSN: 078-0966 (Online) WCE 07