Non-Parametric Weighted Tests for Change in Distribution Function

Similar documents
Estimate by the L 2 Norm of a Parameter Poisson Intensity Discontinuous

Comparing Different Estimators of Reliability Function for Proposed Probability Distribution

Variations. ECE 6540, Lecture 02 Multivariate Random Variables & Linear Algebra

Worksheets for GCSE Mathematics. Quadratics. mr-mathematics.com Maths Resources for Teachers. Algebra

Lecture 3. STAT161/261 Introduction to Pattern Recognition and Machine Learning Spring 2018 Prof. Allie Fletcher

Extreme value statistics: from one dimension to many. Lecture 1: one dimension Lecture 2: many dimensions

ECE 6540, Lecture 06 Sufficient Statistics & Complete Statistics Variations

Review for Exam Hyunse Yoon, Ph.D. Assistant Research Scientist IIHR-Hydroscience & Engineering University of Iowa

A Step Towards the Cognitive Radar: Target Detection under Nonstationary Clutter

2.4 Error Analysis for Iterative Methods

General Strong Polarization

Chapter 5: Spectral Domain From: The Handbook of Spatial Statistics. Dr. Montserrat Fuentes and Dr. Brian Reich Prepared by: Amanda Bell

General Strong Polarization

Terms of Use. Copyright Embark on the Journey

Time Domain Analysis of Linear Systems Ch2. University of Central Oklahoma Dr. Mohamed Bingabr

Math 171 Spring 2017 Final Exam. Problem Worth

10.4 Controller synthesis using discrete-time model Example: comparison of various controllers

Integrating Rational functions by the Method of Partial fraction Decomposition. Antony L. Foster

Conharmonically Flat Vaisman-Gray Manifold

Mathematics Paper 2 Grade 12 Preliminary Examination 2017

Work, Energy, and Power. Chapter 6 of Essential University Physics, Richard Wolfson, 3 rd Edition

1. The graph of a function f is given above. Answer the question: a. Find the value(s) of x where f is not differentiable. Ans: x = 4, x = 3, x = 2,

Rotational Motion. Chapter 10 of Essential University Physics, Richard Wolfson, 3 rd Edition

Multiple Regression Analysis: Inference ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

Continuous Random Variables

International Journal of Scientific & Engineering Research, Volume 7, Issue 8, August ISSN MM Double Exponential Distribution

On The Cauchy Problem For Some Parabolic Fractional Partial Differential Equations With Time Delays

Property Testing and Affine Invariance Part I Madhu Sudan Harvard University

CHAPTER 5 Wave Properties of Matter and Quantum Mechanics I

Solving Fuzzy Nonlinear Equations by a General Iterative Method

(1) Correspondence of the density matrix to traditional method

APPROXIMATING A COMMON SOLUTION OF A FINITE FAMILY OF GENERALIZED EQUILIBRIUM AND FIXED POINT PROBLEMS

10.4 The Cross Product

CS Lecture 8 & 9. Lagrange Multipliers & Varitional Bounds

Stochastic Modeling of a Computer System with Software Redundancy

Review for Exam Hyunse Yoon, Ph.D. Adjunct Assistant Professor Department of Mechanical Engineering, University of Iowa

Wave Motion. Chapter 14 of Essential University Physics, Richard Wolfson, 3 rd Edition

SECTION 7: FAULT ANALYSIS. ESE 470 Energy Distribution Systems

Worksheets for GCSE Mathematics. Algebraic Expressions. Mr Black 's Maths Resources for Teachers GCSE 1-9. Algebra

Module 7 (Lecture 25) RETAINING WALLS

MONTE CARLO ANALYSIS OF CHANGE POINT ESTIMATORS

3.2 A2 - Just Like Derivatives but Backwards

M.5 Modeling the Effect of Functional Responses

Independent Component Analysis and FastICA. Copyright Changwei Xiong June last update: July 7, 2016

Module 4 (Lecture 16) SHALLOW FOUNDATIONS: ALLOWABLE BEARING CAPACITY AND SETTLEMENT

Approximate Second Order Algorithms. Seo Taek Kong, Nithin Tangellamudi, Zhikai Guo

Gradient expansion formalism for generic spin torques

Chapter 22 : Electric potential

A linear operator of a new class of multivalent harmonic functions

(2) Orbital angular momentum

Estimation of cumulative distribution function with spline functions

Module 7 (Lecture 27) RETAINING WALLS

Quantum Mechanics. An essential theory to understand properties of matter and light. Chemical Electronic Magnetic Thermal Optical Etc.

By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

Haar Basis Wavelets and Morlet Wavelets

BHASVIC MαTHS. Skills 1

Mathematics Ext 2. HSC 2014 Solutions. Suite 403, 410 Elizabeth St, Surry Hills NSW 2010 keystoneeducation.com.

International Journal of Mathematical Archive-5(3), 2014, Available online through ISSN

COMPRESSION FOR QUANTUM POPULATION CODING

Unit II. Page 1 of 12

Heteroskedasticity ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

Diffusive DE & DM Diffusve DE and DM. Eduardo Guendelman With my student David Benisty, Joseph Katz Memorial Conference, May 23, 2017

Problem 3.1 (Verdeyen 5.13) First, I calculate the ABCD matrix for beam traveling through the lens and space.

A Numerical Integration for Solving First Order Differential Equations Using Gompertz Function Approach

Math 3 Unit 3: Polynomial Functions

Exam Programme VWO Mathematics A

Support Vector Machines. CSE 4309 Machine Learning Vassilis Athitsos Computer Science and Engineering Department University of Texas at Arlington

CHAPTER 2 Special Theory of Relativity

Review for Exam Hyunse Yoon, Ph.D. Adjunct Assistant Professor Department of Mechanical Engineering, University of Iowa

Heat, Work, and the First Law of Thermodynamics. Chapter 18 of Essential University Physics, Richard Wolfson, 3 rd Edition

Rational Expressions and Functions

INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION

Lecture 11. Kernel Methods

Math 3 Unit 3: Polynomial Functions

Introduction to Density Estimation and Anomaly Detection. Tom Dietterich

7.3 The Jacobi and Gauss-Seidel Iterative Methods

Charge carrier density in metals and semiconductors

A A A A A A A A A A A A. a a a a a a a a a a a a a a a. Apples taste amazingly good.

Patterns of soiling in the Old Library Trinity College Dublin. Allyson Smith, Robbie Goodhue, Susie Bioletti

Lecture 3 Optimization methods for econometrics models

Gravitation. Chapter 8 of Essential University Physics, Richard Wolfson, 3 rd Edition

Question 1 carries a weight of 25%; question 2 carries 25%; question 3 carries 20%; and question 4 carries 30%.

IT 403 Statistics and Data Analysis Final Review Guide

" = Y(#,$) % R(r) = 1 4& % " = Y(#,$) % R(r) = Recitation Problems: Week 4. a. 5 B, b. 6. , Ne Mg + 15 P 2+ c. 23 V,

Expectation Propagation performs smooth gradient descent GUILLAUME DEHAENE

General Strong Polarization

Physics 371 Spring 2017 Prof. Anlage Review

Math 3 Unit 4: Rational Functions

Lesson 10: Solving Inequalities

On Forecast Strength of Some Linear and Non Linear Time Series Models for Stationary Data Structure

Lesson 24: True and False Number Sentences

General Strong Polarization

Chemical Engineering 412

(1) Introduction: a new basis set

On the Compounds of Hat Matrix for Six-Factor Central Composite Design with Fractional Replicates of the Factorial Portion

Yang-Hwan Ahn Based on arxiv:

MATH 1080: Calculus of One Variable II Fall 2018 Textbook: Single Variable Calculus: Early Transcendentals, 7e, by James Stewart.

Analysis for Parallel Repairable System with Degradation Facility

Creeping Movement across a Long Strike-Slip Fault in a Half Space of Linear Viscoelastic Material Representing the Lithosphere-Asthenosphere System

Bayesian Estimation and Inference for the Generalized Partial Linear Model

Transcription:

American Journal of Mathematics Statistics 03, 3(3: 57-65 DOI: 0.593/j.ajms.030303.09 Non-Parametric Weighted Tests for Change in Distribution Function Abd-Elnaser S. Abd-Rabou *, Ahmed M. Gad Statistics Department, Faculty of Economics Political Science, Cairo University, Cairo, Egypt Abstract For a given data set it may be required to discover if a change has been occurred. This is can be conducted using change-point analysis. Let X, X,,X n be independent romvariables with respective continuous distribution functions F, F,, F n such that F i (0=0 for all i.we consider the problem of testing the null hypothesis that F = F = = F n against the alternative of r-changes in the distribution functions of this sequence at unknown times <[nτ <[nτ <. <[nτ r <n, where[ y is the integer part of y. We study the asymptotic theory of change-point processes which are defined in terms of the empirical process. We propose study new weighted non-parametric change-point test statistics for a possible change in distribution function of a data set. Keywords Change Point Problem, EmpiricalProcesses, Kiefer Process, Limit Theorems, Monte Carlo Simu lation. Introduction In many applications it is necessary to discover if a data set comes from a single distribution or there is a change in the distribution function. The change-point inference is an effective powerful statistical tool for determining if when a change in a set of data has occurred. Let X, X,,X n, be independent rom variables with continuous distribution functions (DF F, F,, F n, respectively such that F i (0=0for all i. We are interested in testing the null hypothesis of no change HH oo : FF = FF = = FF = FF, ( where F is unknown against the alternative of at most r-changes (AMRC, GG, ii [ττ GG, [ττ < ii < [ττ (rr.. HH : FFii = (.... GG rr +, [ττ rr < ii, where rr is specified, the distribution functions GG ii, ii =,, rr + the change-point positions ττ ii, ii =,, rr are unknown GG ii GG ii +, ii =,,rr. Note that[ττ denotes the integer part of ττ. The aim of this article is to introduce weighted non-parametric tests for distributional change in a data set. The asymptotic distributions of these test statistics are derived. These tests also can be applied to the mean change ina data set. * Corresponding author: abdrfeps@feps.edu.eg (Abd-ElnasrAbd-Rabou Published online at http://journal.sapub.org/ajms Copyright 03 Scientific & Academic Publishing. All Rights Reserved The paper is organized as follows. In Section we will consider the above multiple change-point problem in the case of at most two change-points (AMTC, i.e. r=. In Section 3 we generalize the AMTC results to the case of rr. The proposed new change-point test statistics are presented in Section 4. Also, the asymptotic distributions of the proposed test statistics are derived in Section 4. In Section 5, we propose new test statistics for the case of at most one change point. We study the applicability of the proposed tests through a Monte Carlo study in section 6.. The Case of at Most Two Change Points (AMTC In this section we treat the case of at most -changes (AMTC, i.e. testing HH 0 of ( against the alternative of (, when r =. Many authors have discussed the change-point problem, testing (detection, estimation using both Bayesian non-bayesian approaches. Most of the work done in the change-point analysis is concerned with the case of at most one change (AMOC. Csörgő Horváth[ gave avery excellent extensive treatment review for the related work. Forthe AMTC, recent works are done by[,3. They propose weighted CUSUM tests for the multiple changes in the variance of a sequence of independent rom variables. By using the general form of U-statistic, they studied some CUSUM test statistics for the AMTC in variance. Hawkins[4 developed a dynamic program to test for a mu ltip le change point in the parameters of the general exponential family using repeated maximum likelihood algorithm. His algorithm, involves the application of the at most one change-point ma ximu m like lihood detector for every

58 Abd-Elnaser S. Abd-Rabou et al.: Non-Parametric Weighted Tests for Change in Distribution Function segmented group of data available. Syzsykowicz[5studied weighted approximations for various versions of the empirical processes under the null continuous alternatives of the AMOC. She approximated these processes by appropriate Gaussian processes. Let the smoothed two-parameter empirical process be given by (ss, tt = [(+tt {II[FF(XX ii ii = ss ss}, 0 ss, tt. (3 Let tt = (tt, tt, where 0 tt tt <. For 0 tt tt <, 0 ss,, define ββ ss, tt = [( + tt αα (ss, tt + [( + tt {αα (ss, tt αα (ss, }, (4 ss, tt = {( tt KK(ss, tt + tt KK(ss, tt tt KK(ss, }/, (5 where {KK(ss, yy; 0 ss, 0 yy < } is a Kiefer process, i.e., a mean zero two-parameters Gaussian process with EE(KK(ss,yy KK(ss,yy = (ss ss ss ss (yy yy. (6 The following result follows from Theorem 8.. of[5. Theorem. Assume that HH 0 of Eq. ( holds true. Then, there exists a Kiefer process K(.,. such that as, ββ (ss, tt (ss, tt = oo pp (, 0 ss 0<tt tt < whereββ (.,. is as in Eq. (4 ππ (.,. is as in Eq. (5. Let (.,. be as in Eq. (3 ββ (ss, tt be as in Eq. (4. Define AA (ss, yy = (ss, yy [( +yy (ss,. (7 Note that ββ ss, tt = [ ( + tt AA (ss, tt + [( + tt AA (ss, tt. Let Q be the class of positive functions on (0,, which are non-decreasing in a neighbourhood of zero non-increasing in the neighbourhood of one. A function q defined on (0, is called positive if inf δδ tt δδ qq(tt > 0, δδ 0,. Let qq, qq QQ, the AMTC weighted-process, ββ ww (.,. is defined as ββ ww(ss, tt = [ (+ tt AA (ss,tt + qq [ (+ tt AA (ss,tt. (8 qq (tt Let K(.,. be a Kiefer process. Define Γ (ss, tt = KK (ss, tttt(ss,, 0 ss, tt, Γ ww ss,tt = ( tt Γ ( ss,tt qq + tt Γ (ss,tt. (9 qq (tt Note that Γ (ss, tt Γ(ss, tt = KK(ss, tt tttt(ss, BB (ssbb (tt, 0 ss, tt, (0 where BB (. BB (. are two independent Brownian bridge. Now we give the main theorem of this section. Theorem. Under HH oo of (, there exists a Kiefer process K(.,. such that as,. if for i=, tt( tt log log (tt( tt / lim = 0, tt 0 qq ii (tt then tt( tt llllll llllll (tt( tt / lim = 0, tt 0 qq ii (tt 0 ss 0 tt,tt <. if for i=, then 0 ss 0 ss ββ ww ss, tt ΓΓ ww ss, tt = oo pp ( tt( tt log log (tt( tt / lim <, tt 0 qq ii (tt tt( tt log log tt( tt lim <, tt qq ii (tt 0 tt,tt < ββ ww ss, tt ΓΓ ww ss, tt = OO pp ( ββ ww ss, tt DD 0 tt,tt < 0 ss Γ ww ss, tt. 0 tt,tt < Proof of Theorem (Sketch First we can easily notice that AA (ss, tt ii αα (ss, tt ii, 0 ss, tt ii, ii =, ( where the right-h side is the two-time parameter empirical process of[5. Second if we put qq ii (tt ii =, ii =, in (6..3 of[5, we get as 0 ttii tt ii [( +tt ii = oo(. ( Using the definition of the processes of Eq. (8 Eq. (9, the statements of Eq. (, Eq. ( Theorem 8.3. of[5, we complete the proof of this theorem. Now, let ββ ww.,. be the process defined in Eq. (8, then by Theorem ( the relations in Eq. (9 Eq. (0, we have ββ ww ss, tt DD Γ ww ss, tt, where ββ ww ss, tt = BB (ss ( tt BB + tt BB (tt, (3 qq qq (tt BB ii (., ii =, are as in Eq. (0. As in Pouliot[3, we may use the one weight function ww tt = qq ( tt +qq, 0 < tt tt <, for the whole process. It is very easy to see that Theorem remains true under the one-weight function ww.. In this case the corresponding limiting process of Eq. (3 becomes Γ ww ss, tt = ww tt BB ( ss BB tt, where BB tt = ( tt BB + tt BB (tt,

American Journal of Mathematics Statistics 03, 3(3: 57-65 59 BB ii (., ii =, are independent Brownian bridge. It is clear that Γ ww ss, tt is a mean zero Gaussian process with covariance function where CC ss,tt, yy = EE Γ ww ss,tt Γ ww ss, yy = (ss ss ss ss CC BB tt, yy, ww tt ww yy CC BB tt, yy = ( tt [( rr yy + yy yy + tt [( rr (tt yy + yy (tt yy tt yy. 3. The Case of at Most r Change Points (AMRC We consider here the general case of rr. Following the definition of the change-point processes in (6.5 of Pouliot (00, we define the weighted r-change point empirical process its corresponding weighted Gaussian process as follows. Assume that qq jj (. QQ, jj =,, rr, satisfy the two assumptions of part ( of Theorem, we define the weighted r-time parameter empirical process 3/ ([(+ tt mm + [(+tt mm rr mm = qq mm MM ww ss, tt AA (ss, tt mm + ( ttrr ( ttrr[(+ttrrqqrr(ttrr (ss,, (4 where 0 = tt 0 tt tt tt rr tt rr+ = the processes αα (.,. AA (.,. are defined by Eq. (3 Eq. (7 respectively. We also define the weighted r-time parameter limiting Gaussian process as follows; Λ ww ss, tt : = rr + tt mm KK (ss,tt mm mm= + ttrr ttrrttrrqqrr(ttrrkk(ss,, (5 where K(.,. is the Kiefer process of Eq. (6 KK (ss, = KK(ss, tttt(ss,, 0 ss, tt. Now, following the steps of the proof of Theorem, we can state the general weighted- metric approximation for the r-time parameter empirical process of Eq. (4. Theorem 3. Under the null hypothesis HH 0 of (, there exists a Kiefer process K(.,. such that with the sequence of processes MM ww (.,. ww (.,. of (4 (5 respectively, we have as MM ww (ss, tt ΛΛ ww (ss, tt = oo pp (. 0 ss 0<tt tt tt rr < Under the conditions of Theorem 3, we obtain 0 ss 0<tt tt tt rr < MM ww ss, tt DD ΛΛ ww (ss, tt, where ΛΛ ww (ss,. is the process in Eq. (5 when n=. 4. The Proposed AMRC Test Statistics To introduce our proposed multiple change-point test statistics, we need the following integrated processes. Let αα (.,. be the smoothed two-parameters empirical process defined by Eq. (3.Then integrating over 0 ss, we get αα (tt = αα (ss, ttdddd = [ (+tt [(+ tt FF(XX ii = ii 0 (6 define the integrated empirical process difference of Eq. (4 as; AA (tt ii = αα (tt ii [ (+tt ii αα (, ii =,,., rr. (7 The generalized test statistics integrated processes in the case of rr, AMRC, tt = (tt 0, tt,, tt rr, tt rr+, such that 0 = tt 0 < tt tt rr < tt rr+ = are given by MM tt = MM ss, tt dddd 0 rr = 3 ([( + tt mm + [( + tt mm AA mm = + ( [( + tt rr ( [( + tt rr ([( + tt rr αα ( where αα (. is defined by Eq. (6 its corresponding limiting Gaussian process is Λ tt = Λ ss, tt dddd 0 rr = BB (ssdddd + tt mm BB, 0 mm = where Λ(.,. is defined by Eq. (5 B(. is a stard Brownian bridge defined on the same probability space. Next, we define the weighted processes MM ww (. Λ ww (., that are needed to construct the AMRC test statistics. For tt = (tt 0, tt,, tt rr,tt rr +, such that 0 = tt 0 < tt tt rr < tt rr+ =, we define the following weighted processes. ([(+ tt mm + [(+tt mm qq mm MM tt = 3 rr mm = AA + ( [( + tt rr ( [( + tt rr ([(+tt rr Λ ww tt 0 BB (ssdddd qq rr (tt rr αα ( } (8 + tt mm rr mm = BB, (9 qq mm where αα (. AA (. are given by Eq. (6 Eq. (7 respectively qq jj (. QQ, jj =,,, rr are the weight functions of Theorem 3. Theorem 4. Let B(. be a Brownian bridge assume that HH oo of ( holds. Then as MM ww tt DD ΛΛ ww tt, where tt = (tt 0, tt,, tt rr,tt rr +, such that 0 = tt 0 < tt tt rr < tt rr+ = MM ww. ΛΛ ww. are the processes given by (8 (9 respectively. The proof of this theorem can be deducted easily from that of Theorem 3.

60 Abd-Elnaser S. Abd-Rabou et al.: Non-Parametric Weighted Tests for Change in Distribution Function Corollary. By the continuous Mapping Theorem for rr, we have TT (rr = tt MM ww (tt DD ΛΛ ww (tt = TT tt TT (rr = MM ww tt = ΛΛ ww tt tt The asymptotic distribution of TT (rr TT (rr are, up to our knowledge, are unknown. For this reason we present the special case; at most one change point test statistics. Then we study the applicability of the proposed teststhrough a Monte Carlo simulation study. 5. The at Most One Change (AMOC Test Statistics First, we present the two-weight function test statistics for the CDF on change-point change. For 0 yy, consider the following weight functions; WW (yy = yy( yy, (0 WW (yy = yy( yy log log. ( yy( yy Let αα (. is the empirical counterpart of the process in Eq. (6, defined by replacing the CDF, F(., by its sample one FF (.. We define the test process {AA (kk; kk }, by AA (kk αα (kk kk αα (, = TT 4 ( kk tt FF (XX ii kk ii = FF (XX ii = ( ii where FF (tt = II(XX ii = ii tt, tt RR, is the sample empirical distribution function. The above test process is the natural cidate in case of testing for a change in the CDF of a sequence of independent rom variables. Let II ( (aa, bb = (, II ( (aa, bb = [, [, where (a, b = (0.07033, 0.98966, see[3. Now, we p ropose the follo wing AMOC test statistics; TT ( max kk AA (kk, (3 AA TT ( max (kk kk WW ( kk / (4 ( (kk kk II (aa, bb TT 3 ( max kk AA (kk (, kk II (aa, bb, WW kk (5 AA (kk kk II ( WW ( kk (aa, bb / AA (kk kk II ( WW ( kk (aa, bb, / (6 TT 5 ( AA (kk ( II (aa,bb + AA (kk ( II (aa,bb / WW ( kk WW ( kk / / WW ( kk (7 TT 6 ( AA (kk ( II (aa,bb + AA (kk ( II (aa,bb (8 where WW (., WW (. AA (. are given by Eq. (0, Eq. ( Eq. ( respectively. Note that the first four test statistics are CDF change point versions analogues to Pouliot (00. The last two TT 5 ( TT 6 ( are new proposed test statistics. The limiting distributions of the above test statistics are unknown in literature. Thus we conduct a Monte Carlo study to determine the performance of these test statistics. 6. Estimated Critical Values Powers The critical values of the proposed tests in (3-(8 have been evaluated via simulation. Also, the power of the proposed tests have been estimated. These estimation tasks are conducted using three simulation studies. 6.. Simulation The aim of this simu lation is two-folds. First, to estimate the critical values of each test at different sample sizes under different distributions.second, to show that the critical values are stable. The upper 5% critical values for each test of the six tests given in (3-(8 have been obtained via simulation study. A sample of each distribution has been simulated. The sample sizes are fixed at 5, 0, 5, 30, 40, 50, 00, 00, 500, 000. The underlying distributions are the normal distribution, the chi-square distribution, the exponential distribution the uniform distribution. Each test value is evaluated for each sample. This process is replicated 0000 times. Then, each test values are sorted the upper 95% percentile is obtained. The simulation results are displayed in Table Table. The other distributions critical values have a similar behaviour. From these results we can notice that the second test has a higher critical values followed by the third test, for all the four distributions. The fifth test has the lowest critical values across all the four distributions. The critical values of the first, fourth sixth tests are close. Generally, the critical values of all tests starts at a higher (lower levelfor smaller sample size; n = 5, but they converge reasonably as the sample size increase, see the table. This convergence appears from a sample size as large as 00.

American Journal of Mathematics Statistics 03, 3(3: 57-65 6 Ta ble. The estimated upper 5% critical values of the normal chi-square distributions The normal distribution n T T T3 T4 T5 T6 5 0.3443.783 0.4780 0.4780 0.933 0.37 0 0.35.863 0.467 0.467 0.390 0.935 5 0.3600.5 0.57 0.4507 0.78 0.30 30 0.365.33 0.6035 0.445 0.803 0.33 40 0.3656.359 0.635 0.4089 0.73 0.34 50 0.3649.360 0.643 0.4057 0.88 0.3364 00 0.3765.787 0.6900 0.3860 0.877 0.3336 00 0.3797.80 0.743 0.3667 0.870 0.338 500 0.3847.3049 0.757 0.357 0.909 0.3343 000 0.3900.370 0.7738 0.345 0.880 0.3376 The chi-square distribution n T T T3 T4 T5 T6 5 0.3443.783 0.4780 0.4780 0.38 0.933 0 0.3578.6 0.467 0.467 0.40 0.890 5 0.3600.43 0.57 0.4507 0.753 0.397 30 0.36.90 0.6035 0.445 0.80 0.374 40 0.3676.500 0.604 0.4089 0.70 0.36 50 0.379.585 0.643 0.4057 0.973 0.340 00 0.3735.633 0.6900 0.3860 0.878 0.3343 00 0.38.933 0.743 0.3706 0.853 0.340 500 0.3863.33 0.754 0.357 0.889 0.3335 000 0.3890.3065 0.7647 0.349 0.855 0.336 Ta ble. The estimated upper 5% critical values of the exponential uniform distributions The exponential distribution n T T T3 T4 T5 T6 5 0.3443.783 0.4780 0.4780 0.90 0.353 0 0.3578.9 0.467 0.467 0.38 0.93 5 0.3600.05 0.57 0.4507 0.77 0.395 30 0.36.90 0.6035 0.445 0.80 0.35 40 0.3656.486 0.604 0.4089 0.69 0.335 50 0.3705.585 0.6494 0.4057 0.94 0.389 00 0.3740.68 0.690 0.3860 0.879 0.3389 00 0.3794.869 0.77 0.3667 0.860 0.399 500 0.3853.3076 0.7657 0.3503 0.90 0.3330 000 0.394.376 0.7837 0.349 0.9 0.3439 The uniform distribution n T T T3 T4 T5 T6 5 0.3443.00 0.4780 0.4780 0.98 0.357 0 0.3578.6 0.467 0.467 0.398 0.883 5 0.3600.306 0.57 0.4507 0.793 0.39 30 0.36.46 0.6035 0.445 0.84 0.35 40 0.3656.36 0.635 0.430 0.733 0.33 50 0.369.470 0.643 0.4057 0.95 0.3406 00 0.3770.804 0.6979 0.3860 0.887 0.394 00 0.3774.803 0.769 0.3667 0.877 0.3359 500 0.3839.997 0.755 0.357 0.87 0.3350 000 0.3864.309 0.7705 0.349 0.874 0.354

6 Abd-Elnaser S. Abd-Rabou et al.: Non-Parametric Weighted Tests for Change in Distribution Function 6.. Simulation The aim of this simulation study is to estimate the power of the tests (3 - (8 assuming there is one change point in the mean. A sample of fixed size is generated fromeach distribution. The sample sizes are fixed at 0, 50, 00 units, to cover small, moderate large sample size. The samples are generated from the normal distribution, the chi-square distribution, the exponential distribution the uniform distribution. The change point positions are fixed at the first tail (5%n of the sample in the middle (50%n of the sample. Different change shifts have been used, name ly = 0.5, =.0 =.5. The replications number is 0000. The percentage of times the test statistic exceeds the estimated critical values is reported for each change, test statistic, sample size. The results are displayed in Table 3 Table 4. The simulation results show that the estimated power of each test undereach distribution increase as the change position moves to the middle of the sample. The estimated powers of all tests increase as the change shift increases the sample size increases. From the results we can see that the fourth test has the highest powerfollowed by the sixth test for all distributions in the different settings. The third test has the lowest power in the different setting. The estimated powers of the first the second tests are comparable in the different settings. The powers under the uniform distribution has the highest values, whereas the lowest powers are under the chi-square distribution. This is not surprising because any change in the mean of the uniform rom variable affect the distribution boundaries too. However, the chi-square distribution will change its shape very slowly with such minor location changes. Ta ble 3. The estimated powers of the normal chi-square distributions (change in the mean The normal distribution n Δ position T T T3 T4 T5 T6 0 0.5 0.5 6.3 6.7 0.4 66.4 6.6 7.0 0.50 4.6 5.9.7 7.4 8.4.7.0 0.5 9. 9. 0.6 78.8 0.4 3. 0.50 44.6 47.7 0.5 90.0 9.4 38.6.5 0.5 3.9 3.6.0 90.5 5.. 0.50 79.6 8. 35.3 98.6 38. 69.7 50 0.5 0.5 0.8 0.7 6.8 9.9 7.7.9 0.50 33.7 35. 7. 95.9 3. 6.0.0 0.5 30.6 7.9 5.4 98.8 0. 35.6 0.50 87.0 88.5 8. 99.8 4.9 76.7.5 0.5 63.7 58.6 3.5 99.9 40. 67.3 0.50 99.8 99.8 60.5 00.0 78.9 98.5 00 0.5 0.5 6.0 4.8 4. 98.0 6.3.6 0.50 59. 60.7 0.9 99.3 4.7 49.3.0 0.5 57.8 5.9 45.7 99.9 53.3 69. 0.50 99.3 99.4 56. 00.0 8.3 97.8.5 0.5 94.7 9.6 8.9 00.0 87.0 95.8 0.50 00.0 00.0 98.8 00.0 99.6 00.0 The chi-square distribution n Δ position T T T3 T4 T5 T6 0 0.5 0.5 4.3 4.4 0. 6.6 4.9 5.4 0.50 4.7 5. 0.3 6.8 5. 5..0 0.5 4.9 5. 0.3 63.0 5.7 6.3 0.50 6.8 7. 0.7 63. 5.6 7.3.5 0.5 4.8 5.0 0.3 64. 6.3 6.9 0.50 9.4 9.9. 67.0 6.5 9.5 50 0.5 0.5 5.5 5.4 5.5 89.0 4.5 5. 0.50 6.3 6.4 5.0 89.0 4.6 5.7.0 0.5 5.8 5.5 5.3 89.7 5.6 6.7 0.50 0.7. 4.8 90.8 6. 9.4.5 0.5 7.8 7.4 6.8 9.9 7.0 9.4 0.50 8.7 9.5 5.8 9.7 7.9 4.9 00 0.5 0.5 5.8 5.7 5.8 94.4 6.6 6.0 0.50 8. 8. 5.4 94.3 6.6 7.5.0 0.5 7.5 7.4 7.9 95.6 8.9 8.9 0.50 8.5 9.3 6.3 96.4 0.0 5.7.5 0.5 0.9 0.4. 96.7.6 4.4 0.50 35.4 37.0 8.3 98.0 5.5 8.5

American Journal of Mathematics Statistics 03, 3(3: 57-65 63 Table 4. The estimated powers of the exponential the uniform distributions (change in the mean The exponential distribution n Δ position T T T3 T4 T5 T6 0 0.5 0.5 6.9 7.0 0.5 7.6 0..4 0.50 5.5 7. 4.5 80.9.5 4.4.0 0.5. 0.7 0.8 8.8 6.. 0.50 6. 65.5 4.0 95.6 9.0 56.4.5 0.5 3.7 3.4. 90.6 0.5 3. 0.50 85.8 88.0 49.3 99.4 46. 80.3 50 0.5 0.5 8.3 6. 6.7 96.8 9. 5.9 0.50 60.0 6. 9.8 98.4 4.8 50.9.0 0.5 46.9 4.3 36.3 99.5 4.9 57. 0.50 96.7 97. 35.3 00.0 65.9 93.3.5 0.5 7.5 65.8 54.0 99.9 6.4 77.6 0.50 00.0 00.0 76.5 00.0 90.6 99.6 00 0.5 0.5 34.0 3.6 33. 99.5 34.5 4. 0.50 89. 9. 3.5 00.0 49.4 80..0 0.5 9.7 77..0 00.0 7.6 83.4 0.50 00.0 00.0 88.6 00.0 96.3 99.9.5 0.5 96. 95. 86.6 00.0 90. 96.5 0.50 00.0 00.0 99.8 00.0 00.0 00.0 The uniform distribution n Δ position T T T3 T4 T5 T6 0 0.5 0.5 3...0 93.4 6.7 5.0 0.50 85.3 87. 43. 99.7 4. 77.8.0 0.5.4 9.4.7 00.0 8. 48. 0.50 00.0 00.0 00.0 00.0 8. 00.0.5 0.5.4 9.5.7 00.0 8.4 48.8 0.50 00.0 00.0 00.0 00.0 8.7 00.0 50 0.5 0.5 69.7 63.8 3.8 00.0 45.4 73.3 0.50 99.9 00.0 73. 00.0 85.9 99.4.0 0.5 00.0 00.0 98.5 00.0 96.6 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0.5 0.5 00.0 00.0 98. 00.0 96.3 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0 00 0.5 0.5 98. 97.0 86.0 00.0 9.9 98.3 0.50 00.0 00.0 99.7 00.0 99.9 00.0.0 0.5 00.0 00.0 00.0 00.0 00.0 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0.5 0.5 00.0 00.0 00.0 00.0 00.0 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0 6.3. Simulation 3 The aim of this simulation is to estimate the power of the proposed testassuming that there is change point in the distribution rather than only the distribution mean. A sub-sample is simulated from a given distribution augmented by another sub-sample of another distribution. This means that there is a change in distribution. The change positions are the first 5% point of the sample the middle of the sample. The sample sizes are fixed at n = 0, n = 50 n= 00. Four different distributions have been used; namely the normal distribution, the chi-square distribution, the exponential distribution the uniform distribution. The results in Table 6 show the estimated power of the tests for the normal distribution against the other three distributions (the chi-square distribution, the exponential distribution the uniform distribution. Also, the table shows the estimated power of the tests assuming the chi-square distribution against the exponential the uniform distribution in addition to the exponential distribution against the uniform distribution. From the results we can notice that the fourth test has the highest powerfollowed by the sixth test. Generally, the third test has the lowest estimated powers. The highestpowers are obtained for the chi-square distribution against the rest of the distributions. 7. Discussion Conclusions In this paper we presented new non-parametric weighted type test statisticsfor a change in the cumulative distribution function of a set of data. These proposed test statistics are based on the empirical processes. The asymptotic distributions of these test statistics are unknown intractable to be studied theoretically.

64 Abd-Elnaser S. Abd-Rabou et al.: Non-Parametric Weighted Tests for Change in Distribution Function We conducted a simulation study to estimate the critical values powersof the proposed tests in the at most one change point. Our weighted proposed tests have good performance in all settings; different distributions, different Table 5. The estimated powers of a distribution change sample sizes different change positions. The difficulty of tracing the limiting distributions of the proposed weighted test statistics encouragethe search for a simple new weighted test statistics. The normal distribution against the chi-square distribution n position T T T3 T4 T5 T6 0 0.5 5.3.3.9 00.0 8. 47.0 0.50 00.0 00.0 00.0 00.0 80.8 00.0 50 0.5 00.0 00.0 98.0 00.0 95.7 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0 00 0.5 00.0 00.0 00.0 00.0 00.0 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0 The normal distribution against the exponential distribution n position T T T3 T4 T5 T6 0 0.5 9.4 9.4 0.5 78.0.9 6. 0.50 46.6 49.7.3 9.0 0.6 38.8 50 0.5 33.0 30.3 4. 98.4 6.6 39.0 0.50 88.0 89. 9.8 99.7 4.8 76.4 00 0.5 57.6 53.4 50.4 99.9 53.0 66.6 0.50 99.4 99.5 57.3 00.0 8. 97.6 The normal distribution against the uniform distribution n position T T T3 T4 T5 T6 0 0.5 7.7 7.4 0.4 7.9 3. 4. 0.50 7.5 9. 5.6 80.0.5.5 50 0.5 9.8 8.6.8 97.7.9 7. 0.50 55.4 57.0 9.9 98. 0. 44. 00 0.5 3.3 9.0 38.4 99.4 34.7 4.0 0.50 8.9 83.8.7 99.8 44.7 73.5 The chi-square distribution against the exponential distribution n position T T T3 T4 T5 T6 0 0.5 0.8 8.6.3 00.0 6.7 48.3 0.50 00.0 00.0 99.9 00.0 79. 00.0 50 0.5 00.0 00.0 94.7 00.0 94. 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0 00 0.5 00.0 00.0 00.0 00.0 00.0 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0 The chi-square distribution against the uniform distribution n position T T T3 T4 T5 T6 0 0.5. 8.8.6 00.0 6.9 48.0 0.50 00.0 00.0 00.0 00.0 78.9 00.0 50 0.5 00.0 00.0 98. 00.0 95.0 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0 00 0.5 00.0 00.0 00.0 00.0 00.0 00.0 0.50 00.0 00.0 00.0 00.0 00.0 00.0 The exponential distribution against the uniform distribution n position T T T3 T4 T5 T6 0 0.5 5.7 6.0 0.4 67.6 9.8 9.6 0.50 3.3 4.6.8 7. 7.9 3.6 50 0.5.0 9.8.8 94.9 4. 6.4 0.50 3.0 3.9 7.0 95.0.4 5.0 00 0.5 6.7 5.3. 98.3 0.8 3.4 0.50 55. 57..3 99.0 3.0 44.4

American Journal of Mathematics Statistics 03, 3(3: 57-65 65 REFERENCES [ Csörgő, M. Horváth, L., 979, Limit Theorems in Change-Point Analysis, John Wiley & Sons, New York. [ Orasch, M. R., 999, Multiple change-points with an application to financial modelling, Ph.D. thesis, Carleton University, Ottawa, Canada. [3 Pouliot, W. J., 00, Non-parametric, Non-sequential change-points analysis, Ph.D. thesis, Carleton University, Ottawa, Canada. [4 Hawkins, D. M., 00, Fitting multiple change-point models to data, Computational Statistics & Data Analysis, 37, 33-34. [5 Syzsykowicz, B., 99, Weak convergence of stochastic processes in weighted metrics their applications to contiguous change-points analysis, Ph.D. thesis, Carleton University, Ottawa, Canada. [6 Aly, E.-E., Abd-Rabou, A. S. Al-Kari, N. M., 003, Tests for multiplechange points under ordered alternatives, Metrika, 57, 09-.