Investigation of goodness-of-fit test statistic distributions by random censored samples

Similar documents
Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

Statistic Distribution Models for Some Nonparametric Goodness-of-Fit Tests in Testing Composite Hypotheses

Chapter 31 Application of Nonparametric Goodness-of-Fit Tests for Composite Hypotheses in Case of Unknown Distributions of Statistics

Application of Homogeneity Tests: Problems and Solution

GOODNESS-OF-FIT TEST FOR RANDOMLY CENSORED DATA BASED ON MAXIMUM CORRELATION. Ewa Strzalkowska-Kominiak and Aurea Grané (1)

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip

A note on vector-valued goodness-of-fit tests

Non-parametric Tests for Complete Data

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A comparison study of the nonparametric tests based on the empirical distributions

Statistical Inference on Constant Stress Accelerated Life Tests Under Generalized Gamma Lifetime Distributions

Goodness-of-Fit Tests for Uniformity of Probability Distribution Law

1 Glivenko-Cantelli type theorems

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests

On the Goodness-of-Fit Tests for Some Continuous Time Processes

Lecture 2: CDF and EDF

Modified Kolmogorov-Smirnov Test of Goodness of Fit. Catalonia-BarcelonaTECH, Spain

UNIVERSITÄT POTSDAM Institut für Mathematik

Parametric Evaluation of Lifetime Data

Quantile Regression for Residual Life and Empirical Likelihood

Distribution Fitting (Censored Data)

Statistical Analysis of Competing Risks With Missing Causes of Failure

Fall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

One-Sample Numerical Data

Semi-Competing Risks on A Trivariate Weibull Survival Model

CHAPTER 1 CLASSES OF FIXED-ORDER AND ADAPTIVE SMOOTH GOODNESS-OF-FIT TESTS WITH DISCRETE RIGHT-CENSORED DATA

Recall the Basics of Hypothesis Testing

The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study

PROPERTIES OF THE GENERALIZED NONLINEAR LEAST SQUARES METHOD APPLIED FOR FITTING DISTRIBUTION TO DATA

11 Survival Analysis and Empirical Likelihood

Exact Statistical Inference in. Parametric Models

Chapter 11. Hypothesis Testing (II)

APPLIED METHODS OF STATISTICAL ANALYSIS. APPLICATIONS IN SURVIVAL ANALYSIS, RELIABILITY AND QUALITY CONTROL

Exact goodness-of-fit tests for censored data

On the Comparison of Fisher Information of the Weibull and GE Distributions

Goodness of Fit Tests for Rayleigh Distribution Based on Phi-Divergence

arxiv: v1 [stat.me] 2 Mar 2015

Weibull Reliability Analysis

Non-parametric Tests for Complete Data

New goodness-of-fit plots for censored data in the package fitdistrplus

Exact goodness-of-fit tests for censored data

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Asymptotic Statistics-VI. Changliang Zou

Modeling the Goodness-of-Fit Test Based on the Interval Estimation of the Probability Distribution Function

A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS

Testing Goodness-of-Fit of a Uniform Truncation Model

Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data

Empirical Likelihood in Survival Analysis

AFT Models and Empirical Likelihood

14.30 Introduction to Statistical Methods in Economics Spring 2009

Burr Type X Distribution: Revisited

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

and Comparison with NPMLE

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

Analytical Bootstrap Methods for Censored Data

University of California, Berkeley

Dr. Maddah ENMG 617 EM Statistics 10/15/12. Nonparametric Statistics (2) (Goodness of fit tests)

Stat 710: Mathematical Statistics Lecture 31

Joseph O. Marker Marker Actuarial a Services, LLC and University of Michigan CLRS 2010 Meeting. J. Marker, LSMWP, CLRS 1

Step-Stress Models and Associated Inference

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Approximate Self Consistency for Middle-Censored Data

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy

Reliability analysis of power systems EI2452. Lifetime analysis 7 May 2015

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels

New mixture models and algorithms in the mixtools package

Inference on reliability in two-parameter exponential stress strength model

APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY OF VARIANCES. PART IV

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

Survival Analysis Math 434 Fall 2011

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Weibull Reliability Analysis

Goodness-of-fit test for the Cox Proportional Hazard Model

Stochastic Simulation

Hypothesis testing:power, test statistic CMS:

Multistate Modeling and Applications

ICSA Applied Statistics Symposium 1. Balanced adjusted empirical likelihood

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

Testing Exponentiality by comparing the Empirical Distribution Function of the Normalized Spacings with that of the Original Data

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

Spline Density Estimation and Inference with Model-Based Penalities

NAG Library Chapter Introduction. G08 Nonparametric Statistics

Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), Abstract

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Analysis of Middle Censored Data with Exponential Lifetime Distributions

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data

Introduction to Empirical Processes and Semiparametric Inference Lecture 09: Stochastic Convergence, Continued

Bayesian estimation of the discrepancy with misspecified parametric models

Nonparametric Bayes Estimator of Survival Function for Right-Censoring and Left-Truncation Data

STAT 6350 Analysis of Lifetime Data. Probability Plotting

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution.

Analysis of Statistical Algorithms. Comparison of Data Distributions in Physics Experiments

TMA 4275 Lifetime Analysis June 2004 Solution

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

Censoring and Truncation - Highlighting the Differences

Transcription:

d samples Investigation of goodness-of-fit test statistic distributions by random censored samples Novosibirsk State Technical University November 22, 2010

d samples Outline 1 Nonparametric goodness-of-fit tests for complete data 2 Modified nonparametric goodness-of-fit tests for random censored data 3 Investigation of test statistic distributions for different distributions of censoring times 4 Transformation of censored sample to complete sample by means of randomization 5 RRN χ 2 test for censored data 6 Test power

d samples Outline 1 Nonparametric goodness-of-fit tests for complete data 2 Modified nonparametric goodness-of-fit tests for random censored data 3 Investigation of test statistic distributions for different distributions of censoring times 4 Transformation of censored sample to complete sample by means of randomization 5 RRN χ 2 test for censored data 6 Test power

d samples Outline 1 Nonparametric goodness-of-fit tests for complete data 2 Modified nonparametric goodness-of-fit tests for random censored data 3 Investigation of test statistic distributions for different distributions of censoring times 4 Transformation of censored sample to complete sample by means of randomization 5 RRN χ 2 test for censored data 6 Test power

d samples Outline 1 Nonparametric goodness-of-fit tests for complete data 2 Modified nonparametric goodness-of-fit tests for random censored data 3 Investigation of test statistic distributions for different distributions of censoring times 4 Transformation of censored sample to complete sample by means of randomization 5 RRN χ 2 test for censored data 6 Test power

d samples Outline 1 Nonparametric goodness-of-fit tests for complete data 2 Modified nonparametric goodness-of-fit tests for random censored data 3 Investigation of test statistic distributions for different distributions of censoring times 4 Transformation of censored sample to complete sample by means of randomization 5 RRN χ 2 test for censored data 6 Test power

d samples Outline 1 Nonparametric goodness-of-fit tests for complete data 2 Modified nonparametric goodness-of-fit tests for random censored data 3 Investigation of test statistic distributions for different distributions of censoring times 4 Transformation of censored sample to complete sample by means of randomization 5 RRN χ 2 test for censored data 6 Test power

d samples Nonparametric goodness-of-fit tests for complete data Nonparametric goodness-of-fit tests for complete data The Kolmogorov test statistic D n = sup F n (x) F (x, θ), (1) x < F n (x) is the empirical distribution function. The distribution of the statistic (1) in testing simple hypotheses obeys the Kolmogorov distribution law K(S). The Cramer-von Mises-Smirnov test statistic W 2 n = and in the Anderson-Darling test the statistic A 2 n = (F n (t) F (t)) 2 df (t), (2) (F n (t) F (t)) 2 df (t) F (t)(1 F (t)). (3) In testing a simple hypothesis, statistic (2) has the distribution a1(s), and statistic (3) has the distribution a2(s).

d samples Nonparametric goodness-of-fit tests for complete data Approximations of test statistic distributions It should be noted that in case of composite hypotheses, test statistic distributions G(S H 0 ) are affected by a number of factors, such as the form of the tested distribution F (t; θ), the number of estimated parameters, the estimation method used. In the papers by Lemeshko (2009) the approximations of statistic distribution models were obtained for testing composite hypotheses for a wide range of distribution laws using the maximum likelihood estimates of unknown parameters. These papers are available in the Internet http://ami.nstu.ru/ headrd/seminar/publik html/ Models Part I eng.pdf http://ami.nstu.ru/ headrd/seminar/publik html/ Models Part II eng.pdf

d samples Modified nonparametric goodness-of-fit tests for random censored data Independent random censoring Let lifetime T and censoring time C are independent random variables from distribution functions F (t) and F C (t) respectively. All lifetimes and censoring times are assumed mutually independent, and it is assumed that F C (t) does not depend on any of the parameters of F (t). So, t i = min (T i, C i ) and δ i = 1 {T i C i }, i = 1,..., n.

d samples Modified nonparametric goodness-of-fit tests for random censored data Modified Kolmogorov test The Kolmogorov test statistic D n = sup ˆF n (t) F (t; θ), t< where ˆF n (t) is the Kaplan-Meier estimator. Formulas for calculation: D n = max ( D n +, Dn ), D + n D n ( ) ( = max {ˆFn t(i) F t(i), θ )}, i: δ i =1 { = max F ( t (i), θ ) ( ) ˆF } n t(i 1). i: δ i =1

d samples Modified nonparametric goodness-of-fit tests for random censored data Modified Cramer-von Mises-Smirnov test The Cramer-von Mises-Smirnov test statistic (Koziol and Green (1976)) W 2 n = W 2 n = r j: δ j =1 ) 2 (ˆF n (t) F (t; θ) df (t; θ) {ˆF 2 n ( t(j) ) (F ( t(j+1) ; θ ) F ( t (j) ; θ ) ) ˆF n ( t(j 1) ) ( F 2 ( t (j+1) ; θ ) F 2 ( t (j) ; θ )) } + r 3 where r is the number of complete observations.

d samples Modified nonparametric goodness-of-fit tests for random censored data Modified Anderson-Darling test The Anderson-Darling test statistic A 2 n = (ˆF n (t) F (t; θ) A 2 n = r + r j: δ j =1 ( ( 1 ˆF n ( t(j 1) ) 2 df (t;θ) F (t;θ)(1 F (t;θ)), { (ˆF n 2 ( ) t(j 1) ˆF n 2 ( ) ) t(j) log F ( t (j) ; θ ) ) ) 2 ( 1 ˆF n ( t(j) ) ) 2 ) log ( 1 F ( t (j) ; θ )) }, where r is the number of complete observations.

d samples Modified nonparametric goodness-of-fit tests for random censored data These modified goodness-of-fit tests are mentions in many papers on statistical analysis of censored data. For example, Anderson (1952), Hjort (1992), Nair (1981) Reineke (2004), Lawless (2003) and many others. And it is assumed that when testing a goodness-of-fit hypothesis p-value can be calculated basing on a simulated statistic distribution G(S H 0 ). The main purpose of the paper is to investigate with computer simulation technique the distributions of modified test statistics for various distributions of censoring times.

d samples Modified nonparametric goodness-of-fit tests for random censored data Simulation study of statistic distributions Case 1 F (t; θ) is the Weibull distribution with parameters (2, 2) - red curve F C (t) is the Beta-I distribution Figure: Considered distributions F (t; θ) and F C (t)

d samples Modified nonparametric goodness-of-fit tests for random censored data Simulation study of statistic distributions Figure: Kolmogorov test statistic distributions for different censoring degrees when testing composite hypothesis of goodness-of-fit with the Weibull distribution, n = 100

d samples Modified nonparametric goodness-of-fit tests for random censored data Simulation study of statistic distributions Case 2 F (t; θ) is the Weibull distribution with parameters (2, 2) - red curve F C (t) is the Weibull distribution with other values of parameters Figure: Considered distributions F (t; θ) and F C (t)

d samples Modified nonparametric goodness-of-fit tests for random censored data Simulation study of statistic distributions Figure: Kolmogorov test statistic distributions for different distributions of censoring times when testing the composite hypothesis of goodness-of-fit with the Weibull distribution, censoring degree is 60%

d samples Modified nonparametric goodness-of-fit tests for random censored data Simulation study of statistic distributions Algorithm for simulation of random censored sample 1 Generate a complete sample of the size n from the hypothetical distribution: T i = F 1 (ξ i ; ˆθ n ), i = 1, n, where ξ Uni (0, 1). 2 Calculate the Kaplan-Meier estimate of the censoring distribution ˆF c (t) by the inversed original sample. 3 Generate censoring times C i, i = 1, n by the following formula. ξ i c 1 ˆF c (c 1 ), 0 < ξ i ˆF c (c 1 ) ( ) ξ i ˆF c (c j ) (c j+1 c j ) C i = c j + (ˆF c (c j+1 ) ˆF, ˆF c (c j )) c (c j ) < ξ i ˆF c (c j+1 ), j = 1, k c k + c k (ξ i ˆF ) c (c k ), ξ i > ˆF c (c k ) where c 1,..., c k are the increase-ordered different censoring observations in the original sample, k is the number of different censoring observations in the original sample. 4 t i = min (T i, C i ), δ i = 1 {T i C i }, i = 1, n

d samples Modified nonparametric goodness-of-fit tests for random censored data Simulation study of statistic distributions Inversion of a censored sample Inversed sample is the original sample in which δ i = 1 have been replaced with δ i = 0 and vice versa

d samples Transformation of censored sample to complete sample by means of randomization Randomization In the sample of observations (t 1, δ 1 ), (t 2, δ 2 ),..., (t n, δ n ) we replace all censored observations (t i, δ i = 0) = C i by simulated times ˆT i from the hypothetical distribution.

d samples Transformation of censored sample to complete sample by means of randomization Such replacement enables to obtain a complete sample, for which one can apply the goodness-of-fit tests with statistics (1), (2), (3) for complete data. After this transformation it is necessary to estimate unknown parameters of hypothetical distribution by obtained complete sample. Then the distributions of statistics (1), (2), (3) by transformed samples are the same as in the case of originally complete data. It is possible to use the approximations of statistic distributions obtained in papers of Lemeshko (2009) for calculation of the p-value.

d samples Transformation of censored sample to complete sample by means of randomization An example of testing goodness-of-fit hypothesis Consider a sample of observations of the size n = 100 and it contains 10 right censored observations. H 0 : F (t) is from the family of lognormal distributions

d samples Transformation of censored sample to complete sample by means of randomization An example of testing goodness-of-fit hypothesis (continued) Figure: Kaplan-Meier estimate by considered censored sample, Weibull distribution and Lognormal distribution (MLEs are used)

d samples RRN χ 2 test for censored data RRN χ 2 test for censored data In the papers by Nikulin et al.(2010) the χ 2 test with the statistic Y 2 n (ˆθ n ) = Z T ˆV Z has been suggested for random censored data. The limit distribution of statistic Y 2 n (ˆθ n ) under condition of true hypothesis H 0 is χ 2 distribution with r = rank(v ) degrees of freedom.

d samples RRN χ 2 test for censored data Simulation study of statistic distributions Figure: RRN χ 2 test statistic distributions for different censoring degrees when testing the composite hypothesis of goodness-of-fit with the Weibull distribution, n = 100, K = 5

d samples RRN χ 2 test for censored data Simulation study of statistic distributions Figure: RRN χ 2 test statistic distributions for sample sizes n = 100 and n = 500 when testing the composite hypothesis of goodness-of-fit with the Weibull distribution, censoring degree is about 30%, K = 5

d samples RRN χ 2 test for censored data Simulation study of statistic distributions Figure: RRN χ 2 test statistic distributions for different distributions of censoring times when testing the composite hypothesis of goodness-of-fit with the Weibull distribution, censoring degree is about 50%, n = 100, K = 5

d samples RRN χ 2 test for censored data Simulation study of statistic distributions Figure: RRN χ 2 test statistic distributions for different distributions of censoring times when testing the composite hypothesis of goodness-of-fit with the Weibull distribution, censoring degree is about 50%, n = 500, K = 5

d samples RRN χ 2 test for censored data Simulation study of statistic distributions Tests power H 0 : Weibull distribution H 1 : Lognormal distribution The power of Kolmogorov, Cramer-von Mises-Smirnov and Anderson-Darling tests was calculated by completed samples with randomization. Sample size n = 200, α = 0.1 Table: The test power comparison Goodness-of-fit test 10% 20% 30% 40% 50% 60% 70% 80% Kolmogorov test 0.74 0.64 0.54 0.45 0.35 0.25 0.19 0.13 Cramer-von Mises-Smirnov test 0.84 0.74 0.63 0.52 0.39 0.28 0.21 0.14 Anderson-Darling test 0.88 0.78 0.67 0.56 0.44 0.32 0.24 0.16 RRN χ 2 test, K = 3 0.87 0.81 0.76 0.69 0.62 0.56 0.46 0.35 RRN χ 2 test, K = 5 0.90 0.86 0.81 0.75 0.69 0.58 0.47 0.34

d samples RRN χ 2 test for censored data Simulation study of statistic distributions Tests power H 0 : Weibull distribution H 1 : Lognormal distribution

d samples Conclusions The distributions of modified Kolmogorov, Cramer-von Mises-Smirnov and Anderson-Darling test statistics strongly depend on the distribution of censoring times. This fact doesn t enable to recommend using these tests in practice. Randomization procedure enables to obtain a complete sample, for which one can apply the goodness-of-fit tests with statistics (1), (2), (3) for complete data. The distributions of these statistics by completed samples are the same as in the case of originally complete data. RRN χ 2 test has a number of advantages comparing with the considered nonparametric tests.

d samples Conclusions The distributions of modified Kolmogorov, Cramer-von Mises-Smirnov and Anderson-Darling test statistics strongly depend on the distribution of censoring times. This fact doesn t enable to recommend using these tests in practice. Randomization procedure enables to obtain a complete sample, for which one can apply the goodness-of-fit tests with statistics (1), (2), (3) for complete data. The distributions of these statistics by completed samples are the same as in the case of originally complete data. RRN χ 2 test has a number of advantages comparing with the considered nonparametric tests.

d samples Conclusions The distributions of modified Kolmogorov, Cramer-von Mises-Smirnov and Anderson-Darling test statistics strongly depend on the distribution of censoring times. This fact doesn t enable to recommend using these tests in practice. Randomization procedure enables to obtain a complete sample, for which one can apply the goodness-of-fit tests with statistics (1), (2), (3) for complete data. The distributions of these statistics by completed samples are the same as in the case of originally complete data. RRN χ 2 test has a number of advantages comparing with the considered nonparametric tests.

d samples References [1] Anderson, T.W. Asymptotic Theory of Certain Goodness of fit Criteria based on Stochastic Processes / T.W. Anderson, D.A. Darling // The Annals of Mathematical Statistics. - 1952. - Vol. 23, No. 3. - P. 193-212. [2] Nair, V. Plots and tests for goodness of fit with randomly censored data / Nair, V. // Biometrika. - 1981. - Vol. 68. - P. 99-103. [3] Reineke, D. Estimation of Hazard, Density and Survivor Functions for Randomly Censored Data / D. Reineke, J. Crown // Journal of Applied Statistics. - 2004. - Vol. 31, No. 10. - P. 1211-1225. [4] Koziol, J.A. A Cramer-von Mises Statistic for Randomly Censored Data / J.A. Koziol, S.B. Green // Biometrika. - 1976. - Vol. 63, No. 3. - P. 465-474. [5] Lawless, J.F. Statistical model and methods for lifetime data / J.F. Lawless. - New Jersey : Wiley-Interscience, 2003. - 630 p. [6] Hjort, N.L. On Inference in Parametric Survival Data / Hjort, N.L. // International Statistical Review. - 1992. - Vol. 60, No. 3. - P. 355-387. [7] Lemeshko, B.Yu. Distribution models for nonparametric tests for fit in verifying complicated hypotheses and maximum-likelihood estimators. Part 1 / B.Yu. Lemeshko, S.B. Lemeshko // Measurement Techniques. - 2009. - Vol. 52, No. 6. - P.555-565. [8] Lemeshko, B.Yu. Models for statistical distributions in nonparametric fitting tests on composite hypotheses based on maximum-likelihood estimators. Part II / B.Yu. Lemeshko, S.B. Lemeshko // Measurement Techniques. - 2009. - Vol. 52, No. 8. - P.799-812. [9] V. Bagdonavicius, M. Nikulin Chi-square goodness-of-fit test for right censored data.- The International Journal of Applied Mathematics and Statistics (IJAMAS) (accepted for publication). [10] V. Bagdonavicus, J. Kruopis, M. Nikulin Nonparametric Tests for Censored Data. - Wiley-ISTE, 2010.