Nonparametric Estimation for Semi-Markov Processes Based on K-Sample Paths with Application to Reliability

Similar documents
Efficient estimation for semiparametric semi-markov processes

Confidence intervals for risk indicators in semi-markov models: an application to wind energy production

Chapter 2 SOME ANALYTICAL TOOLS USED IN THE THESIS

Asymptotic Properties of Kaplan-Meier Estimator. for Censored Dependent Data. Zongwu Cai. Department of Mathematics

Journal de la Société Française de Statistique Vol. 153 No. 3 (2014) Interval reliability for semi-markov systems in discrete time

Multistate Modeling and Applications

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

Parametric and Non Homogeneous semi-markov Process for HIV Control

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

Philosophy and Features of the mstate package

Exponential martingales: uniform integrability results and applications to point processes

Efficiency of Profile/Partial Likelihood in the Cox Model

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes

Product-limit estimators of the survival function with left or right censored data

3.1 Markov Renewal Chains and Semi-Markov Chains

[1] Thavaneswaran, A.; Heyde, C. C. A note on filtering for long memory processes. Stable non-gaussian models in finance and econometrics. Math.

The Transition Probability Function P ij (t)

1 Glivenko-Cantelli type theorems

Introduction CONTRIBUTED RESEARCH ARTICLE 1. The R Journal Vol. XX/YY, AAAA 20ZZ ISSN

Statistical Analysis of Competing Risks With Missing Causes of Failure

STAT Sample Problem: General Asymptotic Results

Density estimators for the convolution of discrete and continuous random variables

BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1. By Per Aslak Mykland University of Chicago

EXPLICIT DISTRIBUTIONAL RESULTS IN PATTERN FORMATION. By V. T. Stefanov 1 and A. G. Pakes University of Western Australia

Multi-state models: prediction

On the estimation of the entropy rate of finite Markov chains

Lecture 3. Truncation, length-bias and prevalence sampling

Multi-state Models: An Overview

Central limit theorems for ergodic continuous-time Markov chains with applications to single birth processes

e 4β e 4β + e β ˆβ =0.765

Lecture 10: Semi-Markov Type Processes

CHAPTER 1 A MAINTENANCE MODEL FOR COMPONENTS EXPOSED TO SEVERAL FAILURE MECHANISMS AND IMPERFECT REPAIR

A Regression Model For Recurrent Events With Distribution Free Correlation Structure

Institute of Mathematical Statistics LECTURE NOTES MONOGRAPH SERIES PLUG-IN ESTIMATORS IN SEMIPARAMETRIC STOCHASTIC PROCESS MODELS

An Overview of Methods for Applying Semi-Markov Processes in Biostatistics.

P i [B k ] = lim. n=1 p(n) ii <. n=1. V i :=

STAT STOCHASTIC PROCESSES. Contents

Karaliopoulou Margarita 1. Introduction

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

Introduction to Reliability Theory (part 2)

and Comparison with NPMLE

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen

Frailty Models and Copulas: Similarities and Differences

A multistate additive relative survival semi-markov model

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS

Statistics 150: Spring 2007

L n = l n (π n ) = length of a longest increasing subsequence of π n.

Citation Osaka Journal of Mathematics. 41(4)

Statistical Inference and Methods

University of California, Berkeley

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

Asymptotic normality of the L k -error of the Grenander estimator

Simulation of truncated normal variables. Christian P. Robert LSTA, Université Pierre et Marie Curie, Paris

Understanding product integration. A talk about teaching survival analysis.

Diffusion Approximation of Branching Processes

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin

Introduction to repairable systems STK4400 Spring 2011

Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals

Product-limit estimators of the gap time distribution of a renewal process under different sampling patterns

Likelihood Construction, Inference for Parametric Survival Distributions

Convergence Rates for Renewal Sequences

STAT 331. Martingale Central Limit Theorem and Related Results

Approximate Self Consistency for Middle-Censored Data

EXACT CONVERGENCE RATE AND LEADING TERM IN THE CENTRAL LIMIT THEOREM FOR U-STATISTICS

Asymptotic properties of the maximum likelihood estimator for a ballistic random walk in a random environment

An application of hidden Markov models to asset allocation problems

1 Introduction This work follows a paper by P. Shields [1] concerned with a problem of a relation between the entropy rate of a nite-valued stationary

Asymptotics for posterior hazards

Exact Linear Likelihood Inference for Laplace

Variance Estimation in the Central Limit Theorem for Markov chains

Scan Statistics: Theory and Applications

HEAVY-TRAFFIC EXTREME-VALUE LIMITS FOR QUEUES

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Nonhomogenous Poisson process application to modelling accidents number at Baltic Sea waters and ports

Invariant HPD credible sets and MAP estimators

Discrete-Time Renewal Processes

Consistency of the maximum likelihood estimator for general hidden Markov models

Lecture 5 Models and methods for recurrent event data

On the Midpoint Method for Solving Generalized Equations

Reward Algorithms for Semi-Markov Processes

Step-Stress Models and Associated Inference

Multivariate Survival Data With Censoring.

Kesten s power law for difference equations with coefficients induced by chains of infinite order

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

TESTINGGOODNESSOFFITINTHECOX AALEN MODEL

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Multivariate Survival Analysis

SMSTC (2007/08) Probability.

Operations Research Letters. Instability of FIFO in a simple queueing system with arbitrarily low loads

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

INEQUALITY FOR VARIANCES OF THE DISCOUNTED RE- WARDS

Stochastic process. X, a series of random variables indexed by t

Chapter 2 Inference on Mean Residual Life-Overview

Geometric ρ-mixing property of the interarrival times of a stationary Markovian Arrival Process

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

On the Goodness-of-Fit Tests for Some Continuous Time Processes

LIST OF MATHEMATICAL PAPERS

Transcription:

Nonparametric Estimation for Semi-Markov Processes Based on K-Sample Paths with Application to Reliability Nikolaos Limnios 1 and Brahim Ouhbi 2 1 Laboratoire de Mathématiques Appliquées, Université de Technologie de Compiègne, B.P. 2529, 625 Compiègne Cedex, France 2 Ecole Nationale Supérieure d Arts et Métiers Marjane II, Meknès Ismailia, Béni M Hamed, Meknès, Maroc, Abstract. The problem concerned here is the estimation of ergodic finite semi- Markov processes from data observed by considering K independent censored sample paths with application in the reliability. 1 Introduction Semi-Markov modeling, as a generalization of Markov modeling, is a an active area in research. See, e.g., [Alvarez, 25]-[Voelkel and Cronwley, 1984]. In our previous work [Ouhbi and Limnios, 1999], we have considered one trajectory in the time interval [, T], and given the estimators and their asymptotic properties, as T. In the present work, we consider K trajectories in the time interval [, T], generated by K independent semi-markov processes having the same semi-markov kernel Q and initial distribution α. We obtain asymptotic properties of the estimators when K. In this case the time T is finite and fixed. This type of observation can be viewed as a generalization of the fixed (or type I) censoring of a single failure time. Our method, as in our previous works [Ouhbi and Limnios, 1999, Ouhbi and Limnios, 23, Ouhbi and Limnios, 21], consists in obtaining estimators of the semi-markov kernel, by using a maximum likelihood estimator (MLE) of the hazard rate function of transitions between states, and then considering estimators of other quantities, as the semi-markov transition function, Markov renewal function, and reliability functions as statistical functionals of the semi-markov kernel via analytic explicit formula. 2 Estimation of the hazard rate function of transitions We will consider in this paper a semi-markov process with a finite state space, E = {1, 2,..., s} say, with irreducible embedded Markov chain and finite sojourn time in all states [Limnios and Oprişan, 21]. In this section, we will derive and study the maximum likelihood estimator of the hazard rate functions of piecewise constant type estimator (PEE).

162 Limnios and Ouhbi Let us suppose that the semi-markov kernel Q is absolutely continuous with respect to the Lebesgue measure on R + and denote by q its Radon-Nikodym derivative, that is, for any i, j E, Q ij(dt) dt =: q ij(t). (1) So, we can write also q ij(t) = P(i, j)f ij(t), where f ij is the density function of the distribution F ij. For any i and j in E, let us define the hazard rate function of transition distributions between states, λ ij(t), t, of a semi-markov kernel by λ ij(t) = ( qij (t) 1 H i (t) if P(i, j) > and H i(t) < 1, otherwise. (2) Let us also define the cumulative hazard rate from state i to state j at time t by Λ ij(t) = R t λij(u)du and the total cumulative hazard rate of state i at time t by Λ i(t) = Σ j EΛ ij(t). We have also Q ij(t) = Z t exp[ Λ i(u)]λ ij(u)du. (3) Let us consider now a family of K independent E-valued Markov renewal processes (J r n, S r n, n ), 1 r K, defined by the same semi-markov kernel Q, and the initial distribution α, that is, for any r, 1 r K, Q ij(t) := P(J r n+1 = j, S r n+1 S r n t J r n = i), i, j E, t R +, n N, α(i) = P(J r = i), i E. For any r, let us denote by N r i (t),n r ij(t),n r (t),... the corresponding quantities N i(t), N ij(t), N(t),..., and define further N i(t, K) := Ni r (t), N ij(t, K) := Nij(t), r N(t) := N r (t). (4) If t = T fixed, then we will note simply N i, N ij,... The maximum likelihood estimator of the hazard rate functions will be based upon the observation of the above K independent MRP {(J r, S r ) = [(J r n, S r n) n ], 1 r K}. We assume hereafter that we observe each MRPs over the period of time [, T] for some finite and fixed T. A sample or history for the r-th MRP is given by H r (K) = (J r,j r 1,..., J r N r (T), r 1, r 2,..., r N r (T), U r T), (5) where U r T = T S r N r (T) is the backward recurrence time. The log-likelihood function associated to (H r (T),1 r K) is: l(k)=log L(K)= n Nr (T) o [log λ J r l 1,J l r(r l ) Λ J r l 1 (l r )] Λ J r N r (T) (UT) r. (6) l=1 In the sequel of this paper, we will approximate the hazard rate function λ ij(t) by the piecewise constant function λ ij(t) defined by λ ij(t) = λ ij(v k ) = λ ijk R +

Nonparametric Estimation for Semi-Markov Processes and Reliability 163 for t (v k 1, v k ] = I k, 1 k M, where (v k ) k M is a regular subdivision of [, T], that is, v k = k, k M, M = M(K), with step := T/M, such that, as K,, and K. Hence, λ ij(t) = M λ ijk 1 (vk 1,v k ](t), (7) k=1 where 1 (vk 1,v k ](t) is equal to 1 if t (v k 1, v k ], and otherwise. We get where d ijk = N r (T) l=1 l(k) = i,j E k=1 M (d ijk log λ ijk λ ijk ν ik ), (8) 1 {J r l 1 =i,j r l =j,r l I k} is the number of transitions from state i to state j for which the observed sojourn time in state i belongs to I k, and ν ik is the trace of the sojourn time in state i on the interval time I k, given for N(T) 1. The r.v. ν ik can be represented by the sum of two r.v. as follows ν ik := ν 1 ik + ν 2 ik, where ν 1 ik is the trace of the sojourn time on the interval I k, of the sojourn times in state i, and ν 2 ik is the trace of the cumulated censored time T greater than v k, in state i. So, the maximum likelihood estimator b λ ijk of λ ijk is given by: and bλ ijk = j dijk /ν ik if ν ik > otherwise. Thus, the estimator b λ ij(t, K) of λ ij(t) is then given by Let us also define bλ i(t, K) = j E Z t bλ ij(t, K) = M bλ ijk 1 (vk 1,v k ](t). (9) k=1 bλ ij(u, K)du, ν l i(t) = b Λik = b Λ i(v k, K) = j E M νik1 l (vk 1,v k ](t), l = 1, 2. k=1 3 Maximum likelihood and empirical estimators of the semi-markov kernel k bλ ijl, (1) Let us define estimators of the semi-markov kernel by putting estimators (9) to (1), as follows bq ij(t,k) := e Λ b ikλijk b. {k: v k t} l=1

164 Limnios and Ouhbi Consider now the empirical kernel function defined by eq ij(t, K) := 1 N i 1 {J r l 1 =i,j l r=j,r l t}, (11) K N r l=1 and the empirical density kernel given by: Q eq ij(t, K) = e ij(v k, K) Q e ij(v k 1, K), if t I k. Define also the estimator e H i(t,k) by eh i(t,k) := j E eq ij(t, K). (12) We define a function G i(, K), for t I k, 1 k M, by G i(t,k) := where n N r l=1 r l v k 1 N i Now, let us write estimator (9), as follows bλ ij(t, K) = 1 {J l 1 r =i,r l I k, l r t} + Ur T v o k 1 N 1 {J i N r r (T) =i,ur T I k}. eq ij(v k, K) 1 { e H i(v k, K) G i(v k, K)} + h r i (t, K), if t I k, h r i (t,k) := ν2 ik N i = 1 N i 1 {J r N r (T) =i,u T r >v k}. In order to obtain a consistent estimator, we will neglect the term h r i (t, K) from the denominator of estimator b λ ij(t, K), and obtain a new modified estimator denoted by b λ ij(t, K). That is, bλ ij(t, K) = eq ij(v k, K) 1 { eh i(v k, K) G i(v k, K)}, if t I k. (13) Denote the corresponding cumulative hazard rates estimator by b Λ i (t), and b Λ ij(t). Lemma 1 The estimator b λ ij(t, K), is a consistent estimator of λ ij(t), as K. Since h r i (t, K) converges to a positive quantity, it is clear the estimator b λ ij(t, K) is not consistent. In the sequel of this paper, we will consider only the estimator b λ ij(t,k). So, the MLE b Q ij(t, K) in (11) is obtained by using this estimator. In the remaining of this section we will study the asymptotic properties of the semi-markov kernel estimator given by (11). Theorem 1 The empirical estimator of the semi-markov kernel is uniformly strongly consistent, in the sense that, as K, sup eq ij(t,k) Q ij(t) a.s.. max i,j t [,T]

Nonparametric Estimation for Semi-Markov Processes and Reliability 165 We will prove now that the semi-markov kernel estimator, obtained from modified PEE of the hazard rate function b λ ij(t, K), is asymptotically uniformly equivalent to the empirical estimator e Q ij(t, K). Lemma 2 Let i and j be any two fixed states. Then we have, for any t [, T], bq ij(t, K) e Q ij(t, K) = O(K 1 ), as K. From the previous lemma, we conclude that the estimator of the semi-markov kernel is asymptotically uniformly a.s. equivalent to the empirical estimator of the semi-markov kernel for which we will prove the uniform strong consistency and derive a central limit theorem. Corollary 1 The estimator of the semi-markov kernel Qij(t, b K) is uniformly strongly consistent, that is, when K tends to infinity, sup bq ij(t,k) Q ij(t) a.s.. max i,j t [,T] Theorem 2 For any i, j E and t [, T] fixed, K 1/2 [ Q b ij(t, K) Q ij(t)] converges in distribution, as K, to a zero mean normal random variable with variance Q ij(t)(1 Q ij(t))[(αψ)(t)1]. 4 The estimator of the reliability function and its asymptotic properties After having outlined the problem of estimating the semi-markov transition matrix, it is appropriate to give some concrete applications of these processes as models of evolution of the reliability function of some system. Let the state space, E, be partitioned into two sets, U = {1,..., r} the patient is in good health and D = {r + 1,..., s} the patient is ill due to some causes or the component is failed and under repair. Reliability models whose state space is partitioned in the above manner will be considered here. As indicated above, it is of interest to estimate the distribution function of the waiting time to hit down states (failure). We focus on the estimation of the reliability function for a semi-markov process which describes the stochastic evolution of system. The general definition of the reliability function in the case of semi-markov processes is R(t) = P(Z u U, The reliability function R(t) is given by: u t). R(t) = i U α(i)r i(t), (14) where R i(t) is the conditional reliability function, that the hitting time to D, starting from a state i U, is greater than the time t. It is easy to show, by a renewal argument, that R i(t) satisfies the following Markov renewal equation R i(t) i U Z t R j(t u)q ij(du) = 1 H i(t), i U.

166 Limnios and Ouhbi The solution of this MRE, see Section 2, together with (14), in matrix form, gives R(t) = α (I Q (t)) ( 1) (I H (t))1, (15) where 1 = (1,..., 1) is an r-dimensional column vector. Index means restriction for matrices on U U, and for vectors on U. We will give an estimator of the reliability function of semi-markov processes and prove its uniform strong consistency and weak convergence properties as K. Let bq be the modified MLE of PEE type of the transition probability of the semi-markov kernel Q. Then we propose the following estimator for the reliability function br(t,k) = bα (I b Q (t, K)) ( 1) (I b H (t, K))1, (16) and we will prove now its uniform strong consistency and central limit theorems. Theorem 3 The estimator of the the reliability function of the semi-markov process is uniformly strongly consistent in the sense that, sup br(t, K) R(t), K. t [,T] a.s. Set B ij(t) := n U α(n)b nijk (1 H k )(t). k U Theorem 4 For any fixed t [, T], the r.v. K 1/2 [ R(t,K) b R(t)] converges in distribution to a zero mean normal random variable with variance σs(t) 2 := µ ii{[b ij (αψ) i] 2 Q ij(t) [(B ij (αψ) i) Q ij(t)] 2 }. i U j U 5 Numerical Application In this section we present a numerical example for a three state semi-markov process for which we will consider K = 5 censored trajectories. The time interval is [, T], with T = 1. The conditional transition functions F ij(t) are the following F 12(t), and F 31(t) are exponential with parameters respectively.1 and.2, and F 21(t), F 23(t) are Weibull with parameters respectively (.3, 2), and (.1, 2) (scale and shape parameter). The other functions are identically. The transition probabilities P(2, 1) and P(2, 3) are: P(2,1) = 1 P(2,3) = Z [1 F 23(t)]dF 23(t). The results obtained here are illustrated in figure 1. These results concern the reliability function.

Nonparametric Estimation for Semi-Markov Processes and Reliability 167 1 + : Estimation ; : Theoritical.9.8.7.6 Reliability.5.4.3.2.1 5 1 15 2 25 3 35 4 Time Fig.1. Reliability estimation References [Alvarez, 25]E.E.E. Alvarez (25). Smothed nonparametric estimation in window censored semi-markov processes, J. Statist. Plann. Infer., 131, 29 229. [Andersen et al., 1993]P.K. Andersen, O. Borgan, R.D. Gill, N. Keiding (1993). Statistical Models Based on Counting Processes, Springer, N.Y. [Dabrowska et al., 1994]D. Dabrowska, G. Horowitz, M. Sun (1994). Cox regression in a Markov renewal model: an application to the analysis of bone marrow transplant data. J. Amer. Statist. Ass., 89, 867-877. [Gill, 198]R.D. Gill (198). Nonparametric estimation based on censored observations of Markov renewal process. Z. Wahrsch. verw. Gebiete. 53, 97 116. [Greenwood and Wefelmeyer, 1996]P. E. Greenwood, W. Wefelmeyer (1996). Empirical estimators for semi-markov processes. Math. Methods Statist. 5(3), 299 315. [Lagakos et al., 1978]S.W. Lagakos, C.J. Sommer, M. Zelen (1978). Semi-Markov models for partially censored data, Biometrika, 65(2), 311 317. [Limnios, 24]N. Limnios (24). A functional central limit theorem for the empirical estimator of a semi-markov kernel, J. Nonparametric Statist., 16(1-2), pp 13-18. [Limnios and Oprişan, 21]N. Limnios, G. Oprişan (21). Semi-Markov Processes and Reliability, Birkhäuser, Boston. [Limnios and Ouhbi, 23]N. Limnios, B. Ouhbi, Empirical estimators of reliability and related functions for semi-markov systems, In Mathematical and

168 Limnios and Ouhbi Statistical Methods in Reliability, B. Lindqvist, K. Doksum (Eds.), World Scientific, 23. [Ouhbi and Limnios, 1996]B. Ouhbi, N. Limnios (1996). Non-parametric estimation for semi-markov kernels with application to reliability analysis, Appl. Stoch. Models Data Anal., 12, 29 22. [Ouhbi and Limnios, 1999]B. Ouhbi, N. Limnios (1999). Non-parametric estimation for semi-markov processes based on their hazard rate. Statist. Infer. Stoch. Processes, 2(2), 151 173. [Ouhbi and Limnios, 21]B. Ouhbi, N. Limnios (21). The rate of occurrence of failures for semi-markov processes and estimation. Statist. Probab. Lett., 59(3), 245 255. [Ouhbi and Limnios, 23]B. Ouhbi, N. Limnios (23). Nonparametric reliability estimation of semi-markov processes. J. Statist. Plann. Infer., 19(1/2), 155 165. [Phelan, 199]M.J.Phelan (199). Estimating the transition probabilities from censored Markov renewal processes. Statist. Probab. Letter., 1, pp 43 47. [Pyke, 1961]R. Pyke (1961). Markov renewal processes: definitions and preliminary properties, Ann. Math. Statist., 32, 1231 1241. [Ruiz-Castro and Pérez-Ocon, 24]J. E. Ruiz-Castro, R. Pérez-Ocon (24). A semi-markov model in biomedical studies, Commun. Stat. - Theor. Methods, 33(3). [Voelkel and Cronwley, 1984]J.G. Voelkel, J. Cronwley (1984). Nonparametric inference for a class of semi-markov processes with censored observations, Ann. Statist., 12(1), pp 142 16.