Parametric Maximum Likelihood Estimation of Cure Fraction Using Interval-Censored Data

Size: px
Start display at page:

Download "Parametric Maximum Likelihood Estimation of Cure Fraction Using Interval-Censored Data"

Transcription

1 Columbia International Publishing Journal of Advanced Computing (2013) 1: doi:107726/jac Research Article Parametric Maximum Likelihood Estimation of Cure Fraction Using Interval-Censored Data Bader Ahmad I Aljawadi 1*, Mohd Rizam A Bakar 1, Noor Akma Ibrahim 2, and Mohamad Al-Omari 2 Received 26 November 2012; Published online 15 December 2012 The author(s) 2012 Published with open access at wwwusciporg Abstract A significant proportion of patients in cancer clinical trials can be cured That is, the symptoms of the disease disappear completely and the disease never recurs In this article, the focus is on estimation of the proportion of patients who are cured The parametric maximum likelihood estimation method was used for estimation of the cure fraction based on application of the bounded cumulative hazard (BCH) model to interval-censored data We ran the analysis using the EM algorithm considering two cases: i) when no covariates were involved in the estimation, and ii) when some covariates were involved This paper shows derivation of the estimation equations for the cure rate parameter followed by a simulation study AMS Subject Classification: 62N02 Keywords: Cure fraction; BCH model; Interval censoring; Covariates; MLE method; EM algorithm 1 Introduction Most of the survival models typically assume that all individuals are susceptible to a predetermined event with sufficient follow-up However, this assumption may be violated by a group of individuals such that not all of them will experience the particular event Such group is usually described in the related literature as non-susceptible or cured Several decades ago, the survival models started to include the cured portion in the analysis, and a new group of models hence has been developed, briefly called cure models and broadly used in analyzing data from cancer and other disease clinical trials *Corresponding Bader_Aljawadi@yahoocom 1* Institute for Mathematical Research, Universiti Putra Malaysia 2 Department of Mathematics, Universiti Putra Malaysia and Institute for Mathematical Research, Universiti Putra Malaysia 43

2 The first cure model was developed by Boag (1949) and three years later modified by Berkson and Gage (1952) This type of model is known as a mixture cure rate model and can be written as follows: =+1, (1) where and are the survival functions for the entire population and the uncured patients, respectively, and is the cured fraction In this model, the failure time function of the uncured patients can be estimated either parametrically, thus producing parametric survival models, or non-parametrically, hence producing semi-parametric survival models In the parametric group of cure models, we assume a particular distribution for the failure time of uncured patients, such as the exponential Elaborations on the theory and applications of mixture models can be found in Goldman (1984), Farewell (1986), Gamelet et al (1990), Kuk and Chen (1992), Taylor (1995), Peng and Dear (2000), Sy and Taylor (2000), Peng and Carriere (2003), Uddin et al (2006a and 2006b) and Abu Bakaret et al (2009), amongst others Although the mixture model is widely used in survival analysis, Chen et al (1999) discussed some drawbacks in it and proposed an alternative: the Bounded Cumulative Hazard BCH model, which was originally developed by Yakovlev et al in 1993 (Yakovlev et al, 1993) This model has been examined and applied in medical research, such as by Aljawadi et al (2011) and (2012), where it was employed in the estimation of the cure fraction in cancer trials under various censoring types (ie right, left and interval censoring) and the estimation procedure was handled via the parametric and nonparametric techniques However, the covariates were not involved in the analysis either in the parametric or nonparametric approaches under the interval censoring case, which increased the necessity of such research In the BCH model it is assumed that for cancer patients, a number of cancer cells (N), referred to as clonogens, remains active even after the initial treatment and that the cells grow rapidly and replace the normal tissues In addition, this model posits that N follows a Poisson distribution with a mean of θ Thus, the survival function for the BCH model is: =exp Probability to survive beyond the time t, (2) where is the cumulative distribution function such that =1 (Yakovlev et al, 1993; Chen et al, 1999) Based on the survival function given in (12), the cure fraction can be defined as follows: = lim ==0 =lim exp =exp (3) In the BCH model, represents the lifetime of the individual where in common situations this lifetime is either an exact survival time or a censoring time Other situations, however, can occur 44

3 where the individuals follow-up after a pre-fixed time period or make periodic visits for a fixed number of times In this paper, the lifetime,=1,,, is not exactly known, but it is known that it falls within an interval, ie, between the two visit times and One problem associated with the cure rate models is identifiability This arises in parametric and semiparametric models, as investigated by Farwell (1986), Taylor (1995) and Peng and Dear (2000), due to the lack of information at the end of the follow-up period, since a significant proportion of subjects are censored before the end of the follow-up period A simple working solution in these models is to assume that those patients who have censored times greater than the last uncensored time are cured, which is quite an arbitrary decision and lacks justification However, since the estimated survival function under the BCH model will approach zero at the largest uncensored time, these patients are treated as cured Therefore, decreasing the number of visiting times in case of interval survival data or increasing the length of the follow-up intervals might cause ambiguity in the real proportion of the cured patients and make the estimation of the true cure fraction a challenge This issue is revealed in the simulation study Furthermore, cure fraction estimates can be quite sensitive to the choice of latency distributions (distribution of the survival times for uncured patients) The cure fraction estimates from the model with generalized gamma distribution is found to be quite robust (Yu et al, 2004) Therefore, several cautions on the general use of cure models are advised 2 Methodology The focus of this paper was on parametric analysis of the estimation of the cure rate by means of the BCH model using interval-censored data However, the analysis was applied considering two scenarios; one with covariates and one without them 21 Data Without Covariates Suppose that T is a random variable with a probability density function ;, to be estimated and that,,, is a random sample of size n, then the joint probability density function is given by,,, ;= ; (4) In the parametric maximum likelihood method the cumulative distribution function and the probability density function For the entire population are known Given that is unknown, let be an indicator of censoring having a value of zero if is the censoring time and one if otherwise; be an indicator of the cure status of the patient, namely, is zero if the patient is cured and one otherwise; and =1,2,, If =1, then =1 However, if =0, will not be observed and it can be either one or zero We assume throughout the paper that the censoring is independent of failure times Given and (ie, the complete data are available), then the complete log-likelihood function is: =log 1 1, (5) 45

4 where and are the probability density function and the survival function for the uncured patients, respectively Many of the common distributions can be used in parametric statistical inference with survival data, including the exponential distribution, which is the most commonly used distributions due to its lack of memory property and constant failure time (Jeevanand et al, 2008) It has the survival function = for 0, and therefore the probability density function = In the case of interval-censored data, this function assumes the formula = = (Klein and Moeschberger, 2003) Thus, for left censoring, the formula becomes =0 =1 since 0=1, whereas for right censoring the formula is = = since =0 Regarding uncensored individuals, is not observed and the only information available is that it falls in an interval, thus we will estimate it by means of the mid-point of the observed intervals As a result the log-likelihood function can be obtained from: =log 1 1 =log log1 + 1 log (6) Note that the formula of the log-likelihood in equation (6) is valid only for data in which no tied events occur, ie, for data where no two or more events occur simultaneously Theoretically, a tied event may occur if the event time scale is discrete and/or if the event time is grouped into intervals In such a case, an alternative formula for the likelihood function is needed to handle this event However, discussion of such a formula is beyond the scope of this study but could be considered in subsequent works The solutions of =0 and =0 are the desired estimates of and, such that 1 1 =0 or =log And +1 (7) + 1 =0 (8) As the cure status ( ) is not fully observed, we need to implement the EM algorithm to estimate the desired parameters Before implementation of the EM algorithm, we have to address the main concern related to the EM algorithm, which is its slow convergence rate upon poor choice of initial values The rate of convergence depends on the proportion of missing information in the observed data If the portion of the missing data is large, then this algorithm can be quite slow However, in this work it was not too difficult to assign the proper initial values for the parameters and hence accelerate the 46

5 convergence of the EM algorithm since we utilized simulated data to assess the efficiency of the proposed methods Therefore, the initial values could be expected based on the corresponding censored rates used in each generated data set For example, if we were interested in a censoring rate equal to 20%, then the initial value of the parameter would be set close to log 02 since the cure fraction is defined as =exp For the other parameter, however, it was possible to pick some random values In real settings, on the other hand, the initial values of the parameters could be assigned to them in the same manner but not as much specifically as they are assigned in simulation Accordingly, in a real-world study the initial values for would be set as to be less than log censoring rate since the censoring rate is known and the cure fraction will not exceed the censoring proportion 211 The EM Algorithm The EM algorithm is a very general iterative algorithm for parameter estimation by maximum likelihood when some of the random variables involved are not observed, ie, are considered missing or incomplete For the interval survival data given in the form (,,, ), the cure indicator is partially missing and hence will be handled via the EM algorithm If the complete-data vector =,,, is observed, it is of interest to compute the maximum likelihood estimate of based on the log-likelihood function defined by equation (5) In the presence of missing cure status for censored individuals, however, only a function of the complete-data vector is observed We will denote this by expressing as,, where denotes the observed but incomplete data and denotes the unobserved data set Thus, the probability density function can be written as follows: ;=, ;= ; ; (9) Where is the joint density of and is the joint density of given the observed data, respectively It follows that the log-likelihood function can be expressed by: ;=log;=log ; ; ] = log ; ] + log ; ] (10) Suppose that we have individuals where for = 1,, both and are observed and are equal to 1, while for = + 1,,, only is observed and is equal to 0, whereas is not observed and therefore needs to be estimated (ie = {, ],,, = 1,, = 1,, } and = {, = + 1,, } Since is not completely observed, cannot be evaluated and hence maximized The EM algorithm attempts to maximize iteratively by replacing it by its conditional expectation given the observed data Thus, the E-step of the EM algorithm calculates the expectation of the log-likelihood function defined by equation (6) for given values of, and, ], such that: 47

6 ] = /, ],,, = 1,, ] + /, ],, = + 1, ], (11) Where /, ],, ] = log + log1, And /, ], ] = 1 + log1 + log This equation demonstrates that the expected value of the log-likelihood function cannot be calculated unless the expressions 1,, and log are computed since the cure status for the + 1 censored individuals is not provided These three expressions are collectively called the sufficient statistics and an estimation of the expected value of each is necessary So, let = = 1 and = log (12) Peng and Dear (2000) defined as the expected value of for the patient who would be uncured provided that the values of the current estimates of and the survival function of the uncured patients,, are given Accordingly, the value of will be used throughout this study to represent the expected value of the cure status,, for the censored individual where = + 1 ], = + 1,, (13) For simplicity, let = 1 = 1 where = + 1,, represent the expected value of 1 for the patient to be cured such that for censored individuals = 0 and 1 = 1 = = (14) Then, the formulae used to calculate the sufficient statistics can be rewritten as: = = 1 48

7 and = 1 log (15) Given the sufficient statistics, the maximum likelihood estimates of the parameters given by equations (7) and (8) can be written as follows: =log + 1 =log + 1 =log + 1 (16) and + 1 = = = 0 (17) In the M-step of the EM algorithm we can solve for,, and where for some initial values assigned to (, ), is the solution of equation (16) and is the numerical solution of equation (17), and we repeat the above sequence of steps until convergence 22 Datasets with Covariates Individual databases of cancer clinical trials contain information that may affect the event time distribution This information can be managed through some baseline variables called covariates, such as gender, type of treatment, year of cancer diagnosis, etc The covariates for individuals in these studies are valuable for assessment of the survival function and therefore the cure fraction So, this part of the current study will focus on estimation of the cure fraction when covariates are involved in the analysis It is assumed that the cure fraction is not to depend on the observed group of the covariates and hence they are modeled via the scale parameter of the latent distribution only, even though it is more realistic to incorporate the covariates via the cure rate parameter too When covariates are given and involved in the analysis, the scale parameter of the exponential distribution () given these covariates can be expressed as: = exp, Where and are the covariates and unknown coefficients vectors, respectively Therefore, the log-likelihood function given in (6) can be obtained from the equation: = exp log1 + 1 log (18) 49

8 Where the solutions of the equations = 0, = 0, and and, such that = 0 are the desired estimates of = = 0 (19) θ=log c α 1 c = exp + 1 = 0 = exp + 1 = 0 (20) where: = exp and = exp = exp and = exp The system of non-linear equations in (20) can be simplified and re-written as: (21) = 1 ] + 1 = 0, (22) = 1 ] + 1 = 0 where = exp Since the cure status ( ) is not fully observed, we will employ the EM algorithm to get an estimate of the interested parameters 221 The EM Algorithm In the presence of covariates, the data vector will take the form (, ],,, ) where = 1 Based on the data partition defined in 211, the only unobserved data is the cure status,, for =

9 For the E-Step, the log-likelihood function given in equation 18 can be obtained from: /, c, t, = /,,,, 1 + /,,,, + 1 (23) such that /,,,, 1 = + log1 exp (24) And /,,,, + 1 = 1 + log1 log (25) Where 1,, and log are the sufficient statistics Let = 1 = and = log Based on the definition of given in 211, the can, in such case, be estimated using the equation: 1 = 1 = = (26) Furthermore, the equations giving sufficient statistics can be re-written as follows: = = 1 and = 1 log (27) In light of this, the maximum likelihood estimate of given by equation (7) can be simplified to: =log + 1 =log + 1 =log + 1 (28) 51

10 For the M-step, a pre-determined initial value for can be used to solve the system of non-linear equations given in (22) with respect to using an appropriate numerical approach such as the Newton-Raphson method This way, the values of, besides the initial value of, can be used to solve for the sufficient statistics, and then to solve equation (28) and get a new value for Then, the whole sequence of steps is repeated until convergence 3 Simulation and Results Simulation studies based on interval-censored data involve further steps in comparison with the other types of censoring In this study, we were quite interested in varying the censoring rates as this enables identification of the pattern that the cure rate estimation will assume So, for the sake of flexibility in finding out how the pattern of the cure rate would progress as the censoring rate increased both slowly and rapidly, we studied several scale parameters of the exponential distribution that can be determined based on the criteria defined below Each data set comprised 100 interval observations and a number of censoring rates depending on the various possible values of Here we ignored the left-censoring case and considered the right censoring one to represent the entire censoring rate, which also represents the real fraction of cured individuals as a special case To control the generation process, we assumed that the true survival time,, followed an exponential distribution Subsequently, the steps used for data generation were as follows: 1 Generation of the covariates from a binomial distribution with pre-assigned probabilities where only two covariates were considered: gender, which was derived from a binomial distribution with a probability of 05; and type of treatment, ie, chemotherapy or radiotherapy, with a probability of 05 too 2 Generation of from an exponential distribution with different scale parameter values such that in the case of covariates the different values of the scale parameter could be assigned on the basis of the link function defined in section 22, = exp, where the initial values of the covariates parameters and were set to 01 and 01, respectively, just to produce a scale parameter within the interval 01, 2] that will produce a number of censoring rates based on the subsequent steps of data generation Consequently, the same values of the scale parameter considered in the case of covariates may be used to generate the lifetimes when the covariates are excluded 3 Creation of a vector for the clinic visits assuming that 20 clinic visits are possible In the case of exponential distribution, the first visit,, was generated from a uniform distribution; 0,01 Afterwards, the next visit,, was generated from the distribution, + 01 The subsequent visit times were generated in the same manner 4 Production of a100 2 empty matrix named bound for each data set The entries of the bound matrix were the intervals endpoints for each individual after comparing the true survival time with the 20 visit times In the case of right censoring, the right end point can 52

11 be assigned in such a way as to be a large number beyond the last visit time The formulae used for end point determination were: For = 1,,100, = 1,,20 0 ] < 1] bound, 1] = ] ] < ] < + 1] 20] ] > 20] 1] ] < 1] bound, 2] = + 1] ] < ] < + 1] ] > 20] (29) (30) 5 Construction of a100 2 empty matrix named status Based on the bound matrix, let: 0 bound, 2] = 100 status, 1] censoring indicator, = 1 otherwise and status, 1] cured indicator, = 0 = 0 1 otherwise In this simulation we were interested in the bias, which can indicate the performance of the methods under evaluation, and in the mean square error (MSE) Bias is the deviation of an estimate from the true quantity while, on the other hand, the MSE provides a useful measure of the overall accuracy as it incorporates the measures of both bias and variability By using standard notation for scalar parameter, bias and MSE can be expressed as follows: Bias = = + (31) where is the maximum likelihood estimate for In light of this, the simulation was carried out using the R statistical software to generate raw and bootstrapped data and a part of the final results are presented by Figure 31 53

12 Bias 14% 13% Without Covariates 12% With Covariates 11% 10% 9% 8% 7% 6% 5% 4% 3% 2% 1% 0% 15% 18% 21% 24% 27% 30% 33% 36% 39% 42% 45% 48% 51% 54% Censoring Rate Fig 31 Censoring rate versus bias based on the two scenarios of parametric estimation of the cure fraction The previous considerations suggest the importance of introducing distinct assumptions to the data sets from the proposed distributions to examine the performance of maximum likelihood estimators derived under the BCH model However, it is quite appropriate to consider some departures from idealized assumptions and to study the behavior of estimators by assuming different conditions and circumstances This can be due to: 1) Changing the number of clinic visits Different numbers of clinic visits lead to different numbers of censored individuals between each two visits, which in turn imply different parametric maximum likelihood estimates Therefore, creation of the vector for the clinic visits was based on the assumption that only 15 clinic visits are possible where the visit times could be generated in the same manner defined above A part of the results are presented by Figure 32 54

13 Bias 28% 26% With covariates 24% 22% Without covariates 20% 18% 16% 14% 12% 10% 8% 6% 4% 2% 0% 35% 38% 41% 44% 47% 50% 53% 56% 59% 62% 65% 68% Censoring Rate Fig 32 Censoring rate versus bias based on 15 clinic visits scenario 2) Estimation of the cure fraction when covariates are involved in the analysis due to different joint regression models A common joint model between the scale parameter of the exponential distribution and covariates has been employed to regularize the data generation In practice, the true relationship between the parameters and covariates is almost unknown Therefore, many other joint models of the covariates can be employed However, as an alternative, we present another joint distribution model of the covariates to describe the manner in which the estimation procedure acts upon various link functions Within this context, the effect of accommodating covariates on the scale parameter of the exponential distribution can be described by the following model: = exp Following the data generation steps defined above, the alternative link function produced limited scale parameter values and hence inflexible censoring rates In particular, accommodation of gender and type of treatment in the proposed joint model yielded scale parameter values which oscillated between zero and 1 (ie 0 < < 1 ) and hence resulted in high censoring rates(greater than 35%) This consequently produced defective cure fractions, ie, cure fractions characterized by high bias and MSE values However, employing this joint model provided intensive computational solutions for the cure fraction due to the inconvenient parameter estimation 55

14 4 Discussion The above figures and results present the performance of parametric estimation of the cure fraction when covariates were included in the analysis and when they were not The bias and MSE values for the various given rates of censoring indicate that the proposed method of cure rate estimation was in both cases more efficient when the censoring rate was low than when it was high and that the estimation started to diverge in the case of heavy censoring rates A large number of censored individuals decreases the equivalent number of subjects exposed (at risk), making the cure rate estimates less reliable than it could be for the same number of subjects but with less censoring Moreover, if there is heavy censoring, the estimated is nothing more than a poor approximation since the sufficient statistics estimates become considerably unacceptable as a result of extra error in the estimation procedure due to the overuse of the probability of the cure estimator defined by Equation 14 Hence, increasing the proportion of censored data will distort the estimated parameters, and vice versa Furthermore, the efficiency of cure rate estimation when the covariates were considered in the analysis was higher than when no covariates were incorporated in the estimation procedure Consequently, it is essential in this situation to develop parametric estimation of the cure fraction with covariates incorporated in the analysis Since the covariates that arise in many studies, especially clinical trials are secondary variables that provide extra information, inclusion of covariates in this study allowed for improvement in the estimates of the parameters of primary interest, especially the cure fraction Moreover, it should be emphasized that the proposed estimation procedure is very sensitive to the follow-up period for the cancer patients involved in the clinical trial An insufficient following up may distort the estimate of the parameters of interest Additionally, the estimated cure fraction may not be as reliable when estimated based on insufficient clinical follow-up data since cancer care involves regular medical checkups that include a review of the patient s medical history and a physical examination, which may help in further prevention of the incidence of cancer and in monitoring those patients who may be at risk However, in short-period trials a high proportion of the patients are missing, ie, they are censored Nonetheless, they are no longer considered at risk and the researcher was not in the position to record the precise time at which the event of concern may occur This situation can be detected easily when overviewing the bias behavior in Figure 32 where the bias pertaining to the very high censoring rates cases exceed the common level, which is an incidence that can occur as a result of short period trials 5 Conclusion We have shown that parametric estimation based on the BCH model can lead to better inferential performance (less bias and lower MSE) of the cure fraction in the interval censoring case when the censoring rate is low than when it is high The work examined two scenarios: one when covariates were not included in the estimation procedures and the other when some covariates were involved Our results demonstrate that cure fraction estimation based on the proposed procedure was more realistic when covariates were involved in the estimation procedures than when they were not The R codes used to realize the computations are available upon request 56

15 Acknowledgment The authors would like to express their sincere thanks to the University Research Board of University Putra Malaysia (UPM) for their generous support of this study References Abu Bakar, M R, Salah, K A, Ibrahim, N A, Haron, K, 2009 Bayesian Approach for Joint Longitudinal and Time-to-Event Data with Survival Fraction Bull MalaysMath Sci Soc 32, Aljawadi, B A, Bakar, M R, Ibrahim, N A, and Midi, H 2011 Parametric estimation of the immunes based on BCH model and exponential distribution using left censored data Journal of Applied Sciences 11(15), Aljawadi, B A, Bakar, M R, Ibrahim, N A, and Midi, H 2011 Parametric estimation of the cure fraction based on BCH model using left censored data with covariates Modern Applied Science Journal5(3), Aljawadi, B A, Bakar, M R, and Ibrahim, N A 2012 Parametric versus nonparametric estimation of the cure fraction using interval censored data Communications in Statistics-Theory and Methods41, Berkson, J, Gage, R P, 1952 Survival curves for cancer patients following treatment Journal of the American Statistical Association 47, Boag, J W, 1949 Maximum likelihood estimates of the proportion of patients cured by cancer therapy Journal of the Royal Statistical Society 11,15-44 Burton, A, Altman, D G, Royston, P, Holder, R L, 2006 The design of simulation studies in medical statistics Statistics in Medicine 25, PMid: Chen, M H, Ibrahim, J G, Sinha, D, 1999 A new Bayesian model for survival data with a surviving fractionjournal of the American Statistical Association 94, Davidson, R, 1997 Bootstrap Confidence Intervals Based on Inverting Hypothesis Tests Unpublished manuscript Farewell, V T, 1986 Mixture models in survival analysis: Are they worth the risk? The Canadian Journal of Statistics 14, Gamel, J W, McLean, I W, Rosenberg, S H, 1990 Proportion cured and mean log survival time as functions of tumor size Statistics in Medicine 9, PMid: Goldman, A I, 1984 Survivorship analysis when cure is a possibility: A Monte Carlo study Statistics in Medicine 3, PMid: Jeevanand, E S, Alice, P M and Hitha, N 2008 Semi-parametric Estimation of PX,Y(X > Y) Economic Quality Control 2, Klein, J P, Moeschberger, M L, 2003 Survival Analysis Techniques for Censored and Truncated Data, Second Edition, Springer, New York Kuk, A Y C, Chen, C H, 1992 A mixture model combining logistic regression with proportional hazards regressionbiometrika 79,

16 Peng, Y, Dear, K B G, 2000 A non-parametic mixture model for cure rate estimation Biometrics 56, PMid: PengY, Carriere K, C, 2002 An Empirical Comparison of Parametric and Semiparametric Cure ModelsBiometrical Journal Sy, J P, Taylor, J M, 2000 Estimation in a Cox proportional hazard cure model Biometrics 54, Uddin, M, Islam, M N, Ibrahim, Q I, 2006aAn Analytical Approach on Cure Rate Estimation Based on Uncensored DataJournal of Applied Sciences 6(3), Uddin, M, Sen A, Noor, M S, Islam, M N, Chowdhury, Z I, 2006bAn Analytical Approach on Non-parametric Estimation of Cure Rate Based on Uncensored DataJournal of Applied Sciences 6, Taylor, J M G, 1995 Semi-parametric estimation in failure time mixture modelsbiostatistics 51, Yakovlev, A Y, Asselain, B, Bardou, V J, Fourquet, A, Hoang, T, Rochefediere, A, and Tsodikov, A D, 1993 A Simple Stochastic Model of Tumor Recurrence and Its Applications to Data on pre-menopausal Breast CancerIn BiometrieetAnalyse de DormeesSpatio Temporelles, 12(Eds B Asselain, M Boniface, C Duby, C Lopez, J PMasson, and JTranchefort)SociétéFrancaise de Biométrie, ENSA Renned, France, Yu, B, Tiwari, R C and Cronin, K Z, 2004Cure fraction estimation from the mixture cure models for grouped survival times Statistics in Medicine 23, PMid:

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Analysis of Cure Rate Survival Data Under Proportional Odds Model

Analysis of Cure Rate Survival Data Under Proportional Odds Model Analysis of Cure Rate Survival Data Under Proportional Odds Model Yu Gu 1,, Debajyoti Sinha 1, and Sudipto Banerjee 2, 1 Department of Statistics, Florida State University, Tallahassee, Florida 32310 5608,

More information

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -

More information

3003 Cure. F. P. Treasure

3003 Cure. F. P. Treasure 3003 Cure F. P. reasure November 8, 2000 Peter reasure / November 8, 2000/ Cure / 3003 1 Cure A Simple Cure Model he Concept of Cure A cure model is a survival model where a fraction of the population

More information

Semi-parametric Inference for Cure Rate Models 1

Semi-parametric Inference for Cure Rate Models 1 Semi-parametric Inference for Cure Rate Models 1 Fotios S. Milienos jointly with N. Balakrishnan, M.V. Koutras and S. Pal University of Toronto, 2015 1 This research is supported by a Marie Curie International

More information

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

The nltm Package. July 24, 2006

The nltm Package. July 24, 2006 The nltm Package July 24, 2006 Version 1.2 Date 2006-07-17 Title Non-linear Transformation Models Author Gilda Garibotti, Alexander Tsodikov Maintainer Gilda Garibotti Depends

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Semiparametric Regression

Semiparametric Regression Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under

More information

Adaptive Prediction of Event Times in Clinical Trials

Adaptive Prediction of Event Times in Clinical Trials Adaptive Prediction of Event Times in Clinical Trials Yu Lan Southern Methodist University Advisor: Daniel F. Heitjan May 8, 2017 Yu Lan (SMU) May 8, 2017 1 / 19 Clinical Trial Prediction Event-based trials:

More information

Multi-state Models: An Overview

Multi-state Models: An Overview Multi-state Models: An Overview Andrew Titman Lancaster University 14 April 2016 Overview Introduction to multi-state modelling Examples of applications Continuously observed processes Intermittently observed

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial

More information

Robust estimates of state occupancy and transition probabilities for Non-Markov multi-state models

Robust estimates of state occupancy and transition probabilities for Non-Markov multi-state models Robust estimates of state occupancy and transition probabilities for Non-Markov multi-state models 26 March 2014 Overview Continuously observed data Three-state illness-death General robust estimator Interval

More information

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples

More information

Bivariate Weibull Distributions Derived From Copula Functions In The Presence Of Cure Fraction And Censored Data

Bivariate Weibull Distributions Derived From Copula Functions In The Presence Of Cure Fraction And Censored Data Journal of Data Science 14(2016), 295-316 Bivariate Weibull Distributions Derived From Copula Functions In The Presence Of Cure Fraction And Censored Data Em ılio A. Coelho-Barros 1*, Jorge Alberto Achcar

More information

Semiparametric Mixed Effects Models with Flexible Random Effects Distribution

Semiparametric Mixed Effects Models with Flexible Random Effects Distribution Semiparametric Mixed Effects Models with Flexible Random Effects Distribution Marie Davidian North Carolina State University davidian@stat.ncsu.edu www.stat.ncsu.edu/ davidian Joint work with A. Tsiatis,

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data Journal of Modern Applied Statistical Methods Volume 13 Issue 2 Article 22 11-2014 Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data Jayanthi Arasan University Putra Malaysia,

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 Department of Statistics North Carolina State University Presented by: Butch Tsiatis, Department of Statistics, NCSU

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Analysis of competing risks data and simulation of data following predened subdistribution hazards Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013

More information

Longitudinal + Reliability = Joint Modeling

Longitudinal + Reliability = Joint Modeling Longitudinal + Reliability = Joint Modeling Carles Serrat Institute of Statistics and Mathematics Applied to Building CYTED-HAROSA International Workshop November 21-22, 2013 Barcelona Mainly from Rizopoulos,

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

Multivariate Survival Analysis

Multivariate Survival Analysis Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in

More information

Censoring mechanisms

Censoring mechanisms Censoring mechanisms Patrick Breheny September 3 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Fixed vs. random censoring In the previous lecture, we derived the contribution to the likelihood

More information

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution Communications for Statistical Applications and Methods 2016, Vol. 23, No. 6, 517 529 http://dx.doi.org/10.5351/csam.2016.23.6.517 Print ISSN 2287-7843 / Online ISSN 2383-4757 A comparison of inverse transform

More information

Semiparametric Models for Joint Analysis of Longitudinal Data and Counting Processes

Semiparametric Models for Joint Analysis of Longitudinal Data and Counting Processes Semiparametric Models for Joint Analysis of Longitudinal Data and Counting Processes by Se Hee Kim A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Promotion Time Cure Rate Model with Random Effects: an Application to a Multicentre Clinical Trial of Carcinoma

Promotion Time Cure Rate Model with Random Effects: an Application to a Multicentre Clinical Trial of Carcinoma www.srl-journal.org Statistics Research Letters (SRL) Volume Issue, May 013 Promotion Time Cure Rate Model with Random Effects: an Application to a Multicentre Clinical Trial of Carcinoma Diego I. Gallardo

More information

Constrained estimation for binary and survival data

Constrained estimation for binary and survival data Constrained estimation for binary and survival data Jeremy M. G. Taylor Yong Seok Park John D. Kalbfleisch Biostatistics, University of Michigan May, 2010 () Constrained estimation May, 2010 1 / 43 Outline

More information

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University

More information

Parameter Estimation of Power Lomax Distribution Based on Type-II Progressively Hybrid Censoring Scheme

Parameter Estimation of Power Lomax Distribution Based on Type-II Progressively Hybrid Censoring Scheme Applied Mathematical Sciences, Vol. 12, 2018, no. 18, 879-891 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2018.8691 Parameter Estimation of Power Lomax Distribution Based on Type-II Progressively

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

Vertical modeling: analysis of competing risks data with a cure proportion

Vertical modeling: analysis of competing risks data with a cure proportion Vertical modeling: analysis of competing risks data with a cure proportion M A Nicolaie a and JMG Taylor b and C Legrand a a Institute of Statistics, Biostatistics and Actuarial Sciences, Catholic University

More information

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks Y. Xu, D. Scharfstein, P. Mueller, M. Daniels Johns Hopkins, Johns Hopkins, UT-Austin, UF JSM 2018, Vancouver 1 What are semi-competing

More information

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment

More information

CURE MODEL WITH CURRENT STATUS DATA

CURE MODEL WITH CURRENT STATUS DATA Statistica Sinica 19 (2009), 233-249 CURE MODEL WITH CURRENT STATUS DATA Shuangge Ma Yale University Abstract: Current status data arise when only random censoring time and event status at censoring are

More information

Package threg. August 10, 2015

Package threg. August 10, 2015 Package threg August 10, 2015 Title Threshold Regression Version 1.0.3 Date 2015-08-10 Author Tao Xiao Maintainer Tao Xiao Depends R (>= 2.10), survival, Formula Fit a threshold regression

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

FAILURE-TIME WITH DELAYED ONSET

FAILURE-TIME WITH DELAYED ONSET REVSTAT Statistical Journal Volume 13 Number 3 November 2015 227 231 FAILURE-TIME WITH DELAYED ONSET Authors: Man Yu Wong Department of Mathematics Hong Kong University of Science and Technology Hong Kong

More information

Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation

Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation H. Zhang, E. Cutright & T. Giras Center of Rail Safety-Critical Excellence, University of Virginia,

More information

The Design of a Survival Study

The Design of a Survival Study The Design of a Survival Study The design of survival studies are usually based on the logrank test, and sometimes assumes the exponential distribution. As in standard designs, the power depends on The

More information

Models for Multivariate Panel Count Data

Models for Multivariate Panel Count Data Semiparametric Models for Multivariate Panel Count Data KyungMann Kim University of Wisconsin-Madison kmkim@biostat.wisc.edu 2 April 2015 Outline 1 Introduction 2 3 4 Panel Count Data Motivation Previous

More information

Modelling geoadditive survival data

Modelling geoadditive survival data Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model

More information

Statistical Data Mining and Machine Learning Hilary Term 2016

Statistical Data Mining and Machine Learning Hilary Term 2016 Statistical Data Mining and Machine Learning Hilary Term 2016 Dino Sejdinovic Department of Statistics Oxford Slides and other materials available at: http://www.stats.ox.ac.uk/~sejdinov/sdmml Naïve Bayes

More information

Simple Step-Stress Models with a Cure Fraction

Simple Step-Stress Models with a Cure Fraction Simple Step-Stress Models with a Cure Fraction Nandini Kannan & Debasis Kundu Abstract In this article, we consider models for time-to-event data obtained from experiments in which stress levels are altered

More information

Frailty Modeling for clustered survival data: a simulation study

Frailty Modeling for clustered survival data: a simulation study Frailty Modeling for clustered survival data: a simulation study IAA Oslo 2015 Souad ROMDHANE LaREMFiQ - IHEC University of Sousse (Tunisia) souad_romdhane@yahoo.fr Lotfi BELKACEM LaREMFiQ - IHEC University

More information

Step-Stress Models and Associated Inference

Step-Stress Models and Associated Inference Department of Mathematics & Statistics Indian Institute of Technology Kanpur August 19, 2014 Outline Accelerated Life Test 1 Accelerated Life Test 2 3 4 5 6 7 Outline Accelerated Life Test 1 Accelerated

More information

Technical Report - 7/87 AN APPLICATION OF COX REGRESSION MODEL TO THE ANALYSIS OF GROUPED PULMONARY TUBERCULOSIS SURVIVAL DATA

Technical Report - 7/87 AN APPLICATION OF COX REGRESSION MODEL TO THE ANALYSIS OF GROUPED PULMONARY TUBERCULOSIS SURVIVAL DATA Technical Report - 7/87 AN APPLICATION OF COX REGRESSION MODEL TO THE ANALYSIS OF GROUPED PULMONARY TUBERCULOSIS SURVIVAL DATA P. VENKATESAN* K. VISWANATHAN + R. PRABHAKAR* * Tuberculosis Research Centre,

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

Statistical Modeling and Analysis for Survival Data with a Cure Fraction

Statistical Modeling and Analysis for Survival Data with a Cure Fraction Statistical Modeling and Analysis for Survival Data with a Cure Fraction by Jianfeng Xu A thesis submitted to the Department of Mathematics and Statistics in conformity with the requirements for the degree

More information

The influence of categorising survival time on parameter estimates in a Cox model

The influence of categorising survival time on parameter estimates in a Cox model The influence of categorising survival time on parameter estimates in a Cox model Anika Buchholz 1,2, Willi Sauerbrei 2, Patrick Royston 3 1 Freiburger Zentrum für Datenanalyse und Modellbildung, Albert-Ludwigs-Universität

More information

Advanced Methodology Developments in Mixture Cure Models

Advanced Methodology Developments in Mixture Cure Models University of South Carolina Scholar Commons Theses and Dissertations 1-1-2013 Advanced Methodology Developments in Mixture Cure Models Chao Cai University of South Carolina Follow this and additional

More information

Semiparametric Generalized Linear Models

Semiparametric Generalized Linear Models Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student

More information

MODELING COUNT DATA Joseph M. Hilbe

MODELING COUNT DATA Joseph M. Hilbe MODELING COUNT DATA Joseph M. Hilbe Arizona State University Count models are a subset of discrete response regression models. Count data are distributed as non-negative integers, are intrinsically heteroskedastic,

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

The Accelerated Failure Time Model Under Biased. Sampling

The Accelerated Failure Time Model Under Biased. Sampling The Accelerated Failure Time Model Under Biased Sampling Micha Mandel and Ya akov Ritov Department of Statistics, The Hebrew University of Jerusalem, Israel July 13, 2009 Abstract Chen (2009, Biometrics)

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

A Sampling of IMPACT Research:

A Sampling of IMPACT Research: A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Statistical Methods for Alzheimer s Disease Studies

Statistical Methods for Alzheimer s Disease Studies Statistical Methods for Alzheimer s Disease Studies Rebecca A. Betensky, Ph.D. Department of Biostatistics, Harvard T.H. Chan School of Public Health July 19, 2016 1/37 OUTLINE 1 Statistical collaborations

More information

Dynamic Prediction of Disease Progression Using Longitudinal Biomarker Data

Dynamic Prediction of Disease Progression Using Longitudinal Biomarker Data Dynamic Prediction of Disease Progression Using Longitudinal Biomarker Data Xuelin Huang Department of Biostatistics M. D. Anderson Cancer Center The University of Texas Joint Work with Jing Ning, Sangbum

More information

Longitudinal breast density as a marker of breast cancer risk

Longitudinal breast density as a marker of breast cancer risk Longitudinal breast density as a marker of breast cancer risk C. Armero (1), M. Rué (2), A. Forte (1), C. Forné (2), H. Perpiñán (1), M. Baré (3), and G. Gómez (4) (1) BIOstatnet and Universitat de València,

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

and Comparison with NPMLE

and Comparison with NPMLE NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/

More information

Review Article Analysis of Longitudinal and Survival Data: Joint Modeling, Inference Methods, and Issues

Review Article Analysis of Longitudinal and Survival Data: Joint Modeling, Inference Methods, and Issues Journal of Probability and Statistics Volume 2012, Article ID 640153, 17 pages doi:10.1155/2012/640153 Review Article Analysis of Longitudinal and Survival Data: Joint Modeling, Inference Methods, and

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation

More information

Physician Performance Assessment / Spatial Inference of Pollutant Concentrations

Physician Performance Assessment / Spatial Inference of Pollutant Concentrations Physician Performance Assessment / Spatial Inference of Pollutant Concentrations Dawn Woodard Operations Research & Information Engineering Cornell University Johns Hopkins Dept. of Biostatistics, April

More information

Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), Abstract

Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), Abstract Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), 1605 1620 Comparing of some estimation methods for parameters of the Marshall-Olkin generalized exponential distribution under progressive

More information

A general mixed model approach for spatio-temporal regression data

A general mixed model approach for spatio-temporal regression data A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression

More information

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina Local Likelihood Bayesian Cluster Modeling for small area health data Andrew Lawson Arnold School of Public Health University of South Carolina Local Likelihood Bayesian Cluster Modelling for Small Area

More information

Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model

Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model Pertanika J. Sci. & Technol. 17 (2): 313 323 (2009) ISSN: 0128-7680 Universiti Putra Malaysia Press Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model Jayanthi Arasan Department

More information

Point and Interval Estimation for Gaussian Distribution, Based on Progressively Type-II Censored Samples

Point and Interval Estimation for Gaussian Distribution, Based on Progressively Type-II Censored Samples 90 IEEE TRANSACTIONS ON RELIABILITY, VOL. 52, NO. 1, MARCH 2003 Point and Interval Estimation for Gaussian Distribution, Based on Progressively Type-II Censored Samples N. Balakrishnan, N. Kannan, C. T.

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Statistical Inference on Constant Stress Accelerated Life Tests Under Generalized Gamma Lifetime Distributions

Statistical Inference on Constant Stress Accelerated Life Tests Under Generalized Gamma Lifetime Distributions Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS040) p.4828 Statistical Inference on Constant Stress Accelerated Life Tests Under Generalized Gamma Lifetime Distributions

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

1 The problem of survival analysis

1 The problem of survival analysis 1 The problem of survival analysis Survival analysis concerns analyzing the time to the occurrence of an event. For instance, we have a dataset in which the times are 1, 5, 9, 20, and 22. Perhaps those

More information

Survival Analysis. Stat 526. April 13, 2018

Survival Analysis. Stat 526. April 13, 2018 Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined

More information

CURE FRACTION MODELS USING MIXTURE AND NON-MIXTURE MODELS. 1. Introduction

CURE FRACTION MODELS USING MIXTURE AND NON-MIXTURE MODELS. 1. Introduction ØÑ Å Ø Ñ Ø Ð ÈÙ Ð Ø ÓÒ DOI: 10.2478/v10127-012-0001-4 Tatra Mt. Math. Publ. 51 (2012), 1 9 CURE FRACTION MODELS USING MIXTURE AND NON-MIXTURE MODELS Jorge A. Achcar Emílio A. Coelho-Barros Josmar Mazucheli

More information

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline:

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline: CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity Outline: 1. NIEHS Uterine Fibroid Study Design of Study Scientific Questions Difficulties 2.

More information

Multistate models and recurrent event models

Multistate models and recurrent event models and recurrent event models Patrick Breheny December 6 Patrick Breheny University of Iowa Survival Data Analysis (BIOS:7210) 1 / 22 Introduction In this final lecture, we will briefly look at two other

More information

A brief note on the simulation of survival data with a desired percentage of right-censored datas

A brief note on the simulation of survival data with a desired percentage of right-censored datas Journal of Data Science 14(2016), 701-712 A brief note on the simulation of survival data with a desired percentage of right-censored datas Edson Zangiacomi Martinez 1*, Jorge Alberto Achcar 1, Marcos

More information

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Part III Measures of Classification Accuracy for the Prediction of Survival Times Part III Measures of Classification Accuracy for the Prediction of Survival Times Patrick J Heagerty PhD Department of Biostatistics University of Washington 102 ISCB 2010 Session Three Outline Examples

More information

Multistate models and recurrent event models

Multistate models and recurrent event models Multistate models Multistate models and recurrent event models Patrick Breheny December 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Multistate models In this final lecture,

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal

More information