Fahrmeir: Discrete failure time models
|
|
- Ronald Underwood
- 6 years ago
- Views:
Transcription
1 Fahrmeir: Discrete failure time models Sonderforschungsbereich 386, Paper 91 (1997) Online unter: Projektpartner
2 Discrete failure time models Ludwig Fahrmeir, Universitat Munchen Introduction Most methods for analyzing failure time or event history data are based on time as a continuously measured variate. A basic assumption for large parts of theory is that failure times are untied, see Andersen et al. [2]. In practice, there is always some smallest time unit, so that ties can occur. A moderate number of ties, while banned in theory, can be treated by appropriate modications. If many ties occur, e.g. due to grouping in larger time units or intervals, or if time is truly discrete, then discrete survival or failure time models are more consistent with the data. Such situations arise in medical work when patients are followed up at xed intervals like months, in certain biostatistical problems, for example human fertility studies and time to pregnancy (Scheike and Jensen [19]), or in labor market studies where duration of unemployment is measured in weeks, at best, or in months. We review parametric models and outline recent nonparametric approaches. More details, in particular for parametric models, are given e.g. in Fahrmeir and Tutz [11], ch. 9, and further references cited there. Basic concepts Let time be divided into intervals [a 0 a 1 ),[a 1 a 2 ),:::,[a q;1 a q ),[a q 1). Usually a 0 = 0 is assumed, and a q denotes the nal follow up. Identifying the discrete time index t with interval [a t;1 a t ), a discrete failure time T is considered, where T = t denotes failure within interval t =[a t;1 a t ). The basic quantity characterizing T is the discrete hazard function (t) =Pr(T = tjt t) t =1 ::: q (0.1) which is the conditional probability for the risk of failure in interval t given the interval is reached. The discrete survivor function for reaching interval t
3 2 is S(t) =Pr(T t) = t;1 Y s=1 [1 ; (s)] (0.2) and the unconditional probability for failure at t is Pr(T = t) = (t)s(t). For a homogeneous population, discrete failure time data are given by (t i i ) i =1 ::: n,wheret i =min(t i C i ) is the minimum of survival time T i and censoring time C i, and i is the indicator variable for failure ( i =1) or censoring ( i =0). For the following we assume that censoring occurs at the end of the intervals, otherwise appropriate modications have tobe made. Simple estimates for (t) arecrude death rates ^(t) =d t =n t, where n t is the size of the population at risk and d t the number of observed failures in [a t;1 a t ). The so-called standard life table estimate replaces n t by n t ; w t =2, where w t is the number of censored observations in [a t;1 a t ), thereby assuming that censored observations are under risk for half the interval. In particular for large t, where the size n t of the risk set often becomes small, these estimates may be quite unsteady, and smoothing by one of the nonparametric methods outlined further below will be appropriate. This is illustrated in Figure 1. Figure 0.1: Posterior mean estimates (solid line) and pointwise two standard deviation condence bands together with crude death rates (plus) It shows crude death rates at age t in years for a population of retired American white females together with a smoothed estimate. A look at the data
4 3 (Green and Silverman [13], p. 101) shows that n t becomes rather small for higher age t, resulting in unstable estimates towards the end of the observation period. Discrete failure time data can also be described by discrete-time counting processes N i (t) i=1 ::: n, dened by N i (0) = 0 and ( 1 if individual i is at risk and fails at t N i (t) =N i (t) ; N i (t ; 1) = 0 else for t 1, see e.g. Arjas and Haara [3]. Thus, for every individual i under risk at t, the value N i (t) can be considered as the outcome P of a binary experiment, with Pr(N i (t) = 1) = (t). The sum N(t) = N i i(t) counts the number of observed failures up to t, and crude death rates can be derived as nonparametric maximum likelihood estimators, in analogy to the Nelson- Aalen estimator for continuous time. In most studies a vector of possibly time-dependent basic or derived covariates x it is observed in addition to failure times. Time-dependent components of x it are assumed to be xed within interval t. Then the hazard function for survival time T i of individual i will generally depend on covariates and is dened by i (tjx it) =Pr(T i = tjt i t x it) t =1 ::: q (0.3) where x =(x it i1 ::: x it ) denotes the history of covariates up to time t. Expressions for the survivor function (0.2) and for Pr(T = t) have to be modied accordingly. Also, the sequence of binary experiments for N i (t) t 1, will depend on x it. Unless separate analysis for homogeneous subgroups can be carried out, it is natural to describe the dependence of conditional probabilities of failure by binary regression models. Let F t; denote the history of events registered up to time t, but excluding the failure at t, and let r i (t) denote a risk indicator with r i (t) = 1 if individual i is at risk in interval t, r i (t) = 0 otherwise. Then it will be assumed that the conditional probability of failures can be expressed as Pr(N i (t) =1jF t; )=r i (t) i (tjx it)
5 4 with hazard functions linked to a time-varying predictor it by i (tjx it )=h( it) (0.4) through a suitable link function h, for example the logistic function. The predictor it is modelled parametrically or nonparametrically as a function of time t and basic or derived covariates x it. Parametric models This section deals mainly with parametric models (0.4), where the predictor has the common linear parametric form it = z 0 it (0.5) as in generalized linear models, with the design vector z it formed from basic covariates. In many applications the linear predictor is chosen as it = 0t + x 0 it x (0.6) where x is a vector of covariate eects and 0t t=1 ::: qis a time-varying baseline eect. Model (0.6) can be written in the form (0.5) by dening z 0 it = (0 ::: 1 ::: 0 x 0 ), 0 =( it 01 ::: 0q 0 ). Other predictors are discussed x further below. Dierent discrete-time failure models are determined by choice of the link function h. Most common are discrete proportional hazards and logistic models. The discrete proportional hazards model Suppose that an underlying continuous failure time obeys a proportional hazard or relativeriskmodel 0 (t) exp(x 0 it x ). If time T can only be observed as a discrete random variable, T = t denoting failure in [a t;1 a t ), this yields the discrete proportional hazards model (tjx it )=1; exp[; exp( 0t + x 0 it x )] (0.7)
6 5 with baseline eects Z at 0t =log 0 (t)dt at;1 derived from the baseline function 0 (u) (see e.g. Kalbeisch and Prentice [17]). An alternativeformulation of (0.7) is the complementary log-log model logf; log[1;(tjx 0 it )]g = 0t +x 0 it x. The parameter vector x is unchanged by the transition to the discrete version, so that the same analysis as with the proportional hazard model is possible as far as the inuence of covariates is concerned. However, x and time-varying eects 0t have nowto be estimated jointly. If the number of intervals is large, then the dimension of 01 ::: 0q may become dangerously high, often even leading to nonexistence of ML estimates. Then more parsimonious parametric forms like polynomials 0t = 0 + :::+ k t k, piecewise constant eects or regression splines with only a few cut points are preferable. Often, cubic-linear splines of the form 0t = t + 2 (t ; t c ) 2 ; + 3 (t ; t c ) 3 ;, are useful, where (t;t c ) ; =min(t;t c 0) and t c is a cut-point. The baseline eect is cubic before t c and linear after t c.such a simple spline model is more robust against few data at the end of the observation period than polynomials, and it is a smooth function, compared to piecewise constant modelling. Of course, other forms of regression splines may be considered. Also one may use the numerically more stable B-spline basis instead of the truncated power form, compare Sleeper and Harrington [20] in a continuous-time setting. By appropriate denition of the design vector, regression spline models can also be written in linear parametric from (0.5). The logistic model An alternative model is the logistic model for the discrete hazard (tjx it )= exp( 0t + x 0 it x ) 1+exp( 0t + x 0 it x ) (0.8) considered by Thompson [21] and, in slightly dierent form, by Cox[5]. For short intervals, this model becomes rather similar to the discrete proportional hazards model. An advantage of the logistic model is that the covariate eects x can be estimated semiparametrically, considering baseline eects 0t as
7 6 nuisance parameters and leaving them unspecied as in the continuous-time proportional hazards model, see Cox and Oakes [6]. Other discrete-time failure models result for other choices of h. Very exible models are obtained if the link is an element of a parametric family of link functions. Examples are the model of Aranda-Ordaz (see Fahrmeir and Tutz [11], p. 318) or the families considered by Czado [7]. Although choice of the link function is an important issue, we feel that careful modelling of the predictor is often even more essential. To simplify discussion, we consider only two covariates x and w, where x is a continuous variable like tumor size or hormone concentration and w is binary, indicating for example sex or treatment group. Models with time-varying eects are obtained by assuming it = 0t + 1 x i + 2t w i (0.9) where 2t could be the time-varying eect of a therapy, possibly decreasing with time. Alternatively, the term 2t w i may be considered as a particular form of interaction between time t and the covariate w. The function 2t may be modelled parametrically similarly as the baseline eect 0t.Amore detailed discussion of parametric time-varying eects is in Yamaguchi [24]. If the simple linear form 1 x i for the inuence of x is too restrictive, one may also try to replace it by a nonlinear smooth function 1 (x) like in generalized additive models. As in Hastie and Tibshirani [15], one may go a step further and consider varying coecient models of the form it = 0t + 1 (x i )+ 2 (x i )w i + 3t w i : (0.10) Here the smooth function 2 may be viewed as an eect of w varying over x, or it is interpreted as an interaction term between the continuous covariate x and the binary covariate w. Without further prior knowledge it will often be dicult to specify certain parametric forms for the smooth functions in (0.9), (0.10). Instead, it will be reasonable to explore patterns with nonparametric approaches outlined in the next section and to proceed then with a simpler parametric likelihood-based inference.
8 7 Likelihood inference Under appropriate conditions on censoring and covariate processes, the loglikelihood reduces to the common form known for binary regression models. Introducing the indicators ( (0 ::: 0) i =0 y i =(y i1 ::: y it )= (0 ::: 0 1) i =1 it is proportional to l() = nx t i X i=1 s=1 fy is log i (sjx is)+(1; y is ) log[1 ; i (sjx is)]g : Arjas and Haara [3] give a careful discussion of assumptions leading to l() as a (partial) log-likelihood. They will generally hold for noninformative random censoring and time-independent orexternalcovariates, but can become critical for internal covariates. In particular, the likelihood is valid in the presence of ties, by making the weak assumption that failures at t are conditionally independent given covariates and past failures. By appropriate construction of design vectors z it, the parameters can then be estimated with software for binary regression models, and other tools of likelihood inference for these models may be adopted, see Fahrmeir and Tutz [11], ch. 9. Nonparametric approaches Often, the common assumptions of linearity, additivity and time-constancy of eects are denitely violated and parametric specications of more exible models like (0.9) or (0.10) may be dicult. In this situation nonparametric approaches provide useful tools for detecting and exploring nonlinear or timedependent eects. We outline the roughness penalty approach, leading to spline-type smoothing and related Bayesian nonparametric techniques. Other methods are based on discrete kernels (e.g. Fahrmeir and Tutz [11], ch. 5, 9), or local likelihoods (Wu and Tuma [23], in a continuous-time setting, Tutz [22]). Consider models like (0.9) or (0.10) with unknown parameter vector and unknown "smooth functions" 1 2 ::: q of time or continuous covariates. The roughness penalty approach maximizes a penalized log-likelihood
9 8 criterion pl( 1 ::: p )=l( 1 ::: p ) ; px j=1 j J( j ) where J( j ) are roughness penalties and j are smoothing parameters. A simple roughness penalty for a time-varying eect jt t=1 ::: q,is J( j )= qx s=2 ( jt ; j t;1 ) 2 : (0.11) a t ; a t;1 The same form may be used for a function j (x) of some continuous covariate x. Another common penalty is J( j )= Z [ j (x) 00 ] 2 dx leading to cubic smoothing splines (see e.g. Green and Silverman [13]). Kiefer [18] proposes a discrete proportional hazards model with time-varying eects jt and penalty function (0.11). Dannegger, Klinger and Ulm [8] usethe roughness penalty approach to explore nonlinear and time-varying eects of risk factors in a breast cancer study with monthly data. Related Bayesian nonparametric approaches put smoothness priors on jt or j (x) and estimation is based on posteriors given the data. If, for example, a random walk of rst order jt = j t;1 +(a t ; a t;1 ) 1=2 v t v t N(0 1= j ) is taken as smoothness prior for the sequence f jt g, then the posterior mode or MAP estimate is identical to the penalized likelihood estimate with penalty (0.11), see Fahrmeir [9]. Full posterior analyses can be carried out with MCMC (Markov Chain Monte Carlo) techniques, see Fahrmeir and Knorr- Held [10] and, in the related context of generalized additive models, Biller and Fahrmeir [4]. Nonparametric methods are also useful for smoothing hazard functions in the absence of covariates. The smooth curve in Figure 1 is the posterior mean estimate obtained from a Bayesian MCMC approach. The corresponding cubic spline smoother is very close, see Green and Silverman [13].
10 9 Some extensions More complex discrete-time event history data Discrete failure time models can be extended similarly as continuous-time models. Often one may distinguish between several types R 2f1 ::: mg of failure or terminating events. For example, in a medical study there may be several causes of death, or in studies on unemployment duration one may consider full time and part time jobs that end the unemployment duration. The basic quantities for models with multiple modes of failure are now causespecic hazard functions ir (tjx )=Pr(T = t R = rjt t it x ) (0.12) it i.e. conditional probabilities for failure of type r in interval t. Multicategorical response models can be used for regression analysis of cause-specic hazard functions. A common candidate for unordered events is the multinomial logit model (e.g. Allison [1]). Other discrete choice models like a probit or a nested multinomial logit model (Hill et al. [16]) may also be considered. If events are ordered, ordinal response models (e.g. Fahrmeir and Tutz [11]) are appropriate. Again parametric and nonparametric approaches are possible. Penalized likelihood and Bayesian smoothing techniques with models for time-varying eects are applied to unemployment durations in Fahrmeir and Wagenpfeil [12], Fahrmeir and Knorr-Held [10]. Discrete failure time models can also be extended to general multiepisodemultistate models or, in counting process terminology, marked point processes. Hamerle [14] studies parametric regression analysis for such discrete event history data, but generally much less theoretical or applied work has been done here. Unobserved heterogeneity and frailty models The above model specications assume that individual heterogeneity can be described by observed variables. However, it is likely that not all relevant variables are included in a regression model. The conventional approach to account for neglected heterogeneity or frailty is to include individual-specic
11 10 parameters into the predictor, i.e. modifying it to h it = it + i, and to assume that the individual-specic parameters are i.i.d. random variables from a prior density function f, i.e. a normal density. Estimation can then be based on approaches for generalized mixed models with random eects, and recent MCMC methods seem particularly well suited. However, for singlespell failure time models the estimates can be very dependent on the choice of the prior specication. More experience is needed here. The problem becomes less severe with repeated events. References [1] Allison, P. D. (1982). Discrete{time methods for the analysis of event histories. In Sociological Methodology, Vol.13, S. Leinhardt, ed., Jossey{ Bass, San Francisco, pp. 61{98. [2] Andersen, P. K., Borgan, O., Gill, R. D., and Keiding, N. (1993). Statistical models based oncounting processes. Springer Verlag, New York. [3] Arjas, E., and Haara, P. (1987). A logistic regression model for hazard: Asymptotic results. Scandinavian Journal of Statistics 14, 1{18. [4] Biller, C., and Fahrmeir, L. (1997). Bayesian spline-type smoothing in generalized regression models. To appear in Computational Statistics. [5] Cox, D. R. (1972). Regression models and life tables. Journal of the Royal Statistical Society B34, 187{220. (with discussion). [6] Cox, D. R., and Oakes, D. (1984). Analysis of survival data. Chapman and Hall, London. [7] Czado, C. (1992). On link selection in generalized linear models. Springer Lecture Notes in Statistics 78, 60{65. [8] Dannegger, F., Klinger, A., and Ulm, K. (1995). Identication of prognostic factors with censored data. Discussion Paper 11, Sonderforschungsbereich 386, Ludwig-Maximilians Universitat Munchen.
12 11 [9] Fahrmeir, L. (1994). Dynamic modelling and penalized likelihood estimation for discrete time survival data. Biometrika 81, 317{330. [10] Fahrmeir, L., and Knorr-Held, L. (1997). Dynamic discrete-time duration models: estimation via Markov Chain Monte Carlo. To appear in Sociological Methodology. [11] Fahrmeir, L., and Tutz, G. (1994). Multivariate Statistical Modelling Based on Generalized Linear Models. Springer{Verlag, New York. [12] Fahrmeir, L., and Wagenpfeil, S. (1996). Smoothing hazard functions and time-varying eects in discrete duration and competing risk models. To appear in Journal of the American Statistical Association. [13] Green, P. J., andsilverman, B. W. (1994). Nonparametric Regression and Generalized Linear Models. Chapman and Hall, London. [14] Hamerle, A. (1986). Regression analysis for discrete event history or failure time data. Statistical Papers 27, 207{225. [15] Hastie, T., and Tibshirani, R. (1993). Varying{coecient models. Journal of the Royal Statistical Society B 55, 757{796. [16] Hill, D. H., Axinn, W. G., and Thornton, A. (1993). Competing hazards with shared unmeasured risk factors. In Sociological Methodology, Vol.23, P. V. Marsden, ed., Basil Blackwell, Oxford, pp. 245{277. [17] Kalbeisch, J. D., and Prentice, R. L. (1980). The Statistical Analysis of Failure Time Data. John Wiley & Sons, New York. [18] Kiefer, N. M. (1990). Econometric methods for grouped duration data. In Panel data and labor market studies, J. Hartog, G. Ridder, and J. Theeuwes, eds., Elsevier Science Publisher, pp. 97{117. [19] Scheike, T. H., and Jensen, T. K. (1997). A discrete survival model with random eects: An application to time to pregnancy. Biometrics 53, 349{360.
13 12 [20] Sleeper, L. A., and Harrington, D. P. (1990). Regression splines in the Cox model with application to covariate eects in liver disease. Journal of the American Statistical Association 85, 941{949. [21] Thompson Jr., W. A. (1977). On the treatment of grouped observations in life studies. Biometrics 33, 463{470. [22] Tutz, G. (1995). Dynamic modelling of discrete duration data: a local likelihood approach. Report 95-15, Institut fur Quantitative Methoden, Tech. Univ. Berlin. [23] Wu, L. L., and Tuma, N. B. (1991). Assessing bias and t of global and local hazard models. Sociological Methods and Research 19, 354{387. [24] Yamaguchi, K. (1993). Modeling time{varying eects of covariates in event{history analysis using statistics from the saturated hazard rate model. In Sociological Methodology, Vol.23, P. V. Marsden, ed., Basil Blackwell, Oxford, pp. 279{317.
Fahrmeir, Knorr-Held: Dynamic discrete-time duration models. (REVISED)
Fahrmeir, Knorr-Held: Dynamic discrete-time duration models. (REVISED) Sonderforschungsbereich 386, Paper 14 (1996) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Dynamic discrete{time duration
More informationModelling geoadditive survival data
Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model
More informationFahrmeir: Recent Advances in Semiparametric Bayesian Function Estimation
Fahrmeir: Recent Advances in Semiparametric Bayesian Function Estimation Sonderforschungsbereich 386, Paper 137 (1998) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Recent Advances in Semiparametric
More informationPENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA
PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University
More informationFahrmeir, Klinger: A nonparametric multiplicative hazard model for event history analysis
Fahrmeir, Klinger: A nonparametric multiplicative hazard model for event history analysis Sonderforschungsbereich 386, Paper 12 (1995) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner A nonparametric
More informationRegularization in Cox Frailty Models
Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University
More informationAnalysis of competing risks data and simulation of data following predened subdistribution hazards
Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013
More informationAugustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models
Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models Sonderforschungsbereich 386, Paper 196 (2000) Online
More informationKneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"
Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach" Sonderforschungsbereich 386, Paper 43 (25) Online unter: http://epub.ub.uni-muenchen.de/
More informationA general mixed model approach for spatio-temporal regression data
A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression
More informationAnalysing geoadditive regression data: a mixed model approach
Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression
More informationFULL LIKELIHOOD INFERENCES IN THE COX MODEL
October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach
More informationKüchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models
Küchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models Sonderforschungsbereich 386, Paper 27 (996) Online unter: http://epububuni-muenchende/ Projektpartner An
More informationPower and Sample Size Calculations with the Additive Hazards Model
Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine
More informationMultilevel Statistical Models: 3 rd edition, 2003 Contents
Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction
More informationPruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization
Pruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization Sonderforschungsbereich 386, Paper 66 (1997) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner
More informationVariable Selection and Model Choice in Survival Models with Time-Varying Effects
Variable Selection and Model Choice in Survival Models with Time-Varying Effects Boosting Survival Models Benjamin Hofner 1 Department of Medical Informatics, Biometry and Epidemiology (IMBE) Friedrich-Alexander-Universität
More informationDuration Analysis. Joan Llull
Duration Analysis Joan Llull Panel Data and Duration Models Barcelona GSE joan.llull [at] movebarcelona [dot] eu Introduction Duration Analysis 2 Duration analysis Duration data: how long has an individual
More informationStatistical Inference and Methods
Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and
More informationGoodness-of-Fit Tests With Right-Censored Data by Edsel A. Pe~na Department of Statistics University of South Carolina Colloquium Talk August 31, 2 Research supported by an NIH Grant 1 1. Practical Problem
More informationMixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina
Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -
More informationMultivariate Survival Analysis
Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in
More information* * * * * * * * * * * * * * * ** * **
Generalized Additive Models Trevor Hastie y and Robert Tibshirani z Department of Statistics and Division of Biostatistics Stanford University 12th May, 1995 Regression models play an important role in
More informationOn the Breslow estimator
Lifetime Data Anal (27) 13:471 48 DOI 1.17/s1985-7-948-y On the Breslow estimator D. Y. Lin Received: 5 April 27 / Accepted: 16 July 27 / Published online: 2 September 27 Springer Science+Business Media,
More informationRene Tabanera y Palacios 4. Danish Epidemiology Science Center. Novo Nordisk A/S Gentofte. September 1, 1995
Estimation of variance in Cox's regression model with gamma frailties. Per Kragh Andersen 2 John P. Klein 3 Kim M. Knudsen 2 Rene Tabanera y Palacios 4 Department of Biostatistics, University of Copenhagen,
More informationToutenburg, Nittner: The Classical Linear Regression Model with one Incomplete Binary Variable
Toutenburg, Nittner: The Classical Linear Regression Model with one Incomplete Binary Variable Sonderforschungsbereich 386, Paper 178 (1999) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner
More informationInference on the Univariate Frailty Model: An Approach Bayesian Reference Analysis
Inference on the Univariate Frailty Model: An Approach Bayesian Reference Analysis Vera Lucia D. Tomazella and Camila Bertini Martins Universidade Federal de São Carlos, SP-Brasil (vera@ufscar.br; cacamila-bertini@yahoo.com.br)
More informationA COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky
A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),
More informationSTAT331. Cox s Proportional Hazards Model
STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations
More informationLecture 1. Introduction Statistics Statistical Methods II. Presented January 8, 2018
Introduction Statistics 211 - Statistical Methods II Presented January 8, 2018 linear models Dan Gillen Department of Statistics University of California, Irvine 1.1 Logistics and Contact Information Lectures:
More informationPruscha: Semiparametric Point Process and Time Series Models for Series of Events
Pruscha: Semiparametric Point Process and ime Series Models for Series of Events Sonderforschungsbereich 386, Paper 4 (998) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Semiparametric Point
More informationBiller: Adaptive Bayesian Regression Splines in Semiparametric Generalized Linear Models
Biller: Adaptive Bayesian Regression Splines in Semiparametric Generalized Linear Models Sonderforschungsbereich 386, Paper 133 (1998) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Adaptive
More informationLecture 7 Time-dependent Covariates in Cox Regression
Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the
More informationPrediction of ordinal outcomes when the association between predictors and outcome diers between outcome levels
STATISTICS IN MEDICINE Statist. Med. 2005; 24:1357 1369 Published online 26 November 2004 in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/sim.2009 Prediction of ordinal outcomes when the
More informationLecture 22 Survival Analysis: An Introduction
University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which
More informationVariable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting
Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting Andreas Groll 1 and Gerhard Tutz 2 1 Department of Statistics, University of Munich, Akademiestrasse 1, D-80799, Munich,
More informationThe Poisson transform for unnormalised statistical models. Nicolas Chopin (ENSAE) joint work with Simon Barthelmé (CNRS, Gipsa-LAB)
The Poisson transform for unnormalised statistical models Nicolas Chopin (ENSAE) joint work with Simon Barthelmé (CNRS, Gipsa-LAB) Part I Unnormalised statistical models Unnormalised statistical models
More informationLongitudinal + Reliability = Joint Modeling
Longitudinal + Reliability = Joint Modeling Carles Serrat Institute of Statistics and Mathematics Applied to Building CYTED-HAROSA International Workshop November 21-22, 2013 Barcelona Mainly from Rizopoulos,
More informationPower Calculations for Preclinical Studies Using a K-Sample Rank Test and the Lehmann Alternative Hypothesis
Power Calculations for Preclinical Studies Using a K-Sample Rank Test and the Lehmann Alternative Hypothesis Glenn Heller Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center,
More informationYou know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?
You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David
More informationSSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work
SSUI: Presentation Hints 1 Comparing Marginal and Random Eects (Frailty) Models Terry M. Therneau Mayo Clinic April 1998 SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that
More informationLecture 5 Models and methods for recurrent event data
Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.
More informationInversion Base Height. Daggot Pressure Gradient Visibility (miles)
Stanford University June 2, 1998 Bayesian Backtting: 1 Bayesian Backtting Trevor Hastie Stanford University Rob Tibshirani University of Toronto Email: trevor@stat.stanford.edu Ftp: stat.stanford.edu:
More informationSyllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools
Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The
More informationFahrmeir, Knorr-Held: Dynamic and semiparametric models
Fahrmeir, Knorr-Held: Dynamic and semiparametric models Sonderforschungsbereich 386, Paper 76 (1997) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Dynamic and semiparametric models Ludwig
More informationSemiparametric Regression
Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under
More informationDynamic Models Part 1
Dynamic Models Part 1 Christopher Taber University of Wisconsin December 5, 2016 Survival analysis This is especially useful for variables of interest measured in lengths of time: Length of life after
More informationMicroeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press.
Cheng Hsiao Microeconometrics Required Text: C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press. A.C. Cameron and P.K. Trivedi (2005), Microeconometrics, Cambridge University
More informationEconometric Analysis of Cross Section and Panel Data
Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND
More informationUNIVERSITY OF CALIFORNIA, SAN DIEGO
UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department
More informationGROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin
FITTING COX'S PROPORTIONAL HAZARDS MODEL USING GROUPED SURVIVAL DATA Ian W. McKeague and Mei-Jie Zhang Florida State University and Medical College of Wisconsin Cox's proportional hazard model is often
More informationfor function values or parameters, which are neighbours in the domain of a metrical covariate or a time scale, or in space. These concepts have been u
Bayesian generalized additive mied models. study A simulation Stefan Lang and Ludwig Fahrmeir University of Munich, Ludwigstr. 33, 8539 Munich email:lang@stat.uni-muenchen.de and fahrmeir@stat.uni-muenchen.de
More informationLecture 3. Truncation, length-bias and prevalence sampling
Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in
More informationtype of the at; and an investigation of duration of unemployment in West Germany from 1980 to 1995, with spatio-temporal data from the German Federal
Bayesian Inference for Generalized Additive Mixed Models Based on Markov Random Field Priors Ludwig Fahrmeir and Stefan Lang University of Munich, Ludwigstr. 33, 80539 Munich email: fahrmeir@stat.uni-muenchen.de
More informationGeneralized Additive Models
By Trevor Hastie and R. Tibshirani Regression models play an important role in many applied settings, by enabling predictive analysis, revealing classification rules, and providing data-analytic tools
More informationNORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET
NORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET Investigating posterior contour probabilities using INLA: A case study on recurrence of bladder tumours by Rupali Akerkar PREPRINT STATISTICS NO. 4/2012 NORWEGIAN
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear
More informationModelling Survival Data using Generalized Additive Models with Flexible Link
Modelling Survival Data using Generalized Additive Models with Flexible Link Ana L. Papoila 1 and Cristina S. Rocha 2 1 Faculdade de Ciências Médicas, Dep. de Bioestatística e Informática, Universidade
More information1 The problem of survival analysis
1 The problem of survival analysis Survival analysis concerns analyzing the time to the occurrence of an event. For instance, we have a dataset in which the times are 1, 5, 9, 20, and 22. Perhaps those
More information1. Introduction Over the last three decades a number of model selection criteria have been proposed, including AIC (Akaike, 1973), AICC (Hurvich & Tsa
On the Use of Marginal Likelihood in Model Selection Peide Shi Department of Probability and Statistics Peking University, Beijing 100871 P. R. China Chih-Ling Tsai Graduate School of Management University
More informationImproving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates
Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/
More informationSelection of Variables and Functional Forms in Multivariable Analysis: Current Issues and Future Directions
in Multivariable Analysis: Current Issues and Future Directions Frank E Harrell Jr Department of Biostatistics Vanderbilt University School of Medicine STRATOS Banff Alberta 2016-07-04 Fractional polynomials,
More information3003 Cure. F. P. Treasure
3003 Cure F. P. reasure November 8, 2000 Peter reasure / November 8, 2000/ Cure / 3003 1 Cure A Simple Cure Model he Concept of Cure A cure model is a survival model where a fraction of the population
More informationA NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL
Discussiones Mathematicae Probability and Statistics 36 206 43 5 doi:0.75/dmps.80 A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Tadeusz Bednarski Wroclaw University e-mail: t.bednarski@prawo.uni.wroc.pl
More informationLOGISTIC REGRESSION Joseph M. Hilbe
LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of
More informationFULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH
FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter
More informationContents. Part I: Fundamentals of Bayesian Inference 1
Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian
More informationST745: Survival Analysis: Cox-PH!
ST745: Survival Analysis: Cox-PH! Eric B. Laber Department of Statistics, North Carolina State University April 20, 2015 Rien n est plus dangereux qu une idee, quand on n a qu une idee. (Nothing is more
More informationJoint Modeling of Longitudinal Item Response Data and Survival
Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,
More informationEstimating subgroup specific treatment effects via concave fusion
Estimating subgroup specific treatment effects via concave fusion Jian Huang University of Iowa April 6, 2016 Outline 1 Motivation and the problem 2 The proposed model and approach Concave pairwise fusion
More informationIndividualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models
Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models Nicholas C. Henderson Thomas A. Louis Gary Rosner Ravi Varadhan Johns Hopkins University July 31, 2018
More informationFull likelihood inferences in the Cox model: an empirical likelihood approach
Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s10463-010-0272-y Full likelihood inferences in the Cox model: an empirical likelihood approach Jian-Jian Ren Mai Zhou Received: 22 September 2008 / Revised:
More informationSurvival Distributions, Hazard Functions, Cumulative Hazards
BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution
More informationRonald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California
Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University
More informationConsider Table 1 (Note connection to start-stop process).
Discrete-Time Data and Models Discretized duration data are still duration data! Consider Table 1 (Note connection to start-stop process). Table 1: Example of Discrete-Time Event History Data Case Event
More informationThe Bias-Variance dilemma of the Monte Carlo. method. Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel
The Bias-Variance dilemma of the Monte Carlo method Zlochin Mark 1 and Yoram Baram 1 Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel fzmark,baramg@cs.technion.ac.il Abstract.
More informationA Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated
More informationBIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY
BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1
More informationGeneralized Linear Models for Non-Normal Data
Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture
More informationProblem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM
Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods MIT 14.385, Fall 2007 Due: Wednesday, 07 November 2007, 5:00 PM 1 Applied Problems Instructions: The page indications given below give you
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan
More informationIgnoring the matching variables in cohort studies - when is it valid, and why?
Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association
More informationOther Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model
Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);
More informationContinuous Time Survival in Latent Variable Models
Continuous Time Survival in Latent Variable Models Tihomir Asparouhov 1, Katherine Masyn 2, Bengt Muthen 3 Muthen & Muthen 1 University of California, Davis 2 University of California, Los Angeles 3 Abstract
More informationSURVIVAL ANALYSIS WITH MULTIPLE DISCRETE INDICATORS OF LATENT CLASSES KLAUS LARSEN, UCLA DRAFT - DO NOT DISTRIBUTE. 1.
SURVIVAL ANALYSIS WITH MULTIPLE DISCRETE INDICATORS OF LATENT CLASSES KLAUS LARSEN, UCLA DRAFT - DO NOT DISTRIBUTE Abstract.... 1. Introduction In many studies the outcome of primary interest is the time
More informationSTA 216, GLM, Lecture 16. October 29, 2007
STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural
More informationAn Algorithm for Bayesian Variable Selection in High-dimensional Generalized Linear Models
Proceedings 59th ISI World Statistics Congress, 25-30 August 2013, Hong Kong (Session CPS023) p.3938 An Algorithm for Bayesian Variable Selection in High-dimensional Generalized Linear Models Vitara Pungpapong
More informationPart IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation
Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples
More informationPruscha, Göttlein: Regression Analysis for Forest Inventory Data with Time and Space Dependencies
Pruscha, Göttlein: Regression Analysis for Forest Inventory Data with Time and Space Dependencies Sonderforschungsbereich 386, Paper 173 (1999) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner
More informationPricing and Risk Analysis of a Long-Term Care Insurance Contract in a non-markov Multi-State Model
Pricing and Risk Analysis of a Long-Term Care Insurance Contract in a non-markov Multi-State Model Quentin Guibert Univ Lyon, Université Claude Bernard Lyon 1, ISFA, Laboratoire SAF EA2429, F-69366, Lyon,
More informationReconstruction of individual patient data for meta analysis via Bayesian approach
Reconstruction of individual patient data for meta analysis via Bayesian approach Yusuke Yamaguchi, Wataru Sakamoto and Shingo Shirahata Graduate School of Engineering Science, Osaka University Masashi
More informationNELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation
NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable
More informationStatistical Analysis of Competing Risks With Missing Causes of Failure
Proceedings 59th ISI World Statistics Congress, 25-3 August 213, Hong Kong (Session STS9) p.1223 Statistical Analysis of Competing Risks With Missing Causes of Failure Isha Dewan 1,3 and Uttara V. Naik-Nimbalkar
More informationSurvival Analysis Math 434 Fall 2011
Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup
More informationEfficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data
Efficiency Comparison Between Mean and Log-rank Tests for Recurrent Event Time Data Wenbin Lu Department of Statistics, North Carolina State University, Raleigh, NC 27695 Email: lu@stat.ncsu.edu Summary.
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationTowards stratified medicine instead of dichotomization, estimate a treatment effect function for a continuous covariate
Towards stratified medicine instead of dichotomization, estimate a treatment effect function for a continuous covariate Willi Sauerbrei 1, Patrick Royston 2 1 IMBI, University Medical Center Freiburg 2
More informationHornsteiner, Hamerle, Michels: Parametric versus Nonparametric Treatment of Unobserved Heterogeneity in Multivariate Failure Times
Hornsteiner, Hamerle, Michels: Parametric versus Nonparametric Treatment of Unobserved Heterogeneity in Multivariate Failure Times Sonderforschungsbereich 386, Paper 80 (1997) Online unter: http://epub.ub.uni-muenchen.de/
More informationBuilding a Prognostic Biomarker
Building a Prognostic Biomarker Noah Simon and Richard Simon July 2016 1 / 44 Prognostic Biomarker for a Continuous Measure On each of n patients measure y i - single continuous outcome (eg. blood pressure,
More information