ST745: Survival Analysis: Parametric

Size: px
Start display at page:

Download "ST745: Survival Analysis: Parametric"

Transcription

1 ST745: Survival Analysis: Parametric Eric B. Laber Department of Statistics, North Carolina State University January 13, 2015

2 ...the statistician knows... that in nature there never was a normal distribution, there never was a straight line, yet with normal and linear assumptions, known to be false, he can often derive results which match, to a useful approximation, those found in the real world. Nicole Polizzi

3 Warm-up Define and explain 1. Surivor function 2. Hazard function 3. Cumulative hazard function 4. How does the density relate to the hazard function? True or false: (T/F) Hazard fn is always positive (T/F) Hazard fn is always less than one (T/F) Hazard fn integrates to one What is censoring?

4 Fact from a past life Recall that if X Gamma(α, β) then X has density f (x) = where Γ(u) is the gamma function Γ(u) βα Γ(α) x α 1 exp { βx}, 0 x u 1 exp { x}.

5 Parametric models The first part of this course will focus on parametric models Important role in statistics Data impoverished settings Gain intuition Reference distributions for test cases Sometime justified by underlying process With parametric models we can focus on the usual likelihood based approaches Maximum likelihood estimation Wald-type confidence intervals

6 Example: Exponential distribution If you can t get your estimator to work assuming an exponential generative model, you re in a bad way. Adam Baldwin as Animal Mother in Full Metal Jacket Exponential distribution defined by h(t) λ Exercise: Find f (t) and S(t) Note sometimes parameterized by θ = 1/λ T exp(θ) implies ET = θ, and Var T = θ

7 Example: Weibull distribution Very widely used, esp. in manufacturing Closed under minimum operation Minimum of independent Weibulls with same shape parameter is Weibull Think weakest link in a chain T Weib(λ, β) then h(t) = λβ(λt) β 1 Exercise: find f (t) and S(t) On board: for T Weib(λ, β) find ET r

8 Example: Weibull distribution cont d Weibull distribution gives a reasonable amount of flexibility β > 1 increasing hazard β < 1 decreasing hazard β = 1 constant hazard Combination of flexibility and analytic tractability makes the Weibull an appealing choice in practice Weibull hazard, β=1/2 Weibull hazard, β=1 Weibull hazard, β=2 h(t) h(t) h(t) t t t

9 Example: Weibull distribution cont d Book sometimes uses T Weib(α, β) where λ = 1/α β is sometimes called the shape parameter λ can be seen to compress or stretch the time axis f (t; λ, β) = λβ(λt) β 1 exp { (λβ) β}

10 Code break ## Generate 100 samples from exp distn with mean 3 n = 100; lam = 1/3; x = rexp(n, rate=lam); hist (x, freq=f, col="gray", main=""); ## Plot density and survivor function x = seq (0, 5, length=1000); plot(x, dexp (x, rate=lam), lwd=4, type= l ); plot(x, 1-pexp(x, rate=lam), lwd=4, type= l );

11 Code break ## Generate 100 samples from exp distn with mean 3 n = 100; lam = 1/3; x = rexp(n, rate=lam); hist (x, freq=f, col="gray", main=""); ## Plot density and survivor function x = seq (0, 5, length=1000); plot(x, dexp (x, rate=lam), lwd=4, type= l ); plot(x, 1-pexp(x, rate=lam), lwd=4, type= l );

12 Code break ## Generate 100 samples from exp distn with mean 3 n = 100; lam = 1/3; x = rexp(n, rate=lam); hist (x, freq=f, col="gray", main=""); ## Plot density and survivor function x = seq (0, 5, length=1000); plot(x, dexp (x, rate=lam), lwd=4, type= l ); plot(x, 1-pexp(x, rate=lam), lwd=4, type= l );

13 Code break ## Generate 100 samples from exp distn with mean 3 n = 100; lam = 1/3; x = rexp(n, rate=lam); hist (x, freq=f, col="gray", main=""); ## Plot density and survivor function x = seq (0, 5, length=1000); plot(x, dexp (x, rate=lam), lwd=4, type= l ); plot(x, 1-pexp(x, rate=lam), lwd=4, type= l );

14 Code break ## Generate 100 samples from exp distn with mean 3 n = 100; lam = 1/3; x = rexp(n, rate=lam); hist (x, freq=f, col="gray", main=""); ## Plot density and survivor function x = seq (0, 5, length=1000); plot(x, dexp (x, rate=lam), lwd=4, type= l ); plot(x, 1-pexp(x, rate=lam), lwd=4, type= l );

15 Code break ## Generate 100 samples from exp distn with mean 3 n = 100; lam = 1/3; x = rexp(n, rate=lam); hist (x, freq=f, col="gray", main=""); ## Plot density and survivor function x = seq (0, 5, length=1000); plot(x, dexp (x, rate=lam), lwd=4, type= l ); plot(x, 1-pexp(x, rate=lam), lwd=4, type= l );

16 Code break ## Generate 100 samples from exp distn with mean 3 n = 100; lam = 1/3; x = rexp(n, rate=lam); hist (x, freq=f, col="gray", main=""); ## Plot density and survivor function x = seq (0, 5, length=1000); plot(x, dexp (x, rate=lam), lwd=4, type= l ); plot(x, 1-pexp(x, rate=lam), lwd=4, type= l );

17 Code break ## Generate 100 samples from exp distn with mean 3 n = 100; lam = 1/3; x = rexp(n, rate=lam); hist (x, freq=f, col="gray", main=""); ## Plot density and survivor function x = seq (0, 5, length=1000); plot(x, dexp (x, rate=lam), lwd=4, type= l ); plot(x, 1-pexp(x, rate=lam), lwd=4, type= l );

18 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

19 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

20 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

21 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

22 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

23 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

24 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

25 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

26 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

27 Code break II ## Generate 1000 points from weib(1,2) n = 1000; shp = 1; scl = 1; x = rweibull(n, shape=shp, scale=scl); hist (x, col= gray, freq=f); ## Plot CDF and cumulative hazard x = seq (0, max(x), length=1000); plot(x, dweibull(x, shape=shp, scale=scl)); ST = 1-pweibull(x, shape=shp, scale=scl); plot(x, -log(st), type= l );

28 Example: extreme value distribution T Weibull then log T EV Y = log T takes values in R (why?) { ( )} y u y u f (y) = b 1 exp exp b b { ( )} y u S(y) = exp exp, b where b > 0, u R. Does the form of f (y) look familiar? If T weib(α, β) then we say Y = log T EV (u, b) with b = 1/β and u = log α. If Y EV (u, b) then (Y u)/b EV (0, 1; EV(0,1) is called the standard extreme value distribution

29 Example: extreme value distribution cont d Question: let X denote covariates how can we (conveniently) build a model so that T X follows a Weibull distribution? Let u(x) and b(x) be mean and standard deviation functions E.g., u(x) = x θ and b(x) = exp {x γ} Assume log T u(x ) EV (0, 1), b(x ) this is an example of our old friend the location-scale model!

30 Example: log-normal distribution (aka the Lormal distn) T is said to be log-normally distributed if Y = log T N(µ, σ 2 ) which we denote T log N(µ, σ 2 ) A log-normal has density f (t) = { 1 2πσ 2 t exp how can we derive this density? ( ) } log t µ 2, σ Derive the hazard function and show h(0) = 0 and lim t h(t) = 0.

31 Example: log-normal distribution cont d h(t) is continuous and satisfies h(0) = lim t h(t) = 0 Corresponds to a mix of individuals with short and long lifetimes Some forms of cancer Marriages Some electronics Waiting time of my dog

32 Code break III Question: Using only what we ve already covered in class, how to generate data from an standard EV in R? Question: Using only what we ve already covered in class, how to generate data from an EV (u, b)? Using the function rnorm which generates data from a standard normal distribution, how would you generate data from a log N(µ, σ 2 )?

33 Example: log-logistic distribution Also called the Fisk distribution Density, survivor, and hazard where α, β > 0 f (t) = (β/α)(t/α)β 1 [1 + (t/α) β ] 2, [ S(t) = 1 + (t/α) β] 1, h(t) = (β/α)(t/α)β 1 [1 + (t/α) β, ]

34 Example: log-logistic distribution As the name suggests, log T follows a logistic distribution The hazard function can be monotone or non-monotone depending on the choice of β, α shrinks or expands the time axis h(t) β = 1/2 β = 1 β = 2 β = 4 β = t

35 Log location-scale models Recall a parametric location scale model for Y R f (y) = 1 ( ) y u b f 0, b where u is the location and b is the scale parameter F (y) = F 0 ((y u)/b) and S(y) = 1 F 0 ((y u)/b) = S 0 ((y u)/b) Common examples S0 (y) = exp{ e y } extreme value S0 (y) = 1 Φ(y) normal S0 (y) = (1 + e y ) 1 logistic

36 Log location-scale models cont d Any location-scale model can be made into a lifetime distn through exponentiation T = exp{y } Thus P(T t) = S 0 ((log t u)/b) Log location-scale models are useful for simulating data 1. Generate Y X (µ(x ), σ 2 (X )) 2. Set T = exp{y }

37 Piecewise constant hazard functions Recall that a piecewise constant function takes the form h(t) = λ j if a j t < a j+1, where 0 = a 0 < a 1 < < a m = h(t) t

38 Piecewise constant hazard functions, cont d A piecewise hazard can closely* approximate a large class of functions These will play a role later on when we study nonparametrics Note that the cumulative hazard function is given by H(t) = m(t) 1 j=1 λ j (a j a j 1 ) + λ m(t) (t a m(t) 1 ), where m(t) satisfies a m(t) 1 t < a m(t) What is S(t) in this case? What is f (t) in this case?

39 Piecewise constant hazard functions, cont d Piecewise constant hazards are discontinuous which may not be appealing in some settings. Common alternatives are: 1. Linear spline: where (u) + max(0, u) 2. Cubic spline: m 1 h(t) = α 0 + λ j (t a j ) +, j=1 h(t) = α 0 + α 1 t + α 2 t 2 + α 3 t 3 + m λ j (t a j ) Another approach is to model log h(t) with a nonlinear basis expansion j=1

40 Regression models A common question of interest is how covariates affect survival Tumor stage and cancer survival Treatment received and time to relapse Manufacturing conditions and time to failure Link parametric models with covariates by making parameters fns of covariates E.g., S(t x) = exp{ λ(x)t} E.g., P(Y y x) = S0 ((y β x)/b)

ST495: Survival Analysis: Maximum likelihood

ST495: Survival Analysis: Maximum likelihood ST495: Survival Analysis: Maximum likelihood Eric B. Laber Department of Statistics, North Carolina State University February 11, 2014 Everything is deception: seeking the minimum of illusion, keeping

More information

ST745: Survival Analysis: Cox-PH!

ST745: Survival Analysis: Cox-PH! ST745: Survival Analysis: Cox-PH! Eric B. Laber Department of Statistics, North Carolina State University April 20, 2015 Rien n est plus dangereux qu une idee, quand on n a qu une idee. (Nothing is more

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS330 / MAS83 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-0 8 Parametric models 8. Introduction In the last few sections (the KM

More information

ST495: Survival Analysis: Hypothesis testing and confidence intervals

ST495: Survival Analysis: Hypothesis testing and confidence intervals ST495: Survival Analysis: Hypothesis testing and confidence intervals Eric B. Laber Department of Statistics, North Carolina State University April 3, 2014 I remember that one fateful day when Coach took

More information

ST745: Survival Analysis: Nonparametric methods

ST745: Survival Analysis: Nonparametric methods ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate

More information

Chapter 4: Continuous Probability Distributions

Chapter 4: Continuous Probability Distributions Chapter 4: Continuous Probability Distributions Seungchul Baek Department of Statistics, University of South Carolina STAT 509: Statistics for Engineers 1 / 57 Continuous Random Variable A continuous random

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution

More information

Survival Analysis. Stat 526. April 13, 2018

Survival Analysis. Stat 526. April 13, 2018 Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined

More information

Chapter 4: Continuous Random Variable

Chapter 4: Continuous Random Variable Chapter 4: Continuous Random Variable Shiwen Shen University of South Carolina 2017 Summer 1 / 57 Continuous Random Variable A continuous random variable is a random variable with an interval (either finite

More information

STAT 6350 Analysis of Lifetime Data. Probability Plotting

STAT 6350 Analysis of Lifetime Data. Probability Plotting STAT 6350 Analysis of Lifetime Data Probability Plotting Purpose of Probability Plots Probability plots are an important tool for analyzing data and have been particular popular in the analysis of life

More information

The Weibull Distribution

The Weibull Distribution The Weibull Distribution Patrick Breheny October 10 Patrick Breheny University of Iowa Survival Data Analysis (BIOS 7210) 1 / 19 Introduction Today we will introduce an important generalization of the

More information

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Analysis of competing risks data and simulation of data following predened subdistribution hazards Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Advanced Methods for Data Analysis (36-402/36-608 Spring 2014 1 Generalized linear models 1.1 Introduction: two regressions So far we ve seen two canonical settings for regression.

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require Chapter 5 modelling Semi parametric We have considered parametric and nonparametric techniques for comparing survival distributions between different treatment groups. Nonparametric techniques, such as

More information

Statistics for Engineers Lecture 4 Reliability and Lifetime Distributions

Statistics for Engineers Lecture 4 Reliability and Lifetime Distributions Statistics for Engineers Lecture 4 Reliability and Lifetime Distributions Chong Ma Department of Statistics University of South Carolina chongm@email.sc.edu February 15, 2017 Chong Ma (Statistics, USC)

More information

Lecture 4. Continuous Random Variables and Transformations of Random Variables

Lecture 4. Continuous Random Variables and Transformations of Random Variables Math 408 - Mathematical Statistics Lecture 4. Continuous Random Variables and Transformations of Random Variables January 25, 2013 Konstantin Zuev (USC) Math 408, Lecture 4 January 25, 2013 1 / 13 Agenda

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

10 Introduction to Reliability

10 Introduction to Reliability 0 Introduction to Reliability 10 Introduction to Reliability The following notes are based on Volume 6: How to Analyze Reliability Data, by Wayne Nelson (1993), ASQC Press. When considering the reliability

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation

More information

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued Chapter 3 sections Chapter 3 - continued 3.1 Random Variables and Discrete Distributions 3.2 Continuous Distributions 3.3 The Cumulative Distribution Function 3.4 Bivariate Distributions 3.5 Marginal Distributions

More information

Duration Analysis. Joan Llull

Duration Analysis. Joan Llull Duration Analysis Joan Llull Panel Data and Duration Models Barcelona GSE joan.llull [at] movebarcelona [dot] eu Introduction Duration Analysis 2 Duration analysis Duration data: how long has an individual

More information

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching Mayya Komisarchik Harvard University April 13, 2016 1 Heartfelt thanks to all of the Gov 2001 TFs of yesteryear; this section draws heavily

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What? You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David

More information

Dynamic Models Part 1

Dynamic Models Part 1 Dynamic Models Part 1 Christopher Taber University of Wisconsin December 5, 2016 Survival analysis This is especially useful for variables of interest measured in lengths of time: Length of life after

More information

CSci 8980: Advanced Topics in Graphical Models Gaussian Processes

CSci 8980: Advanced Topics in Graphical Models Gaussian Processes CSci 8980: Advanced Topics in Graphical Models Gaussian Processes Instructor: Arindam Banerjee November 15, 2007 Gaussian Processes Outline Gaussian Processes Outline Parametric Bayesian Regression Gaussian

More information

Lecture 4 - Survival Models

Lecture 4 - Survival Models Lecture 4 - Survival Models Survival Models Definition and Hazards Kaplan Meier Proportional Hazards Model Estimation of Survival in R GLM Extensions: Survival Models Survival Models are a common and incredibly

More information

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution.

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution. POWER-LAW ADJUSTED SURVIVAL MODELS William J. Reed Department of Mathematics & Statistics University of Victoria PO Box 3060 STN CSC Victoria, B.C. Canada V8W 3R4 reed@math.uvic.ca Key Words: survival

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Accelerated Failure Time Models

Accelerated Failure Time Models Accelerated Failure Time Models Patrick Breheny October 12 Patrick Breheny University of Iowa Survival Data Analysis (BIOS 7210) 1 / 29 The AFT model framework Last time, we introduced the Weibull distribution

More information

3003 Cure. F. P. Treasure

3003 Cure. F. P. Treasure 3003 Cure F. P. reasure November 8, 2000 Peter reasure / November 8, 2000/ Cure / 3003 1 Cure A Simple Cure Model he Concept of Cure A cure model is a survival model where a fraction of the population

More information

MATH c UNIVERSITY OF LEEDS Examination for the Module MATH2715 (January 2015) STATISTICAL METHODS. Time allowed: 2 hours

MATH c UNIVERSITY OF LEEDS Examination for the Module MATH2715 (January 2015) STATISTICAL METHODS. Time allowed: 2 hours MATH2750 This question paper consists of 8 printed pages, each of which is identified by the reference MATH275. All calculators must carry an approval sticker issued by the School of Mathematics. c UNIVERSITY

More information

Multistate models and recurrent event models

Multistate models and recurrent event models and recurrent event models Patrick Breheny December 6 Patrick Breheny University of Iowa Survival Data Analysis (BIOS:7210) 1 / 22 Introduction In this final lecture, we will briefly look at two other

More information

Foundations of Statistical Inference

Foundations of Statistical Inference Foundations of Statistical Inference Jonathan Marchini Department of Statistics University of Oxford MT 2013 Jonathan Marchini (University of Oxford) BS2a MT 2013 1 / 27 Course arrangements Lectures M.2

More information

Textbook: Survivial Analysis Techniques for Censored and Truncated Data 2nd edition, by Klein and Moeschberger

Textbook: Survivial Analysis Techniques for Censored and Truncated Data 2nd edition, by Klein and Moeschberger Lecturer: James Degnan Office: SMLC 342 Office hours: MW 12:00 1:00 or by appointment E-mail: jamdeg@unm.edu Please include STAT474 or STAT574 in the subject line of the email to make sure I don t overlook

More information

Definition 1.1 (Parametric family of distributions) A parametric distribution is a set of distribution functions, each of which is determined by speci

Definition 1.1 (Parametric family of distributions) A parametric distribution is a set of distribution functions, each of which is determined by speci Definition 1.1 (Parametric family of distributions) A parametric distribution is a set of distribution functions, each of which is determined by specifying one or more values called parameters. The number

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis STAT 6350 Analysis of Lifetime Data Failure-time Regression Analysis Explanatory Variables for Failure Times Usually explanatory variables explain/predict why some units fail quickly and some units survive

More information

Semiparametric Regression

Semiparametric Regression Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under

More information

10. Composite Hypothesis Testing. ECE 830, Spring 2014

10. Composite Hypothesis Testing. ECE 830, Spring 2014 10. Composite Hypothesis Testing ECE 830, Spring 2014 1 / 25 In many real world problems, it is difficult to precisely specify probability distributions. Our models for data may involve unknown parameters

More information

Beyond GLM and likelihood

Beyond GLM and likelihood Stat 6620: Applied Linear Models Department of Statistics Western Michigan University Statistics curriculum Core knowledge (modeling and estimation) Math stat 1 (probability, distributions, convergence

More information

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued Chapter 3 sections 3.1 Random Variables and Discrete Distributions 3.2 Continuous Distributions 3.3 The Cumulative Distribution Function 3.4 Bivariate Distributions 3.5 Marginal Distributions 3.6 Conditional

More information

Continuous random variables

Continuous random variables Continuous random variables Continuous r.v. s take an uncountably infinite number of possible values. Examples: Heights of people Weights of apples Diameters of bolts Life lengths of light-bulbs We cannot

More information

Multistate models and recurrent event models

Multistate models and recurrent event models Multistate models Multistate models and recurrent event models Patrick Breheny December 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Multistate models In this final lecture,

More information

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation

More information

STAT 509 Section 3.4: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

STAT 509 Section 3.4: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. STAT 509 Section 3.4: Continuous Distributions Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. A continuous random variable is one for which the outcome

More information

ST5212: Survival Analysis

ST5212: Survival Analysis ST51: Survival Analysis 8/9: Semester II Tutorial 1. A model for lifetimes, with a bathtub-shaped hazard rate, is the exponential power distribution with survival fumction S(x) =exp{1 exp[(λx) α ]}. (a)

More information

A New Two Sample Type-II Progressive Censoring Scheme

A New Two Sample Type-II Progressive Censoring Scheme A New Two Sample Type-II Progressive Censoring Scheme arxiv:609.05805v [stat.me] 9 Sep 206 Shuvashree Mondal, Debasis Kundu Abstract Progressive censoring scheme has received considerable attention in

More information

3 Continuous Random Variables

3 Continuous Random Variables Jinguo Lian Math437 Notes January 15, 016 3 Continuous Random Variables Remember that discrete random variables can take only a countable number of possible values. On the other hand, a continuous random

More information

Step-Stress Models and Associated Inference

Step-Stress Models and Associated Inference Department of Mathematics & Statistics Indian Institute of Technology Kanpur August 19, 2014 Outline Accelerated Life Test 1 Accelerated Life Test 2 3 4 5 6 7 Outline Accelerated Life Test 1 Accelerated

More information

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -

More information

Consider Table 1 (Note connection to start-stop process).

Consider Table 1 (Note connection to start-stop process). Discrete-Time Data and Models Discretized duration data are still duration data! Consider Table 1 (Note connection to start-stop process). Table 1: Example of Discrete-Time Event History Data Case Event

More information

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida First Year Examination Department of Statistics, University of Florida August 20, 2009, 8:00 am - 2:00 noon Instructions:. You have four hours to answer questions in this examination. 2. You must show

More information

STATISTICAL METHODS FOR SIGNAL PROCESSING c Alfred Hero

STATISTICAL METHODS FOR SIGNAL PROCESSING c Alfred Hero STATISTICAL METHODS FOR SIGNAL PROCESSING c Alfred Hero 1999 32 Statistic used Meaning in plain english Reduction ratio T (X) [X 1,..., X n ] T, entire data sample RR 1 T (X) [X (1),..., X (n) ] T, rank

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

Introduction to Reliability Theory (part 2)

Introduction to Reliability Theory (part 2) Introduction to Reliability Theory (part 2) Frank Coolen UTOPIAE Training School II, Durham University 3 July 2018 (UTOPIAE) Introduction to Reliability Theory 1 / 21 Outline Statistical issues Software

More information

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction Outline CHL 5225H Advanced Statistical Methods for Clinical Trials: Survival Analysis Prof. Kevin E. Thorpe Defining Survival Data Mathematical Definitions Non-parametric Estimates of Survival Comparing

More information

Statistical Estimation

Statistical Estimation Statistical Estimation Use data and a model. The plug-in estimators are based on the simple principle of applying the defining functional to the ECDF. Other methods of estimation: minimize residuals from

More information

Economics 583: Econometric Theory I A Primer on Asymptotics

Economics 583: Econometric Theory I A Primer on Asymptotics Economics 583: Econometric Theory I A Primer on Asymptotics Eric Zivot January 14, 2013 The two main concepts in asymptotic theory that we will use are Consistency Asymptotic Normality Intuition consistency:

More information

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Right censored

More information

CHAPTER 3 ANALYSIS OF RELIABILITY AND PROBABILITY MEASURES

CHAPTER 3 ANALYSIS OF RELIABILITY AND PROBABILITY MEASURES 27 CHAPTER 3 ANALYSIS OF RELIABILITY AND PROBABILITY MEASURES 3.1 INTRODUCTION The express purpose of this research is to assimilate reliability and its associated probabilistic variables into the Unit

More information

Chapter 5. Chapter 5 sections

Chapter 5. Chapter 5 sections 1 / 43 sections Discrete univariate distributions: 5.2 Bernoulli and Binomial distributions Just skim 5.3 Hypergeometric distributions 5.4 Poisson distributions Just skim 5.5 Negative Binomial distributions

More information

Creating New Distributions

Creating New Distributions Creating New Distributions Section 5.2 Stat 477 - Loss Models Section 5.2 (Stat 477) Creating New Distributions Brian Hartman - BYU 1 / 18 Generating new distributions Some methods to generate new distributions

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Chapter 4: CONTINUOUS RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS

Chapter 4: CONTINUOUS RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS Chapter 4: CONTINUOUS RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS Part 4: Gamma Distribution Weibull Distribution Lognormal Distribution Sections 4-9 through 4-11 Another exponential distribution example

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

Topic 4: Continuous random variables

Topic 4: Continuous random variables Topic 4: Continuous random variables Course 003, 2018 Page 0 Continuous random variables Definition (Continuous random variable): An r.v. X has a continuous distribution if there exists a non-negative

More information

Cox s proportional hazards model and Cox s partial likelihood

Cox s proportional hazards model and Cox s partial likelihood Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.

More information

Nonparametric Bayesian Methods - Lecture I

Nonparametric Bayesian Methods - Lecture I Nonparametric Bayesian Methods - Lecture I Harry van Zanten Korteweg-de Vries Institute for Mathematics CRiSM Masterclass, April 4-6, 2016 Overview of the lectures I Intro to nonparametric Bayesian statistics

More information

Probability and Distributions

Probability and Distributions Probability and Distributions What is a statistical model? A statistical model is a set of assumptions by which the hypothetical population distribution of data is inferred. It is typically postulated

More information

Outline of GLMs. Definitions

Outline of GLMs. Definitions Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density

More information

The Weibull in R is actually parameterized a fair bit differently from the book. In R, the density for x > 0 is

The Weibull in R is actually parameterized a fair bit differently from the book. In R, the density for x > 0 is Weibull in R The Weibull in R is actually parameterized a fair bit differently from the book. In R, the density for x > 0 is f (x) = a b ( x b ) a 1 e (x/b) a This means that a = α in the book s parameterization

More information

The coxvc_1-1-1 package

The coxvc_1-1-1 package Appendix A The coxvc_1-1-1 package A.1 Introduction The coxvc_1-1-1 package is a set of functions for survival analysis that run under R2.1.1 [81]. This package contains a set of routines to fit Cox models

More information

DAGStat Event History Analysis.

DAGStat Event History Analysis. DAGStat 2016 Event History Analysis Robin.Henderson@ncl.ac.uk 1 / 75 Schedule 9.00 Introduction 10.30 Break 11.00 Regression Models, Frailty and Multivariate Survival 12.30 Lunch 13.30 Time-Variation and

More information

PROBABILITY DENSITY FUNCTIONS

PROBABILITY DENSITY FUNCTIONS PROBABILITY DENSITY FUNCTIONS P.D.F. CALCULATIONS Question 1 (***) The lifetime of a certain brand of battery, in tens of hours, is modelled by the f x given by continuous random variable X with probability

More information

Exam C Solutions Spring 2005

Exam C Solutions Spring 2005 Exam C Solutions Spring 005 Question # The CDF is F( x) = 4 ( + x) Observation (x) F(x) compare to: Maximum difference 0. 0.58 0, 0. 0.58 0.7 0.880 0., 0.4 0.680 0.9 0.93 0.4, 0.6 0.53. 0.949 0.6, 0.8

More information

Chapter 17. Failure-Time Regression Analysis. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University

Chapter 17. Failure-Time Regression Analysis. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Chapter 17 Failure-Time Regression Analysis William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Copyright 1998-2008 W. Q. Meeker and L. A. Escobar. Based on the authors

More information

STAT 526 Spring Final Exam. Thursday May 5, 2011

STAT 526 Spring Final Exam. Thursday May 5, 2011 STAT 526 Spring 2011 Final Exam Thursday May 5, 2011 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will

More information

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable Distributions of Functions of Random Variables 5.1 Functions of One Random Variable 5.2 Transformations of Two Random Variables 5.3 Several Random Variables 5.4 The Moment-Generating Function Technique

More information

STA216: Generalized Linear Models. Lecture 1. Review and Introduction

STA216: Generalized Linear Models. Lecture 1. Review and Introduction STA216: Generalized Linear Models Lecture 1. Review and Introduction Let y 1,..., y n denote n independent observations on a response Treat y i as a realization of a random variable Y i In the general

More information

STAT509: Continuous Random Variable

STAT509: Continuous Random Variable University of South Carolina September 23, 2014 Continuous Random Variable A continuous random variable is a random variable with an interval (either finite or infinite) of real numbers for its range.

More information

Lecture 1: August 28

Lecture 1: August 28 36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 1: August 28 Our broad goal for the first few lectures is to try to understand the behaviour of sums of independent random

More information

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

Comparative Distributions of Hazard Modeling Analysis

Comparative Distributions of Hazard Modeling Analysis Comparative s of Hazard Modeling Analysis Rana Abdul Wajid Professor and Director Center for Statistics Lahore School of Economics Lahore E-mail: drrana@lse.edu.pk M. Shuaib Khan Department of Statistics

More information

Chapter 2 Continuous Distributions

Chapter 2 Continuous Distributions Chapter Continuous Distributions Continuous random variables For a continuous random variable X the probability distribution is described by the probability density function f(x), which has the following

More information

Topic 4: Continuous random variables

Topic 4: Continuous random variables Topic 4: Continuous random variables Course 3, 216 Page Continuous random variables Definition (Continuous random variable): An r.v. X has a continuous distribution if there exists a non-negative function

More information

Chapter 2. Continuous random variables

Chapter 2. Continuous random variables Chapter 2 Continuous random variables Outline Review of probability: events and probability Random variable Probability and Cumulative distribution function Review of discrete random variable Introduction

More information

Motivation Scale Mixutres of Normals Finite Gaussian Mixtures Skew-Normal Models. Mixture Models. Econ 690. Purdue University

Motivation Scale Mixutres of Normals Finite Gaussian Mixtures Skew-Normal Models. Mixture Models. Econ 690. Purdue University Econ 690 Purdue University In virtually all of the previous lectures, our models have made use of normality assumptions. From a computational point of view, the reason for this assumption is clear: combined

More information

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data Journal of Modern Applied Statistical Methods Volume 13 Issue 2 Article 22 11-2014 Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data Jayanthi Arasan University Putra Malaysia,

More information

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods MIT 14.385, Fall 2007 Due: Wednesday, 07 November 2007, 5:00 PM 1 Applied Problems Instructions: The page indications given below give you

More information

Week 1 Quantitative Analysis of Financial Markets Distributions A

Week 1 Quantitative Analysis of Financial Markets Distributions A Week 1 Quantitative Analysis of Financial Markets Distributions A Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information

Probability. Table of contents

Probability. Table of contents Probability Table of contents 1. Important definitions 2. Distributions 3. Discrete distributions 4. Continuous distributions 5. The Normal distribution 6. Multivariate random variables 7. Other continuous

More information

Design of Optimal Bayesian Reliability Test Plans for a Series System

Design of Optimal Bayesian Reliability Test Plans for a Series System Volume 109 No 9 2016, 125 133 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://wwwijpameu ijpameu Design of Optimal Bayesian Reliability Test Plans for a Series System P

More information

Incorporating unobserved heterogeneity in Weibull survival models: A Bayesian approach

Incorporating unobserved heterogeneity in Weibull survival models: A Bayesian approach Incorporating unobserved heterogeneity in Weibull survival models: A Bayesian approach Catalina A. Vallejos 1 Mark F.J. Steel 2 1 MRC Biostatistics Unit, EMBL-European Bioinformatics Institute. 2 Dept.

More information