Estimation for Modified Data
|
|
- Brett Gray
- 5 years ago
- Views:
Transcription
1 Definition. Estimation for Modified Data 1. Empirical distribution for complete individual data (section 11.) An observation X is truncated from below ( left truncated) at d if when it is at or below d (X d) it is not recorded, but when it is above d it is recorded at its observed value X. An observation X is truncated from above ( left truncated) at u if when it is at or above u (X u) it is not recorded, but when it is below u it is recorded at its observed value X. An observation X is censored from below ( left censored) at d if when it is at or below d (X d) it is recorded as being equal to d, but when it is above d it is recorded at its observed value X. An observation X is censored from above ( right censored) at u if when it is at or above u (X u) it is recorded as being equal to u, but when it is below u it is recorded at its observed value X. The most common occurrences are left truncation and right censoring. Left truncation occurs when an ordinary deductible of d is applied. When a policyholder has a loss below d, he or she knows no benefits will be paid and so does not inform the insurer. When the loss is above d, the amount of the loss will be reported. A policy limit is an example of right censoring. When the amount of the loss exceeds u, benefits beyond that value are not paid, and so the exact value is not recorded. However, it is known that a loss of at least u has occurred. In chapters 11 and 1, truncated means truncated from below and censored always means censored from above. 1
2 For the observed values {x 1,,..., x n } let y 1 < y < < y k be the unique values of the x i s. For each j we associate a so-called risk set r j as follows: For survival/mortality data, the risk set is the number of people that are alive at age y j. For loss amount data, the risk set is the number of policies with observed loss amounts (either the actual amount or the maximum amount due to a policy limit) larger than or equal to y j less those with deductibles greater than or equal to y j. Let s j be the number of times the uncensored observation y j appears in the sample. We also use the following notations: For an individual data, let d j denote the truncation point of the j-th observation with d j = 0 in the absence of truncation. In a mortality table, it is the entry time. u j denote the censored point of the j-th observation. In the mortality table, it is the withdrawal time. x j denote the uncensored value (loss amount). In a mortality table, it is the death time. Definition. In terms of a mortality study, risk set at the j-th ordered observation y j, denoted by r j, is the number of individuals who are under observation at that age. r j = #(x i y j ) + #(u i y j ) #(d i y j ) Since the total number of d i s equals the sum of the number of x i s and the number of u i s, we can calculate r j in the following way too: r j = #(d i < y j ) #(x i < y j ) #(u i < y j ) = number of those who have entered prior to y j number of those who have left prior to y j For data consisting of loss amounts:
3 Risk set = the number of policies with observed loss amounts (either the actual amount or the maximum amount due to a policy limit) greater than or equal to y j less those with deductibles greater than or equal to y j. Example (Finan s notes - Example 51.1). The following mortality table is given: Life Time of Entry Time of exit Reason of exit End of Study Death Leaving Study End of Study End of Study 6 1. Death Death 8 3 Leaving Study End of Study Death (a) Complete the following table: (b) Create a table summarizing y j, s j, and r j 3
4 i d i x i u i Solution for part (a). Life Time of Entry Time of exit Reason of exit End of Study Death Leaving Study End of Study End of Study 6 1. Death Death 8 3 Leaving Study End of Study Death i d i x i u i entry death withdraw Solution for part (b). 4
5 The unique values of x i s are y j {0.5,, 3.}. Now form the extended table with asterisk assigned to y j values: all numbers in the table number of d i s number of x i s 1 1 number of u i s Then from the formula r j = #(x i y j ) + #(u i y j ) #(d i y j ) we can complete the following table j y j s j r j = = = 5 5
6 Definition. The Kaplan-Meier (product-limit) estimate for the survival function is given by 1 0 t < y 1 S n (t) = j 1 i=1 ( ) 1 s i r i y j 1 t < y j j =,..., k k i=1 ( ) 1 s i r i or 0 t y k where for the case of s k = r k, which means that nobody survives beyond age y k, we set S n (t) = 0 for t y k. Note. This estimator is a point estimation, as it gives a value for Ŝ(t) for each time t. Note. Note that the s i = #(x i : x i = y j ) gives the number of those who die at age y j, while r j = #(x i y j ) + #(u i y j ) #(d i y j ) gives the number of those who are at y j under study before any event occurs (death or withdrawal). So, the difference 1 s j r j gives the probability of surviving past time y j. = r j s j r j Example. Find the Kaplan-Meier estimate of the survival function for the data in the previous example. j y j s j r j = = = 5 6
7 S 10 (t) = 1 0 t < s 1 r 1 = t < ( ) (0.75) 1 s r = 0.45 t < 3. ( ) (0.45) 1 s 3 r 3 = 0.36 t 3. Note. The Kaplan-Meier estimation applied to a complete individual data is the same as the empirical survival function defined by S n (x) = 1 number of observations x n Note. For a survival function S(x) associated with a variable X we have this is because P (X x) = lim t x S(t) = S(x ) P (X x) = 1 P (X < x) = 1 F (x ) = S(x ) Example ( ). You are studying the length of time attorneys are involved in settling bodily injury lawsuits. T represents the number of months from the time an attorney is assigned such a case to the time the case is settled. Nine cases were observed during the study period, two of which were not settled at the conclusion of the study. For those two cases, the time spent up to the conclusion of the study, 4 months and 6 months, was recorded instead. The observed values of T for the other seven cases are as follows: Estimate P r(3 T 5) using the Product-Limit estimator. 7
8 i d i x i u i entry death withdraw j y j s j r j Ŝ(t) = t < = t < 3 ( ) 1 8 = 3 3 t < 5 ( ) = t < 8 ( ) 1 3 = t < ( 1 1 1) = 0 t 9 P (3 T 5) = P (T 3) P (T > 5) = Ŝ(3 ) Ŝ(5) = = Example. The claim payments on a sample of ten policies are: indicates that the loss exceeded the policy limit 8
9 Using the Product-Limit estimator, calculate the probability that the loss on a policy exceeds 8. i d i x i u i entry event censored j y j s j r j Ŝ(t) = 1 0 t < = 0.9 t < 3 (0.9) ( 1 9) = t < 5 (0.7) ( 1 9) = t < 6 (0.6) ( 1 1 5) = t < 7 (0.45) ( 1 4) 1 = t < 9 (0.36) ( 1 ) 1 = 0.18 t 9 Ŝ(8) = Ŝ(7) =
10 Definition. As in the case of complete data, the Nelson-Åalen Estimator cumulative hazard rate function is defined by: 0 0 t < y 1 Ĥ(t) = j 1 i=1 s i r i y j 1 t < y j j =,..., k k i=1 s i r i t y k and then set Ŝ(t) = e Ĥ(t) for the estimation of the survival function. Example. Find the Nelson-Åalen estimate of the survival function. 0 0 t < = t < Ĥ(t) = = 0.65 t < = 0.85 t t < 0.5 e 0.5 = t < Ŝ(t) = e 0.65 = 0.50 t < 3. e 0.85 = t 3. 10
11 Mean and Variance of Nelson-Åalen Estimator (from section 1.) Suppose that events have occurred at times t i. In the Nelson-Åalen estimation j i=1 s i r i Ĥ(y j ) the value s i r i is an estimation for the hazard rate function h(t i ). If we assume a Poisson distribution for the number of events at time t i, this distribution has mean r i h(t i ), which is approximated by r i ( si r i ) = s i. So, if S i represents the random number of events at time t i, then Var(S i ) s i as S i is assumed to follow a Poisson distribution with mean approximately s i. If we assume independence between the random variables S i, then Var(Ĥ(y j)) = Var ( j i=1 S i r i ) = Note. This formula is recursive: j i=1 ( ) Si Var = r i ( ) j Var(Si = i=1 r i j i=1 ( si r i ) for Var(Ĥ(y j)) = Var(Ĥ(y j 1)) + s j r j Example (from the Finan s note) For a survival study with censored and truncated data, you are given: Time (t) Number at risk Failures at at time t time t Estimate the variance of Ĥ(4). 11
12 Var(Ĥ(y 4 4)) = s i r i=1 i = = Example. Fifteen cancer patients were observed from the time of diagnosis until the earlier of death or 36 months from diagnosis. Deaths occurred during the study as follows: Time in Months Number of Since Diagnosis Deaths d The Nelson-Aalen estimate Ĥ(35) is Calculate the Nelson-Aalen estimate of the variance of Ĥ(35). First of all, note that d 8 because prior to t = 30 only 8 survivals remain in the group d d = After simplifying, this reduces to: d d = 1 By multiplying both sides by 8(8 d) gives: d(8 d) + 16 = 8(8 d) d 16d + 48 = 0 d = 4, 1 d 8 d = 4 The estimation for the variance will be: 1
13 =
14 Variance of Kaplan-Meier Estimator (from section 1.) The vairiance of the Kaplan-Meier estimator is: Var[Ŝ(y j)] = Ŝ(y j) j i=1 s i r i (r i s i ) and for t s other than y j s we have: { s Var[Ŝ(t)] = Ŝ(t) i r i (r i s i ) : } y i t We note that the quantity Ŝ(t) estimates the probability P (X > t). Now if one wants to estimate the variance of the estimator of a conditional probability t p s = P (X > t + s X > s), where s is one of the y j values, then the formula would be Var[P (X > t + s X > s)] = P (X > t + s X > s) { s i r i (r i s i ) : } s + 1 y i s + t Note. If the data is a complete one, then this formula reduces to the empirical approximation of the variance: Var[S n (x)] = Sn(x)(1 Sn(x)) n Example. The following payment data is given: 1500, , 550, 800, 800, , 5600 in here the plus sign means that the policy limit has been reached (note that in absence of deductible, the policy limit is the same as the maximum covered loss). Find the Greenwood s approximation for the variance of the Kaplan-Meier estimate of S(4000). 14
15 all numbers in the table number of x i s number of u i s 1 1 The unique values of y j s are 1500, 550, 800, 5600 The formula r j = #(x y j ) + #(u y j ) #(d y j ) still works fine, but since there is no d-value, it reduces to the formula r j = #(x y j ) + #(u y j ) j y j s j r j = = = S(4000) = ( 6 7 )(4 5 )( 4 ) = ) 3 Var (Ŝ(4000) = S(4000) i=1 [ s i r i (r i s i ) = (7)(6) + 1 (5)(4) + 1 (4)() ] = Example. In a five year mortality study, you are given: y j s j r j
16 Calculate Greenwood s approximation of the conditional variance of the product limit estimator of S(4). Ŝ(4) = ( 1 15 ) ( ) ( 0 5 ) ( ) 54 = [ ] Var(Ŝ(4)) = (15)(1) + 4 (80)(56) + 5 (5)(0) + 6 = (60)(54) Example given:. A mortality study is conducted on 50 lives observed from time zero. You are Time Number of Deaths Number Censored t d t c t (i) c (ii) Ŝ(35) is the Product-Limit estimate of S(35). (iii) Var(Ŝ(35)) ˆ is the estimate of Ŝ(35) using Greenwood s formula. (iv) Var(Ŝ(35)) ˆ ( Ŝ(35) ) = Determine c 30, the number censored at
17 all numbers in the table number of x i s 4 8 number of u i s 3 c number of d i s 50 Note that the initial time we d 0 = 50. We use the following formula to calculate r j s: r j = #(d < y j ) #(x < y j ) #(u < y j ) t j s j r j c c = ˆ Var(Ŝ(35)) (Ŝ(35) ) = (50)(48) + 4 (45)(41) + 8 (41 c)(33 c) (41 c)(33 c) = 945 solve quadratic equation c = 6 Note. For mortality data, the provability P (T > s + t T > s) is denoted by t p s and the probability of failing at or before time s + t given survival past s is denoted by t q s. The Kaplan-Meier estimation estimates the probabilities t p 0. If you want a conditional probability tp s, then you must calculate the Kaplan-Meier product starting at time s + 1 since tp s = P (T > s + t) P (T > s) = the product starting at s + 1 Example. For a survival study with censored and truncated data, you are given: 17
18 Number at Risk Time (t) at Time t Failures at Time t The probability of failing at or before Time 4, given survival past Time 1, is 3 q 1. Calculate Greenwood s approximation of the variance of 3 q 1. 3p 1 = ( 18 7 ) ( 6 3 ) ( ) 0 = (7)(18) + 6 (3)(6) + 5 (5)(5) = Var( 3 q 1 ) = Var(1 3 p 1 ) = Var( 3 p 1 ) = (0.4333) ( ) = Definition. If S(t) is the Kaplan-Meier estimator for the survival function (the empirical estimate is a special case of this), then the 100(1 α)% linear confidence interval is defined to be the interval with endpoints: S(t) ± z p+1 Var(S(t)) Similarly, if H(t) is Nelson-Aalen estimate of the cumulative hazard function, we have the 100(1 α)% linear confidence interval H(t) ± z p+1 Var(H(t)) But, this confidence interval may include negative values, and the linear confidence interval for S(t) may include numbers larger than 1. To avoid these problems, the delta method can be 18
19 used (details omitted) to find the following so-call log-transformed confidence intervals : (S(t) 1 U, S(t) U ) U = exp ( z p+1 Var(S(t)) S(t) ln S(t) ) ( H(t) U, U H(t) ) U = exp ( z p+1 Var(H(t)) H(t) ) Example. In the previous example, calculate the 95% log-transformed confidence interval for H(3), based on the Nelson-Aalen estimate. Ĥ(3) = = Var(Ĥ(3)) = = U = exp ( 1.96 ) = exp(0.4395) = confidence interval ( ) ( ) H U, HU = , (0.6875)(1.5519) = (0.4430, ) Example. Twelve policyholders were monitored from the starting date of the policy to the time of first claim. The observed data are as follows: Time of First claim Number of Claims 1 1 Using the Nelson-Aalen estimator, calculate the 95% linear confidence interval for the cumulative hazard rate function H(4.5). 19
20 y j r i s i H(4.5) = s 1 r 1 + s r + s 3 r 3 + s 4 r 4 = = Var(H(4.5)) = s 1 r1 + s r + s 3 r3 + s 4 r4 = = linear confidence interval : ± = (0.189, 1.361) Example. For a survival study, you are given: (i) The Product-Limit estimator Ŝ(t 0) is used to construct confidence intervals for S(t 0 ). (ii) The 95% log-transformed confidence interval for S(t 0 ) is (0.695, 0.843). Determine Ŝ(t 0). S 1 U = S U = U ln(s) = ln(0.695) U ln(s) = ln(0.843) Dividing the second row by the first row gives us: U = ln(0.843) ln(0.695) S = U = ( S 1 U ) U = = ln(0.843) ln(0.695) =
Nonparametric Model Construction
Nonparametric Model Construction Chapters 4 and 12 Stat 477 - Loss Models Chapters 4 and 12 (Stat 477) Nonparametric Model Construction Brian Hartman - BYU 1 / 28 Types of data Types of data For non-life
More informationAnalysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time
Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term
More informationPractice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:
Practice Exam 1 1. Losses for an insurance coverage have the following cumulative distribution function: F(0) = 0 F(1,000) = 0.2 F(5,000) = 0.4 F(10,000) = 0.9 F(100,000) = 1 with linear interpolation
More informationChapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n
Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 4 Fall 2012 4.2 Estimators of the survival and cumulative hazard functions for RC data Suppose X is a continuous random failure time with
More informationEstimation, Evaluation, and Selection of Actuarial Models 1
Estimation, Evaluation, and Selection of Actuarial Models 1 Stuart A. Klugman November 20, 2002 1 Copyright 2002 by Stuart A. Klugman. Reprinted with permission of the author. ii Contents 1 Introduction
More informationSurvival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University
Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation
More informationST745: Survival Analysis: Nonparametric methods
ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate
More informationSPRING 2007 EXAM C SOLUTIONS
SPRING 007 EXAM C SOLUTIONS Question #1 The data are already shifted (have had the policy limit and the deductible of 50 applied). The two 350 payments are censored. Thus the likelihood function is L =
More informationStatistics 262: Intermediate Biostatistics Non-parametric Survival Analysis
Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve
More informationCensoring and Truncation - Highlighting the Differences
Censoring and Truncation - Highlighting the Differences Micha Mandel The Hebrew University of Jerusalem, Jerusalem, Israel, 91905 July 9, 2007 Micha Mandel is a Lecturer, Department of Statistics, The
More informationTMA 4275 Lifetime Analysis June 2004 Solution
TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,
More informationLecture 4 - Survival Models
Lecture 4 - Survival Models Survival Models Definition and Hazards Kaplan Meier Proportional Hazards Model Estimation of Survival in R GLM Extensions: Survival Models Survival Models are a common and incredibly
More information18.465, further revised November 27, 2012 Survival analysis and the Kaplan Meier estimator
18.465, further revised November 27, 2012 Survival analysis and the Kaplan Meier estimator 1. Definitions Ordinarily, an unknown distribution function F is estimated by an empirical distribution function
More informationStat 475. Solutions to Homework Assignment 1
Stat 475 Solutions to Homework Assignment. Jimmy recently purchased a house for he and his family to live in with a $3, 3-year mortgage. He is worried that should he die before the mortgage is paid, his
More informationMultistate Modeling and Applications
Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)
More informationMulti-state models: prediction
Department of Medical Statistics and Bioinformatics Leiden University Medical Center Course on advanced survival analysis, Copenhagen Outline Prediction Theory Aalen-Johansen Computational aspects Applications
More informationStatistical Inference and Methods
Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and
More informationSurvival Analysis. Stat 526. April 13, 2018
Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined
More informationSTAT Sample Problem: General Asymptotic Results
STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative
More informationCourse 4 Solutions November 2001 Exams
Course 4 Solutions November 001 Exams November, 001 Society of Actuaries Question #1 From the Yule-Walker equations: ρ φ + ρφ 1 1 1. 1 1+ ρ ρφ φ Substituting the given quantities yields: 0.53 φ + 0.53φ
More informationMAS3301 / MAS8311 Biostatistics Part II: Survival
MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the
More informationLecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL
Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL The Cox PH model: λ(t Z) = λ 0 (t) exp(β Z). How do we estimate the survival probability, S z (t) = S(t Z) = P (T > t Z), for an individual with covariates
More informationLet us use the term failure time to indicate the time of the event of interest in either a survival analysis or reliability analysis.
10.2 Product-Limit (Kaplan-Meier) Method Let us use the term failure time to indicate the time of the event of interest in either a survival analysis or reliability analysis. Let T be a continuous random
More informationKernel density estimation in R
Kernel density estimation in R Kernel density estimation can be done in R using the density() function in R. The default is a Guassian kernel, but others are possible also. It uses it s own algorithm to
More informationExam C Solutions Spring 2005
Exam C Solutions Spring 005 Question # The CDF is F( x) = 4 ( + x) Observation (x) F(x) compare to: Maximum difference 0. 0.58 0, 0. 0.58 0.7 0.880 0., 0.4 0.680 0.9 0.93 0.4, 0.6 0.53. 0.949 0.6, 0.8
More informationLecture 7. Poisson and lifetime processes in risk analysis
Lecture 7. Poisson and lifetime processes in risk analysis Jesper Rydén Department of Mathematics, Uppsala University jesper.ryden@math.uu.se Statistical Risk Analysis Spring 2014 Example: Life times of
More informationChapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample
Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,
More informationExercises. (a) Prove that m(t) =
Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for
More informationLecture 3. Truncation, length-bias and prevalence sampling
Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in
More informationLog-linearity for Cox s regression model. Thesis for the Degree Master of Science
Log-linearity for Cox s regression model Thesis for the Degree Master of Science Zaki Amini Master s Thesis, Spring 2015 i Abstract Cox s regression model is one of the most applied methods in medical
More information9. Estimating Survival Distribution for a PH Model
9. Estimating Survival Distribution for a PH Model Objective: Another Goal of the COX model Estimating the survival distribution for individuals with a certain combination of covariates. PH model assumption:
More information1 The problem of survival analysis
1 The problem of survival analysis Survival analysis concerns analyzing the time to the occurrence of an event. For instance, we have a dataset in which the times are 1, 5, 9, 20, and 22. Perhaps those
More informationEstimating transition probabilities for the illness-death model The Aalen-Johansen estimator under violation of the Markov assumption Torunn Heggland
Estimating transition probabilities for the illness-death model The Aalen-Johansen estimator under violation of the Markov assumption Torunn Heggland Master s Thesis for the degree Modelling and Data Analysis
More informationMay 2013 MLC Solutions
May 3 MLC Solutions. Key: D Let k be the policy year, so that the mortality rate during that year is q 3 + k. The objective is to determine the smallest value of k such that k k v ( k p3) ( P3) < v ( k
More informatione 4β e 4β + e β ˆβ =0.765
SIMPLE EXAMPLE COX-REGRESSION i Y i x i δ i 1 5 12 0 2 10 10 1 3 40 3 0 4 80 5 0 5 120 3 1 6 400 4 1 7 600 1 0 Model: z(t x) =z 0 (t) exp{βx} Partial likelihood: L(β) = e 10β e 10β + e 3β + e 5β + e 3β
More informationMLC Fall Model Solutions. Written Answer Questions
MLC Fall 216 Model Solutions Written Answer Questions 1 MLC Fall 216 Question 1 Model Solution Learning Outcome: 1(a), 1(b), 1(d) Chapter References: AMLCR Chapter 8 Notes: 1. Part (a) was done very well
More informationRight-truncated data. STAT474/STAT574 February 7, / 44
Right-truncated data For this data, only individuals for whom the event has occurred by a given date are included in the study. Right truncation can occur in infectious disease studies. Let T i denote
More informationCIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis
CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation
More informationSurvival Regression Models
Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32 Background on the Proportional Hazards Model The exponential distribution has constant
More informationSolutions to the Spring 2015 CAS Exam ST
Solutions to the Spring 2015 CAS Exam ST (updated to include the CAS Final Answer Key of July 15) There were 25 questions in total, of equal value, on this 2.5 hour exam. There was a 10 minute reading
More informationMulti-state Models: An Overview
Multi-state Models: An Overview Andrew Titman Lancaster University 14 April 2016 Overview Introduction to multi-state modelling Examples of applications Continuously observed processes Intermittently observed
More informationNotes for Math 324, Part 20
7 Notes for Math 34, Part Chapter Conditional epectations, variances, etc.. Conditional probability Given two events, the conditional probability of A given B is defined by P[A B] = P[A B]. P[B] P[A B]
More informationEmpirical Processes & Survival Analysis. The Functional Delta Method
STAT/BMI 741 University of Wisconsin-Madison Empirical Processes & Survival Analysis Lecture 3 The Functional Delta Method Lu Mao lmao@biostat.wisc.edu 3-1 Objectives By the end of this lecture, you will
More informationCourse 1 Solutions November 2001 Exams
Course Solutions November Exams . A For i =,, let R = event that a red ball is drawn form urn i i B = event that a blue ball is drawn from urn i. i Then if x is the number of blue balls in urn, ( R R)
More informationSurvival Analysis I (CHL5209H)
Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really
More information1. If X has density. cx 3 e x ), 0 x < 0, otherwise. Find the value of c that makes f a probability density. f(x) =
1. If X has density f(x) = { cx 3 e x ), 0 x < 0, otherwise. Find the value of c that makes f a probability density. 2. Let X have density f(x) = { xe x, 0 < x < 0, otherwise. (a) Find P (X > 2). (b) Find
More informationSurvival Distributions, Hazard Functions, Cumulative Hazards
BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution
More informationThe Weibull in R is actually parameterized a fair bit differently from the book. In R, the density for x > 0 is
Weibull in R The Weibull in R is actually parameterized a fair bit differently from the book. In R, the density for x > 0 is f (x) = a b ( x b ) a 1 e (x/b) a This means that a = α in the book s parameterization
More informationStat 476 Life Contingencies II. Multiple Life and Multiple Decrement Models
Stat 476 Life Contingencies II Multiple Life and Multiple Decrement Models Multiple Life Models To this point, all of the products we ve dealt with were based on the life status (alive / dead or multi-state)
More information4 Testing Hypotheses. 4.1 Tests in the regression setting. 4.2 Non-parametric testing of survival between groups
4 Testing Hypotheses The next lectures will look at tests, some in an actuarial setting, and in the last subsection we will also consider tests applied to graduation 4 Tests in the regression setting )
More informationSPRING 2007 EXAM MLC SOLUTIONS
SPRING 7 EXAM MLC SOLUTIONS Question # Key: E p p.95 3 7 7 p7.96 75 μ 7 d.7.9896 4p7 e e.8985 5p 7.9896.8985.889 Question # Key: B A μ/ μ+ δ μ/ μ+.8.3443 μ.4 A μ.79 μ+ δ Var a T A A.79.3443 δ.8 3.96 Question
More information10 Introduction to Reliability
0 Introduction to Reliability 10 Introduction to Reliability The following notes are based on Volume 6: How to Analyze Reliability Data, by Wayne Nelson (1993), ASQC Press. When considering the reliability
More informationDefinitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen
Recap of Part 1 Per Kragh Andersen Section of Biostatistics, University of Copenhagen DSBS Course Survival Analysis in Clinical Trials January 2018 1 / 65 Overview Definitions and examples Simple estimation
More informationLecture 5 Models and methods for recurrent event data
Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.
More informationNovember 2000 Course 1. Society of Actuaries/Casualty Actuarial Society
November 2000 Course 1 Society of Actuaries/Casualty Actuarial Society 1. A recent study indicates that the annual cost of maintaining and repairing a car in a town in Ontario averages 200 with a variance
More information11 Survival Analysis and Empirical Likelihood
11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with
More informationSurvival Analysis. STAT 526 Professor Olga Vitek
Survival Analysis STAT 526 Professor Olga Vitek May 4, 2011 9 Survival Data and Survival Functions Statistical analysis of time-to-event data Lifetime of machines and/or parts (called failure time analysis
More informationSubject CT4 Models. October 2015 Examination INDICATIVE SOLUTION
Institute of Actuaries of India Subject CT4 Models October 2015 Examination INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the aim of helping candidates.
More informationIntroduction to repairable systems STK4400 Spring 2011
Introduction to repairable systems STK4400 Spring 2011 Bo Lindqvist http://www.math.ntnu.no/ bo/ bo@math.ntnu.no Bo Lindqvist Introduction to repairable systems Definition of repairable system Ascher and
More informationTypical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction
Outline CHL 5225H Advanced Statistical Methods for Clinical Trials: Survival Analysis Prof. Kevin E. Thorpe Defining Survival Data Mathematical Definitions Non-parametric Estimates of Survival Comparing
More informationREGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520
REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 Department of Statistics North Carolina State University Presented by: Butch Tsiatis, Department of Statistics, NCSU
More informationUNIVERSITY OF CALIFORNIA, SAN DIEGO
UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department
More informationThe Design of a Survival Study
The Design of a Survival Study The design of survival studies are usually based on the logrank test, and sometimes assumes the exponential distribution. As in standard designs, the power depends on The
More informationDescription Syntax for predict Menu for predict Options for predict Remarks and examples Methods and formulas References Also see
Title stata.com stcrreg postestimation Postestimation tools for stcrreg Description Syntax for predict Menu for predict Options for predict Remarks and examples Methods and formulas References Also see
More informationErrata for the ASM Study Manual for Exam P, Fourth Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA
Errata for the ASM Study Manual for Exam P, Fourth Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA (krzysio@krzysio.net) Effective July 5, 3, only the latest edition of this manual will have its
More informationFall 2003 Society of Actuaries Course 3 Solutions = (0.9)(0.8)(0.3) + (0.5)(0.4)(0.7) (0.9)(0.8)(0.5)(0.4) [1-(0.7)(0.3)] = (0.
Fall 3 Society of Actuaries Course 3 Solutions Question # Key: E q = p p 3:34 3:34 3 3:34 p p p 3 34 3:34 p 3:34 p p p p 3 3 3 34 3 3:34 3 3:34 =.9.8 =.7 =.5.4 =. =.7. =.44 =.7 +..44 =.776 =.7.7 =.54 =..3
More informationAnalysis of Time-to-Event Data: Chapter 6 - Regression diagnostics
Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Residuals for the
More informationINSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS
INSTITUTE AND FACULTY OF ACTUARIES Curriculum 09 SPECIMEN SOLUTIONS Subject CSA Risk Modelling and Survival Analysis Institute and Faculty of Actuaries Sample path A continuous time, discrete state process
More informationAsymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics.
Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Dragi Anevski Mathematical Sciences und University November 25, 21 1 Asymptotic distributions for statistical
More informationMLC Fall 2015 Written Answer Questions. Model Solutions
MLC Fall 215 Written Answer Questions Model Solutions 1 MLC Fall 215 Question 1 Model Solution Learning Outcome: 1(a) Chapter References: AMLCR 2.2-2.5 Overall, this question was done reasonably well,
More informationSurvival Analysis. 732G34 Statistisk analys av komplexa data. Krzysztof Bartoszek
Survival Analysis 732G34 Statistisk analys av komplexa data Krzysztof Bartoszek (krzysztof.bartoszek@liu.se) 10, 11 I 2018 Department of Computer and Information Science Linköping University Survival analysis
More informationTextbook: Survivial Analysis Techniques for Censored and Truncated Data 2nd edition, by Klein and Moeschberger
Lecturer: James Degnan Office: SMLC 342 Office hours: MW 12:00 1:00 or by appointment E-mail: jamdeg@unm.edu Please include STAT474 or STAT574 in the subject line of the email to make sure I don t overlook
More informationRMSC 2001 Introduction to Risk Management
RMSC 2001 Introduction to Risk Management Tutorial 4 (2011/12) 1 February 20, 2012 Outline: 1. Failure Time 2. Loss Frequency 3. Loss Severity 4. Aggregate Claim ====================================================
More informationSurvival Analysis Math 434 Fall 2011
Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup
More informationChapter 7: Hypothesis testing
Chapter 7: Hypothesis testing Hypothesis testing is typically done based on the cumulative hazard function. Here we ll use the Nelson-Aalen estimate of the cumulative hazard. The survival function is used
More information2011/04 LEUKAEMIA IN WALES Welsh Cancer Intelligence and Surveillance Unit
2011/04 LEUKAEMIA IN WALES 1994-2008 Welsh Cancer Intelligence and Surveillance Unit Table of Contents 1 Definitions and Statistical Methods... 2 2 Results 7 2.1 Leukaemia....... 7 2.2 Acute Lymphoblastic
More informationNotes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes.
Unit 2: Models, Censoring, and Likelihood for Failure-Time Data Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes. Ramón
More informationCreating New Distributions
Creating New Distributions Section 5.2 Stat 477 - Loss Models Section 5.2 (Stat 477) Creating New Distributions Brian Hartman - BYU 1 / 18 Generating new distributions Some methods to generate new distributions
More informationMeei Pyng Ng 1 and Ray Watson 1
Aust N Z J Stat 444), 2002, 467 478 DEALING WITH TIES IN FAILURE TIME DATA Meei Pyng Ng 1 and Ray Watson 1 University of Melbourne Summary In dealing with ties in failure time data the mechanism by which
More informationErrata and updates for ASM Exam C/Exam 4 Manual (Sixth Edition) sorted by date
Errata for ASM Exam C/4 Study Manual (Sixth Edition) Sorted by Date 1 Errata and updates for ASM Exam C/Exam 4 Manual (Sixth Edition) sorted by date Please note that the SOA has announced that when using
More informationConstrained estimation for binary and survival data
Constrained estimation for binary and survival data Jeremy M. G. Taylor Yong Seok Park John D. Kalbfleisch Biostatistics, University of Michigan May, 2010 () Constrained estimation May, 2010 1 / 43 Outline
More informationMachine Learning. Module 3-4: Regression and Survival Analysis Day 2, Asst. Prof. Dr. Santitham Prom-on
Machine Learning Module 3-4: Regression and Survival Analysis Day 2, 9.00 16.00 Asst. Prof. Dr. Santitham Prom-on Department of Computer Engineering, Faculty of Engineering King Mongkut s University of
More informationFrequency Distributions
Mahlerʼs Guide to Frequency Distributions Joint Exam 4/C prepared by Howard C. Mahler, FCAS Copyright 2012 by Howard C. Mahler. Study Aid 2012-4-1 Howard Mahler hmahler@mac.com www.howardmahler.com/teaching
More informationErrata and updates for ASM Exam C/Exam 4 Manual (Sixth Edition) sorted by page
Errata for ASM Exam C/4 Study Manual (Sixth Edition) Sorted by Page Errata and updates for ASM Exam C/Exam 4 Manual (Sixth Edition) sorted by page Please note that the SOA has announced that when using
More informationPhilosophy and Features of the mstate package
Introduction Mathematical theory Practice Discussion Philosophy and Features of the mstate package Liesbeth de Wreede, Hein Putter Department of Medical Statistics and Bioinformatics Leiden University
More informationKaplan-Meier in SAS. filename foo url "http://math.unm.edu/~james/small.txt"; data small; infile foo firstobs=2; input time censor; run;
Kaplan-Meier in SAS filename foo url "http://math.unm.edu/~james/small.txt"; data small; infile foo firstobs=2; input time censor; run; proc print data=small; run; proc lifetest data=small plots=survival;
More informationAnalysis of competing risks data and simulation of data following predened subdistribution hazards
Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013
More informationExam C. Exam C. Exam C. Exam C. Exam C
cumulative distribution function distribution function cdf survival function probability density function density function probability function probability mass function hazard rate force of mortality
More informationCox s proportional hazards model and Cox s partial likelihood
Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.
More informationLecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016
Proportional Hazards Model - Handling Ties and Survival Estimation Statistics 255 - Survival Analysis Presented February 4, 2016 likelihood - Discrete Dan Gillen Department of Statistics University of
More informationSTAT 6385 Survey of Nonparametric Statistics. Order Statistics, EDF and Censoring
STAT 6385 Survey of Nonparametric Statistics Order Statistics, EDF and Censoring Quantile Function A quantile (or a percentile) of a distribution is that value of X such that a specific percentage of the
More informationASM Study Manual for Exam P, Second Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata
ASM Study Manual for Exam P, Second Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA (krzysio@krzysio.net) Errata Effective July 5, 3, only the latest edition of this manual will have its errata
More informationf X (x) = λe λx, , x 0, k 0, λ > 0 Γ (k) f X (u)f X (z u)du
11 COLLECTED PROBLEMS Do the following problems for coursework 1. Problems 11.4 and 11.5 constitute one exercise leading you through the basic ruin arguments. 2. Problems 11.1 through to 11.13 but excluding
More informationIntroduction to BGPhazard
Introduction to BGPhazard José A. García-Bueno and Luis E. Nieto-Barajas February 11, 2016 Abstract We present BGPhazard, an R package which computes hazard rates from a Bayesian nonparametric view. This
More informationPart III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data
1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions
More informationLecture 22 Survival Analysis: An Introduction
University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which
More informationStatistical Methods for Alzheimer s Disease Studies
Statistical Methods for Alzheimer s Disease Studies Rebecca A. Betensky, Ph.D. Department of Biostatistics, Harvard T.H. Chan School of Public Health July 19, 2016 1/37 OUTLINE 1 Statistical collaborations
More informationExtensions of Cox Model for Non-Proportional Hazards Purpose
PhUSE Annual Conference 2013 Paper SP07 Extensions of Cox Model for Non-Proportional Hazards Purpose Author: Jadwiga Borucka PAREXEL, Warsaw, Poland Brussels 13 th - 16 th October 2013 Presentation Plan
More informationRMSC 2001 Introduction to Risk Management
RMSC 2001 Introduction to Risk Management Tutorial 4 (2011/12) 1 February 20, 2012 Outline: 1. Failure Time 2. Loss Frequency 3. Loss Severity 4. Aggregate Claim ====================================================
More information