ST495: Survival Analysis: Hypothesis testing and confidence intervals

Similar documents
ST495: Survival Analysis: Maximum likelihood

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

ST745: Survival Analysis: Nonparametric methods

Mathematical Statistics

ST745: Survival Analysis: Cox-PH!

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Composite Hypotheses and Generalized Likelihood Ratio Tests

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

Statistical Inference. Hypothesis Testing

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

14.30 Introduction to Statistical Methods in Economics Spring 2009

Lecture 28: Asymptotic confidence sets

Statistical Data Analysis Stat 3: p-values, parameter estimation

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

Hypothesis testing: theory and methods

Chapter 8 - Statistical intervals for a single sample

Statistical Inference

Primer on statistics:

Introductory Econometrics

ST745: Survival Analysis: Parametric

10. Composite Hypothesis Testing. ECE 830, Spring 2014

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

A Very Brief Summary of Statistical Inference, and Examples

Math 494: Mathematical Statistics

Lecture 21: October 19

Statistical Methods in Particle Physics

parameter space Θ, depending only on X, such that Note: it is not θ that is random, but the set C(X).

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

Lecture 10: Generalized likelihood ratio test

Robustness and Distribution Assumptions

Can we do statistical inference in a non-asymptotic way? 1

STAT 6350 Analysis of Lifetime Data. Probability Plotting

Cherry Blossom run (1) The credit union Cherry Blossom Run is a 10 mile race that takes place every year in D.C. In 2009 there were participants

Confidence Intervals for Normal Data Spring 2014

ISQS 5349 Final Exam, Spring 2017.

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS

Estimation of Quantiles

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes

Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE

Interval Estimation. Chapter 9

Central Limit Theorem ( 5.3)

Math 494: Mathematical Statistics

Hypothesis Testing - Frequentist

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

General Linear Model: Statistical Inference

BIOS 2083 Linear Models c Abdus S. Wahed

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Goodness-of-Fit Testing with. Discrete Right-Censored Data

Bootstrap and Parametric Inference: Successes and Challenges

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

E. Santovetti lesson 4 Maximum likelihood Interval estimation

Loglikelihood and Confidence Intervals

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

New Bayesian methods for model comparison

Institute of Actuaries of India

STAT 461/561- Assignments, Year 2015

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

11 Survival Analysis and Empirical Likelihood

Comparing two independent samples

Inference for the Pareto, half normal and related. distributions

Theory of Statistical Tests

Frequentist Statistics and Hypothesis Testing Spring

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

Further Remarks about Hypothesis Tests

[y i α βx i ] 2 (2) Q = i=1

STAT 830 Non-parametric Inference Basics

Step-Stress Models and Associated Inference

Lecture 32: Asymptotic confidence sets and likelihoods

Ch. 5 Hypothesis Testing

Chapter 5: HYPOTHESIS TESTING

Review. December 4 th, Review

Basic Concepts of Inference

First Year Examination Department of Statistics, University of Florida

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

Classical Inference for Gaussian Linear Models

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

Rank-Based Methods. Lukas Meier

One-parameter models

Uniformly Most Powerful Bayesian Tests and Standards for Statistical Evidence

Pubh 8482: Sequential Analysis

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Lecture 26: Likelihood ratio tests

SPRING 2007 EXAM C SOLUTIONS

Goodness of Fit Goodness of fit - 2 classes

Confidence Intervals for Normal Data Spring 2018

Masters Comprehensive Examination Department of Statistics, University of Florida

Multiple Regression Analysis

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

simple if it completely specifies the density of x

COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA. Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

EECS564 Estimation, Filtering, and Detection Exam 2 Week of April 20, 2015

Fitting a Straight Line to Data

Introduction to Statistical Inference

β j = coefficient of x j in the model; β = ( β1, β2,

Transcription:

ST495: Survival Analysis: Hypothesis testing and confidence intervals Eric B. Laber Department of Statistics, North Carolina State University April 3, 2014

I remember that one fateful day when Coach took me aside. I knew what was coming. You don t have to tell me, I said. I m off the team, aren t I? Well, said Coach, you never were really ON the team. You made that uniform you re wearing out of rags and towels, and your helmet is a toy space helmet. You show up at practice and then either steal the ball and make us chase you to get it back, or you try to tackle people at inappropriate times. It was all true what he was saying. And yet, I thought something is brewing inside the head of this Coach. He sees something in me, some kind of raw talent that he can mold. But that s when I felt the handcuffs go on. Philip Glass

Then and now Last time we discussed graphical methods QQ-plots PP-plots Linearized plots Today we ll discuss Who is Philip Glass Inference with parametric models Prediction intervals

Warm-up Explain to your stat buddy 1. What is a likelihood ratio test? 2. How can you invert a hypothesis test to form a confidence interval? 3. What is a prediction interval? How does this difference from a confidence interval? 4. Who was Philip Glass? True or false: (T/F) A likelihood ratio test is uniformly most powerful for simple hypotheses? (T/F) The ratio of two (scaled) independent chi-squared random variables has an F-distribution (T/F) The most expensive Donkey cheese in the world sells for several thousand dollars a pound

Inference and prediction intervals Quantities estimated from data should (must!) be accompanied by measures of uncertainty Hypothesis tests and confidence intervals regard population-level parameters, assess uncertainty and strength of evidence Prediction intervals reflect both estimation uncertainty and variability in the generative distribution

Shaking off the rust Suppose T 1,..., T n are i.i.d. with density f (t; θ) Inference and hypothesis testing typically concern θ Prediction regards uncertainty in a new draw Tn+1 from f (t; θ) Suppose X 1,..., X n are i.i.d. N(µ, 1), with your stat buddy: 1. Give a 95% confidence interval for µ 2. Construct a test of H 0 : µ = µ 0 vs. H a : µ µ 0 with α Type I error 3. Give a 95% prediction interval for X n+1

Confidence intervals Let {(T i, δ i )} n i=1 denote censored observation times, survival distn f (t; θ) ML approach for CIS: Construct MLE θ n Construct observed fisher information I ( θ) Use asymptotically normality of MLE I 1/2 ( θ n )( θ n θ) N(0, 1), to obtain θ n ± z 1 α/2 I 1/2 ( θ n )

The example you love to hate Let {(T i, δ i )} n i=1 denote observation times subject to non-informative right-censoring Assume the true survival distribution is exponential with density f (t; θ) = θ 1 exp t/θ construct a 95% CI for θ Answer: on board Problem! The log-lh is not approximated well by a quadratic in small-samples when the amount of censoring is large Recall precision varies inversely with number of uncensored observations (Why?) Normal approximation may be poor

The example you love to hate cont d Local approximation of log-lh by a quadratic, T exp(1) Log LH with n=100, Σδ i = 10 Log LH with n=100, Σδ i = 25 Log LH with n=100, Σδ i = 50 l(θ) 200 150 100 50 l(θ) 30 28 26 24 22 l(θ) 53 52 51 50 49 48 47 0.0 0.5 1.0 1.5 2.0 θ 0.4 0.6 0.8 1.0 1.2 1.4 1.6 θ 0.6 0.8 1.0 1.2 1.4 θ

Inverting a test: review Consider testing the null H 0 : θ = θ 0 vs. H a : θ θ 0 Let T n (θ 0 ) be a test statistic and c 1 α an α-level critical value so that the test rejects when T n (θ 0 ) > c 1 α E.g., H0 : µ = µ 0 vs. H a : µ µ 0 and T n (µ 0 ) = n( X n µ 0 ) 2 / σ 2, and c 1 α = χ 2 n 1,1 α If θ is the true value of θ then P (T n (θ ) c 1 α ) 1 α, thus {θ : T n (θ) c 1 α } is a valid (1 α) 100% CI for θ (Why?)

Inverting a test: likelihood Likelihood ratio statistic is Λ(θ) = 2l( θ) 2l(θ) If θ is the true parameter value then Λ(θ) is asymptotically χ 2 p where p = dim θ (1 α) 100% LH ratio CI for θ { θ : Λ(θ) χ 2 p,1 α } Often preferred to the standard MLE interval in small samples

LH ratio CI for an exponential RV {(T i, δ i )} n i=1 observation times subj to non-informative right-censoring, assume f (t; θ) = θ 1 exp{ t/θ} With your stat buddy: compute a LR CI for θ

LH ratio CI for an exponential RV cont d See lrtci.r

Sprott s adjustment Sprott showed that the transformation φ = θ 3 has a symmetrizing effect on the log-lh Idea: Construct usual ML based interval for φ using I 1/2 1 ( φ)( φ φ) N(0, 1), where I 1 ( φ) is the observed Fisher info for φ, then solve for a CI for θ This method performs similarly to the LRT and does not extend generally so we will not consider it further

Exact methods the exponential distribution Under certain censoring schemes exact confidence intervals are possible Type II censoring Testing with replacement These are somewhat specialized examples but you should know they exist as they are useful when applicable (see pp. 153-154)

Likelihood ratio test Goal is to test null hypothesis H 0 Restricted MLE is θ = arg max θ satisfies H0 L(θ) Ex. X 1,..., X n N(µ, σ 2 ) compute the restricted MLE under H 0 : µ = 1 Ex. X 1,..., X n N(µ, σ 2 ) compute the restricted MLE under H 0 : µ = σ LRT statistic Λ = 2l( θ) 2l( θ), when H 0 is true Λ χ 2 d where d is difference in the degrees of freedom between restricted and unrestricted models

LRT example Suppose we observe right-censored failure times from m different groups that we believe have exponential lifetime distns. Observed data {(Tij, δ ij )}, j = 1,..., n i, i = 1,..., m Want to test H0 : θ 1 = = θ m With your stat buddy: derive unrestricted and restricted MLEs Test statistic Λ = 2l( θ) 2l( θ) reject H0 if Λ > χ 2 m 1,1 α (Why?)

LRT example cont d See lrtfourmachines.r

Other distributions and censoring mechanisms As long as we have a likelihood we can apply these methods The book gives additional details for other distributions Gamma, Inverse-gaussian In more general settings numerical methods must be used to maximize the LH Since we know how to compute the LH for other censoring mechanisms (e.g., left, interval) we can apply the proposed methods directly Don t forget about bootstrap CIs! 1 1 Bootstrap tests are also possible but we won t discuss them in this class. See Bickel and Ren for info.

Prediction intervals Confidence intervals quantify uncertainty in estimated coefficients, the do not include uncertainty about future observations Give prognosis to newly diagnosed patient Predict failure time of new manufacturing process Implementing redundancies...

Prediction intervals cont d Let X 1,..., X n, X n+1 be iid, a (1 α) level prediction interval for X n+1 based on X 1,..., X n are functions L(X 1,..., X n ) and U(X 1,..., X N ) so that P {L(X 1,..., X n ) X n+1 U(X 1,..., X n )} 1 α E.g., in the warm-up example X 1,..., X n, X n+1 iid N(µ, 1), L(X 1,..., X n ) = X n z 1 α/2 1 + 1/n, and U(X 1,..., X n ) = X n + z 1 α/2 1 + 1/n

Pivots G(X 1,..., X n, X n+1 ) that does not depend on θ is called pivot X n X n+1 is a pivot the N(µ, 1) example Solve for a, b such that P (a G(X 1,..., X n, X n+1 ) b) 1 α, then {x : G(X 1,..., X n, x) b} is (1 α) 100% PI In survival problems obtaining a pivot can be difficult, instead: Plug-in intervals Projection intervals Calibrate the interval using simulation

Plug-in intervals Compute θ and take percentiles of distn f (t; θ) Asymptotically correct when θ is consistent Ignores uncertainty in estimated θ Ex. {(T i, δ i )} n i=1 censored lifetimes assumed exp(θ) Solve S(t; θ) = 1 p for t to obtain t p (θ) = log(1 p)θ 2 Approximate ( prediction interval) log(1 α/2) θ, log(α/2) θ See approxatepi.r 2 Here we re using f (t; θ) = θ 1 exp{ t/θ}

Projection intervals Plugin PIs ignore uncertainty in θ and thus may be anti-conservative Projection intervals incorporate uncertainty in θ but can be conservative Let C 1 α (θ 0 ) be a (1 α) 100% plugin PI using assumed parameter θ 0 Let ζ 1 η be a (1 η) 100% confidence region for θ Projection interval is a valid 1 α η PI (Why?) θ 0 ζ 1 η C 1 α (θ 0 ),

Calibration Investigate the performance of PI procedure view α as tuning parameter Choose α so that the coverage is near nominal levels in simulated examples