Duration Analysis. Joan Llull

Size: px
Start display at page:

Download "Duration Analysis. Joan Llull"

Transcription

1 Duration Analysis Joan Llull Panel Data and Duration Models Barcelona GSE joan.llull [at] movebarcelona [dot] eu

2 Introduction Duration Analysis 2

3 Duration analysis Duration data: how long has an individual been in a state when exiting from it (e.g. weeks unemployed). Examples: unemployment duration, marriage duration, life expectancy, rms exit from the market,... We want to analyze: Why durations dier across individuals? How and why do exit probabilities vary over time? Long tradition in biometrics: surviving probabilities, hazard functions,... Duration Analysis 3

4 Duration data We denote these durations for the N observations as t 1, t 2,..., t N. These data are typically censored. E.g.: Individuals may not nd a job before the interview (we observe that t > t, but not t). Individuals may nd a job between selection and interview, and we cannot interview them (we observe t < t < t) One of the main motivations for duration analysis: dealing with censoring Duration Analysis 4

5 Figure I. Two examples of censored observations i. Example 1 ii. Example 2 Individual Jan90 Jul90 Jan91 Jul91 Jan92 Date Individual Jan90 Jul90 Jan91 Jul91 Jan92 Date Note: Black lines represent the time when the individual was unemployed. A dot indicates that the individual is still unemployed at that date, but we do not have further information about him/her. Vertical red dashed lines in Example 2 are interview dates. Duration Analysis 5

6 The Hazard Function Duration Analysis 6

7 The hazard function Hazard function: probability (or density) of exiting at t conditional on being alive. It can be time-varying (e.g. mortality) or constant. We analyze discrete and continuous hazard functions. Why do we care? Theoretically appealing (e.g. job arrival rate). Empirically convenient (binomial discrete choice+censoring). We start from unconditional hazards and then add regressors. Duration Analysis 7

8 Figure II. Mortality Hazard Rate Hazard function Age Note: The line depicts the hazard mortality rate, i.e. the probability of dying at age a conditional on survival until that age. Duration Analysis 8

9 Hazard function for a discrete variable Probability mass function for t: p(τ) = Pr(t = τ) for τ = 1, 2,... Cdf for t: F (t) = p(1) + p(2) p(t). Hazard function for t: Pr(t = τ) h(τ) = Pr(t = τ t τ) = Pr(t τ) = p(τ) 1 F (τ 1) We can recover p(t) and F (t) from h(t) recursively: F (τ) F (τ 1) =. 1 F (τ 1) 1 h(t) = 1 F (t) t 1 F (t 1) F (t) = 1 (1 h(s)) (Interpretation of these expressions) s=1 t 1 p(t) = (1 F (t 1))h(t) =h(t) (1 h(s)). s=1 Duration Analysis 9

10 Hazard function for a continuous variable Pdf function for t: f(τ) = lim dt 0 Pr(τ t<τ+dt) dt. Cdf for t: F (t) = t 0 f(u)du. Hazard function for t: / Pr(τ t < τ + dt t τ) Pr(τ t < τ + dt) h(τ) = lim = lim dt = f(τ) dt 0 dt dt 0 Pr(t τ) 1 F (τ). We can recover p(t) and F (t) with the integrated hazard H(t) = t 0 h(s)ds: H(t) = ln[1 F (t)] F (t) = 1 exp[ H(t)] f(t) = F (t) t =h(t) exp[ H(t)] =h(t)(1 F (t)). (Comparison with discrete through ln[1 F (t)]) Duration Analysis 10

11 Some frequently used hazard functions In discrete duration, we can estimate semi-parametric hazards. In continuous duration (and sometimes in discrete), we usually assume a given parametric function. Two frequent examples: Constant hazard. Weibull distribution. Duration Analysis 11

12 Constant hazard: the exponential distribution The simplest hazard function is a constant hazard: h(t) = λ. In the continuous case: H(t) = t 0 λdu = λt F (t) = 1 e λt f(t) = λe λt. In the discrete case: F (t) = 1 (1 λ) t p(t) = λ(1 λ) t 1. Having a constant hazard is the memoryless property of the exponential. Also E[T ] = 1/λ. Duration Analysis 12

13 Figure III. Examples of Hazard Functions (I): Constant i. h(t) ii. H(t) iii. f(t) iv. F (t) Hazard rate [h(t)] Duration [t] Integrated hazard [H(t)] Duration [t] Prob. density function [f(t)] Duration [t] Cum. distribution function [F(t)] Duration (t) Note: Black: λ = 0.12; gray: λ = These examples depict the hazard function (left), the integrated hazard (center-left), the pdf or probability mass function (center-right) and the cdf (right) of a constant hazard model with hazard equal to λ Duration Analysis 13

14 The Weibull distribution A generalization of the exponential distribution is: F (t) = 1 e (λt)α f(t) = αλ α t α 1 e (λt)α. This gives the following hazard function: h(t) = αλ α t α 1. It is monotonically increasing, decreasing, or constant depending on whether α ><= 1 Duration Analysis 14

15 Figure IV. Examples of Hazard Functions (II): Weibull i. h(t) ii. H(t) iii. f(t) iv. F (t) Hazard rate [h(t)] Duration [t] Integrated hazard [H(t)] Duration [t] Prob. density function [f(t)] Duration [t] Cum. distribution function [F(t)] Duration (t) Note: Black: λ = 0.12; gray: λ = Solid: α = 1; dotted: α = 0.5; dashed: α = 1.5; dot-dashed: α = 3. These examples depict the hazard function (left), the integrated hazard (center-left), the pdf or probability mass function (center-right) and the cdf (right) of a Weibull hazard model with parameters λ and α. Duration Analysis 15

16 Conditional Hazard Functions Duration Analysis 16

17 Including covariates to the hazard model We want to see how duration is aected by covariates x. We are interested in the conditional hazard: h(t, x) = f(t x) 1 F (t x). Duration Analysis 17

18 Proportional hazard model It was introduced by Cox (1972). It is a.k.a. Cox regression or Cox model. Main advantage: simplicity. It is a factorized model: h(t, x) = λ(t) exp(x β). λ(t) can be either constant λ(t) = 1 or Weibull λ(t) = αt α 1 (the scale cannot be separately identied from the constant) λ(t) is called the baseline hazard. Duration Analysis 18

19 Discrete durations Now conditional hazard rates are probabilities. One approach: treat spells as continuous and use the PH model. However, it is natural to use discrete choice models to specify the exit rate: h(τ, x) = Pr(t = τ t τ, x) = G(γ τ + x β τ ), where G(.) is a cdf (e.g. normal or logistic). Dierent ways to specify baseline hazard: Leave them free: γ t = T j=1 γj 1{t = j}. (why T?) Specify them as polynomials: e.g. γ t = γ 0 + γ 1 ln t + γ 2(ln t) Assuming γ t free delivers a semi-parametric estimation of λ(t) that allows us to test parametric assumptions. Duration Analysis 19

20 Likelihood functions Duration Analysis 20

21 Complete durations Assume that we observe {t 1, t 2,..., t N }. The log-likelihood function is: L N = where: N ln f(t i x i ), i=1 f(t x) = h(t, x) exp( H(t, x)) = λ(t) exp(x β) exp { Λ(t) exp(x β)}. Dierent expressions for λ(t) and Λ(t) depending on baseline hazards. Duration Analysis 21

22 Our two examples of censored durations Example 1: We observe t if t < t; otherwise we only observe that t > t. The log-likelihood function is: N { L N = wi ln f(t i x i) + (1 w i) ln(1 F (t } i x i)), w i = 1{t i < t i}. i=1 Example 2: We observe whether whether t < t 1, t 1 < t < t 2, or t > t 2. The log-likelihood function is: N { ( ) L N = wi 1 ln F (t 1 i x i) + wi 2 ln F (t 2 i x i) F (t 1 i x i) i=1 where w 1 i = 1{t i < t 1 i }, and w 2 i = 1{t 1 i < t i < t 2 i }. ( )} +(1 wi 1 wi 2 ) ln 1 F (t 2 i x i), Duration Analysis 22

23 An additional example Example 2 modied: Same as example 2, but with initial (known) durations d 1, d 2,..., d N. The log-likelihood function is: { N L N = wi 1 ln F (di + t1 i x i) F (d i x i) 1 F (d i x i) i=1 + wi 2 ln F (di + t2 i x i) F (d i + t 1 i x i) 1 F (d i x i) } +(1 wi 1 wi 2 ) ln 1 F (di + t2 i x i), 1 F (d i x i) where w 1 i = 1{d i < t i < d i + t 1 i }, and w 2 i = 1{d i + t 1 i < d i + t i < t 2 i }. Duration Analysis 23

24 Discrete durations The log-likelihood of a discrete duration model can be written as: L N = N T w iτ {y iτ ln G(γ τ + x iβ τ ) + (1 y iτ ) ln(1 G(γ τ + x iβ τ ))}, i=1 τ=1 where y iτ = 1{t i = τ}, and w iτ = 1{t i τ}. Two types of contributions: Spells that end at time τ: τ 1 ln Pr(t = τ x) = ln h(τ, x) + ln(1 h(s, x)). Spells that are incomplete at time T : s=1 ln Pr(t > T x) = ln(1 h(s, x)). T s=1 This problem is a sequence of binary choice models. If censoring occurs at dierent durations: T i = min{ t i, T }. Duration Analysis 24

25 Unobserved Heterogeneity Duration Analysis 25

26 State dependence vs unobserved heterogeneity Consider a regressor x = {0, 1}, and a constant proportional hazard model such that: h(t, x = 0) = h 0 h(t, x = 1) = h 1 h 1 > h 0. The aggregate hazard is: h(τ) = h 1 Pr(x = 1 t τ) + h 0 Pr(x = 0 t τ). This aggregate hazard is not constant anymore given that Pr(x = 1 t τ) decreases over time spurious state dependence. Duration Analysis 26

27 Figure V. An Example with Unobserved Heterogeneity Hazard function Note: Duration (t) Black dashed: h 1 (hazard rate when x = 1); gray dashed: h 0 (hazard rate when x = 0); gray dashed: observed (unconditional) hazard. Duration Analysis 27

28 The PH with unobserved heterogeneity Lancaster (1979) includes unobserved heterogeneity as a multiplicative random eect: h(t, x, ν) = λ(t) exp(x β)ν, where ν is independent of x with E[ν] = 1 and with pdf g(ν) (e.g. gamma). The cdf conditional on ν is: F (t x, ν) = 1 exp [ t 0 ] h(u, x, ν)du, and the unconditional cdf is: F (t x) = 0 F (t x, v)g(v)dv, Important remark: identication requires inclusion of exogenous regressors! Duration Analysis 28

29 Unobserved heterogeneity with discrete data Similarly to the continuous case: Pr(t = τ x) = Pr(t = τ x, v)g(v)dv = τ 1 h(τ, x, v) [1 h(s, x, v)]g(v)dv, s=1 where, for instance: h(t, x, ν) = G(γ t + x β t + ν). A frequently used specication for g(v) is a discrete-support mass point distribution: {ν 1,..., ν m} with probabilities {p 1,.., p m}: { } m τ 1 Pr(t = τ x) = h(τ, x, ν j) [1 h(s, x, ν j)]p j, j=1 s=1 and ν j and p j are treated as additional parameters. Duration Analysis 29

30 Multiple-exit Discrete Duration Models Duration Analysis 30

31 Transition intensities vs conditional hazard rates Let d 1 and d 2 be indicators for the two potential alternatives where to exit (e.g temporary or permanent job). We can dene the following intensities of transition to each state: φ j(τ) = Pr(t = τ, d j = 1 t τ), j = 1, 2. The unconditional hazard rates are: h(τ) = Pr(t = τ t τ) = φ 1(τ) + φ 2(τ). Also, we can dene conditional hazard rates as: h j(τ) = Pr(y jτ = 1 t τ, y kτ = 0), j k where y jτ = y τ d j = 1{t = τ, d j = 1}. The mapping between the two expressions is as follows: h j(τ) = Pr(yjτ = 1 t τ) Pr(y kτ = 0 t τ) = φj(t) 1 φ k (t). We can model either of the two. For instance, a MNL for φ's delivers a binary logit for h's with the same parameters. Duration Analysis 31

32 Competing risk models The models for conditional hazards h 1 (t) and h 2 (t) are often called competing risk models. This name derives from considering two latent variables t 1 and t 2, such that the observed duration is t = min{t 1, t 2}. If t 1 and t 2 are independent, h 1 (t) and h 2 (t) can be interpreted as exit rates of latent durations: h j (τ) = Pr(t j = τ t j τ). This implies that when we analyze exits to alternative 1, we take exits to 2 as censored observations. Duration Analysis 32

33 FIML estimation The log-likelihood is analogous to the discrete case: N T L N = w iτ {y iτ (d 1i ln φ 1(τ, x i) + d 2i ln φ 2(τ, x i)) i=1 τ=1 where y iτ = 1{t i = τ}, and w iτ = 1{t i τ}. Three types of contributions: Spells that end at time τ exiting to option 1: +(1 y iτ ) ln(1 φ 1(τ, x i) φ 2(τ, x i))}, τ 1 ln Pr(t = τ, d 1 = 1 x) = ln φ 1(τ, x) + ln(1 φ 1(s, x) φ 2(s, x)). s=1 Spells that end at time τ exiting to option 2: τ 1 ln Pr(t = τ, d 2 = 1 x) = ln φ 2(τ, x) + ln(1 φ 1(s, x) φ 2(s, x)). Spells that are incomplete at time T : s=1 ln Pr(t > T x) = ln(1 φ 1(s, x) φ 2(s, x)). T s=1 Duration Analysis 33

34 LIML estimation Alternatively, we can estimate the model separately for the two alternatives in a competing risk fashion. In this case, the conditional log-likelihood function for alternative j is: N T L Nj = w iτ {y ijτ ln h j(τ, x i) + (1 y ijτ ) ln(1 h j(τ, x i))}, or, equivalently: i=1 τ=1 N T L Nj = w iτ {y iτ d j ln h j(τ, x i) + (1 y iτ d j) ln(1 h j(τ, x i))}. i=1 τ=1 where y ijτ = 1{t i = τ, d ij = 1}, y iτ = 1{t i = τ}, and w iτ = 1{t i τ}, with t i = min{t 1i, t 2i}. Duration Analysis 34

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Stock Sampling with Interval-Censored Elapsed Duration: A Monte Carlo Analysis

Stock Sampling with Interval-Censored Elapsed Duration: A Monte Carlo Analysis Stock Sampling with Interval-Censored Elapsed Duration: A Monte Carlo Analysis Michael P. Babington and Javier Cano-Urbina August 31, 2018 Abstract Duration data obtained from a given stock of individuals

More information

ECON 721: Lecture Notes on Duration Analysis. Petra E. Todd

ECON 721: Lecture Notes on Duration Analysis. Petra E. Todd ECON 721: Lecture Notes on Duration Analysis Petra E. Todd Fall, 213 2 Contents 1 Two state Model, possible non-stationary 1 1.1 Hazard function.......................... 1 1.2 Examples.............................

More information

Dynamic Models Part 1

Dynamic Models Part 1 Dynamic Models Part 1 Christopher Taber University of Wisconsin December 5, 2016 Survival analysis This is especially useful for variables of interest measured in lengths of time: Length of life after

More information

Survival Analysis. Stat 526. April 13, 2018

Survival Analysis. Stat 526. April 13, 2018 Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined

More information

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching Mayya Komisarchik Harvard University April 13, 2016 1 Heartfelt thanks to all of the Gov 2001 TFs of yesteryear; this section draws heavily

More information

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Analysis of competing risks data and simulation of data following predened subdistribution hazards Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013

More information

Search frictions: some empirical evidence

Search frictions: some empirical evidence 1 Search frictions: some empirical evidence Models of the labor market with search frictions deliver a number of testable implications for labor market flows unemployment inflows and outflows equilibrium

More information

7.1 The Hazard and Survival Functions

7.1 The Hazard and Survival Functions Chapter 7 Survival Models Our final chapter concerns models for the analysis of data which have three main characteristics: (1) the dependent variable or response is the waiting time until the occurrence

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

Consider Table 1 (Note connection to start-stop process).

Consider Table 1 (Note connection to start-stop process). Discrete-Time Data and Models Discretized duration data are still duration data! Consider Table 1 (Note connection to start-stop process). Table 1: Example of Discrete-Time Event History Data Case Event

More information

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

Incorporating Time-Invariant Covariates into the Pareto/NBD and BG/NBD Models

Incorporating Time-Invariant Covariates into the Pareto/NBD and BG/NBD Models Incorporating Time-Invariant Covariates into the Pareto/NBD and BG/NBD Models Peter S Fader wwwpetefadercom Bruce G S Hardie wwwbrucehardiecom August 2007 1 Introduction This note documents how to incorporate

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Frailty Modeling for clustered survival data: a simulation study

Frailty Modeling for clustered survival data: a simulation study Frailty Modeling for clustered survival data: a simulation study IAA Oslo 2015 Souad ROMDHANE LaREMFiQ - IHEC University of Sousse (Tunisia) souad_romdhane@yahoo.fr Lotfi BELKACEM LaREMFiQ - IHEC University

More information

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation

More information

Economics 508 Lecture 22 Duration Models

Economics 508 Lecture 22 Duration Models University of Illinois Fall 2012 Department of Economics Roger Koenker Economics 508 Lecture 22 Duration Models There is considerable interest, especially among labor-economists in models of duration.

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

THE SINGULARITY OF THE INFORMATION MATRIX OF THE MIXED PROPORTIONAL HAZARD MODEL

THE SINGULARITY OF THE INFORMATION MATRIX OF THE MIXED PROPORTIONAL HAZARD MODEL Econometrica, Vol. 71, No. 5 (September, 2003), 1579 1589 THE SINGULARITY OF THE INFORMATION MATRIX OF THE MIXED PROPORTIONAL HAZARD MODEL BY GEERT RIDDER AND TIEMEN M. WOUTERSEN 1 This paper presents

More information

Step-Stress Models and Associated Inference

Step-Stress Models and Associated Inference Department of Mathematics & Statistics Indian Institute of Technology Kanpur August 19, 2014 Outline Accelerated Life Test 1 Accelerated Life Test 2 3 4 5 6 7 Outline Accelerated Life Test 1 Accelerated

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis STAT 6350 Analysis of Lifetime Data Failure-time Regression Analysis Explanatory Variables for Failure Times Usually explanatory variables explain/predict why some units fail quickly and some units survive

More information

Semiparametric Estimation of a Panel Data Proportional Hazards Model with Fixed Effects

Semiparametric Estimation of a Panel Data Proportional Hazards Model with Fixed Effects Semiparametric Estimation of a Panel Data Proportional Hazards Model with Fixed Effects Joel L. Horowitz Department of Economics Northwestern University Evanston, IL 60208 and Sokbae Lee Department of

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS330 / MAS83 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-0 8 Parametric models 8. Introduction In the last few sections (the KM

More information

Fahrmeir: Discrete failure time models

Fahrmeir: Discrete failure time models Fahrmeir: Discrete failure time models Sonderforschungsbereich 386, Paper 91 (1997) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Discrete failure time models Ludwig Fahrmeir, Universitat

More information

Semiparametric Estimation with Mismeasured Dependent Variables: An Application to Duration Models for Unemployment Spells

Semiparametric Estimation with Mismeasured Dependent Variables: An Application to Duration Models for Unemployment Spells Semiparametric Estimation with Mismeasured Dependent Variables: An Application to Duration Models for Unemployment Spells Jason Abrevaya University of Chicago Graduate School of Business Chicago, IL 60637,

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

Estimation of discrete time (grouped duration data) proportional hazards models: pgmhaz

Estimation of discrete time (grouped duration data) proportional hazards models: pgmhaz Estimation of discrete time (grouped duration data) proportional hazards models: pgmhaz Stephen P. Jenkins ESRC Research Centre on Micro-Social Change University of Essex, Colchester

More information

Duration Models and Point Processes

Duration Models and Point Processes DISCUSSION PAPER SERIES IZA DP No. 2971 Duration Models and Point Processes Jean-Pierre Florens Denis Fougère Michel Mouchart August 27 Forschungsinstitut zur Zukunft der Arbeit Institute for the Study

More information

Multistate models and recurrent event models

Multistate models and recurrent event models Multistate models Multistate models and recurrent event models Patrick Breheny December 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Multistate models In this final lecture,

More information

A Regression Model for the Copula Graphic Estimator

A Regression Model for the Copula Graphic Estimator Discussion Papers in Economics Discussion Paper No. 11/04 A Regression Model for the Copula Graphic Estimator S.M.S. Lo and R.A. Wilke April 2011 2011 DP 11/04 A Regression Model for the Copula Graphic

More information

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

Longitudinal and Multilevel Methods for Multinomial Logit David K. Guilkey

Longitudinal and Multilevel Methods for Multinomial Logit David K. Guilkey Longitudinal and Multilevel Methods for Multinomial Logit David K. Guilkey Focus of this talk: Unordered categorical dependent variables Models will be logit based Empirical example uses data from the

More information

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements [Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements Aasthaa Bansal PhD Pharmaceutical Outcomes Research & Policy Program University of Washington 69 Biomarkers

More information

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods MIT 14.385, Fall 2007 Due: Wednesday, 07 November 2007, 5:00 PM 1 Applied Problems Instructions: The page indications given below give you

More information

Survival analysis in R

Survival analysis in R Survival analysis in R Niels Richard Hansen This note describes a few elementary aspects of practical analysis of survival data in R. For further information we refer to the book Introductory Statistics

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

Maximum likelihood estimation of mark-recapture-recovery models in the presence of continuous covariates

Maximum likelihood estimation of mark-recapture-recovery models in the presence of continuous covariates Maximum likelihood estimation of mark-recapture-recovery models in the presence of continuous covariates & Ruth King University of St Andrews........ 1 Outline of the problem 2 MRR data in the absence

More information

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables

More information

Multistate models and recurrent event models

Multistate models and recurrent event models and recurrent event models Patrick Breheny December 6 Patrick Breheny University of Iowa Survival Data Analysis (BIOS:7210) 1 / 22 Introduction In this final lecture, we will briefly look at two other

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

THESIS for the degree of MASTER OF SCIENCE. Modelling and Data Analysis

THESIS for the degree of MASTER OF SCIENCE. Modelling and Data Analysis PROPERTIES OF ESTIMATORS FOR RELATIVE RISKS FROM NESTED CASE-CONTROL STUDIES WITH MULTIPLE OUTCOMES (COMPETING RISKS) by NATHALIE C. STØER THESIS for the degree of MASTER OF SCIENCE Modelling and Data

More information

DAGStat Event History Analysis.

DAGStat Event History Analysis. DAGStat 2016 Event History Analysis Robin.Henderson@ncl.ac.uk 1 / 75 Schedule 9.00 Introduction 10.30 Break 11.00 Regression Models, Frailty and Multivariate Survival 12.30 Lunch 13.30 Time-Variation and

More information

Survival Models. Patrick Lam. February 1, 2008

Survival Models. Patrick Lam. February 1, 2008 Survival Models Patrick Lam February 1, 2008 Outline Basics Underlying Math Parametric Survival Models The Cox Proportional Hazards Model Beck, Katz, and Tucker 1998 Conclusion Outline Basics Underlying

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Identification of the timing-of-events model with multiple competing exit risks from single-spell data

Identification of the timing-of-events model with multiple competing exit risks from single-spell data COHERE - Centre of Health Economics Research Identification of the timing-of-events model with multiple competing exit risks from single-spell data By: Bettina Drepper, Department of Econometrics and OR,

More information

Joint Modeling of Longitudinal Item Response Data and Survival

Joint Modeling of Longitudinal Item Response Data and Survival Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,

More information

Remarks on Structural Estimation The Search Framework

Remarks on Structural Estimation The Search Framework Remarks on Structural Estimation The Search Framework Christopher Flinn NYU and Collegio Carlo Alberto November 2009 1 The Estimation of Search Models We develop a simple model of single agent search set

More information

Graduate Econometrics I: What is econometrics?

Graduate Econometrics I: What is econometrics? Graduate Econometrics I: What is econometrics? Yves Dominicy Université libre de Bruxelles Solvay Brussels School of Economics and Management ECARES Yves Dominicy Graduate Econometrics I: What is econometrics?

More information

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions Econ 513, USC, Department of Economics Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions I Introduction Here we look at a set of complications with the

More information

3003 Cure. F. P. Treasure

3003 Cure. F. P. Treasure 3003 Cure F. P. reasure November 8, 2000 Peter reasure / November 8, 2000/ Cure / 3003 1 Cure A Simple Cure Model he Concept of Cure A cure model is a survival model where a fraction of the population

More information

Cox s proportional hazards model and Cox s partial likelihood

Cox s proportional hazards model and Cox s partial likelihood Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.

More information

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 Department of Statistics North Carolina State University Presented by: Butch Tsiatis, Department of Statistics, NCSU

More information

Optimum Hybrid Censoring Scheme using Cost Function Approach

Optimum Hybrid Censoring Scheme using Cost Function Approach Optimum Hybrid Censoring Scheme using Cost Function Approach Ritwik Bhattacharya 1, Biswabrata Pradhan 1, Anup Dewanji 2 1 SQC and OR Unit, Indian Statistical Institute, 203, B. T. Road, Kolkata, PIN-

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Separate Appendix to: Semi-Nonparametric Competing Risks Analysis of Recidivism

Separate Appendix to: Semi-Nonparametric Competing Risks Analysis of Recidivism Separate Appendix to: Semi-Nonparametric Competing Risks Analysis of Recidivism Herman J. Bierens a and Jose R. Carvalho b a Department of Economics,Pennsylvania State University, University Park, PA 1682

More information

A general mixed model approach for spatio-temporal regression data

A general mixed model approach for spatio-temporal regression data A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression

More information

Definition 1.1 (Parametric family of distributions) A parametric distribution is a set of distribution functions, each of which is determined by speci

Definition 1.1 (Parametric family of distributions) A parametric distribution is a set of distribution functions, each of which is determined by speci Definition 1.1 (Parametric family of distributions) A parametric distribution is a set of distribution functions, each of which is determined by specifying one or more values called parameters. The number

More information

An Overview of Methods for Applying Semi-Markov Processes in Biostatistics.

An Overview of Methods for Applying Semi-Markov Processes in Biostatistics. An Overview of Methods for Applying Semi-Markov Processes in Biostatistics. Charles J. Mode Department of Mathematics and Computer Science Drexel University Philadelphia, PA 19104 Overview of Topics. I.

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

Continuous case Discrete case General case. Hazard functions. Patrick Breheny. August 27. Patrick Breheny Survival Data Analysis (BIOS 7210) 1/21

Continuous case Discrete case General case. Hazard functions. Patrick Breheny. August 27. Patrick Breheny Survival Data Analysis (BIOS 7210) 1/21 Hazard functions Patrick Breheny August 27 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/21 Introduction Continuous case Let T be a nonnegative random variable representing the time to an event

More information

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Right censored

More information

ST745: Survival Analysis: Parametric

ST745: Survival Analysis: Parametric ST745: Survival Analysis: Parametric Eric B. Laber Department of Statistics, North Carolina State University January 13, 2015 ...the statistician knows... that in nature there never was a normal distribution,

More information

Lecture 4 - Survival Models

Lecture 4 - Survival Models Lecture 4 - Survival Models Survival Models Definition and Hazards Kaplan Meier Proportional Hazards Model Estimation of Survival in R GLM Extensions: Survival Models Survival Models are a common and incredibly

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

The exponential distribution and the Poisson process

The exponential distribution and the Poisson process The exponential distribution and the Poisson process 1-1 Exponential Distribution: Basic Facts PDF f(t) = { λe λt, t 0 0, t < 0 CDF Pr{T t) = 0 t λe λu du = 1 e λt (t 0) Mean E[T] = 1 λ Variance Var[T]

More information

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a

More information

Semiparametric Regression

Semiparametric Regression Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under

More information

On robust causality nonresponse testing in duration studies under the Cox model

On robust causality nonresponse testing in duration studies under the Cox model Stat Papers (2014) 55:221 231 DOI 10.1007/s00362-013-0523-0 REGULAR ARTICLE On robust causality nonresponse testing in duration studies under the Cox model Tadeusz Bednarski Received: 26 March 2012 / Revised:

More information

Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation

Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation H. Zhang, E. Cutright & T. Giras Center of Rail Safety-Critical Excellence, University of Virginia,

More information

Applied Probability Models in Marketing Research: Extensions

Applied Probability Models in Marketing Research: Extensions Applied Probability Models in Marketing Research: Extensions Bruce G S Hardie London Business School bhardie@londonedu Peter S Fader University of Pennsylvania faderp@whartonupennedu 12th Annual Advanced

More information

ST5212: Survival Analysis

ST5212: Survival Analysis ST51: Survival Analysis 8/9: Semester II Tutorial 1. A model for lifetimes, with a bathtub-shaped hazard rate, is the exponential power distribution with survival fumction S(x) =exp{1 exp[(λx) α ]}. (a)

More information

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation

More information

Unobserved Heterogeneity

Unobserved Heterogeneity Unobserved Heterogeneity Germán Rodríguez grodri@princeton.edu Spring, 21. Revised Spring 25 This unit considers survival models with a random effect representing unobserved heterogeneity of frailty, a

More information

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -

More information

Seminar on Longitudinal Analysis

Seminar on Longitudinal Analysis Seminar on Longitudinal Analysis James Heckman University of Chicago This draft, May 20, 2007 1 / 191 Review of Data Collection Techniques Before addressing issues regarding the modeling and estimation

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Continuous Time Survival in Latent Variable Models

Continuous Time Survival in Latent Variable Models Continuous Time Survival in Latent Variable Models Tihomir Asparouhov 1, Katherine Masyn 2, Bengt Muthen 3 Muthen & Muthen 1 University of California, Davis 2 University of California, Los Angeles 3 Abstract

More information

CTDL-Positive Stable Frailty Model

CTDL-Positive Stable Frailty Model CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland

More information

Exam D0M61A Advanced econometrics

Exam D0M61A Advanced econometrics Exam D0M61A Advanced econometrics 19 January 2009, 9 12am Question 1 (5 pts.) Consider the wage function w i = β 0 + β 1 S i + β 2 E i + β 0 3h i + ε i, where w i is the log-wage of individual i, S i is

More information

stcrmix and Timing of Events with Stata

stcrmix and Timing of Events with Stata stcrmix and Timing of Events with Stata Christophe Kolodziejczyk, VIVE August 30, 2017 Introduction I will present a Stata command to estimate mixed proportional hazards competing risks models (stcrmix).

More information

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution.

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution. POWER-LAW ADJUSTED SURVIVAL MODELS William J. Reed Department of Mathematics & Statistics University of Victoria PO Box 3060 STN CSC Victoria, B.C. Canada V8W 3R4 reed@math.uvic.ca Key Words: survival

More information

STAT 479: Short Term Actuarial Models

STAT 479: Short Term Actuarial Models STAT 479: Short Term Actuarial Models Jianxi Su, FSA, ACIA Purdue University, Department of Statistics Week 1. Some important things about this course you may want to know... The course covers a large

More information

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes: Practice Exam 1 1. Losses for an insurance coverage have the following cumulative distribution function: F(0) = 0 F(1,000) = 0.2 F(5,000) = 0.4 F(10,000) = 0.9 F(100,000) = 1 with linear interpolation

More information

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks Y. Xu, D. Scharfstein, P. Mueller, M. Daniels Johns Hopkins, Johns Hopkins, UT-Austin, UF JSM 2018, Vancouver 1 What are semi-competing

More information

Introduction to Event History Analysis

Introduction to Event History Analysis Introduction to Ulrich Pötter Götz Rohwer August 27 Ruhr University Bochum Contents Notation 3 2 Basic descriptions of durations 6 2. Distribution function, density, and hazard rate......... 6 2.2 Censoring

More information

Accelerated life testing in the presence of dependent competing causes of failure

Accelerated life testing in the presence of dependent competing causes of failure isid/ms/25/5 April 21, 25 http://www.isid.ac.in/ statmath/eprints Accelerated life testing in the presence of dependent competing causes of failure Isha Dewan S. B. Kulathinal Indian Statistical Institute,

More information

ST495: Survival Analysis: Maximum likelihood

ST495: Survival Analysis: Maximum likelihood ST495: Survival Analysis: Maximum likelihood Eric B. Laber Department of Statistics, North Carolina State University February 11, 2014 Everything is deception: seeking the minimum of illusion, keeping

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction Outline CHL 5225H Advanced Statistical Methods for Clinical Trials: Survival Analysis Prof. Kevin E. Thorpe Defining Survival Data Mathematical Definitions Non-parametric Estimates of Survival Comparing

More information

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1 Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes 1 JunXuJ.ScottLong Indiana University 2005-02-03 1 General Formula The delta method is a general

More information

Count and Duration Models

Count and Duration Models Count and Duration Models Stephen Pettigrew April 2, 2014 Stephen Pettigrew Count and Duration Models April 2, 2014 1 / 61 Outline 1 Logistics 2 Last week s assessment question 3 Counts: Poisson Model

More information

The Weibull Distribution

The Weibull Distribution The Weibull Distribution Patrick Breheny October 10 Patrick Breheny University of Iowa Survival Data Analysis (BIOS 7210) 1 / 19 Introduction Today we will introduce an important generalization of the

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement

More information

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require Chapter 5 modelling Semi parametric We have considered parametric and nonparametric techniques for comparing survival distributions between different treatment groups. Nonparametric techniques, such as

More information