Methodological challenges in research on consequences of sickness absence and disability pension?

Similar documents
Follow-up data with the Epi package

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

One-stage dose-response meta-analysis

Federated analyses. technical, statistical and human challenges

Flexible modelling of the cumulative effects of time-varying exposures

Introduction to logistic regression

Meta-analysis of epidemiological dose-response studies

Multi-state Models: An Overview

Lecture 7 Time-dependent Covariates in Cox Regression

Point-wise averaging approach in dose-response meta-analysis of aggregated data

UNIVERSITY OF TORONTO Faculty of Arts and Science

Case-control studies

Survival Analysis I (CHL5209H)

Checking model assumptions with regression diagnostics

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics

Exam Applied Statistical Regression. Good Luck!

BMI 541/699 Lecture 22

Statistics in medicine

Correlation and regression

PAPER 30 APPLIED STATISTICS

Lecture 12: Interactions and Splines

Clinical Trials. Olli Saarela. September 18, Dalla Lana School of Public Health University of Toronto.

Direct likelihood inference on the cause-specific cumulative incidence function: a flexible parametric regression modelling approach

Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study

BIOSTATS Intermediate Biostatistics Spring 2017 Exam 2 (Units 3, 4 & 5) Practice Problems SOLUTIONS

The coxvc_1-1-1 package

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )

Package SimSCRPiecewise

Joint longitudinal and time-to-event models via Stan

Poisson Regression. The Training Data

Multistate models and recurrent event models

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) GRADUATE DIPLOMA, 2007

Lecture 11. Interval Censored and. Discrete-Time Data. Statistics Survival Analysis. Presented March 3, 2016

Lecture 14: Introduction to Poisson Regression

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Testing and Model Selection

An Analysis. Jane Doe Department of Biostatistics Vanderbilt University School of Medicine. March 19, Descriptive Statistics 1

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

Survival Analysis. 732G34 Statistisk analys av komplexa data. Krzysztof Bartoszek

Correlation and Simple Linear Regression

11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies.

STATS216v Introduction to Statistical Learning Stanford University, Summer Midterm Exam (Solutions) Duration: 1 hours

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

Generalized linear models for binary data. A better graphical exploratory data analysis. The simple linear logistic regression model

Multistate models and recurrent event models

Practice in analysis of multistate models using Epi::Lexis in

STAT 510 Final Exam Spring 2015

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

Recap. HW due Thursday by 5 pm Next HW coming on Thursday Logistic regression: Pr(G = k X) linear on the logit scale Linear discriminant analysis:

Generating new models from design matrix function apc package

Analysing categorical data using logit models

Dependence structures with applications to actuarial science

PubHlth Intermediate Biostatistics Spring 2015 Exam 2 (Units 3, 4 & 5) Study Guide

Ph.D. course: Regression models. Introduction. 19 April 2012

Models for Count and Binary Data. Poisson and Logistic GWR Models. 24/07/2008 GWR Workshop 1

ANALYSIS OF CORRELATED DATA SAMPLING FROM CLUSTERS CLUSTER-RANDOMIZED TRIALS

Variable Selection and Model Choice in Survival Models with Time-Varying Effects

especially with continuous

Unit 5 Logistic Regression Practice Problems

Instantaneous geometric rates via Generalized Linear Models

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status

Lecture 2: Poisson and logistic regression

Statistics GIDP Ph.D. Qualifying Exam Methodology

Analysing longitudinal data when the visit times are informative

Sample Size/Power Calculation by Software/Online Calculators

On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm

Multiple imputation to account for measurement error in marginal structural models

Chapter 22: Log-linear regression for Poisson counts

A Handbook of Statistical Analyses Using R 2nd Edition. Brian S. Everitt and Torsten Hothorn

A Reliable Constrained Method for Identity Link Poisson Regression

Distributed analysis in multi-center studies

Prediction of Bike Rental using Model Reuse Strategy

PAPER 206 APPLIED STATISTICS

Outline. Mixed models in R using the lme4 package Part 3: Longitudinal data. Sleep deprivation data. Simple longitudinal data

A note on R 2 measures for Poisson and logistic regression models when both models are applicable

Multivariable Fractional Polynomials

Lecture 5: Poisson and logistic regression

Statistics 191 Introduction to Regression Analysis and Applied Statistics Practice Exam

Power and Sample Size Calculations with the Additive Hazards Model

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn

Introduction to the Analysis of Tabular Data

Sample solutions. Stat 8051 Homework 8

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Using P-splines to smooth two-dimensional Poisson data

Journal of Asian Scientific Research PREDICTION OF FATAL ROAD TRAFFIC CRASHES IN IRAN USING THE BOX-JENKINS TIME SERIES MODEL. Ayad Bahadori Monfared

Ordinary Least Squares Regression Explained: Vartanian

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

Faculty of Science FINAL EXAMINATION Mathematics MATH 523 Generalized Linear Models

Goodness of Fit Tests: Homogeneity

Case-control studies C&H 16

Classification. Chapter Introduction. 6.2 The Bayes classifier

Today. HW 1: due February 4, pm. Aspects of Design CD Chapter 2. Continue with Chapter 2 of ELM. In the News:

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Multivariable Fractional Polynomials

Generalized Linear Models in R

Duration of Unemployment - Analysis of Deviance Table for Nested Models

Ph.D. course: Regression models

QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018

Transcription:

Methodological challenges in research on consequences of sickness absence and disability pension? Prof., PhD Hjelt Institute, University of Helsinki 2

Two methodological approaches Lexis diagrams and Poisson regression Multi state modeling 3

Poisson regression The Cox proportional hazards model use only one time scale Alternative is to use Poisson regression with a piecewise-constant hazard Typically 5-years timebands of age and calendar year are used. 4

Poisson regression Quite sophisticated analysis could be carried out by Poisson regression Follow-up time could be split by event in interest e.g.: E.g. by first prescrition of certain drug first long sickness leave first hospitalization Times could be modeled using splines that provides good tools for data-analysis 5

Sick-leave data time scales Time since start of follow-up Time since start of antidepressants Time since cessation of antidepressants Time since cessation of last sick-leave Age Calendar time 6

Lexis and multiple time scales Two sample individuals Healthy (H) Sickness absence Mental disorder (after first hospitalisation) Death Time since start of follow-up 7

Time-scales Time since start of antidepressants Time since start of first antidepressants Time since cessation of antidepressants Time since cessation of last sick-leave Healthy (H) Sickness absence Mental disorder (hospitalisation) Antidepressant medication (AD) Time since start of follow-up Other time scales: Disability pension Age, Calendar time, Time since stop of hospitalization 8

Lexis diagrams Examples from : Carstensen B, Plummer M. Using Lexis Objects for Multi-State Models in R. Journal of Statistical Software 2011;38:1 18. Carstensen B. Age period cohort models for the Lexis diagram. Statistics in Medicine 2007;26:3018 45. Plummer M, Carstensen B. Lexis: An R Class for Epidemiological Studies with Long-Term Follow-Up. Journal of Statistical Software 2011;38:1 12. Example data:the data concern a cohort of nickel smelting workers in South Wales and are taken from Breslow and Day, Volume 2. For comparison purposes, England and Wales mortality rates (per 1,000,000 per annum) from lung cancer (ICDs 162 and 163), nasal cancer (ICD 160), and all causes, by age group and calendar period, are supplied in the dataset ewrates. 9

Nickel data Example data > head(nickel) id icd exposure dob age1st agein ageout 1 3 0 5 1889.019 17.4808 45.2273 92.9808 2 4 162 5 1885.978 23.1864 48.2684 63.2712 3 6 163 10 1881.255 25.2452 52.9917 54.1644 4 8 527 9 1886.340 24.7206 47.9067 69.6794 5 9 150 0 1879.500 29.9575 54.7465 76.8442 6 10 163 2 1889.915 21.2877 44.3314 62.5413 > 10

Lexis, one time scale 11

Lexis, two time scales 12

Lexis, two time scales, 5 yrs 13

Modeling data (1) Variables for three time scale, viz. period, age and time since first exposure > subset(nics2,id==4,select = c("case","period","age", "tfe", "lex.dur")) case period age tfe lex.dur 7 FALSE 1934.246 48.2684 25.0820 1.7316 8 FALSE 1935.978 50.0000 26.8136 3.1864 9 FALSE 1939.164 53.1864 30.0000 6.8136 10 TRUE 1945.978 60.0000 36.8136 3.2712 14

Modeling data (2) library( Hmisc ) library( rms ) library(epi) # Model with age and time since first exposure # NOTE! # function rcs applies restricted cubic spline with # user specified knots tmp.m1.2<-glm(case ~ rcs(age,parms=c(30,50,70)) + rcs(tfe,parms=c(10,20,30,40)) + offset(log(pyar)), family = poisson(), subset = (age >= 40), data = nics2) 15

Modeling data (3) > (tmp.m1.2) Call: glm(formula = case ~ rcs(age, parms = c(30, 50, 70)) + rcs(tfe, parms = c(10, 20, 30, 40)) + offset(log(pyar)), family = poisson(), data = nics2, subset = (age >= 40)) Coefficients: (Intercept) -13.23426 rcs(age, parms = c(30, 50, 70))age 0.13467 rcs(age, parms = c(30, 50, 70))age' -0.09365 rcs(tfe, parms = c(10, 20, 30, 40))tfe 0.11148 rcs(tfe, parms = c(10, 20, 30, 40))tfe' -0.10678 rcs(tfe, parms = c(10, 20, 30, 40))tfe'' 0.08234 Interpretation of spline parameters is difficult. -> Present results as graphs Degrees of Freedom: 2757 Total (i.e. Null); 2752 Residual Null Deviance: 999.4 Residual Deviance: 968.7 AIC: 1255 16

Modeling data (4) Rate ratios (RR) with 95% CI presented in graphics. RR 0.1 0.5 2.0 5.0 RR 0.1 0.5 2.0 5.0 40 50 60 70 80 Age (y) 10 20 30 40 50 Time since first exposure (y) 17

Multi state modeling 18

Multi state modeling Multi state models describe how individual moves between series of states in continuous time 19

Multi state modeling Multi state models describe how individual moves between series of states in continuous time Figure 1. Model for transitions between the four states: working (W), sickness absence (S), unemployment (U) and disability pension (D). The sickness absence state is labeled entry, to indicate that only sick-listed persons enter the study. (Pedersen J, Bjorner JB, Burr H, et al. Transitions between sickness absence, work, unemployment, and disability in Denmark 2004-2008. Scand J Work Environ Health 2012;38:516 26.) 20

Analysing multi-state data (1) One approach is that we are interested in transition pattern between states. We apply Markov model, where transition intensities between states are presented in matrix Q Q=( q 11 q 12 q 13 q 21 q 22 q 23 q 31 q 32 q 33 ) 21

Transition example (1) q 21 q 12 q 24 q 32 q 14 q 23 q 13 q 31 q 34 Figure 1 Transition states between labour market outcomes in Denmark. Work, sickness absence and unemployment covers persons in the workforce while retirement independent of the reason (disability or age) is an irreversible state, where persons are considered to leave the workforce forever. (Carlsen K, Harling H, Pedersen J, et al. The transition between work, sickness absence and pension in a cohort of Danish colorectal cancer survivors. BMJ Open 2013;3. doi:10.1136/bmjopen-2012-002259) 22

Q with some constrains 1.Working 2.Sickness absense 3.Unemployed 4.Retired Q=( q q q q 11 12 13 14 q q q q 21 22 23 24 q q q q 31 32 33 34 0 0 0 q 44 ) Intensity matrix for Carlsen K, Harling H, Pedersen J (2013) 23

Analysing multi-state data Using matrix Q we are able to calculate transition matrix P between states. p ij describes probability to be in state j after certain time when starting state is i q 11 q 12 q 13 q 14 p 11 p 12 p 13 p 14 q p P=( 21 p 22 q 23 p 24 p 31 p 32 q 33 p Q=( 21 q 22 q 23 q 24 ) q 31 q 32 q 33 q 34 34 0 0 0 q 44 p 41 p 42 p 43 p 44 Intensity matrix for Carlsen K, Harling H, Pedersen J (2013) ) 24

States in nocturia data to from 0-1 2 3-5 1 333 113 20 2 119 272 93 3 30 124 432 119 0-1 30 113 20 NOTE! All transitions between states are possible. Here are number of transitions 124 2 3-5 93 26

Example We use R package msm for analysis # Initial values for Q matrix > twoway2.q 0-1 2 3-5 0-1 0.000 0.166 0.166 2 0.166 0.000 0.166 3-5 0.166 0.166 0.000 # Model without covariates tmp.m1.msm <- msm(noct.3.num ~ Week.num, subject = PATIENT, gen.inits = TRUE, qmatrix = twoway2.q, data = pla.data.1.v2, obstype = 1, analyticp = TRUE, opt.method = "optim", control = list(maxit = 1000, trace = 0, fnscale = 2000)) 27

Simple model results ## ## Call: ## msm(formula = Noct.3.num ~ Week.num, subject = PATIENT, data = pla.data.1.v2, qmatrix = twoway2.q, gen.inits = TRUE, obstype = 1, opt.method = "optim", analyticp = TRUE, control = list(maxit = 1000, trace = 0, fnscale = 2000)) ## ## Maximum likelihood estimates: ## Transition intensity matrix ## ## 0-1 2 ## 0-1 -0.05529 (-0.0685,-0.04463) 0.05298 (0.042,0.06682) ## 2 0.05591 (0.04434,0.07049) -0.09766 (-0.1153,-0.0827) ## 3-5 0.00321 (0.001195,0.008623) 0.04537 (0.04053,0.05078) ## 3-5 ## 0-1 0.002316 (0.0005702,0.009405) ## 2 0.04176 (0.03352,0.05203) ## 3-5 -0.04858 (-0.05445,-0.04333) ## ## -2 * log-likelihood: 2658 28

Simple model, intensity matrix Intensity matrix (Q) 0-1 2 3-5 0-1 -0.0553 0.0530 0.0023 2 0.0559-0.0977 0.0418 3-5 0.0032 0.0454-0.0486 29

Adding covariates It is possible to add individual level covariates into model. We also might be interested in if transition intensities are stable or are they time-inhomogenius (not shown here). msm(formula = Noct.3.num ~ Week.num, subject = PID, data = pla.data.2, qmatrix = twoway2.q, gen.inits = TRUE, obstype = 1, pci = c(27), opt.method = "optim", control = list(maxit = 1000, trace = 0, fnscale = 2000)) 30

# Add BMI as covariate Model with covarites tmp.m2.msm <- msm(noct.3.num ~ Week.num, subject = PATIENT, gen.inits = TRUE, qmatrix = twoway2.q, data = pla.data.1.v2[tmp.i., ], obstype = 1, covariates = ~BMI..kg.m2., opt.method = "optim", control = list(maxit = 1000, trace = 0, fnscale = 2000)) 31

Model comparison Now we have two models, viz. tmp.m1.msm and tmp.m2.msm. We use likelihood ratio test to check if covarite (BMI) should be included. It turns out that BMI as covariate should be included # Check model, adding BMI as covariate gives better # model lrtest.msm(tmp.m1.msm, tmp.m2.msm) ## -2 log LR df p ## tmp.m2.msm 20.19 6 0.00256 32

Interpretation of results In order to interpret results it is usually sensible to calculate estimates transition matrices (TM). In this example we calculate TM for two covariate values: BMI=25 and BMI=30 Transition matrix is calculated for time period of 24 week (6 months) 33

Transition matrices estimates Predicted probability of transition (%) in 24 weeks. With two covariate values (BMI=25 and BMI=30) BMI=25 0-1 2 3-5 0-1 51 (45,57) 33 (28,37) 16 (13,21) 2 37 (32,42) 38 (33,42) 26 (22,31) 3-5 23 (20,28) 34 (30,37) 43 (37,48) BMI=30 0-1 2 3-5 0-1 45 (38,51) 32 (28,38) 23 (19,28) 2 33 (28,39) 35 (30,39) 32 (27,37) 3-5 22 (18,26) 31 (27,35) 47 (42,52) Transition prob. to state 3-5 is much higher for patients with higher BMI 34

24 weeks transitions BMI=25 0-1 2 3-5 0-1 51 (45,57) 33 (28,37) 16 (13,21) 2 37 (32,42) 38 (33,42) 26 (22,31) 3-5 23 (20,28) 34 (30,37) 43 (37,48) 51% 0-1 37% 33% 16% 23% 34% 2 3-5 38% 26% 43% 35

24 weeks transitions BMI=30 0-1 2 3-5 0-1 45 (38,51) 32 (28,38) 23 (19,28) 2 33 (28,39) 35 (30,39) 32 (27,37) 3-5 22 (18,26) 31 (27,35) 47 (42,52) 45% 0-1 33% 32% 23% 31% 22% 2 3-5 35% 32% 47% 36

Model with covariate(s) Effect of covariate(s) could be presented as hazard ratios (HR) for each transition Here HR presented for five units of BMI print(hazard.msm(tmp.m2.msm, hazard.scale = 5)) ## $BMI..kg.m2. ## HR L U ## 0-1 - 2 1.1617 0.8475 1.593 ## 0-1 - 3-5 1.8966 0.8879 4.051 ## 2-0-1 1.0637 0.8199 1.380 ## 2-3-5 1.2701 0.9502 1.698 ## 3-5 - 0-1 1.0751 0.4494 2.572 ## 3-5 - 2 0.9455 0.6939 1.288 37

Question was: Methodological challenges in research on consequences of sickness absence and disability pension? Answer: Yes, there are such challenges. Two approaches, Lexis data with Poisson regression and multi state models, Conclusion might help researcher to have better grasp to data and the transitions of sickness absence and disability pension. 38

References Carlsen K, Harling H, Pedersen J, et al. The transition between work, sickness absence and pension in a cohort of Danish colorectal cancer survivors. BMJ Open 2013;3. doi:10.1136/bmjopen-2012-002259 Carstensen B, Plummer M. Using Lexis Objects for Multi-State Models in R. Journal of Statistical Software 2011;38:1 18. Carstensen B, Plummer M, Laara E, et al. Epi: A package for statistical analysis in epidemiology. 2008. http://www.pubhealth.ku.dk/ bxc/epi/ Jackson C. Multi-State Models for Panel Data: The msm Package for R. Journal of Statistical Software 2011;38:1 28. Pedersen J, Bjorner JB, Burr H, et al. Transitions between sickness absence, work, unemployment, and disability in Denmark 2004-2008. Scand J Work Environ Health 2012;38:516 26. Plummer M, Carstensen B. Lexis: An R Class for Epidemiological Studies with Long-Term Follow-Up. Journal of Statistical Software 2011;38:1 12. 39