Models for longitudinal data

Size: px

Start display at page:

Download "Models for longitudinal data"

Barry Carpenter
5 years ago
Views:

Faculty of Health Sciences Contents Models for longitudinal data Analysis of repeated measurements, NFA 016 Julie Lyng Forman & Lene Theil Skovgaard

regression) Suggested reading: FLW chapters 6, 7, and 8.

statistics The linear growth model (random regression) Unbalanced data Two or more groups of subjects E.g. two different treatments, possibly randomized.

1 Faculty of Health Sciences Contents Models for longitudinal data Analysis of repeated measurements, NFA 016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen Longitudinal designs Models for the mean Models for the covariance Linear growth model (aka random regression) Suggested reading: FLW chapters 6, 7, and 8. / 67 Outline Typical set-up for longitudinal measurements Longitudinal designs Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data Two or more groups of subjects E.g. two different treatments, possibly randomized. Repeated measurements over time for each subject. calender time / age / duration of treatment planned or ad hoc times of measurement. ATT: statistical results may be biased unless we account for correlation between measurements on the same subject. 3 / 67 4 / 67

2 Pros and cons of longitudinal designs Longitudinal vs cross sectional effect Example: Reading ability, as a function of age and cohort: Pros Thay have much more power for detecting changes over time. (data are paired with the subject as its own control) We may discover that subjects have different time courses In designs with only cross-sectional data, this may also be the case, but we have no way of knowing! Cons Analysis by traditional ANOVA or GLM-models is infeasible because the independence assumption is violated. We need to model both the mean and the covariance / 67 6 / 67 Characterizing your longitudinal study Study types Type of study: Randomized or observational? Time schedule: Fixed or ad hoc observations? Type of data: Continuous, binary, coundt, ordinal, or categorical? Data structure: Wide or long format? Randomized: One homogeneous population is studied. Randomization to two (or more) treatment groups One or more follow-up measurements + usually a measurement at baseline. Observational: Two (or more) populations are studied. E.g. men and women or two different groups of patients A well defined starting point (e.g. diagnosis). Sample size: How many subjects? How many time points? 7 / 67 8 / 67

3 Observation schedules Fixed time points (balanced design): Measurements collected at prespecified time points. Equidistant: 5, 10, 15, 0 minutes, or every month. Non-equidistant: 5, 10, 0, and 60 minutes. Ad hoc time points (unbalanced design): Subejcts were seen e.g. when the doctor decided or when the patient felt the need. Beware of selection bias. What are the time points representative of? Case: Calcium supplements Randomized study: including 11 girls at age 11. Treatment: calcium supplement or placebo. Outcome: BMD=bone mineral density, in g/cm 3 Planned follow-up: every 6 months in two years 5 visits in total including baseline. Does calcium increase bone gain in adolescent women? BUT: In practice, designs planned to be balanced often turn out more or less unbalanced... 9 / / 67 Time points in the calcium study Visit as time line (ignore deviations from time schedule) visit: Number 1,, 3, 4, or 5. time: Scheduled follow-up 0, 1/, 1, 1 1/, years. obstime: Actual time of follow-up (individual). Analysis Variable : obstime time NObs Mean Std Dev Minimum Maximum / 67 1 / 67

4 Repetition: Analysis of response profiles Comparison of change over n time points (visits) within g groups (treatment) of subjects. Similar to two-way ANOVA, only with correlated data. An unstructured covariance is assumed. Drawbacks: Can only handle balanced designs. Not good with many groups or time points, due to having too many model parameters. Do not make use of a priori known data patterns, e.g. correlation decreasing with time. monotone growth. Today: Parametric models for the mean and the covariance. 13 / 67 Repetition: Analysis of response profiles Note: Do baseline adjustment since treatment was randomized. title1 Response profiles (clmm) ; data calcium; set calcium; treat = grp; if visit EQ 1 then treat = P ; run; proc mixed data=calcium plots=all; class grp girl time (ref= 0 ) treat (ref= P ); model bmd = time treat*time / ddfm=kr solution cl; repeated time / type=un subject=girl; run; 14 / 67 Repetition: Analysis of response profiles Conclusion Solution for Fixed Effects Standard Effect treat time Estimate Error DF t Value Pr > t Alpha Lower Upper Intercept < time < time < time < time < time time*treat C time*treat P time*treat C time*treat P time*treat C time*treat P time*treat C time*treat P time*treat P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F time <.0001 time*treat Mean gain in BMD (g/cm 3 ) since baseline with calcium supplement or placebo years Calcium group Placebo group Difference 1/ 0.07 (0.00;0.033) 0.00 (0.014;0.07) (0.000;0.013) (0.047;0.065) (0.035;0.054) 0.01 (0.00;0.01) 1 1/ (0.07;0.094) (0.060;0.08) 0.01 (0.001;0.03) (0.094;0.118) (0.075;0.099) (0.006;0.031) We see a significantly higher gain in BMD with calcium at last follow-up (P=0.003) 15 / / 67

5 Outline Longitudinal designs Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data 17 / 67 Models for the mean Changes over time usually appear gradually and often following a distinct pattern. We gain power by incorporating this in our models. We want to estimate key parameters such as growth rates. Model the mean as a continuous function of time: Linear Exponential (log-linear) Piecewise linear (linear spline) Flexible (polynomium, smoothing spline, etc) Nonlinear (cyclic, dose-response, etc) (many possibilities, not all treated in this course). 18 / 67 Examples of mean curves Calcium: estimated response profiles 19 / 67 0 / 67 Looks as mean BMD increases linearly with time.

6 Calcium: linear model To fit at linear model for the mean, use the continuous variable time (scheduled visit in years since baseline). title1 Linear trend ; proc mixed data=calcium method=ml plots=all; class grp (ref= P ) girl visit; model bmd = time grp*time / ddfm=kr solution cl; repeated visit / type=un subject=girl r rcorr; run; 1 / 67 Main effect of grp omitted (no difference at baseline). Use the categorical variable visit to specify the unstructured model for the covariance. Use the method=ml option for model comparisons. Calcium: Estimates Fit Statistics - Log Likelihood AIC (Smaller is Better) AICC (Smaller is Better) BIC (Smaller is Better) Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Alpha Lower Upper Intercept < time < time*grp C time*grp P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F time <.0001 time*grp Extra increase in BMD with calcium of g/cm 3 per year, (95%CI: 0.006;0.0151, / 67 P=0.006). Calcium: Estimated response profiles Comparison of models for the mean Goodness of fit is measured by likelihood. Better fitting models have large values of likelihood and therefore small values of deviance: - log Likelihood. ATT: Use the method=ml-option in proc mixed. Compute the difference in deviances (called - log Q) and compare to a χ -distribution with df= no. params. Only nested models can be compared this way. Example: linear trend vs unrestricted response profiles. log Q = = 13.7 χ (9 3) = χ (6) P = BUT: Is linear evolution at all plausible? 3 / 67 4 / 67

7 Technical note on likelihood-types When testing a submodel for the mean, the deviances of the compared models must be computed with the full (or conventional) likelihood proc mixed data=calcium method=ml; not the residual likelihood (reml) which is default in mixed. Residuals for linear trend model But don t forget: Most hypothesis about the mean can be tested using just the default F-tests (optimal choice). proc mixed data=calcium; class grp girl visit treat; model bmd = time grp*time treat*visit/ ddfm=kr solution; repeated visit / type=un subject=girl; run; Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F time 0... time*grp 0... visit*treat / 67 6 / 67 Deviation from linearity is not that pronounced. Outline The unstructured covariance Longitudinal designs Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data 7 / 67 So far we have made no assumptions about covariance. Advantages We make no wrong assumptions about the covariance of our observations. No need to think more about them. We gain insight in the actual structure of the covariance by looking at the estimates. Drawbacks 8 / 67 We use quite a lot of parameters to describe the covariance structure. Thus our analysis becomes less efficient. No good with small data sets; The results may be unstable. It can only be used in case of balanced data, i.e. all subjects have to be measured at identical times.

8 To get a view of the covariance Output for unstructured covariance To get an unbiased view of the covariance we must ensure that out model for the mean is correct. We can use an unrestricted model as a starting point. Now we want to see the estimated covariances and correlations, so we use the r and rcorr options in proc mixed: repeated visit / type=un subject=girl r rcorr; Estimated R Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated R Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col / / 67 Models for the covariance Stationary covariance patterns Most often covariance display distinct features. E.g. decreasing correlation with increasing time span between observations as in the calcium data. We gain power by incorporating these features in our model. Possibilities: Unstructured covariance (lecture 1) Covariance pattern models (lecture ) Variance components / random effects (lecture 3) A huge selection is available with proc mixed. Large selection of models for equidistant observations. Assumption: variances and correlations are stationary: The variance is constant over time. Correlation depend only on the time-distance between the observations not the specific times of measurements. Examples: compound symmetry, autoregressive, autoregressive moving average, and the Toeplitz models. proc mixed type Cov(Y ij, Y ik ) no.par CS σ [I {j = k} + ρ I {j k}] AR(1) σ ρ k j ARMA(1,1) σ [I {j = k} + γ ρ k j 1 I {j k}] 3 TOEP σ [I {j = k} + ρ k j I {j k}] n 31 / 67 3 / 67

9 Examples of stationary correlations Example: Compound symmetry Also called exchangeable covariance (more on this in lecture 3) The variance σ is the same at all time points. All paired observations are correlated with the same correlation ρ, regardles of the specific time points. The 5x5 covariance and correlation matrices are Cov = σ σ ρ σ ρ σ ρ σ ρ σ ρ σ σ ρ σ ρ σ ρ σ ρ σ ρ σ σ ρ σ ρ σ ρ σ ρ σ ρ σ σ ρ σ ρ σ ρ σ ρ σ ρ σ Cor = 1 ρ ρ ρ ρ ρ 1 ρ ρ ρ ρ ρ 1 ρ ρ ρ ρ ρ 1 ρ ρ ρ ρ ρ 1 In proc mixed we write: repeated visit / type=cs subject=girl r rcorr; 33 / / 67 Output for Compound Symmetry Example: AR(1)-covariance matrix Estimated R Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated R Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col Covariance Parameter Estimates Cov Parm Subject Estimate CS girl Residual Note: Cov Parms are not σ = and ρ = (lecture 35 / 67 3). The so-called autoregressive covariance structure has 36 / 67 Constant variance σ over time. Correlation decreasing exponentially with the distance between the observations, Cor(Y ij, Y ik ) = ρ k j The 5x5 covariance and correlation matrices are: Cov = σ σ ρ σ ρ σ ρ 3 σ ρ 4 σ ρ σ σ ρ σ ρ σ ρ 3 σ ρ σ ρ σ σ ρ σ ρ σ ρ 3 σ ρ σ ρ σ σ ρ σ ρ 4 σ ρ 3 σ ρ σ ρ σ In proc mixed we write: Co4 = repeated visit / type=ar(1) subject=girl r rcorr; 1 ρ ρ ρ ρ 1 ρ ρ ρ ρ 1 ρ ρ 3 ρ ρ 1 ρ 4 ρ 3 ρ ρ

10 Output from TYPE=AR(1) structure Estimated R Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated R Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col Covariance Parameter Estimates Cov Parm Subject Estimate AR(1) girl Residual Here σ 37 / 67 = and ρ = Heterogeneous covariance patterns The assumption that the variance does not change with time can be dropped when assuming a heterogeneous covariance pattern. No restrictions on the variances σ 1, σ,... Correlations are assumed to be stationary; They depend only on the time-distance between observations not the specific time points. Examples: the heterogeneous compound symmetry, heterogeneous autoregressive, the heterogeneous Toeplitz, and the antedependence covariance sturctures. 38 / 67 proc mixed type Cov(Y ij, Y ik ) no.par CSH σ j σ k [I {j = k} + ρ I {j k}] n + 1 ARH(1) σ j σ k ρ k j n + 1 TOEPH σ j σ k [I {j = k} + ρ k j I {j k}] n 1 k 1 ANTE(1) σ j σ k l=j ρ l n 1 Output ARH(1) structure Comparison of covariance structures Estimated R Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated R Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col Covariance Parameter Estimates CovParm Subject Estimate Var(1) girl Var() girl Var(3) girl Var(4) girl Var(5) girl ARH(1) girl Model - log L par. log Q df P-value UN ARH(1) AR(1) CS < For comparison of covariance patterns use either of the two likelihood-types (ml or reml), just don t compare one to the other. Estimates of σ1, σ,..., σ 39 / 67 5, and ρ. 40 / 67

11 Predicted response profiles Tests of treatment effect BUT: Confidence intervals and tests depend on the covariance. Tests of treatment effect at last follow-up Covariance pattern Esitmate (95% CI) P-value Assumed independence (0.000;0.0578) Compound symmetry (0.0110;0.083) < Autoregressive (0.0078;0.034) Heterogeneous autoregr (0.0073;0.0315) The estimated response profiles are almost identical for all our choices of covariance patterns -!!! 41 / 67 If data was balanced and complete we would get perfect agreement (all would be equal to the treat*time-averages). Unstructured (0.0065;0.0314) / 67 Modeling strategy As suggested by Fitzmaurice et al. (011): 1. Put up a plausible (e.g. saturated) model for the mean. Fit the data so far ignoring correlation (GLM). 3. Check the residuals for assessing the adequacy of the model for the mean and in order to get an impression of the error covariance. 4. Pick a reasonable model for the covariance (if possible test against the unstructured model). 5. Re-check the model fit. 6. Do the analysis. Note: Lecture 4 on model diagnostics among others. Alternative: Robust standard errors Goodness of fit test for the covariance might select an incorrect model due to lack of power. If data is balanced and complete estimates are unbiased but the standard errors may be biased. Instead one could use the robust sandwich covariance estimator: proc mixed empirical data=calcium; However: The sandwich covariance estimator... is useless in small samples because it is anti-conservative. needs additional weighting to get unbiased estimates when there are missing data. 43 / 67 (More 44 / 67 about sandwich covariance estimators in lectures 5 and 6.)

12 Outline Many time points and few subjects Longitudinal designs In this situation choosing a reliable model for the covariance is just about impossible. Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data Unstructured covariance has too many parameters. Compound symmetry underestimates correlation between observations close in time and overestimates correlation between observations far apart in time. We have no crystal ball for choosing a simple yet correct covariance pattern. Robust standard errors are anti-conservative. So what can we do? 45 / / 67 Reduction to independent data Case: Calcium supplements By analyzing carefully chosen characteristics for each individual we can resolve to simple analyses which have no repeated measurements issues. Maybe not optimal, but feasible and easy! Examples of useful summary statistics: The changes from baseline to endpoint The slopes for individual time effects The area under the curve (AUC) The time to peak or peak value Note: Comparing measurements for each time point in turn is not recommended without adjustment for multiple testing. 47 / 67 The overall time course looks reasonably linear, but maybe the girls have individual growth rates. 48 / 67

Individual regression Fit an ordinary linear regression for each girl: Y ij = A i + B i t j + ε ij, ε ij N (0, σ ) Analysis of summary statistics Comparison of individual intercepts and slopes: Just

13 Individual regression Fit an ordinary linear regression for each girl: Y ij = A i + B i t j + ε ij, ε ij N (0, σ ) Analysis of summary statistics Comparison of individual intercepts and slopes: Just using the two-sample t-test (for independent data). Group Level at baseline (g/cm 3 ) Slope (g/cm 3 per year) P (0.8515;0.8880) (0.0361;0.046) C (0.8650;0.8985) (0.0438;0.0556) Difference (-0.015;0.0365) (0.0009;0.016) P-value Are 49 / 67 the slopes of the Calcium group systematically bigger? 50 / 67 Limitations of summary statistics Outline No baseline adjustment, hence less power. Longitudinal designs Some of the girls in the calcium study dropped out: We get less accurate slope-estimates from girls with few observations. No slope at all if drop out was rigth after baseline. And maybe those with low BMD are more likely to drop out; They think they need supplement but might have got the placebo. Can we make better use of the full data? Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data 51 / 67 5 / 67

14 Random regression We let each girl have her own level A i and her own slope B i We assume these individual parameters (A i and B i ) follow a bivariate normal distribution in the population ( Ai B i ) N (( αg(i) β g(i) The covariance is the so-called G-matrix. 53 / 67 ), ( τ a ω ab ω ab τ b )) G describes the population variance of the regression parameters, the inter-individual variation. Note the correlation ρ ab = ω ab τ a τ b ; Subjects with lower baseline might tend to also have smaller (or larger) growth rates. PROC MIXED: random regression proc mixed data=calcium plots=all; class grp girl; model bmd = time grp*time / ddfm=kr solution cl; random intercept time / type=un subject=girl g v vcorr; run; Individual intercepts and slopes are so-called random effects. They must be specified in a random-statement. Note: The intercept refers to time=0 (baseline). Here type=un refers to a unstructured specification of the G-matrix. If it is omitted, we may experience convergence problems and sometimes totally incomprehensible results. Option g prints the estimated G-matrix. (More about random effects in lecture 3). 54 / 67 Output from random regression Output from random regression Estimated G Matrix Row Effect girl Col1 Col 1 Intercept time Covariance Parameter Estimates Cov Parm Subject Estimate UN(1,1) girl UN(,1) girl UN(,) girl Residual Fit Statistics - Res Log Likelihood AIC (smaller is better) AICC (smaller is better) BIC (smaller is better) Note: ρ ab = = 0.1. Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Alpha Lower Upper Intercept < time < time*grp C time*grp P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F time <.0001 time*grp We find an extra increase in BMD of g/cm 3 per year, with calcium supplement, (95% CI: , P=0.0063). 55 / / 67

15 Implied covariance The random regression implies the covariance pattern: Cov(Y ij, Y ik ) = τa + (t j + t k )ω ab + t j t k τb as a function of the G-matrix parameters and the time points. Does this fit the data well? Random regression vs UN (linear trend for the mean) log Q = = 50.5 χ (15 3) = χ (1) P = No, appearantly not. 57 / 67 Implied covariance Options v and vcorr prints the covariance and correlation. Estimated V Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated V Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col5 58 / Outline Nonequidistant time points Longitudinal designs Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data In the calcium study the girls are only seen approximately twice a year. Perhaps we get better estimates of the slopes when replacing planned time of visit with the actual individual times? But we loose the option of an unstructured covariance. Other covariance patterns can still be fitted e.g. the autoregressive pattern, the random regression model, and the compound symmetry pattern. 59 / / 67

BMD vs actual time of visit Non-equidistant observations Only a limited number of covariance patterns are available in case time points are individual or non-equidistant.

16 BMD vs actual time of visit Non-equidistant observations Only a limited number of covariance patterns are available in case time points are individual or non-equidistant. They are all stationary: The variance is constant over time. The correlation depend only on the time-distance between the observations, not the specific time points. The ctime-variable must be a numerical variable in SAS proc mixed Cov(Y ij, Y ik ) no. type= param CS σ [I {j = k} + ρ I {j k}] SP(POW)(ctime) σ ρ t ij t ik SP(GAU)(ctime) σ e t ij t ik /γ SP(LIN)(ctime) σ (1 ρ t k t j ) I {ρ t k t j 1}] 61 / 67 6 / 67 Random regression in observed time Random regression, using actual duration Estimated G Matrix Row Effect girl Col1 Col Syntax is exactly the same as with scheduled time points, only the name of the time-varible changes. title1 random regression (actual time) ; proc mixed data=calcium plots=all; class grp girl; model bmd = obstime grp*obstime / ddfm=kr solution cl; random intercept obstime / type=un subject=girl g; run; 1 Intercept obstime Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Alpha Lower Upper Intercept < obstime < obstime*grp C obstime*grp P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F obstime <.0001 obstime*grp / 67 We find an extra increase in BMD of g/cm 3 per year with 64 / 67 calcium supplement, 95% Ci: to , P=

17 Tests of treatment effect Scheduled vs observed times Comparison of estimates for different covariance structures: Covariance pattern Estimate (95% CI) P-value Assumed independence (0.0071;0.07) Compound symmetry (0.0053;0.0138) < Autoregressive (0.0038;0.016) Random regression (0.006;0.0149) Estimates and tests depend on the covariance! Estimated slopes from the two random regressions: Group Scheduled time Actual time P (0.0405;0.0493) (0.0411;0.0497) C (0.0493;0.058) (0.0498;0.0585) Difference (0.006; ) (0.006;0.0149) P-value Slightly steeper mean slopes in both groups with actual times. Slightly smaller standard errors with actual times. This would have been more pronounced if actual times of visit deviated more from the scheduled times. 65 / / 67 Concluding remarks Results depend on choice of covariance pattern Obvious bias for unrealistic models (independence, CS). More similar results for the more complex models. Not much impact of using exact times of measurement instead of planned times (visit) - WHY? Gain in modeling flexibility by rounding the times. There are sophisticated statistical arguments implying that rounding to the nearest scheduled time do not cause bias (Berkson error) But it increases variance. Choosing an apropriate model is a compromise between practical feasibility, realistic model assumptions, and interpretable results. 67 / 67

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data Faculty of Health Sciences Repeated measurements over time Correlated data NFA, May 22, 2014 Longitudinal measurements Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of