Repeated Measures Data

Similar documents
over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

Repeated Measures Design. Advertising Sales Example

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED.

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures

SAS Syntax and Output for Data Manipulation:

Models for longitudinal data

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA

Introduction to SAS proc mixed

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar

ANOVA Longitudinal Models for the Practice Effects Data: via GLM

A SAS Macro to Find the Best Fitted Model for Repeated Measures Using PROC MIXED

Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study

POWER ANALYSIS TO DETERMINE THE IMPORTANCE OF COVARIANCE STRUCTURE CHOICE IN MIXED MODEL REPEATED MEASURES ANOVA

Topic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects

Introduction to SAS proc mixed

Introduction to Within-Person Analysis and RM ANOVA

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010

Dynamic Determination of Mixed Model Covariance Structures. in Double-blind Clinical Trials. Matthew Davis - Omnicare Clinical Research

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1

Answer to exercise: Blood pressure lowering drugs

Lecture 4. Random Effects in Completely Randomized Design

Correlated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED

Covariance Structure Approach to Within-Cases

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data

STAT 5200 Handout #23. Repeated Measures Example (Ch. 16)

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology

Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.

Chapter 9. Multivariate and Within-cases Analysis. 9.1 Multivariate Analysis of Variance

Longitudinal Studies. Cross-Sectional Studies. Example: Y i =(Y it1,y it2,...y i,tni ) T. Repeatedly measure individuals followed over time

Step 2: Select Analyze, Mixed Models, and Linear.

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

Biostatistics 301A. Repeated measurement analysis (mixed models)

Review of Multilevel Models for Longitudinal Data

Review of Unconditional Multilevel Models for Longitudinal Data

Introduction to Random Effects of Time and Model Estimation

This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.

Linear Mixed Models with Repeated Effects

Confidence Intervals for One-Way Repeated Measures Contrasts

Review of CLDP 944: Multilevel Models for Longitudinal Data

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Describing Change over Time: Adding Linear Trends

Analysis of variance and regression. May 13, 2008

Lecture 10: Experiments with Random Effects

Modelling the Covariance

SAS Program Part 1: proc import datafile="y:\iowa_classes\stat_5201_design\examples\2-23_drillspeed_feed\mont_5-7.csv" out=ds dbms=csv replace; run;

17. Example SAS Commands for Analysis of a Classic Split-Plot Experiment 17. 1

2 >1. That is, a parallel study design will require

36-402/608 Homework #10 Solutions 4/1

Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models:

STAT3401: Advanced data analysis Week 10: Models for Clustered Longitudinal Data

WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS

Multi-factor analysis of variance

Hypothesis Testing for Var-Cov Components

REPEATED MEASURES. Copyright c 2012 (Iowa State University) Statistics / 29

Some general observations.

Graphical Procedures, SAS' PROC MIXED, and Tests of Repeated Measures Effects. H.J. Keselman University of Manitoba

Odor attraction CRD Page 1

dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" = -/\<>*"; ODS LISTING;

Statistics 910, #5 1. Regression Methods

Mixed Models Lecture Notes By Dr. Hanford page 199 More Statistics& SAS Tutorial at

Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

One-Way Repeated Measures Contrasts

This is a Split-plot Design with a fixed single factor treatment arrangement in the main plot and a 2 by 3 factorial subplot.

Variance component models part I

Chapter 11. Analysis of Variance (One-Way)

Accounting for Correlation in the Analysis of Randomized Controlled Trials with Multiple Layers of Clustering

Longitudinal Invariance CFA (using MLR) Example in Mplus v. 7.4 (N = 151; 6 items over 3 occasions)

Sample Size / Power Calculations

Modeling the Covariance

Mixed models with correlated measurement errors

Topic 23: Diagnostics and Remedies

Model comparison and selection

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

Implementation of Pairwise Fitting Technique for Analyzing Multivariate Longitudinal Data in SAS

Analyzing the Behavior of Rats by Repeated Measurements

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30

How to fit longitudinal data in PROC MCMC?

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual

STK4900/ Lecture 10. Program

Random Effects. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology. university of illinois at urbana-champaign

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Random Intercept Models

Introduction and Background to Multilevel Analysis

Chapter 1 Statistical Inference

Chapter 5: Multivariate Analysis and Repeated Measures

Stat 579: Generalized Linear Models and Extensions

Longitudinal Data Analysis of Health Outcomes

Independence (Null) Baseline Model: Item means and variances, but NO covariances

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA

4 Introduction to modeling longitudinal data

EXST 7015 Fall 2014 Lab 11: Randomized Block Design and Nested Design

Now consider the case where E(Y) = µ = Xβ and V (Y) = σ 2 G, where G is diagonal, but unknown.

a = 4 levels of treatment A = Poison b = 3 levels of treatment B = Pretreatment n = 4 replicates for each treatment combination

Models for binary data

Statistical Inference: The Marginal Model

Transcription:

Repeated Measures Data Mixed Models Lecture Notes By Dr. Hanford page 1 Data where subjects are measured repeatedly over time - predetermined intervals (weekly) - uncontrolled variable intervals between measurements Reasons for repeated measures - ensure treatment is effective over a specified period - monitor safety aspects of the treatment over a specified period - measure length of time for treatment to become effective Objectives of repeated measures analysis - measure average treatment effect over time - assess possible time*treatment interactions - assess treatment response profile (e.g. rea Under Curve (UC), tmax, cmax) - identify possible covariance patterns in the repeated measurements - determine an appropriate model to describe the time and measurement relationship. Fixed effects approaches lthough it is possible to treat the patient/animal as a fixed effect in special situations, normally the patient/animal is treated as random. The book briefly covers a few of these situations in Chapter 6 and covers the advantages and disadvantages of these. Most of them require fixed time intervals and no missing data. Mixed model approaches dvantages of using mixed models with repeated measures data - a single model can be used to estimate overall treatment effects and to estimate treatment effects at each time point. - Missing data does not cause problems as long as they can be assumed missing at random. - The covariance pattern of the repeated measurements can be determined and taken into account. Covariance Pattern Models Several ways to analyze repeated measures data using mixed models are available. The first one we will cover is the random effects model with animal effects treated as random. It allows for constant correlation between all observations on the same animal. However,

Mixed Models Lecture Notes By Dr. Hanford page 1 correlations between observations on the same animal are not always constant. Observations recorded on consecutive visits may be more highly correlated than observations measured on visits further apart. covariance pattern model can be used to model this. The basic structure of the covariance pattern model is R1 R R R R5 R R6 R7 R8 R9 R1 Where the submatrix R i covariance block correspond to the ith animal where the dimension of the submatrix correspond to the number of repeated measurements for animal i. The s are also block matrices that correspond between observations of different animals and assumes that the correlation between observations among animals are zero. Covariance patterns There are a large number of possible covariance patterns for each of the submatrices R i available for mixed models. We will present several of the more commonly used patterns which are available in SS. Simple covariance patterns For our example, we will include animals, where nimal 1 had visits, nimal had visits and nimal had visits, where the visits are at fixed times. Independent covariance pattern If the within-subjects correlation is zero, the covariance model is called independent and is the simplest model. The number of parameters that need to be estimated is equal to the number of visits and assumes that the visits are independent. This covariance structure is also referred to as heterogeneous uncorrelated because it allows for differing variances at each visit. For our example, the independent covariance pattern would be

Mixed Models Lecture Notes By Dr. Hanford page 1 1 1 R 1 General or unstructured covariance pattern However, in most experiments there is more than likely some correlation present. If the correlation between each pair of visits (θ ij ) (within-subjects correlation) is allowed to differ and the variance for each time point (σ i ) is allowed to differ, then the covariance pattern is called general or unstructured. This is the most complex of the simple correlation patterns. The number of parameters that need to be estimated is (N*(N+1))/. For example, if there are visits (N=), then the number of parameters would be (*5)/=1. The general covariance pattern for our example follows: 1 1 1 1 1 1 1 1 1 R 1 1 1 1 1 1 First-order autoregressive covariance pattern When time periods are evenly spaced, the first-order autoregressive model is sometimes appropriate. This model is also referred to as R(1). In this model, the variances are the same for all of the visits and the covariances decrease exponentially depending on their distance (θ ij = ρ i-j σ ). Note that the covariance is a function of the variance so the scalar variance can be pulled out of the matrix and that there are only two parameters that need to be estimated. For our example, the first-order autoregressive would be:

Mixed Models Lecture Notes By Dr. Hanford page 15 1 1 1 1 R 1 1 1 1 1 Compound symmetry covariance pattern If all of the covariances among the visits are thought to be the same, then the compound symmetry covariance pattern is used. This pattern does not make any assumption about the relationship between the variance and covariance and only has two parameters that need to be estimated. For our example, this would be: R Toeplitz covariance pattern The Toeplitz covariance is similar to the first-order autoregressive covariance pattern in the way that the pairs separated by the same distance share the same correlations. But with the Toeplitz covariance pattern there is no known function relating the ρ values to the distance. This covariance pattern makes sense when visits are equally spaced. The number of parameters that need to be estimated is equal to the maximum number of visits, which is greater than the two parameters needed for R(1). For our example, the Toeplitz covariance pattern would be

R Mixed Models Lecture Notes By Dr. Hanford page 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Heterogeneous variances among visits The first-order autoregressive, compound symmetry and Toeplitz covariance patterns all assumed that the variances for all of the visits were the same. However, sometimes variability will differ between the visits for the trait being measured. In other words, the variances are heterogeneous. The independent and general covariance patterns allowed for heterogeneity among the variances. The R(1), compound symmetry and Toeplitz covariance patterns can also be modified to allow for heterogeneous variances and are presented for our example. Heterogeneous first-order autoregressive The number of parameters needed for the heterogeneous R(1) is the number of visits plus 1. In our example, with visits, we have 5 parameters that need to be estimated. R 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Heterogeneous compound symmetry The number of parameters needed for the heterogeneous compound symmetry is also the number of visits plus 1.

R Mixed Models Lecture Notes By Dr. Hanford page 17 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Heterogeneous Toeplitz Just as with the homogeneous covariance patterns, the number of parameters that need to be estimated for the heterogeneous Toeplitz covariance pattern is greater than for heterogeneous R(1) or compound symmetry. gain, this is because there is no known function relating the ρ values to the distance. For our example, the heterogeneous Toeplitz covariance pattern would be 1 1 1 1 1 1 1 1 1 1 1 1 1 1 R 1 1 1 1 1 1 1 1 1 1 1 1 1 First-order ante dependence covariance pattern If visits are not equally spaced and the correlation structure changes over time, the R(1) and Toeplitz covariance patterns no longer make the most sense. The first-order ante dependence covariance pattern NTE(1) is a more general model that keeps the main features of the R(1) and Toeplitz, but allows for unequal visit spacing and changes in the correlation structure over time.

Mixed Models Lecture Notes By Dr. Hanford page 18 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 R 1 1 1 1 1 1 1 1 1 1 1 Note that the NTE(1) covariance pattern - permits the variance among observations to change over time - permits the correlations to change over time (correlations between pairs of observations is the product of the correlations between adjacent times between observations) - N-1 parameters estimated. For our example *-1=7. Separate covariance patterns for each treatment Response to treatment may differ in amount of variability, so different sets of covariance parameters may be used for each treatment group. For our example, nimals 1 and received Treatment and nimal received Treatment B. Previously, when we assumed homogeneous treatment variances, the number of parameters that needed to be estimated was two. ssuming heterogeneous treatment variances increases the number of parameters multiplicatively by the number of treatments. For our two treatment example, this would be *= parameters. If the compound symmetry covariance structure is used, the structure we would now have would be R B B B B B B B B B B B B B B B B In our example, we present only animals. If we assume that there are additional animals in the study and that the maximum number of visits for the study is, for the

Mixed Models Lecture Notes By Dr. Hanford page 19 general or unstructured pattern, the number of parameters in our example would be *1=.,1,1,1,1,,,1,, B,1 B,1 B,1 B,1 R B,1 B, B,, B B,1 B, B, B, B,1 B, B, B,,1,1,1, Banded covariances With the exception of the independent structure, all of the above covariance pattern structures assumed that there was some correlation among visits. s the distance in visit number between two different visits increases, the correlation between the observations from the visits in most cases decreases. The correlations between widely separated observations is close to zero and banding the covariance matrices by setting correlations between these observations to zero may be appropriate. The advantage of using a banded covariance structure is that it reduces the number of parameters that need to be estimated. Banding is very useful in experiments where there are a large number of visits or measurement time points. For example, if in our example, we used a band size of with a general covariance pattern, we would get the following, where the θ correlation parameter is assumed to be zero and is not estimated: 1 1 1 1 1 1 1 1 R 1 1 1 1 1 Selecting an appropriate covariance pattern There are many choices of covariance patterns for modeling repeated measures data. Choosing the most appropriate pattern is important in order to draw accurate conclusions.

Mixed Models Lecture Notes By Dr. Hanford page 1 If the covariance pattern chosen is too simple, then you risk increasing Type I error rate and underestimating standard errors. If the covariance pattern chosen is too complex, you sacrifice power and efficiency. s more covariance parameters are included, the chances of over fitting increase. There are tools available to help select a covariance pattern. First, there are graphical tools to help you visualize patterns of correlation between observations at different times. We will go through examples of graphical tools later. Secondly, there are measures of model fit, including information criteria and statistical comparison between models. Information criteria Information criteria measure the relative fit of competing covariance patterns. s more parameters are included in the model, the likelihood statistic is expected to become larger. There are two statistics based on the likelihood that make allowance for the number of covariance parameters fitted and can be used to compare two models which fit the same fixed effects. kaike s information criterion (IC) IC=log(L)-q Where q is the number of covariance parameters and log(l) is the log of the likelihood for the model. SS calculates the IC as -(log(l))+q. The second statistic is Schwarz s information criterion (SIC) SIC=log(L)-(q*log(N-p))/ Where p is the number of fixed effects, N is the number of observations and q is the number of covariance parameters. This is also referred to as the BIC (Bayesian information criterion) or SBC (Schwarz's Bayesian information criterion). SS calculates the BIC as -(log(l))+(q*log(n-p)) where N is the number of subjects. Models with larger values of IC and SIC as presented in the text mean better fit. With SS PROC MIXED, smaller means a better fit. Whether IC or BIC is better is not clear. Statistical comparison between models The likelihood ratio test can also be used to compare models which fit the same fixed effects and whose covariance patterns are nested. Nesting is when the covariance pattern in the simpler model can be obtained by restricting some of the parameters in the more complex model. n example of this would be as follows: R compound symmetry pattern is nested within

Mixed Models Lecture Notes By Dr. Hanford page 11 R 1 1 1 1 Toeplitz pattern, because a restriction on the Toeplitz pattern that θ 1 =θ would be the same as the compound symmetry pattern. However, the compound symmetry pattern is not nested within the R(1) R 1 1 1 because the off diagonal elements in the R(1) are a function of both the σ and the ρ. The likelihood ratio test statistic is (log(l 1 )-log(l )) ~ χ df where df=difference in number of covariance parameters fitted. Which covariance patterns to consider? good strategy is to start with simple patterns such as compound symmetry or R(1). Once the general covariance pattern is determined, more complex patterns based on the selected general covariance pattern can be investigated. The more complex pattern would be accepted only if using them leads to a significant improvement in the likelihood. If there are only a few repeated measurements, there will be little difference between models using different covariance patterns on overall treatment effects. The compound symmetry pattern is likely to be robust and can be used with reasonable confidence. Example: Covariance pattern models for normal data We will use the blood pressure data again, but this time consider the repeated measures data. The table below summarizes the number of patients that attended each visit. Visit Treatment Treatment B Treatment C Total 1 16 11 1 11 16 1 1 8 1 96 9 9 95 91 9 8 5 87 88 9 68 6 8 8 91 58

Mixed Models Lecture Notes By Dr. Hanford page 1 Visualizing correlation patterns graphically The first graphical tool is visualizing the correlation structure by plotting changes in covariance and correlation among residuals on the same subject at different times over distance between times of observations. The following PROC MIXED program allows us to obtain the correlations and covariances among the residuals. proc mixed data=dbp; class trt pat visit; model dbp=trt visit; repeated/type=un subject=pat(trt) sscp rcorr; ods output covparms=cov; ods output rcorr=corr; run; The repeated statement determines the form of the covariance, type=un specifies an unstructured correlation pattern within each pat(trt). subject=pat(trt) specifies that errors are correlated within each pat(trt). In other words, the observations on different pat(trt) levels are independent, but observations on the same pat(trt) are not. The sscp option causes the cov(e ijk,e ijk' ) to be computed directly from the corrected sums of squares and cross products matrix rather than the default REML procedure. The rcorr option causes the correlations to be computed. The two ODS statements create new SS data sets containing the covariances and correlations. The following code sets up the dataset for producing the plot for visualizing correlation patterns data times; do time1=1 to ; do time=1 to time1; dist=time1-time; output; end; end; run; Obs time1 time dist CovParm Subject Estimate 1 1 1 UN(1,1) pat(trt) 8.89 1 1 UN(,1) pat(trt) 5.199 UN(,) pat(trt) 78.96 1 UN(,1) pat(trt) 5.571 5 1 UN(,) pat(trt) 5.51 6 UN(,) pat(trt) 91.986 7 1 UN(,1) pat(trt) 6.7 8 UN(,) pat(trt) 8.91 9 1 UN(,) pat(trt) 6.75 1 UN(,) pat(trt) 71.796 These are the estimated covariances among residuals on the same subject at different times over distance between times of observations. For example UN(,1) is the

Mixed Models Lecture Notes By Dr. Hanford page 1 covariance between the first and fourth time points. The following SS code plots the above covariances. data complot; merge times cov; proc print; run; axis1 value=(font=swiss h=) label=(angle=9 f=swiss h= 'Covariance of between Subj effects'); axis value=(font=swiss h=) label=(f=swiss h= 'Distance'); legend1 value=(font=swiss h=) label=(f=swiss h= 'From Time'); symbol1 color=black interpol=join line=1 value=square; symbol color=black interpol=join line= value=circle; symbol color=black interpol=join line= value=triangle; symbol color=black interpol=join line= value=star; proc gplot data=covplot; plot estimate*dist=time/vaxis=axis1 haxis=axis legend=legend1; run; The values plotted at distance= are the variances among the observations at each of the four visits. These range from about 7 to 9. This small range along with no trend of increasing or decreasing variance with visit, suggests that a covariance pattern with a constant variance over time, such as a Toeplitz or R(1) will probably be adequate. s

Mixed Models Lecture Notes By Dr. Hanford page 1 the distance between pairs of observations increases, the covariance tends to decrease, again suggesting a Toeplitz or R(1) covariance pattern. Even though the plot narrows the choice of which covariance patterns may be appropriate, we will fit and compare six different covariance patterns to our data. Model Covariance pattern 1 compound symmetry first order autoregressive (R(1)) Toeplitz General 5 Separate compound symmetry for each treatment group 6 Separate Toeplitz pattern for each treatment group Treatment, time, treatment*time and baseline effects will be included as fixed effects. The following SS code fits the compound symmetry model. proc mixed data=dbp; class trt pat visit; model dbp=trt visit dbp; repeated visit/type=cs subject=pat r rcorr; run; The following output is produced. Model Information Data Set Dependent Variable Covariance Structure Subject Effect Estimation Method Residual Variance Method Fixed Effects SE Method Degrees of Freedom Method WORK.DBP dbp Compound Symmetry pat REML Profile Model-Based Between-Within Dimensions Covariance Parameters Columns in X 1 Columns in Z Subjects 88 Max Obs Per Subject Number of Observations Number of Observations Read 19 Number of Observations Used 19 Number of Observations Not Used Iteration History Iteration Evaluations - Res Log Like Criterion

Mixed Models Lecture Notes By Dr. Hanford page 15 1 779.69176 1 76.5958.9 1 76.515697. Convergence criteria met. Estimated R Matrix for pat 1 1 76.68.169.169.169.169 76.68.169.169.169.169 76.68.169.169.169.169 76.68 Estimated R Correlation Matrix for pat 1 1 1..559.559.559.559 1..559.559.559.559 1..559.559.559.559 1. Covariance Parameter Estimates Cov Parm Subject Estimate CS pat.169 Residual 6.1 Fit Statistics - Res Log Likelihood 76. IC (smaller is better) 767. ICC (smaller is better) 767. BIC (smaller is better) 77.7 The next three models use the same SS code, except the type= is modified as follows: type=r(1) for R(1) type=toep for Toeplitz type=un for general or unstructured The following are the covariance parameters and information criterion R(1) Residual 76.189 Estimated R Correlation Matrix for R(1) 1..578.81.1879.578 1..578.81.81.578 1..578.1879.81.578 1. - Res Log Likelihood 785. IC (smaller is better) 789. ICC (smaller is better) 789. BIC (smaller is better) 796.6 Toeplitz Residual 76.91 Estimated R Correlation Matrix for Toeplitz 1..57.818.6.57 1..57.818.818.57 1..57

Mixed Models Lecture Notes By Dr. Hanford page 16.6.818.57 1. - Res Log Likelihood 75.6 IC (smaller is better) 758.6 ICC (smaller is better) 758.6 BIC (smaller is better) 77. General or Unstructured UN(1,1) pat 75.8677 UN(,) pat 7.91 UN(,) pat 86.167 UN(,) pat 7.58 Estimated R Correlation Matrix for General or Unstructured 1..515.781.55.515 1..618.986.781.618 1..666.55.986.666 1. - Res Log Likelihood 7. IC (smaller is better) 76. ICC (smaller is better) 76.5 BIC (smaller is better) 798.9 The following table summarizes the fit information for the four models Model - Res Log (L) IC BIC 1 compound symmetry 76. 767. 77.7 first order autoregressive (R(1)) 785. 789. 796.6 Toeplitz 75.6 758.6 77. General 7. 76. 798.9 For all of the models, correlations between visits are positive, indicating that it is important to take into account the correlations between repeated measures. Comparing the fit statistics among these first four covariance patterns, it appears that the compound symmetry (the simplest model with covariance parameters) and the Toeplitz (with covariance parameters) covariance patterns fit the data better than either the R(1) or general covariance patterns. The compound symmetry model is nested within the Toeplitz pattern, so they can be compared statistically using the likelihood ratio test. The likelihood ratio test statistic would be (log(l 1 )-log(l )) ~ χ df = (.5*76.-.5*75.6)=76.-75.6=1.8 =. The p-value associated with this =., indicating that the Toeplitz structure is a significant improvement over the simpler compound symmetry structure. Model 5 s covariance pattern allows for differences in variances and covariances among the treatments. The SS code for fitting this model with a compound symmetry covariance pattern follows proc mixed noclprint data=dbp; class trt pat visit;

Mixed Models Lecture Notes By Dr. Hanford page 17 model dbp=trt visit dbp/ddfm=satterth; repeated visit/type=cs subject=pat group=trt r=1,, rcorr=1,,; run; There are a couple of additions/changes in the repeated statement. group=trt allows for estimation of the parameters by treatment. r=1,, rcorr=1,, indicates that the covariance matrices and correlation matrices for Patients 1( Trt C), (Trt B), and (Trt ) be printed out. Following are the results for Model 5 Estimated R Matrix for pat 1 - Treatment C 1 76.18 7.99 7.99 7.99 7.99 76.18 7.99 7.99 7.99 7.99 76.18 7.99 7.99 7.99 7.99 76.18 Estimated R Correlation Matrix for pat 1 - Treatment C 1 1..6.6.6.6 1..6.6.6.6 1..6.6.6.6 1. Estimated R Matrix for pat - Treatment B 1 68.197 6.777 6.777 6.777 6.777 68.197 6.777 6.777 6.777 6.777 68.197 6.777 6.777 6.777 6.777 68.197

Mixed Models Lecture Notes By Dr. Hanford page 18 Estimated R Correlation Matrix for pat - Treatment B 1 1..96.96.96.96 1..96.96.96.96 1..96.96.96.96 1. Estimated R Matrix for pat Treatment 1 85.57 5.8115 5.8115 5.8115 5.8115 85.57 5.8115 5.8115 5.8115 5.8115 85.57 5.8115 5.8115 5.8115 5.8115 85.57 Estimated R Correlation Matrix for pat Treatment 1 1..586.586.586.586 1..586.586.586.586 1..586.586.586.586 1. Covariance Parameter Estimates Cov Parm Subject Group Estimate Variance pat trt 9.9 CS pat trt 5.8115 Variance pat trt B 1.195 CS pat trt B 6.777 Variance pat trt C 8.158 CS pat trt C 7.99 Fit Statistics - Res Log Likelihood 77.5 IC (smaller is better) 759.5 ICC (smaller is better) 759.6 BIC (smaller is better) 781.5 Looking at the results, the pattern for compound symmetry differ among the treatments, indicating that separate covariances for each treatment group may be necessary. Because Model (Toeplitz) fit significantly better than Model 1 (compound symmetry), a Toeplitz model fitting for separate covariances for each treatment group should be fitted next (Model 6) with the following SS code and results. proc mixed data=dbp; class trt pat visit; model dbp=trt visit dbp; repeated visit/type=toep subject=pat group=trt r=1,, rcorr=1,,; lsmeans trt/ diff pdiff cl; Estimated R Matrix for pat 1 - Treatment C 1 76.1169 5.76 6.95 5.75 5.76 76.1169 5.76 6.95 6.95 5.76 76.1169 5.76 5.75 6.95 5.76 76.1169 Estimated R Correlation Matrix for pat 1 1 1..69.618.67.69 1..69.618

Mixed Models Lecture Notes By Dr. Hanford page 19.618.69 1..69.67.618.69 1. Estimated R Matrix for pat - Treatment B 1 68.1 8.9.76 8.691 8.9 68.1 8.9.76.76 8.9 68.1 8.9 8.691.76 8.9 68.1 Estimated R Correlation Matrix for pat 1 1...95.6. 1...95.95. 1...6.95. 1. Estimated R Matrix for pat 1 8.989 9.5186 1.77.17 9.5186 8.989 9.5186 1.77 1.77 9.5186 8.989 9.5186.17 1.77 9.5186 8.989 Estimated R Correlation Matrix for pat 1 1..587.8.967.587 1..587.8.8.587 1..587.967.8.587 1. Covariance Parameter Estimates Cov Parm Subject Group Estimate Variance pat trt 8.989 TOEP() pat trt 9.5186 TOEP() pat trt 1.77 TOEP() pat trt.17 Variance pat trt B 68.1 TOEP() pat trt B 8.9 TOEP() pat trt B.76 TOEP() pat trt B 8.691 Variance pat trt C 76.1169 TOEP() pat trt C 5.76 TOEP() pat trt C 6.95 TOEP() pat trt C 5.75 Fit Statistics - Res Log Likelihood 7. IC (smaller is better) 78. ICC (smaller is better) 78. BIC (smaller is better) 791.9 Both the simple Toeplitz model (Model ), and the compound symmetry with treatment model (Model 5) are nested within Model 6, so we can test to see if Model 6 fits significantly better than the other two models. Model ( parameters) vs Model 6 (1 parameters) (log(l 1 )-log(l )) ~ χ df = 75.6-7.=6.6= 8 (p=.8) Model 5 (6 parameters) vs Model 6 (1 parameters)

Mixed Models Lecture Notes By Dr. Hanford page 15 (log(l 1 )-log(l )) ~ χ df = 77.5-7.=.5= 6 (p=.6) Therefore, we are statistically justified in using the more complex covariance pattern. This may partly be due to the large size of the trial, which allows us to estimate the covariances with reasonable accuracy.