Analysis of variance and regression. May 13, 2008

Size: px
Start display at page:

Download "Analysis of variance and regression. May 13, 2008"

Transcription

1 Analysis of variance and regression May 13, 2008

2 Repeated measurements over time Presentation of data Traditional ways of analysis Variance component model (the dogs revisited) Random regression Baseline considerations

3 Lene Theil Skovgaard, Dept. of Biostatistics, Institute of Public Health, University of Copenhagen

4 Repeated measurements, May Traditional presentation of longitudinal data: Ex: Aspirin absorption for healthy and ill subjects (Matthews et.al.,1990) Comparison of groups for each time: mass significance problem tests are not independent interpretation may be difficult

5 Repeated measurements, May What is the purpose of the investigation? Description of time course Comparison of groups in which respect? level, trend,... overall pattern

6 Repeated measurements, May Why is this difficult? or at least different from usual analyses We have several measurements on each individual traditional independence assumption is violated repeated observations on the same individual are correlated (look alike) ignoring this correlation may lead to bias, wrong standard error and therefore potentially misleading conclusions Time course may be quite irregular, with no obvious structure, to be treated as a class-variable (using many parameters) in ANOVA-type models (variance component models) Time course may vary between individuals Random regression

7 Repeated measurements, May Notation from multi-level models: level unit covariate 1 single observations time effects 2 individuals treatment effects If we fail to take this correlation into account, we will experience: possible bias in the mean value structure low efficiency (type 2 error) for evaluation of level 1 covariates (time-related effects) too small standard errors (type 1 error) for estimates of level 2 effects (treatments)

8 Repeated measurements, May Possible bias? Individual time courses Average curve sometimes referred to as the healthy worker effect

9 Repeated measurements, May Missing values MCAR Missing completely at random MAR Missing at random - may depend on past observations NR Informative missing (non-random) - depends on the missing value itself

10 Repeated measurements, May Level 1 covariates (unit: single observations), i.e. Time itself Covariates varying with time: blood pressure, heart rate, age If correlation is not taken into account, we ignore the paired situation, leading to low efficiency, i.e. too large P-values (type 2 error) Effects may go undetected!

11 Repeated measurements, May Level 2 covariates (unit: individuals), i.e. Treatment Gender, age If correlation is ignored, we act as if we have (a lot) more information than we actually have, leading to too small P-values (type 1 error) Noise may be taken to be real effects!

12 Repeated measurements, May Average curves may hide important structures! They give no indication of the variation in the time profiles Comparisons between groups should not be performed for each time point separately Comparisons between time points cannot be judged from the curves (they are paired)

13 Repeated measurements, May The model must describe the characteristic differences between individuals, and the rest (noise, error) should be of an unsystematic, random nature. Do not average over individual profiles, unless these have identical shapes, i.e. only shifts in level are seen between individuals. Alternative: Calculate individual characteristics

14 Repeated measurements, May Individual time profiles (spaghettiogram) - divided into groups Do we see time profiles of identical shape? Are the averages representative?

15 Repeated measurements, May Commonly used characteristics The response for selected times, e.g. endpoint Average over a specific period of time The slope, perhaps for a specific period Peak value Time to peak The area under the curve (AUC). A measure of cyclic behaviour. These are analysed as new observations.

16 Repeated measurements, May Ex: Aspirin time to peak peak value Conclusion: P=0.02 for identity of peak values. Quantifications!

17 Repeated measurements, May Example: 2 groups of dogs (5 resp. 6 dogs). Average profiles: of osmolality, measured 4 times (including treatments along the way)

18 Repeated measurements, May Do we have identical repetitions (except for level)?

19 Repeated measurements, May Model control Residual plot for 2-way ANOVA in (dog, treatment) We see a clear trumpet shape, because dogs with a high level also vary more than dogs with a low level. Multiplicative structure Solution: Make a logarithmic transformation!

20 Repeated measurements, May Profiles on logarithmic scale, with corresponding residual plot:

21 Repeated measurements, May Multilevel model structure: level/niveau 1 2 unit single measurements individuals variation within individuals between individuals σ 2 W ω 2 B covariates x z time, grp*time grp Multilevel models are part of the broader class of models: variance component models (which are not necessarily hierarchical)

22 Repeated measurements, May Two-level model: Observations Y gdt (group, dog, time) Random dog-level, Var(a gd ) = ωb 2 Residual variation, within dogs, Var(ε gdt ) = σw 2 Systematic effect of time and grp proc mixed data=dog; class grp time_no dog; model losmol=grp time_no grp*time_no / ddfm=satterth; random dog(grp); run;

23 Repeated measurements, May This model assumes the socalled compound symmetry, i.e. that all measurements on the same individual are equally correlated: Corr(Y gdt1, Y gdt2 ) = ρ = ω 2 B ω 2 B + σ2 W This means that the distance in time is not taken into account!!

24 Repeated measurements, May Two-level model with random dog level: Class Levels Values grp time_no dog Covariance Parameter Estimates Standard Z Cov Parm Estimate Error Value Pr Z dog(grp) Residual P=0.08 for test of interaction, i.e. no convincing indication of this. Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp time_no <.0001 grp*time_no

25 Repeated measurements, May Factor diagram: [I] = [Dog Time] [Dog] Grp Time Grp Time We have used the notation [ ] for the random effects, corresponding to variance components. We may note the following: The effect of Grp*Time is evaluated against Dog*Time If Grp*Time is not considered significant, we thereafter evaluate Time against Dog*Time Grp against Dog(Grp)

26 Repeated measurements, May The variance component model with random dog level specifies the covariance structure: 0 ω 2 B + σ2 W ω 2 B ω 2 B ω 2 B ω 2 B ω 2 B + σ2 W ω 2 B ω 2 B ω 2 B ω 2 B ω 2 B + σ2 W ω 2 B ω 2 B ω 2 B ω 2 B ω 2 B + σ2 W 1 0 C A = (ω2 B +σ2 W ) 1 ρ ρ ρ ρ 1 ρ ρ ρ ρ 1 ρ ρ ρ ρ 1 1 C A called the compound symmetry structure. The correlation ρ is here estimated to ρ = Corr(Y gdt1, Y gdt2 ) = ω 2 B ωb 2 + σ2 W = 0.65

27 Repeated measurements, May Note, that the specification random dog(grp); can be written in two other ways: random intercept / subject=dog(grp); repeated time / type=cs subject=dog(grp); In the following, we shall see generalisations of the constructions above.

28 Repeated measurements, May Compound symmetry analysis proc mixed data=dog; class grp time dog; model losmol=grp time grp*time / ddfm=satterth; repeated time / type=cs subject=dog(grp) rcorr; run; Covariance Parameter Estimates Cov Parm Subject Estimate CS dog(grp) Residual Fit Statistics -2 Res Log Likelihood 14.8 AIC (smaller is better) 18.8 Estimated R Correlation Matrix for dog(grp) 1 1 Row Col1 Col2 Col3 Col Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp time_no <.0001 grp*time_no

29 Repeated measurements, May The option ddfm=satterth (- or kenwardrogers): When the distributions are exact, they have no effect in balanced situations When approximations are necessary, these are considered best in unbalanced situations, i.e for almost all observational designs in case of missing observations It may give rise to fractional degrees of freedom The computations may require a little more time, but in most cases this will not be noticable When in doubt, use it!

30 Repeated measurements, May Since the interaction was not significant, we omit it from the model: Covariance Parameter Estimates Standard Z Cov Parm Estimate Error Value Pr Z dog(grp) Residual <.0001 Solution for Fixed Effects Standard Effect grp time_no Estimate Error DF t Value Pr > t Intercept grp grp time_no time_no time_no <.0001 time_no Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp time_no <.0001

31 Repeated measurements, May The variance component model (compound symmetry) with random dog level specifies the covariance structure: 0 (ωb 2 + σw) 2 1 ρ ρ ρ ρ 1 ρ ρ ρ ρ 1 ρ ρ ρ ρ 1 1 C A But: The assumption of equal correlation for all pairs of observations taken on the same individual is not necessarily reasonable! Observations taken close to each other in time will often be more closely correlated than observations taken further apart!

32 Repeated measurements, May In the dog example, the empirical correlation matrix is Rather large differences are seen between individual correlations. So what?

33 Repeated measurements, May Unstructured covariance If we do not assume any special structure for the covariance, we may let it be arbitrary = unstructured This is done in MIXED by using type=un and remembering the option hlm: proc mixed data=dog; class grp dog time_no; model losmol=grp time_no grp*time_no / ddfm=satterth; repeated time_no / type=un hlm subject=dog(grp) rcorr; run;

34 Repeated measurements, May Estimated R Correlation Matrix for dog(grp) 1 1 Row Col1 Col2 Col3 Col Fit Statistics -2 Res Log Likelihood 2.3 AIC (smaller is better) 22.3 Type 3 Hotelling-Lawley-McKeon Statistics Num Den Effect DF DF F Value Pr > F time <.0001 grp*time_no

35 Repeated measurements, May Advantages with unstructured covariance We do not force a wrong covariance structure upon our observations. We gain some insight in the actual structure of the covariance. Drawbacks of the unstructured covarianc We use quite a lot of parameters to describe the covariance structure. The result may therefore be unstable. It cannot be used for small data sets It can only be used in case of balanced data (all subjects have to be measured at identical times) Can we do something in between?

36 Repeated measurements, May Comparison of models, using the likelihood Default likelihood is the REML-likelihood, where the mean value structure has been eliminated The traditional likelihood may be obtained using an extra option: proc mixed method=ml; Use differences in 2 log L ( = 2 log Q) and compare to χ 2 with degrees of freedom equal to the difference in parameters Comparison of covariance structures: Use either of the two likelihoods Comparison of mean value structures: Use only the traditional likelihood (ML) Comparison of compound symmetry and unstructured covariance: 2log Q = = 12.5 χ 2 (10 2) = χ 2 (8) P = 0.13.

37 Repeated measurements, May Autoregressive structure of first order In case of equidistant times, this specifies the following covariance structure σ 2 1 ρ ρ 2 ρ 3 ρ 1 ρ ρ 2 ρ 2 ρ 1 ρ ρ 3 ρ 2 ρ 1 i.e. the correlation decreases (in powers) with the distance between observations. The non-equidistant analogue is Corr(Y gdt1, Y gdt2 ) = ρ t 1 t 2

38 Repeated measurements, May Autoregressive structure of first order (TYPE=AR(1)) Estimated R Correlation Matrix for dog(grp) 1 1 Row Col1 Col2 Col3 Col Covariance Parameter Estimates Standard Z Cov Parm Subject Estimate Error Value Pr Z AR(1) dog(dog) <.0001 Residual Fit Statistics -2 Res Log Likelihood 9.8 AIC (smaller is better) 13.8 Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp time_no <.0001 grp*time_no

39 Repeated measurements, May Note: Comparison of models with different covariance structures using a χ 2 -test on 2 log Q, the difference between 2 log L s requires, that the models are nested This is not the case for CS and AR(1)! Therefore, we have to compare both of them with the model which combines the two covariance structures: proc mixed data=dog; class grp dog time_no; model losmol = grp time_no grp*time_no / ddfm=satterth; random intercept / subject=dog(grp) vcorr; repeated time_no / type=ar(1) subject=dog(grp); run;

40 Repeated measurements, May Estimated V Correlation Matrix for dog(grp) 1 1 Row Col1 Col2 Col3 Col Covariance Parameter Estimates Cov Parm Subject Estimate dog(grp) AR(1) dog(grp) Residual Fit Statistics -2 Res Log Likelihood 9.8 AIC (smaller is better) 15.8 Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp time_no <.0001 grp*time_no

41 Repeated measurements, May Comparison of covariance structures cov. Model -2 log L par. df P CS=random dog AR(1) both UN Conclusions? The autoregressive structure is probably the best! Our data set is too small!

42 Repeated measurements, May What, if we had had double or triple measurements at each time? If we always have the same number of repetitions, the correct and optimal approach is to analyze averages If the number of repetitions vary, analysis of averages may still be valid (depends on the reason for the unbalance), although not optimal The optimal approach is to modify the random-statement to: random dog dog*time_no;

43 Repeated measurements, May Actually, the times are not equidistant! Then what?? The non-equidistant analogue to the autoregressive structure is Corr(Y gdt1, Y gdt2 ) = ρ t 1 t 2 which is written as TYPE=SP(POW)(time) For technical reasons, we have to rescale time to hours=time/60 proc mixed covtest data=dog; class grp hours dog; model losmol=grp hours grp*hours / s ddfm=satterth; repeated hours / subject=dog(grp) type=sp(exp)(hours) r; run;

44 Repeated measurements, May Class Level Information Class Levels Values grp hours dog Estimated R Matrix for dog(grp) 1 1 Row Col1 Col2 Col3 Col Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp hours <.0001 grp*hours

45 Repeated measurements, May Example: Calcium supplement for adolescent women, to improve the rate of bone gain A total of year old girls were randomized to receive either calcium or placebo. Outcome: BMD=bone mineral density, in g cm, 2 measured every 6 months (5 visits)

46 Repeated measurements, May Factor diagram: [I] = [Girl V isit] [Girl] Grp V isit Grp V isit Two-level model with: Observations Y git (group, girl=individual, visit) Random girl-level, Var(a gi ) = ωb 2 Residual variation, within girls, Var(ε git ) = σw 2

47 Repeated measurements, May Variance component model: where Y git = µ + α g + β t + γ gt + a gi + ε git Var(a gi ) = ω 2 B, Var(ε git ) = σ 2 W Like previously, we have assumed compound symmetry, i.e. that all measurements on the same girl are equally correlated: Corr(Y git1, Y git2 ) = ρ = ω 2 B ω 2 B + σ2 W

48 Repeated measurements, May Empirical correlation structure: Row COL1 COL2 COL3 COL4 COL Is compound symmetry reasonable? Other possibilities: Unstructured: T(T+1) 2 covariance parameters patterned, e.g. an autoregressive structure random regression

49 Repeated measurements, May Compound symmetry results for the calcium example: Covariance Parameter Estimates (REML) Cov Parm Estimate GIRL(GRP) Residual Tests of Fixed Effects Source NDF DDF Type III F Pr > F GRP VISIT GRP*VISIT No doubt, we see an interaction GRP*VISIT, or?

50 Repeated measurements, May Autoregressive covariance structure: Results: Covariance Parameter Estimates (REML) σ 2 1 ρ ρ 2 ρ 3 ρ 4 ρ 1 ρ ρ 2 ρ 3 ρ 2 ρ 1 ρ ρ 2 ρ 3 ρ 2 ρ 1 ρ ρ 4 ρ 3 ρ 2 ρ 1 Cov Parm Subject Estimate AR(1) GIRL(GRP) Residual Source Tests of Fixed Effects NDF DDF Type III F Pr > F GRP VISIT GRP*VISIT

51 Repeated measurements, May Comparison of test results for the test of no interaction GRP*VISIT: Covariance structure Test statistic distribution P value Independence 0.35 F(4,491) 0.84 Compound symmetry 5.30 F(4,382) Autoregressive 5.30 F(4,381) Unstructured 2.72 F(4,107) 0.034

52 Repeated measurements, May Predicted profiles for the unstructured covariance: The evolution over time looks pretty linear Include time=visit as a quantitative covariate? What about the baseline difference?

53 Repeated measurements, May Test of linear time trend (time=visit): proc mixed data=calcium; class grp girl visit; model bmd=grp time grp*time visit grp*visit / ddfm=satterth; repeated visit / type=un subject=girl(grp) r; run; proc mixed data=calcium; class grp girl visit; model bmd=grp time grp*time visit / s ddfm=satterth; repeated visit / type=un subject=girl(grp) r; run; Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp time 0... time*grp 0... visit grp*visit

54 Repeated measurements, May Solution for Fixed Effects Standard Effect grp visit Estimate Error DF t Value Pr > t Intercept <.0001 grp C grp P time <.0001 time*grp C time*grp P visit visit visit visit visit Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp time 0... time*grp visit There is some deviation from linearity (P=0.0151)

55 Repeated measurements, May The time course is reasonably linear, but maybe the girls have have different growth rates (slopes)? If we let Y gpt denote BMD for the p th girl (in the g th group) at time t (t=1,,5), we have the model: y gpt = a gp + b gp t + ε gpt, ε gpt N(0, σ 2 W)

56 Repeated measurements, May We generalize the idea of a random level to Random regression We let each individual (girl) have her own level a gp her own slope b gp

57 Repeated measurements, May But we bind these individual parameters (a gp and b gp ) together by normal distributions a gp b gp N 2 α g β g, G G = τ2 a ω ω τ 2 b = τ2 a ρτ a τ b ρτ a τ b τb 2 G describes the population variation of the lines, i.e. the inter-individual variation.

58 Repeated measurements, May We estimate in this model by writing: proc mixed covtest data=calcium; class grp girl; model bmd=grp time time*grp / ddfm=satterth s; random intercept time / type=un subject=girl(grp) g v vcorr; run; Estimated G Matrix Row Effect grp girl Col1 Col2 1 Intercept C E-6 2 time C E Estimated V Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col

59 Repeated measurements, May Estimated V Correlation Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col Covariance Parameter Estimates Standard Z Cov Parm Subject Estimate Error Value Pr Z UN(1,1) girl(grp) <.0001 UN(2,1) girl(grp) 3.733E UN(2,2) girl(grp) E <.0001 Residual <.0001 Fit Statistics -2 Res Log Likelihood AIC (smaller is better)

60 Repeated measurements, May Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Intercept <.0001 grp C grp P time <.0001 time*grp C time*grp P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp time <.0001 time*grp Thus, we find an extra increase in BMD of (0.0016) g pr. cm 3 pr. half year, when giving calcium supplement.

61 Repeated measurements, May Note concerning MIXED-notation It is necessary to use TYPE=UN in the RANDOM-statement in order to allow intercept and slope to be arbitrarily correlated Default option in RANDOM is TYPE=VC, which only specifies variance components with different variances If TYPE=UN is omitted, we may experience convergence problems and sometimes totally incomprehensible results. In this particular case, the correlation between intercept and slope is not that impressive (intercept is not completely out of range in this example).

62 Repeated measurements, May It turns out, that the girls are only seen approximately twice a year the actual dates are available and are translated into ctime, the internal date representation in SAS, denoting days since... We can no longer use the construction type=un, but still the random-statement (CS). A lot of other covariance structures will still be possible, e.g. the autoregressive type=ar(1) the girls were not precisely 11 years at the first visit As a covariate, we ought to have the specific age of the girl, but unfortunately, these are not available.

63 Repeated measurements, May If we use the newly constructed ctime: proc mixed covtest data=calcium; class grp girl; model bmd=grp ctime ctime*grp / ddfm=satterth s; random intercept ctime / type=un subject=girl(grp) g; run; Iteration History Iteration Evaluations -2 Res Log Like Criterion WARNING: Did not converge.

64 Repeated measurements, May The variablen ctime has much too large values, with a very small range, and we get numerical instability. We normalise, to approximate age or age11: age=(ctime-11475)/ ; age11=age-11; Variable N Mean Std Dev Minimum Maximum ctime bmd visit age age

65 Repeated measurements, May Random regression, covariate age: y gpt = a gp + b gp (age-11) + ε gpt

66 Repeated measurements, May Random regression, using actual age (age11=age-11): proc mixed covtest data=calcium; class grp girl; model bmd=grp age11 age11*grp / ddfm=satterth s outpm=predicted_mean; random intercept age11 / type=un subject=girl(grp) g vcorr; run; Estimated G Matrix Row Effect grp girl Col1 Col2 1 Intercept C age11 C Estimated V Correlation Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col

67 Repeated measurements, May Covariance Parameter Estimates Standard Z Cov Parm Subject Estimate Error Value Pr Z UN(1,1) girl(grp) <.0001 UN(2,1) girl(grp) UN(2,2) girl(grp) <.0001 Residual <.0001 Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Intercept <.0001 grp C grp P age <.0001 age11*grp C age11*grp P In this model, we quantify the effect of a calcium supplement to (0.0031) g per cm 3 per year.

68 Repeated measurements, May Results from random regression: Group level at age 11 slope P (0.0087) (0.0022) C (0.0088) (0.0022) difference (0.0124) (0.0031) P

69 Repeated measurements, May Comparison of slopes for different covariance structures: Covariance 2 log L cov.par. AIC Difference structure of slopes P Independence (0.0086) 0.27 Compound (0.0020) < symmetry Exponential (0.0032) (autoregressive) Random (0.0031) regression

70 Repeated measurements, May Predicted values from random regression It looks as if there is a difference right from the start (although we have previously seen this to be insignificant, P=0.37). Baseline adjustment?

71 Repeated measurements, May It the first visit is a baseline measurement: The two groups are known to be equal at baseline To include this measurement in the comparison between groups may therefore weaken a possible difference between these (type 2 error) Dissimilarities may be present in small studies For slowly varying outcomes, even a small difference may produce non-treatment related differences, i.e. bias

72 Repeated measurements, May Approaches for handling baseline differences: Use follow-up data only (exclude baseline from analysis) - only reasonable if correlation between repeated measurements is very low Subtract baseline from successive measurements - only reasonable if correlation between repeated measurements is very high Use baseline measurement as a covariate - may be used for any degree of correlation

73 Repeated measurements, May Baseline included as a covariate will hardly change the results for the slopes since these are within-individual quantities A small change is expected because of the exclusion of visit 1 from the analysis. may affect the difference between groups at fixed ages e.g. endpoint age of 13 years

74 Repeated measurements, May Excluding baseline (4 visits only), without baseline as covariate: proc mixed covtest noclprint data=calcium; where visit>1; class grp girl; model bmd=grp age13 grp*age13 / ddfm=satterth s; random intercept age13 / type=un subject=girl(grp) g; run; Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Intercept <.0001 grp C grp P age <.0001 age13*grp C age13*grp P Estimated gain at the age 13: (0.0138) g per cm 3

75 Repeated measurements, May Including baseline as covariate proc mixed covtest noclprint data=calcium; where visit>1; class grp girl; model bmd=baseline grp age13 grp*age13 / ddfm=satterth s; random intercept age13 / type=un subject=girl(grp) g; run; Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Intercept baseline <.0001 grp C grp P age <.0001 age13*grp C age13*grp P Estimated gain at the age 13: (0.0062) g per cm 3

76 Repeated measurements, May Including baseline as covariate explains some (but not all) of the difference between groups at age 13 without baseline: (0.0138) baseline as covariate: (0.0062) increases the precision of the estimated difference (standard error becomes smaller)

77 Repeated measurements, May Example from Vickers, A.J. & Altman, D.G.: Analysing controlled clinical trials with baseline and follow-up measurements. British Medical Journal 2001; 323: : 52 patients with shoulder pain are randomized to either Acupuncture (n=25) Placebo (n=27) Pain is evaluated on a 100 point scale before and after treatment. High scores are good

78 Repeated measurements, May Results: Pain score (mean and SD) placebo acupuncture difference (n=27) (n=25) (95% CI) P-value baseline 53.9 (14.0) 60.4 (12.3) Type of analysis follow-up 62.3 (17.9) 79.6 (17.1) 17.3 (7.5; 27.1) changes* 8.4 (14.6) 19.2 (16.1) 10.8 (2.3; 19.4) ancova 12.7 (4.1; 21.3) * results published

79 Repeated measurements, May Baseline The placebo group lies somewhat below acupuncture Follow-up We would expect the placebo group to be lower also after treatment Therefore, the comparison is unreasonable (we see too big a difference) Change Low baseline implies an expected large positive change (regression to the mean) The placebo group is therefore expected to increase the most Therefore, the comparison is unreasonable (we see too small a difference)

80 Repeated measurements, May What to do in such a situation? Ancova Analysis of covariance, a special case of multiple regression: Outcome: follow-up data Covariates treatment (factor: acupuncture/placebo) baseline measurement (quantitative)

81 Repeated measurements, May When can we use follow-up data? when we have a control group and proper randomisation when the correlation is low When can we use differences? when we have a control group and proper randomisation when the correlation is large When can we use analysis of covariance? allways

82 Repeated measurements, May Specification of mixed models: Systematic variation: Between-individual covariates: treatment, sex, age, baseline value... Within-individual covariates: time, cumulative dose, temperature... Random variation

83 Repeated measurements, May Sources of random variation: 1. Random effects: 2. Serial correlation: 3. Measurement error:

84 Repeated measurements, May SAS, PROC MIXED model describes the systematic part (fixed effects, mean value structure) random describes the random effects repeated describes the process covariance structure local adds an additional measurement error

85 Repeated measurements, May Explained variation in percent, R 2 We have two (or more) different variances to explain! residual variation (variation within individuals, σ 2 W ) decreases when we include an important x covariate (level 1) may decrease when we include an important z covariate (level 2) variation between individuals, ω 2 B decreases when we include an important z covariate (level 2) may increase, when we include an important x covariate (level 1)

86 Repeated measurements, May Hypothetical example: The x s vary between individuals, but the average outcomes (ȳ) are almost identical: Levels of y, for fixed x are very different!

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data Faculty of Health Sciences Repeated measurements over time Correlated data NFA, May 22, 2014 Longitudinal measurements Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of

More information

Correlated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences

Correlated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences Faculty of Health Sciences Longitudinal data Correlated data Longitudinal measurements Outline Designs Models for the mean Covariance patterns Lene Theil Skovgaard November 27, 2015 Random regression Baseline

More information

Models for longitudinal data

Models for longitudinal data Faculty of Health Sciences Contents Models for longitudinal data Analysis of repeated measurements, NFA 016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman. Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 1 / 96 Overview One-way anova with random variation The rabbit example Hierarchical

More information

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 One-way anova with random variation The rabbit example Hierarchical

More information

Analysis of variance and regression. December 4, 2007

Analysis of variance and regression. December 4, 2007 Analysis of variance and regression December 4, 2007 Variance component models Variance components One-way anova with random variation estimation interpretations Two-way anova with random variation Crossed

More information

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman. Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 27, 2018 1 / 84 Overview One-way anova with random variation The rabbit example Hierarchical

More information

Correlated data. Overview. Example: Swelling due to vaccine. Variance component models. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Example: Swelling due to vaccine. Variance component models. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models One-way anova with random variation The rabbit example Hierarchical models with several levels Random regression Lene Theil

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Variance components and LMMs Analysis of repeated measurements, 4th December 2014 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Topics for today Variance components and LMMs Analysis of repeated measurements, 4th December 04 Leftover from 8/: Rest of random regression example. New concepts for today:

More information

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard Faculty of Health Sciences Linear mixed models Analysis of repeated measurements, 10th March 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 80 Program

More information

Variance component models part I

Variance component models part I Faculty of Health Sciences Variance component models part I Analysis of repeated measurements, 30th November 2012 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Varians- og regressionsanalyse

Varians- og regressionsanalyse Faculty of Health Sciences Varians- og regressionsanalyse Variance component models Lene Theil Skovgaard Department of Biostatistics Variance component models Definitions and motivation One-way anova with

More information

SAS Syntax and Output for Data Manipulation:

SAS Syntax and Output for Data Manipulation: CLP 944 Example 5 page 1 Practice with Fixed and Random Effects of Time in Modeling Within-Person Change The models for this example come from Hoffman (2015) chapter 5. We will be examining the extent

More information

Linear mixed models. Program. What are repeated measurements? Outline. Faculty of Health Sciences. Analysis of repeated measurements, 10th March 2015

Linear mixed models. Program. What are repeated measurements? Outline. Faculty of Health Sciences. Analysis of repeated measurements, 10th March 2015 university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s Program Faculty of Health Sciences Topics: Linear mixed models

More information

Answer to exercise: Blood pressure lowering drugs

Answer to exercise: Blood pressure lowering drugs Answer to exercise: Blood pressure lowering drugs The data set bloodpressure.txt contains data from a cross-over trial, involving three different formulations of a drug for lowering of blood pressure:

More information

Faculty of Health Sciences. Correlated data. More about LMMs. Lene Theil Skovgaard. December 4, / 104

Faculty of Health Sciences. Correlated data. More about LMMs. Lene Theil Skovgaard. December 4, / 104 Faculty of Health Sciences Correlated data More about LMMs Lene Theil Skovgaard December 4, 2015 1 / 104 Further topics Model check and diagnostics Cross-over studies Paired T-tests with missing values

More information

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */ CLP 944 Example 4 page 1 Within-Personn Fluctuation in Symptom Severity over Time These data come from a study of weekly fluctuation in psoriasis severity. There was no intervention and no real reason

More information

Statistics for exp. medical researchers Regression and Correlation

Statistics for exp. medical researchers Regression and Correlation Faculty of Health Sciences Regression analysis Statistics for exp. medical researchers Regression and Correlation Lene Theil Skovgaard Sept. 28, 2015 Linear regression, Estimation and Testing Confidence

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

Multi-factor analysis of variance

Multi-factor analysis of variance Faculty of Health Sciences Outline Multi-factor analysis of variance Basic statistics for experimental researchers 2015 Two-way ANOVA and interaction Mathed samples ANOVA Random vs systematic variation

More information

Correlated data. Variance component models. Example: Evaluate vaccine. Traditional assumption so far. Faculty of Health Sciences

Correlated data. Variance component models. Example: Evaluate vaccine. Traditional assumption so far. Faculty of Health Sciences Faculty of Health Sciences Variance component models Definitions and motivation Correlated data Variance component models, I Lene Theil Skovgaard November 29, 2013 One-way anova with random variation The

More information

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED Maribeth Johnson Medical College of Georgia Augusta, GA Overview Introduction to longitudinal data Describe the data for examples

More information

Variance component models

Variance component models Faculty of Health Sciences Variance component models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen Topics for

More information

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 Part 1 of this document can be found at http://www.uvm.edu/~dhowell/methods/supplements/mixed Models for Repeated Measures1.pdf

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen Outline Data in wide and long format

More information

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 29, 2016 One-way anova with random variation The rabbit example Hierarchical

More information

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1 CLDP 944 Example 3a page 1 From Between-Person to Within-Person Models for Longitudinal Data The models for this example come from Hoffman (2015) chapter 3 example 3a. We will be examining the extent to

More information

Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars)

Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars) STAT:5201 Applied Statistic II Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars) School math achievement scores The data file consists of 7185 students

More information

Analysis of variance and regression. November 22, 2007

Analysis of variance and regression. November 22, 2007 Analysis of variance and regression November 22, 2007 Parametrisations: Choice of parameters Comparison of models Test for linearity Linear splines Lene Theil Skovgaard, Dept. of Biostatistics, Institute

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen 2 / 28 Preparing data for analysis The

More information

More about linear mixed models

More about linear mixed models Faculty of Health Sciences Contents More about linear mixed models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1 Psyc 945 Example page Example : Unconditional Models for Change in Number Match 3 Response Time (complete data, syntax, and output available for SAS, SPSS, and STATA electronically) These data come from

More information

Correlated data. Further topics. Sources of random variation. Specification of mixed models. Faculty of Health Sciences.

Correlated data. Further topics. Sources of random variation. Specification of mixed models. Faculty of Health Sciences. Faculty of Health Sciences Further topics Correlated data Further topics Lene Theil Skovgaard December 11, 2012 Specification of mixed models Model check and diagnostics Explained variation, R 2 Missing

More information

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures Describing Within-Person Fluctuation over Time using Alternative Covariance Structures Today s Class: The Big Picture ACS models using the R matrix only Introducing the G, Z, and V matrices ACS models

More information

Review of CLDP 944: Multilevel Models for Longitudinal Data

Review of CLDP 944: Multilevel Models for Longitudinal Data Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance

More information

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology Serial Correlation Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 017 Model for Level 1 Residuals There are three sources

More information

Introduction to Random Effects of Time and Model Estimation

Introduction to Random Effects of Time and Model Estimation Introduction to Random Effects of Time and Model Estimation Today s Class: The Big Picture Multilevel model notation Fixed vs. random effects of time Random intercept vs. random slope models How MLM =

More information

STAT 5200 Handout #23. Repeated Measures Example (Ch. 16)

STAT 5200 Handout #23. Repeated Measures Example (Ch. 16) Motivating Example: Glucose STAT 500 Handout #3 Repeated Measures Example (Ch. 16) An experiment is conducted to evaluate the effects of three diets on the serum glucose levels of human subjects. Twelve

More information

Independence (Null) Baseline Model: Item means and variances, but NO covariances

Independence (Null) Baseline Model: Item means and variances, but NO covariances CFA Example Using Forgiveness of Situations (N = 1103) using SAS MIXED (See Example 4 for corresponding Mplus syntax and output) SAS Code to Read in Mplus Data: * Import data from Mplus, becomes var1-var23

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA Paper 188-29 Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA ABSTRACT PROC MIXED provides a very flexible environment in which to model many types

More information

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED.

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED. Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED. Maribeth Johnson, Medical College of Georgia, Augusta, GA ABSTRACT Longitudinal data refers to datasets with multiple measurements

More information

Describing Change over Time: Adding Linear Trends

Describing Change over Time: Adding Linear Trends Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study

Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study 1.4 0.0-6 7 8 9 10 11 12 13 14 15 16 17 18 19 age Model 1: A simple broken stick model with knot at 14 fit with

More information

Lab 11. Multilevel Models. Description of Data

Lab 11. Multilevel Models. Description of Data Lab 11 Multilevel Models Henian Chen, M.D., Ph.D. Description of Data MULTILEVEL.TXT is clustered data for 386 women distributed across 40 groups. ID: 386 women, id from 1 to 386, individual level (level

More information

Chapter 3 ANALYSIS OF RESPONSE PROFILES

Chapter 3 ANALYSIS OF RESPONSE PROFILES Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over

More information

Review of Unconditional Multilevel Models for Longitudinal Data

Review of Unconditional Multilevel Models for Longitudinal Data Review of Unconditional Multilevel Models for Longitudinal Data Topics: Course (and MLM) overview Concepts in longitudinal multilevel modeling Model comparisons and significance testing Describing within-person

More information

Covariance Structure Approach to Within-Cases

Covariance Structure Approach to Within-Cases Covariance Structure Approach to Within-Cases Remember how the data file grapefruit1.data looks: Store sales1 sales2 sales3 1 62.1 61.3 60.8 2 58.2 57.9 55.1 3 51.6 49.2 46.2 4 53.7 51.5 48.3 5 61.4 58.7

More information

ANOVA Longitudinal Models for the Practice Effects Data: via GLM

ANOVA Longitudinal Models for the Practice Effects Data: via GLM Psyc 943 Lecture 25 page 1 ANOVA Longitudinal Models for the Practice Effects Data: via GLM Model 1. Saturated Means Model for Session, E-only Variances Model (BP) Variances Model: NO correlation, EQUAL

More information

Repeated Measures Design. Advertising Sales Example

Repeated Measures Design. Advertising Sales Example STAT:5201 Anaylsis/Applied Statistic II Repeated Measures Design Advertising Sales Example A company is interested in comparing the success of two different advertising campaigns. It has 10 test markets,

More information

Models for binary data

Models for binary data Faculty of Health Sciences Models for binary data Analysis of repeated measurements 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 63 Program for

More information

Random Intercept Models

Random Intercept Models Random Intercept Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline A very simple case of a random intercept

More information

Faculty of Health Sciences. Correlated data. Count variables. Lene Theil Skovgaard & Julie Lyng Forman. December 6, 2016

Faculty of Health Sciences. Correlated data. Count variables. Lene Theil Skovgaard & Julie Lyng Forman. December 6, 2016 Faculty of Health Sciences Correlated data Count variables Lene Theil Skovgaard & Julie Lyng Forman December 6, 2016 1 / 76 Modeling count outcomes Outline The Poisson distribution for counts Poisson models,

More information

Chapter 9. Multivariate and Within-cases Analysis. 9.1 Multivariate Analysis of Variance

Chapter 9. Multivariate and Within-cases Analysis. 9.1 Multivariate Analysis of Variance Chapter 9 Multivariate and Within-cases Analysis 9.1 Multivariate Analysis of Variance Multivariate means more than one response variable at once. Why do it? Primarily because if you do parallel analyses

More information

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data Today s Class: Review of concepts in multivariate data Introduction to random intercepts Crossed random effects models

More information

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Paper S02-2007 RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Eli Lilly & Company, Indianapolis, IN ABSTRACT

More information

Review of Multilevel Models for Longitudinal Data

Review of Multilevel Models for Longitudinal Data Review of Multilevel Models for Longitudinal Data Topics: Concepts in longitudinal multilevel modeling Describing within-person fluctuation using ACS models Describing within-person change using random

More information

Correlated data. Overview. Cross-over study. Repetition. Faculty of Health Sciences. Variance component models, II. More on variance component models

Correlated data. Overview. Cross-over study. Repetition. Faculty of Health Sciences. Variance component models, II. More on variance component models Faculty of Health Sciences Overview Correlated data More on variance component models Variance component models, II Cross-over studies Non-normal data Comparing measurement devices Lene Theil Skovgaard

More information

Statistical Inference: The Marginal Model

Statistical Inference: The Marginal Model Statistical Inference: The Marginal Model Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Inference for fixed

More information

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad 1 Supplemental Materials Graphing Values for Individual Dyad Members over Time In the main text, we recommend graphing physiological values for individual dyad members over time to aid in the decision

More information

Repeated Measures Data

Repeated Measures Data Repeated Measures Data Mixed Models Lecture Notes By Dr. Hanford page 1 Data where subjects are measured repeatedly over time - predetermined intervals (weekly) - uncontrolled variable intervals between

More information

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 27 pages

More information

Describing Within-Person Change over Time

Describing Within-Person Change over Time Describing Within-Person Change over Time Topics: Multilevel modeling notation and terminology Fixed and random effects of linear time Predicted variances and covariances from random slopes Dependency

More information

A Re-Introduction to General Linear Models (GLM)

A Re-Introduction to General Linear Models (GLM) A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing

More information

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual change across time 2. MRM more flexible in terms of repeated

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED Here we provide syntax for fitting the lower-level mediation model using the MIXED procedure in SAS as well as a sas macro, IndTest.sas

More information

Model Assumptions; Predicting Heterogeneity of Variance

Model Assumptions; Predicting Heterogeneity of Variance Model Assumptions; Predicting Heterogeneity of Variance Today s topics: Model assumptions Normality Constant variance Predicting heterogeneity of variance CLP 945: Lecture 6 1 Checking for Violations of

More information

36-402/608 Homework #10 Solutions 4/1

36-402/608 Homework #10 Solutions 4/1 36-402/608 Homework #10 Solutions 4/1 1. Fixing Breakout 17 (60 points) You must use SAS for this problem! Modify the code in wallaby.sas to load the wallaby data and to create a new outcome in the form

More information

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements: Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported

More information

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D. Designing Multilevel Models Using SPSS 11.5 Mixed Model John Painter, Ph.D. Jordan Institute for Families School of Social Work University of North Carolina at Chapel Hill 1 Creating Multilevel Models

More information

Introduction to Crossover Trials

Introduction to Crossover Trials Introduction to Crossover Trials Stat 6500 Tutorial Project Isaac Blackhurst A crossover trial is a type of randomized control trial. It has advantages over other designed experiments because, under certain

More information

Correlated data. Non-normal outcomes. Reminder on binary data. Non-normal data. Faculty of Health Sciences. Non-normal outcomes

Correlated data. Non-normal outcomes. Reminder on binary data. Non-normal data. Faculty of Health Sciences. Non-normal outcomes Faculty of Health Sciences Non-normal outcomes Correlated data Non-normal outcomes Lene Theil Skovgaard December 5, 2014 Generalized linear models Generalized linear mixed models Population average models

More information

Estimation: Problems & Solutions

Estimation: Problems & Solutions Estimation: Problems & Solutions Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline 1. Introduction: Estimation of

More information

6. Multiple regression - PROC GLM

6. Multiple regression - PROC GLM Use of SAS - November 2016 6. Multiple regression - PROC GLM Karl Bang Christensen Department of Biostatistics, University of Copenhagen. http://biostat.ku.dk/~kach/sas2016/ kach@biostat.ku.dk, tel: 35327491

More information

Time Invariant Predictors in Longitudinal Models

Time Invariant Predictors in Longitudinal Models Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs) 36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)

More information

Accounting for Correlation in the Analysis of Randomized Controlled Trials with Multiple Layers of Clustering

Accounting for Correlation in the Analysis of Randomized Controlled Trials with Multiple Layers of Clustering Duquesne University Duquesne Scholarship Collection Electronic Theses and Dissertations Spring 2016 Accounting for Correlation in the Analysis of Randomized Controlled Trials with Multiple Layers of Clustering

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH Cross-over Designs #: DESIGNING CLINICAL RESEARCH The subtraction of measurements from the same subject will mostly cancel or minimize effects

More information

An R # Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM

An R # Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM An R Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM Lloyd J. Edwards, Ph.D. UNC-CH Department of Biostatistics email: Lloyd_Edwards@unc.edu Presented to the Department

More information

Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d.

Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d. Research Design: Topic 8 Hierarchical Linear Modeling (Measures within Persons) R.C. Gardner, Ph.d. General Rationale, Purpose, and Applications Linear Growth Models HLM can also be used with repeated

More information

Swabs, revisited. The families were subdivided into 3 groups according to the factor crowding, which describes the space available for the household.

Swabs, revisited. The families were subdivided into 3 groups according to the factor crowding, which describes the space available for the household. Swabs, revisited 18 families with 3 children each (in well defined age intervals) were followed over a certain period of time, during which repeated swabs were taken. The variable swabs indicates how many

More information

Analyzing the Behavior of Rats by Repeated Measurements

Analyzing the Behavior of Rats by Repeated Measurements Georgia State University ScholarWorks @ Georgia State University Mathematics Theses Department of Mathematics and Statistics 5-3-007 Analyzing the Behavior of Rats by Repeated Measurements Kenita A. Hall

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Gov 2000: 9. Regression with Two Independent Variables

Gov 2000: 9. Regression with Two Independent Variables Gov 2000: 9. Regression with Two Independent Variables Matthew Blackwell Fall 2016 1 / 62 1. Why Add Variables to a Regression? 2. Adding a Binary Covariate 3. Adding a Continuous Covariate 4. OLS Mechanics

More information

Step 2: Select Analyze, Mixed Models, and Linear.

Step 2: Select Analyze, Mixed Models, and Linear. Example 1a. 20 employees were given a mood questionnaire on Monday, Wednesday and again on Friday. The data will be first be analyzed using a Covariance Pattern model. Step 1: Copy Example1.sav data file

More information

STAT 705 Generalized linear mixed models

STAT 705 Generalized linear mixed models STAT 705 Generalized linear mixed models Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 24 Generalized Linear Mixed Models We have considered random

More information

Overview. Prerequisites

Overview. Prerequisites Overview Introduction Practicalities Review of basic ideas Peter Dalgaard Department of Biostatistics University of Copenhagen Structure of the course The normal distribution t tests Determining the size

More information

De-mystifying random effects models

De-mystifying random effects models De-mystifying random effects models Peter J Diggle Lecture 4, Leahurst, October 2012 Linear regression input variable x factor, covariate, explanatory variable,... output variable y response, end-point,

More information

Rerandomization to Balance Covariates

Rerandomization to Balance Covariates Rerandomization to Balance Covariates Kari Lock Morgan Department of Statistics Penn State University Joint work with Don Rubin University of Minnesota Biostatistics 4/27/16 The Gold Standard Randomized

More information

Multi-factor analysis of variance

Multi-factor analysis of variance Faculty of Health Sciences Outline Multi-factor analysis of variance Basic statistics for experimental researchers 2016 Two-way ANOVA and interaction Matched samples ANOVA Random vs systematic variation

More information

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 1 DAVID C. HOWELL 4/26/2010

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 1 DAVID C. HOWELL 4/26/2010 MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 1 DAVID C. HOWELL 4/26/2010 FOR THE SECOND PART OF THIS DOCUMENT GO TO www.uvm.edu/~dhowell/methods/supplements/mixed Models Repeated/Mixed Models for

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: Summary of building unconditional models for time Missing predictors in MLM Effects of time-invariant predictors Fixed, systematically varying,

More information

Dynamic Determination of Mixed Model Covariance Structures. in Double-blind Clinical Trials. Matthew Davis - Omnicare Clinical Research

Dynamic Determination of Mixed Model Covariance Structures. in Double-blind Clinical Trials. Matthew Davis - Omnicare Clinical Research PharmaSUG2010 - Paper SP12 Dynamic Determination of Mixed Model Covariance Structures in Double-blind Clinical Trials Matthew Davis - Omnicare Clinical Research Abstract With the computing power of SAS

More information

Analysis of repeated measurements (KLMED8008)

Analysis of repeated measurements (KLMED8008) Analysis of repeated measurements (KLMED8008) Eirik Skogvoll, MD PhD Professor and Consultant Institute of Circulation and Medical Imaging Dept. of Anaesthesiology and Emergency Medicine 1 Day 2 Practical

More information

STAT3401: Advanced data analysis Week 10: Models for Clustered Longitudinal Data

STAT3401: Advanced data analysis Week 10: Models for Clustered Longitudinal Data STAT3401: Advanced data analysis Week 10: Models for Clustered Longitudinal Data Berwin Turlach School of Mathematics and Statistics Berwin.Turlach@gmail.com The University of Western Australia Models

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies

More information

Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA

Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA Faculty of Health Sciences Outline Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA Lene Theil Skovgaard Sept. 14, 2015 Paired comparisons: tests and confidence intervals

More information

Estimation: Problems & Solutions

Estimation: Problems & Solutions Estimation: Problems & Solutions Edps/Psych/Soc 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline 1. Introduction: Estimation

More information