Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data

Size: px
Start display at page:

Download "Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data"

Transcription

1 Faculty of Health Sciences Repeated measurements over time Correlated data NFA, May 22, 2014 Longitudinal measurements Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of Copenhagen Presentation of data. Simple approaches. Correlation structures. Random regression. Baseline considerations. Home page: / 84 2 / 84 Typical set-up for repeated measurements Traditional presentation of data Two or more groups of subjects (typically receiving different treatments) Randomization at baseline Longitudinal measurements of the same quantity over time for each subject, typically as a function of time (duration of treatment) age cumulative dose of some drug Level 1: Single observations Level 2: Patients/Subjects Example: Aspirin absorption for healthy and ill subjects (Matthews et.al.,1990) 3 / 84 4 / 84

2 Problems with the traditional presentation of data Take care with average curves Comparison of groups for each time point separately is inefficient has a high risk of leading to chance significance Tests are not independent, since they are carried out on the same subjects Interpretation may be difficult Changes over time cannot be evaluated by eye because we cannot see the pairing They may hide important structures They give no indication of the variation in the time profiles Always make a picture of individual time profiles Do not average over individual profiles, unless these have identical shapes, i.e. only shifts in level are seen between individuals. Alternative: Calculate individual characteristics 5 / 84 6 / 84 Individual time profiles Commonly used characteristics Spaghettiplot - divided into groups Do we see time profiles of identical shape? Are the averages representative? The response for selected times, e.g. endpoint Average The slope, perhaps for a specific period Peak value Time to peak The area under the curve (AUC). A measure of cyclic behaviour. These are analyzed as new observations. 7 / 84 8 / 84

3 Example: Aspirin absorption Repeated measurement designs Characteristics: time to peak peak value Conclusion: P=0.02 for identity of peak values. Remember quantifications! Merits It is much more powerful in detecting time changes (data are paired with the subject as its own control) We may discover that subjects have different time courses (In designs with only cross-sectional data, this may also be the case, but we have no way of knowing!) We may identify important characteristics of the time courses, specific for each subject (trend, peak etc.) 9 / / 84 Repeated measurement designs Example: Calcium supplements Drawbacks Traditional independence assumption is violated since repeated observations on the same individual are correlated (look alike) Traditional anova-models become impossible Comparison of time averages (or other characteristics) cannot incorporate time dependent covariates A total of year old girls were randomized to receive either calcium or placebo. Outcome: BMD=bone mineral density, in measured every 6 months (5 visits) g cm 2, Scientific question: Does calcium improve the rate of bone gain for adolescent women? 11 / / 84

4 Individual profiles Factor diagram [Girl] Grp [I ] = [Girl Visit] Grp Visit Visit Two-level model with: Observations Y git (group=g, girl=individual=i, visit=time=t) Systematic (fixed) effects of group and visit, with a possible interaction Random girl-level, Var(a gi ) = ωb 2, i.e., random intercept Residual variation, within girls, Var(ε git ) = σ 2 W 13 / / 84 Multilevel model structure Random girl level in SAS Y git = µ gt + A gi + ε git Level 1 2 Unit single measurements girls Variation within girls between girls σw 2 ωb 2 Covariates x z visit, grp*visit grp proc mixed data=calcium; class grp girl visit; model bmd=grp visit grp*visit / ddfm=satterth s; random girl(grp); Girls are nested in groups, specified by the notation random girl(grp); satterth could be replaced by kenwardroger 15 / / 84

5 The options Random girl level, output ddfm=satterth (- or kenwardrogers): When the distributions are exact, they have no effect in balanced situations When approximations are necessary, these two are considered best in unbalanced situations, i.e for almost all observational designs in case of missing observations It may give rise to fractional degrees of freedom The computations may require a little more time, but in most cases this will not be noticable When in doubt, use it! 17 / 84 Covariance Parameter Estimates (REML) Cov Parm Estimate GIRL(GRP) Residual Tests of Fixed Effects Source NDF DDF Type III F Pr > F GRP VISIT GRP*VISIT No doubt, we see an interaction GRP*VISIT 18 / 84 Random girl level, output, continued Model checks Solution for Fixed Effects Standard Effect grp visit Estimate Error DF t Value Pr > t Intercept <.0001 grp C grp P visit <.0001 visit <.0001 visit <.0001 visit <.0001 visit grp*visit C <.0001 grp*visit C grp*visit C grp*visit C grp*visit C grp*visit P grp*visit P grp*visit P grp*visit P grp*visit P Two types of residuals: Ordinary Observed minus predicted group mean (only systematic effects) Y git ˆµ gt Conditional Observed minus predicted individual mean value (systematic and random effects) ˆε git = Y git ˆµ gt ˆ A gi Difference between groups at visit 5 Difference increases over time 19 / / 84

6 Model check, ordinary residuals Model check, conditional residuals 21 / / 84 Correlated observations Level 1 covariates (unit: single observations) If we fail to take the correlation into account, we will experience: possible bias in the mean value structure in unbalanced situations low efficiency (type 2 error) for estimation of level 1 covariates (time-related effects) too small standard errors (type 1 error) for estimates of level 2 effects (treatments) Time itself Covariates varying with time: blood pressure, heart rate, age Interaction between group and time If correlation is not taken into account, we ignore the paired situation, leading to low efficiency, i.e. too large P-values Type 2 error Effects may go undetected! 23 / / 84

7 Incorrect analysis Level 2 covariates (unit: individuals) Correlation ignored proc glm data=calcium; class grp visit; model bmd=grp visit grp*visit / solution; Source DF Type III SS Mean Square F Value Pr > F grp visit <.0001 grp*visit The interaction is not detected since we forgot the pairing Treatment Gender, age If correlation is ignored, we act as if we have more information than we actually have, leading to too small P-values Type 1 error Noise may be taken to be real effects! 25 / / 84 Model synonyms Compound symmetry = Exchangeability Two-level model Model with random subject levels Model with random intercepts Model with compound symmetry correlation structure Model with exchangeability correlation structure The two-level model assumes compound symmetry, i.e., that all measurements on the same individual are equally correlated: Corr(Y git1, Y git2 ) = ρ = ω 2 B ω 2 B + σ2 W This means that the distance in time is not taken into account!! Observations are exchangeable 27 / / 84

8 Compound symmetry structure Estimation of correlation ρ ωb 2 + σ2 W ωb 2 ωb 2 ωb 2 ωb 2 ωb 2 ωb 2 + σ2 W ωb 2 ωb 2 ωb 2 ωb 2 ωb 2 ωb 2 + σ2 W ωb 2 ωb 2 ωb 2 ωb 2 ωb 2 ωb 2 + σ2 W ωb 2 ωb 2 ωb 2 ωb 2 ωb 2 ωb 2 + σ2 W = (ω 2 B + σ2 W ) 1 ρ ρ ρ ρ ρ 1 ρ ρ ρ ρ ρ 1 ρ ρ ρ ρ ρ 1 ρ ρ ρ ρ ρ 1 But: Observations taken close to each other in time will often be more closely correlated than observations taken further apart! The correlation ρ is estimated by using output from p. 18 ρ = Corr(Y gdt1, Y gdt2 ) = ω 2 B ω 2 B + σ2 W = High tracking, meaning that each girl stays more or less at the same rank in the group 29 / / 84 Alternative specifications Compound symmetry analysis (just checking...) Note, that the specification random girl(grp); can be written in two other ways: random intercept / subject=girl(grp); repeated visit / type=cs subject=girl(grp); CS: Compound symmetry In the following, we shall see generalizations of the constructions above. proc mixed data=calcium; class grp girl visit; model bmd=grp visit grp*visit / ddfm=satterth outpredm=fit_cs; repeated visit / type=cs subject=girl(grp) rcorr; Estimated R Correlation Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col / / 84

9 Compound symmetry analysis, continued Empirical correlation structure Covariance Parameter Estimates Cov Parm Subject Estimate CS girl(grp) Residual Row COL1 COL2 COL3 COL4 COL Fit Statistics -2 Res Log Likelihood < used later (p ) Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp visit <.0001 grp*visit Is compound symmetry reasonable? Other possibilities: Hardly... Unstructured: T(T+1) 2 = 15 covariance parameters (T = 5) Patterned, e.g. an autoregressive structure Random regression 33 / / 84 Unstructured covariance Output from TYPE=UN model If we do not assume any specific structure for the covariance, we may let it be arbitrary = unstructured This is done in MIXED by using type=un proc mixed data=calcium; class grp girl visit; model bmd=grp visit grp*visit / ddfm=satterth outpredm=fit_un; repeated visit / type=un subject=girl(grp) rcorr; Estimated R Correlation Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col Fit Statistics -2 Res Log Likelihood < used later (p ) Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp visit <.0001 grp*visit / / 84

10 Unstructured covariance Advantages with unstructured covariance We do not force a wrong covariance structure upon our observations. We gain some insight in the actual structure of the covariance. Drawbacks of the unstructured covariance We use quite a lot of parameters to describe the covariance structure. The result may therefore be unstable. It cannot be used for small data sets It can only be used in case of balanced data (all subjects have to be measured at identical times) Can we do something in between? 37 / 84 Comparison of covariance structures Use the likelihood: Good models have large values of likelihood L and therefore small values of deviance: 2 log L Use differences in deviances ( = 2 log Q) and compare to χ 2 with degrees of freedom equal to the difference in parameters Comparison of CS and UN: 2 log Q = = χ 2 (15 2) = χ 2 (13) P < Compound symmetry is not suitable 38 / 84 Different forms of the likelihood Autoregressive structure - of first order Default likelihood is the REML-likelihood, where the mean value structure has been eliminated The traditional likelihood may be obtained using an extra option: proc mixed method=ml; Comparison of covariance structures: Use either of the two likelihoods Comparison of mean value structures: Use only the traditional likelihood (ML) only! But mostly, just use the default F-tests In case of equidistant times, this specifies the following covariance structure σ 2 1 ρ ρ 2 ρ 3 ρ 4 ρ 1 ρ ρ 2 ρ 3 ρ 2 ρ 1 ρ ρ 2 ρ 3 ρ 2 ρ 1 ρ ρ 4 ρ 3 ρ 2 ρ 1 i.e. the correlation decreases (in powers) with the distance between observations. The non-equidistant analogue is Corr(Y git1, Y git2 ) = ρ t 1 t 2 39 / / 84

11 Autoregressive correlation Autoregressiove covariance structure in SAS as a function of distance between the measurements for ρ = 0.1,..., 0.9 proc mixed data=calcium; class grp girl visit; model bmd=grp visit grp*visit / ddfm=satterth outpredm=fit_ar1; repeated visit / type=ar(1) subject=girl(grp) rcorr; 41 / / 84 Output from TYPE=AR(1) structure Estimated R Correlation Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col Covariance Parameter Estimates Cov Parm Subject Estimate AR(1) girl(grp) Residual Fit Statistics -2 Res Log Likelihood < used later (p. 47) Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp visit <.0001 grp*visit Note: Comparison of models with different covariance structures requires, that the models are nested This is not the case for CS and AR(1)! Therefore, we have to compare both of them with the model which combines the two covariance structures: proc mixed data=calcium; class grp girl visit; model bmd=grp visit grp*visit / ddfm=satterth outpredm=fit_ar1; random intercept / subject=girl(grp) g; repeated visit / type=ar(1) subject=girl(grp) rcorr; 43 / / 84

12 Combination of CS and AR(1) In case of equidistant times, this combined model specifies the following covariance structure ω 2 + σ 2 ω 2 + σ 2 ρ ω 2 + σ 2 ρ 2 ω 2 + σ 2 ρ 3 ω 2 + σ 2 ρ 4 ω 2 + σ 2 ρ ω 2 + σ 2 ω 2 + σ 2 ρ ω 2 + σ 2 ρ 2 ω 2 + σ 2 ρ 3 ω 2 + σ 2 ρ 2 ω 2 + σ 2 ρ ω 2 + σ 2 ω 2 + σ 2 ρ ω 2 + σ 2 ρ 2 ω 2 + σ 2 ρ 3 ω 2 + σ 2 ρ 2 ω 2 + σ 2 ρ ω 2 + σ 2 ω 2 + σ 2 ρ ω 2 + σ 2 ρ 4 ω 2 + σ 2 ρ 3 ω 2 + σ 2 ρ 2 ω 2 + σ 2 ρ ω 2 + σ 2 Dividing all entries with ω 2 + σ 2 gives the correlation matrix, with correlations ω 2 + σ 2 ρ d 45 / 84 ω 2 + σ 2 Output from CS+AR(1) Estimated R Correlation Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col Covariance Parameter Estimates Cov Parm Subject Estimate Intercept girl(grp) 0 AR(1) girl(grp) Residual Fit Statistics -2 Res Log Likelihood < used later (p. 47) Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp visit <.0001 grp*visit / 84 Comparison of covariance structures Test of no interaction GRP*VISIT cov. = Model -2 log L par. 2 log Q df P UN AR(1) + CS AR(1) CS < Conclusions? The autoregressive structure is definitely better than CS, but not good quite enough... for various choices of covariance structure Covariance structure Test statistic distribution P value Independence 0.35 F(4,491) 0.84 Compound symmetry 5.30 F(4,382) Autoregressive 2.86 F(4,382) Unstructured 2.72 F(4,107) Not quite identical conclusions! 47 / / 84

13 Adding an extra level Predicted mean time profiles What, if we had had double or triple measurements at each visit? If we always have the same number of repetitions, a correct and optimal approach is to analyze averages If the number of repetitions vary, analysis of averages may still be valid (depends on the reason for the unbalance), although not optimal The easiest approach is to modify the random-statement to: random girl girl*visit; In SAS: Use the outpredm=newdata-option and make pictures using this new data set Profiles are almost identical for all choices of covariance structures For balanced designs, they agree completely and equal simple averages They agree for time points with no missing values (here the first visit) 49 / / 84 Predicted profiles for the unstructured covariance Individual growth rates? The time course is reasonably linear, but maybe the girls have different growth rates (slopes)? If we let Y git denote BMD for the i th girl (in the g th group) at time t (t=1,,5), we could look at the model: The evolution over time looks pretty linear Include time=visit as a quantitative covariate? What about the baseline difference? later... y git = A gi + B gi t + ε git, ε git N (0, σ 2 ) i.e., with different intercepts and different slopes for each girl 51 / / 84

14 Fit a straight line for each girl Results from individual regression Scatterplot of slopes vs. levels at first visit: Estimates with standard errors in brackets: Group Level at age 11 Slope P (0.0086) (0.0014) C (0.0088) (0.0014) Difference (0.0123) (0.0019) P-value Slopes in the Calcium-group (blue dots) seem to be bigger / / 84 Random regression a generalization of the idea of a random level We let each individual (girl) have her own level A gi... we bind these individual parameters (A gi and B gi ) together by normal distributions G = ( Agi B gi ) ( τ 2 a ω ω τ 2 b N 2 (( αg ) = β g ), G ) ( τ 2 a ρτ a τ b ρτ a τ b τ 2 b ) her own slope B gi but... G describes the population variation of the lines, i.e. the inter-individual variation (as seen on p. 53). 55 / / 84

15 Estimation in random regression Output from random regression proc mixed covtest data=calcium; class grp girl; model bmd=grp time time*grp / ddfm=satterth s; random intercept time / type=un subject=girl(grp) g vcorr; type=un in the random-statement refers to the matrix G on the previous slide Covariance Parameter Estimates Standard Z Cov Parm Subject Estimate Error Value Pr Z UN(1,1) girl(grp) <.0001 UN(2,1) girl(grp) 3.733E UN(2,2) girl(grp) E <.0001 Residual <.0001 Fit Statistics -2 Res Log Likelihood Estimated G Matrix Row Effect grp girl Col1 Col2 1 Intercept C E-6 2 time C E / / 84 Output II: Estimated covariance and correlation Output III: Estimated mean value structure Estimated V Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col Estimated V Correlation Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Intercept <.0001 grp C grp P time <.0001 time*grp C time*grp P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp time <.0001 time*grp Thus, we find an extra increase in BMD of (0.0016) g per cm 3 per half year, when giving calcium supplement. 59 / / 84

16 Note concerning MIXED-notation Individual regressions approach It is necessary to use TYPE=UN in the RANDOM-statement in order to allow intercept and slope to be arbitrarily correlated Default option in RANDOM is TYPE=VC, which only specifies variance components with different variances If TYPE=UN is omitted, we may experience convergence problems and sometimes totally incomprehensible results. In this particular case, the correlation between intercept and slope is not that impressive - actually only (intercept is not completely out of range in this example, since it refers to visit=0). Merits: Easy to understand and interpret Drawbacks: Suboptimal in case of unequal sample sizes Only simple models feasible Difficult/impossible to include covariates 61 / / 84 Random regression approach What to choose in this example? Merits: Uses all available information Optimal procedure if the model holds Easy to include covariates Drawbacks: Biased in case of informative missing values or informative sample sizes Random regression gives steeper slope almost identical levels at age 11 The girls with flat low profiles tend to be shorter: Why?? 63 / / 84

17 Actually as it always happens Furthermore, The girls are only seen approximately twice a year The actual dates are available and are translated into ctime, the internal date representation in SAS, denoting days since... We can no longer use the construction type=un, but still the random-statement and the CS in the repeated-statement. A lot of other covariance structures will still be possible, e.g. The non-equidistant analogue to the autoregressive structure is Corr(Y git1, Y git2 ) = ρ t 1 t 2 which is written as TYPE=SP(POW)(ctime) the girls were not precisely 11 years at the first visit As a covariate, we ought to have the specific age of the girl, but unfortunately, these are not available. Note, that this will mostly affect the intercept estimates! 65 / / 84 Using the newly constructed ctime as covariate proc mixed covtest data=calcium; class grp girl; model bmd=grp ctime ctime*grp / ddfm=satterth s; random intercept ctime / type=un subject=girl(grp) g; Iteration History Numerical instability The variable ctime has much too large values, with a very small range. We normalize, to approximate age or age11: age=(ctime-11475)/ ; age11=age-11; /* intercept at age 11 */ Iteration Evaluations -2 Res Log Like Criterion 67 / WARNING: Did not converge. Variable N Mean Minimum Maximum ctime bmd visit age age / 84

18 Random regression, using actual age Intercept at age 11: age11=age-11: proc mixed covtest data=calcium; class grp girl; model bmd=grp age11 age11*grp / ddfm=satterth s outpm=predicted_mean; random intercept age11 / type=un subject=girl(grp) g vcorr; Estimated G Matrix Row Effect grp girl Col1 Col2 1 Intercept C age11 C Random regression, covariate age11: Estimated V Correlation Matrix for girl(grp) 101 C Row Col1 Col2 Col3 Col4 Col5 69 / 84 y git = A gi + B gi (age-11) + ε git / 84 Results from random regression Covariance Parameter Estimates Standard Z Cov Parm Subject Estimate Error Value Pr Z UN(1,1) girl(grp) <.0001 UN(2,1) girl(grp) UN(2,2) girl(grp) <.0001 Residual <.0001 Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Intercept <.0001 / grp C grp P \ age <.0001 / age11*grp C age11*grp P \ In this model, we quantify the effect of a calcium supplement to (0.0031) g per cm 3 per year. Group Level at age 11 Slope P (0.0087) (0.0022) C (0.0088) (0.0022) Difference (0.0124) (0.0031) P Compare to results from individual regressions (p. 54): 71 / / 84

19 Comparison of slopes for different covariance structures Reparametrization, intercept at age 13 Covariance 2 log L Cov.par. Difference structure in slopes P Independence (0.0086) 0.27 Compound (0.0020) < Symmetry Power (0.0032) (Autoregressive) Random (0.0031) Regression Using age13=age-13 as covariate, and no baseline yet: proc mixed covtest noclprint data=calcium; class grp girl; model bmd=grp age13 grp*age13 / ddfm=satterth s; random intercept age13 / type=un subject=girl(grp) g; Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Intercept <.0001 grp C grp P age <.0001 age13*grp C age13*grp P Estimated gain at the age 13: (0.0140) g per cm 3 73 / / 84 Predicted values from random regression It looks as if there is a difference right from the start (although we have previously seen this to be insignificant, P=0.37). It the first visit is a baseline measurement (which it is), and randomization has been performed: The two groups are known to be equal at baseline To include this measurement in the comparison between groups may weaken a possible difference between these (type 2 error) may convert a treatment effect to an interaction Dissimilarities may be present in small studies For slowly varying outcomes, even a small difference may produce non-treatment related differences, i.e. bias Baseline adjustment? 75 / / 84

20 Hypothetical comparison of two treatment groups, A Hypothetical comparison of two treatment groups, B Truth: Constant difference between the treatments Finding: Interaction between time and treatment Truth: No effect of treatment Finding: Constant difference between treatments 77 / / 84 Baseline difference I: Observational studies Research question: Compare the outcomes for individuals from different groups (e.g. gender or illness groups): The groups are likely to differ in many respects, including baseline outcome value. Differences in the outcome may be due to any of these characteristics, and the answer will depend on which of these are included in the model. Adjust for the covariates that are sensible in the context. The scientific question answered depends upon the model 79 / 84 Baseline difference II: Randomized studies Research question: Compare the outcomes for individuals treated differently, but otherwise identical (with respect to all baseline characteristics, including baseline outcome value): There ought to be no difference in either covariates or baseline outcome. Even so, small chance differences in baseline may create important outcome differences that may erroneously be taken to be treatment effects, if the covariate (or baseline) is highly predictive of the outcome. Use baseline measurement as a covariate (Ancova) to adjust for chance differences. Adjust also for other important covariates. The scientific question answered depends upon the model 80 / 84

21 Approaches for handling baseline Excluding baseline (4 follow-up visits only) in randomized studies Use follow-up data only (exclude baseline from analysis) - most reasonable if correlation between repeated measurements is very low Subtract baseline from successive measurements - most reasonable if correlation between repeated measurements is very high Use baseline measurement as a covariate (Ancova) - may be used for any degree of correlation and gives the most sensible interpretation without baseline as covariate: proc mixed covtest noclprint data=calcium; where visit>1; class grp girl; model bmd=grp age13 grp*age13 / ddfm=satterth s; random intercept age13 / type=un subject=girl(grp) g; Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Intercept <.0001 grp C grp P age <.0001 age13*grp C age13*grp P Estimated gain at the age 13: (0.0138) g per cm 3 81 / / 84 Including baseline as covariate Effect of including baseline as covariate proc mixed covtest noclprint data=calcium; where visit>1; class grp girl; model bmd=baseline grp age13 grp*age13 / ddfm=satterth s; random intercept age13 / type=un subject=girl(grp) g; Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Intercept baseline <.0001 grp C grp P age <.0001 age13*grp C age13*grp P explains some (but not all) of the difference between groups at age 13 without baseline: (0.0138) baseline as covariate: (0.0062) increases the precision of the estimated difference (standard error becomes smaller) It even becomes significant! Estimated gain at the age 13: (0.0062) g per cm 3 83 / / 84

Analysis of variance and regression. May 13, 2008

Analysis of variance and regression. May 13, 2008 Analysis of variance and regression May 13, 2008 Repeated measurements over time Presentation of data Traditional ways of analysis Variance component model (the dogs revisited) Random regression Baseline

More information

Correlated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences

Correlated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences Faculty of Health Sciences Longitudinal data Correlated data Longitudinal measurements Outline Designs Models for the mean Covariance patterns Lene Theil Skovgaard November 27, 2015 Random regression Baseline

More information

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman. Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 1 / 96 Overview One-way anova with random variation The rabbit example Hierarchical

More information

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 One-way anova with random variation The rabbit example Hierarchical

More information

Models for longitudinal data

Models for longitudinal data Faculty of Health Sciences Contents Models for longitudinal data Analysis of repeated measurements, NFA 016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman. Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 27, 2018 1 / 84 Overview One-way anova with random variation The rabbit example Hierarchical

More information

Correlated data. Overview. Example: Swelling due to vaccine. Variance component models. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Example: Swelling due to vaccine. Variance component models. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models One-way anova with random variation The rabbit example Hierarchical models with several levels Random regression Lene Theil

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Variance components and LMMs Analysis of repeated measurements, 4th December 2014 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Topics for today Variance components and LMMs Analysis of repeated measurements, 4th December 04 Leftover from 8/: Rest of random regression example. New concepts for today:

More information

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard Faculty of Health Sciences Linear mixed models Analysis of repeated measurements, 10th March 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 80 Program

More information

Variance component models part I

Variance component models part I Faculty of Health Sciences Variance component models part I Analysis of repeated measurements, 30th November 2012 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Answer to exercise: Blood pressure lowering drugs

Answer to exercise: Blood pressure lowering drugs Answer to exercise: Blood pressure lowering drugs The data set bloodpressure.txt contains data from a cross-over trial, involving three different formulations of a drug for lowering of blood pressure:

More information

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */ CLP 944 Example 4 page 1 Within-Personn Fluctuation in Symptom Severity over Time These data come from a study of weekly fluctuation in psoriasis severity. There was no intervention and no real reason

More information

Analysis of variance and regression. December 4, 2007

Analysis of variance and regression. December 4, 2007 Analysis of variance and regression December 4, 2007 Variance component models Variance components One-way anova with random variation estimation interpretations Two-way anova with random variation Crossed

More information

Linear mixed models. Program. What are repeated measurements? Outline. Faculty of Health Sciences. Analysis of repeated measurements, 10th March 2015

Linear mixed models. Program. What are repeated measurements? Outline. Faculty of Health Sciences. Analysis of repeated measurements, 10th March 2015 university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s Program Faculty of Health Sciences Topics: Linear mixed models

More information

SAS Syntax and Output for Data Manipulation:

SAS Syntax and Output for Data Manipulation: CLP 944 Example 5 page 1 Practice with Fixed and Random Effects of Time in Modeling Within-Person Change The models for this example come from Hoffman (2015) chapter 5. We will be examining the extent

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen Outline Data in wide and long format

More information

Statistics for exp. medical researchers Regression and Correlation

Statistics for exp. medical researchers Regression and Correlation Faculty of Health Sciences Regression analysis Statistics for exp. medical researchers Regression and Correlation Lene Theil Skovgaard Sept. 28, 2015 Linear regression, Estimation and Testing Confidence

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen 2 / 28 Preparing data for analysis The

More information

Varians- og regressionsanalyse

Varians- og regressionsanalyse Faculty of Health Sciences Varians- og regressionsanalyse Variance component models Lene Theil Skovgaard Department of Biostatistics Variance component models Definitions and motivation One-way anova with

More information

Multi-factor analysis of variance

Multi-factor analysis of variance Faculty of Health Sciences Outline Multi-factor analysis of variance Basic statistics for experimental researchers 2015 Two-way ANOVA and interaction Mathed samples ANOVA Random vs systematic variation

More information

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 Part 1 of this document can be found at http://www.uvm.edu/~dhowell/methods/supplements/mixed Models for Repeated Measures1.pdf

More information

Correlated data. Variance component models. Example: Evaluate vaccine. Traditional assumption so far. Faculty of Health Sciences

Correlated data. Variance component models. Example: Evaluate vaccine. Traditional assumption so far. Faculty of Health Sciences Faculty of Health Sciences Variance component models Definitions and motivation Correlated data Variance component models, I Lene Theil Skovgaard November 29, 2013 One-way anova with random variation The

More information

Faculty of Health Sciences. Correlated data. More about LMMs. Lene Theil Skovgaard. December 4, / 104

Faculty of Health Sciences. Correlated data. More about LMMs. Lene Theil Skovgaard. December 4, / 104 Faculty of Health Sciences Correlated data More about LMMs Lene Theil Skovgaard December 4, 2015 1 / 104 Further topics Model check and diagnostics Cross-over studies Paired T-tests with missing values

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

Models for binary data

Models for binary data Faculty of Health Sciences Models for binary data Analysis of repeated measurements 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 63 Program for

More information

Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars)

Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars) STAT:5201 Applied Statistic II Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars) School math achievement scores The data file consists of 7185 students

More information

Introduction to Random Effects of Time and Model Estimation

Introduction to Random Effects of Time and Model Estimation Introduction to Random Effects of Time and Model Estimation Today s Class: The Big Picture Multilevel model notation Fixed vs. random effects of time Random intercept vs. random slope models How MLM =

More information

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures Describing Within-Person Fluctuation over Time using Alternative Covariance Structures Today s Class: The Big Picture ACS models using the R matrix only Introducing the G, Z, and V matrices ACS models

More information

Variance component models

Variance component models Faculty of Health Sciences Variance component models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen Topics for

More information

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 29, 2016 One-way anova with random variation The rabbit example Hierarchical

More information

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1 CLDP 944 Example 3a page 1 From Between-Person to Within-Person Models for Longitudinal Data The models for this example come from Hoffman (2015) chapter 3 example 3a. We will be examining the extent to

More information

Describing Change over Time: Adding Linear Trends

Describing Change over Time: Adding Linear Trends Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

More about linear mixed models

More about linear mixed models Faculty of Health Sciences Contents More about linear mixed models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED Maribeth Johnson Medical College of Georgia Augusta, GA Overview Introduction to longitudinal data Describe the data for examples

More information

STAT 5200 Handout #23. Repeated Measures Example (Ch. 16)

STAT 5200 Handout #23. Repeated Measures Example (Ch. 16) Motivating Example: Glucose STAT 500 Handout #3 Repeated Measures Example (Ch. 16) An experiment is conducted to evaluate the effects of three diets on the serum glucose levels of human subjects. Twelve

More information

Review of CLDP 944: Multilevel Models for Longitudinal Data

Review of CLDP 944: Multilevel Models for Longitudinal Data Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1 Psyc 945 Example page Example : Unconditional Models for Change in Number Match 3 Response Time (complete data, syntax, and output available for SAS, SPSS, and STATA electronically) These data come from

More information

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED.

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED. Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED. Maribeth Johnson, Medical College of Georgia, Augusta, GA ABSTRACT Longitudinal data refers to datasets with multiple measurements

More information

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad 1 Supplemental Materials Graphing Values for Individual Dyad Members over Time In the main text, we recommend graphing physiological values for individual dyad members over time to aid in the decision

More information

Faculty of Health Sciences. Correlated data. Count variables. Lene Theil Skovgaard & Julie Lyng Forman. December 6, 2016

Faculty of Health Sciences. Correlated data. Count variables. Lene Theil Skovgaard & Julie Lyng Forman. December 6, 2016 Faculty of Health Sciences Correlated data Count variables Lene Theil Skovgaard & Julie Lyng Forman December 6, 2016 1 / 76 Modeling count outcomes Outline The Poisson distribution for counts Poisson models,

More information

Analysis of variance and regression. November 22, 2007

Analysis of variance and regression. November 22, 2007 Analysis of variance and regression November 22, 2007 Parametrisations: Choice of parameters Comparison of models Test for linearity Linear splines Lene Theil Skovgaard, Dept. of Biostatistics, Institute

More information

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA Paper 188-29 Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA ABSTRACT PROC MIXED provides a very flexible environment in which to model many types

More information

ANOVA Longitudinal Models for the Practice Effects Data: via GLM

ANOVA Longitudinal Models for the Practice Effects Data: via GLM Psyc 943 Lecture 25 page 1 ANOVA Longitudinal Models for the Practice Effects Data: via GLM Model 1. Saturated Means Model for Session, E-only Variances Model (BP) Variances Model: NO correlation, EQUAL

More information

Describing Within-Person Change over Time

Describing Within-Person Change over Time Describing Within-Person Change over Time Topics: Multilevel modeling notation and terminology Fixed and random effects of linear time Predicted variances and covariances from random slopes Dependency

More information

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology Serial Correlation Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 017 Model for Level 1 Residuals There are three sources

More information

Repeated Measures Data

Repeated Measures Data Repeated Measures Data Mixed Models Lecture Notes By Dr. Hanford page 1 Data where subjects are measured repeatedly over time - predetermined intervals (weekly) - uncontrolled variable intervals between

More information

Chapter 3 ANALYSIS OF RESPONSE PROFILES

Chapter 3 ANALYSIS OF RESPONSE PROFILES Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over

More information

Review of Unconditional Multilevel Models for Longitudinal Data

Review of Unconditional Multilevel Models for Longitudinal Data Review of Unconditional Multilevel Models for Longitudinal Data Topics: Course (and MLM) overview Concepts in longitudinal multilevel modeling Model comparisons and significance testing Describing within-person

More information

A Re-Introduction to General Linear Models (GLM)

A Re-Introduction to General Linear Models (GLM) A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing

More information

Covariance Structure Approach to Within-Cases

Covariance Structure Approach to Within-Cases Covariance Structure Approach to Within-Cases Remember how the data file grapefruit1.data looks: Store sales1 sales2 sales3 1 62.1 61.3 60.8 2 58.2 57.9 55.1 3 51.6 49.2 46.2 4 53.7 51.5 48.3 5 61.4 58.7

More information

Lab 11. Multilevel Models. Description of Data

Lab 11. Multilevel Models. Description of Data Lab 11 Multilevel Models Henian Chen, M.D., Ph.D. Description of Data MULTILEVEL.TXT is clustered data for 386 women distributed across 40 groups. ID: 386 women, id from 1 to 386, individual level (level

More information

Independence (Null) Baseline Model: Item means and variances, but NO covariances

Independence (Null) Baseline Model: Item means and variances, but NO covariances CFA Example Using Forgiveness of Situations (N = 1103) using SAS MIXED (See Example 4 for corresponding Mplus syntax and output) SAS Code to Read in Mplus Data: * Import data from Mplus, becomes var1-var23

More information

Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study

Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study 1.4 0.0-6 7 8 9 10 11 12 13 14 15 16 17 18 19 age Model 1: A simple broken stick model with knot at 14 fit with

More information

Repeated Measures Design. Advertising Sales Example

Repeated Measures Design. Advertising Sales Example STAT:5201 Anaylsis/Applied Statistic II Repeated Measures Design Advertising Sales Example A company is interested in comparing the success of two different advertising campaigns. It has 10 test markets,

More information

Review of Multilevel Models for Longitudinal Data

Review of Multilevel Models for Longitudinal Data Review of Multilevel Models for Longitudinal Data Topics: Concepts in longitudinal multilevel modeling Describing within-person fluctuation using ACS models Describing within-person change using random

More information

Model Assumptions; Predicting Heterogeneity of Variance

Model Assumptions; Predicting Heterogeneity of Variance Model Assumptions; Predicting Heterogeneity of Variance Today s topics: Model assumptions Normality Constant variance Predicting heterogeneity of variance CLP 945: Lecture 6 1 Checking for Violations of

More information

Chapter 9. Multivariate and Within-cases Analysis. 9.1 Multivariate Analysis of Variance

Chapter 9. Multivariate and Within-cases Analysis. 9.1 Multivariate Analysis of Variance Chapter 9 Multivariate and Within-cases Analysis 9.1 Multivariate Analysis of Variance Multivariate means more than one response variable at once. Why do it? Primarily because if you do parallel analyses

More information

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 27 pages

More information

Random Intercept Models

Random Intercept Models Random Intercept Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline A very simple case of a random intercept

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH Cross-over Designs #: DESIGNING CLINICAL RESEARCH The subtraction of measurements from the same subject will mostly cancel or minimize effects

More information

Correlated data. Non-normal outcomes. Reminder on binary data. Non-normal data. Faculty of Health Sciences. Non-normal outcomes

Correlated data. Non-normal outcomes. Reminder on binary data. Non-normal data. Faculty of Health Sciences. Non-normal outcomes Faculty of Health Sciences Non-normal outcomes Correlated data Non-normal outcomes Lene Theil Skovgaard December 5, 2014 Generalized linear models Generalized linear mixed models Population average models

More information

Correlated data. Overview. Cross-over study. Repetition. Faculty of Health Sciences. Variance component models, II. More on variance component models

Correlated data. Overview. Cross-over study. Repetition. Faculty of Health Sciences. Variance component models, II. More on variance component models Faculty of Health Sciences Overview Correlated data More on variance component models Variance component models, II Cross-over studies Non-normal data Comparing measurement devices Lene Theil Skovgaard

More information

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED Here we provide syntax for fitting the lower-level mediation model using the MIXED procedure in SAS as well as a sas macro, IndTest.sas

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts

Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 217, Boston, Massachusetts Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

More information

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data Today s Class: Review of concepts in multivariate data Introduction to random intercepts Crossed random effects models

More information

Longitudinal Data Analysis of Health Outcomes

Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

36-402/608 Homework #10 Solutions 4/1

36-402/608 Homework #10 Solutions 4/1 36-402/608 Homework #10 Solutions 4/1 1. Fixing Breakout 17 (60 points) You must use SAS for this problem! Modify the code in wallaby.sas to load the wallaby data and to create a new outcome in the form

More information

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual change across time 2. MRM more flexible in terms of repeated

More information

Parametrisations, splines

Parametrisations, splines / 7 Parametrisations, splines Analysis of variance and regression course http://staff.pubhealth.ku.dk/~lts/regression_2 Marc Andersen, mja@statgroup.dk Analysis of variance and regression for health researchers,

More information

Estimation: Problems & Solutions

Estimation: Problems & Solutions Estimation: Problems & Solutions Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline 1. Introduction: Estimation of

More information

Statistical Inference: The Marginal Model

Statistical Inference: The Marginal Model Statistical Inference: The Marginal Model Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Inference for fixed

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE Biostatistics Workshop 2008 Longitudinal Data Analysis Session 4 GARRETT FITZMAURICE Harvard University 1 LINEAR MIXED EFFECTS MODELS Motivating Example: Influence of Menarche on Changes in Body Fat Prospective

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA Topics: Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA What are MI and DIF? Testing measurement invariance in CFA Testing differential item functioning in IRT/IFA

More information

Introduction to Crossover Trials

Introduction to Crossover Trials Introduction to Crossover Trials Stat 6500 Tutorial Project Isaac Blackhurst A crossover trial is a type of randomized control trial. It has advantages over other designed experiments because, under certain

More information

Analyzing the Behavior of Rats by Repeated Measurements

Analyzing the Behavior of Rats by Repeated Measurements Georgia State University ScholarWorks @ Georgia State University Mathematics Theses Department of Mathematics and Statistics 5-3-007 Analyzing the Behavior of Rats by Repeated Measurements Kenita A. Hall

More information

Estimation: Problems & Solutions

Estimation: Problems & Solutions Estimation: Problems & Solutions Edps/Psych/Soc 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline 1. Introduction: Estimation

More information

A Re-Introduction to General Linear Models

A Re-Introduction to General Linear Models A Re-Introduction to General Linear Models Today s Class: Big picture overview Why we are using restricted maximum likelihood within MIXED instead of least squares within GLM Linear model interpretation

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

Multilevel Methodology

Multilevel Methodology Multilevel Methodology Geert Molenberghs Interuniversity Institute for Biostatistics and statistical Bioinformatics Universiteit Hasselt, Belgium geert.molenberghs@uhasselt.be www.censtat.uhasselt.be Katholieke

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

VIII. ANCOVA. A. Introduction

VIII. ANCOVA. A. Introduction VIII. ANCOVA A. Introduction In most experiments and observational studies, additional information on each experimental unit is available, information besides the factors under direct control or of interest.

More information

Time Invariant Predictors in Longitudinal Models

Time Invariant Predictors in Longitudinal Models Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials CHL 55 H Crossover Trials The Two-sequence, Two-Treatment, Two-period Crossover Trial Definition A trial in which patients are randomly allocated to one of two sequences of treatments (either 1 then, or

More information

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D. Designing Multilevel Models Using SPSS 11.5 Mixed Model John Painter, Ph.D. Jordan Institute for Families School of Social Work University of North Carolina at Chapel Hill 1 Creating Multilevel Models

More information

STK4900/ Lecture 10. Program

STK4900/ Lecture 10. Program STK4900/9900 - Lecture 10 Program 1. Repeated measures and longitudinal data 2. Simple analysis approaches 3. Random effects models 4. Generalized estimating equations (GEE) 5. GEE for binary data (and

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: Summary of building unconditional models for time Missing predictors in MLM Effects of time-invariant predictors Fixed, systematically varying,

More information

Longitudinal data: simple univariate methods of analysis

Longitudinal data: simple univariate methods of analysis Longitudinal data: simple univariate methods of analysis Danish version by Henrik Stryhn, June 1996 Department of Mathematcis and Physics, KVL Translation (and rewriting) by Ib Skovgaard, March 1998 (the

More information

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models

More information

Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA

Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA Faculty of Health Sciences Outline Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA Lene Theil Skovgaard Sept. 14, 2015 Paired comparisons: tests and confidence intervals

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies

More information

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Paper S02-2007 RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Eli Lilly & Company, Indianapolis, IN ABSTRACT

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models Mixed effects models - II Henrik Madsen, Jan Kloppenborg Møller, Anders Nielsen April 16, 2012 H. Madsen, JK. Møller, A. Nielsen () Chapman & Hall

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information