Models for longitudinal data

Size: px
Start display at page:

Download "Models for longitudinal data"

Transcription

1 Faculty of Health Sciences Contents Models for longitudinal data Analysis of repeated measurements, NFA 016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen Longitudinal designs Models for the mean Models for the covariance Linear growth model (aka random regression) Suggested reading: FLW chapters 6, 7, and 8. / 67 Outline Typical set-up for longitudinal measurements Longitudinal designs Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data Two or more groups of subjects E.g. two different treatments, possibly randomized. Repeated measurements over time for each subject. calender time / age / duration of treatment planned or ad hoc times of measurement. ATT: statistical results may be biased unless we account for correlation between measurements on the same subject. 3 / 67 4 / 67

2 Pros and cons of longitudinal designs Longitudinal vs cross sectional effect Example: Reading ability, as a function of age and cohort: Pros Thay have much more power for detecting changes over time. (data are paired with the subject as its own control) We may discover that subjects have different time courses In designs with only cross-sectional data, this may also be the case, but we have no way of knowing! Cons Analysis by traditional ANOVA or GLM-models is infeasible because the independence assumption is violated. We need to model both the mean and the covariance / 67 6 / 67 Characterizing your longitudinal study Study types Type of study: Randomized or observational? Time schedule: Fixed or ad hoc observations? Type of data: Continuous, binary, coundt, ordinal, or categorical? Data structure: Wide or long format? Randomized: One homogeneous population is studied. Randomization to two (or more) treatment groups One or more follow-up measurements + usually a measurement at baseline. Observational: Two (or more) populations are studied. E.g. men and women or two different groups of patients A well defined starting point (e.g. diagnosis). Sample size: How many subjects? How many time points? 7 / 67 8 / 67

3 Observation schedules Fixed time points (balanced design): Measurements collected at prespecified time points. Equidistant: 5, 10, 15, 0 minutes, or every month. Non-equidistant: 5, 10, 0, and 60 minutes. Ad hoc time points (unbalanced design): Subejcts were seen e.g. when the doctor decided or when the patient felt the need. Beware of selection bias. What are the time points representative of? Case: Calcium supplements Randomized study: including 11 girls at age 11. Treatment: calcium supplement or placebo. Outcome: BMD=bone mineral density, in g/cm 3 Planned follow-up: every 6 months in two years 5 visits in total including baseline. Does calcium increase bone gain in adolescent women? BUT: In practice, designs planned to be balanced often turn out more or less unbalanced... 9 / / 67 Time points in the calcium study Visit as time line (ignore deviations from time schedule) visit: Number 1,, 3, 4, or 5. time: Scheduled follow-up 0, 1/, 1, 1 1/, years. obstime: Actual time of follow-up (individual). Analysis Variable : obstime time NObs Mean Std Dev Minimum Maximum / 67 1 / 67

4 Repetition: Analysis of response profiles Comparison of change over n time points (visits) within g groups (treatment) of subjects. Similar to two-way ANOVA, only with correlated data. An unstructured covariance is assumed. Drawbacks: Can only handle balanced designs. Not good with many groups or time points, due to having too many model parameters. Do not make use of a priori known data patterns, e.g. correlation decreasing with time. monotone growth. Today: Parametric models for the mean and the covariance. 13 / 67 Repetition: Analysis of response profiles Note: Do baseline adjustment since treatment was randomized. title1 Response profiles (clmm) ; data calcium; set calcium; treat = grp; if visit EQ 1 then treat = P ; run; proc mixed data=calcium plots=all; class grp girl time (ref= 0 ) treat (ref= P ); model bmd = time treat*time / ddfm=kr solution cl; repeated time / type=un subject=girl; run; 14 / 67 Repetition: Analysis of response profiles Conclusion Solution for Fixed Effects Standard Effect treat time Estimate Error DF t Value Pr > t Alpha Lower Upper Intercept < time < time < time < time < time time*treat C time*treat P time*treat C time*treat P time*treat C time*treat P time*treat C time*treat P time*treat P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F time <.0001 time*treat Mean gain in BMD (g/cm 3 ) since baseline with calcium supplement or placebo years Calcium group Placebo group Difference 1/ 0.07 (0.00;0.033) 0.00 (0.014;0.07) (0.000;0.013) (0.047;0.065) (0.035;0.054) 0.01 (0.00;0.01) 1 1/ (0.07;0.094) (0.060;0.08) 0.01 (0.001;0.03) (0.094;0.118) (0.075;0.099) (0.006;0.031) We see a significantly higher gain in BMD with calcium at last follow-up (P=0.003) 15 / / 67

5 Outline Longitudinal designs Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data 17 / 67 Models for the mean Changes over time usually appear gradually and often following a distinct pattern. We gain power by incorporating this in our models. We want to estimate key parameters such as growth rates. Model the mean as a continuous function of time: Linear Exponential (log-linear) Piecewise linear (linear spline) Flexible (polynomium, smoothing spline, etc) Nonlinear (cyclic, dose-response, etc) (many possibilities, not all treated in this course). 18 / 67 Examples of mean curves Calcium: estimated response profiles 19 / 67 0 / 67 Looks as mean BMD increases linearly with time.

6 Calcium: linear model To fit at linear model for the mean, use the continuous variable time (scheduled visit in years since baseline). title1 Linear trend ; proc mixed data=calcium method=ml plots=all; class grp (ref= P ) girl visit; model bmd = time grp*time / ddfm=kr solution cl; repeated visit / type=un subject=girl r rcorr; run; 1 / 67 Main effect of grp omitted (no difference at baseline). Use the categorical variable visit to specify the unstructured model for the covariance. Use the method=ml option for model comparisons. Calcium: Estimates Fit Statistics - Log Likelihood AIC (Smaller is Better) AICC (Smaller is Better) BIC (Smaller is Better) Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Alpha Lower Upper Intercept < time < time*grp C time*grp P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F time <.0001 time*grp Extra increase in BMD with calcium of g/cm 3 per year, (95%CI: 0.006;0.0151, / 67 P=0.006). Calcium: Estimated response profiles Comparison of models for the mean Goodness of fit is measured by likelihood. Better fitting models have large values of likelihood and therefore small values of deviance: - log Likelihood. ATT: Use the method=ml-option in proc mixed. Compute the difference in deviances (called - log Q) and compare to a χ -distribution with df= no. params. Only nested models can be compared this way. Example: linear trend vs unrestricted response profiles. log Q = = 13.7 χ (9 3) = χ (6) P = BUT: Is linear evolution at all plausible? 3 / 67 4 / 67

7 Technical note on likelihood-types When testing a submodel for the mean, the deviances of the compared models must be computed with the full (or conventional) likelihood proc mixed data=calcium method=ml; not the residual likelihood (reml) which is default in mixed. Residuals for linear trend model But don t forget: Most hypothesis about the mean can be tested using just the default F-tests (optimal choice). proc mixed data=calcium; class grp girl visit treat; model bmd = time grp*time treat*visit/ ddfm=kr solution; repeated visit / type=un subject=girl; run; Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F time 0... time*grp 0... visit*treat / 67 6 / 67 Deviation from linearity is not that pronounced. Outline The unstructured covariance Longitudinal designs Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data 7 / 67 So far we have made no assumptions about covariance. Advantages We make no wrong assumptions about the covariance of our observations. No need to think more about them. We gain insight in the actual structure of the covariance by looking at the estimates. Drawbacks 8 / 67 We use quite a lot of parameters to describe the covariance structure. Thus our analysis becomes less efficient. No good with small data sets; The results may be unstable. It can only be used in case of balanced data, i.e. all subjects have to be measured at identical times.

8 To get a view of the covariance Output for unstructured covariance To get an unbiased view of the covariance we must ensure that out model for the mean is correct. We can use an unrestricted model as a starting point. Now we want to see the estimated covariances and correlations, so we use the r and rcorr options in proc mixed: repeated visit / type=un subject=girl r rcorr; Estimated R Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated R Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col / / 67 Models for the covariance Stationary covariance patterns Most often covariance display distinct features. E.g. decreasing correlation with increasing time span between observations as in the calcium data. We gain power by incorporating these features in our model. Possibilities: Unstructured covariance (lecture 1) Covariance pattern models (lecture ) Variance components / random effects (lecture 3) A huge selection is available with proc mixed. Large selection of models for equidistant observations. Assumption: variances and correlations are stationary: The variance is constant over time. Correlation depend only on the time-distance between the observations not the specific times of measurements. Examples: compound symmetry, autoregressive, autoregressive moving average, and the Toeplitz models. proc mixed type Cov(Y ij, Y ik ) no.par CS σ [I {j = k} + ρ I {j k}] AR(1) σ ρ k j ARMA(1,1) σ [I {j = k} + γ ρ k j 1 I {j k}] 3 TOEP σ [I {j = k} + ρ k j I {j k}] n 31 / 67 3 / 67

9 Examples of stationary correlations Example: Compound symmetry Also called exchangeable covariance (more on this in lecture 3) The variance σ is the same at all time points. All paired observations are correlated with the same correlation ρ, regardles of the specific time points. The 5x5 covariance and correlation matrices are Cov = σ σ ρ σ ρ σ ρ σ ρ σ ρ σ σ ρ σ ρ σ ρ σ ρ σ ρ σ σ ρ σ ρ σ ρ σ ρ σ ρ σ σ ρ σ ρ σ ρ σ ρ σ ρ σ Cor = 1 ρ ρ ρ ρ ρ 1 ρ ρ ρ ρ ρ 1 ρ ρ ρ ρ ρ 1 ρ ρ ρ ρ ρ 1 In proc mixed we write: repeated visit / type=cs subject=girl r rcorr; 33 / / 67 Output for Compound Symmetry Example: AR(1)-covariance matrix Estimated R Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated R Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col Covariance Parameter Estimates Cov Parm Subject Estimate CS girl Residual Note: Cov Parms are not σ = and ρ = (lecture 35 / 67 3). The so-called autoregressive covariance structure has 36 / 67 Constant variance σ over time. Correlation decreasing exponentially with the distance between the observations, Cor(Y ij, Y ik ) = ρ k j The 5x5 covariance and correlation matrices are: Cov = σ σ ρ σ ρ σ ρ 3 σ ρ 4 σ ρ σ σ ρ σ ρ σ ρ 3 σ ρ σ ρ σ σ ρ σ ρ σ ρ 3 σ ρ σ ρ σ σ ρ σ ρ 4 σ ρ 3 σ ρ σ ρ σ In proc mixed we write: Co4 = repeated visit / type=ar(1) subject=girl r rcorr; 1 ρ ρ ρ ρ 1 ρ ρ ρ ρ 1 ρ ρ 3 ρ ρ 1 ρ 4 ρ 3 ρ ρ

10 Output from TYPE=AR(1) structure Estimated R Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated R Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col Covariance Parameter Estimates Cov Parm Subject Estimate AR(1) girl Residual Here σ 37 / 67 = and ρ = Heterogeneous covariance patterns The assumption that the variance does not change with time can be dropped when assuming a heterogeneous covariance pattern. No restrictions on the variances σ 1, σ,... Correlations are assumed to be stationary; They depend only on the time-distance between observations not the specific time points. Examples: the heterogeneous compound symmetry, heterogeneous autoregressive, the heterogeneous Toeplitz, and the antedependence covariance sturctures. 38 / 67 proc mixed type Cov(Y ij, Y ik ) no.par CSH σ j σ k [I {j = k} + ρ I {j k}] n + 1 ARH(1) σ j σ k ρ k j n + 1 TOEPH σ j σ k [I {j = k} + ρ k j I {j k}] n 1 k 1 ANTE(1) σ j σ k l=j ρ l n 1 Output ARH(1) structure Comparison of covariance structures Estimated R Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated R Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col Covariance Parameter Estimates CovParm Subject Estimate Var(1) girl Var() girl Var(3) girl Var(4) girl Var(5) girl ARH(1) girl Model - log L par. log Q df P-value UN ARH(1) AR(1) CS < For comparison of covariance patterns use either of the two likelihood-types (ml or reml), just don t compare one to the other. Estimates of σ1, σ,..., σ 39 / 67 5, and ρ. 40 / 67

11 Predicted response profiles Tests of treatment effect BUT: Confidence intervals and tests depend on the covariance. Tests of treatment effect at last follow-up Covariance pattern Esitmate (95% CI) P-value Assumed independence (0.000;0.0578) Compound symmetry (0.0110;0.083) < Autoregressive (0.0078;0.034) Heterogeneous autoregr (0.0073;0.0315) The estimated response profiles are almost identical for all our choices of covariance patterns -!!! 41 / 67 If data was balanced and complete we would get perfect agreement (all would be equal to the treat*time-averages). Unstructured (0.0065;0.0314) / 67 Modeling strategy As suggested by Fitzmaurice et al. (011): 1. Put up a plausible (e.g. saturated) model for the mean. Fit the data so far ignoring correlation (GLM). 3. Check the residuals for assessing the adequacy of the model for the mean and in order to get an impression of the error covariance. 4. Pick a reasonable model for the covariance (if possible test against the unstructured model). 5. Re-check the model fit. 6. Do the analysis. Note: Lecture 4 on model diagnostics among others. Alternative: Robust standard errors Goodness of fit test for the covariance might select an incorrect model due to lack of power. If data is balanced and complete estimates are unbiased but the standard errors may be biased. Instead one could use the robust sandwich covariance estimator: proc mixed empirical data=calcium; However: The sandwich covariance estimator... is useless in small samples because it is anti-conservative. needs additional weighting to get unbiased estimates when there are missing data. 43 / 67 (More 44 / 67 about sandwich covariance estimators in lectures 5 and 6.)

12 Outline Many time points and few subjects Longitudinal designs In this situation choosing a reliable model for the covariance is just about impossible. Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data Unstructured covariance has too many parameters. Compound symmetry underestimates correlation between observations close in time and overestimates correlation between observations far apart in time. We have no crystal ball for choosing a simple yet correct covariance pattern. Robust standard errors are anti-conservative. So what can we do? 45 / / 67 Reduction to independent data Case: Calcium supplements By analyzing carefully chosen characteristics for each individual we can resolve to simple analyses which have no repeated measurements issues. Maybe not optimal, but feasible and easy! Examples of useful summary statistics: The changes from baseline to endpoint The slopes for individual time effects The area under the curve (AUC) The time to peak or peak value Note: Comparing measurements for each time point in turn is not recommended without adjustment for multiple testing. 47 / 67 The overall time course looks reasonably linear, but maybe the girls have individual growth rates. 48 / 67

13 Individual regression Fit an ordinary linear regression for each girl: Y ij = A i + B i t j + ε ij, ε ij N (0, σ ) Analysis of summary statistics Comparison of individual intercepts and slopes: Just using the two-sample t-test (for independent data). Group Level at baseline (g/cm 3 ) Slope (g/cm 3 per year) P (0.8515;0.8880) (0.0361;0.046) C (0.8650;0.8985) (0.0438;0.0556) Difference (-0.015;0.0365) (0.0009;0.016) P-value Are 49 / 67 the slopes of the Calcium group systematically bigger? 50 / 67 Limitations of summary statistics Outline No baseline adjustment, hence less power. Longitudinal designs Some of the girls in the calcium study dropped out: We get less accurate slope-estimates from girls with few observations. No slope at all if drop out was rigth after baseline. And maybe those with low BMD are more likely to drop out; They think they need supplement but might have got the placebo. Can we make better use of the full data? Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data 51 / 67 5 / 67

14 Random regression We let each girl have her own level A i and her own slope B i We assume these individual parameters (A i and B i ) follow a bivariate normal distribution in the population ( Ai B i ) N (( αg(i) β g(i) The covariance is the so-called G-matrix. 53 / 67 ), ( τ a ω ab ω ab τ b )) G describes the population variance of the regression parameters, the inter-individual variation. Note the correlation ρ ab = ω ab τ a τ b ; Subjects with lower baseline might tend to also have smaller (or larger) growth rates. PROC MIXED: random regression proc mixed data=calcium plots=all; class grp girl; model bmd = time grp*time / ddfm=kr solution cl; random intercept time / type=un subject=girl g v vcorr; run; Individual intercepts and slopes are so-called random effects. They must be specified in a random-statement. Note: The intercept refers to time=0 (baseline). Here type=un refers to a unstructured specification of the G-matrix. If it is omitted, we may experience convergence problems and sometimes totally incomprehensible results. Option g prints the estimated G-matrix. (More about random effects in lecture 3). 54 / 67 Output from random regression Output from random regression Estimated G Matrix Row Effect girl Col1 Col 1 Intercept time Covariance Parameter Estimates Cov Parm Subject Estimate UN(1,1) girl UN(,1) girl UN(,) girl Residual Fit Statistics - Res Log Likelihood AIC (smaller is better) AICC (smaller is better) BIC (smaller is better) Note: ρ ab = = 0.1. Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Alpha Lower Upper Intercept < time < time*grp C time*grp P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F time <.0001 time*grp We find an extra increase in BMD of g/cm 3 per year, with calcium supplement, (95% CI: , P=0.0063). 55 / / 67

15 Implied covariance The random regression implies the covariance pattern: Cov(Y ij, Y ik ) = τa + (t j + t k )ω ab + t j t k τb as a function of the G-matrix parameters and the time points. Does this fit the data well? Random regression vs UN (linear trend for the mean) log Q = = 50.5 χ (15 3) = χ (1) P = No, appearantly not. 57 / 67 Implied covariance Options v and vcorr prints the covariance and correlation. Estimated V Matrix for girl 101 Row Col1 Col Col3 Col4 Col Estimated V Correlation Matrix for girl 101 Row Col1 Col Col3 Col4 Col5 58 / Outline Nonequidistant time points Longitudinal designs Models for the mean Covariance pattern models Analysis of summary statistics The linear growth model (random regression) Unbalanced data In the calcium study the girls are only seen approximately twice a year. Perhaps we get better estimates of the slopes when replacing planned time of visit with the actual individual times? But we loose the option of an unstructured covariance. Other covariance patterns can still be fitted e.g. the autoregressive pattern, the random regression model, and the compound symmetry pattern. 59 / / 67

16 BMD vs actual time of visit Non-equidistant observations Only a limited number of covariance patterns are available in case time points are individual or non-equidistant. They are all stationary: The variance is constant over time. The correlation depend only on the time-distance between the observations, not the specific time points. The ctime-variable must be a numerical variable in SAS proc mixed Cov(Y ij, Y ik ) no. type= param CS σ [I {j = k} + ρ I {j k}] SP(POW)(ctime) σ ρ t ij t ik SP(GAU)(ctime) σ e t ij t ik /γ SP(LIN)(ctime) σ (1 ρ t k t j ) I {ρ t k t j 1}] 61 / 67 6 / 67 Random regression in observed time Random regression, using actual duration Estimated G Matrix Row Effect girl Col1 Col Syntax is exactly the same as with scheduled time points, only the name of the time-varible changes. title1 random regression (actual time) ; proc mixed data=calcium plots=all; class grp girl; model bmd = obstime grp*obstime / ddfm=kr solution cl; random intercept obstime / type=un subject=girl g; run; 1 Intercept obstime Solution for Fixed Effects Standard Effect grp Estimate Error DF t Value Pr > t Alpha Lower Upper Intercept < obstime < obstime*grp C obstime*grp P Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F obstime <.0001 obstime*grp / 67 We find an extra increase in BMD of g/cm 3 per year with 64 / 67 calcium supplement, 95% Ci: to , P=

17 Tests of treatment effect Scheduled vs observed times Comparison of estimates for different covariance structures: Covariance pattern Estimate (95% CI) P-value Assumed independence (0.0071;0.07) Compound symmetry (0.0053;0.0138) < Autoregressive (0.0038;0.016) Random regression (0.006;0.0149) Estimates and tests depend on the covariance! Estimated slopes from the two random regressions: Group Scheduled time Actual time P (0.0405;0.0493) (0.0411;0.0497) C (0.0493;0.058) (0.0498;0.0585) Difference (0.006; ) (0.006;0.0149) P-value Slightly steeper mean slopes in both groups with actual times. Slightly smaller standard errors with actual times. This would have been more pronounced if actual times of visit deviated more from the scheduled times. 65 / / 67 Concluding remarks Results depend on choice of covariance pattern Obvious bias for unrealistic models (independence, CS). More similar results for the more complex models. Not much impact of using exact times of measurement instead of planned times (visit) - WHY? Gain in modeling flexibility by rounding the times. There are sophisticated statistical arguments implying that rounding to the nearest scheduled time do not cause bias (Berkson error) But it increases variance. Choosing an apropriate model is a compromise between practical feasibility, realistic model assumptions, and interpretable results. 67 / 67

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data Faculty of Health Sciences Repeated measurements over time Correlated data NFA, May 22, 2014 Longitudinal measurements Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of

More information

Correlated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences

Correlated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences Faculty of Health Sciences Longitudinal data Correlated data Longitudinal measurements Outline Designs Models for the mean Covariance patterns Lene Theil Skovgaard November 27, 2015 Random regression Baseline

More information

Analysis of variance and regression. May 13, 2008

Analysis of variance and regression. May 13, 2008 Analysis of variance and regression May 13, 2008 Repeated measurements over time Presentation of data Traditional ways of analysis Variance component model (the dogs revisited) Random regression Baseline

More information

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman. Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 27, 2018 1 / 84 Overview One-way anova with random variation The rabbit example Hierarchical

More information

Correlated data. Overview. Example: Swelling due to vaccine. Variance component models. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Example: Swelling due to vaccine. Variance component models. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models One-way anova with random variation The rabbit example Hierarchical models with several levels Random regression Lene Theil

More information

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.

Faculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman. Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 1 / 96 Overview One-way anova with random variation The rabbit example Hierarchical

More information

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 One-way anova with random variation The rabbit example Hierarchical

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen Outline Data in wide and long format

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Variance components and LMMs Analysis of repeated measurements, 4th December 2014 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Topics for today Variance components and LMMs Analysis of repeated measurements, 4th December 04 Leftover from 8/: Rest of random regression example. New concepts for today:

More information

Answer to exercise: Blood pressure lowering drugs

Answer to exercise: Blood pressure lowering drugs Answer to exercise: Blood pressure lowering drugs The data set bloodpressure.txt contains data from a cross-over trial, involving three different formulations of a drug for lowering of blood pressure:

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen 2 / 28 Preparing data for analysis The

More information

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */ CLP 944 Example 4 page 1 Within-Personn Fluctuation in Symptom Severity over Time These data come from a study of weekly fluctuation in psoriasis severity. There was no intervention and no real reason

More information

SAS Syntax and Output for Data Manipulation:

SAS Syntax and Output for Data Manipulation: CLP 944 Example 5 page 1 Practice with Fixed and Random Effects of Time in Modeling Within-Person Change The models for this example come from Hoffman (2015) chapter 5. We will be examining the extent

More information

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard Faculty of Health Sciences Linear mixed models Analysis of repeated measurements, 10th March 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 80 Program

More information

Variance component models part I

Variance component models part I Faculty of Health Sciences Variance component models part I Analysis of repeated measurements, 30th November 2012 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures Describing Within-Person Fluctuation over Time using Alternative Covariance Structures Today s Class: The Big Picture ACS models using the R matrix only Introducing the G, Z, and V matrices ACS models

More information

More about linear mixed models

More about linear mixed models Faculty of Health Sciences Contents More about linear mixed models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Models for binary data

Models for binary data Faculty of Health Sciences Models for binary data Analysis of repeated measurements 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 63 Program for

More information

Review of CLDP 944: Multilevel Models for Longitudinal Data

Review of CLDP 944: Multilevel Models for Longitudinal Data Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance

More information

Statistics for exp. medical researchers Regression and Correlation

Statistics for exp. medical researchers Regression and Correlation Faculty of Health Sciences Regression analysis Statistics for exp. medical researchers Regression and Correlation Lene Theil Skovgaard Sept. 28, 2015 Linear regression, Estimation and Testing Confidence

More information

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED Maribeth Johnson Medical College of Georgia Augusta, GA Overview Introduction to longitudinal data Describe the data for examples

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 Part 1 of this document can be found at http://www.uvm.edu/~dhowell/methods/supplements/mixed Models for Repeated Measures1.pdf

More information

Introduction to Random Effects of Time and Model Estimation

Introduction to Random Effects of Time and Model Estimation Introduction to Random Effects of Time and Model Estimation Today s Class: The Big Picture Multilevel model notation Fixed vs. random effects of time Random intercept vs. random slope models How MLM =

More information

Model Assumptions; Predicting Heterogeneity of Variance

Model Assumptions; Predicting Heterogeneity of Variance Model Assumptions; Predicting Heterogeneity of Variance Today s topics: Model assumptions Normality Constant variance Predicting heterogeneity of variance CLP 945: Lecture 6 1 Checking for Violations of

More information

Review of Unconditional Multilevel Models for Longitudinal Data

Review of Unconditional Multilevel Models for Longitudinal Data Review of Unconditional Multilevel Models for Longitudinal Data Topics: Course (and MLM) overview Concepts in longitudinal multilevel modeling Model comparisons and significance testing Describing within-person

More information

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA Paper 188-29 Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA ABSTRACT PROC MIXED provides a very flexible environment in which to model many types

More information

Linear mixed models. Program. What are repeated measurements? Outline. Faculty of Health Sciences. Analysis of repeated measurements, 10th March 2015

Linear mixed models. Program. What are repeated measurements? Outline. Faculty of Health Sciences. Analysis of repeated measurements, 10th March 2015 university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s Program Faculty of Health Sciences Topics: Linear mixed models

More information

Faculty of Health Sciences. Correlated data. More about LMMs. Lene Theil Skovgaard. December 4, / 104

Faculty of Health Sciences. Correlated data. More about LMMs. Lene Theil Skovgaard. December 4, / 104 Faculty of Health Sciences Correlated data More about LMMs Lene Theil Skovgaard December 4, 2015 1 / 104 Further topics Model check and diagnostics Cross-over studies Paired T-tests with missing values

More information

Variance component models

Variance component models Faculty of Health Sciences Variance component models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen Topics for

More information

Review of Multilevel Models for Longitudinal Data

Review of Multilevel Models for Longitudinal Data Review of Multilevel Models for Longitudinal Data Topics: Concepts in longitudinal multilevel modeling Describing within-person fluctuation using ACS models Describing within-person change using random

More information

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1 CLDP 944 Example 3a page 1 From Between-Person to Within-Person Models for Longitudinal Data The models for this example come from Hoffman (2015) chapter 3 example 3a. We will be examining the extent to

More information

Repeated Measures Data

Repeated Measures Data Repeated Measures Data Mixed Models Lecture Notes By Dr. Hanford page 1 Data where subjects are measured repeatedly over time - predetermined intervals (weekly) - uncontrolled variable intervals between

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

Multi-factor analysis of variance

Multi-factor analysis of variance Faculty of Health Sciences Outline Multi-factor analysis of variance Basic statistics for experimental researchers 2015 Two-way ANOVA and interaction Mathed samples ANOVA Random vs systematic variation

More information

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology Serial Correlation Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 017 Model for Level 1 Residuals There are three sources

More information

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1

SAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1 Psyc 945 Example page Example : Unconditional Models for Change in Number Match 3 Response Time (complete data, syntax, and output available for SAS, SPSS, and STATA electronically) These data come from

More information

ANOVA Longitudinal Models for the Practice Effects Data: via GLM

ANOVA Longitudinal Models for the Practice Effects Data: via GLM Psyc 943 Lecture 25 page 1 ANOVA Longitudinal Models for the Practice Effects Data: via GLM Model 1. Saturated Means Model for Session, E-only Variances Model (BP) Variances Model: NO correlation, EQUAL

More information

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE Biostatistics Workshop 2008 Longitudinal Data Analysis Session 4 GARRETT FITZMAURICE Harvard University 1 LINEAR MIXED EFFECTS MODELS Motivating Example: Influence of Menarche on Changes in Body Fat Prospective

More information

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data Today s Class: Review of concepts in multivariate data Introduction to random intercepts Crossed random effects models

More information

Describing Change over Time: Adding Linear Trends

Describing Change over Time: Adding Linear Trends Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

Analysis of variance and regression. December 4, 2007

Analysis of variance and regression. December 4, 2007 Analysis of variance and regression December 4, 2007 Variance component models Variance components One-way anova with random variation estimation interpretations Two-way anova with random variation Crossed

More information

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED.

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED. Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED. Maribeth Johnson, Medical College of Georgia, Augusta, GA ABSTRACT Longitudinal data refers to datasets with multiple measurements

More information

STAT 5200 Handout #23. Repeated Measures Example (Ch. 16)

STAT 5200 Handout #23. Repeated Measures Example (Ch. 16) Motivating Example: Glucose STAT 500 Handout #3 Repeated Measures Example (Ch. 16) An experiment is conducted to evaluate the effects of three diets on the serum glucose levels of human subjects. Twelve

More information

Analyzing the Behavior of Rats by Repeated Measurements

Analyzing the Behavior of Rats by Repeated Measurements Georgia State University ScholarWorks @ Georgia State University Mathematics Theses Department of Mathematics and Statistics 5-3-007 Analyzing the Behavior of Rats by Repeated Measurements Kenita A. Hall

More information

Topic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects

Topic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects Topic 5 - One-Way Random Effects Models One-way Random effects Outline Model Variance component estimation - Fall 013 Confidence intervals Topic 5 Random Effects vs Fixed Effects Consider factor with numerous

More information

A Re-Introduction to General Linear Models (GLM)

A Re-Introduction to General Linear Models (GLM) A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing

More information

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 27 pages

More information

Describing Within-Person Change over Time

Describing Within-Person Change over Time Describing Within-Person Change over Time Topics: Multilevel modeling notation and terminology Fixed and random effects of linear time Predicted variances and covariances from random slopes Dependency

More information

STAT 705 Generalized linear mixed models

STAT 705 Generalized linear mixed models STAT 705 Generalized linear mixed models Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 24 Generalized Linear Mixed Models We have considered random

More information

Faculty of Health Sciences. Correlated data. Count variables. Lene Theil Skovgaard & Julie Lyng Forman. December 6, 2016

Faculty of Health Sciences. Correlated data. Count variables. Lene Theil Skovgaard & Julie Lyng Forman. December 6, 2016 Faculty of Health Sciences Correlated data Count variables Lene Theil Skovgaard & Julie Lyng Forman December 6, 2016 1 / 76 Modeling count outcomes Outline The Poisson distribution for counts Poisson models,

More information

Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study

Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study 1.4 0.0-6 7 8 9 10 11 12 13 14 15 16 17 18 19 age Model 1: A simple broken stick model with knot at 14 fit with

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

Modeling the Covariance

Modeling the Covariance Modeling the Covariance Jamie Monogan University of Georgia February 3, 2016 Jamie Monogan (UGA) Modeling the Covariance February 3, 2016 1 / 16 Objectives By the end of this meeting, participants should

More information

POWER ANALYSIS TO DETERMINE THE IMPORTANCE OF COVARIANCE STRUCTURE CHOICE IN MIXED MODEL REPEATED MEASURES ANOVA

POWER ANALYSIS TO DETERMINE THE IMPORTANCE OF COVARIANCE STRUCTURE CHOICE IN MIXED MODEL REPEATED MEASURES ANOVA POWER ANALYSIS TO DETERMINE THE IMPORTANCE OF COVARIANCE STRUCTURE CHOICE IN MIXED MODEL REPEATED MEASURES ANOVA A Thesis Submitted to the Graduate Faculty of the North Dakota State University of Agriculture

More information

Dynamic Determination of Mixed Model Covariance Structures. in Double-blind Clinical Trials. Matthew Davis - Omnicare Clinical Research

Dynamic Determination of Mixed Model Covariance Structures. in Double-blind Clinical Trials. Matthew Davis - Omnicare Clinical Research PharmaSUG2010 - Paper SP12 Dynamic Determination of Mixed Model Covariance Structures in Double-blind Clinical Trials Matthew Davis - Omnicare Clinical Research Abstract With the computing power of SAS

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

Introduction to Linear Mixed Models: Modeling continuous longitudinal outcomes

Introduction to Linear Mixed Models: Modeling continuous longitudinal outcomes 1/64 to : Modeling continuous longitudinal outcomes Dr Cameron Hurst cphurst@gmail.com CEU, ACRO and DAMASAC, Khon Kaen University 4 th Febuary, 2557 2/64 Some motivational datasets Before we start, I

More information

Covariance Structure Approach to Within-Cases

Covariance Structure Approach to Within-Cases Covariance Structure Approach to Within-Cases Remember how the data file grapefruit1.data looks: Store sales1 sales2 sales3 1 62.1 61.3 60.8 2 58.2 57.9 55.1 3 51.6 49.2 46.2 4 53.7 51.5 48.3 5 61.4 58.7

More information

Topic 23: Diagnostics and Remedies

Topic 23: Diagnostics and Remedies Topic 23: Diagnostics and Remedies Outline Diagnostics residual checks ANOVA remedial measures Diagnostics Overview We will take the diagnostics and remedial measures that we learned for regression and

More information

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED Here we provide syntax for fitting the lower-level mediation model using the MIXED procedure in SAS as well as a sas macro, IndTest.sas

More information

Repeated Measures Design. Advertising Sales Example

Repeated Measures Design. Advertising Sales Example STAT:5201 Anaylsis/Applied Statistic II Repeated Measures Design Advertising Sales Example A company is interested in comparing the success of two different advertising campaigns. It has 10 test markets,

More information

Analysis of variance and regression. November 22, 2007

Analysis of variance and regression. November 22, 2007 Analysis of variance and regression November 22, 2007 Parametrisations: Choice of parameters Comparison of models Test for linearity Linear splines Lene Theil Skovgaard, Dept. of Biostatistics, Institute

More information

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models

More information

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models

Correlated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 29, 2016 One-way anova with random variation The rabbit example Hierarchical

More information

Modelling the Covariance

Modelling the Covariance Modelling the Covariance Jamie Monogan Washington University in St Louis February 9, 2010 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 1 / 13 Objectives By the end of this meeting, participants

More information

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Paper S02-2007 RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Eli Lilly & Company, Indianapolis, IN ABSTRACT

More information

Introduction to Crossover Trials

Introduction to Crossover Trials Introduction to Crossover Trials Stat 6500 Tutorial Project Isaac Blackhurst A crossover trial is a type of randomized control trial. It has advantages over other designed experiments because, under certain

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies

More information

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter

More information

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad 1 Supplemental Materials Graphing Values for Individual Dyad Members over Time In the main text, we recommend graphing physiological values for individual dyad members over time to aid in the decision

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

Time Invariant Predictors in Longitudinal Models

Time Invariant Predictors in Longitudinal Models Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

Independence (Null) Baseline Model: Item means and variances, but NO covariances

Independence (Null) Baseline Model: Item means and variances, but NO covariances CFA Example Using Forgiveness of Situations (N = 1103) using SAS MIXED (See Example 4 for corresponding Mplus syntax and output) SAS Code to Read in Mplus Data: * Import data from Mplus, becomes var1-var23

More information

Lecture 4. Random Effects in Completely Randomized Design

Lecture 4. Random Effects in Completely Randomized Design Lecture 4. Random Effects in Completely Randomized Design Montgomery: 3.9, 13.1 and 13.7 1 Lecture 4 Page 1 Random Effects vs Fixed Effects Consider factor with numerous possible levels Want to draw inference

More information

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials CHL 55 H Crossover Trials The Two-sequence, Two-Treatment, Two-period Crossover Trial Definition A trial in which patients are randomly allocated to one of two sequences of treatments (either 1 then, or

More information

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual change across time 2. MRM more flexible in terms of repeated

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: Summary of building unconditional models for time Missing predictors in MLM Effects of time-invariant predictors Fixed, systematically varying,

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

Describing Nonlinear Change Over Time

Describing Nonlinear Change Over Time Describing Nonlinear Change Over Time Longitudinal Data Analysis Workshop Section 8 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 8: Describing

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

Topic 20: Single Factor Analysis of Variance

Topic 20: Single Factor Analysis of Variance Topic 20: Single Factor Analysis of Variance Outline Single factor Analysis of Variance One set of treatments Cell means model Factor effects model Link to linear regression using indicator explanatory

More information

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science. Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint

More information

Chapter 9. Multivariate and Within-cases Analysis. 9.1 Multivariate Analysis of Variance

Chapter 9. Multivariate and Within-cases Analysis. 9.1 Multivariate Analysis of Variance Chapter 9 Multivariate and Within-cases Analysis 9.1 Multivariate Analysis of Variance Multivariate means more than one response variable at once. Why do it? Primarily because if you do parallel analyses

More information

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014 LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Liang (Sally) Shan Nov. 4, 2014 L Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Varians- og regressionsanalyse

Varians- og regressionsanalyse Faculty of Health Sciences Varians- og regressionsanalyse Variance component models Lene Theil Skovgaard Department of Biostatistics Variance component models Definitions and motivation One-way anova with

More information

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010 1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

More information

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA Topics: Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA What are MI and DIF? Testing measurement invariance in CFA Testing differential item functioning in IRT/IFA

More information

Correlated data. Variance component models. Example: Evaluate vaccine. Traditional assumption so far. Faculty of Health Sciences

Correlated data. Variance component models. Example: Evaluate vaccine. Traditional assumption so far. Faculty of Health Sciences Faculty of Health Sciences Variance component models Definitions and motivation Correlated data Variance component models, I Lene Theil Skovgaard November 29, 2013 One-way anova with random variation The

More information

Longitudinal Data Analysis

Longitudinal Data Analysis Longitudinal Data Analysis Mike Allerhand This document has been produced for the CCACE short course: Longitudinal Data Analysis. No part of this document may be reproduced, in any form or by any means,

More information

Biostatistics 301A. Repeated measurement analysis (mixed models)

Biostatistics 301A. Repeated measurement analysis (mixed models) B a s i c S t a t i s t i c s F o r D o c t o r s Singapore Med J 2004 Vol 45(10) : 456 CME Article Biostatistics 301A. Repeated measurement analysis (mixed models) Y H Chan Faculty of Medicine National

More information

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p ) Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p. 376-390) BIO656 2009 Goal: To see if a major health-care reform which took place in 1997 in Germany was

More information

Chapter 3 ANALYSIS OF RESPONSE PROFILES

Chapter 3 ANALYSIS OF RESPONSE PROFILES Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over

More information

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Statistics 203: Introduction to Regression and Analysis of Variance Course review Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying

More information

Lab 11. Multilevel Models. Description of Data

Lab 11. Multilevel Models. Description of Data Lab 11 Multilevel Models Henian Chen, M.D., Ph.D. Description of Data MULTILEVEL.TXT is clustered data for 386 women distributed across 40 groups. ID: 386 women, id from 1 to 386, individual level (level

More information

Random Intercept Models

Random Intercept Models Random Intercept Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline A very simple case of a random intercept

More information