Stat 579: Generalized Linear Models and Extensions

Size: px
Start display at page:

Download "Stat 579: Generalized Linear Models and Extensions"

Transcription

1 Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 12 1 / 34

2 Correlated data multivariate observations clustered data repeated measurement longitudinal data spatially correlated data 2 / 34

3 Example: Multivariate: a subject s systolic and diastolic blood pressure are measured simultaneously -measurement of different characteristics Cluster setting: for a number of families, diastolic blood pressure is measured for all of their members Repeated measure: for each subject, diastolic blood pressure is recorded under several experimental conditions -measurement of the same characteristic under a different condition Longitudinal data: diastolic blood pressure is measured repeated over time for each subject 3 / 34

4 Problems of repeated measure and longitudinal design: Order effect Example: in evaluating five different advertisements, subjects may tend to give higher (or lower) ratings for advertisements shown-toward the end of the sequence than at the beginning Remedy: randomization of the treatment orders Carry over effect Example: in evaluating five different soup recipes, a bland recipe may get a higher (or lower) rating when preceded by a highly spiced recipe than when preceded by a blander recipe. Remedy: allowing sufficient time between treatments. Example: in evaluating a new drug, it is often noticed that the effect of the drug began to wear off over time. Remedy: consider a mixed model by incorporating time effect typical models: Random intercepts model, and Random slopes and intercepts model 4 / 34

5 Example: Childhood obesity n children are followed longitudinally (over time) body mass index (BMI = WT/HT 2 ) has been recorded at each time (age) interested in the relationship of BMI as a function of age data structure: time (age) 1 time (age) 2 time (age) n i 1st child y 11 y 12 y 1n1 2nd child y 21 y 22 y 2n2. nth child y n1 y n2 y nnn 5 / 34

6 Model 1: Simple linear regression model y ij = β 0 + β 1 age ij + ɛ ij where i = 1, 2., n, j = 1, 2,, n i and ɛ ij iid N(0, σ 2 ) Assume that gender is unimportant, same relationship for all children. 6 / 34

7 Model 2: Intercepts and slopes vary by child where i = 1, 2., n, j = 1, 2,, n i and ɛ ij iid N(0, σ 2 ) y ij = β 0i + β 1i age ij + ɛ ij Model 1: 2 parameters (β 0, β 1 ) Model 2: 2n parameters (β 0i, β 1i ) Responses on the same child are likely to be correlated, iid assumption may not work 7 / 34

8 Model 3: Mixed model Treat the subject specific intercepts β 0i and slopes β 1i as r.vs β 0i = β 0 + β0i, β iid 0i N(0, σ 2 0 ) β 1i = β 1 + β1i, β iid 1i N(0, σ 2 1 ) β0i, β 1i measure the difference between the population mean intercept and slope (β 0, β 1 ) and the subject specific intercept and slope (β 0i, β 1i ) y ij = β 0 + β 0i + (β 1 + β 1i)age ij + ɛ ij = β 0 + β 1 age ij + β 0i + β 1iage ij + ɛ ij β 0 + β 1 age ij : population average regression line β 0i + β 1i age ij: subject specific deviation from population average A special case of a linear mixed model (LMM): fixed effects (β 0, β 1 ) + random effects (β 0i, β 1i ) 8 / 34

9 Advantages: responses on the same individual are correlated y i1, y i2,, y ini all depend on β 0i, β 1i Model 2: 2n parameters Model 3: 5 parameters Model 2 only applies to the individuals selected from study, while Model 3, individuals are sampled as representatives of an underlying population, therefore, applying to the population 9 / 34

10 General Linear Models for Longitudinal Data y i = y i1 y i2., µ i = µ i1 µ i2., ɛ i = ɛ i1 ɛ i2. y ini µ ini ɛ ini y i s are responses at times t i1, t i2, t ini y i = µ i + ɛ i number of time points can vary by individual assume that µ i = X i ni pβ p 1, X i may be individual specific assume E(ɛ i ) = 0, Var(ɛ i ) = Var(y i ) = Σ i often assume ɛ i N ni (0, Σ i ) or equivalently, y i N ni (µ i, Σ i ) 10 / 34

11 Example: dental data Orthodontic distance is measured at ages t i1 = 8, t i2 = 10, t i3 = 12, t i4 = 14 (n i = 4) for each subject from 2 groups (boys (red) and girls (black)) > ex.data obs subject age distance gender / 34

12 > aa<-aggregate(distance~age+gender, data=ex.data, mean) #cell means > aa age gender distance / 34

13 Figure 1: Sample means at each time across children gender distance age Increasing linear trend with time, males (red) and females (black) have different intercepts and possibly different slopes. 13 / 34

14 Starting model i: subject, j: time Let Let δ i = y ij = µ ij + ɛ ij { 1 if boy 0 if girl µ ij = β 0 + β 1 δ i + β 2 t j + β 3 (δ i t j ) δ i : indicator effect for gender t j : linear time effect (δ i t j ): time and gender interactions 14 / 34

15 µ ij = β 0 + β 1 δ i + β 2 t j + β 3 (δ i t j ) For δ i = 0 (girls) µ ij = β 0 + β 1 (0) + β 2 t j + β 3 (0) = β 0 + β 2 t j For δ i = 1 (boys) µ ij = β 0 + β 1 (1) + β 2 t j + β 3 (1 t j ) = (β 0 + β 1 ) + (β 2 + β 3 )t j gender intercept slope girls β 0 β 2 boys β 0 + β 1 β 2 + β 3 15 / 34

16 Coefficient for gender and gender * time interaction measure difference in intercept and slope between boys and girls, GDP (group difference parametrization) for the model µ i = X i β 1 δ i t 1 δ i t 1 1 δ i t 2 δ i t 2 X i =...., β = 1 δ i t n δ i t n β 0 β 1 β 2 β 3 16 / 34

17 Separate group parametrization An equivalent model is to specify µ ij = β 0 (1 δ i ) + β 1 δ i + β 2 (1 δ i )t j + β 3 (δ i t j ) For δ i = 0 (girls), µ ij = β 0 + β 2 t j For δ i = 1 (boys), µ ij = β 1 + β 3 t j µ i = X i β Separate group parametrization (SGP) µ i1. µ ini = 1 δ i δ i (1 δ i )t j δ i t j δ i δ i (1 δ i )t j δ i t j β 0 β 1 β 2 β 3 17 / 34

18 For girls For boys 1 0 t t 2 0 X i = t ni t t 2 X i = t ni 18 / 34

19 Split Plot model Alternatively, we could consider the split-plot model µ ij = µ + τ i + δ j + (τδ) ij The models relates the mean to a linear function of time would appear to be more appealing. 19 / 34

20 Quadratic curves fitting Hip-replacement study: 30 patients underwent hip replacement surgery, 13 males and 17 females, the ratio of volume packed red blood cells relative to volume of whole blood recorded on a percentage basis, was supposed to be measured for each patient at week 0 before the replacement, and then at weeks 1, 2 and 3, after replacement. collected at 4 equally spaced time points 0, 1,2 3 weeks but some responses missing mean profiles appear to be quadratic in time effect also interested in whether age might response given short duration of study, makes sense to treat a subjects age as constant. 20 / 34

21 Let δ i = { 1 males 0 females y ij = µ ij + ɛ ij µ ij = β 0 + β 1 t ij + β 2 t 2 ij + β 3 δ i + β 4 δ i t ij + β 5 δ i t 2 ij For females For males µ ij = β 0 + β 1 t ij + β 2 t 2 ij µ ij = (β 0 + β 3 ) + (β 1 + β 4 )t ij + (β 2 + β 5 )t 2 ij quadratic curves shift up or down with age, but amount of shift is same for males and females. We could allow the shift to be sex dependent by includes a gender*age interaction. 21 / 34

22 Modeling the covariance matrix Σ i y i = µ i + ɛ i Var(ɛ i ) = Var(y i ) = Σ i (ni n i ) For balanced case n i = n, t 1, t 2,, t n assume no missing data further, assume that Σ i = Σ 22 / 34

23 Equicorrelation Σ = σ 2 σ 2 ρ σ 2... σ 2 ρ σ 2 This structure is also called spherical or exchangeable It might be applicable to cluster data, where ρ is called the intra-class correlation coefficient between two members of the same cluster and is a relative measure of the within-cluster similarity. Each response has variance σ 2 and all pairwise correlations are ρ Believe that correlation between responses at any two points in a time interval are constant 23 / 34

24 Compound symmetry A special case of equicorrelation, arises by enforcing ρ = σ 2 B /(σ2 B + σ2 e) for some σ 2 B and σ2 e. In this case, σ 2 = σ 2 B + σ2 e, then σb 2 + σ2 e σb 2 σb 2 σ2 B σb 2 σb 2 Σ = + σ2 e σb 2 σ2 B σb 2 σb 2 σb 2 σ2 B + σ2 e n n Recall that a split plot design has this variance structure. 24 / 34

25 Unstructured arbitrary variances and covariances Σ = σ1 2 σ 12 σ 1n σ 21 σ2 2 σ 2n..... σ n1 σ n2 σn 2 the variance-covariance matrix contains, n(n 1)/2 + n nuisance parameters to be estimated estimation of this structure may only convergence for N >> n the statistical power under this structure is reduced since the only constraint on Σ i is that it be symmetric 25 / 34

26 1 dependent Σ = σ 2 1 ρ ρ 0... ρ 1 constant variance of σ 2 lag 1, correlation = ρ, i.e., corr(y ij, y i(j±1) ) = ρ lag k correlation =0, i.e., corr(y ij, y i(j±k) ) = 0, if k > 1 observations taken more closely together in time might tend to be more alike than those taken further apart 26 / 34

27 AR(1): auto regressive order 1 (equally spaced in time) Σ = σ 2 1 ρ ρ 2 ρ n 1 1 ρ ρ n ρ 1 constant variance σ 2 corr(y ij, y i(j±k) ) = ρ k 27 / 34

28 Unequally spaced in time The above models make sense for equally spaced times. How about if times are unequally spaced, but we still have same times each subject? compound symmetry: if you believe that correlations between responses at any two points in a time interval are constant, then ok unstructured: ok 1 dependent, not for sure, if t 3 = t 2 + ɛ with ɛ small, should corr(y t1, y t2 ) = ρ, yet corr(y t1, y t3 ) = 0 AR(1), same issue as 1 dependent 28 / 34

29 Generalization of AR(1) to unequally spaced times Markov model-correlation is a function of distance between times Let d jh = t j t h, then Markov model has - var(y tj ) = σ 2 - corr(y tj, y th ) = ρ d jh 29 / 34

30 Deciding among covariance models inspection of sample covariance matrices can use formal test in some cases, model selection methods AIC, BIC, for non-nested models 30 / 34

31 Unbalanced case: or some observations are missing on some units Example: (t 1, t 2, t 3, t 4 ) = (0, 1, 2, 3) For unit i, observation at time t 3 is not available n i = 3. Let y i1 y i = y i2 y i4 denote the observations at times (t 1, t 2, t 4 ) = (0, 1, 3) Assume that var(y ij ) = σ 2 for all j, we thus want a model for Σ i = var(y i ) = σ 2 cov(y i1, y i2 ) cov(y i1, y i4 ) cov(y i2, y i1 ) σ 2 cov(y i2, y i4 ) cov(y i4, y i1 ) cov(y i4, y i2 ) σ 2 31 / 34

32 Compound symmetry: represented in the same way regardless of the missing value observations any distance apart have the same correlation Unstructured, Σ i = var(y i ) = σ 2 1 σ 12 σ 14 σ 21 σ 2 2 σ 24 σ 41 σ 42 σ 2 4 One-dependent, which says that only observations adjacent in time are correlated, this matrix becomes σ 2 ρσ 2 0 Σ i = var(y i ) = ρσ 2 σ σ 2 32 / 34

33 AR(1): Σ i = var(y i ) = σ 2 1 ρ ρ 3 ρ 1 ρ 2 ρ 3 ρ 2 1 Comments: if all observations were intended to be taken at the same times, but some are not available, the covariance matrix must be carefully constructed. 33 / 34

34 Independence assumption: Assume that observations within a unit are uncorrelated, under multivariate normality they are independent Σ i = var(ɛ i ) = σ 2 I ni Independence says that observations on the same unit are no more alike than those across units, unrealistic for longitudinal data Independence also means no correlation included by within-unit fluctuations over time -this is ok, if observations are all taken sufficiently far apart in time from one another. -most situation, cause model misspecification 34 / 34

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 14 1 / 64 Data structure and Model t1 t2 tn i 1st subject y 11 y 12 y 1n1 2nd subject

More information

4 Introduction to modeling longitudinal data

4 Introduction to modeling longitudinal data 4 Introduction to modeling longitudinal data We are now in a position to introduce a basic statistical model for longitudinal data. The models and methods we discuss in subsequent chapters may be viewed

More information

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington Analsis of Longitudinal Data Patrick J. Heagert PhD Department of Biostatistics Universit of Washington 1 Auckland 2008 Session Three Outline Role of correlation Impact proper standard errors Used to weight

More information

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject

More information

Modelling the Covariance

Modelling the Covariance Modelling the Covariance Jamie Monogan Washington University in St Louis February 9, 2010 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 1 / 13 Objectives By the end of this meeting, participants

More information

Intruction to General and Generalized Linear Models

Intruction to General and Generalized Linear Models Intruction to General and Generalized Linear Models Mixed Effects Models IV Henrik Madsen Anna Helga Jónsdóttir hm@imm.dtu.dk April 30, 2012 Henrik Madsen Anna Helga Jónsdóttir (hm@imm.dtu.dk) Intruction

More information

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology Serial Correlation Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 017 Model for Level 1 Residuals There are three sources

More information

REPEATED MEASURES. Copyright c 2012 (Iowa State University) Statistics / 29

REPEATED MEASURES. Copyright c 2012 (Iowa State University) Statistics / 29 REPEATED MEASURES Copyright c 2012 (Iowa State University) Statistics 511 1 / 29 Repeated Measures Example In an exercise therapy study, subjects were assigned to one of three weightlifting programs i=1:

More information

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE Biostatistics Workshop 2008 Longitudinal Data Analysis Session 4 GARRETT FITZMAURICE Harvard University 1 LINEAR MIXED EFFECTS MODELS Motivating Example: Influence of Menarche on Changes in Body Fat Prospective

More information

ST 732, Midterm Solutions Spring 2019

ST 732, Midterm Solutions Spring 2019 ST 732, Midterm Solutions Spring 2019 Please sign the following pledge certifying that the work on this test is your own: I have neither given nor received aid on this test. Signature: Printed Name: There

More information

Covariance Models (*) X i : (n i p) design matrix for fixed effects β : (p 1) regression coefficient for fixed effects

Covariance Models (*) X i : (n i p) design matrix for fixed effects β : (p 1) regression coefficient for fixed effects Covariance Models (*) Mixed Models Laird & Ware (1982) Y i = X i β + Z i b i + e i Y i : (n i 1) response vector X i : (n i p) design matrix for fixed effects β : (p 1) regression coefficient for fixed

More information

ST 790, Homework 1 Spring 2017

ST 790, Homework 1 Spring 2017 ST 790, Homework 1 Spring 2017 1. In EXAMPLE 1 of Chapter 1 of the notes, it is shown at the bottom of page 22 that the complete case estimator for the mean µ of an outcome Y given in (1.18) under MNAR

More information

Step 2: Select Analyze, Mixed Models, and Linear.

Step 2: Select Analyze, Mixed Models, and Linear. Example 1a. 20 employees were given a mood questionnaire on Monday, Wednesday and again on Friday. The data will be first be analyzed using a Covariance Pattern model. Step 1: Copy Example1.sav data file

More information

Modeling the Covariance

Modeling the Covariance Modeling the Covariance Jamie Monogan University of Georgia February 3, 2016 Jamie Monogan (UGA) Modeling the Covariance February 3, 2016 1 / 16 Objectives By the end of this meeting, participants should

More information

Applied Multivariate and Longitudinal Data Analysis

Applied Multivariate and Longitudinal Data Analysis Applied Multivariate and Longitudinal Data Analysis Longitudinal Data Analysis: General Linear Model Ana-Maria Staicu SAS Hall 522; 99-55-644; astaicu@ncsu.edu Introduction Consider the following examples.

More information

LDA Midterm Due: 02/21/2005

LDA Midterm Due: 02/21/2005 LDA.665 Midterm Due: //5 Question : The randomized intervention trial is designed to answer the scientific questions: whether social network method is effective in retaining drug users in treatment programs,

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

STAT3401: Advanced data analysis Week 10: Models for Clustered Longitudinal Data

STAT3401: Advanced data analysis Week 10: Models for Clustered Longitudinal Data STAT3401: Advanced data analysis Week 10: Models for Clustered Longitudinal Data Berwin Turlach School of Mathematics and Statistics Berwin.Turlach@gmail.com The University of Western Australia Models

More information

Introduction to Linear Mixed Models: Modeling continuous longitudinal outcomes

Introduction to Linear Mixed Models: Modeling continuous longitudinal outcomes 1/64 to : Modeling continuous longitudinal outcomes Dr Cameron Hurst cphurst@gmail.com CEU, ACRO and DAMASAC, Khon Kaen University 4 th Febuary, 2557 2/64 Some motivational datasets Before we start, I

More information

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Paper S02-2007 RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Eli Lilly & Company, Indianapolis, IN ABSTRACT

More information

Statistical Methods III Statistics 212. Problem Set 2 - Answer Key

Statistical Methods III Statistics 212. Problem Set 2 - Answer Key Statistical Methods III Statistics 212 Problem Set 2 - Answer Key 1. (Analysis to be turned in and discussed on Tuesday, April 24th) The data for this problem are taken from long-term followup of 1423

More information

Random Intercept Models

Random Intercept Models Random Intercept Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline A very simple case of a random intercept

More information

,..., θ(2),..., θ(n)

,..., θ(2),..., θ(n) Likelihoods for Multivariate Binary Data Log-Linear Model We have 2 n 1 distinct probabilities, but we wish to consider formulations that allow more parsimonious descriptions as a function of covariates.

More information

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Mixed models Yan Lu Feb, 2018, week 7 1 / 17 Some commonly used experimental designs related to mixed models Two way or three way random/mixed effects

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

Variance component models part I

Variance component models part I Faculty of Health Sciences Variance component models part I Analysis of repeated measurements, 30th November 2012 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Multilevel/Mixed Models and Longitudinal Analysis Using Stata

Multilevel/Mixed Models and Longitudinal Analysis Using Stata Multilevel/Mixed Models and Longitudinal Analysis Using Stata Isaac J. Washburn PhD Research Associate Oregon Social Learning Center Summer Workshop Series July 2010 Longitudinal Analysis 1 Longitudinal

More information

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements: Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported

More information

Mixed models with correlated measurement errors

Mixed models with correlated measurement errors Mixed models with correlated measurement errors Rasmus Waagepetersen October 9, 2018 Example from Department of Health Technology 25 subjects where exposed to electric pulses of 11 different durations

More information

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test

More information

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */ CLP 944 Example 4 page 1 Within-Personn Fluctuation in Symptom Severity over Time These data come from a study of weekly fluctuation in psoriasis severity. There was no intervention and no real reason

More information

1 Introduction. 2 Example

1 Introduction. 2 Example Statistics: Multilevel modelling Richard Buxton. 2008. Introduction Multilevel modelling is an approach that can be used to handle clustered or grouped data. Suppose we are trying to discover some of the

More information

22s:152 Applied Linear Regression. Returning to a continuous response variable Y...

22s:152 Applied Linear Regression. Returning to a continuous response variable Y... 22s:152 Applied Linear Regression Generalized Least Squares Returning to a continuous response variable Y... Ordinary Least Squares Estimation The classical models we have fit so far with a continuous

More information

22s:152 Applied Linear Regression. In matrix notation, we can write this model: Generalized Least Squares. Y = Xβ + ɛ with ɛ N n (0, Σ)

22s:152 Applied Linear Regression. In matrix notation, we can write this model: Generalized Least Squares. Y = Xβ + ɛ with ɛ N n (0, Σ) 22s:152 Applied Linear Regression Generalized Least Squares Returning to a continuous response variable Y Ordinary Least Squares Estimation The classical models we have fit so far with a continuous response

More information

STAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test

STAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test STAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test Rebecca Barter April 13, 2015 Let s now imagine a dataset for which our response variable, Y, may be influenced by two factors,

More information

3 Repeated Measures Analysis of Variance

3 Repeated Measures Analysis of Variance 3 Repeated Measures Analysis of Variance 3.1 Introduction As we have discussed, many approaches have been taken in the literature to specifying statistical models for longitudinal data. Within the framework

More information

Outline. Linear OLS Models vs: Linear Marginal Models Linear Conditional Models. Random Intercepts Random Intercepts & Slopes

Outline. Linear OLS Models vs: Linear Marginal Models Linear Conditional Models. Random Intercepts Random Intercepts & Slopes Lecture 2.1 Basic Linear LDA 1 Outline Linear OLS Models vs: Linear Marginal Models Linear Conditional Models Random Intercepts Random Intercepts & Slopes Cond l & Marginal Connections Empirical Bayes

More information

Longitudinal Data Analysis

Longitudinal Data Analysis Longitudinal Data Analysis Mike Allerhand This document has been produced for the CCACE short course: Longitudinal Data Analysis. No part of this document may be reproduced, in any form or by any means,

More information

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses Outline Marginal model Examples of marginal model GEE1 Augmented GEE GEE1.5 GEE2 Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association

More information

Multivariate Linear Regression Models

Multivariate Linear Regression Models Multivariate Linear Regression Models Regression analysis is used to predict the value of one or more responses from a set of predictors. It can also be used to estimate the linear association between

More information

Matrix Approach to Simple Linear Regression: An Overview

Matrix Approach to Simple Linear Regression: An Overview Matrix Approach to Simple Linear Regression: An Overview Aspects of matrices that you should know: Definition of a matrix Addition/subtraction/multiplication of matrices Symmetric/diagonal/identity matrix

More information

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 27 pages

More information

Time Invariant Predictors in Longitudinal Models

Time Invariant Predictors in Longitudinal Models Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

Describing Change over Time: Adding Linear Trends

Describing Change over Time: Adding Linear Trends Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

Course topics (tentative) The role of random effects

Course topics (tentative) The role of random effects Course topics (tentative) random effects linear mixed models analysis of variance frequentist likelihood-based inference (MLE and REML) prediction Bayesian inference The role of random effects Rasmus Waagepetersen

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Variance components and LMMs Analysis of repeated measurements, 4th December 2014 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen

More information

Variance components and LMMs

Variance components and LMMs Faculty of Health Sciences Topics for today Variance components and LMMs Analysis of repeated measurements, 4th December 04 Leftover from 8/: Rest of random regression example. New concepts for today:

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

Economics Department LSE. Econometrics: Timeseries EXERCISE 1: SERIAL CORRELATION (ANALYTICAL)

Economics Department LSE. Econometrics: Timeseries EXERCISE 1: SERIAL CORRELATION (ANALYTICAL) Economics Department LSE EC402 Lent 2015 Danny Quah TW1.10.01A x7535 : Timeseries EXERCISE 1: SERIAL CORRELATION (ANALYTICAL) 1. Suppose ɛ is w.n. (0, σ 2 ), ρ < 1, and W t = ρw t 1 + ɛ t, for t = 1, 2,....

More information

Figure 36: Respiratory infection versus time for the first 49 children.

Figure 36: Respiratory infection versus time for the first 49 children. y BINARY DATA MODELS We devote an entire chapter to binary data since such data are challenging, both in terms of modeling the dependence, and parameter interpretation. We again consider mixed effects

More information

Chapter 3 ANALYSIS OF RESPONSE PROFILES

Chapter 3 ANALYSIS OF RESPONSE PROFILES Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over

More information

Impact of serial correlation structures on random effect misspecification with the linear mixed model.

Impact of serial correlation structures on random effect misspecification with the linear mixed model. Impact of serial correlation structures on random effect misspecification with the linear mixed model. Brandon LeBeau University of Iowa file:///c:/users/bleb/onedrive%20 %20University%20of%20Iowa%201/JournalArticlesInProgress/Diss/Study2/Pres/pres.html#(2)

More information

An R # Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM

An R # Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM An R Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM Lloyd J. Edwards, Ph.D. UNC-CH Department of Biostatistics email: Lloyd_Edwards@unc.edu Presented to the Department

More information

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual change across time 2. MRM more flexible in terms of repeated

More information

Modelling Repeated Measurements of Renal Function during dialysis with cut off due to complete kidney failure

Modelling Repeated Measurements of Renal Function during dialysis with cut off due to complete kidney failure Gerard G.A. Westhoff Modelling Repeated Measurements of Renal Function during dialysis with cut off due to complete kidney failure Master thesis, defended on November 12, 2009 Thesis advisors: Prof. dr.

More information

STAT 540: Data Analysis and Regression

STAT 540: Data Analysis and Regression STAT 540: Data Analysis and Regression Wen Zhou http://www.stat.colostate.edu/~riczw/ Email: riczw@stat.colostate.edu Department of Statistics Colorado State University Fall 205 W. Zhou (Colorado State

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs) 36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)

More information

Linear models Analysis of Covariance

Linear models Analysis of Covariance Esben Budtz-Jørgensen April 22, 2008 Linear models Analysis of Covariance Confounding Interactions Parameterizations Analysis of Covariance group comparisons can become biased if an important predictor

More information

Linear & Generalized Linear Mixed Models

Linear & Generalized Linear Mixed Models Linear & Generalized Linear Mixed Models KKS-Netzwerk: Fachgruppe Biometrie December 2016 Geert Verbeke geert.verbeke@kuleuven.be http://perswww.kuleuven.be/geert_verbeke Interuniversity Institute for

More information

Chapter 4: Models for Stationary Time Series

Chapter 4: Models for Stationary Time Series Chapter 4: Models for Stationary Time Series Now we will introduce some useful parametric models for time series that are stationary processes. We begin by defining the General Linear Process. Let {Y t

More information

Analyzing the Behavior of Rats by Repeated Measurements

Analyzing the Behavior of Rats by Repeated Measurements Georgia State University ScholarWorks @ Georgia State University Mathematics Theses Department of Mathematics and Statistics 5-3-007 Analyzing the Behavior of Rats by Repeated Measurements Kenita A. Hall

More information

Measurement Error. Martin Bland. Accuracy and precision. Error. Measurement in Health and Disease. Professor of Health Statistics University of York

Measurement Error. Martin Bland. Accuracy and precision. Error. Measurement in Health and Disease. Professor of Health Statistics University of York Measurement in Health and Disease Measurement Error Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Accuracy and precision In this lecture: measurements which are

More information

Lecture 6: Linear Regression (continued)

Lecture 6: Linear Regression (continued) Lecture 6: Linear Regression (continued) Reading: Sections 3.1-3.3 STATS 202: Data mining and analysis October 6, 2017 1 / 23 Multiple linear regression Y = β 0 + β 1 X 1 + + β p X p + ε Y ε N (0, σ) i.i.d.

More information

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons: STAT 263/363: Experimental Design Winter 206/7 Lecture January 9 Lecturer: Minyong Lee Scribe: Zachary del Rosario. Design of Experiments Why perform Design of Experiments (DOE)? There are at least two

More information

Y i = η + ɛ i, i = 1,...,n.

Y i = η + ɛ i, i = 1,...,n. Nonparametric tests If data do not come from a normal population (and if the sample is not large), we cannot use a t-test. One useful approach to creating test statistics is through the use of rank statistics.

More information

Outline. Statistical inference for linear mixed models. One-way ANOVA in matrix-vector form

Outline. Statistical inference for linear mixed models. One-way ANOVA in matrix-vector form Outline Statistical inference for linear mixed models Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark general form of linear mixed models examples of analyses using linear mixed

More information

STAT 526 Advanced Statistical Methodology

STAT 526 Advanced Statistical Methodology STAT 526 Advanced Statistical Methodology Fall 2017 Lecture Note 10 Analyzing Clustered/Repeated Categorical Data 0-0 Outline Clustered/Repeated Categorical Data Generalized Linear Mixed Models Generalized

More information

Non-Stationary Time Series and Unit Root Testing

Non-Stationary Time Series and Unit Root Testing Econometrics II Non-Stationary Time Series and Unit Root Testing Morten Nyboe Tabor Course Outline: Non-Stationary Time Series and Unit Root Testing 1 Stationarity and Deviation from Stationarity Trend-Stationarity

More information

Accounting for Baseline Observations in Randomized Clinical Trials

Accounting for Baseline Observations in Randomized Clinical Trials Accounting for Baseline Observations in Randomized Clinical Trials Scott S Emerson, MD, PhD Department of Biostatistics, University of Washington, Seattle, WA 9895, USA October 6, 0 Abstract In clinical

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

Design of Experiments

Design of Experiments Design of Experiments DOE Radu T. Trîmbiţaş April 20, 2016 1 Introduction The Elements Affecting the Information in a Sample Generally, the design of experiments (DOE) is a very broad subject concerned

More information

Well-developed and understood properties

Well-developed and understood properties 1 INTRODUCTION TO LINEAR MODELS 1 THE CLASSICAL LINEAR MODEL Most commonly used statistical models Flexible models Well-developed and understood properties Ease of interpretation Building block for more

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of

More information

ANOVA approaches to Repeated Measures. repeated measures MANOVA (chapter 3)

ANOVA approaches to Repeated Measures. repeated measures MANOVA (chapter 3) ANOVA approaches to Repeated Measures univariate repeated-measures ANOVA (chapter 2) repeated measures MANOVA (chapter 3) Assumptions Interval measurement and normally distributed errors (homogeneous across

More information

BIOS 2083 Linear Models c Abdus S. Wahed

BIOS 2083 Linear Models c Abdus S. Wahed Chapter 5 206 Chapter 6 General Linear Model: Statistical Inference 6.1 Introduction So far we have discussed formulation of linear models (Chapter 1), estimability of parameters in a linear model (Chapter

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures

Describing Within-Person Fluctuation over Time using Alternative Covariance Structures Describing Within-Person Fluctuation over Time using Alternative Covariance Structures Today s Class: The Big Picture ACS models using the R matrix only Introducing the G, Z, and V matrices ACS models

More information

BIOS 2083: Linear Models

BIOS 2083: Linear Models BIOS 2083: Linear Models Abdus S Wahed September 2, 2009 Chapter 0 2 Chapter 1 Introduction to linear models 1.1 Linear Models: Definition and Examples Example 1.1.1. Estimating the mean of a N(μ, σ 2

More information

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard

Linear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard Faculty of Health Sciences Linear mixed models Analysis of repeated measurements, 10th March 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 80 Program

More information

Categorical and Zero Inflated Growth Models

Categorical and Zero Inflated Growth Models Categorical and Zero Inflated Growth Models Alan C. Acock* Summer, 2009 *Alan C. Acock, Department of Human Development and Family Sciences, Oregon State University, Corvallis OR 97331 (alan.acock@oregonstate.edu).

More information

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data Faculty of Health Sciences Repeated measurements over time Correlated data NFA, May 22, 2014 Longitudinal measurements Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of

More information

Thursday Morning. Growth Modelling in Mplus. Using a set of repeated continuous measures of bodyweight

Thursday Morning. Growth Modelling in Mplus. Using a set of repeated continuous measures of bodyweight Thursday Morning Growth Modelling in Mplus Using a set of repeated continuous measures of bodyweight 1 Growth modelling Continuous Data Mplus model syntax refresher ALSPAC Confirmatory Factor Analysis

More information

Lecture 3 Linear random intercept models

Lecture 3 Linear random intercept models Lecture 3 Linear random intercept models Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The response is measures at n different times, or under

More information

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED Maribeth Johnson Medical College of Georgia Augusta, GA Overview Introduction to longitudinal data Describe the data for examples

More information

Need for Several Predictor Variables

Need for Several Predictor Variables Multiple regression One of the most widely used tools in statistical analysis Matrix expressions for multiple regression are the same as for simple linear regression Need for Several Predictor Variables

More information

Review of Multilevel Models for Longitudinal Data

Review of Multilevel Models for Longitudinal Data Review of Multilevel Models for Longitudinal Data Topics: Concepts in longitudinal multilevel modeling Describing within-person fluctuation using ACS models Describing within-person change using random

More information

Variance component models

Variance component models Faculty of Health Sciences Variance component models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen Topics for

More information

Multivariate Time Series: VAR(p) Processes and Models

Multivariate Time Series: VAR(p) Processes and Models Multivariate Time Series: VAR(p) Processes and Models A VAR(p) model, for p > 0 is X t = φ 0 + Φ 1 X t 1 + + Φ p X t p + A t, where X t, φ 0, and X t i are k-vectors, Φ 1,..., Φ p are k k matrices, with

More information

Linear models Analysis of Covariance

Linear models Analysis of Covariance Esben Budtz-Jørgensen November 20, 2007 Linear models Analysis of Covariance Confounding Interactions Parameterizations Analysis of Covariance group comparisons can become biased if an important predictor

More information

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination McGill University Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II Final Examination Date: 23rd April 2007 Time: 2pm-5pm Examiner: Dr. David A. Stephens Associate Examiner: Dr. Russell Steele Please

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation Using the least squares estimator for β we can obtain predicted values and compute residuals: Ŷ = Z ˆβ = Z(Z Z) 1 Z Y ˆɛ = Y Ŷ = Y Z(Z Z) 1 Z Y = [I Z(Z Z) 1 Z ]Y. The usual decomposition

More information

Model Assumptions; Predicting Heterogeneity of Variance

Model Assumptions; Predicting Heterogeneity of Variance Model Assumptions; Predicting Heterogeneity of Variance Today s topics: Model assumptions Normality Constant variance Predicting heterogeneity of variance CLP 945: Lecture 6 1 Checking for Violations of

More information

with the usual assumptions about the error term. The two values of X 1 X 2 0 1

with the usual assumptions about the error term. The two values of X 1 X 2 0 1 Sample questions 1. A researcher is investigating the effects of two factors, X 1 and X 2, each at 2 levels, on a response variable Y. A balanced two-factor factorial design is used with 1 replicate. The

More information

Multiple linear regression S6

Multiple linear regression S6 Basic medical statistics for clinical and experimental research Multiple linear regression S6 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/42 Introduction Two main motivations for doing multiple

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

Nested Designs & Random Effects

Nested Designs & Random Effects Nested Designs & Random Effects Timothy Hanson Department of Statistics, University of South Carolina Stat 506: Introduction to Design of Experiments 1 / 17 Bottling plant production A production engineer

More information