Variance component models part I
|
|
- Cameron Simmons
- 5 years ago
- Views:
Transcription
1 Faculty of Health Sciences Variance component models part I Analysis of repeated measurements, 30th November 2012 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen
2 Plan for this lecture Variance component models (aka random effects models) for repeated measurements or clustered data. Two- and three-level random effects models (Linear models for repeated measurements.) Advantages/necessity of modeling random effects. Analysis with SAS Strategies for (mis)handling repeated measurements. Ecological analysis. 2 / 60
3 university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s What are repeated measurents? Repeated measurements refer to data where the same outcome has been measured in different situations (or at different spots) on the same individuals. I 3 / 60 Special case: longitudinal means repeatedly over time.
4 university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s What is clustered data? Repeated measurements are termed clustered data when the same outcome is measured on groups of individuals from the same families/workplaces/school classes/villages/etc. 4 / 60
5 Outline The two-level model (random effects ANOVA) SAS proc mixed Variance components in general Example: A three-level model Strategies for (mis)handling repeated measurements Ecological analyses 5 / 60
6 Case study: Retinopathy of prematurity 34 premature children were assesed for retinopathy of prematurity on one eye by four leading experts. Each eye was ranked on a scale from 0 to 5 by all four experts. Outcome: Severity of desease as measured by the score. Aims: Describe desease severity in the populatioin. Assess the difference in use of the scale among the experts. 6 / 60
7 Case study: Retinopathy of prematurity Eye Expert Mean no no 1 no 2 no 3 no 4 score We expect that severity of desease varies from child to child. 7 / 60
8 Case study: Retinopathy of prematurity 8 / 60
9 Random effects ANOVA Comparison of data from k groups (individuals/clusters): The groups are of no individual interest and it is of no relevance to test whether they have identical means (most likely they don t). The groups may be thought of as representatives from a population, that we want to describe. We don t have the same amount of information as if all observations were sampled independently. Our statistical analysis must reflect this! 9 / 60
10 The two-level model We model the deviations (variances) instead of the levels (means): score y ij = µ + a i + ε ij, i = 1,..., 34 j = 1,..., 4 = population mean + between-eyes deviation + within-eye deviation Assuming the deviations (the a i s and ε ij s) are independent and normally distributed: a i N (0, ω 2 B), ε ij N (0, σ 2 W ) The deviations between and within eyes are considered random and their variances, are the so-called variance components. 10 / 60
11 Case study: Retinopathy of prematurity 11 / 60
12 Interpretaion of model parameters All observations follow the same normal distribution y ij N (µ, ω 2 B + σ 2 W ) µ is the population mean (the grand mean). ω 2 B + σ2 W is the population variance (the total variation). But: Measurements made on the same eye are correlated with the so-called intra-class correlation Corr(y i1, y i2 ) = ρ = ω 2 B ω 2 B + σ2 W 0.82 I.e. scores of the same eye tend to look more alike than scores from different eyes. 12 / 60
13 Case study: Retinopathy of prematurity Parameter estimates: Variance components (random effects): Variation Variance component Estimate %of variation Between ω 2 B % Within σ 2 W % Total ω 2 B + σ2 W % Covariates (fixed effects): Effect Parameter Estimate 95%CI Population mean µ 1.90 (1.43, 2.37) 13 / 60
14 Outline The two-level model (random effects ANOVA) SAS proc mixed Variance components in general Example: A three-level model Strategies for (mis)handling repeated measurements Ecological analyses 14 / 60
15 Case study: retinopathy of prematurity PROC MIXED DATA=eyedata COVTEST; CLASS eye; MODEL score = / SOLUTION CL; RANDOM intercept / subject=eye; RUN; Syntax is similar to PROC GLM with a MODEL-statement specifying the (linear) relationship between outcome and the covariates. Note: fixed effects only!. Random effects are specified in a separate RANDOM-statement. Cathegorical variable (fixed as well as random) must be declared with CLASS. 15 / 60
16 SAS: proc mixed output Model Information Data Set Dependent Variable Covariance Structure Estimation Method Residual Variance Method Fixed Effects SE Method Degrees of Freedom Method WORK.EYEDATA score Variance Components REML Profile Model-Based Containment Class Level Information Class Levels Values eye Dimensions Covariance Parameters 2 Columns in X 1 Columns in Z Per Subject 1 Subjects 34 Max Obs Per Subject 4 16 / 60
17 SAS: proc mixed output Number of Observations Number of Observations Read 136 Number of Observations Used 133 Number of Observations Not Used 3 Iteration History Iteration Evaluations -2 Res Log Like Criterion Convergence criteria met. Always check that the numerical optimisation has converged. 17 / 60
18 SAS: proc mixed output Covariance Parameter Estimates Cov Parm Estimate Std.Error Value Pr > Z eye <.0001 Residual <.0001 Fit Statistics -2 Res Log Likelihood AIC (smaller is better) AICC (smaller is better) BIC (smaller is better) Solution for Fixed Effects Effect Estimate Std.Error DF t Value Pr > t Alpha Lower Upper Intercept < At last: Parameter estimates, tests, goodnes of fit criteria. 18 / 60
19 Case study: Retinopathy of prematurity Testing fixed effects level variation covariates 1 within eye expert 2 between eyes group Part of the variation within eyes could be explained by systematic differences between the experts. Part of the variation between eyes could be explained by systematic differences between the groups (e.g. treatment). 19 / 60
20 Case study: Retinopathy of prematurity Are there differences among the experts? Eye Expert Mean no no 1 no 2 no 3 no 4 score Mean / 60
21 Case study: Retinopathy of prematurity PROC MIXED DATA=eyedata; CLASS eye expert; MODEL score = expert / SOLUTION CL; RANDOM intercept / subject=eye; RUN; Solution for Fixed Effects Effect Estimate Std.Error DF t Value Pr > t Alpha Lower Upper Intercept < expert < expert expert expert Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F expert <.0001 Conclusion: Highly significant differences between experts. 21 / 60
22 Case study: Retinopathy of prematurity PROC MIXED DATA=eyedata; CLASS eye expert; MODEL score = expert / SOLUTION CL; RANDOM intercept / subject=eye; RUN; Covariance Parameter Estimates Standard Z Cov Parm Estimate Error Value Pr > Z eye <.0001 Residual <.0001 Note: Smaller estimate of residual variance because expert differences explain some of the within eye variation. 22 / 60
23 Case study: Retinopathy of prematurity PROC MIXED DATA=eyedata; CLASS eye grp; MODEL score = grp / SOLUTION CL; RANDOM intecept / subject=eye; RUN; Solution for Fixed Effects Effect Estimate Std.Error DF t Value Pr > t Alpha Lower Upper Intercept < grp grp Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F grp Conclusion: No significant difference between the two groups. 23 / 60
24 Case study: Retinopathy of prematurity Differences between scores of the same eye for expert 1 and 2: y i1 y i2 N (µ 1 µ 2, 2ωW 2 ) Typical differences (95% normal region)? ( ) ± = 0.88 ± 1.32 Differences between scores of the same eye for expert 3 and 4: y i3 y i4 N (µ 3 µ 4, 2ωW 2 ) Typical differences? ± = 0.12 ± 1.32 Don t compute normal regions unless your data is normal! 24 / 60
25 Outline The two-level model (random effects ANOVA) SAS proc mixed Variance components in general Example: A three-level model Strategies for (mis)handling repeated measurements Ecological analyses 25 / 60
26 General variance component models Generalisations of ANOVA and GLM models involving several sources of random variation, so-called variance components. Examples of sources of random variation: Environmental variation. Between regions, hospitals or countries. Biological variation. Between individuals, families or animals. Within-individual variation. Between arms, teeth, days. Variation due to uncontrollable circumstances. E.g. time of day, temperature, observer. Measurement error. 26 / 60
27 Multilevel models Variance component models are also called multilevel models. We have variation (i.e. a variance component) on each level. And possibly systematic effects (covariates) on each level. individual context/cluster context/cluster level 1 level 2 level 3 students classes schools patient clinic regions asessment eye child On all levels, we have random variation (random effect or a variance component) and possibly covariates (fixed effects). 27 / 60
28 Fixed or random effect? How do we decide whether a factor should be modeled as fixed or random? Fixed All values of the factor present (typically few, e.g. treatment). Allows inference for these particular values only. Demands a decent number of observations for each value. Random A representative sample of values of the factor is present. Allows inference to be extended beyond the values in the experiment and to the population they were sampled from. 28 / 60
29 Merits of multilevel models We get a better understanding of the various sources of variation. Certain effects may be estimated more precisely (higher power), since some sources of variation are eliminated, e.g. by making comparisons within a family. This is analogous to the paired comparison situation. When planning subsequent investigations, the knowledge of the relative sizes of the variance components will (in principle) be of help in deciding the number of repetitions needed at each level. 29 / 60
30 Drawbacks of multilevel models Their statistical analysis is more difficult. When making inference (estimation and testing), it is important to take all sources of variation into account, and effects have to be evaluated against the relevant variation. Results may be biased if one or more sources of variation are disregarded! 30 / 60
31 Necessity of multilevel models Measurements belonging together in the same cluster tend to look alike they are correlated. If we fail to take this correlation into account, we will experience: Possible bias in the mean value estimates. Too small standard errors (type 1 error) for estimates of level 2 covariates (between-cluster effects). Too large standard errors (type 2 error) for estimates of level 1 covariates (within-cluster effects) 31 / 60
32 Case study: Retinopathy of prematurity When the repeated measurements are ignored: PROC GLM DATA=eyedata; CLASS expert; MODEL score=expert / SOLUTION; RUN; Source DF Type III SS Mean Square F Value Pr > F expert Parameter Estimate Std.Error t Value Pr > t 95% Confidence Limits Intercept < expert expert expert expert Too large standard errors and type 2 error for level 1 effect! 32 / 60
33 Case study: Retinopathy of prematurity When the repeated measurements are ignored: PROC GLM DATA=eyedata; CLASS grp; MODEL score=grp / SOLUTION; RUN; Source DF Type III SS Mean Square F Value Pr > F grp Parameter Estimate Std.Error t Value Pr > t 95% Confidence Limits Intercept < grp grp Too small standard errors and type 1 error for level 2 effect! 33 / 60
34 Outline The two-level model (random effects ANOVA) SAS proc mixed Variance components in general Example: A three-level model Strategies for (mis)handling repeated measurements Ecological analyses 34 / 60
35 Case study: Prisme Outcome: Covariates: Kortisol koncentration in blood samples taken morning and evening in workers in Aarhus amt and kommune in 2007 (3536 participants) with similar follow-up in 2009 (2408 participants) Age, gender, BMI, level of education, stressors: lifeevents, Effort Reward Index. level variation covariates 3 between persons gender 2 within person: between days age, bmi, stressors 1 within person: within days time of day 35 / 60
36 Case study: Prisme Sample data from 8 men: NOTE: koncentration on logarithmic scale. 36 / 60
37 Case study: Prisme PROC MIXED DATA=prisme COVTEST; WHERE kvinde EQ 0; CLASS idnr us_aar tid; MODEL logkonc = tid / SOLUTION CL; RANDOM idnr idnr*us_aar; RUN; Covariance Parameter Estimates Cov Parm Estimate Std.Error Z Value Pr > Z idnr <.0001 idnr*us_aar 0... Residual <.0001 Type 3 Tests of Fixed Effects Num Den Effect DF DF F Value Pr > F tid <.0001 One of the variance component estimates is a zero. 37 / 60
38 Negative variance components In case on of the variance component estimates becomes negative, SAS repports a zero. What does it mean? The zero-estimate may be a chance finding due to statistical uncertainty. Or it might be the result of truly negative correlation within clusters - e.g. from competition. Correlation between days (within persons) might be negative -?? What can we do about it? Re-fit the model without the problematic random effect Use a marginal model which allows for negative correlation 38 / 60 Include exact time of day as a covariate
39 Case study: Prisme Estimated variance components Level Variation Estimate 3 between persons (ω 2 ) (10.0%) 2 between days (τ 2 ) (0.0%) 1 within days (σ 2 ) (90.0%) Total (100%) Level 2 covariates (stressors) can only have very little impact on individual cortisol koncentrations! Correlation between two measurements made on the same person: ω 2 + τ 2 ω 2 + τ 2 + σ 2 = ω 2 ω 2 + τ 2 + σ 2 = 0.10 regardless of whether they are measured on the same day or on different days! 39 / 60
40 Case study: Prisme Solution for Fixed Effects Standard Effect tid Estimate Error DF t Value Pr > t Alpha Lower Upper Intercept < tid < tid Effect of time of day is measured on log-scale. Backtransformation exp(2.0137) 7.49 yields that median levels of kortisol is an estimated 7.5 times higher in the morning than in the evening. 40 / 60
41 Outline The two-level model (random effects ANOVA) SAS proc mixed Variance components in general Example: A three-level model Strategies for (mis)handling repeated measurements Ecological analyses 41 / 60
42 Comparison of four modeling strategies Quantifying overall severity of desease in the ROP-data Four strategies for estimating the population mean method estimate (s.e.) 1: ignore repeated (0.125) 2: ordinary ANOVA (0.125) 3: average scores (0.230) 4: two-level model (0.230) 1. We (wrongfully) assume independence of all measurements 2. We estimate the mean score of these particular eyes by ordinary one-way ANOVA 3. We reduce the data to a sample of averages for the individual eyes and use one-sample formulae. 4. We estimate the population mean from the randomly sampled eyes using the two-level model. 42 / 60
43 Comments on the strategies: 1. Ignoring the repeated measurements is wrong! leads to systematic underestimation of the standard error. 2. In the ordinary one-way ANOVA the grand mean has a different meaning. leads to systematic underestimation of the standard error. 3. Taking averages is often OK. At least if the number of replicates is the same for all individuals. But we loose information on within subject variation. The expert-effect cannot be tested! 43 / 60
44 Outline The two-level model (random effects ANOVA) SAS proc mixed Variance components in general Example: A three-level model Strategies for (mis)handling repeated measurements Ecological analyses 44 / 60
45 Ecological analyses An easy way of dealing with repeated measurements: Compute summary statistics for each cluster/individual. Perform a traditional analysis on the sample of summary statistics rightfully assuming that these are independent. Summary statistics could be: Sample mean or standard deviation. AUC (area under the curve). Intercept and slope of regression line. Beware of: Loss of relevant information. Inhomogeneous variances in case of unbalanced designs. 45 / 60
46 Ecological vs two-level analysis Blood pressure and social inequity: women in 17 regions of Malmø. Covariates: Individual (level 1): low educational achievement (x) (less than 9 years of school) age group Regional (level 2): rate of people with low educational achievement (z) from the Skåne Council Statistics Office 46 / 60
47 Diastolic blood pressure and area of residence... From: 47 / 60 J. Merlo etal, J. Epidemiol Community Health 55, 2001
48 Ecological analysis Average blood pressure in region vs rate of people with low educational achievement. Size of circle indicates size of investigation. Estimate of regression coefficient: with SE Seems an important explanatory variable?!? 48 / 60
49 Results of the two-level analysis Estimates are: variances within and between regions, explained variation, and 49 / 60 intra-region correlation
50 Estimates from variance component model Covariates Individual variation Rate of variation in low education between low education between model individuals regions x σw 2 z ωb 2 none age x, age (0.170) z, age (1.345) x, z, age (0.167) (1.250) / 60
51 Ecological analysis vs the two-level model Region as a random effect could only account for 0.36% of the variation in blood pressures. Thus, regional variables such as rate of low-income will have very little impact on individual blood presures! The ecological analysis sums up the individual and the regional effects, but is not able to distinguish between the two. It overestimates the level 2 effect. It cannot be interpreted as a level 1 effect. 51 / 60
52 Individual and regional blood presure 52 / 60
53 Covariate effects on level 1 and 2 can be quite different Example: Reading ability, as a function of age and cohort: / 60
54 y Simulated data: Random effect of individual p Level Variation standard deviation (SD) 1 within individuals ˆσ W = between individuals ˆω B = / 60
55 y level 1 (individual) covariate, e.g. time/age x Level Variation SD regression coefficient 1 within individuals ˆσ W = 0.41 β x = 1.028(0.046) 2 between individuals ˆω B = / 60
56 y Add a level 2 covariate, e.g. baseline age x Level Variation SD regression coefficient 1 within individuals ˆσ W = 0.41 ˆβx = 1.033(0.046) 2 between individuals ˆω B = 1.14 ˆβz = 1.316(0.206) 56 / 60
57 Comparison of estimates within individual between individual Model ˆβx sd, ˆσ W ˆβz sd, ˆω B x (0.046) z (0.201) 1.03 x, z (0.046) (0.206) / 60
58 Effects of model misspecification Misspecification missing random effect missing z missing x Result - type 2 error for x (unpaired) - type 1 error for z (too many df s, wrong variation) - estimate of ωb 2 too big - estimate of σw 2 perhaps too big (in unbalanced designs) - estimate of ωb 2 too big or too small - estimate of σw 2 too big 58 / 60
59 Example: suicide and religion Ecological analysis of regions: Percent of suicides increases with percent of protestants. Are protestants more likely to commit suicide? Two-level model: level unit variation covariates 1 individuals within region, σ 2 W religion, x 2 regions between regions, ω 2 B % protestants, z Finding: Interaction between individual effect (x) and region covariate (z) 59 / 60
60 Example: suicide and religion More suicides among catholics in regions with many protestants. 60 / 60
Variance component models
Faculty of Health Sciences Variance component models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen Topics for
More informationAnalysis of variance and regression. December 4, 2007
Analysis of variance and regression December 4, 2007 Variance component models Variance components One-way anova with random variation estimation interpretations Two-way anova with random variation Crossed
More informationVariance components and LMMs
Faculty of Health Sciences Topics for today Variance components and LMMs Analysis of repeated measurements, 4th December 04 Leftover from 8/: Rest of random regression example. New concepts for today:
More informationVariance components and LMMs
Faculty of Health Sciences Variance components and LMMs Analysis of repeated measurements, 4th December 2014 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen
More informationMulti-factor analysis of variance
Faculty of Health Sciences Outline Multi-factor analysis of variance Basic statistics for experimental researchers 2015 Two-way ANOVA and interaction Mathed samples ANOVA Random vs systematic variation
More informationLinear mixed models. Faculty of Health Sciences. Analysis of repeated measurements, 10th March Julie Lyng Forman & Lene Theil Skovgaard
Faculty of Health Sciences Linear mixed models Analysis of repeated measurements, 10th March 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 80 Program
More informationLinear mixed models. Program. What are repeated measurements? Outline. Faculty of Health Sciences. Analysis of repeated measurements, 10th March 2015
university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s university of copenhagen d e pa rt m e n t o f b i o s tat i s t i c s Program Faculty of Health Sciences Topics: Linear mixed models
More informationVarians- og regressionsanalyse
Faculty of Health Sciences Varians- og regressionsanalyse Variance component models Lene Theil Skovgaard Department of Biostatistics Variance component models Definitions and motivation One-way anova with
More informationCorrelated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data
Faculty of Health Sciences Repeated measurements over time Correlated data NFA, May 22, 2014 Longitudinal measurements Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of
More informationCorrelated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models
Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 29, 2016 One-way anova with random variation The rabbit example Hierarchical
More informationCorrelated data. Variance component models. Example: Evaluate vaccine. Traditional assumption so far. Faculty of Health Sciences
Faculty of Health Sciences Variance component models Definitions and motivation Correlated data Variance component models, I Lene Theil Skovgaard November 29, 2013 One-way anova with random variation The
More informationAnalysis of variance and regression. May 13, 2008
Analysis of variance and regression May 13, 2008 Repeated measurements over time Presentation of data Traditional ways of analysis Variance component model (the dogs revisited) Random regression Baseline
More informationModels for longitudinal data
Faculty of Health Sciences Contents Models for longitudinal data Analysis of repeated measurements, NFA 016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen
More informationFaculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.
Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 27, 2018 1 / 84 Overview One-way anova with random variation The rabbit example Hierarchical
More informationCorrelated data. Overview. Example: Swelling due to vaccine. Variance component models. Faculty of Health Sciences. Variance component models
Faculty of Health Sciences Overview Correlated data Variance component models One-way anova with random variation The rabbit example Hierarchical models with several levels Random regression Lene Theil
More informationIntroduction to SAS proc mixed
Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen Outline Data in wide and long format
More informationIntroduction to SAS proc mixed
Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen 2 / 28 Preparing data for analysis The
More informationMulti-factor analysis of variance
Faculty of Health Sciences Outline Multi-factor analysis of variance Basic statistics for experimental researchers 2016 Two-way ANOVA and interaction Matched samples ANOVA Random vs systematic variation
More informationRandom Intercept Models
Random Intercept Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline A very simple case of a random intercept
More informationCorrelated data. Longitudinal data. Typical set-up for repeated measurements. Examples from literature, I. Faculty of Health Sciences
Faculty of Health Sciences Longitudinal data Correlated data Longitudinal measurements Outline Designs Models for the mean Covariance patterns Lene Theil Skovgaard November 27, 2015 Random regression Baseline
More informationSAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1
CLDP 944 Example 3a page 1 From Between-Person to Within-Person Models for Longitudinal Data The models for this example come from Hoffman (2015) chapter 3 example 3a. We will be examining the extent to
More informationSubject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study
Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study 1.4 0.0-6 7 8 9 10 11 12 13 14 15 16 17 18 19 age Model 1: A simple broken stick model with knot at 14 fit with
More informationModels for binary data
Faculty of Health Sciences Models for binary data Analysis of repeated measurements 2015 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen 1 / 63 Program for
More informationFaculty of Health Sciences. Correlated data. Variance component models. Lene Theil Skovgaard & Julie Lyng Forman.
Faculty of Health Sciences Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 1 / 96 Overview One-way anova with random variation The rabbit example Hierarchical
More informationCorrelated data. Overview. Variance component models. Terminology for correlated measurements. Faculty of Health Sciences. Variance component models
Faculty of Health Sciences Overview Correlated data Variance component models Lene Theil Skovgaard & Julie Lyng Forman November 28, 2017 One-way anova with random variation The rabbit example Hierarchical
More informationSAS Syntax and Output for Data Manipulation:
CLP 944 Example 5 page 1 Practice with Fixed and Random Effects of Time in Modeling Within-Person Change The models for this example come from Hoffman (2015) chapter 5. We will be examining the extent
More informationTopic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model
Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is
More informationMIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010
MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 Part 1 of this document can be found at http://www.uvm.edu/~dhowell/methods/supplements/mixed Models for Repeated Measures1.pdf
More informationStatistics for exp. medical researchers Regression and Correlation
Faculty of Health Sciences Regression analysis Statistics for exp. medical researchers Regression and Correlation Lene Theil Skovgaard Sept. 28, 2015 Linear regression, Estimation and Testing Confidence
More informationAnswer to exercise: Blood pressure lowering drugs
Answer to exercise: Blood pressure lowering drugs The data set bloodpressure.txt contains data from a cross-over trial, involving three different formulations of a drug for lowering of blood pressure:
More informationmultilevel modeling: concepts, applications and interpretations
multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models
More informationLecture 4. Random Effects in Completely Randomized Design
Lecture 4. Random Effects in Completely Randomized Design Montgomery: 3.9, 13.1 and 13.7 1 Lecture 4 Page 1 Random Effects vs Fixed Effects Consider factor with numerous possible levels Want to draw inference
More informationA (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data
A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data Today s Class: Review of concepts in multivariate data Introduction to random intercepts Crossed random effects models
More informationRandom Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars)
STAT:5201 Applied Statistic II Random Coefficient Model (a.k.a. multilevel model) (Adapted from UCLA Statistical Computing Seminars) School math achievement scores The data file consists of 7185 students
More informationAn Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012
An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person
More informationTopic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects
Topic 5 - One-Way Random Effects Models One-way Random effects Outline Model Variance component estimation - Fall 013 Confidence intervals Topic 5 Random Effects vs Fixed Effects Consider factor with numerous
More informationIntroduction to Within-Person Analysis and RM ANOVA
Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides
More informationIntroduction and Background to Multilevel Analysis
Introduction and Background to Multilevel Analysis Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Background and
More informationReview of CLDP 944: Multilevel Models for Longitudinal Data
Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More information36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)
36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)
More informationIntroduction to Random Effects of Time and Model Estimation
Introduction to Random Effects of Time and Model Estimation Today s Class: The Big Picture Multilevel model notation Fixed vs. random effects of time Random intercept vs. random slope models How MLM =
More informationUNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator
UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 27 pages
More informationThis is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
EXST3201 Chapter 13c Geaghan Fall 2005: Page 1 Linear Models Y ij = µ + βi + τ j + βτij + εijk This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More informationA Re-Introduction to General Linear Models (GLM)
A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing
More informationStat 579: Generalized Linear Models and Extensions
Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 12 1 / 34 Correlated data multivariate observations clustered data repeated measurement
More informationHierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!
Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter
More informationAnalysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking
Analysis of variance and regression Contents Comparison of several groups One-way ANOVA April 7, 008 Two-way ANOVA Interaction Model checking ANOVA, April 008 Comparison of or more groups Julie Lyng Forman,
More informationModels for Clustered Data
Models for Clustered Data Edps/Psych/Soc 589 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline Notation NELS88 data Fixed Effects ANOVA
More informationCorrelated data. Overview. Cross-over study. Repetition. Faculty of Health Sciences. Variance component models, II. More on variance component models
Faculty of Health Sciences Overview Correlated data More on variance component models Variance component models, II Cross-over studies Non-normal data Comparing measurement devices Lene Theil Skovgaard
More informationModeling Effect Modification and Higher-Order Interactions: Novel Approach for Repeated Measures Design using the LSMESTIMATE Statement in SAS 9.
Paper 400-015 Modeling Effect Modification and Higher-Order Interactions: Novel Approach for Repeated Measures Design using the LSMESTIMATE Statement in SAS 9.4 Pronabesh DasMahapatra, MD, MPH, PatientsLikeMe
More informationCorrelated data. Non-normal outcomes. Reminder on binary data. Non-normal data. Faculty of Health Sciences. Non-normal outcomes
Faculty of Health Sciences Non-normal outcomes Correlated data Non-normal outcomes Lene Theil Skovgaard December 5, 2014 Generalized linear models Generalized linear mixed models Population average models
More informationModels for Clustered Data
Models for Clustered Data Edps/Psych/Stat 587 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Notation NELS88 data Fixed Effects ANOVA
More informationStatistical Distribution Assumptions of General Linear Models
Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions
More informationAnalysis of variance. April 16, Contents Comparison of several groups
Contents Comparison of several groups Analysis of variance April 16, 2009 One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics
More informationIntroducing Generalized Linear Models: Logistic Regression
Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and
More informationAnalysis of variance. April 16, 2009
Analysis of variance April 16, 2009 Contents Comparison of several groups One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics
More informationAnalysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED.
Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED. Maribeth Johnson, Medical College of Georgia, Augusta, GA ABSTRACT Longitudinal data refers to datasets with multiple measurements
More informationStatistical Methods III Statistics 212. Problem Set 2 - Answer Key
Statistical Methods III Statistics 212 Problem Set 2 - Answer Key 1. (Analysis to be turned in and discussed on Tuesday, April 24th) The data for this problem are taken from long-term followup of 1423
More informationDescribing Change over Time: Adding Linear Trends
Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section
More information36-402/608 Homework #10 Solutions 4/1
36-402/608 Homework #10 Solutions 4/1 1. Fixing Breakout 17 (60 points) You must use SAS for this problem! Modify the code in wallaby.sas to load the wallaby data and to create a new outcome in the form
More informationFaculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics
Faculty of Health Sciences Regression models Counts, Poisson regression, 27-5-2013 Lene Theil Skovgaard Dept. of Biostatistics 1 / 36 Count outcome PKA & LTS, Sect. 7.2 Poisson regression The Binomial
More informationdf=degrees of freedom = n - 1
One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:
More informationRon Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)
Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October
More informationBIOSTATISTICAL METHODS
BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH Cross-over Designs #: DESIGNING CLINICAL RESEARCH The subtraction of measurements from the same subject will mostly cancel or minimize effects
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More information6. Multiple regression - PROC GLM
Use of SAS - November 2016 6. Multiple regression - PROC GLM Karl Bang Christensen Department of Biostatistics, University of Copenhagen. http://biostat.ku.dk/~kach/sas2016/ kach@biostat.ku.dk, tel: 35327491
More informationUNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Applied Statistics Friday, January 15, 2016
UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Applied Statistics Friday, January 15, 2016 Work all problems. 60 points are needed to pass at the Masters Level and 75 to pass at the
More informationAnalysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA
Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED Maribeth Johnson Medical College of Georgia Augusta, GA Overview Introduction to longitudinal data Describe the data for examples
More information9. Linear Regression and Correlation
9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,
More informationover Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */
CLP 944 Example 4 page 1 Within-Personn Fluctuation in Symptom Severity over Time These data come from a study of weekly fluctuation in psoriasis severity. There was no intervention and no real reason
More informationThursday Morning. Growth Modelling in Mplus. Using a set of repeated continuous measures of bodyweight
Thursday Morning Growth Modelling in Mplus Using a set of repeated continuous measures of bodyweight 1 Growth modelling Continuous Data Mplus model syntax refresher ALSPAC Confirmatory Factor Analysis
More informationFor more information about how to cite these materials visit
Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/
More informationHierarchical Linear Models (HLM) Using R Package nlme. Interpretation. 2 = ( x 2) u 0j. e ij
Hierarchical Linear Models (HLM) Using R Package nlme Interpretation I. The Null Model Level 1 (student level) model is mathach ij = β 0j + e ij Level 2 (school level) model is β 0j = γ 00 + u 0j Combined
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the
More informationSAS Code for Data Manipulation: SPSS Code for Data Manipulation: STATA Code for Data Manipulation: Psyc 945 Example 1 page 1
Psyc 945 Example page Example : Unconditional Models for Change in Number Match 3 Response Time (complete data, syntax, and output available for SAS, SPSS, and STATA electronically) These data come from
More informationSupplemental Materials. In the main text, we recommend graphing physiological values for individual dyad
1 Supplemental Materials Graphing Values for Individual Dyad Members over Time In the main text, we recommend graphing physiological values for individual dyad members over time to aid in the decision
More informationLab 11. Multilevel Models. Description of Data
Lab 11 Multilevel Models Henian Chen, M.D., Ph.D. Description of Data MULTILEVEL.TXT is clustered data for 386 women distributed across 40 groups. ID: 386 women, id from 1 to 386, individual level (level
More informationECO220Y Simple Regression: Testing the Slope
ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x
More informationCovariance Structure Approach to Within-Cases
Covariance Structure Approach to Within-Cases Remember how the data file grapefruit1.data looks: Store sales1 sales2 sales3 1 62.1 61.3 60.8 2 58.2 57.9 55.1 3 51.6 49.2 46.2 4 53.7 51.5 48.3 5 61.4 58.7
More informationMore about linear mixed models
Faculty of Health Sciences Contents More about linear mixed models Analysis of repeated measurements, NFA 2016 Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics, University of Copenhagen
More informationFaculty of Health Sciences. Correlated data. Count variables. Lene Theil Skovgaard & Julie Lyng Forman. December 6, 2016
Faculty of Health Sciences Correlated data Count variables Lene Theil Skovgaard & Julie Lyng Forman December 6, 2016 1 / 76 Modeling count outcomes Outline The Poisson distribution for counts Poisson models,
More informationStat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010
1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of
More informationSTA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3
STA 303 H1S / 1002 HS Winter 2011 Test March 7, 2011 LAST NAME: FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 303 STA 1002 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator. Some formulae
More informationLecture 10: Experiments with Random Effects
Lecture 10: Experiments with Random Effects Montgomery, Chapter 13 1 Lecture 10 Page 1 Example 1 A textile company weaves a fabric on a large number of looms. It would like the looms to be homogeneous
More informationECNS 561 Multiple Regression Analysis
ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking
More informationANOVA Longitudinal Models for the Practice Effects Data: via GLM
Psyc 943 Lecture 25 page 1 ANOVA Longitudinal Models for the Practice Effects Data: via GLM Model 1. Saturated Means Model for Session, E-only Variances Model (BP) Variances Model: NO correlation, EQUAL
More informationLecture 1 Introduction to Multi-level Models
Lecture 1 Introduction to Multi-level Models Course Website: http://www.biostat.jhsph.edu/~ejohnson/multilevel.htm All lecture materials extracted and further developed from the Multilevel Model course
More informationIntroduction to Crossover Trials
Introduction to Crossover Trials Stat 6500 Tutorial Project Isaac Blackhurst A crossover trial is a type of randomized control trial. It has advantages over other designed experiments because, under certain
More informationLinear regression and correlation
Faculty of Health Sciences Linear regression and correlation Statistics for experimental medical researchers 2018 Julie Forman, Christian Pipper & Claus Ekstrøm Department of Biostatistics, University
More informationFaculty of Health Sciences. Correlated data. More about LMMs. Lene Theil Skovgaard. December 4, / 104
Faculty of Health Sciences Correlated data More about LMMs Lene Theil Skovgaard December 4, 2015 1 / 104 Further topics Model check and diagnostics Cross-over studies Paired T-tests with missing values
More informationTime Invariant Predictors in Longitudinal Models
Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section
More informationWU Weiterbildung. Linear Mixed Models
Linear Mixed Effects Models WU Weiterbildung SLIDE 1 Outline 1 Estimation: ML vs. REML 2 Special Models On Two Levels Mixed ANOVA Or Random ANOVA Random Intercept Model Random Coefficients Model Intercept-and-Slopes-as-Outcomes
More informationRegression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.
TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationLongitudinal Modeling with Logistic Regression
Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to
More informationLecture 10 Multiple Linear Regression
Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable
More informationInference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58
Inference ME104: Linear Regression Analysis Kenneth Benoit August 15, 2012 August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Stata output resvisited. reg votes1st spend_total incumb minister
More information