Random effects & Repeated measurements

Size: px

Start display at page:

Download "Random effects & Repeated measurements"

Mark Murphy
5 years ago
Views:

1 Random effects & Repeated measurements Bo Markussen Department of Mathematical Sciences December 20, 2017 LAST (MATH) AS / SmB Day 5 1 / 55

2 Lecture outline Hypothetical data example: Comparison of two treatments. Experimental design and random effects. Data example 1: Color of pork meat. Random effects in models with continuous response. Data example 2: Germination of Orobanche seeds. Overdispersion in logistic regression. Data example 3: Growth of Baobab trees. Repeated measurements. General concepts about analysis of repeated measurements. We will also briefly talk about analysis using summary measures. LAST (MATH) AS / SmB Day 5 2 / 55

3 Hypothetical data example: 1-way ANOVA Comparison of two treatments. Five animals per treatment treat Observations Mean > anova(lm(y~treat,data=rep1)) Analysis of Variance Table Response: y Df Sum Sq Mean Sq F value Pr(>F) treat Residuals [I] treat e The F-test compares variation between treatments to variation within treatments. Here treatment is non-significant (p=0.474). LAST (MATH) AS / SmB Day 5 3 / 55

4 Now measure each animal twice Almost the same observations repeated! treat Observations Mean > anova(lm(y~treat+animal,data=rbind(rep1,rep2))) Analysis of Variance Table Response: y Df Sum Sq Mean Sq F value Pr(>F) treat e-09 *** animal e-12 *** Residuals [I] e animal e 09 2 treat How can double measurements on the same animals suddenly change significance from (NS) to (***)?? LAST (MATH) AS / SmB Day 5 4 / 55

5 What went wrong? In the first analysis residual variation (mainly) represents variation between animals. In the second analysis residual variation represents measurement error. Treatments are compared by comparing different animals. Therefore the relevant variation for comparing treatments is variation between animals. Conclusion: First analysis is OK. Second analysis shows that the two groups of animals are different, but not necessarily that they are more different than animals are in general. LAST (MATH) AS / SmB Day 5 5 / 55

6 Factors with random effect The factor animal is not reproducible. The specific animals are of no interest beyond the present experiment, representatives of a population. These statements characterize factors which we want to include as random effects in the model. Estimates describe properties of the population, typically the standard deviation on the response. Typical factors with random effects: field, litter, replication, day, herd, and block factors in general. But possibly(!) also: strain, species, and other factors of no particular interest in a given experiment. LAST (MATH) AS / SmB Day 5 6 / 55

7 Factors with fixed (also called systematic ) effect The factor treatment is reproducible. The specific treatments are of interest beyond the present experiment that is why we made it! or other reproducible effects. they only represent themselves. These statements are typical for factors which we want to include as fixed effects in the model. Estimates describe properties of the individual treatments, typically the mean of the response. LAST (MATH) AS / SmB Day 5 7 / 55

8 Fixed effects vs. Random effects Fixed effects: Estimated mean for each level of the factor. Use many degrees of freedom (which is a bad thing). Random effects: Estimated standard deviation between the levels (using the REML method to avoid bias). Use only 1 degree of freedom (which is a good thing). Requires iterative search for parameter estimates (which is a bad thing, although it really doesn t pose an actual problem). Rough rule of thumb A block factor with 4 or fewer levels is often used with fixed effect (it is dubious to estimate variation from few points, and we don t lose many degrees of freedom anyway). LAST (MATH) AS / SmB Day 5 8 / 55

9 Digression: Interpretation of ANOVA-tables treat Observations Mean > anova(lm(y~treat+animal,data=rbind(rep1,rep2))) Analysis of Variance Table Response: y Df Sum Sq Mean Sq F value Pr(>F) treat e-09 *** animal e-12 *** Residuals The total variation in the response is decomposed using the right hand side of from left to right(!): SS total = N ( Yi Ȳ ) 2 = SStreat + SS animal treat + SS error i=1 MSS = SS/df: This was shown as blue circles in the Design Diagram. LAST (MATH) AS / SmB Day 5 9 / 55

10 Three models for the dataset on slide 4 (and 9) (A) Y i = α(treatment i ) + error i α(1), α(2) are constants (fixed effects). Wrong: Effect of animal is ignored. (B) Y i = α(treatment i ) + β(animal i ) + error i α(1), α(2) and β(1),..., β(10) are constants (fixed effects). Wrong: Meaningless model for testing treatment effect. (C) Y i = α(treatment i ) + B(animal i ) + error i α(1), α(2) constants (fixed effects). B(1),..., B(10) independent N (0, σ 2 B ) distributed random variables. Recommended: Model with random effect of animal. LAST (MATH) AS / SmB Day 5 10 / 55

11 Design diagrams for the two last models Random factors are designated by square brackets (B) Y i = α(treatment i ) + β(animal i ) + error i has diagram treatment 2 1 animal 10 8 [I] Remark: treatment can not be tested, since animal is nested within it. (C) Y i = α(treatment i ) + B(animal i ) + error i has diagram treat 2 1 [animal] 10 8 [I] Remark: treatment is tested against the random factor animal. Rule of thumb When testing the effect of a factor (here treatment) any factors nested within it (here animal) should usually be modelled as a factor with random effect. LAST (MATH) AS / SmB Day 5 11 / 55

12 Random effects using LabApplStat-package 20 [I] e 12 [animal] treat e plot(dd(y~treat,random=~animal,data=mydata)),"mss") LAST (MATH) AS / SmB Day 5 12 / 55

13 Random effects using R Two packages (developed by Bates et al) available: nlme and lme4 nlme also analyses repeated measurements and non-linear models. However, lme4 is maintained by the developers, allow for non-nested random effects, and may also be used for categorical responses. For models without repeated measurements I recommend the lme4-package. A methodological challenge is that random effects models for continuous responses may be estimated by two different methods. ML (maximum likelihood) is needed if you want to do likelihood-ratio tests (which we will do). REML (restricted maximum likelihood) is recommended for estimation of effects. Luckily lme4 and drop1() take care of this. So simply use the default setting, which gives REML. Model validation: Needed both for fixed and random effects. LAST (MATH) AS / SmB Day 5 13 / 55

14 Questions? And then a break! After the break we see how to use a random effect in a so-called split-plot design. This example also exemplifies Where is the effects?, and show how to estimate random effects models in R. LAST (MATH) AS / SmB Day 5 14 / 55

15 Data example 1: Color of pork meat 2 breeds { old: 10 pigs new: 10 pigs After slaughter: 6 pork chops from each pig. Storage in light or darkness for 1, 4 or 6 days. Response=redness of meat (continuous measurement) Storage 1 days 4 days 6 days Dark chop 1 chop 2 chop 3 Light chop 4 chop 5 chop 6 In total 2*10*6=120 pork chops. LAST (MATH) AS / SmB Day 5 15 / 55

16 Data example 1: Table of Variables Variable Type Range Usage breed nominal old, new fixed effect pig nominal 1,..., 20 random effect storage nominal dark, light fixed effect time ordinal 1 < 4 < 6 (days) fixed effect redness continuous [1.606; ] response pig is nested within breed (breed is the nest and pigs are the eggs ). breed is a between-pig factor. storage and time are within-pig factors. Use full factorial design for the fixed effects. LAST (MATH) AS / SmB Day 5 16 / 55

17 Example is a typical split-plot experiment Two types of experimental units: Pigs ( whole-plots ) with the whole-plot factor breed. Chops ( sub-plots ) with the sub-plot factors storage and time. Design diagram: [pig] [I] breed:storage 4 1 breed 2 1 breed:storage:time 12 2 breed:time 6 2 storage storage:time 6 2 time 3 2 LAST (MATH) AS / SmB Day 5 17 / 55

18 R code: Validation of random effects model redness i = α(breed i, storage i, time i ) + A(pig i ) + ɛ i, where A(j) N (0, σ 2 A) and ɛ i N (0, σ 2 ) # Read dataset: Recode some variables as factors redness <- read.delim("redness.txt") redness$pig <- factor(redness$pig) redness$time <- factor(redness$time) # Fit random effects model m0 <- lmer(redness~breed*storage*time+(1 pig),data=redness) # Residual plot plot(m0) # Normal quantile plots qqnorm(residuals(m0)) qqnorm(ranef(m0)$pig[,1]) LAST (MATH) AS / SmB Day 5 18 / 55

19 Two of the three validation plots Normal quantile plot for residuals not shown 6 Normal Q Q Plot resid(., type = "pearson") Sample Quantiles fitted(.) Theoretical Quantiles Residual plot (mean zero?, variance homogeneity?): Standardised residuals vs. predicted values (including random effects). Observation no. 44 appears to be an outlier. Normal quantile plot for predicted random effects. Note that we only have 20 points, one for each pig. LAST (MATH) AS / SmB Day 5 19 / 55

20 R code: Backward model reduction In the example using AIC, but we have also asked for p-values # Refit model without obs. no. 44 m1 <- lmer(y~breed*storage*time+(1 pig),data=redness[-44,]) # Model reduction. # Here with dense code using update() drop1(m1,test="chisq") drop1(m2 <- update(m1,.~.-breed:storage:time),test="chisq") drop1(m3 <- update(m2,.~.-breed:time),test="chisq") drop1(m4 <- update(m3,.~.-breed:storage),test="chisq") Syntax for lmer(): Random effects specified in terms (1 ). Other things done as for lm(). Technicality: Tests done as chi-squared test on likelihood ratio. Automated backward model selection using p-values via step() function in lmertest-package. LAST (MATH) AS / SmB Day 5 20 / 55

21 Results for the final model redness i = α(breed i ) + β(storage i, time i ) + A(pig i ) + ɛ i, where A(j) N (0, σ 2 A) and ɛ i N (0, σ 2 ) Likelihood ratio test in model m4: p(breed, df=1)=0.0141, p(storage:time,df=2)= The old breed has more redness that the new breed: lsmean(old)-lsmean(new)= (95% CI: ; ) Estimates (with FWER confidence intervals) of lsmeans for combination of storage (D, L) and time (1, 4, 6) are found and displayed (see next slide) using the R code: # FWER confidence intervals for time:storage # To do this we call the multcomp-package from lsmeans. confint(as.glht(lsmeans(m4,~time:storage))) plot(confint(as.glht(lsmeans(m4,~time:storage)))) LAST (MATH) AS / SmB Day 5 21 / 55

22 LS-means of storage:time Meat fade over time, more so under storage in light 95% family wise confidence level D:1 ( ) D:4 ( ) D:6 ( ) L:1 ( ) L:4 ( ) L:6 ( ) Linear Function LAST (MATH) AS / SmB Day 5 22 / 55

23 Quantification of sources of variation In some analyses this quantification is the main objective > summary(m4) Linear mixed model fit by REML Formula: y ~ breed + storage*time + (1 pig)... Random effects: Groups Name Variance Std.Dev. pig (Intercept) Residual Number of obs: 119, groups: pig, Total variance = = Of this 9% is due to variation between pigs (random effect of pig). And 91% comes from other sources (residual). LAST (MATH) AS / SmB Day 5 23 / 55

24 Questions? And then a break! After the break we discuss how to estimate categorical regressions with random effects. LAST (MATH) AS / SmB Day 5 24 / 55

Data example 2: Binary data Number of Orobanche seeds germinating (yes/no) in extracts of bean and cucumber roots. In total 21 batches of two different orobanche varieties as shown below. O. 75 O.

25 Data example 2: Binary data Number of Orobanche seeds germinating (yes/no) in extracts of bean and cucumber roots. In total 21 batches of two different orobanche varieties as shown below. O. 75 O. 73 Bean Cucumber Bean Cucumber yes no yes no yes no yes no Orobanche purpurea (Yarrow Broomrape), picture from Wikipedia Thus, the first batch consisted of 39 seeds of variety O.75 grown in bean roots, out of which 10 seeds germinated. LAST (MATH) AS / SmB Day 5 25 / 55

26 Data example 2: Germination of Orobanche seeds Variable Type Range Usage variety nominal O.75, O.73 fixed effect root nominal bean, cucumber fixed effect batch nominal 1,..., 21 random effect germination binary yes, no response Interest on the effect of variety and root on germination of the seeds. The particular batches are not of interest, but representatives of a population. variety 2 1 [I ] [batch] variety:root root 2 1 LAST (MATH) AS / SmB Day 5 26 / 55

27 Logistic regression with random effects Logistic regression models probability of germination via log(odds for germination of i th seed) = α(variety i, root i ) But what if the batches are different? A solution is to include batch as a random effect. Thus, for independent B(1),..., B(21) N (0, σb 2 ) we have log(odds for germination of i th seed) = α(variety i, root i ) + B(batch i ) The additional variability induced by the random factor is called overdispersion. Alternatives to random effects models (R guide Section ): Correct for overdispersion by rescaling the standard errors: done using quasibinomial-family in glm(). Estimate empirical correlation using the gee-approach. LAST (MATH) AS / SmB Day 5 27 / 55

28 Data example 2 continued Comparison with other approaches to overdispersion Test of Logistic regression model variety:root Overdispersion GLMM (what we did) p= modelled GEE p= scale= quasibinomial p= scale= Plain p= not modelled In this example plain logistic regression is questionable, and p= is not trustworthy. In practice all the 3 other methods can be used. Although they disagree on the significance in this situation. This is not a paradox, but simply different tests for the same hypothesis. If valid, then I recommend the GLMM. But this is a matter of taste. To see if you have overdispersion check whether the scale parameter in the quasibinomial analysis is (significantly) larger than 1. Alternatively, make hypothesis test on the random effect in the GLMM. LAST (MATH) AS / SmB Day 5 28 / 55

29 Summary of random effects A factor should be used as a random effect if it is... non reproducible, of no interest beyond the present experiment, representatives of a population. Often block factors are used with random effects. Rule of thumb: To test a fixed effect any factor nested within it should usually be modelled as a random effect. In these lectures we only discussed models with random intercepts. However, we may also have models with random slopes of some continuous covariate. E.g., if redness$time is continuous: lmer(y~breed*storage+(1+time pig),data=redness) Random effects also possible for categorical regression models: Related to overdispersion. rescaling (e.g. quasibinomial) and GEE discussed in Chapter GLMM (generalized linear mixed effects models) used in these lectures. LAST (MATH) AS / SmB Day 5 29 / 55

30 Questions? And then a break! After the break we discuss models for repeated measurements. LAST (MATH) AS / SmB Day 5 30 / 55

31 Repeated measurements models: Why? Data example 1: Color of pork meat. 10 pigs from both old and new breed: 7 chops from each pig. Time Storage 0 days 1 days 4 days 6 days Dark chop 1: data chop 2 chop 3 chop 4 Light not used chop 5 chop 6 chop 7 Random effect model for chops 2 to 7 (in total 2*10*6=120 observations): redness i = α(storage i, time i, breed i ) + A(pig i ) }{{} N (0,σ 2 A ) + But what if the experimental design has been like this? Time Storage 0 days 1 days 4 days 6 days Dark chop 1 chop 1 chop 1 chop 1 Light chop 2 chop 2 chop 2 chop 2 ɛ i }{{} N (0,σ 2 ) LAST (MATH) AS / SmB Day 5 31 / 55

32 example continued... Alternative design has 4 measurements for each pork chop Time Storage 0 days 1 days 4 days 6 days Dark chop 1 chop 1 chop 1 chop 1 Light chop 2 chop 2 chop 2 chop 2 Random effect model for the alternative design: redness i = α(storage i, time i, breed i ) + A(pig i ) + B(pig }{{} i, chop i ) + ɛ }{{}}{{} i N (0,σA 2 ) N (0,σB 2 ) Here the 4 measurements on the same pork chop (from the same pig) share an additional random effect B(pig,chop). N (0,σ 2 ) But perhaps measurements taken close in time are more correlated than measurements taken fare apart in time! How to model that? LAST (MATH) AS / SmB Day 5 32 / 55

33 General remarks on repeated measurements Repeated measurements originate from study designs where the experimental units have been measured several times (typically at different time points or at different spatial positions): Economic necessity, e.g. when experimental units are expensive. Experimental units may serve as their own controls. Response profile (i.e. response over time) is of scientific interest. Repeated measurements are analysed either to gain power or to investigate the response profile. A summary measure is a single number capturing the important feature of the response profile. Examples: AUC (area under the curve), mean, maximum, minimum, range between max and min, time under a pre-specified level, slope, curvature, halving time, slope after the minimum,... Summary measure preferably suggested from the scientific study, not from the statistical analysis. Summary measures reduce the repeated measurements to a single observation = statistical analysis without repeated measurements. LAST (MATH) AS / SmB Day 5 33 / 55

34 Which method to use? Summary measures vs. Random effects vs. Repeated measurements Summary measures is always an option. Unless you have particular interest in the response profile I recommend analysis of summary measures (if it has sufficient power). With few repeated measurements per subject, say 4 or less, it does not make sense to estimate the serial correlation structure. Simply use a random effect model. With many repeated measurements per subject, say 5 or more, you have enough information to estimate the serial correlation structure. This is necessary to have trustworthy p-values and confidence intervals. LAST (MATH) AS / SmB Day 5 34 / 55

35 Case study: Growth of Baobab trees under water stress Data kindly provided by Henri-Noël Bouda Baobab seeds from 3 countries and 7 provenances sown in the beginning of Plants grown under 3 water regimes (100%, 75% and 50% field capacity). Diameter and height of plants measured monthly. To measure root weight etc. some plants were harvested in August 2009, some plants in February Purpose of experiment: How does water drought effect growth of trees. Is there an interaction with country and/or provenance? LAST (MATH) AS / SmB Day 5 35 / 55

Gives overview of the data and provides an impression of

36 Individual response profiles (subject profiles) For 362 different baobab trees A good plot to make. Gives overview of the data and provides an impression of the typical time-response relationship. LAST (MATH) AS / SmB Day 5 36 / 55

37 Average response profiles A good plot to make. Gives overview of the treatment effects. Response profiles averaged within the 9 combinations of treatments and countries LAST (MATH) AS / SmB Day 5 37 / 55

38 Data organization in Excel sheet Wide form (also called horizontal organization ) of responses: diameter, height Variable Levels Description Country 3 (Burkina,Mali,Tanzanie) Country Provenance 7 (Kolangal,...,Samé) 3 provenances from Burkina, 3 from Mali, 1 from Tanzanie Plant 362 (BKol-04,...,TNku-76) Plant id Block 3 (1,2,3) Field blocks Treatment 3 (1,2,3) Water regime HarvestDate 3 (aug-09,feb-10,missing) Day of harvest Dia0209 continuous (or missing) Diameter, February Dia0110 continuous (or missing) Diameter, January 2010 Hei0209 continuous (or missing) Height, February Hei0110 continuous (or missing) Height, January 2010 LAST (MATH) AS / SmB Day 5 38 / 55

39 Long form: diameter, height for each month The long form is also referred to as the vertical organization Variable Levels Description Country 3 (Burkina,Mali,Tanzanie) Country Provenance 7 (Kolangal,...,Samé) 3 provenances from Burkina, 3 from Mali, 1 from Tanzanie Plant 362 (BKol-04,...,TNku-76) Plant id Block 3 (1,2,3) Field blocks Treatment 3 (1,2,3) Water regime HarvestDate 3 (aug-09,feb-10,missing) Day of harvest month 12 (2,3,...,13) February 09 January 10 diameter continuous (or missing) Diameter height continuous (or missing) Height LAST (MATH) AS / SmB Day 5 39 / 55

40 Long form as needed for the statistical analysis Obs Country Provenance Plant Block Treat HarvestDate month diameter height 1 Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG NA NA 9 Burkina Kolangal BKol AUG NA NA 10 Burkina Kolangal BKol AUG NA NA 11 Burkina Kolangal BKol AUG NA NA 12 Burkina Kolangal BKol AUG NA NA 13 Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG Burkina Kolangal BKol AUG LAST (MATH) AS / SmB Day 5 40 / 55

41 Table of variables Variable Type Range Usage Country Nominal Burkina, Mali, Tanzania fixed effect Provenance Nominal Kolangal,...,Samé fixed effect Remark: nested in Country Plant Nominal BKol-04,...,TNku-76 random effect subject id Block Nominal 1,2,3 fixed effect Treatment Ordinal 1 < 2 < 3 fixed effect month Nominal 2,..., 12,13 fixed effect correlation effect diameter continuous [0; 23.7] response height continuous [0; 142] response The two responses (diameter, height) analysed separately. R code via lme() in nlme-package: fixed effects are specified in model formula. random effects are specified in random option. correlations are specified in corr option. LAST (MATH) AS / SmB Day 5 41 / 55

42 Diagram of fixed and random factors [plant] treat*block 9 4 block 3 2 [I] treat*provenance treat treat*month*provenance treat*month provenance 7 6 month*provenance month In the model reduction provenance is nested within country. Are the residuals ɛ i, i = 1,..., 3412, independent? LAST (MATH) AS / SmB Day 5 42 / 55

43 Repeated measurements model diameter i = α(treat i, month i, provenance i, block i ) + A(plant i ) + B(plant }{{} i, month i ) + ɛ }{{}}{{} i N (0,σA 2 ) N (0,σB 2 ) N (0,σ 2 ) }{{} [I]-term in the design diagram Random effect A: Some plants are bigger than others. Correlated effect B: Correlated within plants ( subject id). Correlation typically decreases with increasing time distance. Uncorrelated between plants. Possible interpretation is variation between time position of growth period. Error term ɛ: Possible interpretation is measurement error. LAST (MATH) AS / SmB Day 5 43 / 55

44 Three examples of correlation structures for B diameter i = α(treat i, month i, provenance i, block i ) + A(plant i ) + B(plant i, month i ) + ɛ i (A) The model without the serial correlated effect B. In this case we have a random effect model. This model is also referred to as the random intercept model or the compound symmetry model. ( ) ( (B) Var B(plant, month i ), B(plant, month j ) = σb 2 exp Correlation has exponential decrease. ) month i month j d ( ) ( (C) Var B(plant, month i ), B(plant, month j ) = σb 2 exp month i month j 2 ρ 2 ) Correlation has Gaussian decrease. When a random effect A and an error term ɛ are present, this model is sometimes referred to as the Diggle model after Peter Diggle. LAST (MATH) AS / SmB Day 5 44 / 55

45 R code # In the data frame baobab we have the factors: # Plant, Block, Treatment, Provenance, Country long <- reshape(baobab, varying=list(c("dia0209","dia0309","dia0409"," c("hei0209","hei0309","hei0409"," v.names=c("diameter","height"), timevar="month", idvar="plant", direction="long") # Diggle model: two other models in R script mgauss <-lme(diameter~treatment*block+ Treatment*factor(month)*Provenance, random=~1 Plant, corr=corgaus(form=~month Plant,nugget=TRUE), data=long,na.action=na.omit) LAST (MATH) AS / SmB Day 5 45 / 55

46 Which repeated measurements model to use? There exists many other models than those listed on slide 44 Is the model valid? Residual plot: Non-random scatter suggests that the explanatory variables have not been used appropriately, e.g. an interaction or a quadratic term might be missing. Normal quantile plots: Residuals not on a straight line suggests that the response variable perhaps should be transformed. Semi-variogram: Compares empirical correlation structure (the dots) to the fitted theoretical correlation structure (the line). Interpretation? Random effects have a simple interpretation, which speak in favour of the compound symmetry model. The exponential decrease and the Diggle model have similar interpretations. Akaike Information Criterion (AIC): The smaller the better. For the Baobab dataset the compound symmetry model is clearly rejected by the AIC. LAST (MATH) AS / SmB Day 5 46 / 55

47 Exponential vs. Gaussian decrease: Residual plot Exponential decrease model Diggle model 4 4 Standardized residuals Standardized residuals Fitted values Fitted values plot(mexp,main="exponential decrease model") plot(mgauss,main="diggle model") LAST (MATH) AS / SmB Day 5 47 / 55

48 Exponential vs. Gaussian decrease: Normal quantile plot Note that the axes are interchanged relative to the usual qqnorm() Exponential decrease model Diggle model Quantiles of standard normal Quantiles of standard normal Standardized residuals Standardized residuals qqnorm(resid(mexp),main="exponential decrease model") qqnorm(resid(mgauss),main="diggle model") LAST (MATH) AS / SmB Day 5 48 / 55

49 Semi-variogram: γ(h) = 1 2 var( X (t + h) X (t) ) An alternative is to use Chapter 9.4 in the R guide Exponential decrease model Diggle model Semivariogram Semivariogram Distance Distance plot(variogram(mexp),ylim=c(0,0.7)) plot(variogram(mgauss),ylim=c(0,1.1)) LAST (MATH) AS / SmB Day 5 49 / 55

50 Choice of repeated measurements model Residual and normal quantile plots acceptable for all models Model Compound symmetry Exponential decrease Diggle AIC Akaike Information Criterion (AIC) clearly prefers the exponential decrease model, which we describe as: An ANOVA with random effect of plant and residual errors correlated within plants. The errors consist of an independent component and a component with exponential decreasing correlation. For the fixed effects we used the concatenation of the full factorial design of (tretment,month,provenance) and the full factorial design of (treatment,block). LAST (MATH) AS / SmB Day 5 50 / 55

51 Overview of steps in a repeated measurements analysis List and classify the factors and covariates in the design. Done (see slide 41). Make plots of individual, and perhaps averaged, response profiles. Done (see slides 36 and 37). Choose and validate a correlation structure. Done (see slides 47 to 50). Test for reduction of fixed effects: interactions, main effects, covariates etc. (as usual for ANOVA and ANCOVA models). Remember to refit models using maximum likelihood (method="ml"). Unfortunately the drop1() function does not work for lme-objects. So this has to be done by hand (see Chapter in the R guide). Automatic model selection based on AIC may be done using stepaic() from the MASS-package. Report estimates and conclusions from the final model (as usual, e.g. using the lsmeans-package). LAST (MATH) AS / SmB Day 5 51 / 55

52 Analysis of Summary measures An alternative to the repeated measurements analysis discussed above Idea: Reduce the curve for each subject to a single value. Analyze this summary measure as usual (ANOVA, regression,... ). As summary measures we could for example use: Average response over time. Area under curve (AUC, often used in medicine). Slope of curve (rate of increase). Maximal response. Position (e.g. time) of maximal response. Halving time since maximal response. Curvature: fit α + β time + γ time 2 for each individual and use ˆγ. Note: The summary measures should be computed for each subject not on the average profiles! LAST (MATH) AS / SmB Day 5 52 / 55

53 Principles for choosing summary measures Select a measure that addresses the problem under investigation. Do not choose summary measures on the basis of visual inspection of the treatment differences this is cheating. But it is OK to plot all profiles in one graph and select typical features of the curves for further investigation. You may analyze more than one summary measure. If so, then choose some that reflect different aspects of the curves. For example: AUC and average response is NOT a good combination. AUC and rate of increase might be a good combination. But be aware of the associated multiple testing problem. LAST (MATH) AS / SmB Day 5 53 / 55

54 Analysis of summary measures: Pros and Cons Advantages: Simple analysis, which is more easily communicated. Often powerful analysis if the summary measure is chosen appropriately. Model validation more easy and transparent. Disadvantages: Each curve is reduced to a single value loss of information? Which summary measure should be choose? No investigation of the temporal structure, which might be important for the problem under investigation. LAST (MATH) AS / SmB Day 5 54 / 55

55 Todays exercises Exercise 5.1: Read the exercise text, and discuss items 1 and 2 with your neighbours. Before you continue with the statistical analysis we discuss the usage of the explanatory variables in plenum. And I ll make the factor diagram for you. In exercise 1.6 this dataset was analysed using a bunch of T-tests. Exercise 5.2: The block factor round with 3 levels should be used with random effect. This violates the rules of thumb that factors with less than 5 levels can be used with fixed effect. However, this is also a matter of taste. Moreover, you should also try a model with more than 1 random effect. Exercise 5.3: You might remember this dataset as Data Example 3 in Exercise 1.2. LAST (MATH) AS / SmB Day 5 55 / 55

Non-Gaussian Response Variables

Non-Gaussian Response Variables What is the Generalized Model Doing? The fixed effects are like the factors in a traditional analysis of variance or linear model The random effects are different A generalized