11. Linear Mixed-Effects Models. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
|
|
- Jasmin Armstrong
- 5 years ago
- Views:
Transcription
1 11. Linear Mixed-Effects Models Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
2 The Linear Mixed-Effects Model y = Xβ + Zu + e X is an n p matrix of known constants β R p is an unknown parameter vector Z is an n q matrix of known constants u is a q 1 random vector e is an n 1 vector of random errors Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
3 The Linear Mixed-Effects Model y = Xβ + Zu + e The elements of β are considered to be non-random and are called fixed effects. The elements of u are random variables and are called random effects. The elements of the error vector e are always considered to be random variables. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
4 Because the model includes both fixed and random effects (in addition to the random errors), it is called a mixed-effects model or, more simply, a mixed model. The model is called a linear mixed-effects model because (as we will soon see) E(y u) = Xβ + Zu, a linear function of fixed and random effects. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
5 We assume that E(e) = 0 Var(e) = R E(u) = 0 Var(u) = G Cov(e, u) = 0. It follows that opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
6 E(y) = E(Xβ + Zu + e) = Xβ + ZE(u) + E(e) = Xβ Var(y) = Var(Xβ + Zu + e) = Var(Zu + e) = Var(Zu) + Var(e) = ZVar(u)Z + R = ZGZ + R Σ. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
7 We usually consider the special case in which [ ] ([ ] [ ]) u 0 G 0 N, e 0 0 R = y N(Xβ, ZGZ + R). opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
8 The conditional mean and variance, given the random effects, are E(y u) = Xβ + Zu and Var(y u) = R. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
9 Example 1 Suppose an experiment was conducted to compare the height of plants grown at two soil moisture levels (labeled 1 and 2). The soil moisture levels were randomly assigned to 4 pots with 2 pots per moisture level. For each moisture level, 3 seeds were planted in one pot and 2 seeds were planted in the other. After a four-week growing period, the height of each seedling was measured. Let y ijk denote the height for soil moisture level i, pot j, seedling k. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
10 Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
11 Consider the model y ijk = µ + α i + p ij + e ijk p 11, p 12, p 21, p 22 i.i.d. N(0, σ 2 p) independent of the e ijk terms, which are assumed to be iid N(0, σ 2 e). This model can be written in the form y = Xβ + Zu + e, where Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
12 y = y 111 y 112 y 113 y 121 y 122 y 211 y 212 y 213 y 221 y 222, X = µ, β = α 1 α 2, Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
13 Z = , u = p 11 p 12 p 21 p 22, e = e 111 e 112 e 113 e 121 e 122 e 211 e 212 e 213 e 221 e 222. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
14 y 111 y 112 y 113 y 121 y 122 y 211 y 212 y 213 y 221 y 222 = µ + α 1 µ + α 1 µ + α 1 µ + α 1 µ + α 1 µ + α 2 µ + α 2 µ + α 2 µ + α 2 µ + α 2 + p 11 p 11 p 11 p 12 p 12 p 21 p 21 p 21 p 22 p 22 + e 111 e 112 e 113 e 121 e 122 e 211 e 212 e 213 e 221 e 222 opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
15 y 111 = µ + α 1 + p 11 + e 111 y 112 = µ + α 1 + p 11 + e 112 y 113 = µ + α 1 + p 11 + e 113 y 121 = µ + α 1 + p 12 + e 121 y 122 = µ + α 1 + p 12 + e 122 y 211 = µ + α 2 + p 21 + e 211 y 212 = µ + α 2 + p 21 + e 212 y 213 = µ + α 2 + p 21 + e 213 y 221 = µ + α 2 + p 22 + e 221 y 222 = µ + α 2 + p 22 + e 222 opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
16 G = Var(u) = Var([p 11, p 12, p 21, p 22 ] ) = σ 2 pi 4 4 R = Var(e) = σ 2 ei Var(y) = ZGZ + R = Zσ 2 piz + σ 2 ei = σ 2 pzz + σ 2 ei. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
17 ZZ = Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
18 Thus, Var(y) = σ 2 pzz + σ 2 ei is a block diagonal matrix. The first block is Var y 111 y 112 = σ 2 p + σ 2 e σ 2 p σ 2 p σ 2 p σ 2 p + σ 2 e σ 2 p. y 113 σ 2 p σ 2 p σ 2 p + σ 2 e opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
19 Note that Var(y ijk ) = σp 2 + σe 2 i, j, k. Cov(y ijk, y ijk ) = σp 2 i, j, and k k. Cov(y ijk, y i j k ) = 0 if i i or j j. Any two observations from the same pot have covariance σp. 2 Any two observations from different pots are uncorrelated. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
20 Alternative Derivation of Variances and Covariances Var(y ijk ) = Var(µ + α i + p ij + e ijk ) = Var(p ij + e ijk ) = Var(p ij ) + Var(e ijk ) + Cov(p ij, e ijk ) + Cov(e ijk, p ij ) = σp 2 + σe = σp 2 + σe. 2 For k k, Cov(y ijk, y ijk ) = Cov(µ + α i + p ij + e ijk, µ + α i + p ij + e ijk ) = Cov(p ij + e ijk, p ij + e ijk ) = Cov(p ij, p ij ) + Cov(p ij, e ijk ) +Cov(e ijk, p ij ) + Cov(e ijk, e ijk ) = Var(p ij ) = σp. 2 Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
21 Note that Var(y) may be written as σ 2 ev where V is a block diagonal matrix with blocks of the form 1 + σp/σ 2 e 2 σp/σ 2 e 2... σp/σ 2 e 2 σ p/σ 2 e σp/σ 2 e 2... σp/σ 2 e σ 2 p/σ 2 e σ 2 p/σ 2 e σ 2 p/σ 2 e Thus, if σ 2 p/σ 2 e were known, we would have the Aitken Model. y = Xβ + ɛ, where ɛ = Zu + e N(0, σ 2 V), σ 2 σ 2 e. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
22 Thus, if σ 2 p/σ 2 e were known, we would use GLS to estimate any estimable Cβ by Cˆβ V = C(X V 1 X) X V 1 y. However, we seldom know σ 2 p/σ 2 e or, more generally, Σ or V. For the general problem where Var(y) = Σ is an unknown positive definite matrix, we can rewrite Σ as σ 2 V, where σ 2 is an unknown positive variance and V is an unknown positive definite matrix. As in our simple example, each entry of V is usually assumed to be a known function of few unknown parameters. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
23 Thus, our strategy for estimating an estimable Cβ involves estimating the unknown parameters in V to obtain Cˆβ ˆV = C(X ˆV 1 X) X ˆV 1 y. In general, Cˆβ ˆV = C(X ˆV 1 X) X ˆV 1 y is an nonlinear estimator that is an approximation to Cˆβ V = C(X V 1 X) X V 1 y, which would be the BLUE of Cβ if V were known. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
24 In special cases, Cˆβ ˆV may be a linear estimator. However, even for our simple example involving seedling height, Cˆβ ˆV is a nonlinear estimator of Cβ for C = [1, 1, 0] Cβ = µ + α 1, C = [1, 0, 1] Cβ = µ + α 2, and C = [0, 1, 1] Cβ = α 1 α 2. Confidence intervals and tests for these estimable functions are not exact. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
25 In our simple example involving seedling height, there was only one random factor (pot). When there are m random factors, we can partition Z and u as Z = [Z 1,..., Z m ] and u = u 1. u m, where u j is the vector of random effects associated with factor j (j = 1,..., m). opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
26 We can write Zu as [Z 1,..., Z m ] u 1. u m = m Z j u j. j=1 We often assume that all random effects (including random errors) are mutually independent and that the random effects associated with the jth random factor have variance σ 2 j (j = 1,..., m). Under these assumptions, Var(y) = ZGZ + R = m σj 2 Z j Z j + σei. 2 j=1 opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
27 Example 2 Consider an experiment involving 4 litters of 4 animals each. Suppose 4 treatments are randomly assigned to the 4 animals in each litter. Suppose we obtain two replicate muscle samples from each animal and measure the response of interest for each muscle sample. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
28 Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
29 Let y ijk denote the kth measure of the response for the animal from litter j that received treatment i (i = 1, 2, 3, 4; j = 1, 2, 3, 4; k = 1, 2). Suppose y ijk = µ + τ i + l j + a ij + e ijk, where β = [µ, τ 1, τ 2, τ 3, τ 4 ] R 5 is an unknown vector of fixed parameters, u = [l 1, l 2, l 3, l 4, a 11, a 21, a 31, a 41, a 12,..., a 34, a 44 ] is a vector of random effects, and opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
30 e = [e 111, e 112, e 211, e 212, e 311, e 312, e 411, e 412,..., e 441, e 442 ] is a vector of random errors. With y = [y 111, y 112, y 211, y 212, y 311, y 312, y 411, y 412,..., y 441, y 442 ], we can write the model as a linear mixed-effects model y = Xβ + Zu + e, where opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
31 X = , Z = The matrix above repeated three more times Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
32 Kronecker Product Notation A B = = a 11 a 12 a 1n a 21 a 22 a 2n B... a m1 a m2 a mn a 11 B a 12 B a 1n B a 21 B a 22 B a 2n B... a m1 B a m2 B a mn B Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
33 We can write less and be more precise using Kronecker product notation. X = [ 1 8 1, I ], Z = [ I , I ]. In this experiment, we have two random factors: litter and animal. We can partition our random effects vector u into a vector of litter effects and a vector of animal effects: Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
34 [ ] l u =, l = a l 1 l 2 l 3 l 4 a 11 a 44 a 21 a 31, a = a 41. a 12. We make the usual assumption that [ ] ([ ] [ ]) l 0 σ 2 u = N, l I 0, a 0 0 σai 2 where σl 2, σ2 a R + are unknown parameters. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
35 We can partition Z = [ I = [Z l, Z a ]. 2 1 ] We have Zu = [Z l, Z a ] [ ] l a = Z l l + Z a a and Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
36 Var(Zu) = ZGZ = [Z l, Z a ] [ ] [ ] σ 2 l I 0 Z l 0 σ 2 ai Z a = Z l (σ 2 l I)Z l + Z a(σ 2 ai)z a = σ 2 l Z lz l + σ2 az a Z a = σ 2 l I σ2 a I Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
37 We usually assume that all random effects and random errors are mutually independent and that the errors (like the effects within each factor) are identically distributed: l 0 σl 2 a N 0, I σai 2 0. e σei 2 The unknown variance parameters σl 2, σ2 a, σe 2 R + are called variance components. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
38 In this case, we have R = Var(e) = σ 2 ei. Thus, Var(y) = ZGZ + R = σl 2 Z lz l + σ2 az a Z a + σei. 2 This is a block diagonal matrix with a block as follows. (To get a block to fit on one slide, let l = σ 2 l, a = σ2 a, e = σ 2 e). opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
39 l + a + e l + a l l l l l l l + a l + a + e l l l l l l l l l + a + e l + a l l l l l l l + a l + a + e l l l l l l l l l + a + e l + a l l l l l l l + a l + a + e l l l l l l l l l + a + e l + a l l l l l l l + a l + a + e Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
40 Random Effects Specify the Correlation Structure Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
41 Without the mouse random effects, our model would correspond to an RCBD with 2 mice per treatment per litter Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
42 With no random effects, our model would correspond to a CRD with 8 mice per treatment Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
43 Review of Experimental Design Terminology Experiment An investigation in which the investigator applies some treatments to experimental units and then observes the effect of the treatments on the experimental units by measuring one or more response variables. opyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
44 Treatment a condition or set of conditions applied to experimental units in an experiment. Experimental Unit the physical entity to which a treatment is randomly assigned and independently applied. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
45 Response Variable a characteristic of an experimental unit that is measured after treatment and analyzed to assess the effects of treatments on experimental units. Observational Unit the unit on which a response variable is measured. There is often a one-to-one correspondence between experimental units and observational units, but that is not always true. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
46 In our example involving plant heights and soil moisture levels, pots were the experimental units because soil moisture levels were randomly assigned to pots. Seedlings were the observational units because the response was measured separately for each seedling. Whenever there is more than one observational unit for an experimental unit or whenever the response is measured multiple times for an experimental unit, we say we have multiple observations per experiment unit. This scenario is also referred to as subsampling or pseudo-replication. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
47 Whenever an experiment involves multiple observations per experimental unit, it is important to include a random effect for each experimental unit. Without a random effect for each experimental unit, a one-to-one correspondence between observations and experimental units is assumed. Including random effects in a model is one way to account for a lack of independence among observations that might be expected based on the design of an experiment. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
48 Completely Randomized Design (CRD) experimental design in which, for given number of experiment units per treatment, all possible assignments of treatments to experimental units are equally likely. Block a group of experimental units that, prior to treatment, are expected to be more like one another (with respect to one or more response variables) than experimental units in general. Randomized Complete Block Design (RCBD) experimental design in which separate and completely randomized treatment assignments are made for each of multiple blocks in such a way that all treatments have at least one experimental unit in each block. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
49 Blocking grouping similar experimental units together and assigning different treatments within such groups of experimental units. The experiment involving muscle samples from mice used blocking. Each litter was a block in that experiment. Each mouse was an experimental unit. Each muscle sample was an observational unit. Copyright c 2018 Dan Nettleton (Iowa State University) 11. Statistics / 49
Linear Mixed-Effects Models. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 34
Linear Mixed-Effects Models Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 34 The Linear Mixed-Effects Model y = Xβ + Zu + e X is an n p design matrix of known constants β R
More informationWhen is the OLSE the BLUE? Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 40
When is the OLSE the BLUE? Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 40 When is the Ordinary Least Squares Estimator (OLSE) the Best Linear Unbiased Estimator (BLUE)? Copyright
More informationREPEATED MEASURES. Copyright c 2012 (Iowa State University) Statistics / 29
REPEATED MEASURES Copyright c 2012 (Iowa State University) Statistics 511 1 / 29 Repeated Measures Example In an exercise therapy study, subjects were assigned to one of three weightlifting programs i=1:
More informationTHE ANOVA APPROACH TO THE ANALYSIS OF LINEAR MIXED EFFECTS MODELS
THE ANOVA APPROACH TO THE ANALYSIS OF LINEAR MIXED EFFECTS MODELS We begin with a relatively simple special case. Suppose y ijk = µ + τ i + u ij + e ijk, (i = 1,..., t; j = 1,..., n; k = 1,..., m) β =
More informationEstimating Estimable Functions of β. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 17
Estimating Estimable Functions of β Copyright c 202 Dan Nettleton (Iowa State University) Statistics 5 / 7 The Response Depends on β Only through Xβ In the Gauss-Markov or Normal Theory Gauss-Markov Linear
More informationANOVA Variance Component Estimation. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 32
ANOVA Variance Component Estimation Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 32 We now consider the ANOVA approach to variance component estimation. The ANOVA approach
More informationANOVA Variance Component Estimation. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 32
ANOVA Variance Component Estimation Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 32 We now consider the ANOVA approach to variance component estimation. The ANOVA approach
More informationML and REML Variance Component Estimation. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 58
ML and REML Variance Component Estimation Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 58 Suppose y = Xβ + ε, where ε N(0, Σ) for some positive definite, symmetric matrix Σ.
More informationLINEAR MIXED-EFFECT MODELS
LINEAR MIXED-EFFECT MODELS Studies / data / models seen previously in 511 assumed a single source of error variation y = Xβ + ɛ. β are fixed constants (in the frequentist approach to inference) ɛ is the
More informationExample 1: Two-Treatment CRD
Introduction to Mixed Linear Models in Microarray Experiments //0 Copyright 0 Dan Nettleton Statistical Models A statistical model describes a formal mathematical data generation mechanism from which an
More informationThe Gauss-Markov Model. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 61
The Gauss-Markov Model Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 61 Recall that Cov(u, v) = E((u E(u))(v E(v))) = E(uv) E(u)E(v) Var(u) = Cov(u, u) = E(u E(u)) 2 = E(u 2
More informationScheffé s Method. opyright c 2012 Dan Nettleton (Iowa State University) Statistics / 37
Scheffé s Method opyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 37 Scheffé s Method: Suppose where Let y = Xβ + ε, ε N(0, σ 2 I). c 1β,..., c qβ be q estimable functions, where
More information20. REML Estimation of Variance Components. Copyright c 2018 (Iowa State University) 20. Statistics / 36
20. REML Estimation of Variance Components Copyright c 2018 (Iowa State University) 20. Statistics 510 1 / 36 Consider the General Linear Model y = Xβ + ɛ, where ɛ N(0, Σ) and Σ is an n n positive definite
More information2. A Review of Some Key Linear Models Results. Copyright c 2018 Dan Nettleton (Iowa State University) 2. Statistics / 28
2. A Review of Some Key Linear Models Results Copyright c 2018 Dan Nettleton (Iowa State University) 2. Statistics 510 1 / 28 A General Linear Model (GLM) Suppose y = Xβ + ɛ, where y R n is the response
More informationGeneral Linear Test of a General Linear Hypothesis. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 35
General Linear Test of a General Linear Hypothesis Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 35 Suppose the NTGMM holds so that y = Xβ + ε, where ε N(0, σ 2 I). opyright
More informationPreliminaries. Copyright c 2018 Dan Nettleton (Iowa State University) Statistics / 38
Preliminaries Copyright c 2018 Dan Nettleton (Iowa State University) Statistics 510 1 / 38 Notation for Scalars, Vectors, and Matrices Lowercase letters = scalars: x, c, σ. Boldface, lowercase letters
More informationStat 579: Generalized Linear Models and Extensions
Stat 579: Generalized Linear Models and Extensions Mixed models Yan Lu Feb, 2018, week 7 1 / 17 Some commonly used experimental designs related to mixed models Two way or three way random/mixed effects
More informationTwo-Color Microarray Experimental Design Notation. Simple Examples of Analysis for a Single Gene. Microarray Experimental Design Notation
Simple Examples of Analysis for a Single Gene wo-olor Microarray Experimental Design Notation /3/0 opyright 0 Dan Nettleton Microarray Experimental Design Notation Microarray Experimental Design Notation
More information3. The F Test for Comparing Reduced vs. Full Models. opyright c 2018 Dan Nettleton (Iowa State University) 3. Statistics / 43
3. The F Test for Comparing Reduced vs. Full Models opyright c 2018 Dan Nettleton (Iowa State University) 3. Statistics 510 1 / 43 Assume the Gauss-Markov Model with normal errors: y = Xβ + ɛ, ɛ N(0, σ
More informationThe Aitken Model. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 41
The Aitken Model Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 41 The Aitken Model (AM): Suppose where y = Xβ + ε, E(ε) = 0 and Var(ε) = σ 2 V for some σ 2 > 0 and some known
More informationThe Multivariate Normal Distribution. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 36
The Multivariate Normal Distribution Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 36 The Moment Generating Function (MGF) of a random vector X is given by M X (t) = E(e t X
More informationLikelihood Ratio Test of a General Linear Hypothesis. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 42
Likelihood Ratio Test of a General Linear Hypothesis Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 42 Consider the Likelihood Ratio Test of H 0 : Cβ = d vs H A : Cβ d. Suppose
More informationEstimation of the Response Mean. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 27
Estimation of the Response Mean Copyright c 202 Dan Nettleton (Iowa State University) Statistics 5 / 27 The Gauss-Markov Linear Model y = Xβ + ɛ y is an n random vector of responses. X is an n p matrix
More informationThe Multivariate Normal Distribution. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 36
The Multivariate Normal Distribution Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 36 The Moment Generating Function (MGF) of a random vector X is given by M X (t) = E(e t X
More information13. The Cochran-Satterthwaite Approximation for Linear Combinations of Mean Squares
13. The Cochran-Satterthwaite Approximation for Linear Combinations of Mean Squares opyright c 2018 Dan Nettleton (Iowa State University) 13. Statistics 510 1 / 18 Suppose M 1,..., M k are independent
More informationGeneralized Linear Mixed-Effects Models. Copyright c 2015 Dan Nettleton (Iowa State University) Statistics / 58
Generalized Linear Mixed-Effects Models Copyright c 2015 Dan Nettleton (Iowa State University) Statistics 510 1 / 58 Reconsideration of the Plant Fungus Example Consider again the experiment designed to
More informationMA 575 Linear Models: Cedric E. Ginestet, Boston University Mixed Effects Estimation, Residuals Diagnostics Week 11, Lecture 1
MA 575 Linear Models: Cedric E Ginestet, Boston University Mixed Effects Estimation, Residuals Diagnostics Week 11, Lecture 1 1 Within-group Correlation Let us recall the simple two-level hierarchical
More informationMixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012
Mixed-Model Estimation of genetic variances Bruce Walsh lecture notes Uppsala EQG 01 course version 8 Jan 01 Estimation of Var(A) and Breeding Values in General Pedigrees The above designs (ANOVA, P-O
More informationLecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013
Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 013 1 Estimation of Var(A) and Breeding Values in General Pedigrees The classic
More informationLecture 2: Linear and Mixed Models
Lecture 2: Linear and Mixed Models Bruce Walsh lecture notes Introduction to Mixed Models SISG, Seattle 18 20 July 2018 1 Quick Review of the Major Points The general linear model can be written as y =
More informationDeterminants. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 25
Determinants opyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 25 Notation The determinant of a square matrix n n A is denoted det(a) or A. opyright c 2012 Dan Nettleton (Iowa State
More informationMixed-Models. version 30 October 2011
Mixed-Models version 30 October 2011 Mixed models Mixed models estimate a vector! of fixed effects and one (or more) vectors u of random effects Both fixed and random effects models always include a vector
More informationXβ is a linear combination of the columns of X: Copyright c 2010 Dan Nettleton (Iowa State University) Statistics / 25 X =
The Gauss-Markov Linear Model y Xβ + ɛ y is an n random vector of responses X is an n p matrix of constants with columns corresponding to explanatory variables X is sometimes referred to as the design
More information21. Best Linear Unbiased Prediction (BLUP) of Random Effects in the Normal Linear Mixed Effects Model
21. Best Linear Unbiased Prediction (BLUP) of Random Effects in the Normal Linear Mixed Effects Model Copyright c 2018 (Iowa State University) 21. Statistics 510 1 / 26 C. R. Henderson Born April 1, 1911,
More informationvariability of the model, represented by σ 2 and not accounted for by Xβ
Posterior Predictive Distribution Suppose we have observed a new set of explanatory variables X and we want to predict the outcomes ỹ using the regression model. Components of uncertainty in p(ỹ y) variability
More informationMIXED MODELS THE GENERAL MIXED MODEL
MIXED MODELS This chapter introduces best linear unbiased prediction (BLUP), a general method for predicting random effects, while Chapter 27 is concerned with the estimation of variances by restricted
More informationMatrix Approach to Simple Linear Regression: An Overview
Matrix Approach to Simple Linear Regression: An Overview Aspects of matrices that you should know: Definition of a matrix Addition/subtraction/multiplication of matrices Symmetric/diagonal/identity matrix
More informationPeter Hoff Linear and multilinear models April 3, GLS for multivariate regression 5. 3 Covariance estimation for the GLM 8
Contents 1 Linear model 1 2 GLS for multivariate regression 5 3 Covariance estimation for the GLM 8 4 Testing the GLH 11 A reference for some of this material can be found somewhere. 1 Linear model Recall
More informationLecture 6: Linear models and Gauss-Markov theorem
Lecture 6: Linear models and Gauss-Markov theorem Linear model setting Results in simple linear regression can be extended to the following general linear model with independently observed response variables
More informationLecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012
Lecture 3: Linear Models Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector of observed
More informationEstimable Functions and Their Least Squares Estimators. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 51
Estimable Functions and Their Least Squares Estimators Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 51 Consider the GLM y = n p X β + ε, where E(ε) = 0. p 1 n 1 n 1 Suppose
More informationSTAT 510 Final Exam Spring 2015
STAT 510 Final Exam Spring 2015 Instructions: The is a closed-notes, closed-book exam No calculator or electronic device of any kind may be used Use nothing but a pen or pencil Please write your name and
More informationDistributions of Quadratic Forms. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 31
Distributions of Quadratic Forms Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 31 Under the Normal Theory GMM (NTGMM), y = Xβ + ε, where ε N(0, σ 2 I). By Result 5.3, the NTGMM
More informationAnalysis of Covariance
Analysis of Covariance Timothy Hanson Department of Statistics, University of South Carolina Stat 506: Introduction to Experimental Design 1 / 11 ANalysis of COVAriance Add a continuous predictor to an
More informationStat 579: Generalized Linear Models and Extensions
Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject
More informationConstraints on Solutions to the Normal Equations. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 41
Constraints on Solutions to the Normal Equations Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 41 If rank( n p X) = r < p, there are infinitely many solutions to the NE X Xb
More informationSampling Distributions
Merlise Clyde Duke University September 8, 2016 Outline Topics Normal Theory Chi-squared Distributions Student t Distributions Readings: Christensen Apendix C, Chapter 1-2 Prostate Example > library(lasso2);
More informationTwo-Way Factorial Designs
81-86 Two-Way Factorial Designs Yibi Huang 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley, so brewers like
More informationFactorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik
Factorial Treatment Structure: Part I Lukas Meier, Seminar für Statistik Factorial Treatment Structure So far (in CRD), the treatments had no structure. So called factorial treatment structure exists if
More informationMiscellaneous Results, Solving Equations, and Generalized Inverses. opyright c 2012 Dan Nettleton (Iowa State University) Statistics / 51
Miscellaneous Results, Solving Equations, and Generalized Inverses opyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 51 Result A.7: Suppose S and T are vector spaces. If S T and
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models Mixed effects models - II Henrik Madsen, Jan Kloppenborg Møller, Anders Nielsen April 16, 2012 H. Madsen, JK. Møller, A. Nielsen () Chapman & Hall
More informationUnit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs
Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Understand how to interpret a random effect Know the different
More informationLecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011
Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector
More informationSTAT 100C: Linear models
STAT 100C: Linear models Arash A. Amini June 9, 2018 1 / 56 Table of Contents Multiple linear regression Linear model setup Estimation of β Geometric interpretation Estimation of σ 2 Hat matrix Gram matrix
More information3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is
Stat 501 Solutions and Comments on Exam 1 Spring 005-4 0-4 1. (a) (5 points) Y ~ N, -1-4 34 (b) (5 points) X (X,X ) = (5,8) ~ N ( 11.5, 0.9375 ) 3 1 (c) (10 points, for each part) (i), (ii), and (v) are
More informationStat 217 Final Exam. Name: May 1, 2002
Stat 217 Final Exam Name: May 1, 2002 Problem 1. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are
More informationChapter 5 Prediction of Random Variables
Chapter 5 Prediction of Random Variables C R Henderson 1984 - Guelph We have discussed estimation of β, regarded as fixed Now we shall consider a rather different problem, prediction of random variables,
More informationLinear Algebra Review
Linear Algebra Review Yang Feng http://www.stat.columbia.edu/~yangfeng Yang Feng (Columbia University) Linear Algebra Review 1 / 45 Definition of Matrix Rectangular array of elements arranged in rows and
More informationPreliminary Linear Algebra 1. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 100
Preliminary Linear Algebra 1 Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 100 Notation for all there exists such that therefore because end of proof (QED) Copyright c 2012
More informationIntroduction The framework Bias and variance Approximate computation of leverage Empirical evaluation Discussion of sampling approach in big data
Discussion of sampling approach in big data Big data discussion group at MSCS of UIC Outline 1 Introduction 2 The framework 3 Bias and variance 4 Approximate computation of leverage 5 Empirical evaluation
More informationChapter 3 Best Linear Unbiased Estimation
Chapter 3 Best Linear Unbiased Estimation C R Henderson 1984 - Guelph In Chapter 2 we discussed linear unbiased estimation of k β, having determined that it is estimable Let the estimate be a y, and if
More informationBest Linear Unbiased Prediction (BLUP) of Random Effects in the Normal Linear Mixed Effects Model. *Modified notes from Dr. Dan Nettleton from ISU
Best Linear Unbiased Prediction (BLUP) of Random Effects in the Normal Linear Mixed Effects Model *Modified notes from Dr. Dan Nettleton from ISU Suppose intelligence quotients (IQs) for a population of
More informationRegression Review. Statistics 149. Spring Copyright c 2006 by Mark E. Irwin
Regression Review Statistics 149 Spring 2006 Copyright c 2006 by Mark E. Irwin Matrix Approach to Regression Linear Model: Y i = β 0 + β 1 X i1 +... + β p X ip + ɛ i ; ɛ i iid N(0, σ 2 ), i = 1,..., n
More informationSTAT22200 Spring 2014 Chapter 8A
STAT22200 Spring 2014 Chapter 8A Yibi Huang May 13, 2014 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley,
More informationPrediction of New Observations
Statistic Seminar: 6 th talk ETHZ FS2010 Prediction of New Observations Martina Albers 12. April 2010 Papers: Welham (2004), Yiang (2007) 1 Content Introduction Prediction of Mixed Effects Prediction of
More informationMLES & Multivariate Normal Theory
Merlise Clyde September 6, 2016 Outline Expectations of Quadratic Forms Distribution Linear Transformations Distribution of estimates under normality Properties of MLE s Recap Ŷ = ˆµ is an unbiased estimate
More informationWeighted Least Squares
Weighted Least Squares The standard linear model assumes that Var(ε i ) = σ 2 for i = 1,..., n. As we have seen, however, there are instances where Var(Y X = x i ) = Var(ε i ) = σ2 w i. Here w 1,..., w
More informationPh.D. Qualifying Exam Friday Saturday, January 3 4, 2014
Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Put your solution to each problem on a separate sheet of paper. Problem 1. (5166) Assume that two random samples {x i } and {y i } are independently
More informationWeighted Least Squares
Weighted Least Squares The standard linear model assumes that Var(ε i ) = σ 2 for i = 1,..., n. As we have seen, however, there are instances where Var(Y X = x i ) = Var(ε i ) = σ2 w i. Here w 1,..., w
More informationLecture Notes. Introduction
5/3/016 Lecture Notes R. Rekaya June 1-10, 016 Introduction Variance components play major role in animal breeding and genetic (estimation of BVs) It has been an active area of research since early 1950
More informationLecture 9 Multi-Trait Models, Binary and Count Traits
Lecture 9 Multi-Trait Models, Binary and Count Traits Guilherme J. M. Rosa University of Wisconsin-Madison Mixed Models in Quantitative Genetics SISG, Seattle 18 0 September 018 OUTLINE Multiple-trait
More informationWLS and BLUE (prelude to BLUP) Prediction
WLS and BLUE (prelude to BLUP) Prediction Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark April 21, 2018 Suppose that Y has mean X β and known covariance matrix V (but Y need
More informationApplied Statistics Preliminary Examination Theory of Linear Models August 2017
Applied Statistics Preliminary Examination Theory of Linear Models August 2017 Instructions: Do all 3 Problems. Neither calculators nor electronic devices of any kind are allowed. Show all your work, clearly
More informationSTAT 100C: Linear models
STAT 100C: Linear models Arash A. Amini April 27, 2018 1 / 1 Table of Contents 2 / 1 Linear Algebra Review Read 3.1 and 3.2 from text. 1. Fundamental subspace (rank-nullity, etc.) Im(X ) = ker(x T ) R
More informationRANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar
Paper S02-2007 RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Eli Lilly & Company, Indianapolis, IN ABSTRACT
More informationPrediction. is a weighted least squares estimate since it minimizes. Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark
Prediction Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark March 22, 2017 WLS and BLUE (prelude to BLUP) Suppose that Y has mean β and known covariance matrix V (but Y need not
More informationFinancial Econometrics
Material : solution Class : Teacher(s) : zacharias psaradakis, marian vavra Example 1.1: Consider the linear regression model y Xβ + u, (1) where y is a (n 1) vector of observations on the dependent variable,
More informationIf you believe in things that you don t understand then you suffer. Stevie Wonder, Superstition
If you believe in things that you don t understand then you suffer Stevie Wonder, Superstition For Slovenian municipality i, i = 1,, 192, we have SIR i = (observed stomach cancers 1995-2001 / expected
More informationChapter 11 MIVQUE of Variances and Covariances
Chapter 11 MIVQUE of Variances and Covariances C R Henderson 1984 - Guelph The methods described in Chapter 10 for estimation of variances are quadratic, translation invariant, and unbiased For the balanced
More informationREGRESSION WITH SPATIALLY MISALIGNED DATA. Lisa Madsen Oregon State University David Ruppert Cornell University
REGRESSION ITH SPATIALL MISALIGNED DATA Lisa Madsen Oregon State University David Ruppert Cornell University SPATIALL MISALIGNED DATA 10 X X X X X X X X 5 X X X X X 0 X 0 5 10 OUTLINE 1. Introduction 2.
More informationStat 511 Outline (Spring 2009 Revision of 2008 Version)
Stat 511 Outline Spring 2009 Revision of 2008 Version Steve Vardeman Iowa State University September 29, 2014 Abstract This outline summarizes the main points of lectures based on Ken Koehler s class notes
More information. a m1 a mn. a 1 a 2 a = a n
Biostat 140655, 2008: Matrix Algebra Review 1 Definition: An m n matrix, A m n, is a rectangular array of real numbers with m rows and n columns Element in the i th row and the j th column is denoted by
More informationChapter 5: Joint Probability Distributions
Chapter 5: Joint Probability Distributions Seungchul Baek Department of Statistics, University of South Carolina STAT 509: Statistics for Engineers 1 / 19 Joint pmf Definition: The joint probability mass
More informationTopic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8]
Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8] The completely randomized design Treatments are randomly assigned to e.u. such
More informationMIT Spring 2015
Regression Analysis MIT 18.472 Dr. Kempthorne Spring 2015 1 Outline Regression Analysis 1 Regression Analysis 2 Multiple Linear Regression: Setup Data Set n cases i = 1, 2,..., n 1 Response (dependent)
More informationStat 511 Outline (Spring 2008 Revision of 2004 Version)
Stat 511 Outline (Spring 2008 Revision of 2004 Version) Steve Vardeman Iowa State University February 1, 2008 Abstract This outline summarizes the main points of lectures based on Ken Koehler s class notes
More informationWeak Instruments and the First-Stage Robust F-statistic in an IV regression with heteroskedastic errors
Weak Instruments and the First-Stage Robust F-statistic in an IV regression with heteroskedastic errors Uijin Kim u.kim@bristol.ac.uk Department of Economics, University of Bristol January 2016. Preliminary,
More informationLecture 7 Correlated Characters
Lecture 7 Correlated Characters Bruce Walsh. Sept 2007. Summer Institute on Statistical Genetics, Liège Genetic and Environmental Correlations Many characters are positively or negatively correlated at
More informationSTAT 525 Fall Final exam. Tuesday December 14, 2010
STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will
More informationFirst Year Examination Department of Statistics, University of Florida
First Year Examination Department of Statistics, University of Florida August 20, 2009, 8:00 am - 2:00 noon Instructions:. You have four hours to answer questions in this examination. 2. You must show
More informationHeteroskedasticity. Part VII. Heteroskedasticity
Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least
More informationChapter 4: Models for Stationary Time Series
Chapter 4: Models for Stationary Time Series Now we will introduce some useful parametric models for time series that are stationary processes. We begin by defining the General Linear Process. Let {Y t
More informationQuestions and Answers on Heteroskedasticity, Autocorrelation and Generalized Least Squares
Questions and Answers on Heteroskedasticity, Autocorrelation and Generalized Least Squares L Magee Fall, 2008 1 Consider a regression model y = Xβ +ɛ, where it is assumed that E(ɛ X) = 0 and E(ɛɛ X) =
More informationLecture 4. Basic Designs for Estimation of Genetic Parameters
Lecture 4 Basic Designs for Estimation of Genetic Parameters Bruce Walsh. Aug 003. Nordic Summer Course Heritability The reason for our focus, indeed obsession, on the heritability is that it determines
More informationSTAT 540: Data Analysis and Regression
STAT 540: Data Analysis and Regression Wen Zhou http://www.stat.colostate.edu/~riczw/ Email: riczw@stat.colostate.edu Department of Statistics Colorado State University Fall 205 W. Zhou (Colorado State
More informationAnalysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington
Analsis of Longitudinal Data Patrick J. Heagert PhD Department of Biostatistics Universit of Washington 1 Auckland 2008 Session Three Outline Role of correlation Impact proper standard errors Used to weight
More informationWhat is Experimental Design?
One Factor ANOVA What is Experimental Design? A designed experiment is a test in which purposeful changes are made to the input variables (x) so that we may observe and identify the reasons for change
More informationSolutions to Odd-Numbered End-of-Chapter Exercises: Chapter 14
Introduction to Econometrics (3 rd Updated Edition) by James H. Stock and Mark W. Watson Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 14 (This version July 0, 014) 015 Pearson Education,
More informationSample Size / Power Calculations
Sample Size / Power Calculations A Simple Example Goal: To study the effect of cold on blood pressure (mmhg) in rats Use a Completely Randomized Design (CRD): 12 rats are randomly assigned to one of two
More information