Oct Analysis of variance models. One-way anova. Three sheep breeds. Finger ridges. Random and. Fixed effects model. The random effects model

Size: px
Start display at page:

Download "Oct Analysis of variance models. One-way anova. Three sheep breeds. Finger ridges. Random and. Fixed effects model. The random effects model"

Transcription

1 s s Oct / 34

2 s Consider N = n 0 + n n k 1 observations, which form k groups, of sizes n 0, n 1,..., n k 1. The r-th group has sample mean Ȳ r The overall mean (for all groups combined) is Ȳ = N 1 Y i. Observations in the r-th group are N(m r, σ 2 W ), for r = 0 to k 1: Y = b 0 + b 1 X b k 1 X k 1 + e e is N(0, σ 2 W ), and X 1... X k 1 are indicator variables. 2 / 34

3 Indicator variables s For an observation in group 0, all indicator variables are zero. Otherwise, for r 1, the value of X r for an observation in the r-th group is 1, and all other indicators are zero. In terms of group means, regression parameters are: b 0 = m 0, and b r = m r m 0 for r = 1... k 1 The null hypothesis : m 0 = m 1 = = m k 1 (no difference among groups) is equivalent to the standard multiple regression hypothesis b 1 = b 2 = = b k 1 = 0. 3 / 34

4 Parameter estimates s Estimates of the parameters are b0 = Ȳ 0, br = Ȳ r Ȳ 0, for r = 1... k 1 The fitted value for an observation in the r-th group is Ŷ = b 0 + b 1 X b k 1 X k 1 = Ȳ r (the sample mean for the r-th group) 4 / 34

5 Between-groups sum of squares s Regression sum of squares is k 1 S B = n r (Ȳ r Ȳ) 2, r=0 also referred to as the between-group sum of squares. Residual sum of squares S W is the sum of the residual sum of squares for each group ( within-group sum of squares). Source DF Sum Sq Mean Sq F Between k 1 S B M B M B /M W Within N k S W M W Total N 1 S yy M W (also written S 2 ) estimates the residual σ 2 W. 5 / 34

6 Hand calculation s T r is the subtotal for the r-th group, T = T 0 + T T k 1 is the total of all observations. S yy = N i=1 Yi 2 T2 N, S k 1 B = r=0 T 2 r n r T2 N S W is calculated as the difference between S yy and S B. 6 / 34

7 s A random sample is taken from each of 3 sheep, and blood Cu content measured on the 16 sheep. Blackface Welsh Cross / 34

8 Hand calculation of sheep data s Blackface Welsh Cross Total (5) (5) (6) (16) Uncorrected sum of squares for all 16 measurements is Total (corrected) sum of squares = = Between sum of squares = = / 34

9 Anova table s Source DF Sum Sq Mean Sq F Between 2 Within 13 Total 15 9 / 34

10 Anova table s Source DF Sum Sq Mean Sq F Between Within 13 Total / 34

11 Anova table s Source DF Sum Sq Mean Sq F Between Within Total / 34

12 Anova table s Source DF Sum Sq Mean Sq F Between Within Total The F test is highly significant (P < 0.001) 9 / 34

13 in identical twins s Fam Twin 1 Twin Fam Twin 1 Twin / 34

14 s Special case (all groups of size two) Fam sum diff Between family sum of squares Fam sum diff ( )/ /24 = = Within family sum of squares ( )/2 = / 34

15 anova table (finger ridges) s Source DF Sum Sq Mean Sq F Between families Within families Total The F test is very highly significant P < / 34

16 s The ANOVA calculations for these two examples are similar, but the interpretation of the mean squares is different. In the first example, different are regarded as, and we are primarily interested in differences (contrasts) between. Examples of factors which are usually treated as fixed are experimental treatments, sex, breed. 13 / 34

17 Random effects s In the second example, differences between particular families are not of interest. Instead, we regard the twelve family means as sampled from a N(m, σ 2 B ) distribution. Primary interest is in the of σ 2 B and σ2 W. Differences between families are treated as random effects. Examples of random effects are families, litters, or sires, where the levels are a sample of those possible and a repeat experiment would use different levels. 14 / 34

18 Fixed-effect s In the fixed-, F tests the hypothesis m 0 = m 1 = = m k 1, or equivalently, b 1 = b 2 = = b k 1 = 0. In the sheep example, F = on 2 and 13 d.f. (very highly significant). Differences between are established beyond reasonable doubt. 15 / 34

19 Comparing means s Mean values for each breed: Blackface Welsh Cross Estimated standard error of a difference between two means based on n 1 and n 2 observations is MW (1/n 1 + 1/n 2 ) where M W is the within group mean square from the ANOVA. Standard errors of differences between breed means are (Blackface v Welsh), and (otherwise). 16 / 34

20 Comparing means s If m 1 = m 2, the difference between means Ȳ 1 Ȳ 2 divided by its estimated s.e. has a t distn with N k = 13 d.f. Comparison Estimate SE t Blackface v Welsh Blackface v Cross Welsh v Cross The upper 2.5% point of t on 13 d.f. is Difference between Blackface and Cross does not quite reach 5% significance. The Welsh breed has significantly higher measurements than the other two. Confidence intervals are calculated in the usual way. 17 / 34

21 Contrasts s The general form of a contrast is a 1 m 1 + a 2 m a k m k, where a 1 + a a k = 0. The estimate of the contrast is a 1 Ȳ 1 + a 2 Ȳ a k Ȳ k, with estimated s.e. ( M W a 2 1 /n 1 + a 2 2 /n a 2 k /n ) k The t statistic for testing whether the contrast is zero is the ratio of the estimated contrast to its s.e. 18 / 34

22 Another contrast s In the sheep example, a simple genetic hypothesis is H 0 : m B + m W 2m C = 0. Estimate of the contrast Ȳ B + Ȳ W 2Ȳ C is normally distd with zero mean (if H 0 is true) and estimated M W (1/n B + 1/n W + 4/n C ) Here the estimate is ( ) = 0.66, with estimated 0.732(1/5 + 1/5 + 4/6) = t = 0.66/ = with 13 d.f. Not significant, there is no evidence against the genetic. 19 / 34

23 Random Equal group sizes s To the fixed- we add the assumption that m 1... m k are drawn randomly from an N(m, σ 2 B ) distn. We now have E(Y i ) = m, var (Y i ) = σ 2 B + σ 2 W. Expectations of between and within group mean squares are DF Mean Sq E(Mean Sq) Between groups k 1 M B σ 2 W + nσ2 B Within groups N k M W σ 2 W The F statistic tests H 0 : σ 2 B = 0. The overall mean Ȳ estimates m, with (σ 2 W + nσ2 B )/N. 20 / 34

24 s In the random the total of an observation σ 2 B + σ2 W is partitioned into σ2 B and σ2 W. Estimates are obtained by equating observed and expected mean squares: σ 2 W = M W, σ 2 B = (M B M W )/n. Source DF Sum Sq Mean Sq E(Mean Sq) Between families σ 2 W + 2σ2 B Within families σ 2 W Total In the finger ridges example, σ 2 W = 14.29, σ 2 B = / 34

25 Intraclass correlation s The intraclass correlation coefficient σ 2 B /(σ2 W + σ2 B ) measures the extent to which individuals in the same group are more similar than individuals in different groups. An estimate of the intraclass correlation is σ 2 B σ 2 B + σ 2 W M B M W = M B + (n 1)M W In the finger ridges example, σ 2 W = 14.29, σ 2 B = , and the estimated intraclass correlation is / 34

26 Cross-classification s In the sheep example, we may want to allow for the possibility that the measurement is affected by the sex of the animal. Suppose that the first three Blackface, first two Welsh, and first three Cross sheep are male, and the rest female. We now have a cross- classification of the data by breed and sex. Male Female Blackface Welsh Cross / 34

27 Two factors, cross-classification s breed - factor(rep(c( Blackface, Welsh, Cross ), each = 6)) sex - factor(rep(1:2, times = 3, each = 3), labels = c( male, female )) Cu - c(6.5,7.9,7.4,6.8,8.1,na, 10.4,9.8,NA,11.1,10.6,9.2, 6.9,9.2,8.4,7.6,9.7,8.9) fit - lm(cu breed + sex + breed:sex) anova(fit) Df Sum Sq Mean Sq F value breed sex breed:sex Residuals / 34

28 Sequential sums of squares s anova(fit) Df Sum Sq Mean Sq F value breed sex breed:sex Residuals ) Sum of squares for (ignoring sex). 2) Extra sum of squares obtained by fitting sex after breed (an additive : fitted values are sums of breed differences and the sex difference). 3) Sex difference is allowed to depend on breed. This is equivalent to a one-way anova with six levels, one for each combination of breed and sex. 25 / 34

29 Sequence of fits s Intercept b 0 has been omitted everywhere. breed only Blackface Welsh Cross Male 0 b 1 b 2 Female 0 b 1 b 2 additive Blackface Welsh Cross Male 0 b 1 b 2 Female c 1 b 1 + c 1 b 2 + c 1 saturated Blackface Welsh Cross Male 0 b 1 b 2 Female c 1 b 1 + c 1 + d 1 b 2 + c 1 + d 2 d 1 and d 2 are interaction terms (differences of differences). 26 / 34

30 s Mouse Litter An experiment to investigate variability in skin thickness of mice used 5 litters of 3 mice. For each mouse, skin thickness was measured twice by independent determinations. 27 / 34

31 example s In the random the total of an observation σ 2 B + σ2 W is partitioned into σ2 B and σ2 W. Estimates are obtained by equating observed and expected mean squares: σ 2 W = M W, σ 2 B = (M B M W )/n. Source DF Sum Sq Mean Sq E(Mean Sq) Between families σ 2 W + 2σ2 B Within families σ 2 W Total In the finger ridges example, σ 2 W = 14.29, σ 2 B = / 34

32 Mouse skin thickness example s Source DF SSQ MSQ E(MSQ) Litters σ 2 W + 2σ2 M + 6σ2 L Mice in litters σ 2 W + 2σ2 M Residual σ 2 W Estimated of Between observations, within mice 62.3 Between mice, within litters Between litters / 34

33 The mixed s There are several complications which can affect anova. a) It may be necessary to allow for (e.g. sex of mouse). b) The data may be unbalanced. For example, the number of readings per mouse may vary, perhaps because of missing observations. A solution to these problems is to use a mixed (with both fixed and random effects), and estimate by maximum likelihood. lme( ), lmer( ), MCMCglmm( ), asreml, / 34

34 A mixed s A simple example of a mixed is obtained if in the skin thickness example we treat litters as fixed rather than random. Expected mean squares are unchanged, except that the σ 2 L term in the expected mean square for litters is replaced by the sample of the. The test for litter effects is unchanged: F = between litters mean square between mice within litters mean square Compare the last part of farms example on problem sheet / 34

35 s Main assumptions underlying the anova procedure are: 1) Data are normally distributed. 2) The σw 2 is constant. Diagnostic plots, e.g. of residuals against fitted values, are used to check these assumptions (as for regression). If any of the assumptions is false, the situation can sometimes be improved by a transformation of the data. The most common transformations are the square root and logarithmic. These are most useful in correcting a tendency for the of an observation to increase with its size. 32 / 34

36 s breed - factor(rep(c( Blackface, Welsh, Cross ), c(5,5,6))) Cu - c(6.5,7.9,7.4,6.8,8.1,10.4,9.8,11.1,10.6,9.2, 6.9,9.2,8.4,7.6,9.7,8.9) lmfit - lm(cu breed) anova(lmfit) # anova table plot(lmfit) # diagnostic plots # alternatively... aovfit - aov(cu breed) summary(aovfit) # anova table 33 / 34

37 Two factors, nested classification s If litter and mouse are factors with 5 and 15 levels, aov(skin litter + Error(mouse)) produces two residual mean squares, a) between mice within litters (10 d.f.) b) between observations within mice (15 d.f.) Mean square with 10 d.f. is the denominator in the F test for differences among litters. The coding which numbers mice 1-3 within litters aov(skin litter + Error(litter:mouse)) works but is not recommended. 34 / 34

Oct Simple linear regression. Minimum mean square error prediction. Univariate. regression. Calculating intercept and slope

Oct Simple linear regression. Minimum mean square error prediction. Univariate. regression. Calculating intercept and slope Oct 2017 1 / 28 Minimum MSE Y is the response variable, X the predictor variable, E(X) = E(Y) = 0. BLUP of Y minimizes average discrepancy var (Y ux) = C YY 2u C XY + u 2 C XX This is minimized when u

More information

Multiple regression. Partial regression coefficients

Multiple regression. Partial regression coefficients Multiple regression We now generalise the results of simple linear regression to the case where there is one response variable Y and two predictor variables, X and Z. Data consist of n triplets of values

More information

13 Simple Linear Regression

13 Simple Linear Regression B.Sc./Cert./M.Sc. Qualif. - Statistics: Theory and Practice 3 Simple Linear Regression 3. An industrial example A study was undertaken to determine the effect of stirring rate on the amount of impurity

More information

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement

More information

Food consumption of rats. Two-way vs one-way vs nested ANOVA

Food consumption of rats. Two-way vs one-way vs nested ANOVA Food consumption of rats Lard Gender Fresh Rancid 709 592 Male 679 538 699 476 657 508 Female 594 505 677 539 Two-way vs one-way vs nested NOV T 1 2 * * M * * * * * * F * * * * T M1 M2 F1 F2 * * * * *

More information

Mixed-Models. version 30 October 2011

Mixed-Models. version 30 October 2011 Mixed-Models version 30 October 2011 Mixed models Mixed models estimate a vector! of fixed effects and one (or more) vectors u of random effects Both fixed and random effects models always include a vector

More information

Unbalanced Data in Factorials Types I, II, III SS Part 1

Unbalanced Data in Factorials Types I, II, III SS Part 1 Unbalanced Data in Factorials Types I, II, III SS Part 1 Chapter 10 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 14 When we perform an ANOVA, we try to quantify the amount of variability in the data accounted

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61

More information

Mixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012

Mixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012 Mixed-Model Estimation of genetic variances Bruce Walsh lecture notes Uppsala EQG 01 course version 8 Jan 01 Estimation of Var(A) and Breeding Values in General Pedigrees The above designs (ANOVA, P-O

More information

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons: STAT 263/363: Experimental Design Winter 206/7 Lecture January 9 Lecturer: Minyong Lee Scribe: Zachary del Rosario. Design of Experiments Why perform Design of Experiments (DOE)? There are at least two

More information

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification, Likelihood Let P (D H) be the probability an experiment produces data D, given hypothesis H. Usually H is regarded as fixed and D variable. Before the experiment, the data D are unknown, and the probability

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

Remedial Measures, Brown-Forsythe test, F test

Remedial Measures, Brown-Forsythe test, F test Remedial Measures, Brown-Forsythe test, F test Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 7, Slide 1 Remedial Measures How do we know that the regression function

More information

Ch 3: Multiple Linear Regression

Ch 3: Multiple Linear Regression Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery

More information

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3 STA 303 H1S / 1002 HS Winter 2011 Test March 7, 2011 LAST NAME: FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 303 STA 1002 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator. Some formulae

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

1 Multiple Regression

1 Multiple Regression 1 Multiple Regression In this section, we extend the linear model to the case of several quantitative explanatory variables. There are many issues involved in this problem and this section serves only

More information

2.1 Linear regression with matrices

2.1 Linear regression with matrices 21 Linear regression with matrices The values of the independent variables are united into the matrix X (design matrix), the values of the outcome and the coefficient are represented by the vectors Y and

More information

Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference.

Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference. Understanding regression output from software Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals In 1966 Cyril Burt published a paper called The genetic determination of differences

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

SPH 247 Statistical Analysis of Laboratory Data

SPH 247 Statistical Analysis of Laboratory Data SPH 247 Statistical Analysis of Laboratory Data March 31, 2015 SPH 247 Statistical Analysis of Laboratory Data 1 ANOVA Fixed and Random Effects We will review the analysis of variance (ANOVA) and then

More information

STAT 501 EXAM I NAME Spring 1999

STAT 501 EXAM I NAME Spring 1999 STAT 501 EXAM I NAME Spring 1999 Instructions: You may use only your calculator and the attached tables and formula sheet. You can detach the tables and formula sheet from the rest of this exam. Show your

More information

CAS MA575 Linear Models

CAS MA575 Linear Models CAS MA575 Linear Models Boston University, Fall 2013 Midterm Exam (Correction) Instructor: Cedric Ginestet Date: 22 Oct 2013. Maximal Score: 200pts. Please Note: You will only be graded on work and answers

More information

Supplementary File 3: Tutorial for ASReml-R. Tutorial 1 (ASReml-R) - Estimating the heritability of birth weight

Supplementary File 3: Tutorial for ASReml-R. Tutorial 1 (ASReml-R) - Estimating the heritability of birth weight Supplementary File 3: Tutorial for ASReml-R Tutorial 1 (ASReml-R) - Estimating the heritability of birth weight This tutorial will demonstrate how to run a univariate animal model using the software ASReml

More information

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector

More information

Nesting and Mixed Effects: Part I. Lukas Meier, Seminar für Statistik

Nesting and Mixed Effects: Part I. Lukas Meier, Seminar für Statistik Nesting and Mixed Effects: Part I Lukas Meier, Seminar für Statistik Where do we stand? So far: Fixed effects Random effects Both in the factorial context Now: Nested factor structure Mixed models: a combination

More information

Lecture 4. Basic Designs for Estimation of Genetic Parameters

Lecture 4. Basic Designs for Estimation of Genetic Parameters Lecture 4 Basic Designs for Estimation of Genetic Parameters Bruce Walsh. Aug 003. Nordic Summer Course Heritability The reason for our focus, indeed obsession, on the heritability is that it determines

More information

Multiple Regression: Example

Multiple Regression: Example Multiple Regression: Example Cobb-Douglas Production Function The Cobb-Douglas production function for observed economic data i = 1,..., n may be expressed as where O i is output l i is labour input c

More information

Lecture 9: Linear Regression

Lecture 9: Linear Regression Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression

More information

STAT 525 Fall Final exam. Tuesday December 14, 2010

STAT 525 Fall Final exam. Tuesday December 14, 2010 STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will

More information

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012 Lecture 3: Linear Models Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector of observed

More information

The First Thing You Ever Do When Receive a Set of Data Is

The First Thing You Ever Do When Receive a Set of Data Is The First Thing You Ever Do When Receive a Set of Data Is Understand the goal of the study What are the objectives of the study? What would the person like to see from the data? Understand the methodology

More information

Resemblance among relatives

Resemblance among relatives Resemblance among relatives Introduction Just as individuals may differ from one another in phenotype because they have different genotypes, because they developed in different environments, or both, relatives

More information

10.1 Simple Linear Regression

10.1 Simple Linear Regression 10.1 Simple Linear Regression Ulrich Hoensch Tuesday, December 1, 2009 The Simple Linear Regression Model We have two quantitative random variables X (the explanatory variable) and Y (the response variable).

More information

Simple Linear Regression

Simple Linear Regression Chapter 2 Simple Linear Regression Linear Regression with One Independent Variable 2.1 Introduction In Chapter 1 we introduced the linear model as an alternative for making inferences on means of one or

More information

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 12 - Lecture 2 Inferences about regression coefficient Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous

More information

School of Mathematical Sciences. Question 1

School of Mathematical Sciences. Question 1 School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 8 and Assignment 7 Solutions Question 1 Figure 1: The residual plots do not contradict the model assumptions of normality, constant

More information

Multiple comparisons - subsequent inferences for two-way ANOVA

Multiple comparisons - subsequent inferences for two-way ANOVA 1 Multiple comparisons - subsequent inferences for two-way ANOVA the kinds of inferences to be made after the F tests of a two-way ANOVA depend on the results if none of the F tests lead to rejection of

More information

ASReml Tutorial: D1 GLM & GLMM p. 1. ASReml tutorial D1 GLM & GLMM. Arthur Gilmour

ASReml Tutorial: D1 GLM & GLMM p. 1. ASReml tutorial D1 GLM & GLMM. Arthur Gilmour ASReml tutorial D1 GLM & GLMM Arthur Gilmour ASReml Tutorial: D1 GLM & GLMM p. 1 ASReml tutorial D1 GLM & GLMM Arthur Gilmour ASReml Tutorial: D1 GLM & GLMM p. 2 Generalized Linear Mixed models Iteratively

More information

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38 BIO5312 Biostatistics Lecture 11: Multisample Hypothesis Testing II Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/8/2016 1/38 Outline In this lecture, we will continue to

More information

CHAPTER 10. Regression and Correlation

CHAPTER 10. Regression and Correlation CHAPTER 10 Regression and Correlation In this Chapter we assess the strength of the linear relationship between two continuous variables. If a significant linear relationship is found, the next step would

More information

Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares

Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares K&W introduce the notion of a simple experiment with two conditions. Note that the raw data (p. 16)

More information

Chapter 1 Linear Regression with One Predictor

Chapter 1 Linear Regression with One Predictor STAT 525 FALL 2018 Chapter 1 Linear Regression with One Predictor Professor Min Zhang Goals of Regression Analysis Serve three purposes Describes an association between X and Y In some applications, the

More information

STA 101 Final Review

STA 101 Final Review STA 101 Final Review Statistics 101 Thomas Leininger June 24, 2013 Announcements All work (besides projects) should be returned to you and should be entered on Sakai. Office Hour: 2 3pm today (Old Chem

More information

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a

More information

STAT5044: Regression and Anova. Inyoung Kim

STAT5044: Regression and Anova. Inyoung Kim STAT5044: Regression and Anova Inyoung Kim 2 / 47 Outline 1 Regression 2 Simple Linear regression 3 Basic concepts in regression 4 How to estimate unknown parameters 5 Properties of Least Squares Estimators:

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont. TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted

More information

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013 Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 013 1 Estimation of Var(A) and Breeding Values in General Pedigrees The classic

More information

Math 423/533: The Main Theoretical Topics

Math 423/533: The Main Theoretical Topics Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)

More information

Analysis of Variance

Analysis of Variance Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also

More information

Multiple Regression Examples

Multiple Regression Examples Multiple Regression Examples Example: Tree data. we have seen that a simple linear regression of usable volume on diameter at chest height is not suitable, but that a quadratic model y = β 0 + β 1 x +

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

STAT5044: Regression and Anova

STAT5044: Regression and Anova STAT5044: Regression and Anova Inyoung Kim 1 / 25 Outline 1 Multiple Linear Regression 2 / 25 Basic Idea An extra sum of squares: the marginal reduction in the error sum of squares when one or several

More information

Statistics 5100 Spring 2018 Exam 1

Statistics 5100 Spring 2018 Exam 1 Statistics 5100 Spring 2018 Exam 1 Directions: You have 60 minutes to complete the exam. Be sure to answer every question, and do not spend too much time on any part of any question. Be concise with all

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

Coefficient of Determination

Coefficient of Determination Coefficient of Determination ST 430/514 The coefficient of determination, R 2, is defined as before: R 2 = 1 SS E (yi ŷ i ) = 1 2 SS yy (yi ȳ) 2 The interpretation of R 2 is still the fraction of variance

More information

STK4900/ Lecture 3. Program

STK4900/ Lecture 3. Program STK4900/9900 - Lecture 3 Program 1. Multiple regression: Data structure and basic questions 2. The multiple linear regression model 3. Categorical predictors 4. Planned experiments and observational studies

More information

Lecture 3: Inference in SLR

Lecture 3: Inference in SLR Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression 36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form

More information

Econometrics. 4) Statistical inference

Econometrics. 4) Statistical inference 30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution

More information

Latin Square Designs (LS) Clarice G. B. Demétrio

Latin Square Designs (LS) Clarice G. B. Demétrio Estatística Experimental: Session 4 1 Latin Square Designs (LS) based on Chris Brien s notes (University of South Australia, Adelaide), http://chris.brien.name/ee2/ Clarice G. B. Demétrio Departamento

More information

Chapter 11. Analysis of Variance (One-Way)

Chapter 11. Analysis of Variance (One-Way) Chapter 11 Analysis of Variance (One-Way) We now develop a statistical procedure for comparing the means of two or more groups, known as analysis of variance or ANOVA. These groups might be the result

More information

Lecture 10. Factorial experiments (2-way ANOVA etc)

Lecture 10. Factorial experiments (2-way ANOVA etc) Lecture 10. Factorial experiments (2-way ANOVA etc) Jesper Rydén Matematiska institutionen, Uppsala universitet jesper@math.uu.se Regression and Analysis of Variance autumn 2014 A factorial experiment

More information

Sociology 593 Exam 1 February 14, 1997

Sociology 593 Exam 1 February 14, 1997 Sociology 9 Exam February, 997 I. True-False. ( points) Indicate whether the following statements are true or false. If false, briefly explain why.. There are IVs in a multiple regression model. If the

More information

Lecture 14 Topic 10: ANOVA models for random and mixed effects. Fixed and Random Models in One-way Classification Experiments

Lecture 14 Topic 10: ANOVA models for random and mixed effects. Fixed and Random Models in One-way Classification Experiments Lecture 14 Topic 10: ANOVA models for random and mixed effects To this point, we have considered only the Fixed Model (Model I) ANOVA; now we will extend the method of ANOVA to other experimental objectives.

More information

UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018

UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018 UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018 Work all problems. 60 points are needed to pass at the Masters Level and 75

More information

Model Specification and Data Problems. Part VIII

Model Specification and Data Problems. Part VIII Part VIII Model Specification and Data Problems As of Oct 24, 2017 1 Model Specification and Data Problems RESET test Non-nested alternatives Outliers A functional form misspecification generally means

More information

Lecture 15. Hypothesis testing in the linear model

Lecture 15. Hypothesis testing in the linear model 14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma

More information

Generalized linear mixed models for biologists

Generalized linear mixed models for biologists Generalized linear mixed models for biologists McMaster University 7 May 2009 Outline 1 2 Outline 1 2 Coral protection by symbionts 10 Number of predation events Number of blocks 8 6 4 2 2 2 1 0 2 0 2

More information

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES BIOL 458 - Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES PART 1: INTRODUCTION TO ANOVA Purpose of ANOVA Analysis of Variance (ANOVA) is an extremely useful statistical method

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Linear models and their mathematical foundations: Simple linear regression

Linear models and their mathematical foundations: Simple linear regression Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction

More information

L6: Regression II. JJ Chen. July 2, 2015

L6: Regression II. JJ Chen. July 2, 2015 L6: Regression II JJ Chen July 2, 2015 Today s Plan Review basic inference based on Sample average Difference in sample average Extrapolate the knowledge to sample regression coefficients Standard error,

More information

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3 Table of contents TABLE OF CONTENTS...1 1 INTRODUCTION TO MIXED-EFFECTS MODELS...3 Fixed-effects regression ignoring data clustering...5 Fixed-effects regression including data clustering...1 Fixed-effects

More information

Homework 2: Simple Linear Regression

Homework 2: Simple Linear Regression STAT 4385 Applied Regression Analysis Homework : Simple Linear Regression (Simple Linear Regression) Thirty (n = 30) College graduates who have recently entered the job market. For each student, the CGPA

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 009 MODULE 4 : Linear models Time allowed: One and a half hours Candidates should answer THREE questions. Each question carries

More information

Introduction to Linear regression analysis. Part 2. Model comparisons

Introduction to Linear regression analysis. Part 2. Model comparisons Introduction to Linear regression analysis Part Model comparisons 1 ANOVA for regression Total variation in Y SS Total = Variation explained by regression with X SS Regression + Residual variation SS Residual

More information

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables. One-Way Analysis of Variance With regression, we related two quantitative, typically continuous variables. Often we wish to relate a quantitative response variable with a qualitative (or simply discrete)

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

A discussion on multiple regression models

A discussion on multiple regression models A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

FACTORIAL DESIGNS and NESTED DESIGNS

FACTORIAL DESIGNS and NESTED DESIGNS Experimental Design and Statistical Methods Workshop FACTORIAL DESIGNS and NESTED DESIGNS Jesús Piedrafita Arilla jesus.piedrafita@uab.cat Departament de Ciència Animal i dels Aliments Items Factorial

More information

Lecture 1: Linear Models and Applications

Lecture 1: Linear Models and Applications Lecture 1: Linear Models and Applications Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Introduction to linear models Exploratory data analysis (EDA) Estimation

More information

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable, Concordia University Department of Mathematics and Statistics Course Number Section Statistics 360/2 01 Examination Date Time Pages Final December 2002 3 hours 6 Instructors Course Examiner Marks Y.P.

More information

Inference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58

Inference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Inference ME104: Linear Regression Analysis Kenneth Benoit August 15, 2012 August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Stata output resvisited. reg votes1st spend_total incumb minister

More information

Announcements. Final Review: Units 1-7

Announcements. Final Review: Units 1-7 Announcements Announcements Final : Units 1-7 Statistics 104 Mine Çetinkaya-Rundel June 24, 2013 Final on Wed: cheat sheet (one sheet, front and back) and calculator Must have webcam + audio on at all

More information

Lecture 22 Mixed Effects Models III Nested designs

Lecture 22 Mixed Effects Models III Nested designs Lecture 22 Mixed Effects Models III Nested designs 94 Introduction: Crossed Designs The two-factor designs considered so far involve every level of the first factor occurring with every level of the second

More information

22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA)

22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) 22s:152 Applied Linear Regression Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) We now consider an analysis with only categorical predictors (i.e. all predictors are

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Lecture 18: Simple Linear Regression

Lecture 18: Simple Linear Regression Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength

More information

SCHOOL OF MATHEMATICS AND STATISTICS

SCHOOL OF MATHEMATICS AND STATISTICS RESTRICTED OPEN BOOK EXAMINATION (Not to be removed from the examination hall) Data provided: Statistics Tables by H.R. Neave MAS5052 SCHOOL OF MATHEMATICS AND STATISTICS Basic Statistics Spring Semester

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College 1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative

More information

STAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing

STAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing STAT763: Applied Regression Analysis Multiple linear regression 4.4 Hypothesis testing Chunsheng Ma E-mail: cma@math.wichita.edu 4.4.1 Significance of regression Null hypothesis (Test whether all β j =

More information