Example: Poisondata. 22s:152 Applied Linear Regression. Chapter 8: ANOVA
|
|
- Cornelius Anderson
- 5 years ago
- Views:
Transcription
1 s:5 Applied Linear Regression Chapter 8: ANOVA Two-way ANOVA Used to compare populations means when the populations are classified by two factors (or categorical variables) For example sex and occupation for predicting salary advantages compared to doing two separate one-way ANOVAs more efficient to study two factors simultaneously rather than separately can reduce residual variation in a model by including a second factor believed to affect the response (more predictors) can study interaction between factors Example: Poisondata The data give the survival times (in 0 hour units) in axfactorialexperiment,thefactorsbeing (a) three poisons and (b) four treatments. Each combination of the two factors is used for four animals, the allocation to animals being completely randomized. The variables in the dataset are: Time : Survival time (in 0 hour units) Poison : Factor with levels,, : Factor with levels,,, Cells in a two-way ANOVA: Notation first categorical variable: factor A with a levels second categorical variable: factor B with b levels n ij is the sample size for level i of A and level j of B N is total observations Cell means model: Y ijk = µ ij + ijk with ijk iid N(0, σ ) µ ij is population mean for level i of A and level j of B (there are ab distinct cell means) Allows for freely fit mean in each cell It will take ab parameters to describe the mean structure. Y ijk is kth observation from population with level i of A and level j of B
2 Effects model: Y ijk = µ + α i + β j + γ ij + ijk α i is the main effect parameter for level i of A β j is the main effect parameter for level j of B γ ij is the interaction parameter for level i of A and level j of B There are (+a+b+ab)parameterstodescribethe mean structure, this is an over-parameterization. We ll need to use constraints to make the parameters estimable. But, the effects model can be useful when there are only main effects present (no interaction) because it s a much simpler model to interpret (more on this later). Y ijk is kth observation from population with level i of A and level j of B 5 Example: Poisondata Time : Survival time (in 0 hour units) Poison : Factor with levels,, : Factor with levels,,, > poison.data=read.delim("poison.data) > head(poison.data) Poison Time > levels() NULL ## R sees it as numeric, redefine as categorical: > =as.factor() > is.factor() [] TRUE > levels() [] "" "" "" "" 6 > levels(poison) NULL ##Seen as numeric, redefine as categorical: > Poison=as.factor(Poison) > is.factor(poison) [] TRUE > levels(poison) [] "" "" "" Get the cell means: > means=tapply(time,list(poison,),mean) > means Get the number of observations in each cell: > table(poison,) Poison 7 This is a balanced -way ANOVA. To get the appropriate tests in R in a - way ANOVA, we will use the sum-to-zero constraints for the dummy regressors. These are: i α i =0, j β j =0and i,j γ ij =0 > contrasts(poison) ## Reassign using sum-to-zero constraint: > contrasts(poison)=contr.sum(levels(poison)) > contrasts(poison) [,] [,] ## Do the same for the other factor: > contrasts()
3 > contrasts()=contr.sum(levels()) > contrasts() [,] [,] [,] You can get a quick plot of the means using the interaction.plot function. > interaction.plot(poison,,time) mean of Time More sophisticated plot: > poison.num=as.numeric(poison) > plot(c(0.5,.5),range(time),type="n",xlab="poisson", ylab="survival Time",axes=F,main="Survival Time vs. Poison by group (jittered)") > axis() > axis(,at=:,c("","","")) > box() > points(jitter(poison.num[==""],factor=0.5), Time[==""],pch="",col=) > points(jitter(poison.num[==""],factor=0.5), Time[==""],pch="",col=) > points(jitter(poison.num[==""],factor=0.5), Time[==""],pch="",col=) > points(jitter(poison.num[==""],factor=0.5), Time[==""],pch="",col=) > lines(c(,,),means[,],col=) > lines(c(,,),means[,],col=) > lines(c(,,),means[,],col=) > lines(c(,,),means[,],col=) > legend(,.,c("trt ","TRT ","TRT ","TRT "),col=c(,,,),lty=) Poison 9 0 Survival Time Survival Time vs. Poison by group (jittered) Poisson TRT TRT TRT TRT Fit the model that includes interaction and get the overall F -test: > lm.out=lm(time ~ Poison + + Poison:) > summary(lm.out) Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) < e-6 *** Poison e-05 *** Poison * e-05 *** e-06 *** * Poison: Poison: Poison: Poison: Poison: Poison: Signif. codes: 0 *** 0.00 ** 0.0 * Residual standard error: 0.9 on 6 degrees of freedom Multiple R-squared: 0.75,Adjusted R-squared: 0.65 F-statistic: 9.0 on and 6 DF, p-value:.986e-07 The overall F -test p-value is quite small, and we reject the overall F -test. At least one of the terms (Poison,, or Poison:) in the model is significant.
4 The summary() command also provides a test for each dummy regressor for each factor, but we re probably more interested in testing for each factor as a whole, initially. The function Anova (notice the capital A) is a function in the car library. We can use this to compute type III sums of squares, and do appropriate tests for the effects in the model. > Anova(lm.out,type="III") Anova Table (Type III tests) Response: Time (Intercept) <.e-6 *** Poison.00.7.e-07 *** e-06 *** Poison: Residuals Signif. codes: 0 *** 0.00 ** 0.0 * NOTE: We need to be careful using the anova function for unbalanced data, because it computes type I sums of squares (sequential sums of squares) for testing the significance of the terms. We want type III sums of squares. In models that include interaction (as this one does), a common first step is to test for the significance of the interaction. If there is no interaction, we perceive the true profile plot of the means to be parallel. mean of Time Poison The interaction is NOT significant in this model. Poison: So, these lines (connecting the cell means) are not significantly different than parallel. Amodellikethis,withnosignificantinteraction, is said to have only main effects. We can plot the main effects fitted model, which shows parallel lines: Main effects model (Survival Time vs. Poison by ) When the interaction is not significant, we can go back and fit the reduced model (without interaction), and use this reduced model for the rest of the analysis. > lm.no.interaction=lm(time ~ Poison + ) Survival Time Poisson TRT TRT TRT TRT > Anova(lm.no.interaction,type="III") Anova Table (Type III tests) Response: Time (Intercept) <.e-6 *** Poison e-07 *** e-06 *** Residuals.0509 Signif. codes: 0 *** 0.00 ** 0.0 * Since there is no significant interaction, we can say The effect is the same for all levels of Poison. Or, at each Poison level, the effect is the same. Effects model with only Main Effects: Y ijk = µ + α i + β j + ijk Both main effects are significant here. 5 6
5 We can get the parameter estimates like usual: > lm.no.interaction$coefficients (Intercept) Poison Poison You can also get the fitted values using the lm() output: > cbind(poison,,lm.no.interaction$fitted) Poison There will be unique fitted values in this -way ANOVA (one for each cell). 7 We could also plug-in the estimated parameters to the main effects model to get the fitted values... Ŷ ijk = ˆµ + ˆα i + ˆβ j To plot the main effects fitted model with a profile plot, we can get the fitted means using the parameters, and then connect the relevant fitted values. > coeff=lm.no.interaction$coefficients > coeff (Intercept) Poison Poison > ## Sum the relevant parameter estiamates to get the fitted values: > mu.=sum(coeff[c(,,)]) > mu.=sum(coeff[c(,,)]) > mu.=sum(coeff[c(,)])-sum(coeff[c(,)]) > mu.=sum(coeff[c(,,5)]) > mu.=sum(coeff[c(,,5)]) > mu.=sum(coeff[c(,5)])-sum(coeff[c(,)]) > mu.=sum(coeff[c(,,6)]) > mu.=sum(coeff[c(,,6)]) > mu.=sum(coeff[c(,6)])-sum(coeff[c(,)]) > mu.=sum(coeff[c(,)])-sum(coeff[c(,5,6)]) > mu.=sum(coeff[c(,)])-sum(coeff[c(,5,6)]) > mu.=coeff[]-sum(coeff[c(,)])-sum(coeff[c(,5,6)]) 8 ## Create vector of means for each treatment: > TRT..means=c(mu.,mu.,mu.) > TRT..means=c(mu.,mu.,mu.) > TRT..means=c(mu.,mu.,mu.) > TRT..means=c(mu.,mu.,mu.) > plot(c(0.5,.5),range(time),type="n",xlab="poisson", ylab="survival Time",axes=F,main="Main effects model (Survival Time vs. Poison by )") > axis() > axis(,at=:,c("","","")) > box() > poison.num=as.numeric(poison) > points(jitter(poison.num[==""],factor=0.5), Time[==""],pch="",col=) > points(jitter(poison.num[==""],factor=0.5), Time[==""],pch="",col=) > points(jitter(poison.num[==""],factor=0.5), Time[==""],pch="",col=) > points(jitter(poison.num[==""],factor=0.5), Time[==""],pch="",col=) Survival Time Main effects model (Survival Time vs. Poison by ) Poisson TRT TRT TRT TRT > lines(c(,,),trt..means,col=) > lines(c(,,),trt..means,col=) > lines(c(,,),trt..means,col=) > lines(c(,,),trt..means,col=) > legend(,.,c("trt ","TRT ","TRT ","TRT "),col=c(,,,),lty=) 9 0
6 If there IS significant interaction, we shouldn t consider the tests for main effects (i.e. don t try to use the tests for Poison or in the Anova() output if Poison: was significant). Instead, we should test for the effect separately at each Poison level (because the effect depends on the level of Poison, and really, there isn t a common global effect to be testing all at once). mean of Time Poison Another option for testing for interaction is to use a Partial F -test. > lm.no.interaction=lm(time ~ Poison + ) > anova(lm.no.interaction,lm.out) Analysis of Variance Table Model : Time ~ Poison + Model : Time ~ Poison + + Poison: Res.Df RSS Df Sum of Sq F Pr(>F) This p-value is the same as the one seen in... > Anova(lm.out,type="III") Anova Table (Type III tests) Response: Time (Intercept) <.e-6 *** Poison.00.7.e-07 *** e-06 *** Poison: Residuals Signif. codes: 0 *** 0.00 ** 0.0 * Different types of interaction ( factors/ levels) Cells: Different types of interaction ( factors/ levels) Factor B is in same direction for all levels of Factor A: (but not parallel lines) No interaction: (parallel lines) Factor B is in different direction for different levels of Factor A: (lines cross)
7 Assumptions of two-way ANOVA independent simple random samples of size n ij from each of a b normal populations. constant variance across all populations ijk are normally distributed as N(0, σ ) A general ANOVA table for two-way ANOVA Source Sum of Squares df Mean Square F A SS A a- B SS B b- AB SS AB (a-)(b-) SS A a = MS A SS B b = MS B SS AB (a )(b ) = MS AB MS A MSE MS B MSE MS AB MSE Residuals RSS n-ab RSS n ab = MSE Total TSS N- Sums of squares We still have the same fundamental relationship that TSS = RegSS + RSS. The RegSS represents the variability explained by the model, which includes the terms of main effects for factor A and B and the interaction between A and B, in the full model. 5 If we have equal sample sizes in each cell (a balanced design), then TSS = SS A + SS B + SS AB + RSS But if we have an unbalanced design, then this equation does not hold (i.e. RegSS = SS A +SS B +SS AB ). In this case, we can still test for effects using the Type III sums of squares (as with the Anova() function) looking for a significant effect given all other factors are accounted for. 6
22s:152 Applied Linear Regression. Take random samples from each of m populations.
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More information22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More information22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA)
22s:152 Applied Linear Regression Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) We now consider an analysis with only categorical predictors (i.e. all predictors are
More information22s:152 Applied Linear Regression
22s:152 Applied Linear Regression Chapter 7: Dummy Variable Regression So far, we ve only considered quantitative variables in our models. We can integrate categorical predictors by constructing artificial
More informationRecall that a measure of fit is the sum of squared residuals: where. The F-test statistic may be written as:
1 Joint hypotheses The null and alternative hypotheses can usually be interpreted as a restricted model ( ) and an model ( ). In our example: Note that if the model fits significantly better than the restricted
More information22s:152 Applied Linear Regression. 1-way ANOVA visual:
22s:152 Applied Linear Regression 1-way ANOVA visual: Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 Y We now consider an analysis
More informationUnbalanced Data in Factorials Types I, II, III SS Part 1
Unbalanced Data in Factorials Types I, II, III SS Part 1 Chapter 10 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 14 When we perform an ANOVA, we try to quantify the amount of variability in the data accounted
More information1.) Fit the full model, i.e., allow for separate regression lines (different slopes and intercepts) for each species
Lecture notes 2/22/2000 Dummy variables and extra SS F-test Page 1 Crab claw size and closing force. Problem 7.25, 10.9, and 10.10 Regression for all species at once, i.e., include dummy variables for
More informationResidual Analysis for two-way ANOVA The twoway model with K replicates, including interaction,
Residual Analysis for two-way ANOVA The twoway model with K replicates, including interaction, is Y ijk = µ ij + ɛ ijk = µ + α i + β j + γ ij + ɛ ijk with i = 1,..., I, j = 1,..., J, k = 1,..., K. In carrying
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More information1 Use of indicator random variables. (Chapter 8)
1 Use of indicator random variables. (Chapter 8) let I(A) = 1 if the event A occurs, and I(A) = 0 otherwise. I(A) is referred to as the indicator of the event A. The notation I A is often used. 1 2 Fitting
More informationNC Births, ANOVA & F-tests
Math 158, Spring 2018 Jo Hardin Multiple Regression II R code Decomposition of Sums of Squares (and F-tests) NC Births, ANOVA & F-tests A description of the data is given at http://pages.pomona.edu/~jsh04747/courses/math58/
More informationLab 3 A Quick Introduction to Multiple Linear Regression Psychology The Multiple Linear Regression Model
Lab 3 A Quick Introduction to Multiple Linear Regression Psychology 310 Instructions.Work through the lab, saving the output as you go. You will be submitting your assignment as an R Markdown document.
More informationMultiple Regression: Example
Multiple Regression: Example Cobb-Douglas Production Function The Cobb-Douglas production function for observed economic data i = 1,..., n may be expressed as where O i is output l i is labour input c
More informationRegression and the 2-Sample t
Regression and the 2-Sample t James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Regression and the 2-Sample t 1 / 44 Regression
More informationGeneral Linear Statistical Models - Part III
General Linear Statistical Models - Part III Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Interaction Models Lets examine two models involving Weight and Domestic in the cars93 dataset.
More informationFACTORIAL DESIGNS and NESTED DESIGNS
Experimental Design and Statistical Methods Workshop FACTORIAL DESIGNS and NESTED DESIGNS Jesús Piedrafita Arilla jesus.piedrafita@uab.cat Departament de Ciència Animal i dels Aliments Items Factorial
More informationR 2 and F -Tests and ANOVA
R 2 and F -Tests and ANOVA December 6, 2018 1 Partition of Sums of Squares The distance from any point y i in a collection of data, to the mean of the data ȳ, is the deviation, written as y i ȳ. Definition.
More information22s:152 Applied Linear Regression. Chapter 5: Ordinary Least Squares Regression. Part 1: Simple Linear Regression Introduction and Estimation
22s:152 Applied Linear Regression Chapter 5: Ordinary Least Squares Regression Part 1: Simple Linear Regression Introduction and Estimation Methods for studying the relationship of two or more quantitative
More informationActivity #12: More regression topics: LOWESS; polynomial, nonlinear, robust, quantile; ANOVA as regression
Activity #12: More regression topics: LOWESS; polynomial, nonlinear, robust, quantile; ANOVA as regression Scenario: 31 counts (over a 30-second period) were recorded from a Geiger counter at a nuclear
More informationWorkshop 7.4a: Single factor ANOVA
-1- Workshop 7.4a: Single factor ANOVA Murray Logan November 23, 2016 Table of contents 1 Revision 1 2 Anova Parameterization 2 3 Partitioning of variance (ANOVA) 10 4 Worked Examples 13 1. Revision 1.1.
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the
More informationMODELS WITHOUT AN INTERCEPT
Consider the balanced two factor design MODELS WITHOUT AN INTERCEPT Factor A 3 levels, indexed j 0, 1, 2; Factor B 5 levels, indexed l 0, 1, 2, 3, 4; n jl 4 replicate observations for each factor level
More informationOne-way ANOVA (Single-Factor CRD)
One-way ANOVA (Single-Factor CRD) STAT:5201 Week 3: Lecture 3 1 / 23 One-way ANOVA We have already described a completed randomized design (CRD) where treatments are randomly assigned to EUs. There is
More informationAnalysis of variance. Gilles Guillot. September 30, Gilles Guillot September 30, / 29
Analysis of variance Gilles Guillot gigu@dtu.dk September 30, 2013 Gilles Guillot (gigu@dtu.dk) September 30, 2013 1 / 29 1 Introductory example 2 One-way ANOVA 3 Two-way ANOVA 4 Two-way ANOVA with interactions
More informationMultiple Predictor Variables: ANOVA
Multiple Predictor Variables: ANOVA 1/32 Linear Models with Many Predictors Multiple regression has many predictors BUT - so did 1-way ANOVA if treatments had 2 levels What if there are multiple treatment
More informationStat 412/512 TWO WAY ANOVA. Charlotte Wickham. stat512.cwick.co.nz. Feb
Stat 42/52 TWO WAY ANOVA Feb 6 25 Charlotte Wickham stat52.cwick.co.nz Roadmap DONE: Understand what a multiple regression model is. Know how to do inference on single and multiple parameters. Some extra
More informationWeek 7 Multiple factors. Ch , Some miscellaneous parts
Week 7 Multiple factors Ch. 18-19, Some miscellaneous parts Multiple Factors Most experiments will involve multiple factors, some of which will be nuisance variables Dealing with these factors requires
More informationTests of Linear Restrictions
Tests of Linear Restrictions 1. Linear Restricted in Regression Models In this tutorial, we consider tests on general linear restrictions on regression coefficients. In other tutorials, we examine some
More informationST430 Exam 2 Solutions
ST430 Exam 2 Solutions Date: November 9, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textbook are permitted but you may use a calculator. Giving
More informationTopic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model
Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is
More informationNested 2-Way ANOVA as Linear Models - Unbalanced Example
Linear Models Nested -Way ANOVA ORIGIN As with other linear models, unbalanced data require use of the regression approach, in this case by contrast coding of independent variables using a scheme not described
More informationComparing Nested Models
Comparing Nested Models ST 370 Two regression models are called nested if one contains all the predictors of the other, and some additional predictors. For example, the first-order model in two independent
More informationStatistics Lab #6 Factorial ANOVA
Statistics Lab #6 Factorial ANOVA PSYCH 710 Initialize R Initialize R by entering the following commands at the prompt. You must type the commands exactly as shown. options(contrasts=c("contr.sum","contr.poly")
More information20.1. Balanced One-Way Classification Cell means parametrization: ε 1. ε I. + ˆɛ 2 ij =
20. ONE-WAY ANALYSIS OF VARIANCE 1 20.1. Balanced One-Way Classification Cell means parametrization: Y ij = µ i + ε ij, i = 1,..., I; j = 1,..., J, ε ij N(0, σ 2 ), In matrix form, Y = Xβ + ε, or 1 Y J
More informationInference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58
Inference ME104: Linear Regression Analysis Kenneth Benoit August 15, 2012 August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Stata output resvisited. reg votes1st spend_total incumb minister
More informationANOVA (Analysis of Variance) output RLS 11/20/2016
ANOVA (Analysis of Variance) output RLS 11/20/2016 1. Analysis of Variance (ANOVA) The goal of ANOVA is to see if the variation in the data can explain enough to see if there are differences in the means.
More informationLinear Regression. In this lecture we will study a particular type of regression model: the linear regression model
1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor
More information5. Linear Regression
5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4
More informationStatistics 512: Solution to Homework#11. Problems 1-3 refer to the soybean sausage dataset of Problem 20.8 (ch21pr08.dat).
Statistics 512: Solution to Homework#11 Problems 1-3 refer to the soybean sausage dataset of Problem 20.8 (ch21pr08.dat). 1. Perform the two-way ANOVA without interaction for this model. Use the results
More informationLecture 15. Hypothesis testing in the linear model
14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma
More informationLinear regression. We have that the estimated mean in linear regression is. ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. The standard error of ˆµ Y X=x is.
Linear regression We have that the estimated mean in linear regression is The standard error of ˆµ Y X=x is where x = 1 n s.e.(ˆµ Y X=x ) = σ ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. 1 n + (x x)2 i (x i x) 2 i x i. The
More informationTwo-Way Factorial Designs
81-86 Two-Way Factorial Designs Yibi Huang 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley, so brewers like
More information36-707: Regression Analysis Homework Solutions. Homework 3
36-707: Regression Analysis Homework Solutions Homework 3 Fall 2012 Problem 1 Y i = βx i + ɛ i, i {1, 2,..., n}. (a) Find the LS estimator of β: RSS = Σ n i=1(y i βx i ) 2 RSS β = Σ n i=1( 2X i )(Y i βx
More informationExplanatory Variables Must be Linear Independent...
Explanatory Variables Must be Linear Independent... Recall the multiple linear regression model Y j = β 0 + β 1 X 1j + β 2 X 2j + + β p X pj + ε j, i = 1,, n. is a shorthand for n linear relationships
More informationInference for Regression
Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationStat 5303 (Oehlert): Analysis of CR Designs; January
Stat 5303 (Oehlert): Analysis of CR Designs; January 2016 1 > resin
More informationSTAT22200 Spring 2014 Chapter 8A
STAT22200 Spring 2014 Chapter 8A Yibi Huang May 13, 2014 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley,
More informationChapter 3: Multiple Regression. August 14, 2018
Chapter 3: Multiple Regression August 14, 2018 1 The multiple linear regression model The model y = β 0 +β 1 x 1 + +β k x k +ǫ (1) is called a multiple linear regression model with k regressors. The parametersβ
More informationST430 Exam 1 with Answers
ST430 Exam 1 with Answers Date: October 5, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textook are permitted but you may use a calculator.
More informationLecture 6 Multiple Linear Regression, cont.
Lecture 6 Multiple Linear Regression, cont. BIOST 515 January 22, 2004 BIOST 515, Lecture 6 Testing general linear hypotheses Suppose we are interested in testing linear combinations of the regression
More informationVariance Decomposition and Goodness of Fit
Variance Decomposition and Goodness of Fit 1. Example: Monthly Earnings and Years of Education In this tutorial, we will focus on an example that explores the relationship between total monthly earnings
More informationSimple Linear Regression
Simple Linear Regression MATH 282A Introduction to Computational Statistics University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/ eariasca/math282a.html MATH 282A University
More information1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College
1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative
More information11 Factors, ANOVA, and Regression: SAS versus Splus
Adapted from P. Smith, and expanded 11 Factors, ANOVA, and Regression: SAS versus Splus Factors. A factor is a variable with finitely many values or levels which is treated as a predictor within regression-type
More information2-way analysis of variance
2-way analysis of variance We may be considering the effect of two factors (A and B) on our response variable, for instance fertilizer and variety on maize yield; or therapy and sex on cholesterol level.
More informationSTAT 705 Chapter 16: One-way ANOVA
STAT 705 Chapter 16: One-way ANOVA Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 21 What is ANOVA? Analysis of variance (ANOVA) models are regression
More information22s:152 Applied Linear Regression. Returning to a continuous response variable Y...
22s:152 Applied Linear Regression Generalized Least Squares Returning to a continuous response variable Y... Ordinary Least Squares Estimation The classical models we have fit so far with a continuous
More informationBiostatistics 380 Multiple Regression 1. Multiple Regression
Biostatistics 0 Multiple Regression ORIGIN 0 Multiple Regression Multiple Regression is an extension of the technique of linear regression to describe the relationship between a single dependent (response)
More informationChapter 8 Conclusion
1 Chapter 8 Conclusion Three questions about test scores (score) and student-teacher ratio (str): a) After controlling for differences in economic characteristics of different districts, does the effect
More informationStats fest Analysis of variance. Single factor ANOVA. Aims. Single factor ANOVA. Data
1 Stats fest 2007 Analysis of variance murray.logan@sci.monash.edu.au Single factor ANOVA 2 Aims Description Investigate differences between population means Explanation How much of the variation in response
More information1 Multiple Regression
1 Multiple Regression In this section, we extend the linear model to the case of several quantitative explanatory variables. There are many issues involved in this problem and this section serves only
More informationGenerating OLS Results Manually via R
Generating OLS Results Manually via R Sujan Bandyopadhyay Statistical softwares and packages have made it extremely easy for people to run regression analyses. Packages like lm in R or the reg command
More informationLecture 10. Factorial experiments (2-way ANOVA etc)
Lecture 10. Factorial experiments (2-way ANOVA etc) Jesper Rydén Matematiska institutionen, Uppsala universitet jesper@math.uu.se Regression and Analysis of Variance autumn 2014 A factorial experiment
More information22s:152 Applied Linear Regression. In matrix notation, we can write this model: Generalized Least Squares. Y = Xβ + ɛ with ɛ N n (0, Σ)
22s:152 Applied Linear Regression Generalized Least Squares Returning to a continuous response variable Y Ordinary Least Squares Estimation The classical models we have fit so far with a continuous response
More informationLinear model selection and regularization
Linear model selection and regularization Problems with linear regression with least square 1. Prediction Accuracy: linear regression has low bias but suffer from high variance, especially when n p. It
More information1 Introduction 1. 2 The Multiple Regression Model 1
Multiple Linear Regression Contents 1 Introduction 1 2 The Multiple Regression Model 1 3 Setting Up a Multiple Regression Model 2 3.1 Introduction.............................. 2 3.2 Significance Tests
More informationCh 3: Multiple Linear Regression
Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery
More informationExtensions of One-Way ANOVA.
Extensions of One-Way ANOVA http://www.pelagicos.net/classes_biometry_fa18.htm What do I want You to Know What are two main limitations of ANOVA? What two approaches can follow a significant ANOVA? How
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationMultiple Predictor Variables: ANOVA
What if you manipulate two factors? Multiple Predictor Variables: ANOVA Block 1 Block 2 Block 3 Block 4 A B C D B C D A C D A B D A B C Randomized Controlled Blocked Design: Design where each treatment
More informationLecture 2. The Simple Linear Regression Model: Matrix Approach
Lecture 2 The Simple Linear Regression Model: Matrix Approach Matrix algebra Matrix representation of simple linear regression model 1 Vectors and Matrices Where it is necessary to consider a distribution
More information14 Multiple Linear Regression
B.Sc./Cert./M.Sc. Qualif. - Statistics: Theory and Practice 14 Multiple Linear Regression 14.1 The multiple linear regression model In simple linear regression, the response variable y is expressed in
More informationSTAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing
STAT763: Applied Regression Analysis Multiple linear regression 4.4 Hypothesis testing Chunsheng Ma E-mail: cma@math.wichita.edu 4.4.1 Significance of regression Null hypothesis (Test whether all β j =
More informationBooklet of Code and Output for STAC32 Final Exam
Booklet of Code and Output for STAC32 Final Exam December 7, 2017 Figure captions are below the Figures they refer to. LowCalorie LowFat LowCarbo Control 8 2 3 2 9 4 5 2 6 3 4-1 7 5 2 0 3 1 3 3 Figure
More informationSimple Linear Regression
Simple Linear Regression September 24, 2008 Reading HH 8, GIll 4 Simple Linear Regression p.1/20 Problem Data: Observe pairs (Y i,x i ),i = 1,...n Response or dependent variable Y Predictor or independent
More informationLinear Model Specification in R
Linear Model Specification in R How to deal with overparameterisation? Paul Janssen 1 Luc Duchateau 2 1 Center for Statistics Hasselt University, Belgium 2 Faculty of Veterinary Medicine Ghent University,
More information5. Linear Regression
5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4
More informationSTAT 350: Summer Semester Midterm 1: Solutions
Name: Student Number: STAT 350: Summer Semester 2008 Midterm 1: Solutions 9 June 2008 Instructor: Richard Lockhart Instructions: This is an open book test. You may use notes, text, other books and a calculator.
More informationGeneral Linear Statistical Models
General Linear Statistical Models Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin This framework includes General Linear Statistical Models Linear Regression Analysis of Variance (ANOVA) Analysis
More informationVIII. ANCOVA. A. Introduction
VIII. ANCOVA A. Introduction In most experiments and observational studies, additional information on each experimental unit is available, information besides the factors under direct control or of interest.
More informationStat 5102 Final Exam May 14, 2015
Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions
More informationModels for Clustered Data
Models for Clustered Data Edps/Psych/Soc 589 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline Notation NELS88 data Fixed Effects ANOVA
More informationModels for Clustered Data
Models for Clustered Data Edps/Psych/Stat 587 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Notation NELS88 data Fixed Effects ANOVA
More informationSTAT 213 Interactions in Two-Way ANOVA
STAT 213 Interactions in Two-Way ANOVA Colin Reimer Dawson Oberlin College 14 April 2016 Outline Last Time: Two-Way ANOVA Interaction Terms Reading Quiz (Multiple Choice) If there is no interaction present,
More informationLecture 19 Multiple (Linear) Regression
Lecture 19 Multiple (Linear) Regression Thais Paiva STA 111 - Summer 2013 Term II August 1, 2013 1 / 30 Thais Paiva STA 111 - Summer 2013 Term II Lecture 19, 08/01/2013 Lecture Plan 1 Multiple regression
More informationOne-way analysis of variance
Analysis of variance From R.R. Sokal and F.J. Rohlf, Biometry, 2nd Edition (1981): A knowledge of analysis of variance is indispensable to any modern biologist and, after you have mastered it, you will
More informationBiostatistics for physicists fall Correlation Linear regression Analysis of variance
Biostatistics for physicists fall 2015 Correlation Linear regression Analysis of variance Correlation Example: Antibody level on 38 newborns and their mothers There is a positive correlation in antibody
More informationmovies Name:
movies Name: 217-4-14 Contents movies.................................................... 1 USRevenue ~ Budget + Opening + Theaters + Opinion..................... 6 USRevenue ~ Opening + Opinion..................................
More informationLecture 4 Multiple linear regression
Lecture 4 Multiple linear regression BIOST 515 January 15, 2004 Outline 1 Motivation for the multiple regression model Multiple regression in matrix notation Least squares estimation of model parameters
More informationSCHOOL OF MATHEMATICS AND STATISTICS
RESTRICTED OPEN BOOK EXAMINATION (Not to be removed from the examination hall) Data provided: Statistics Tables by H.R. Neave MAS5052 SCHOOL OF MATHEMATICS AND STATISTICS Basic Statistics Spring Semester
More informationSTK4900/ Lecture 3. Program
STK4900/9900 - Lecture 3 Program 1. Multiple regression: Data structure and basic questions 2. The multiple linear regression model 3. Categorical predictors 4. Planned experiments and observational studies
More informationMatrices and vectors A matrix is a rectangular array of numbers. Here s an example: A =
Matrices and vectors A matrix is a rectangular array of numbers Here s an example: 23 14 17 A = 225 0 2 This matrix has dimensions 2 3 The number of rows is first, then the number of columns We can write
More informationSTAT 213 Two-Way ANOVA II
STAT 213 Two-Way ANOVA II Colin Reimer Dawson Oberlin College May 2, 2018 1 / 21 Outline Two-Way ANOVA: Additive Model FIT: Estimating Parameters ASSESS: Variance Decomposition Pairwise Comparisons 2 /
More informationExercise 2 SISG Association Mapping
Exercise 2 SISG Association Mapping Load the bpdata.csv data file into your R session. LHON.txt data file into your R session. Can read the data directly from the website if your computer is connected
More informationExtensions of One-Way ANOVA.
Extensions of One-Way ANOVA http://www.pelagicos.net/classes_biometry_fa17.htm What do I want You to Know What are two main limitations of ANOVA? What two approaches can follow a significant ANOVA? How
More informationSimple Linear Regression
Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.
More informationSchool of Mathematical Sciences. Question 1
School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 8 and Assignment 7 Solutions Question 1 Figure 1: The residual plots do not contradict the model assumptions of normality, constant
More informationSAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model
Topic 23 - Unequal Replication Data Model Outline - Fall 2013 Parameter Estimates Inference Topic 23 2 Example Page 954 Data for Two Factor ANOVA Y is the response variable Factor A has levels i = 1, 2,...,
More information