Oct Analysis of variance models. One-way anova. Three sheep breeds. Finger ridges. Random and. Fixed effects model. The random effects model
|
|
- Damian Nichols
- 5 years ago
- Views:
Transcription
1 s s Oct / 34
2 s Consider N = n 0 + n n k 1 observations, which form k groups, of sizes n 0, n 1,..., n k 1. The r-th group has sample mean Ȳ r The overall mean (for all groups combined) is Ȳ = N 1 Y i. Observations in the r-th group are N(m r, σ 2 W ), for r = 0 to k 1: Y = b 0 + b 1 X b k 1 X k 1 + e e is N(0, σ 2 W ), and X 1... X k 1 are indicator variables. 2 / 34
3 Indicator variables s For an observation in group 0, all indicator variables are zero. Otherwise, for r 1, the value of X r for an observation in the r-th group is 1, and all other indicators are zero. In terms of group means, regression parameters are: b 0 = m 0, and b r = m r m 0 for r = 1... k 1 The null hypothesis : m 0 = m 1 = = m k 1 (no difference among groups) is equivalent to the standard multiple regression hypothesis b 1 = b 2 = = b k 1 = 0. 3 / 34
4 Parameter estimates s Estimates of the parameters are b0 = Ȳ 0, br = Ȳ r Ȳ 0, for r = 1... k 1 The fitted value for an observation in the r-th group is Ŷ = b 0 + b 1 X b k 1 X k 1 = Ȳ r (the sample mean for the r-th group) 4 / 34
5 Between-groups sum of squares s Regression sum of squares is k 1 S B = n r (Ȳ r Ȳ) 2, r=0 also referred to as the between-group sum of squares. Residual sum of squares S W is the sum of the residual sum of squares for each group ( within-group sum of squares). Source DF Sum Sq Mean Sq F Between k 1 S B M B M B /M W Within N k S W M W Total N 1 S yy M W (also written S 2 ) estimates the residual σ 2 W. 5 / 34
6 Hand calculation s T r is the subtotal for the r-th group, T = T 0 + T T k 1 is the total of all observations. S yy = N i=1 Yi 2 T2 N, S k 1 B = r=0 T 2 r n r T2 N S W is calculated as the difference between S yy and S B. 6 / 34
7 s A random sample is taken from each of 3 sheep, and blood Cu content measured on the 16 sheep. Blackface Welsh Cross / 34
8 Hand calculation of sheep data s Blackface Welsh Cross Total (5) (5) (6) (16) Uncorrected sum of squares for all 16 measurements is Total (corrected) sum of squares = = Between sum of squares = = / 34
9 Anova table s Source DF Sum Sq Mean Sq F Between 2 Within 13 Total 15 9 / 34
10 Anova table s Source DF Sum Sq Mean Sq F Between Within 13 Total / 34
11 Anova table s Source DF Sum Sq Mean Sq F Between Within Total / 34
12 Anova table s Source DF Sum Sq Mean Sq F Between Within Total The F test is highly significant (P < 0.001) 9 / 34
13 in identical twins s Fam Twin 1 Twin Fam Twin 1 Twin / 34
14 s Special case (all groups of size two) Fam sum diff Between family sum of squares Fam sum diff ( )/ /24 = = Within family sum of squares ( )/2 = / 34
15 anova table (finger ridges) s Source DF Sum Sq Mean Sq F Between families Within families Total The F test is very highly significant P < / 34
16 s The ANOVA calculations for these two examples are similar, but the interpretation of the mean squares is different. In the first example, different are regarded as, and we are primarily interested in differences (contrasts) between. Examples of factors which are usually treated as fixed are experimental treatments, sex, breed. 13 / 34
17 Random effects s In the second example, differences between particular families are not of interest. Instead, we regard the twelve family means as sampled from a N(m, σ 2 B ) distribution. Primary interest is in the of σ 2 B and σ2 W. Differences between families are treated as random effects. Examples of random effects are families, litters, or sires, where the levels are a sample of those possible and a repeat experiment would use different levels. 14 / 34
18 Fixed-effect s In the fixed-, F tests the hypothesis m 0 = m 1 = = m k 1, or equivalently, b 1 = b 2 = = b k 1 = 0. In the sheep example, F = on 2 and 13 d.f. (very highly significant). Differences between are established beyond reasonable doubt. 15 / 34
19 Comparing means s Mean values for each breed: Blackface Welsh Cross Estimated standard error of a difference between two means based on n 1 and n 2 observations is MW (1/n 1 + 1/n 2 ) where M W is the within group mean square from the ANOVA. Standard errors of differences between breed means are (Blackface v Welsh), and (otherwise). 16 / 34
20 Comparing means s If m 1 = m 2, the difference between means Ȳ 1 Ȳ 2 divided by its estimated s.e. has a t distn with N k = 13 d.f. Comparison Estimate SE t Blackface v Welsh Blackface v Cross Welsh v Cross The upper 2.5% point of t on 13 d.f. is Difference between Blackface and Cross does not quite reach 5% significance. The Welsh breed has significantly higher measurements than the other two. Confidence intervals are calculated in the usual way. 17 / 34
21 Contrasts s The general form of a contrast is a 1 m 1 + a 2 m a k m k, where a 1 + a a k = 0. The estimate of the contrast is a 1 Ȳ 1 + a 2 Ȳ a k Ȳ k, with estimated s.e. ( M W a 2 1 /n 1 + a 2 2 /n a 2 k /n ) k The t statistic for testing whether the contrast is zero is the ratio of the estimated contrast to its s.e. 18 / 34
22 Another contrast s In the sheep example, a simple genetic hypothesis is H 0 : m B + m W 2m C = 0. Estimate of the contrast Ȳ B + Ȳ W 2Ȳ C is normally distd with zero mean (if H 0 is true) and estimated M W (1/n B + 1/n W + 4/n C ) Here the estimate is ( ) = 0.66, with estimated 0.732(1/5 + 1/5 + 4/6) = t = 0.66/ = with 13 d.f. Not significant, there is no evidence against the genetic. 19 / 34
23 Random Equal group sizes s To the fixed- we add the assumption that m 1... m k are drawn randomly from an N(m, σ 2 B ) distn. We now have E(Y i ) = m, var (Y i ) = σ 2 B + σ 2 W. Expectations of between and within group mean squares are DF Mean Sq E(Mean Sq) Between groups k 1 M B σ 2 W + nσ2 B Within groups N k M W σ 2 W The F statistic tests H 0 : σ 2 B = 0. The overall mean Ȳ estimates m, with (σ 2 W + nσ2 B )/N. 20 / 34
24 s In the random the total of an observation σ 2 B + σ2 W is partitioned into σ2 B and σ2 W. Estimates are obtained by equating observed and expected mean squares: σ 2 W = M W, σ 2 B = (M B M W )/n. Source DF Sum Sq Mean Sq E(Mean Sq) Between families σ 2 W + 2σ2 B Within families σ 2 W Total In the finger ridges example, σ 2 W = 14.29, σ 2 B = / 34
25 Intraclass correlation s The intraclass correlation coefficient σ 2 B /(σ2 W + σ2 B ) measures the extent to which individuals in the same group are more similar than individuals in different groups. An estimate of the intraclass correlation is σ 2 B σ 2 B + σ 2 W M B M W = M B + (n 1)M W In the finger ridges example, σ 2 W = 14.29, σ 2 B = , and the estimated intraclass correlation is / 34
26 Cross-classification s In the sheep example, we may want to allow for the possibility that the measurement is affected by the sex of the animal. Suppose that the first three Blackface, first two Welsh, and first three Cross sheep are male, and the rest female. We now have a cross- classification of the data by breed and sex. Male Female Blackface Welsh Cross / 34
27 Two factors, cross-classification s breed - factor(rep(c( Blackface, Welsh, Cross ), each = 6)) sex - factor(rep(1:2, times = 3, each = 3), labels = c( male, female )) Cu - c(6.5,7.9,7.4,6.8,8.1,na, 10.4,9.8,NA,11.1,10.6,9.2, 6.9,9.2,8.4,7.6,9.7,8.9) fit - lm(cu breed + sex + breed:sex) anova(fit) Df Sum Sq Mean Sq F value breed sex breed:sex Residuals / 34
28 Sequential sums of squares s anova(fit) Df Sum Sq Mean Sq F value breed sex breed:sex Residuals ) Sum of squares for (ignoring sex). 2) Extra sum of squares obtained by fitting sex after breed (an additive : fitted values are sums of breed differences and the sex difference). 3) Sex difference is allowed to depend on breed. This is equivalent to a one-way anova with six levels, one for each combination of breed and sex. 25 / 34
29 Sequence of fits s Intercept b 0 has been omitted everywhere. breed only Blackface Welsh Cross Male 0 b 1 b 2 Female 0 b 1 b 2 additive Blackface Welsh Cross Male 0 b 1 b 2 Female c 1 b 1 + c 1 b 2 + c 1 saturated Blackface Welsh Cross Male 0 b 1 b 2 Female c 1 b 1 + c 1 + d 1 b 2 + c 1 + d 2 d 1 and d 2 are interaction terms (differences of differences). 26 / 34
30 s Mouse Litter An experiment to investigate variability in skin thickness of mice used 5 litters of 3 mice. For each mouse, skin thickness was measured twice by independent determinations. 27 / 34
31 example s In the random the total of an observation σ 2 B + σ2 W is partitioned into σ2 B and σ2 W. Estimates are obtained by equating observed and expected mean squares: σ 2 W = M W, σ 2 B = (M B M W )/n. Source DF Sum Sq Mean Sq E(Mean Sq) Between families σ 2 W + 2σ2 B Within families σ 2 W Total In the finger ridges example, σ 2 W = 14.29, σ 2 B = / 34
32 Mouse skin thickness example s Source DF SSQ MSQ E(MSQ) Litters σ 2 W + 2σ2 M + 6σ2 L Mice in litters σ 2 W + 2σ2 M Residual σ 2 W Estimated of Between observations, within mice 62.3 Between mice, within litters Between litters / 34
33 The mixed s There are several complications which can affect anova. a) It may be necessary to allow for (e.g. sex of mouse). b) The data may be unbalanced. For example, the number of readings per mouse may vary, perhaps because of missing observations. A solution to these problems is to use a mixed (with both fixed and random effects), and estimate by maximum likelihood. lme( ), lmer( ), MCMCglmm( ), asreml, / 34
34 A mixed s A simple example of a mixed is obtained if in the skin thickness example we treat litters as fixed rather than random. Expected mean squares are unchanged, except that the σ 2 L term in the expected mean square for litters is replaced by the sample of the. The test for litter effects is unchanged: F = between litters mean square between mice within litters mean square Compare the last part of farms example on problem sheet / 34
35 s Main assumptions underlying the anova procedure are: 1) Data are normally distributed. 2) The σw 2 is constant. Diagnostic plots, e.g. of residuals against fitted values, are used to check these assumptions (as for regression). If any of the assumptions is false, the situation can sometimes be improved by a transformation of the data. The most common transformations are the square root and logarithmic. These are most useful in correcting a tendency for the of an observation to increase with its size. 32 / 34
36 s breed - factor(rep(c( Blackface, Welsh, Cross ), c(5,5,6))) Cu - c(6.5,7.9,7.4,6.8,8.1,10.4,9.8,11.1,10.6,9.2, 6.9,9.2,8.4,7.6,9.7,8.9) lmfit - lm(cu breed) anova(lmfit) # anova table plot(lmfit) # diagnostic plots # alternatively... aovfit - aov(cu breed) summary(aovfit) # anova table 33 / 34
37 Two factors, nested classification s If litter and mouse are factors with 5 and 15 levels, aov(skin litter + Error(mouse)) produces two residual mean squares, a) between mice within litters (10 d.f.) b) between observations within mice (15 d.f.) Mean square with 10 d.f. is the denominator in the F test for differences among litters. The coding which numbers mice 1-3 within litters aov(skin litter + Error(litter:mouse)) works but is not recommended. 34 / 34
Oct Simple linear regression. Minimum mean square error prediction. Univariate. regression. Calculating intercept and slope
Oct 2017 1 / 28 Minimum MSE Y is the response variable, X the predictor variable, E(X) = E(Y) = 0. BLUP of Y minimizes average discrepancy var (Y ux) = C YY 2u C XY + u 2 C XX This is minimized when u
More informationMultiple regression. Partial regression coefficients
Multiple regression We now generalise the results of simple linear regression to the case where there is one response variable Y and two predictor variables, X and Z. Data consist of n triplets of values
More information13 Simple Linear Regression
B.Sc./Cert./M.Sc. Qualif. - Statistics: Theory and Practice 3 Simple Linear Regression 3. An industrial example A study was undertaken to determine the effect of stirring rate on the amount of impurity
More informationStatistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018
Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement
More informationFood consumption of rats. Two-way vs one-way vs nested ANOVA
Food consumption of rats Lard Gender Fresh Rancid 709 592 Male 679 538 699 476 657 508 Female 594 505 677 539 Two-way vs one-way vs nested NOV T 1 2 * * M * * * * * * F * * * * T M1 M2 F1 F2 * * * * *
More informationMixed-Models. version 30 October 2011
Mixed-Models version 30 October 2011 Mixed models Mixed models estimate a vector! of fixed effects and one (or more) vectors u of random effects Both fixed and random effects models always include a vector
More informationUnbalanced Data in Factorials Types I, II, III SS Part 1
Unbalanced Data in Factorials Types I, II, III SS Part 1 Chapter 10 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 14 When we perform an ANOVA, we try to quantify the amount of variability in the data accounted
More informationAnalysis of Variance
Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61
More informationMixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012
Mixed-Model Estimation of genetic variances Bruce Walsh lecture notes Uppsala EQG 01 course version 8 Jan 01 Estimation of Var(A) and Breeding Values in General Pedigrees The above designs (ANOVA, P-O
More informationSTAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:
STAT 263/363: Experimental Design Winter 206/7 Lecture January 9 Lecturer: Minyong Lee Scribe: Zachary del Rosario. Design of Experiments Why perform Design of Experiments (DOE)? There are at least two
More informationNormal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,
Likelihood Let P (D H) be the probability an experiment produces data D, given hypothesis H. Usually H is regarded as fixed and D variable. Before the experiment, the data D are unknown, and the probability
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More information(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.
FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December
More informationRemedial Measures, Brown-Forsythe test, F test
Remedial Measures, Brown-Forsythe test, F test Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 7, Slide 1 Remedial Measures How do we know that the regression function
More informationCh 3: Multiple Linear Regression
Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery
More informationSTA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3
STA 303 H1S / 1002 HS Winter 2011 Test March 7, 2011 LAST NAME: FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 303 STA 1002 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator. Some formulae
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More information1 Multiple Regression
1 Multiple Regression In this section, we extend the linear model to the case of several quantitative explanatory variables. There are many issues involved in this problem and this section serves only
More information2.1 Linear regression with matrices
21 Linear regression with matrices The values of the independent variables are united into the matrix X (design matrix), the values of the outcome and the coefficient are represented by the vectors Y and
More informationNature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference.
Understanding regression output from software Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals In 1966 Cyril Burt published a paper called The genetic determination of differences
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationInference for Regression Inference about the Regression Model and Using the Regression Line
Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about
More informationSPH 247 Statistical Analysis of Laboratory Data
SPH 247 Statistical Analysis of Laboratory Data March 31, 2015 SPH 247 Statistical Analysis of Laboratory Data 1 ANOVA Fixed and Random Effects We will review the analysis of variance (ANOVA) and then
More informationSTAT 501 EXAM I NAME Spring 1999
STAT 501 EXAM I NAME Spring 1999 Instructions: You may use only your calculator and the attached tables and formula sheet. You can detach the tables and formula sheet from the rest of this exam. Show your
More informationCAS MA575 Linear Models
CAS MA575 Linear Models Boston University, Fall 2013 Midterm Exam (Correction) Instructor: Cedric Ginestet Date: 22 Oct 2013. Maximal Score: 200pts. Please Note: You will only be graded on work and answers
More informationSupplementary File 3: Tutorial for ASReml-R. Tutorial 1 (ASReml-R) - Estimating the heritability of birth weight
Supplementary File 3: Tutorial for ASReml-R Tutorial 1 (ASReml-R) - Estimating the heritability of birth weight This tutorial will demonstrate how to run a univariate animal model using the software ASReml
More informationLecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011
Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector
More informationNesting and Mixed Effects: Part I. Lukas Meier, Seminar für Statistik
Nesting and Mixed Effects: Part I Lukas Meier, Seminar für Statistik Where do we stand? So far: Fixed effects Random effects Both in the factorial context Now: Nested factor structure Mixed models: a combination
More informationLecture 4. Basic Designs for Estimation of Genetic Parameters
Lecture 4 Basic Designs for Estimation of Genetic Parameters Bruce Walsh. Aug 003. Nordic Summer Course Heritability The reason for our focus, indeed obsession, on the heritability is that it determines
More informationMultiple Regression: Example
Multiple Regression: Example Cobb-Douglas Production Function The Cobb-Douglas production function for observed economic data i = 1,..., n may be expressed as where O i is output l i is labour input c
More informationLecture 9: Linear Regression
Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression
More informationSTAT 525 Fall Final exam. Tuesday December 14, 2010
STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will
More informationLecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012
Lecture 3: Linear Models Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector of observed
More informationThe First Thing You Ever Do When Receive a Set of Data Is
The First Thing You Ever Do When Receive a Set of Data Is Understand the goal of the study What are the objectives of the study? What would the person like to see from the data? Understand the methodology
More informationResemblance among relatives
Resemblance among relatives Introduction Just as individuals may differ from one another in phenotype because they have different genotypes, because they developed in different environments, or both, relatives
More information10.1 Simple Linear Regression
10.1 Simple Linear Regression Ulrich Hoensch Tuesday, December 1, 2009 The Simple Linear Regression Model We have two quantitative random variables X (the explanatory variable) and Y (the response variable).
More informationSimple Linear Regression
Chapter 2 Simple Linear Regression Linear Regression with One Independent Variable 2.1 Introduction In Chapter 1 we introduced the linear model as an alternative for making inferences on means of one or
More informationChapter 12 - Lecture 2 Inferences about regression coefficient
Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous
More informationSchool of Mathematical Sciences. Question 1
School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 8 and Assignment 7 Solutions Question 1 Figure 1: The residual plots do not contradict the model assumptions of normality, constant
More informationMultiple comparisons - subsequent inferences for two-way ANOVA
1 Multiple comparisons - subsequent inferences for two-way ANOVA the kinds of inferences to be made after the F tests of a two-way ANOVA depend on the results if none of the F tests lead to rejection of
More informationASReml Tutorial: D1 GLM & GLMM p. 1. ASReml tutorial D1 GLM & GLMM. Arthur Gilmour
ASReml tutorial D1 GLM & GLMM Arthur Gilmour ASReml Tutorial: D1 GLM & GLMM p. 1 ASReml tutorial D1 GLM & GLMM Arthur Gilmour ASReml Tutorial: D1 GLM & GLMM p. 2 Generalized Linear Mixed models Iteratively
More informationDr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38
BIO5312 Biostatistics Lecture 11: Multisample Hypothesis Testing II Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/8/2016 1/38 Outline In this lecture, we will continue to
More informationCHAPTER 10. Regression and Correlation
CHAPTER 10 Regression and Correlation In this Chapter we assess the strength of the linear relationship between two continuous variables. If a significant linear relationship is found, the next step would
More informationKeppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares
Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares K&W introduce the notion of a simple experiment with two conditions. Note that the raw data (p. 16)
More informationChapter 1 Linear Regression with One Predictor
STAT 525 FALL 2018 Chapter 1 Linear Regression with One Predictor Professor Min Zhang Goals of Regression Analysis Serve three purposes Describes an association between X and Y In some applications, the
More informationSTA 101 Final Review
STA 101 Final Review Statistics 101 Thomas Leininger June 24, 2013 Announcements All work (besides projects) should be returned to you and should be entered on Sakai. Office Hour: 2 3pm today (Old Chem
More informationSection 9c. Propensity scores. Controlling for bias & confounding in observational studies
Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a
More informationSTAT5044: Regression and Anova. Inyoung Kim
STAT5044: Regression and Anova Inyoung Kim 2 / 47 Outline 1 Regression 2 Simple Linear regression 3 Basic concepts in regression 4 How to estimate unknown parameters 5 Properties of Least Squares Estimators:
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationRegression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.
TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted
More informationLecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013
Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 013 1 Estimation of Var(A) and Breeding Values in General Pedigrees The classic
More informationMath 423/533: The Main Theoretical Topics
Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)
More informationAnalysis of Variance
Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also
More informationMultiple Regression Examples
Multiple Regression Examples Example: Tree data. we have seen that a simple linear regression of usable volume on diameter at chest height is not suitable, but that a quadratic model y = β 0 + β 1 x +
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationSTAT5044: Regression and Anova
STAT5044: Regression and Anova Inyoung Kim 1 / 25 Outline 1 Multiple Linear Regression 2 / 25 Basic Idea An extra sum of squares: the marginal reduction in the error sum of squares when one or several
More informationStatistics 5100 Spring 2018 Exam 1
Statistics 5100 Spring 2018 Exam 1 Directions: You have 60 minutes to complete the exam. Be sure to answer every question, and do not spend too much time on any part of any question. Be concise with all
More informationReview of Econometrics
Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,
More informationCoefficient of Determination
Coefficient of Determination ST 430/514 The coefficient of determination, R 2, is defined as before: R 2 = 1 SS E (yi ŷ i ) = 1 2 SS yy (yi ȳ) 2 The interpretation of R 2 is still the fraction of variance
More informationSTK4900/ Lecture 3. Program
STK4900/9900 - Lecture 3 Program 1. Multiple regression: Data structure and basic questions 2. The multiple linear regression model 3. Categorical predictors 4. Planned experiments and observational studies
More informationLecture 3: Inference in SLR
Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals
More information36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression
36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form
More informationEconometrics. 4) Statistical inference
30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution
More informationLatin Square Designs (LS) Clarice G. B. Demétrio
Estatística Experimental: Session 4 1 Latin Square Designs (LS) based on Chris Brien s notes (University of South Australia, Adelaide), http://chris.brien.name/ee2/ Clarice G. B. Demétrio Departamento
More informationChapter 11. Analysis of Variance (One-Way)
Chapter 11 Analysis of Variance (One-Way) We now develop a statistical procedure for comparing the means of two or more groups, known as analysis of variance or ANOVA. These groups might be the result
More informationLecture 10. Factorial experiments (2-way ANOVA etc)
Lecture 10. Factorial experiments (2-way ANOVA etc) Jesper Rydén Matematiska institutionen, Uppsala universitet jesper@math.uu.se Regression and Analysis of Variance autumn 2014 A factorial experiment
More informationSociology 593 Exam 1 February 14, 1997
Sociology 9 Exam February, 997 I. True-False. ( points) Indicate whether the following statements are true or false. If false, briefly explain why.. There are IVs in a multiple regression model. If the
More informationLecture 14 Topic 10: ANOVA models for random and mixed effects. Fixed and Random Models in One-way Classification Experiments
Lecture 14 Topic 10: ANOVA models for random and mixed effects To this point, we have considered only the Fixed Model (Model I) ANOVA; now we will extend the method of ANOVA to other experimental objectives.
More informationUNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018
UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018 Work all problems. 60 points are needed to pass at the Masters Level and 75
More informationModel Specification and Data Problems. Part VIII
Part VIII Model Specification and Data Problems As of Oct 24, 2017 1 Model Specification and Data Problems RESET test Non-nested alternatives Outliers A functional form misspecification generally means
More informationLecture 15. Hypothesis testing in the linear model
14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma
More informationGeneralized linear mixed models for biologists
Generalized linear mixed models for biologists McMaster University 7 May 2009 Outline 1 2 Outline 1 2 Coral protection by symbionts 10 Number of predation events Number of blocks 8 6 4 2 2 2 1 0 2 0 2
More informationBIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES
BIOL 458 - Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES PART 1: INTRODUCTION TO ANOVA Purpose of ANOVA Analysis of Variance (ANOVA) is an extremely useful statistical method
More informationEstimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.
Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.
More informationLinear models and their mathematical foundations: Simple linear regression
Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction
More informationL6: Regression II. JJ Chen. July 2, 2015
L6: Regression II JJ Chen July 2, 2015 Today s Plan Review basic inference based on Sample average Difference in sample average Extrapolate the knowledge to sample regression coefficients Standard error,
More informationTABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3
Table of contents TABLE OF CONTENTS...1 1 INTRODUCTION TO MIXED-EFFECTS MODELS...3 Fixed-effects regression ignoring data clustering...5 Fixed-effects regression including data clustering...1 Fixed-effects
More informationHomework 2: Simple Linear Regression
STAT 4385 Applied Regression Analysis Homework : Simple Linear Regression (Simple Linear Regression) Thirty (n = 30) College graduates who have recently entered the job market. For each student, the CGPA
More informationInference for Regression Simple Linear Regression
Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating
More informationWISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A
WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This
More informationEXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY
EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 009 MODULE 4 : Linear models Time allowed: One and a half hours Candidates should answer THREE questions. Each question carries
More informationIntroduction to Linear regression analysis. Part 2. Model comparisons
Introduction to Linear regression analysis Part Model comparisons 1 ANOVA for regression Total variation in Y SS Total = Variation explained by regression with X SS Regression + Residual variation SS Residual
More informationOne-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.
One-Way Analysis of Variance With regression, we related two quantitative, typically continuous variables. Often we wish to relate a quantitative response variable with a qualitative (or simply discrete)
More information9 Correlation and Regression
9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the
More informationChapter 1 Statistical Inference
Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations
More informationA discussion on multiple regression models
A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value
More informationBinary Logistic Regression
The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b
More informationFACTORIAL DESIGNS and NESTED DESIGNS
Experimental Design and Statistical Methods Workshop FACTORIAL DESIGNS and NESTED DESIGNS Jesús Piedrafita Arilla jesus.piedrafita@uab.cat Departament de Ciència Animal i dels Aliments Items Factorial
More informationLecture 1: Linear Models and Applications
Lecture 1: Linear Models and Applications Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Introduction to linear models Exploratory data analysis (EDA) Estimation
More informationPART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,
Concordia University Department of Mathematics and Statistics Course Number Section Statistics 360/2 01 Examination Date Time Pages Final December 2002 3 hours 6 Instructors Course Examiner Marks Y.P.
More informationInference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58
Inference ME104: Linear Regression Analysis Kenneth Benoit August 15, 2012 August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Stata output resvisited. reg votes1st spend_total incumb minister
More informationAnnouncements. Final Review: Units 1-7
Announcements Announcements Final : Units 1-7 Statistics 104 Mine Çetinkaya-Rundel June 24, 2013 Final on Wed: cheat sheet (one sheet, front and back) and calculator Must have webcam + audio on at all
More informationLecture 22 Mixed Effects Models III Nested designs
Lecture 22 Mixed Effects Models III Nested designs 94 Introduction: Crossed Designs The two-factor designs considered so far involve every level of the first factor occurring with every level of the second
More information22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA)
22s:152 Applied Linear Regression Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) We now consider an analysis with only categorical predictors (i.e. all predictors are
More informationCorrelation Analysis
Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the
More informationLecture 18: Simple Linear Regression
Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength
More informationSCHOOL OF MATHEMATICS AND STATISTICS
RESTRICTED OPEN BOOK EXAMINATION (Not to be removed from the examination hall) Data provided: Statistics Tables by H.R. Neave MAS5052 SCHOOL OF MATHEMATICS AND STATISTICS Basic Statistics Spring Semester
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More information1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College
1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative
More informationSTAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing
STAT763: Applied Regression Analysis Multiple linear regression 4.4 Hypothesis testing Chunsheng Ma E-mail: cma@math.wichita.edu 4.4.1 Significance of regression Null hypothesis (Test whether all β j =
More information