Lecture 4. Checking Model Adequacy
|
|
- Jasper Parrish
- 6 years ago
- Views:
Transcription
1 Lecture 4. Checking Model Adequacy Montgomery: 3-4, Page 1
2 Model Checking and Diagnostics Model Assumptions 1 Model is correct 2 Independent observations 3 Errors normally distributed 4 Constant variance y ij = (y.. +(y i. y.. )) + (y ij y i. ) y ij = ŷ ij + ˆɛ ij observed = predicted + residual Note that the predicted response at treatment i is ŷ ij =ȳ i. Diagnostics use predicted responses and residuals. Page 2
3 Diagnostics Normality Histogram of residuals Normal probability plot / QQ plot Shapiro-Wilk Test Constant Variance Plot ˆɛ ij vs ŷ ij (residual plot) Bartlett s or Levene s Test Independence Plot ˆɛ ij vs time/space Plot ˆɛ ij vs variable of interest Outliers Page 3
4 Constant Variance In some experiments, error variance (σ 2 i ) depends on the mean response E(y ij )=µ i = µ + τ i. So the constant variance assumption is violated. Size of error (residual) depends on mean response (predicted value) Residual plot Plot ˆɛ ij vs ŷ ij Is the range constant for different levels of ŷ ij More formal tests: Bartlett s Test Modified Levene s Test. Page 4
5 Bartlett s Test H 0 : σ 2 1 = σ 2 2 =...= σ 2 a Test statistic: χ 2 0 = q c where q =(N a)log 10 S 2 p a i=1 (n i 1)log 10 S 2 i c =1+ 1 3(a 1) ( a i=1 (n i 1) 1 (N a) 1 ) and S 2 i is the sample variance of the ith population and S2 p is the pooled sample variance. Decision Rule: reject H 0 when χ 2 0 >χ 2 α,a 1. Remark: sensitive to normality assumption. Page 5
6 Modified Levene s Test For each fixed i, calculate the median m i of y i1,y i2,...,y ini. Compute the absolute deviation of observation from sample median:[ d ij = y ij m i for i =1, 2,...,aand j =1, 2,...,n i, Apply ANOVA to the deviations: d ij Use the usual ANOVA F -statistic for testing H 0 : σ1 2 =...= σa 2. Page 6
7 options ls=80 ps=65; title1 Diagnostics Example ; data one; infile c:\saswork\data\tensile.dat ; input percent strength time; proc glm data=one; class percent; model strength=percent; means percent / hovtest=bartlett hovtest=levene; output out=diag p=pred r=res; proc sort; by pred; symbol1 v=circle i=sm50; title1 Residual Plot ; proc gplot; plot res*pred/frame; run; proc univariate data=diag normal noprint; var res; qqplot res / normal (L=1 mu=est sigma=est); histogram res / normal; run; Page 7
8 run; proc sort; by time; symbol1 v=circle i=sm75; title1 Plot of residuals vs time ; proc gplot; plot res*time / vref=0 vaxis=-6 to 6 by 1; run; symbol1 v=circle i=sm50; title1 Plot of residuals vs time ; proc gplot; plot res*time / vref=0 vaxis=-6 to 6 by 1; run; Page 8
9 Diagnostics Example Sum of Source DF Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Levene s Test for Homogeneity of strength Variance ANOVA of Squared Deviations from Group Means Sum of Mean Source DF Squares Square F Value Pr > F percent Error Bartlett s Test for Homogeneity of strength Variance Source DF Chi-Square Pr > ChiSq percent Page 9
10 Non-constant Variance: Impact and Remedy Does not affect F-test dramatically when experiment is balanced Why concern? Comparison of treatments depends on MSE Incorrect intervals and comparison results Variance-Stabilizing Transformations Common transformations x, log(x), 1/x, arcsin( x), and 1/ x Box-Cox transformations 1. approximate the relationship σ i = θµ β i X 1 β 2. use maximum likelihood principle Distribution often more normal after transformation, then the transformation is Page 10
11 Ideas for Finding Proper Transformations Consider response Y with mean E(Y )=µ and variance Var(Y )=σ 2. That σ 2 depends on µ leads to nonconstant variances for different µ. Let f be a transformation and Ỹ = f(y ); What is the mean and variance of Ỹ? Approximate f(y ) by a linear function (Delta Method): f(y ) f(µ)+(y µ)f (µ) Mean µ = E(Ỹ )=E(f(Y )) E(f(µ)) + E((Y µ)f (µ)) = f(µ) Variance σ 2 = Var(Ỹ ) [f (µ)] 2 Var(Y )=[f (µ)] 2 σ 2 f is a good transformation if σ 2 does not depend on µ anymore. So, Ỹ has constant variance for different f(µ). Page 11
12 Transformations Suppose σ 2 is a function of µ, that is σ 2 = g(µ) Want to find transformation f such that Var(Ỹ ) does not depend on µ. Ỹ = f(y ) has constant variance: Have shown Var(Ỹ ) [f (µ)] 2 σ 2 [f (µ)] 2 g(µ) Want to choose f such that [f (µ)] 2 g(µ) c Examples g(µ) =µ (Poisson) f(x) = 1 µ dµ f(x) = X g(µ) =µ(1 µ) (Binomial) f(x) = 1 dµ f(x) = arcsin( X) µ(1 µ) g(µ) =µ 2β (Box-Cox) f(x) = µ β dµ f(x) =X 1 β g(µ) =µ 2 (Box-Cox) f(x) = 1 µ dµ f(x) = log X Page 12
13 Identify Box-Cox Transformation Using Data: Approximate Method From the previous slide, if σ = θµ β, the transformation is Y 1 β β 1; f(y )= log Y β =1 So it is crucial to estimate β based on data y ij, i =1,...,a. We have logσ i = logθ + βlogµ i Let s i and ȳ i. be the sample standard deviations and means. Because ˆσ i = s i and ˆµ i =ȳ i., approximately, where i =1,...,a. logs i = constant + βlogȳ i., We can plot logs i against logȳ i., fit a straight line and use the slope to estimate β. Page 13
14 Identify Box-Cox Transformation: Formal Method 1. For a fixed λ, perform analysis of variance on y ij (λ) = y λ ij 1 λẏ λ 1 λ 0 ẏ log y ij λ =0 whereẏ = a n i y ij i=1 j=1 1/N. 2. Step 1 generates a transformed data y ij (λ). Apply ANOVA to the new data and obtain SS E. Because SS E depends on λ, it is denoted by SS E (λ). Repeat 1 and 2 for various λ in an interval, e.g., [-2,2], and record SS E (λ) 3 Find λ 0 which minimizes SS E (λ) and pick up a meaningful λ in the neighborhood of λ 0. Denote it again by λ. 4 The transformation is: ỹ ij = y λ 0 ij if λ 0 0; ỹ ij = log y ij if λ 0 =0. Page 14
15 An Example: boxcox.dat trt response Page 15
16 Approximate Method: trans.sas options nocenter ps=65 ls=80; title1 Increasing Variance Example ; data one; infile c:\saswork\data\boxcox.dat ; input trt resp; proc glm data=one; class trt; model resp=trt; output out=diag p=pred r=res; title1 Residual Plot ; symbol1 v=circle i=none; proc gplot data=diag; plot res*pred /frame; proc univariate data=one noprint; var resp; by trt; output out=two mean=mu std=sigma; data three; set two; logmu = log(mu); logsig = log(sigma); proc reg; model logsig = logmu; title1 Mean vs Std Dev ; symbol1 v=circle i=rl; proc gplot; plot logsig*logmu / regeqn; run; Page 16
17 Residual Plot Page 17
18 Plot of logs i vs logµ i Page 18
19 Formal Method: trans1.sas options ls=80 ps=65 nocenter; title1 Box-Cox Example ; data one; infile c:\saswork\data\boxcox.dat ; input trt resp; logresp = log(resp); proc univariate data=one noprint; var logresp; output out=two mean=mlogresp; data three; set one; if _n_ eq 1 then set two; ydot = exp(mlogresp); do l=-1.0 to 1.0 by.25; den = l*ydot**(l-1); if abs(l) eq 0 then den = 1; yl=(resp**l -1)/den; if abs(l) < then yl=ydot*log(resp); output; end; Page 19
20 keep trt yl l; proc sort data=three out=three; by l; proc glm data=three noprint outstat=four; class trt; model yl=trt; by l; data five; set four; if _SOURCE_ eq ERROR ; keep l SS; proc print data=five; run; symbol1 v=circle i=sm50; proc gplot; plot SS*l; run; Page 20
21 SS E (λ) and λ OBS L SS OBS L SS Page 21
22 Plot of SS E (λ) vs λ Page 22
23 Using Proc Transreg proc transreg data=one; model boxcox(y/lambda=-2.0 to 2.0 by 0.1)=class(trt); run; The TRANSREG Procedure Transformation Information for BoxCox(y) Lambda R-Square Log Like : : : * * < * * < Best Lambda : : : * Confidence Interval Convenient Lambda Page 23
24 Nonnormality trt nitrogen Page 24
25 Test ---Statistic p Value----- Shapiro-Wilk W Pr < W Page 25
26 Kruskal-Wallis Test: a Nonparametric alternative a treatments, H 0 : a treatments are not different. Rank the observations y ij in ascending order Replace each observation by its rank R ij (assign average for tied observations) Test statistic H = 1 S 2 [ a i=1 where S 2 = 1 N 1 R 2 i. n i N(N+1)2 4 [ a i=1 Decision Rule: reject H 0 if H>χ 2 α,a 1. ] χ 2 a 1 ni j=1 R2 ij N(N+1)2 4 Let F 0 be the F -test statistic in ANOVA based on R ij. Then ] F 0 = H/(a 1) (N 1 H)/(N a) Page 26
27 options nocenter ps=65 ls=80; data new; input strain nitrogen cards; ; proc npar1way; class strain; var nitrogen; run; Page 27
28 The NPAR1WAY Procedure Analysis of Variance for Variable nitrogen Classified by Variable strain strain N Mean Source DF Sum of Squares Mean Square F Value Pr > F Among Within Page 28
29 The NPAR1WAY Procedure Wilcoxon Scores (Rank Sums) for Variable nitrogen Classified by Variable strain Sum of Expected Std Dev Mean strain N Scores Under H0 Under H0 Score Average scores were used for ties. Kruskal-Wallis Test Chi-Square DF 5 Pr > Chi-Square Page 29
Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3
Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the weight percent
More informationLecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3
Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3 Fall, 2013 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the
More informationLecture 3. Experiments with a Single Factor: ANOVA Montgomery Sections 3.1 through 3.5 and Section
Lecture 3. Experiments with a Single Factor: ANOVA Montgomery Sections 3.1 through 3.5 and Section 15.1.1 1 Lecture 3 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic
More informationOutline. Topic 20 - Diagnostics and Remedies. Residuals. Overview. Diagnostics Plots Residual checks Formal Tests. STAT Fall 2013
Topic 20 - Diagnostics and Remedies - Fall 2013 Diagnostics Plots Residual checks Formal Tests Remedial Measures Outline Topic 20 2 General assumptions Overview Normally distributed error terms Independent
More informationComparison of a Population Means
Analysis of Variance Interested in comparing Several treatments Several levels of one treatment Comparison of a Population Means Could do numerous two-sample t-tests but... ANOVA provides method of joint
More informationSingle Factor Experiments
Single Factor Experiments Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 4 1 Analysis of Variance Suppose you are interested in comparing either a different treatments a levels
More informationTopic 23: Diagnostics and Remedies
Topic 23: Diagnostics and Remedies Outline Diagnostics residual checks ANOVA remedial measures Diagnostics Overview We will take the diagnostics and remedial measures that we learned for regression and
More informationIntroduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes)
Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes) Asheber Abebe Discrete and Statistical Sciences Auburn University Contents 1 Completely Randomized Design
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationSAS Procedures Inference about the Line ffl model statement in proc reg has many options ffl To construct confidence intervals use alpha=, clm, cli, c
Inference About the Slope ffl As with all estimates, ^fi1 subject to sampling var ffl Because Y jx _ Normal, the estimate ^fi1 _ Normal A linear combination of indep Normals is Normal Simple Linear Regression
More informationLecture 10: 2 k Factorial Design Montgomery: Chapter 6
Lecture 10: 2 k Factorial Design Montgomery: Chapter 6 Page 1 2 k Factorial Design Involving k factors Each factor has two levels (often labeled + and ) Factor screening experiment (preliminary study)
More informationLecture 7: Latin Square and Related Design
Lecture 7: Latin Square and Related Design Montgomery: Section 4.2-4.3 Page 1 Automobile Emission Experiment Four cars and four drivers are employed in a study for possible differences between four gasoline
More informationWeek 7.1--IES 612-STA STA doc
Week 7.1--IES 612-STA 4-573-STA 4-576.doc IES 612/STA 4-576 Winter 2009 ANOVA MODELS model adequacy aka RESIDUAL ANALYSIS Numeric data samples from t populations obtained Assume Y ij ~ independent N(μ
More informationLecture 5: Comparing Treatment Means Montgomery: Section 3-5
Lecture 5: Comparing Treatment Means Montgomery: Section 3-5 Page 1 Linear Combination of Means ANOVA: y ij = µ + τ i + ɛ ij = µ i + ɛ ij Linear combination: L = c 1 µ 1 + c 1 µ 2 +...+ c a µ a = a i=1
More informationTopic 2. Chapter 3: Diagnostics and Remedial Measures
Topic Overview This topic will cover Regression Diagnostics Remedial Measures Statistics 512: Applied Linear Models Some other Miscellaneous Topics Topic 2 Chapter 3: Diagnostics and Remedial Measures
More informationLec 3: Model Adequacy Checking
November 16, 2011 Model validation Model validation is a very important step in the model building procedure. (one of the most overlooked) A high R 2 value does not guarantee that the model fits the data
More informationOne-way ANOVA Model Assumptions
One-way ANOVA Model Assumptions STAT:5201 Week 4: Lecture 1 1 / 31 One-way ANOVA: Model Assumptions Consider the single factor model: Y ij = µ + α }{{} i ij iid with ɛ ij N(0, σ 2 ) mean structure random
More informationLecture 12: 2 k Factorial Design Montgomery: Chapter 6
Lecture 12: 2 k Factorial Design Montgomery: Chapter 6 1 Lecture 12 Page 1 2 k Factorial Design Involvingk factors: each has two levels (often labeled+and ) Very useful design for preliminary study Can
More informationLecture 11: Simple Linear Regression
Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink
More informationLecture 4. Random Effects in Completely Randomized Design
Lecture 4. Random Effects in Completely Randomized Design Montgomery: 3.9, 13.1 and 13.7 1 Lecture 4 Page 1 Random Effects vs Fixed Effects Consider factor with numerous possible levels Want to draw inference
More informationSAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model
Topic 23 - Unequal Replication Data Model Outline - Fall 2013 Parameter Estimates Inference Topic 23 2 Example Page 954 Data for Two Factor ANOVA Y is the response variable Factor A has levels i = 1, 2,...,
More informationTopic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model
Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is
More informationTopic 20: Single Factor Analysis of Variance
Topic 20: Single Factor Analysis of Variance Outline Single factor Analysis of Variance One set of treatments Cell means model Factor effects model Link to linear regression using indicator explanatory
More informationChapter 1 Linear Regression with One Predictor
STAT 525 FALL 2018 Chapter 1 Linear Regression with One Predictor Professor Min Zhang Goals of Regression Analysis Serve three purposes Describes an association between X and Y In some applications, the
More informationunadjusted model for baseline cholesterol 22:31 Monday, April 19,
unadjusted model for baseline cholesterol 22:31 Monday, April 19, 2004 1 Class Level Information Class Levels Values TRETGRP 3 3 4 5 SEX 2 0 1 Number of observations 916 unadjusted model for baseline cholesterol
More informationAssessing Model Adequacy
Assessing Model Adequacy A number of assumptions were made about the model, and these need to be verified in order to use the model for inferences. In cases where some assumptions are violated, there are
More informationOutline. Analysis of Variance. Acknowledgements. Comparison of 2 or more groups. Comparison of serveral groups
Outline Analysis of Variance Analysis of variance and regression course http://staff.pubhealth.ku.dk/~lts/regression10_2/index.html Comparison of serveral groups Model checking Marc Andersen, mja@statgroup.dk
More informationAnalysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking
Analysis of variance and regression Contents Comparison of several groups One-way ANOVA April 7, 008 Two-way ANOVA Interaction Model checking ANOVA, April 008 Comparison of or more groups Julie Lyng Forman,
More informationTopic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects
Topic 5 - One-Way Random Effects Models One-way Random effects Outline Model Variance component estimation - Fall 013 Confidence intervals Topic 5 Random Effects vs Fixed Effects Consider factor with numerous
More informationLecture 9: Factorial Design Montgomery: chapter 5
Lecture 9: Factorial Design Montgomery: chapter 5 Page 1 Examples Example I. Two factors (A, B) each with two levels (, +) Page 2 Three Data for Example I Ex.I-Data 1 A B + + 27,33 51,51 18,22 39,41 EX.I-Data
More informationOne-Way Analysis of Variance (ANOVA) There are two key differences regarding the explanatory variable X.
One-Way Analysis of Variance (ANOVA) Also called single factor ANOVA. The response variable Y is continuous (same as in regression). There are two key differences regarding the explanatory variable X.
More informationData are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)
BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data
More informationOutline. Analysis of Variance. Comparison of 2 or more groups. Acknowledgements. Comparison of serveral groups
Outline Analysis of Variance Analysis of variance and regression course http://staff.pubhealth.ku.dk/~jufo/varianceregressionf2011.html Comparison of serveral groups Model checking Marc Andersen, mja@statgroup.dk
More informationAnalysis of Covariance
Analysis of Covariance (ANCOVA) Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 10 1 When to Use ANCOVA In experiment, there is a nuisance factor x that is 1 Correlated with y 2
More informationAnalysis of variance. April 16, Contents Comparison of several groups
Contents Comparison of several groups Analysis of variance April 16, 2009 One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics
More informationAnalysis of variance. April 16, 2009
Analysis of variance April 16, 2009 Contents Comparison of several groups One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics
More informationChapter 8 (More on Assumptions for the Simple Linear Regression)
EXST3201 Chapter 8b Geaghan Fall 2005: Page 1 Chapter 8 (More on Assumptions for the Simple Linear Regression) Your textbook considers the following assumptions: Linearity This is not something I usually
More informationAnalysis of Variance
1 / 70 Analysis of Variance Analysis of variance and regression course http://staff.pubhealth.ku.dk/~lts/regression11_2 Marc Andersen, mja@statgroup.dk Analysis of variance and regression for health researchers,
More informationPLS205 Lab 2 January 15, Laboratory Topic 3
PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationChapter 11. Analysis of Variance (One-Way)
Chapter 11 Analysis of Variance (One-Way) We now develop a statistical procedure for comparing the means of two or more groups, known as analysis of variance or ANOVA. These groups might be the result
More informationLecture 7: Latin Squares and Related Designs
Lecture 7: Latin Squares and Related Designs Montgomery: Section 4.2 and 4.3 1 Lecture 7 Page 1 Automobile Emission Experiment Four cars and four drivers are employed in a study of four gasoline additives(a,b,c,
More informationOverview Scatter Plot Example
Overview Topic 22 - Linear Regression and Correlation STAT 5 Professor Bruce Craig Consider one population but two variables For each sampling unit observe X and Y Assume linear relationship between variables
More informationLecture 7 Remedial Measures
Lecture 7 Remedial Measures STAT 512 Spring 2011 Background Reading KNNL: 3.8-3.11, Chapter 4 7-1 Topic Overview Review Assumptions & Diagnostics Remedial Measures for Non-normality Non-constant variance
More informationOutline. Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping
Topic 19: Remedies Outline Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping Regression Diagnostics Summary Check normality of the residuals
More informationTwo-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek
Two-factor studies STAT 525 Chapter 19 and 20 Professor Olga Vitek December 2, 2010 19 Overview Now have two factors (A and B) Suppose each factor has two levels Could analyze as one factor with 4 levels
More informationa = 4 levels of treatment A = Poison b = 3 levels of treatment B = Pretreatment n = 4 replicates for each treatment combination
In Box, Hunter, and Hunter Statistics for Experimenters is a two factor example of dying times for animals, let's say cockroaches, using 4 poisons and pretreatments with n=4 values for each combination
More informationNonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown
Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding
More information1 One-way Analysis of Variance
1 One-way Analysis of Variance Suppose that a random sample of q individuals receives treatment T i, i = 1,,... p. Let Y ij be the response from the jth individual to be treated with the ith treatment
More informationE509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.
E509A: Principle of Biostatistics (Week 11(2): Introduction to non-parametric methods ) GY Zou gzou@robarts.ca Sign test for two dependent samples Ex 12.1 subj 1 2 3 4 5 6 7 8 9 10 baseline 166 135 189
More informationTopic 14: Inference in Multiple Regression
Topic 14: Inference in Multiple Regression Outline Review multiple linear regression Inference of regression coefficients Application to book example Inference of mean Application to book example Inference
More informationLecture 10: Experiments with Random Effects
Lecture 10: Experiments with Random Effects Montgomery, Chapter 13 1 Lecture 10 Page 1 Example 1 A textile company weaves a fabric on a large number of looms. It would like the looms to be homogeneous
More information22s:152 Applied Linear Regression. Take random samples from each of m populations.
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationIn many situations, there is a non-parametric test that corresponds to the standard test, as described below:
There are many standard tests like the t-tests and analyses of variance that are commonly used. They rest on assumptions like normality, which can be hard to assess: for example, if you have small samples,
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationTopic 29: Three-Way ANOVA
Topic 29: Three-Way ANOVA Outline Three-way ANOVA Data Model Inference Data for three-way ANOVA Y, the response variable Factor A with levels i = 1 to a Factor B with levels j = 1 to b Factor C with levels
More informationTopic 28: Unequal Replication in Two-Way ANOVA
Topic 28: Unequal Replication in Two-Way ANOVA Outline Two-way ANOVA with unequal numbers of observations in the cells Data and model Regression approach Parameter estimates Previous analyses with constant
More information22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationEXST7015: Estimating tree weights from other morphometric variables Raw data print
Simple Linear Regression SAS example Page 1 1 ********************************************; 2 *** Data from Freund & Wilson (1993) ***; 3 *** TABLE 8.24 : ESTIMATING TREE WEIGHTS ***; 4 ********************************************;
More informationFailure Time of System due to the Hot Electron Effect
of System due to the Hot Electron Effect 1 * exresist; 2 option ls=120 ps=75 nocenter nodate; 3 title of System due to the Hot Electron Effect ; 4 * TIME = failure time (hours) of a system due to drift
More informationK. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =
K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing
More informationOutline Topic 21 - Two Factor ANOVA
Outline Topic 21 - Two Factor ANOVA Data Model Parameter Estimates - Fall 2013 Equal Sample Size One replicate per cell Unequal Sample size Topic 21 2 Overview Now have two factors (A and B) Suppose each
More informationBE640 Intermediate Biostatistics 2. Regression and Correlation. Simple Linear Regression Software: SAS. Emergency Calls to the New York Auto Club
BE640 Intermediate Biostatistics 2. Regression and Correlation Simple Linear Regression Software: SAS Emergency Calls to the New York Auto Club Source: Chatterjee, S; Handcock MS and Simonoff JS A Casebook
More informationEXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"
EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically
More informationOutline. Topic 22 - Interaction in Two Factor ANOVA. Interaction Not Significant. General Plan
Topic 22 - Interaction in Two Factor ANOVA - Fall 2013 Outline Strategies for Analysis when interaction not present when interaction present when n ij = 1 when factor(s) quantitative Topic 22 2 General
More informationThis is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
EXST3201 Chapter 13c Geaghan Fall 2005: Page 1 Linear Models Y ij = µ + βi + τ j + βτij + εijk This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
More informationAnswer Keys to Homework#10
Answer Keys to Homework#10 Problem 1 Use either restricted or unrestricted mixed models. Problem 2 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean
More informationChapter 16. Nonparametric Tests
Chapter 16 Nonparametric Tests The statistical tests we have examined so far are called parametric tests, because they assume the data have a known distribution, such as the normal, and test hypotheses
More informationAssignment 9 Answer Keys
Assignment 9 Answer Keys Problem 1 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean 26.00 + 34.67 + 39.67 + + 49.33 + 42.33 + + 37.67 + + 54.67
More informationNonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I
1 / 16 Nonparametric tests Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I Nonparametric one and two-sample tests 2 / 16 If data do not come from a normal
More informationdm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" = -/\<>*"; ODS LISTING;
dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" ---- + ---+= -/\*"; ODS LISTING; *** Table 23.2 ********************************************; *** Moore, David
More informationChapter 6 Multiple Regression
STAT 525 FALL 2018 Chapter 6 Multiple Regression Professor Min Zhang The Data and Model Still have single response variable Y Now have multiple explanatory variables Examples: Blood Pressure vs Age, Weight,
More informationLecture 7 Randomized Complete Block Design (RCBD) [ST&D sections (except 9.6) and section 15.8]
Lecture 7 Randomized Complete Block Design () [ST&D sections 9.1 9.7 (except 9.6) and section 15.8] The Completely Randomized Design () 1. It is assumed that all experimental units (EU's) are uniform..
More informationSTAT5044: Regression and Anova
STAT5044: Regression and Anova Inyoung Kim 1 / 49 Outline 1 How to check assumptions 2 / 49 Assumption Linearity: scatter plot, residual plot Randomness: Run test, Durbin-Watson test when the data can
More informationChapter 2 Inferences in Simple Linear Regression
STAT 525 SPRING 2018 Chapter 2 Inferences in Simple Linear Regression Professor Min Zhang Testing for Linear Relationship Term β 1 X i defines linear relationship Will then test H 0 : β 1 = 0 Test requires
More informationLinear Combinations. Comparison of treatment means. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 6 1
Linear Combinations Comparison of treatment means Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 6 1 Linear Combinations of Means y ij = µ + τ i + ǫ ij = µ i + ǫ ij Often study
More informationOdor attraction CRD Page 1
Odor attraction CRD Page 1 dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" ---- + ---+= -/\*"; ODS LISTING; *** Table 23.2 ********************************************;
More informationThe ε ij (i.e. the errors or residuals) are normally distributed. This assumption has the least influence on the F test.
Lecture 11 Topic 8: Data Transformations Assumptions of the Analysis of Variance 1. Independence of errors The ε ij (i.e. the errors or residuals) are statistically independent from one another. Failure
More informationHandout 1: Predicting GPA from SAT
Handout 1: Predicting GPA from SAT appsrv01.srv.cquest.utoronto.ca> appsrv01.srv.cquest.utoronto.ca> ls Desktop grades.data grades.sas oldstuff sasuser.800 appsrv01.srv.cquest.utoronto.ca> cat grades.data
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Exploring Data: Distributions Look for overall pattern (shape, center, spread) and deviations (outliers). Mean (use a calculator): x = x 1 + x
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationLecture notes on Regression & SAS example demonstration
Regression & Correlation (p. 215) When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable individually, and you can also
More informationStatistics 512: Applied Linear Models. Topic 1
Topic Overview This topic will cover Course Overview & Policies SAS Statistics 512: Applied Linear Models Topic 1 KNNL Chapter 1 (emphasis on Sections 1.3, 1.6, and 1.7; much should be review) Simple linear
More informationTopic 32: Two-Way Mixed Effects Model
Topic 3: Two-Way Mixed Effects Model Outline Two-way mixed models Three-way mixed models Data for two-way design Y is the response variable Factor A with levels i = 1 to a Factor B with levels j = 1 to
More informationTesting Independence
Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1
More informationTopic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts
Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction The analysis of covariance (ANCOVA) is a technique that is occasionally useful for improving the precision of an experiment.
More informationCourse Information Text:
Course Information Text: Special reprint of Applied Linear Statistical Models, 5th edition by Kutner, Neter, Nachtsheim, and Li, 2012. Recommended: Applied Statistics and the SAS Programming Language,
More informationChapter 3. Diagnostics and Remedial Measures
Chapter 3. Diagnostics and Remedial Measures So far, we took data (X i, Y i ) and we assumed Y i = β 0 + β 1 X i + ǫ i i = 1, 2,..., n, where ǫ i iid N(0, σ 2 ), β 0, β 1 and σ 2 are unknown parameters,
More information6 Single Sample Methods for a Location Parameter
6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationIX. Complete Block Designs (CBD s)
IX. Complete Block Designs (CBD s) A.Background Noise Factors nuisance factors whose values can be controlled within the context of the experiment but not outside the context of the experiment Covariates
More informationPLS205 Winter Homework Topic 8
PLS205 Winter 2015 Homework Topic 8 Due TUESDAY, February 10, at the beginning of discussion. Answer all parts of the questions completely, and clearly document the procedures used in each exercise. To
More informationSAS code for Levene s Test and Weighted Least Squares DM LOG; CLEAR; OUT; CLEAR; ; ODS GRAPHICS ON; OPTIONS NODATE NONUMBER;
4.5 4.9 Weighted WeightedLeast Least Squares Squares and and Transformation Transformation Example Example 97 8 SAS code for Levene s Test and Weighted Least Squares M LOG; LEAR; OUT; LEAR; ; OS GRAPHIS
More informationWrap-up. The General Linear Model is a special case of the Generalized Linear Model. Consequently, we can carry out any GLM as a GzLM.
Model Based Statistics in Biology. Part V. The Generalized Linear Model. Analysis of Continuous Data ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10, 11), Part IV (Ch13,
More informationOutline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013
Topic 19 - Inference - Fall 2013 Outline Inference for Means Differences in cell means Contrasts Multiplicity Topic 19 2 The Cell Means Model Expressed numerically Y ij = µ i + ε ij where µ i is the theoretical
More informationTopic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8]
Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8] The completely randomized design Treatments are randomly assigned to e.u. such
More informationPubH 7405: REGRESSION ANALYSIS SLR: DIAGNOSTICS & REMEDIES
PubH 7405: REGRESSION ANALYSIS SLR: DIAGNOSTICS & REMEDIES Normal Error RegressionModel : Y = β 0 + β ε N(0,σ 2 1 x ) + ε The Model has several parts: Normal Distribution, Linear Mean, Constant Variance,
More informationPLS205 Lab 6 February 13, Laboratory Topic 9
PLS205 Lab 6 February 13, 2014 Laboratory Topic 9 A word about factorials Specifying interactions among factorial effects in SAS The relationship between factors and treatment Interpreting results of an
More informationBooklet of Code and Output for STAC32 Final Exam
Booklet of Code and Output for STAC32 Final Exam December 7, 2017 Figure captions are below the Figures they refer to. LowCalorie LowFat LowCarbo Control 8 2 3 2 9 4 5 2 6 3 4-1 7 5 2 0 3 1 3 3 Figure
More information