Multiple Sample Numerical Data
|
|
- Briana Richard
- 6 years ago
- Views:
Transcription
1 Multiple Sample Numerical Data Analysis of Variance, Kruskal-Wallis test, Friedman test University of California, San Diego Instructor: Ery Arias-Castro 1 / 32 Multiple unpaired numerical samples Suppose we have multiple unpaired samples: The ith observation from group j is denoted Y ij. group 1 group 2 group J Y 1,1 Y 1,2 Y 1,J Y 2,1 Y 2,2 Y 2,J... Y n1,1 Y n2,2 Y nj,j There are n j observations in group j, so that N = n 1 + +n J is the total sample size. An experiment where the number of observations per group is the same is said to be have a balanced design. Summary statistics: summarize each sample independently of the others. Graphics: side-by-side boxplots. Example: Smoking and Heart Rate We consider the smokers data (Larsen & Marx, case study ). non light moderate heavy Non-smokers, light smokers, moderate smokers and heavy smokers (six in each group) undertook sustained physical exercise. Their heart rates were measured after resting for three minutes. (The observations are not paired.) Main question: Is smoking associated with a decrease in fitness level? 2 / 32 3 / 32
2 Testing for Equality of Means Let µ j be the population mean for group j. Consider testing H 0 : µ 1 = = µ J versus H 1 : not all µ j s are equal 4 / 32 One-Way Analysis of Variance (ANOVA) Let Y.j and S 2 j denote the jth sample mean and variance: Y.j = 1 n j n j Y ij, S 2 j = 1 n j 1 n j (Y ij Y.j ) 2 Let Y.. denote the (all-groups-combined) grand mean: Y.. = 1 N n j Y ij = n j N Y.j 5 / 32 One-Way Analysis of Variance (ANOVA) Compute the between groups sum of squares (aka treatment sum of squares) SS T = n j (Y.j Y.. ) 2 Compute the within groups sum of squares (aka error sum of squares) SSE = n j (Y ij Y.j ) 2 = (n j 1)Sj 2 The F-test rejects for large values of F = SS T/(J 1) SSE/(N J) Theory. If the observations are iid normal, then F has the so-called F-distribution with J 1 and N J degrees of freedom. 6 / 32
3 One-way ANOVA table Traditionally, the computations above are gathered in an ANOVA table: Df SS MS = SS/Df F-ratio p-value Treatment J 1 SS T SS T /J F = MS T /MS E P(F J 1,N J >F) Residuals N J SSE SSE/(N J) Sometimes, an ANOVA table also includes the total sum of squares: SS Tot = n j (Y ij Y.. ) 2 Note that SS Tot = SS T +SSE. 7 / 32 One-Way ANOVA model assumptions The p-value relies on the assumption that, at least approximately, and that these are independent. This is often expressed as where ε ij iid N(0,σ 2 ) are the measurement errors. µ = 1 N J n jµ j is the grand mean. Y ij N(µ j,σ 2 ) Y ij = µ+α j +ε ij α j = µ j µ is the jth treatment effect. Note that J n jα j = 0. Testing µ 1 = = µ J is equivalent to testing α 1 = = α J = 0. 8 / 32
4 Checking Assumptions We need to check that the ε ij s satisfy the following properties: 1. They have the same variance (homoscedasticity). 2. They are approximately normally distributed (important if the sample sizes are small). 3. They are independent of each other. The errors are not available to us. As a substitute we use the residuals: e ij = Y ij Y.j 1. We use a residual plot to check for homoscedasticity. 2. We use a Q-Q plot of the residuals to check for (approximate) normality. 3. Independence is usually assessed based on how the data was collected. While a the assumption of normality is cured in the large-sample limit (because of the CLT), the assumption of homoscedasticity is not. However there is a variant (the analog of the Welch s t-test) that does not require homoscedasticity. Tukey s Honest Significant Differences Although the F-test is significant, in the boxplots it appears that there is not much difference between heart rates for non smokers and light smokers We might want to know which groups have different population means. Tukey s honest significant differences provides a method to perform all (or some selected tests) with an overall level of 5%. The level is exact under the same assumptions as in ANOVA. In our example, only the differences between heavy and light (or non) smokers are statistically significant. This may seem paradoxical, as we accept µ light = µ moderate, and µ moderate = µ heavy 9 / 32 while rejecting µ light = µ heavy 10 / 32
5 Calibration by bootstrap The one-way ANOVA F-test and Tukey s HSD both assume (approximate) normality and equal variances. (Although the Welch ANOVA F-test does not assume equal variances.) However, just as in the two-sample situation, a bootstrap calibration is possible. For the F-test, it works as follows: 1. Start by centering each group by removing its sample mean. This is to place ourselves under the null where the means are all equal. 2. Let B be a large integer. For b = 1,...B do the following: (a) For each (centered) group j = 1,...,J, sample n j observations with replacement. (b) Compute the corresponding F-ratio F b. 3. The p-value is #{b : F b F obs }+1 B +1 (roughly) the fraction of bootstrapped statistics (F b : b = 1,...,B) that are at least as large as the observed statistic F obs. 11 / 32 Permutation test Just as in the two-sample situation, a calibration by permutation is appropriate if we are interested in comparing the group distributions (goodness-of-fit testing) instead of merely comparing the means. This leads to testing H 0 : all J samples have the same distribution, meaning, (Y ij : i = 1,...,n j ;j = 1,...,J) are iid versus H 1 : this is not the case Let Z 1,...,Z n be the concatenated sample (Y ij : i = 1,...,n j ;j = 1,...,J). Under the null, the Z l s are exchangeable because the Y ij s are iid. 12 / 32 A permutation test based on the treatment sum of squares works as follows: 1. For each permutation π of {1,...,N}, let Yij π = Z π(i) for i = n j 1 +1,...,n j (where n 0 = 0) and compute SS π T = n j (Y π.j Y..) 2 2. The (exact) p-value is the fraction of SS T π that are greater or equal to the observed SS T obs. 13 / 32
6 Even for small n, computing the exact p-value as done above may not be feasible. Indeed, assuming all the Y ij s are distinct, there are (n 1,...,n J )! = (n 1 + +n J )! n 1! n J! non-equivalent permutations. This number is very large even for small sample sizes. When this is the case, we estimate the p-value by Monte Carlo sampling. We sample B permutations (B is a large integer) uniformly at random. Kruskal-Wallis Test 14 / 32 The Kruskal-Wallis test is a rank-based method, and as such, is a form of permutation test. The null hypothesis is the same as with any permutation test, that the J samples come from the same distribution (i.e., same population). Let R ij denote the overall rank of Y ij and define the rank-sum for group j The test rejects for large values of D = 12 N(N +1) n j R.j = R ij R 2.j n j 3(N +1) If the design is balanced, meaning n 1 = = n J, this is equivalent to rejecting for large values of R 2.j 15 / 32
7 To compare it with one-way ANOVA, note that D = (N 1) ( R.j n j R.. n j N n j ( R ij R.. N ) 2 ) 2 So the total sum of squares is used in the denominator as opposed to the error sum of squares as in regular one-way ANOVA. As with any rank-based method, the KW test is distribution-free (important for tabulation in the pre-personal computer age) and invariant with respect to strictly increasing transformations for the data. Theory. Under the null, D has asymptotically (all the sample sizes tend to infinity in proportion to each other) a chi-squared distribution with J 1 degrees of freedom. Repeated measures: multiple paired numerical samples Suppose now that we have multiple paired samples. This is often referred to as repeated measures, and presented as follows: treatment 1 treatment 2 treatment J subject 1 Y 1,1 Y 1,2 Y 1,J subject 2 Y 2,1 Y 2,2 Y 2,J... subject I Y I,1 Y I,2 Y I,J 16 / 32 There are I subjects (or experimental units) in the study and each subject receives all J treatments. This often occurs over a period of time. This typically leads to a two-way analysis of variance, where the subjects are the blocks. (This is covered later in the slides.) 17 / 32
8 Friedman test The Friedman test is a rank-based procedure for repeated measures designs, to test H 0 : for all i = 1,...,I, (Y i,1,...,y i,j ) are exchangeable with the alternative H 1 being the negation of H 0. Let R ij be the rank of Y i,j among (Y i,1,...,y i,j ). The treatment responses are compared within each subject. Define the sum of the ranks for treatment j: R.j = I R ij The test rejects for large values of G = 12 IJ(J +1) R 2.j 3I(J +1) Theory. Under the null, G has asymptotically (as I ) the chi-squared distribution with J 1 degrees of freedom. Two-way designs Suppose we have a two-way design with two factors: Treatment: I levels, indexed by i. Blocking: J blocks, indexed by j. These define I J groups, also called cells, indexed by pairs (i,j) : i = 1,...,I;j = 1,...,J. Let Y i,j,k denote the kth observation in cell (i,j). For simplicity, we assume the design is balanced, with the same number n of observations per group. The observations in each cell, (Y i,j,k : k = 1,...,n), are assumed iid. Summary statistics: for each group independently. Graphics: side-by-side boxplots and interactions plot. 18 / / 32
9 Pictorially, the data looks as follows: it is divided into groups indexed by the treatment and the blocking variables. Treatment 1 Treatment 2 Treatment k Block 1 Y 1,1,1,...,Y 1,1,n Y 2,1,1,...,Y 2,1,n Y k,1,1,...,y k,1,n Block 2 Y 1,2,1,...,Y 1,2,n Y 2,2,1,...,Y 2,2,n Y k,2,1,...,y k,2,n.... Block b Y 1,b,1,...,Y 1,b,n Y 2,b,1,...,Y 2,b,n Y k,b,1,...,y k,b,n 20 / 32 Example: orange juice vs ascorbic acid in tooth growth We consider the ToothGrowth data, already loaded in R. The response is the length of odontoblasts (continuous). Supplement type: VC = ascorbic acid or OJ = orange juice (categorical). Dose is either 0.5, 1 or 2 mg (discrete). Questions: 1. Does the delivery method have any effect on tooth growth when controlling for dosage? 2. Is there any interaction between delivery method and dosage in their effect on tooth growth? 21 / 32 Population means Let µ ij denote the population mean of group (i,j). Define µ.. = 1 IJ I µ ij, µ i. = 1 J µ ij, µ.j = 1 I I µ ij, We have the decomposition: µ ij = µ+α i +β j +γ ij Grand mean: µ (same as µ.. ). Treatment effects: α i = µ i. µ Blocking effects: β j = µ.j µ Interactions terms: γ ij = µ ij µ i. µ.j +µ 22 / 32
10 Note that: I α i = 0, β j = 0, I γ ij = 0, j, γ ij = 0, i 23 / 32 Hypotheses about the means Testing for a treatment effect H T : α i = µ i. µ.. = 0, i This is often the main testing problem considered. Testing for a blocking effect H B : β j = µ.j µ.. = 0, j This is often less important since the blocking variable is usually known to affect the response variable. Testing for interactions H TB : γ ij = µ ij µ i. µ.j +µ.. = 0, i,j 24 / 32 Sample means Define Y ij. = 1 n n k=1 Y ijk, Y i.. = 1 nj n k=1 Y ijk, Y.j. = 1 ni I n k=1 Y ijk Y... = 1 nji I n k=1 Y ijk Y ij. is the method of moments estimate (MME) for µ ij Y i.. is the MME estimate for µ i. Y.j. is the MME estimate for µ.j Y... is the MME estimate for µ.. 25 / 32
11 Sums of squares The treatment sum of squares The blocks sum of squares The interactions sum of squares SS T = nj SS B = ni I (Y i.. Y... ) 2 (Y.j. Y... ) 2 I SS TB = n (Y ij. Y.j. Y i.. +Y... ) 2 26 / 32 Sums of squares The error sum of squares SSE = I n (Y ijk Y ij. ) 2 k=1 Assume equal variance Var(Y ijk ) = σ 2, i,j,k We have: SS df E(SS/df) SS T I 1 σ 2 + nj I 1 i α2 i SS B J 1 σ 2 + ni J 1 j β2 j SS TB (I 1)(J 1) σ 2 + SSE (n 1)JI σ 2 n (I 1)(J 1) ij γ2 ij 27 / 32
12 F-tests For H T, reject for large For H B, reject for large For H TB, reject for large F T = SS T/(I 1) SSE/((n 1)JI) F B = SS B/(J 1) SSE/((n 1)JI) F TB = SS TB/(I 1)(J 1) SSE/((n 1)JI) 28 / 32 Theory. Assume the Y ijk s are independent normal with equal variance. Under H T, F T has the F-distribution with I 1 and (n 1)JI degrees of freedom. Under H B, F B has the F-distribution with J 1 and (n 1)JI degrees of freedom. Under H TB, F TB has the F-distribution with (I 1)(J 1) and (n 1)JI degrees of freedom. 29 / 32 Two-way ANOVA table Traditionally, the computations above are gathered in an ANOVA table: Df SS MS = SS/Df F-ratio p-value Treatment I 1 SS T SS T I 1 Blocking J 1 SS B SS B J 1 SS TB Interactions (I 1)(J 1) SS TB (I 1)(J 1) SSE Residuals (n 1)JI SSE (n 1)JI Sometimes, an ANOVA table also includes the total sum of squares: F T = MS T MS E P ( F dft,df E >F T ) F B = MS B MS E P ( F dfb,df E >F B ) F TB = MS TB MS E P ( F dftb,df E >F TB ) SS Tot = I n (Y ijk Y... ) 2 k=1 Note that SS Tot = SS T +SS B +SS TB +SSE NOTE: sometimes the table is different and the tests for treatment and blocking are performed as if it were a one-way ANOVA. 30 / 32
13 Two-Way ANOVA model assumptions and diagnostics The p-value relies on the assumption that Y ijk iid N(µ ij,σ 2 ). This is equivalent to where the ε ijk s are the measurement errors. Y ijk = µ+α i +β j +γ ij +ε ijk Checking the assumptions on the ε ijs s is done as before based the residuals: e ijk = Y ijk Y ij. If we are fitting a model without interactions, i.e. Y ijk = µ+α i +β j +ε ijk then we need to check that the model is appropriate. This can be done via an interactions plot, or a residual plot. Permutation test A calibration by permutation is also possible for the null where the groups within each block come from the same population. This allows for the blocking variable to have an effect on the response, which it typically does. But it does not allow for interactions. Therefore, consider testing H 0 : the observations within each block are iid meaning, for each j = 1,...,J, (Y ijk : i = 1,...,I;k = 1,...,n) are iid 31 / 32 versus H 1 : this is not the case Choose a test statistic, for example, consider the treatment sum of squares. Permute the observations within each block, independently of the other blocks. This is done many times (say, B = 2,000 times) and the statistic is computed every time. The p-value is computed as usual. 32 / 32
Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)
Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ
More informationBivariate Paired Numerical Data
Bivariate Paired Numerical Data Pearson s correlation, Spearman s ρ and Kendall s τ, tests of independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html
More informationChapter 12. Analysis of variance
Serik Sagitov, Chalmers and GU, January 9, 016 Chapter 1. Analysis of variance Chapter 11: I = samples independent samples paired samples Chapter 1: I 3 samples of equal size J one-way layout two-way layout
More informationMultiple Sample Categorical Data
Multiple Sample Categorical Data paired and unpaired data, goodness-of-fit testing, testing for independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html
More informationI i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.
Serik Sagitov, Chalmers and GU, February, 08 Solutions chapter Matlab commands: x = data matrix boxplot(x) anova(x) anova(x) Problem.3 Consider one-way ANOVA test statistic For I = and = n, put F = MS
More informationAnalysis of Variance
Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61
More informationrobustness, efficiency, breakdown point, outliers, rank-based procedures, least absolute regression
Robust Statistics robustness, efficiency, breakdown point, outliers, rank-based procedures, least absolute regression University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html
More informationSimple Linear Regression
Simple Linear Regression MATH 282A Introduction to Computational Statistics University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/ eariasca/math282a.html MATH 282A University
More informationQUEEN MARY, UNIVERSITY OF LONDON
QUEEN MARY, UNIVERSITY OF LONDON MTH634 Statistical Modelling II Solutions to Exercise Sheet 4 Octobe07. We can write (y i. y.. ) (yi. y i.y.. +y.. ) yi. y.. S T. ( Ti T i G n Ti G n y i. +y.. ) G n T
More information22s:152 Applied Linear Regression. Take random samples from each of m populations.
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationMultiple Linear Regression
Multiple Linear Regression University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html 1 / 42 Passenger car mileage Consider the carmpg dataset taken from
More information22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationMATH Notebook 3 Spring 2018
MATH448001 Notebook 3 Spring 2018 prepared by Professor Jenny Baglivo c Copyright 2010 2018 by Jenny A. Baglivo. All Rights Reserved. 3 MATH448001 Notebook 3 3 3.1 One Way Layout........................................
More informationStatistics for EES Factorial analysis of variance
Statistics for EES Factorial analysis of variance Dirk Metzler June 12, 2015 Contents 1 ANOVA and F -Test 1 2 Pairwise comparisons and multiple testing 6 3 Non-parametric: The Kruskal-Wallis Test 9 1 ANOVA
More informationDr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38
BIO5312 Biostatistics Lecture 11: Multisample Hypothesis Testing II Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/8/2016 1/38 Outline In this lecture, we will continue to
More informationChapter 8: Hypothesis Testing Lecture 9: Likelihood ratio tests
Chapter 8: Hypothesis Testing Lecture 9: Likelihood ratio tests Throughout this chapter we consider a sample X taken from a population indexed by θ Θ R k. Instead of estimating the unknown parameter, we
More informationOne-Sample Numerical Data
One-Sample Numerical Data quantiles, boxplot, histogram, bootstrap confidence intervals, goodness-of-fit tests University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html
More informationhttp://www.statsoft.it/out.php?loc=http://www.statsoft.com/textbook/ Group comparison test for independent samples The purpose of the Analysis of Variance (ANOVA) is to test for significant differences
More informationMultiple comparisons - subsequent inferences for two-way ANOVA
1 Multiple comparisons - subsequent inferences for two-way ANOVA the kinds of inferences to be made after the F tests of a two-way ANOVA depend on the results if none of the F tests lead to rejection of
More informationTwo-Way Analysis of Variance - no interaction
1 Two-Way Analysis of Variance - no interaction Example: Tests were conducted to assess the effects of two factors, engine type, and propellant type, on propellant burn rate in fired missiles. Three engine
More informationEmpirical Power of Four Statistical Tests in One Way Layout
International Mathematical Forum, Vol. 9, 2014, no. 28, 1347-1356 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/imf.2014.47128 Empirical Power of Four Statistical Tests in One Way Layout Lorenzo
More informationDisadvantages of using many pooled t procedures. The sampling distribution of the sample means. The variability between the sample means
Stat 529 (Winter 2011) Analysis of Variance (ANOVA) Reading: Sections 5.1 5.3. Introduction and notation Birthweight example Disadvantages of using many pooled t procedures The analysis of variance procedure
More informationTopic 22 Analysis of Variance
Topic 22 Analysis of Variance Comparing Multiple Populations 1 / 14 Outline Overview One Way Analysis of Variance Sample Means Sums of Squares The F Statistic Confidence Intervals 2 / 14 Overview Two-sample
More informationSTAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis
STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis Rebecca Barter April 6, 2015 Multiple Testing Multiple Testing Recall that when we were doing two sample t-tests, we were testing the equality
More informationCentral Limit Theorem ( 5.3)
Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately
More informationLecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationOutline Topic 21 - Two Factor ANOVA
Outline Topic 21 - Two Factor ANOVA Data Model Parameter Estimates - Fall 2013 Equal Sample Size One replicate per cell Unequal Sample size Topic 21 2 Overview Now have two factors (A and B) Suppose each
More informationStatistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data
Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data 1999 Prentice-Hall, Inc. Chap. 10-1 Chapter Topics The Completely Randomized Model: One-Factor
More informationWhat Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test)
What Is ANOVA? One-way ANOVA ANOVA ANalysis Of VAriance ANOVA compares the means of several groups. The groups are sometimes called "treatments" First textbook presentation in 95. Group Group σ µ µ σ µ
More informationOne-way ANOVA. Experimental Design. One-way ANOVA
Method to compare more than two samples simultaneously without inflating Type I Error rate (α) Simplicity Few assumptions Adequate for highly complex hypothesis testing 09/30/12 1 Outline of this class
More informationComparing the means of more than two groups
Comparing the means of more than two groups Chapter 15 Analysis of variance (ANOVA) Like a t-test, but can compare more than two groups Asks whether any of two or more means is different from any other.
More informationG. Nested Designs. 1 Introduction. 2 Two-Way Nested Designs (Balanced Cases) 1.1 Definition (Nested Factors) 1.2 Notation. 1.3 Example. 2.
G. Nested Designs 1 Introduction. 1.1 Definition (Nested Factors) When each level of one factor B is associated with one and only one level of another factor A, we say that B is nested within factor A.
More informationOne-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups
One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The
More informationLecture 14: ANOVA and the F-test
Lecture 14: ANOVA and the F-test S. Massa, Department of Statistics, University of Oxford 3 February 2016 Example Consider a study of 983 individuals and examine the relationship between duration of breastfeeding
More informationTheorem A: Expectations of Sums of Squares Under the two-way ANOVA model, E(X i X) 2 = (µ i µ) 2 + n 1 n σ2
identity Y ijk Ȳ = (Y ijk Ȳij ) + (Ȳi Ȳ ) + (Ȳ j Ȳ ) + (Ȳij Ȳi Ȳ j + Ȳ ) Theorem A: Expectations of Sums of Squares Under the two-way ANOVA model, (1) E(MSE) = E(SSE/[IJ(K 1)]) = (2) E(MSA) = E(SSA/(I
More informationNonparametric Location Tests: k-sample
Nonparametric Location Tests: k-sample Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)
More informationBattery Life. Factory
Statistics 354 (Fall 2018) Analysis of Variance: Comparing Several Means Remark. These notes are from an elementary statistics class and introduce the Analysis of Variance technique for comparing several
More informationExample: Four levels of herbicide strength in an experiment on dry weight of treated plants.
The idea of ANOVA Reminders: A factor is a variable that can take one of several levels used to differentiate one group from another. An experiment has a one-way, or completely randomized, design if several
More informationANOVA: Analysis of Variance
ANOVA: Analysis of Variance Marc H. Mehlman marcmehlman@yahoo.com University of New Haven The analysis of variance is (not a mathematical theorem but) a simple method of arranging arithmetical facts so
More informationWeek 14 Comparing k(> 2) Populations
Week 14 Comparing k(> 2) Populations Week 14 Objectives Methods associated with testing for the equality of k(> 2) means or proportions are presented. Post-testing concepts and analysis are introduced.
More informationUnit 12: Analysis of Single Factor Experiments
Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter
More informationiron retention (log) high Fe2+ medium Fe2+ high Fe3+ medium Fe3+ low Fe2+ low Fe3+ 2 Two-way ANOVA
iron retention (log) 0 1 2 3 high Fe2+ high Fe3+ low Fe2+ low Fe3+ medium Fe2+ medium Fe3+ 2 Two-way ANOVA In the one-way design there is only one factor. What if there are several factors? Often, we are
More informationSolutions to Final STAT 421, Fall 2008
Solutions to Final STAT 421, Fall 2008 Fritz Scholz 1. (8) Two treatments A and B were randomly assigned to 8 subjects (4 subjects to each treatment) with the following responses: 0, 1, 3, 6 and 5, 7,
More informationEcon 3790: Business and Economic Statistics. Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 13, Part A: Analysis of Variance and Experimental Design Introduction to Analysis of Variance Analysis
More informationANOVA: Analysis of Variance
ANOVA: Analysis of Variance Marc H. Mehlman marcmehlman@yahoo.com University of New Haven The analysis of variance is (not a mathematical theorem but) a simple method of arranging arithmetical facts so
More informationANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College
1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College An example ANOVA situation Example (Treating Blisters) Subjects: 25 patients with blisters Treatments: Treatment A, Treatment
More informationAnalysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes
Analysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes 2 Quick review: Normal distribution Y N(µ, σ 2 ), f Y (y) = 1 2πσ 2 (y µ)2 e 2σ 2 E[Y ] =
More informationAnalysis of variance (ANOVA) Comparing the means of more than two groups
Analysis of variance (ANOVA) Comparing the means of more than two groups Example: Cost of mating in male fruit flies Drosophila Treatments: place males with and without unmated (virgin) females Five treatments
More informationSection 4.6 Simple Linear Regression
Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval
More informationSTAT 401A - Statistical Methods for Research Workers
STAT 401A - Statistical Methods for Research Workers One-way ANOVA Jarad Niemi (Dr. J) Iowa State University last updated: October 10, 2014 Jarad Niemi (Iowa State) One-way ANOVA October 10, 2014 1 / 39
More informationAnalysis of Variance
Analysis of Variance Math 36b May 7, 2009 Contents 2 ANOVA: Analysis of Variance 16 2.1 Basic ANOVA........................... 16 2.1.1 the model......................... 17 2.1.2 treatment sum of squares.................
More informationTopic 29: Three-Way ANOVA
Topic 29: Three-Way ANOVA Outline Three-way ANOVA Data Model Inference Data for three-way ANOVA Y, the response variable Factor A with levels i = 1 to a Factor B with levels j = 1 to b Factor C with levels
More informationCHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC
CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests
More informationFactorial designs. Experiments
Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response
More informationSAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model
Topic 23 - Unequal Replication Data Model Outline - Fall 2013 Parameter Estimates Inference Topic 23 2 Example Page 954 Data for Two Factor ANOVA Y is the response variable Factor A has levels i = 1, 2,...,
More informationLecture 11 Analysis of variance
Lecture 11 Analysis of variance Dr. Wim P. Krijnen Lecturer Statistics University of Groningen Faculty of Mathematics and Natural Sciences Johann Bernoulli Institute for Mathematics and Computer Science
More informationBiostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test
Biostatistics 270 Kruskal-Wallis Test 1 ORIGIN 1 Kruskal-Wallis Test The Kruskal-Wallis is a non-parametric analog to the One-Way ANOVA F-Test of means. It is useful when the k samples appear not to come
More information21.0 Two-Factor Designs
21.0 Two-Factor Designs Answer Questions 1 RCBD Concrete Example Two-Way ANOVA Popcorn Example 21.4 RCBD 2 The Randomized Complete Block Design is also known as the two-way ANOVA without interaction. A
More informationCHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication
CHAPTER 4 Analysis of Variance One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication 1 Introduction In this chapter, expand the idea of hypothesis tests. We
More informationOne-Way Analysis of Variance (ANOVA)
1 One-Way Analysis of Variance (ANOVA) One-Way Analysis of Variance (ANOVA) is a method for comparing the means of a populations. This kind of problem arises in two different settings 1. When a independent
More informationInstitute of Actuaries of India
Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the
More informationGroup comparison test for independent samples
Group comparison test for independent samples The purpose of the Analysis of Variance (ANOVA) is to test for significant differences between means. Supposing that: samples come from normal populations
More informationFactorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik
Factorial Treatment Structure: Part I Lukas Meier, Seminar für Statistik Factorial Treatment Structure So far (in CRD), the treatments had no structure. So called factorial treatment structure exists if
More informationThe legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.
1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design
More informationST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data
ST4241 Design and Analysis of Clinical Trials Lecture 7: Non-parametric tests for PDG data Department of Statistics & Applied Probability 8:00-10:00 am, Friday, September 2, 2016 Outline Non-parametric
More informationMa 3/103: Lecture 25 Linear Regression II: Hypothesis Testing and ANOVA
Ma 3/103: Lecture 25 Linear Regression II: Hypothesis Testing and ANOVA March 6, 2017 KC Border Linear Regression II March 6, 2017 1 / 44 1 OLS estimator 2 Restricted regression 3 Errors in variables 4
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationTwo-Way Factorial Designs
81-86 Two-Way Factorial Designs Yibi Huang 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley, so brewers like
More informationPSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests
PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution
More informationOne-way ANOVA (Single-Factor CRD)
One-way ANOVA (Single-Factor CRD) STAT:5201 Week 3: Lecture 3 1 / 23 One-way ANOVA We have already described a completed randomized design (CRD) where treatments are randomly assigned to EUs. There is
More information1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College
1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative
More informationIntroduction to Nonparametric Statistics
Introduction to Nonparametric Statistics by James Bernhard Spring 2012 Parameters Parametric method Nonparametric method µ[x 2 X 1 ] paired t-test Wilcoxon signed rank test µ[x 1 ], µ[x 2 ] 2-sample t-test
More information1 One-way Analysis of Variance
1 One-way Analysis of Variance Suppose that a random sample of q individuals receives treatment T i, i = 1,,... p. Let Y ij be the response from the jth individual to be treated with the ith treatment
More informationAnalysis of variance (ANOVA) ANOVA. Null hypothesis for simple ANOVA. H 0 : Variance among groups = 0
Analysis of variance (ANOVA) ANOVA Comparing the means of more than two groups Like a t-test, but can compare more than two groups Asks whether any of two or more means is different from any other. In
More informationLec 3: Model Adequacy Checking
November 16, 2011 Model validation Model validation is a very important step in the model building procedure. (one of the most overlooked) A high R 2 value does not guarantee that the model fits the data
More information1 Introduction to One-way ANOVA
Review Source: Chapter 10 - Analysis of Variance (ANOVA). Example Data Source: Example problem 10.1 (dataset: exp10-1.mtw) Link to Data: http://www.auburn.edu/~carpedm/courses/stat3610/textbookdata/minitab/
More informationCh 3: Multiple Linear Regression
Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery
More informationSTAT 705 Chapter 16: One-way ANOVA
STAT 705 Chapter 16: One-way ANOVA Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 21 What is ANOVA? Analysis of variance (ANOVA) models are regression
More informationSEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics
SEVERAL μs AND MEDIANS: MORE ISSUES Business Statistics CONTENTS Post-hoc analysis ANOVA for 2 groups The equal variances assumption The Kruskal-Wallis test Old exam question Further study POST-HOC ANALYSIS
More informationLec 1: An Introduction to ANOVA
Ying Li Stockholm University October 31, 2011 Three end-aisle displays Which is the best? Design of the Experiment Identify the stores of the similar size and type. The displays are randomly assigned to
More informationChapter Seven: Multi-Sample Methods 1/52
Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze
More information2 Hand-out 2. Dr. M. P. M. M. M c Loughlin Revised 2018
Math 403 - P. & S. III - Dr. McLoughlin - 1 2018 2 Hand-out 2 Dr. M. P. M. M. M c Loughlin Revised 2018 3. Fundamentals 3.1. Preliminaries. Suppose we can produce a random sample of weights of 10 year-olds
More informationHandling Categorical Predictors: ANOVA
Handling Categorical Predictors: ANOVA 1/33 I Hate Lines! When we think of experiments, we think of manipulating categories Control, Treatment 1, Treatment 2 Models with Categorical Predictors still reflect
More informationAgonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?
Agonistic Display in Betta splendens: Data Analysis By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether
More informationChap The McGraw-Hill Companies, Inc. All rights reserved.
11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview
More information10/31/2012. One-Way ANOVA F-test
PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 1. Situation/hypotheses 2. Test statistic 3.Distribution 4. Assumptions One-Way ANOVA F-test One factor J>2 independent samples
More informationTopic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model
Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is
More informationResearch Methods II MICHAEL BERNSTEIN CS 376
Research Methods II MICHAEL BERNSTEIN CS 376 Goal Understand and use statistical techniques common to HCI research 2 Last time How to plan an evaluation What is a statistical test? Chi-square t-test Paired
More informationMore about Single Factor Experiments
More about Single Factor Experiments 1 2 3 0 / 23 1 2 3 1 / 23 Parameter estimation Effect Model (1): Y ij = µ + A i + ɛ ij, Ji A i = 0 Estimation: µ + A i = y i. ˆµ = y..  i = y i. y.. Effect Modell
More informationGroup comparison test for independent samples
Group comparison test for independent samples Samples come from normal populations with possibly different means but a common variance Two independent samples: z or t test on difference between means Three,
More information9 One-Way Analysis of Variance
9 One-Way Analysis of Variance SW Chapter 11 - all sections except 6. The one-way analysis of variance (ANOVA) is a generalization of the two sample t test to k 2 groups. Assume that the populations of
More informationAnalysis of variance
Analysis of variance 1 Method If the null hypothesis is true, then the populations are the same: they are normal, and they have the same mean and the same variance. We will estimate the numerical value
More informationLec 5: Factorial Experiment
November 21, 2011 Example Study of the battery life vs the factors temperatures and types of material. A: Types of material, 3 levels. B: Temperatures, 3 levels. Example Study of the battery life vs the
More informationStatistical. Psychology
SEVENTH у *i km m it* & П SB Й EDITION Statistical M e t h o d s for Psychology D a v i d C. Howell University of Vermont ; \ WADSWORTH f% CENGAGE Learning* Australia Biaall apan Korea Меяко Singapore
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationThe One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)
The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures
More informationPLSC PRACTICE TEST ONE
PLSC 724 - PRACTICE TEST ONE 1. Discuss briefly the relationship between the shape of the normal curve and the variance. 2. What is the relationship between a statistic and a parameter? 3. How is the α
More informationANOVA: Comparing More Than Two Means
ANOVA: Comparing More Than Two Means Chapter 11 Cathy Poliak, Ph.D. cathy@math.uh.edu Office Fleming 11c Department of Mathematics University of Houston Lecture 25-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More information