Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38

Size: px
Start display at page:

Download "Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38"

Transcription

1 BIO5312 Biostatistics Lecture 11: Multisample Hypothesis Testing II Dr. Junchao Xia Center of Biophysics and Computational Biology Fall /8/2016 1/38

2 Outline In this lecture, we will continue to discuss more topics in Chapter 12: One-way ANOVA Overall Comparison of Group Means Comparisons of Specific Groups Multiple Comparisons Bonferroni Approach False Discovery Rate Relationship Between Multiple Testing and Regression Analysis Two-way ANOVA 11/8/2016 2/38

3 One-Way ANOVA Fixed-Effects Model Suppose there are k groups with n i observations in the ith group. The jth observation in the ith group will be denoted by y ij. Let us assume the following model. y ij = µ + i + e ij Where µ is a constant, i is a constant specific to the ith group, and e ij is an error term, which is normally distributed with mean 0 and variance 2. A typical observation from the ith group is normally distributed with mean µ+ i and variance 2. i will be a random variable for the random-effects model. NS: Nonsmokers PS: Passive Smokers NI: Noninhaling Smokers LS: Light Smokers MS: Moderate Smokers HS: Heavy Smokers Pulmonary function was assessed by force mid-expiratory Flow (FEF). How do you compare the means of the six groups? 11/8/2016 3/38

4 One-Way ANOVA: Assumptions In a one-way analysis of variance, or a one-way ANOVA model, the means of an arbitrary number of groups, each of which follows a normal distribution with the same variance, can be compared. Whether the variability in the data comes mostly from variability within groups or can truly be attributed to variability between groups can also be determined. Interpretation of the parameters of a one-way ANOVA fixed-effects model 1.µ represents the underlying mean of all groups taken together. 2. i represents the difference between the mean of the ith group and the overall mean. 3.e ij represents random error about the mean µ+ i for an individual observation from the ith group. 11/8/2016 4/38

5 One-Way ANOVA: Decomposition of Variability y ij y ( y ij y i ) ( y i y), k Total SS Within SS Between SS n j i 1 j 1 ( y y) j i 1 j 1 j i 1 j 1 Short computational form for the Between SS and Within SS ij 2 k n ( y ij y ) i 2 k n ( y i y) where y..= sum of the observations across all groups, i.e., the grand total of all observations over all groups and n = total number of observations over all groups. 11/8/2016 5/38 2

6 Overall Comparison of Group Means: F Test Between Mean Square = Between MS = Between SS/(k-1) Within Mean Square = Within MS = Within SS/(n-k) The significance test is based on the ratio of Between MS to Within MS. If this ratio is large, then we reject H 0 ; if it is small, we accept H 0. Under H 0, the ratio follows an F distribution with k - 1 and n - k degrees of freedom. Overall F test for One-way ANOVA To test the hypothesis H 0 : i = 0 for all i vs. H 1 : at least one i 0, use the following procedure. 1.Compute the Between SS, Between MS, Within SS, and Within MS. 2.Compute the test statistic F = Between MS/Within MS, which follows an F distribution with k - 1 and n - k df under H 0. If F > F k-1,n-k,1- then reject H 0. If F F k-1,n-k,1- then accept H 0. 3.The exact p-value is given by the area to the right of F under an F k-1,n-k distribution = Pr(F k-1,n-k > F) 11/8/2016 6/38

7 Comparisons of Specific Groups: t Test for Comparison of Pairs Overall F test: H 0 : all group means are equal vs. H 1 : at least two group means are different. This test lets us detect when at least two groups have different underlying means, but it does not let us determine which of the groups have means that differ from each other. t Test for Comparison of Pairs of Groups on One-Way ANOVA (Least Significant Difference (LSD) Method) If we want to test whether groups 1 and 2 have means that are significantly different from each other. Under either hypothesis, Y 1 ~N(µ+ 1, 2 /n 1 ), And Y 2 ~N(µ+ 2, 2 /n 2 ) The difference of the sample means will be used as a test criterion H 0 : 1-2 =0 t n-k distribution under H 0. If 2 is generally unknown, the best estimate of it, denoted by s 2, is substituted by S 2 = [(n 1-1)s (n 2-1)s 22 ]/(n 1 +n 2-2), or by pooled estimate of the variance for oneway ANOVA: 11/8/2016 7/38

8 Comparisons of a Collection of Specific Groups: Linear Contrasts A linear contrast (L) is any linear combination of the individual group means such that the linear coefficients add up to 0. Specifically, t Test for Linear Contrasts in One-Way ANOVA Suppose we want to test the hypothesis H 0 : µ L = 0 vs. H 1 : µ L 0, using a two-sided test with significance level = where y ij ~ N(µ+ i, 2 ), 1.Compute the pooled estimate of the variance s 2 = Within MS from the one-way ANOVA. 2.Compute the linear contrast 3.Compute the test statistic with t n-k distribution, 11/8/2016 8/38

9 Multiple Comparisons Bonferroni Approach Comparison of interested groups can not be specified before looking at the actural data. Several procedures referred to as multiple-comparisons procedures, ensure that too many falsely significant difference are not declared. It ensures that the overall probability of declaring any significant differences between all possible pairs of groups is maintained at some fixed significance level. One of the simplest such procedure is the Bonferroni adjustment. Suppose we want to compare two specific groups, arbitrarily labeled as group 1 and group 2, among k groups. To test the hypothesis H 0 : 1 = 2 vs. H 1 : 1 2, use the following procedure 1.Compute the pooled estimate of the variance s 2 = Within MS from the one-way ANOVA. 2.Compute the test statistic 3.For a two-sided level test, let * = /( k 2) If t > t n-k,1- */2 or t < t n-k, */2 then reject H 0. If t > t n-k, */2 t t n-k,1- */2 then accept H 0. This test is called the Bonferroni multiple-comparisons procedure. defined a significance level for the family of m=( k 2) tests. 11/8/2016 9/38

10 Multiple Comparisons Bonferroni Approach In a study with k groups, there are c=( k 2) possible two-group comparisons. Suppose each two-group comparison is conducted at the * level of significance. Let E be the event that at least one of the two-group comparisons is statistically significant. Pr(E) = Pr(none of the two-group comparisons is statistically significant) = 1-. If each of the two-group comparisons were independent, then from the multiplication law of probability, Pr(E) = (1- *) c where c = ( k 2) When * is small, (1- *) c 1-c *. Therefore, 1- = (1- *) c 1-c *. 11/8/ /38

11 Multiple Comparisons Bonferroni Approach The results of the multiple-comparisons procedure are typically displayed as shown. A line is drawn between the names or numbers of each pair of means that is not significantly different. This plot allows us to visually summarize the results of many comparisons of pairs of means in one concise display. Multiple-comparisons procedures are more strict than ordinary t tests if more than two means are being compared. As k increases, c = ( k 2) increases and therefore * = /c decreases. The critical value, t n-k,1- */2, therefore increases because as k increases, the degrees of freedom (n-k) decreases and the percentile 1- */2 increases, both of which result in a larger critical value. 11/8/ /38

12 Multiple Comparisons.vs. LSD procedure When is multiple-comparisons procedure used over LSD procedure? It is recommended that multiple-comparisons procedures should be used if there are many groups and not all comparisons between individual groups have been planned in advance. If there are relatively few groups and only specific comparisons of interest are intended, which have been planned in advance, preferably stated in a protocol, then ordinary t tests (i.e., the LSD procedure) is used. Multiple-comparisons procedure is applicable for comparing pairs of means. In some situations, linear contrasts involving more complex comparisons than simple contrasts based on pairs of means are of interest. If linear contrasts, which have not been planned in advance, are suggested by looking at the data, then a multiple comparisons procedure might be used to ensure that under H 0, the probability of detecting any significant linear contrast is no larger than. 11/8/ /38

13 Multiple-Comparisons for Linear Contrasts: Scheffé s Procedure Suppose we want to test the hypothesis H 0 : µ L = 0 vs. H 1 : µ L 0, at significance level, where We have k groups, with n i subjects in the ith group and a total of subjects overall. To use Scheffé s multiple-comparisons procedure in this situation, perform the following steps: 1.Compute the test statistic 2.If t > c 2 = (k-1) F k-1,n-k,1- or t < c 1 = - (k-1) F k-1,n-k,1- then reject H 0. If c 1 t c 2, then accept H 0. 11/8/ /38

14 General Procedure for Comparing the Means of k Groups of Samples 11/8/ /38

15 The False Discovery Rate Cardiovascular Disease, Genetics A subsample of 520 cases of cardiovascular disease (CVD) and 1100 controls was obtained among men in a prospective cohort study. This type of study is called a nested case control study. Baseline blood samples were obtained from men in the subsample and analyzed for 50 candidate singlenucleotide polymorphisms (SNPs). Each SNP was coded as 0 if homozygous wild type (the most common), 1 if heterozygote, and 2 if homozygous mutant. The association of each SNP with CVD was assessed using contingency-table methods. A chi-square test for trend was run for each SNP. This yielded 50 separate p - values. If the Bonferroni approach in Equation were used, then α* =.05/50 =.001. Thus, with such a low value for α* it is likely that very few of the hypotheses would be rejected, resulting in a great loss in power. Instead, an alternative approach based on the false-discovery rate (FDR) was used to control for the problem of multiple testing. 11/8/ /38

16 The False Discovery Rate In some settings, particularly in genetic studies with many hypotheses, control of the experiment-wise type I error sometimes does not seem a reasonable approach to controlling for multiple comparisons because it results in very conservative inferential procedures. The FDR approach was developed by Benjamini and Hochberg. The primary goal is not to control the overall experiment-wise type I error rate. The FDR attempts to control the proportion of false-positive results among reported statistically significant results. An advantage of the FDR approach is that is less conservative than the Bonferroni procedure and as a result yields more power to detect genuine positive effects. 11/8/ /38

17 False-Discovery-Rate (FDR) Testing Procedure 1. Suppose we have conducted k separate tests with p-values = p 1,, p k 2. For convenience we will renumber the tests so that p 1 p 2 p k 3. Define q i = kp i /i, i = 1,,k where i = rank of the p-values among the k tests. 4. Let FDR i = false-discovery rate for the ith test be defined by min (q i,, q k ) 5. Find the largest i such that FDR i < FDR 0 = critical level for the FDR (usually 0.05). 6. Reject H 0 for the hypotheses 1,, i, and accept H 0 for the remaining hypotheses. 11/8/ /38

18 False-Discovery-Rate (FDR): Example Stem-and-leaf plot and box plot of the p-values from the tests of 50 SNPs. 11/8/ /38

19 False-Discovery-Rate (FDR): Example 11/8/ /38

20 Case Study: Effects of Lead Exposure The mean and standard deviation of mean finger-wrist tapping score (MAXFWT) for each group are given. MAXFWT scores are similar in the currently exposed and previously exposed groups and are lower than the corresponding mean score in the control group. 11/8/ /38

21 Case Study: Box Plots 11/8/ /38

22 Case Study: Effects of Lead Exposure Overall F test p > 0.05: The results are considered not statistically significant. 0.01< p < 0.05:* The results are statistically significant < p < 0.01: ** The results are highly significant. p < 0.001: *** The results are very highly significant. 11/8/ /38

23 Case Study: Comparison Group Means (LSD) 11/8/ /38

24 Case Study: Use of Dummy Use of dummy variables to represent a categorical variable with k categories Suppose we have a categorical variable C with k categories. To represent that variable in a multiple-regression model, we construct k -1 dummy variables of the form: The category omitted (category 1) is referred to as the reference group. 11/8/ /38

25 Case Study: Multiple Regression Model The subjects in each category of C have a unique profile in terms of x 1,, x k-1. To relate the categorical variable C to an outcome variable y, we use the multiple regression model y = + 1 x x k-1 x k-1 + e The average value of y for subjects in category 1 (the reference category) =, the average value of y for subjects in category 2 = represents the difference between the average value of y for subjects in category 2 vs. the average value of y for subjects in the reference category. j represents the difference between the average value of y for subjects in category (j+1) vs. the reference category, j = 1,.., k-1. One-way ANOVA hypothesis H 0 : all underlying group means are the same vs. H 1 : at least two underlying group means are different. Multiple-regression Hypothesis: H 0 : all j =0 vs. H 1 : at least one of j 0. 11/8/ /38

26 Case Study: Relationship between Multiple Linear Regression and ANOVA Suppose we wish to compare the underlying mean among k groups where the observations in group j are assumed to be normally distributed with mean = µ j = µ+ j and variance = 2. To test the hypothesis H 0 : j = 0 for all j = 1,,k vs. H 1 : at least two j are different, we can use one o the two equivalent procedures: (1) We can perform the overall F test for one-way ANOVA. (2) Or we can set up a multiple-regression model of the form Where y is the outcome variable and x j = 1 if a subject is in group (j+1) and = 0 otherwise, j = 1,, k-1. The Between SS and Within SS for the one-way ANOVA model in procedure 1 are the same as the Regression SS and residual SS for the multiple linear-regression model in procedure 2. The F statistics and p- values are the same as well. 11/8/ /38

27 Case Study: Relationship between Multiple Linear Regression and ANOVA To compare the underlying mean of the (j+1)th group vs. the reference group, we can use one of the two equivalent procedures: (3) We can use the LSD procedure based on one-way ANOVA, where we compute the t statistic And s 2 = Within MS, (4) Or we can compute the t statistic The test statistic and the p-values are the same under procedures 3 and 4. 11/8/ /38

28 Case Study: Effects of Lead Exposure Relationship between Multiple Linear Regression and ANOVA To compare the underlying mean of the (j + 1)th and the (l+1)th groups, we can use one of two equivalent procedures: (5) We can use the LSD procedure based on one-way ANOVA, whereby we compute the t statistic and s 2 = Within MS (6) Or The standard error of b j b l and test statistic t can usually be obtained as a linear-contrast option for multiple-regression programs available in most statistical packages. Another way to compute se(b j b l ) is to print out the variance-covariance matrix of the regression coefficients, which is an option in most statistical packages. If there are k regression coefficients, then the (j, j)th element of this matrix is the variance of b j =Var(b j ). The (j, l)th element of this matrix is the covariance between b j and b l = Cov(b j, b l ). Then, we have Var(b j b l ) = Var(b j ) + Var(b l ) 2 Cov(b j, b l ) and se(b j b l ) = Var(b j -b l ). 11/8/ /38

29 Case Study: Relationship between Multiple Linear Regression and ANOVA 11/8/ /38

30 Two-Way ANOVA In one-way ANOVA, only one variable is used to defined the groups. In some instances, the groups being considered can be classified by two variables, arranged in the form of an RxC contingency table. To assess the effects of each variable after controlling for the effects of the other variable we use a technique called the two-way ANOVA. An interaction effect between two variables is defined as one in which the effect of one variable depends on the level of the other variable. SV: Strict Vegetarians, eat no animal products LV: Lactovegetarians, eat dairy products but no other animal foods NOR: Normals eat a standard US diet Hyertension, Nutrition Case: mean systolic blood pressure(sbp) by dietary group and sex 11/8/ /38

31 Two-way ANOVA General model can be written as Where y ijk = µ + i + j + ij + e ijk Y ijk is the SBP of the kth person in the ith dietary group and the jth sex group µ is a constant Two-Way ANOVA: General Model i is a constant representing the effect of dietary group j is a constant representing the effect of sex group ij is a constant representing the interaction effect between dietary group and sex e ijk is an error term, which is assumed to be normally distributed with mean 0 and variance 2 By convention 11/8/ /38

32 y y i Decomposition of Variability : overall mean from all data : mean of the ith row y j y ij : mean for theith row and : mean of the jth column jth column error term row effect column effect interaction effect y ijk y ( y ijk y ij ) ( y i y ) ( y j y ) ( y ij y i y j y ) r 2 T i, j i 1 j 1 k 1 S r c, Total SS c n n ( y i, j i 1 j 1 k 1 S ijk (( y 2 I y ij y ) S 2 i 2 B1 ) ( y, SS interaction,between SS for G1 r i, j i 1 j 1 k 1 j c n y ( y )) i 2 y r ) c 2 i, j i 1 j 1 k 1 S 2 E n S r 2 B2 ( y, Between SS for G2 c i, j i 1 j 1 k 1 ijk n y ( y ij, SS Error ) i 2 y ) 2 11/8/ /38

33 Hypothesis Testing in Two-Way ANOVA To test the following hypotheses 1. Test for the presence of row effects: H 0 : all i = 0 vs. H 1 : at least one i 0. This is at least for the effect of dietary group on SBP level after controlling for the effect of sex. 2. Test for the presence of column effects: H 0 : all j = 0 vs. H 1 : at least one j 0. this is a test for the effect of sex on SBP level after controlling for the effect of dietary group. 3. Test for the presence of interaction effects: H 0 : all ij = 0 vs. H 1 : at least one ij 0. this is a test of whether or not there is a differential effect of dietary group between males and females. 11/8/ /38

34 Two dummy or indicator variables were set up to represent study group (x1,x2) where x1 = 1 if a person is in the first (SV) group = 0 otherwise x2 = 1 if a person is in the second (LV) group = 0 otherwise Normal group is the reference group. A variable x3 represents sex x3 = 1 if male = 0 if female. Two-Way ANOVA: Example Then, the multiple regression model can be written as y = + 1 x x x 3 + e 11/8/ /38

35 Two-Way ANOVA: Example 11/8/ /38

36 The Kruskal-Wallis Test for Non-Normal Distribution Data When we want to compare means among more than two samples, but either the underlying distribution is far from being normal or we have ordinal data. In such situations, a nonparametric alternative to the oneway ANOVA is used. 11/8/ /38

37 The Kruskal-Wallis test To compare the means of k samples (k > 2) using nonparametric methods, use the following procedure 1. Pool the observations over all samples, thus constructing a combined sample of size N = n i 2. Assign ranks to the individual observations, using the average rank in the case of tied observations 3. Compute the rank sum R i for each of the k samples. 4. If there are ties or not, compute the test statistic respectively, Where t j refers to the number of observations (i.e., the frequency) with the same value in the jth cluster of tied observations and g is the number of tied groups. 5. For a level test, if H > 2 k-1,1- then reject H 0. Otherwise accept H To assess statistical significance, the p-value is given by p =Pr( 2 k-1 > H) This test procedure should be used only if minimum n i 5. 11/8/ /38

38 Summary In this lecture for Chapter 12, we discussed One-way ANOVA methods enable us to relate a normally distributed outcome variable to the levels of a single categorical independent variable. Two different ANOVA models based on a fixed- or random-effects model were considered. In a fixed-effects model, the levels of categorical variable are determined in advance. A major objective of this design is to test the hypothesis that the mean level of the dependent variable is different for different groups defined by the categorical variable. More complex comparisons such as testing for dose-response relationships involving more than two levels of the categorical variable can also be accomplished using linear-contrast methods. In two-way ANOVA we jointly compare the mean levels of an outcome variable according to the levels of two categorical variables. 11/8/ /38

39 The End 11/8/ /38

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46 BIO5312 Biostatistics Lecture 10:Regression and Correlation Methods Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/1/2016 1/46 Outline In this lecture, we will discuss topics

More information

Biostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test

Biostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test Biostatistics 270 Kruskal-Wallis Test 1 ORIGIN 1 Kruskal-Wallis Test The Kruskal-Wallis is a non-parametric analog to the One-Way ANOVA F-Test of means. It is useful when the k samples appear not to come

More information

22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA)

22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) 22s:152 Applied Linear Regression Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) We now consider an analysis with only categorical predictors (i.e. all predictors are

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall 2016 1/30 Previous Two types of statistical

More information

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests

More information

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons: STAT 263/363: Experimental Design Winter 206/7 Lecture January 9 Lecturer: Minyong Lee Scribe: Zachary del Rosario. Design of Experiments Why perform Design of Experiments (DOE)? There are at least two

More information

22s:152 Applied Linear Regression. 1-way ANOVA visual:

22s:152 Applied Linear Regression. 1-way ANOVA visual: 22s:152 Applied Linear Regression 1-way ANOVA visual: Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 Y We now consider an analysis

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

22s:152 Applied Linear Regression. Take random samples from each of m populations.

22s:152 Applied Linear Regression. Take random samples from each of m populations. 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

Analysis of Variance Bios 662

Analysis of Variance Bios 662 Analysis of Variance Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-10-21 13:34 BIOS 662 1 ANOVA Outline Introduction Alternative models SS decomposition

More information

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA. Analysis of Variance Read Chapter 14 and Sections 15.1-15.2 to review one-way ANOVA. Design of an experiment the process of planning an experiment to insure that an appropriate analysis is possible. Some

More information

Week 14 Comparing k(> 2) Populations

Week 14 Comparing k(> 2) Populations Week 14 Comparing k(> 2) Populations Week 14 Objectives Methods associated with testing for the equality of k(> 2) means or proportions are presented. Post-testing concepts and analysis are introduced.

More information

Multiple Sample Numerical Data

Multiple Sample Numerical Data Multiple Sample Numerical Data Analysis of Variance, Kruskal-Wallis test, Friedman test University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html 1 /

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials. One-Way ANOVA Summary The One-Way ANOVA procedure is designed to construct a statistical model describing the impact of a single categorical factor X on a dependent variable Y. Tests are run to determine

More information

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014 Nemours Biomedical Research Statistics Course Li Xie Nemours Biostatistics Core October 14, 2014 Outline Recap Introduction to Logistic Regression Recap Descriptive statistics Variable type Example of

More information

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination McGill University Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II Final Examination Date: 20th April 2009 Time: 9am-2pm Examiner: Dr David A Stephens Associate Examiner: Dr Russell Steele Please

More information

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1) Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ

More information

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA L#7:(Mar-23-2010) Genome Wide Association Studies 1 The law of causality... is a relic of a bygone age, surviving, like the monarchy,

More information

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution

More information

Tribhuvan University Institute of Science and Technology 2065

Tribhuvan University Institute of Science and Technology 2065 1CSc. Stat. 108-2065 Tribhuvan University Institute of Science and Technology 2065 Bachelor Level/First Year/ First Semester/ Science Full Marks: 60 Computer Science and Information Technology (Stat. 108)

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis Rebecca Barter April 6, 2015 Multiple Testing Multiple Testing Recall that when we were doing two sample t-tests, we were testing the equality

More information

Lecture 5: ANOVA and Correlation

Lecture 5: ANOVA and Correlation Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions

More information

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs) The One-Way Independent-Samples ANOVA (For Between-Subjects Designs) Computations for the ANOVA In computing the terms required for the F-statistic, we won t explicitly compute any sample variances or

More information

Multinomial Logistic Regression Models

Multinomial Logistic Regression Models Stat 544, Lecture 19 1 Multinomial Logistic Regression Models Polytomous responses. Logistic regression can be extended to handle responses that are polytomous, i.e. taking r>2 categories. (Note: The word

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES BIOL 458 - Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES PART 1: INTRODUCTION TO ANOVA Purpose of ANOVA Analysis of Variance (ANOVA) is an extremely useful statistical method

More information

BIOMETRICS INFORMATION

BIOMETRICS INFORMATION BIOMETRICS INFORMATION Index of Pamphlet Topics (for pamphlets #1 to #60) as of December, 2000 Adjusted R-square ANCOVA: Analysis of Covariance 13: ANCOVA: Analysis of Covariance ANOVA: Analysis of Variance

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

ANOVA: Analysis of Variance

ANOVA: Analysis of Variance ANOVA: Analysis of Variance Marc H. Mehlman marcmehlman@yahoo.com University of New Haven The analysis of variance is (not a mathematical theorem but) a simple method of arranging arithmetical facts so

More information

More about Single Factor Experiments

More about Single Factor Experiments More about Single Factor Experiments 1 2 3 0 / 23 1 2 3 1 / 23 Parameter estimation Effect Model (1): Y ij = µ + A i + ɛ ij, Ji A i = 0 Estimation: µ + A i = y i. ˆµ = y..  i = y i. y.. Effect Modell

More information

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity

More information

Unit 12: Analysis of Single Factor Experiments

Unit 12: Analysis of Single Factor Experiments Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Lecture 4 Multiple linear regression

Lecture 4 Multiple linear regression Lecture 4 Multiple linear regression BIOST 515 January 15, 2004 Outline 1 Motivation for the multiple regression model Multiple regression in matrix notation Least squares estimation of model parameters

More information

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t = 2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result

More information

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model 1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor

More information

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?)

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?) 12. Comparing Groups: Analysis of Variance (ANOVA) Methods Response y Explanatory x var s Method Categorical Categorical Contingency tables (Ch. 8) (chi-squared, etc.) Quantitative Quantitative Regression

More information

What is a Hypothesis?

What is a Hypothesis? What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

Sleep data, two drugs Ch13.xls

Sleep data, two drugs Ch13.xls Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch

More information

Chapter 12. Analysis of variance

Chapter 12. Analysis of variance Serik Sagitov, Chalmers and GU, January 9, 016 Chapter 1. Analysis of variance Chapter 11: I = samples independent samples paired samples Chapter 1: I 3 samples of equal size J one-way layout two-way layout

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

BIO 682 Nonparametric Statistics Spring 2010

BIO 682 Nonparametric Statistics Spring 2010 BIO 682 Nonparametric Statistics Spring 2010 Steve Shuster http://www4.nau.edu/shustercourses/bio682/index.htm Lecture 8 Example: Sign Test 1. The number of warning cries delivered against intruders by

More information

Lecture 28. Ingo Ruczinski. December 3, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

Lecture 28. Ingo Ruczinski. December 3, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University Lecture 28 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University December 3, 2015 1 2 3 4 5 1 Familywise error rates 2 procedure 3 Performance of with multiple

More information

Regression models. Categorical covariate, Quantitative outcome. Examples of categorical covariates. Group characteristics. Faculty of Health Sciences

Regression models. Categorical covariate, Quantitative outcome. Examples of categorical covariates. Group characteristics. Faculty of Health Sciences Faculty of Health Sciences Categorical covariate, Quantitative outcome Regression models Categorical covariate, Quantitative outcome Lene Theil Skovgaard April 29, 2013 PKA & LTS, Sect. 3.2, 3.2.1 ANOVA

More information

One-Way Analysis of Variance (ANOVA)

One-Way Analysis of Variance (ANOVA) 1 One-Way Analysis of Variance (ANOVA) One-Way Analysis of Variance (ANOVA) is a method for comparing the means of a populations. This kind of problem arises in two different settings 1. When a independent

More information

Inferences About the Difference Between Two Means

Inferences About the Difference Between Two Means 7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent

More information

Analysis of variance (ANOVA) Comparing the means of more than two groups

Analysis of variance (ANOVA) Comparing the means of more than two groups Analysis of variance (ANOVA) Comparing the means of more than two groups Example: Cost of mating in male fruit flies Drosophila Treatments: place males with and without unmated (virgin) females Five treatments

More information

ANOVA: Analysis of Variance

ANOVA: Analysis of Variance ANOVA: Analysis of Variance Marc H. Mehlman marcmehlman@yahoo.com University of New Haven The analysis of variance is (not a mathematical theorem but) a simple method of arranging arithmetical facts so

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Analysis of Variance: Part 1

Analysis of Variance: Part 1 Analysis of Variance: Part 1 Oneway ANOVA When there are more than two means Each time two means are compared the probability (Type I error) =α. When there are more than two means Each time two means are

More information

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing

More information

ANOVA CIVL 7012/8012

ANOVA CIVL 7012/8012 ANOVA CIVL 7012/8012 ANOVA ANOVA = Analysis of Variance A statistical method used to compare means among various datasets (2 or more samples) Can provide summary of any regression analysis in a table called

More information

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic. Serik Sagitov, Chalmers and GU, February, 08 Solutions chapter Matlab commands: x = data matrix boxplot(x) anova(x) anova(x) Problem.3 Consider one-way ANOVA test statistic For I = and = n, put F = MS

More information

Chapter Seven: Multi-Sample Methods 1/52

Chapter Seven: Multi-Sample Methods 1/52 Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze

More information

TA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM

TA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM STAT 301, Fall 2011 Name Lec 4: Ismor Fischer Discussion Section: Please circle one! TA: Sheng Zhgang... 341 (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan... 345 (W 1:20) / 346 (Th

More information

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination McGill University Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II Final Examination Date: 23rd April 2007 Time: 2pm-5pm Examiner: Dr. David A. Stephens Associate Examiner: Dr. Russell Steele Please

More information

22s:152 Applied Linear Regression

22s:152 Applied Linear Regression 22s:152 Applied Linear Regression Chapter 7: Dummy Variable Regression So far, we ve only considered quantitative variables in our models. We can integrate categorical predictors by constructing artificial

More information

Comparing the means of more than two groups

Comparing the means of more than two groups Comparing the means of more than two groups Chapter 15 Analysis of variance (ANOVA) Like a t-test, but can compare more than two groups Asks whether any of two or more means is different from any other.

More information

MATH Notebook 3 Spring 2018

MATH Notebook 3 Spring 2018 MATH448001 Notebook 3 Spring 2018 prepared by Professor Jenny Baglivo c Copyright 2010 2018 by Jenny A. Baglivo. All Rights Reserved. 3 MATH448001 Notebook 3 3 3.1 One Way Layout........................................

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 1 1-1 Basic Business Statistics 11 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Basic Business Statistics, 11e 009 Prentice-Hall, Inc. Chap 1-1 Learning Objectives In this chapter,

More information

Analysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes

Analysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes Analysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes 2 Quick review: Normal distribution Y N(µ, σ 2 ), f Y (y) = 1 2πσ 2 (y µ)2 e 2σ 2 E[Y ] =

More information

Regression With a Categorical Independent Variable

Regression With a Categorical Independent Variable Regression With a Independent Variable Lecture 10 November 5, 2008 ERSH 8320 Lecture #10-11/5/2008 Slide 1 of 54 Today s Lecture Today s Lecture Chapter 11: Regression with a single categorical independent

More information

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers Nominal Data Greg C Elvers 1 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics A parametric statistic is a statistic that makes certain

More information

Statistiek II. John Nerbonne using reworkings by Hartmut Fitz and Wilbert Heeringa. February 13, Dept of Information Science

Statistiek II. John Nerbonne using reworkings by Hartmut Fitz and Wilbert Heeringa. February 13, Dept of Information Science Statistiek II John Nerbonne using reworkings by Hartmut Fitz and Wilbert Heeringa Dept of Information Science j.nerbonne@rug.nl February 13, 2014 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated

More information

Inference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58

Inference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Inference ME104: Linear Regression Analysis Kenneth Benoit August 15, 2012 August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58 Stata output resvisited. reg votes1st spend_total incumb minister

More information

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables. One-Way Analysis of Variance With regression, we related two quantitative, typically continuous variables. Often we wish to relate a quantitative response variable with a qualitative (or simply discrete)

More information

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

One-way between-subjects ANOVA. Comparing three or more independent means

One-way between-subjects ANOVA. Comparing three or more independent means One-way between-subjects ANOVA Comparing three or more independent means Data files SpiderBG.sav Attractiveness.sav Homework: sourcesofself-esteem.sav ANOVA: A Framework Understand the basic principles

More information

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F. Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

Analysis of variance

Analysis of variance Analysis of variance 1 Method If the null hypothesis is true, then the populations are the same: they are normal, and they have the same mean and the same variance. We will estimate the numerical value

More information

Outline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013

Outline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013 Topic 19 - Inference - Fall 2013 Outline Inference for Means Differences in cell means Contrasts Multiplicity Topic 19 2 The Cell Means Model Expressed numerically Y ij = µ i + ε ij where µ i is the theoretical

More information

Specific Differences. Lukas Meier, Seminar für Statistik

Specific Differences. Lukas Meier, Seminar für Statistik Specific Differences Lukas Meier, Seminar für Statistik Problem with Global F-test Problem: Global F-test (aka omnibus F-test) is very unspecific. Typically: Want a more precise answer (or have a more

More information

Hypothesis Testing: Chi-Square Test 1

Hypothesis Testing: Chi-Square Test 1 Hypothesis Testing: Chi-Square Test 1 November 9, 2017 1 HMS, 2017, v1.0 Chapter References Diez: Chapter 6.3 Navidi, Chapter 6.10 Chapter References 2 Chi-square Distributions Let X 1, X 2,... X n be

More information

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in

More information

Chapter 11. Correlation and Regression

Chapter 11. Correlation and Regression Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

10/31/2012. One-Way ANOVA F-test

10/31/2012. One-Way ANOVA F-test PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 1. Situation/hypotheses 2. Test statistic 3.Distribution 4. Assumptions One-Way ANOVA F-test One factor J>2 independent samples

More information

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data

More information

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts Statistical methods for comparing multiple groups Lecture 7: ANOVA Sandy Eckel seckel@jhsph.edu 30 April 2008 Continuous data: comparing multiple means Analysis of variance Binary data: comparing multiple

More information

Regression With a Categorical Independent Variable

Regression With a Categorical Independent Variable Regression ith a Independent Variable ERSH 8320 Slide 1 of 34 Today s Lecture Regression with a single categorical independent variable. Today s Lecture Coding procedures for analysis. Dummy coding. Relationship

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

Statistics for EES Factorial analysis of variance

Statistics for EES Factorial analysis of variance Statistics for EES Factorial analysis of variance Dirk Metzler June 12, 2015 Contents 1 ANOVA and F -Test 1 2 Pairwise comparisons and multiple testing 6 3 Non-parametric: The Kruskal-Wallis Test 9 1 ANOVA

More information

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs The Analysis of Variance (ANOVA) The analysis of variance (ANOVA) is a statistical technique

More information

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) 1. A quick and easy indicator of dispersion is a. Arithmetic mean b. Variance c. Standard deviation

More information

Analysis of variance (ANOVA) ANOVA. Null hypothesis for simple ANOVA. H 0 : Variance among groups = 0

Analysis of variance (ANOVA) ANOVA. Null hypothesis for simple ANOVA. H 0 : Variance among groups = 0 Analysis of variance (ANOVA) ANOVA Comparing the means of more than two groups Like a t-test, but can compare more than two groups Asks whether any of two or more means is different from any other. In

More information

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels. Contingency Tables Definition & Examples. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels. (Using more than two factors gets complicated,

More information

Regression With a Categorical Independent Variable

Regression With a Categorical Independent Variable Regression With a Categorical Independent Variable Lecture 15 March 17, 2005 Applied Regression Analysis Lecture #15-3/17/2005 Slide 1 of 29 Today s Lecture» Today s Lecture» Midterm Note» Example Regression

More information