ANOVA CIVL 7012/8012

Similar documents
Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

SIMPLE REGRESSION ANALYSIS. Business Statistics

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

STAT Chapter 10: Analysis of Variance

Categorical Predictor Variables

Lecture 7: Hypothesis Testing and ANOVA

Statistics Introductory Correlation

Variance Decomposition and Goodness of Fit

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

Analysis of Variance: Part 1

STAT 115:Experimental Designs

Chap The McGraw-Hill Companies, Inc. All rights reserved.

22s:152 Applied Linear Regression. Take random samples from each of m populations.

Lecture 6 Multiple Linear Regression, cont.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

Ch 2: Simple Linear Regression

Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest.

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

16.3 One-Way ANOVA: The Procedure

20.0 Experimental Design

Finding Relationships Among Variables

Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017

Chapter 10: Analysis of variance (ANOVA)

Textbook Examples of. SPSS Procedure

Wolf River. Lecture 15 - ANOVA. Exploratory analysis. Wolf River - Data. Sta102 / BME102. October 22, 2014

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Chapter 11 - Lecture 1 Single Factor ANOVA

Sleep data, two drugs Ch13.xls

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Inferences for Regression

1 Introduction to One-way ANOVA

Wolf River. Lecture 15 - ANOVA. Exploratory analysis. Wolf River - Data. Sta102 / BME102. October 26, 2015

One-Way Analysis of Variance (ANOVA) Paul K. Strode, Ph.D.

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

ANOVA: Analysis of Variation

Inference for the Regression Coefficient

Inference for Regression Simple Linear Regression

Chapter 4. Regression Models. Learning Objectives

Chapter 4: Regression Models

Hypothesis Testing hypothesis testing approach

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Announcements. Unit 4: Inference for numerical variables Lecture 4: ANOVA. Data. Statistics 104

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM

STAT 350 Final (new Material) Review Problems Key Spring 2016

One-way ANOVA. Experimental Design. One-way ANOVA

The Multiple Regression Model

Lecture 9: Linear Regression

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Correlation 1. December 4, HMS, 2017, v1.1

Simple Linear Regression: One Qualitative IV

Statistics and Quantitative Analysis U4320

A discussion on multiple regression models

Exam details. Final Review Session. Things to Review

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Chapter 10. Design of Experiments and Analysis of Variance

Week 14 Comparing k(> 2) Populations

PubH 7405: REGRESSION ANALYSIS. MLR: INFERENCES, Part I

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Analysis of Variance

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Multiple Linear Regression

Basic Business Statistics, 10/e

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data

Unit 27 One-Way Analysis of Variance

ANOVA: Analysis of Variance

Assignment #7. Chapter 12: 18, 24 Chapter 13: 28. Due next Friday Nov. 20 th by 2pm in your TA s homework box

Analysis of Variance

ANOVA: Analysis of Variance

Analysis of Variance. ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร

Can you tell the relationship between students SAT scores and their college grades?

What Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test)

ANOVA (Analysis of Variance) output RLS 11/20/2016

ST430 Exam 2 Solutions

Analysis of variance

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

Workshop Research Methods and Statistical Analysis

Types of Statistical Tests DR. MIKE MARRAPODI

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Comparing Several Means: ANOVA

Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Review of the General Linear Model

One-Way Analysis of Variance (ANOVA)

10 One-way analysis of variance (ANOVA)

Linear regression. We have that the estimated mean in linear regression is. ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. The standard error of ˆµ Y X=x is.

Transcription:

ANOVA CIVL 7012/8012

ANOVA ANOVA = Analysis of Variance A statistical method used to compare means among various datasets (2 or more samples) Can provide summary of any regression analysis in a table called ANOVA Table Developed by statistician and evolutionary biologist Ronald Fisher in 1921

ANOVA Table Basic Information contains Estimates of Variance Estimates used to answer Inferential questions of regression analysis Analysis of Variance and regression analysis are closely related Usually employed in comparisons involving several population means

Why the name, ANOVA Why Not ANOME, where ME=Means Although means are compared, but Comparisons are made using estimates of variances The ANOVA test statistics used are actually ratios of estimates of variance

ANOVA vs. REGRESSION Independent Variables ANOVA: must be treated as nominal REGRESSION: can be of any mixture (nominal, ordinal, interval) ANOVA is a special case of regression analysis For multivariable analysis or regression, the technique is called Analysis of Covariance (ANACOVA)

FACTORS AND LEVELS Assume a nominal (categorical) variable with k categories: Then number of dummy variables = k 1 These (k 1) variables collectively describe the basic nominal variable The basic nominal variable is called FACTOR The different categories of the FACTOR are referred to as its LEVELS

FIXED AND RANDOM FACTORS RANDOM FACTOR Whose LEVELs may be regarded as a sample from some large population of levels Example, Subjects, Litters, Observers, Days, Weeks FIXED FACTOR Whose LEVELs are the only ones of interest Example, Gender, Age, Marital Status, Education BOTH: locations, treatments, drugs, exposures

Types of ANOVA Several types depending on experimental designs and situations for which they have been developed One way (one factor, fixed effects) Two way (two factors, random effects) Two way with repeated measures (two factors, random effects) Fully nested (hierarchical factors) Kruskal-Wallis (non-parametric one way) Friedman (non-parametric two way)

THE PROBLEM (One Way ANOVA) To Determine whether the population means are all equal or not. Given k means (denoted as μ 1, μ 2,, μ k ), the basic null hypothesis of interest is: H 0 : μ 1 = μ 2 = = μ k The Alternate hypothesis is given by: H A : "The k population means are not all equal"

Assumptions (One Way ANOVA) All populations involved follow normal distribution Variance of the dependent variable is the same in each population Random samples have been selected from each populations or groups Each experimental unit sampled has been recorded with a specified dependent variable value

ANOVA Table Source Between groups/ Treatment groups/model Within Groups/Error Degrees of freedom (df) Sum of Squares (SS) Total N 1 SSY Mean Square (MS) k 1 SST MST = SST k 1 N k SSE MSE = SSE N k F-value/F MST MSE k = number of population means N = Total number of observations SST = Sum of squares between groups SSE = Sum of squares within groups/ Residual sum of squares/error sum of Squares SSY = Total sum of squares MST = Mean square Treatment/Mean Square between groups MSE = Mean square Error

Partition of Variance Y Y i Y i (amount unexplained by regression) Y i Y i തY Y = തY Y i = መβ 0 + መβ 1 X i Y i തY Y i തY (total amount unexplained) (amount explained by regression) Y = መβ 0 + መβ 1 X X i X

Partition of Variance (Cont.) Total unexplained variation Variation in all observations In other words, OR, = Variation due to regression = Variation between each observation and its group mean + Unexplained residual variation + Variation between each group mean and the overall mean SSY = SST + SSE k n i k k n i (Y ij തY) 2 = n i (Y i തY) 2 + (Y ij തY i ) 2 i=1 j=1 i=1 i=1 j=1

F Statistics For a one-way ANOVA, the test statistic is equal to the ratio of MST and MSE This ratio is known to follow an F distribution The test statistics is calculated as, F = MST MSE If F (observed) > F (Critical) Reject Null hypothesis If F (observed) F (Critical) Fail to reject Null hypothesis

F Distribution F distribution table is used to find the critical value Required: Degrees of freedom of Numerator (MST) Degrees of freedom of Denominator (MSE) Value of alpha (0.05, 0.1, ) Table C.7 (Textbook page 572-573)

EXAMPLE Suppose the National Transportation Safety Board (NTSB) wants to examine the safety of compact cars and full-size cars. It collects a sample of three for each of the treatments (cars types). Using the hypothetical data provided below, test whether the mean pressure applied to the driver s head during a crash test is equal for each types of car. Use α = 5% Compact cars 643 484 655 456 702 402 Full size cars

EXAMPLE (Cont.) Step 1 State the null and alternate hypothesis H 0 : μ 1 = μ 2 H A : Atleast one mean pressure is not ststistically equal Step 2 Calculate the appropriate test statistic (Find sum of squares, mean squares) and critical value and then compare Example shown in Excel file (example_anova.xlsx)

Example-1: Complete ANOVA Table

Example-1: Answer

Example-2: Complete ANOVA Table

Example-2: Solution

Example-3 N=20

Example-3: Solution