Chapter 11 The Analysis of Variance

Similar documents
Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

Chapter 8. Inferences about More Than Two Population Central Values

Summary of the lecture in Biostatistics

Lecture Notes Types of economic variables

Simple Linear Regression

ENGI 3423 Simple Linear Regression Page 12-01

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

: At least two means differ SST

STA302/1001-Fall 2008 Midterm Test October 21, 2008

Probability and. Lecture 13: and Correlation

Econometric Methods. Review of Estimation

Special Instructions / Useful Data

Multiple Linear Regression Analysis

Objectives of Multiple Regression

Bootstrap Method for Testing of Equality of Several Coefficients of Variation

Chapter 13 Student Lecture Notes 13-1

Analysis of Variance with Weibull Data

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

Chapter 8: Statistical Analysis of Simulated Data

Permutation Tests for More Than Two Samples

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

Simulation Output Analysis

ESS Line Fitting

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Lecture 1 Review of Fundamental Statistical Concepts

Lecture 3. Sampling, sampling distributions, and parameter estimation

Simple Linear Regression

Statistics MINITAB - Lab 5

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

Chapter 14 Logistic Regression Models

Lecture 8: Linear Regression

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

A New Family of Transformations for Lifetime Data

Statistics: Unlocking the Power of Data Lock 5

MEASURES OF DISPERSION

1. The weight of six Golden Retrievers is 66, 61, 70, 67, 92 and 66 pounds. The weight of six Labrador Retrievers is 54, 60, 72, 78, 84 and 67.

The Randomized Block Design

Sampling Theory MODULE X LECTURE - 35 TWO STAGE SAMPLING (SUB SAMPLING)

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

Outline. treatments. Introduction

STK3100 and STK4100 Autumn 2017

Simple Linear Regression Analysis

Extreme Value Charts and Anom Based on Inverse Rayleigh Distribution

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen.

To use adaptive cluster sampling we must first make some definitions of the sampling universe:

22 Nonparametric Methods.

STATISTICAL INFERENCE

Suggested Answers, Problem Set 4 ECON The R 2 for the unrestricted model is by definition u u u u

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

4. Standard Regression Model and Spatial Dependence Tests

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests. Soccer Goals in European Premier Leagues

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULE 2

STK3100 and STK4100 Autumn 2018

Chapter -2 Simple Random Sampling

Chapter 11 Systematic Sampling

Correlation and Simple Linear Regression

Improving coverage probabilities of confidence intervals in random effects meta-analysis with publication bias

Quantitative analysis requires : sound knowledge of chemistry : possibility of interferences WHY do we need to use STATISTICS in Anal. Chem.?

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION

Wu-Hausman Test: But if X and ε are independent, βˆ. ECON 324 Page 1

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

Lecture Notes Forecasting the process of estimating or predicting unknown situations

5.1 Properties of Random Numbers

C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1

Lesson 3. Group and individual indexes. Design and Data Analysis in Psychology I English group (A) School of Psychology Dpt. Experimental Psychology

Class 13,14 June 17, 19, 2015

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

Linear Regression with One Regressor

Chapter -2 Simple Random Sampling

Module 7. Lecture 7: Statistical parameter estimation

Chapter 3 Sampling For Proportions and Percentages

Dr. Shalabh. Indian Institute of Technology Kanpur

Comparison of Dual to Ratio-Cum-Product Estimators of Population Mean

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Arithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is

Logistic regression (continued)

THE ROYAL STATISTICAL SOCIETY 2010 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULE 2 STATISTICAL INFERENCE

Module 7: Probability and Statistics

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

LINEAR REGRESSION ANALYSIS

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

Functions of Random Variables

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Chapter 3 Multiple Linear Regression Model

ENGI 4421 Propagation of Error Page 8-01

Chapter 2 Simple Linear Regression

PROPERTIES OF GOOD ESTIMATORS

Simple Linear Regression - Scalar Form

Bias Correction in Estimation of the Population Correlation Coefficient

Applied Statistics and Probability for Engineers, 5 th edition February 23, b) y ˆ = (85) =

Chapter 5 Properties of a Random Sample

Lecture 3 Probability review (cont d)

MULTIDIMENSIONAL HETEROGENEOUS VARIABLE PREDICTION BASED ON EXPERTS STATEMENTS. Gennadiy Lbov, Maxim Gerasimov

Transcription:

Chapter The Aalyss of Varace. Oe Factor Aalyss of Varace. Radomzed Bloc Desgs (ot for ths course) NIPRL

. Oe Factor Aalyss of Varace.. Oe Factor Layouts (/4) Suppose that a expermeter s terested populatos wth uow populato meas µ, µ, K, µ The oe factor aalyss of varace methodology s approprate for comparg three of more populatos. x j The observato represets the j th observato from the th populato. The sample from populato cossts of the observatos x, K, x If the sample szes, K, are all equal, the the data set s balaced, ad f the sample szes are uequal, the the data set s ubalaced. NIPRL

.. Oe Factor Layouts (/4) The total sample sze of the data set s A data set of ths d s called a oe-way or oe factor layout. The sgle factor s sad to have levels correspodg to the populatos uder cosderato. Completely radomzed desgs : the expermet s performed by radomly allocatg a total of uts amog the populatos. Modelg assumpto where the error terms T x = µ + ε j j = + L+ T Equvaletly, j ε j d ~ (, σ ) x N µ d ~ N (0, σ ) NIPRL 3

.. Oe Factor Layouts (3/4) Pot estmates of the uow populato meas H µ = L = µ H 0 : x + L+ x µ = x =, : µ µ, for some ad j A j Acceptace of the ull hypothess dcates that there s o evdece that ay of the populato meas are uequal. Rejecto of the ull hypothess mples that there s evdece that at least some of the populato meas are uequal. NIPRL 4

.. Oe Factor Layouts (4/4) Example 60 : Collapse of Bloced Arteres level : steoss = 0.78 level : steoss = 0.7 level 3 : steoss = 0.65 µ µ µ 3 0.6 + L + 8.3 = x = =.09.7 + L+ 7.6 = x = = 5.086 4 9.6 + L+ 6.6 = x3 = = 7.330 0 H : µ = µ = µ 0 3 NIPRL 5

.. Parttog the Total Sum of Squares (/5) Treatmet Sum of Squares x x + L+ x = = SSTr = ( x x ) = T x + L+ A measure of the varablty betwee the factor levels. T x x x x x x x = = = = SSTr = ( ) = + = x x + x = x x T T = = T NIPRL 6

.. Parttog the Total Sum of Squares (/5) Error sum of squares SSE = ( x x ) = j= j A measure of the varablty wth the factor levels. SSE = ( j ) = j j + = j= = j= = j= = j= x x x x x x xj x x xj x = j= = = = j= = = + = NIPRL 7

.. Parttog the Total Sum of Squares (3/5) Total sum of squares SST = ( x x ) = j= j A measure of the total varablty the data set SST = ( j ) = j j + = j= = j= = j= = j= x x x x x x xj Tx Tx xj Tx = j= = j= = + = SST = SSTr + SSE NIPRL 8

.. Parttog the Total Sum of Squares (4/5) P-value cosderatos The plausblty of the ull hypothess that the factor level meas are all equal depeds upo the relatve sze of the sum of squares for treatmets, SSTr, to the sum of squares for error, SSE. NIPRL 9

.. Parttog the Total Sum of Squares (5/5) Example 60 : Collapse of Bloced Arteres x x x 3 x =.09, = 5.086, = 7.330 0.6 + L6.6 = = 4.509 35 = j= x = 0.6 + L+ 6.6 = 770.39 j = j= = T SST = x x = 770.39 (35 4.509 ) = 34.5 j SSTr = x x T = + + SSE=SST-SSTr=34.5-04.0=38.5 (.09 ) (4 5.086 ) (0 7.330 ) (35 4.509 ) = 04.0 NIPRL 0

..3 The Aalyss of Varace Table (/5) s = j= ( x x ) j SSE = ( ) s = Mea square error SSE MSE = = d.f. SSE T χ MSE ~ σ sce ( ) = T E(MSE) T E χ T T = σ So, MSE s a ubased pot estmate of the error varace σ NIPRL

..3 The Aalyss of Varace Table (/5) Mea squares for treatmets SSTr SSTr MSTr = = d.f. - ( ) = σ µ µ µ + L µ E(MSTr) = + ( why?) where If the factor level meas are µ all equal, the χ E (M STr) = σ ad M ST r ~ σ ( why?) These results ca be used to develop a method for calculatg the p- value of the ull hypothess Whe ths ull hypothess s true, the F -statstc MSTr χ /( ) F = = ~ F, (?) T why MSE χ /( ) T T T NIPRL

..3 The Aalyss of Varace Table (3/5) p-value = P( X F) where X ~ F T, F, T dstrbuto p value F = MSTr MSE NIPRL 3

..3 The Aalyss of Varace Table (4/5) Aalyss of varace table for oe factor layout Source d.f. Sum of square s Mea squares F-statstc P-value Treatmets SSTr MSTr = SSTr MSTr F = P F, MSE ( F ) T Error T SSE MSE = SSE T total T SST NIPRL 4

..3 The Aalyss of Varace Table (5/5) Example 60 : Collapse of Bloced Arteres The degrees of freedom for treatmets s SSTr 04.0 MSTr = = = 0.0 The degrees of freedom for error s SSE 38.5 MSE = = = 4.33 3 3 MSTr 0.0 F = = = 3.6 MSE 4.33 p value = P X X F ( 3.6) 0 where ~, T = 3 = T = 35 3 = 3 Cosequetly, the ull hypothess that the average flowrate at collapse s the same for all three amouts of steoss s ot plausble. NIPRL 5

..4 Parwse Comparsos of the Factor Level Meas Whe the ull hypothess s rejected, the expermeter ca follow up the aalyss wth parwse comparsos of the factor level meas to dscover whch oes have bee show to be dfferet ad by how much. Wth factor levels there are (-)/ parwse dffereces µ µ <, NIPRL 6

α A set of cofdece level smultaeous cofdece tervals for these parwse dffereces are qα,, v qα,, v µ µ x x s +, x x + s + where s = σ = MSE q α,, v(see pp. 888-889) s a crtcal pot that s the upper pot of the Studetzed rage dstrbuto wth parameter ad degrees of freedom. v = T α NIPRL 7

These cofdece tervals are smlar to the t-tervals Dfferece : s used stead of q α, κ, ν / /, T-tervals have a dvdual cofdece level whereas ths set of smultaeous cofdece tervals have a overall cofdece level All of the (-)/ cofdece tervals cota ther respectve parameter value µ µ q α κ ν t α /, ν s larger tha,, / If the cofdece terval for the dfferece µ cotas µ zero, the there s o evdece that the meas at factor levels ad are dfferet t α ν NIPRL 8

Example 60 : Collapse of Bloced Arteres s = σ = MSE = 4.33 =.080 Wth 3 degrees of freedom for error, the crtcal pt s q 0.05,3,3 = 3.48 the overall cofdece level s -0.05=0.95 Idvdual cofdece tervals have cofdece levels of -0.096 0.98 The cofdece terval for µ µ.080 3.48.080 3.48 µ µ.09 5.086 +,.09 5.086 + +, 4 4 = ( 3.877.06, 3.877 +.06) = ( 5.939,.84) NIPRL 9

.080 3.48.080 3.48 µ µ 3.09 7.330,.09 7.330, + + + 0 0 = ( 6..36, 6. +.36) = ( 8.357, 3.884).080 3.48.080 3.48 µ µ 3 5.086 7.330,5.086 7.330, + + + 4 0 4 0 = (.44.9,.44 +.9) = ( 4.364, 0.5) Noe of these three cofdece tervals cotas zero, ad so the expermet has establshed that each of the three steoss levels results a dfferet average flow rate at collapse. NIPRL 0

..5 Sample Sze Determato The sestvty afforded by a oe factor aalyss of varace depeds upo the sample szes, K, The power of the test of the ull hypothess that the factor level meas are all equal crease as the sample szes crease. A crease the sample sze results a decrease the legths of the parwse cofdece tervals. If the sample szes are uequal, L = sqα, κ, ν + If the sample szes are all equal to, L = sq / α, κ, ν NIPRL

If pror to expermetato, a expermeter decdes that a cofdece terval legth o large tha L s requred, the the sample sze s 4s q α, κ, ν L The expermeter eeds to estmate the value of s = σ The crtcal pot, gets, larger as the umber of factor levels creases, whch results a larger sample sze requred. q α κ ν NIPRL

..6 Model Assumptos Modelg assumpto of the aalyss of varace Observatos are dstrbuted depedetly wth ormal dstrbuto that has a commo varace The depedece of the data observatos ca be judged from the maer whch a data set s collected. The ANOVA s farly robust to the dstrbuto of data, so that t provdes farly accurate results as log as the dstrbuto s ot very far from a ormal dstrbuto. The equalty of the varaces for each of the factor levels ca be judged from a comparso of the sample varaces or from a comparso of the legths of boxplots of the observatos at each factor level. NIPRL 3

Summary problems. Ca the equalty of two populato meas be tested by ANOVA?. Whe does the F-statstc follow a F-dstrbuto? 3. Why do you use the q-values stead of the t-quatles parwse comparsos of multple meas? SSE / σ 4. Does follow a ch-square dstrbuto uder both of a ull ad ts alteratve hypothess? NIPRL 4