What is Experimental Design?

Similar documents
What If There Are More Than. Two Factor Levels?

PLSC PRACTICE TEST ONE

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

Sleep data, two drugs Ch13.xls

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

Stat 217 Final Exam. Name: May 1, 2002

Chapter 5 Introduction to Factorial Designs Solutions

The Random Effects Model Introduction

One-way ANOVA. Experimental Design. One-way ANOVA

Chapters 4-6: Inference with two samples Read sections 4.2.5, 5.2, 5.3, 6.2

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3

Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G

Chap The McGraw-Hill Companies, Inc. All rights reserved.

OHSU OGI Class ECE-580-DOE :Design of Experiments Steve Brainerd

Chapter 12. Analysis of variance

Two-Sample Inferential Statistics

Masters Comprehensive Examination Department of Statistics, University of Florida

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3

with the usual assumptions about the error term. The two values of X 1 X 2 0 1

Analysis of Variance

5 Basic Steps in Any Hypothesis Test

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions


Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

Randomized Blocks, Latin Squares, and Related Designs. Dr. Mohammad Abuhaiba 1

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

Lecture 7: Hypothesis Testing and ANOVA

STAT Final Practice Problems

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance

This gives us an upper and lower bound that capture our population mean.

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

COGS 14B: INTRODUCTION TO STATISTICAL ANALYSIS

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data

Statistics For Economics & Business

ANOVA: Comparing More Than Two Means

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Factorial Analysis of Variance

Lec 1: An Introduction to ANOVA

Mathematical Notation Math Introduction to Applied Statistics

Chapter 10: Chi-Square and F Distributions

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Chapter 4: Randomized Blocks and Latin Squares

A discussion on multiple regression models

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

If we have many sets of populations, we may compare the means of populations in each set with one experiment.

Math 628 In-class Exam 2 04/03/2013

Introduction to Analysis of Variance. Chapter 11

QUEEN MARY, UNIVERSITY OF LONDON

Ch 3: Multiple Linear Regression

Design and Analysis of Experiments Prof. Jhareswar Maiti Department of Industrial and Systems Engineering Indian Institute of Technology, Kharagpur

Analysis of Variance: Part 1

Multiple comparisons The problem with the one-pair-at-a-time approach is its error rate.

Survey of Smoking Behavior. Survey of Smoking Behavior. Survey of Smoking Behavior

WELCOME! Lecture 13 Thommy Perlinger

Summary of Chapters 7-9

Stat 6640 Solution to Midterm #2

INTRODUCTION TO ANALYSIS OF VARIANCE

Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8]

Confidence Intervals, Testing and ANOVA Summary

Unit 12: Analysis of Single Factor Experiments

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

Sampling Distributions: Central Limit Theorem

Lec 5: Factorial Experiment

STAT 115:Experimental Designs

Group comparison test for independent samples

Ch 2: Simple Linear Regression

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

IX. Complete Block Designs (CBD s)

1. The (dependent variable) is the variable of interest to be measured in the experiment.

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Institute of Actuaries of India

Psychology 282 Lecture #4 Outline Inferences in SLR

χ test statistics of 2.5? χ we see that: χ indicate agreement between the two sets of frequencies.

Wolf River. Lecture 19 - ANOVA. Exploratory analysis. Wolf River - Data. Sta 111. June 11, 2014

Residual Analysis for two-way ANOVA The twoway model with K replicates, including interaction,

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

ST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data

W&M CSCI 688: Design of Experiments Homework 2. Megan Rose Bryant

Analysis of Variance and Co-variance. By Manza Ramesh

ANOVA: Comparing More Than Two Means

Basic Business Statistics, 10/e

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

BIOS 2083 Linear Models c Abdus S. Wahed

MATH Notebook 3 Spring 2018

PLS205 Lab 2 January 15, Laboratory Topic 3

RCB - Example. STA305 week 10 1

Department of Mathematics & Statistics STAT 2593 Final Examination 17 April, 2000

Week 12 Hypothesis Testing, Part II Comparing Two Populations

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Hypothesis testing: Steps

6 Sample Size Calculations

Transcription:

One Factor ANOVA

What is Experimental Design? A designed experiment is a test in which purposeful changes are made to the input variables (x) so that we may observe and identify the reasons for change in the output response (y). Objectives: Which variables are most influential on the response y? Where to set the influential x s so y is near a desired value? Where to set the influential x s so variability in y is small?

Principals of design Replication: repetition of the basic experiment Allows the estimation experimental error Obtain a more precise estimate of the effect of a factor Randomization: both the allocation of the experimental material and the order in which the individual runs are performed are randomly determined Blocking: a block is a homogeneous portion of the experimental material Used to increase the precision of an experiment

Business Example A manufacturer is interested in maximizing the tensile strength of a new synthetic fiber that will be used to make cloth for men s shirt. From previous tests, the manufacturer knows: the strength is affected by the percentage of cotton in the fiber The range of the percentage is 10% to 40% Experiment with a Single Factor

Example continued Maximizing the tensile strength of a new synthetic fiber used to make shirts* Strength is affected by the % of cotton in the fiber Test five levels of cotton percentage 15%, 20%, 25%, 30%, 35% Test five specimens at each cotton % So, a = 5 levels of the factor, and n = 5 replicates All 25 runs are made in random order *DCM pp 50

Example continued Experimental Runs This randomized test sequence, known as a Completely Randomized Design (CRD), is necessary to prevent the effects of unknown nuisance variables from contaminating the results Test Sequences Run Numbers Percentage of Cotton 1 8 20 2 18 30 3 10 20 24 19 30 25 3 15

Data for the Cotton Percentage Example Cotton Percentage Observations 1 2 3 4 5 Total Average 15 7 7 15 11 9 49 9.8 20 12 17 12 18 18 77 15.4 25 14 18 18 19 19 88 17.6 30 19 25 22 19 23 108 21.6 35 7 10 11 15 11 54 10.8 376 15.04

Tensile strength 10 15 20 25 Tensile strength 10 15 20 25 Tensile Strength Vs Cotton Percentage 15 20 25 30 35 Cotton percentage 15 20 25 30 35 Cotton percentage Box plots Scatter diagram

We strongly suspect that: Exploratory Results Cotton content affects tensile strength Around 30% cotton would result in maximum strength A more objective analysis: Should we perform a t-test on all possible pairs of means? No, results in a substantial increase in the type I error The appropriate procedure for testing the equality of several means is the analysis of variance (ANOVA)

ANOVA The name ANOVA is derived from a partitioning of total variability into its component parts: ANalysisOfVAriance The total sum of squares is a measure of overall variability in the data: a i=1 n j=1 SS total = (yij y.. ) 2 The total variability in the data can be partitioned into a sum of squares of the differences between the treatment averages and the grand average, PLUS a sum of squares of the differences of observations within treatments from the treatment average: SS total = SS treatment +SS within

ANOVA Degrees of freedom There are N = an total observations, so SS total has N -1 degrees of freedom. There are a levels of the factor, so SS treatment has a - 1 degrees of freedom. Thus, we have N a degrees of freedom for SS within

ANOVA The analysis of variance identity provides us with two estimates of σ 2 one based on the inherent variability within treatments and one based on the variability between treatments SS total = SS treatment +SS within If there are no differences in the treatment means, then these two estimates should be very similar. If they are not, we suspect that the observed differences must be caused by differences in treatment means.

ANOVA for the Cotton Percentage Data Source of Variation Sum of SS Squares treatment Degrees of Freedom Mean Square F 0 Test Statistics Cotton Percentage 475.76 4 118.94 F 0 =14.76 Error 161.20 20 8.06 SS within Total 636.96 24

ANOVA Additional Concepts A factor is fixed if the levels of a factor are predetermined and the experimenter is interested only in those particular levels (e.g. 250, 300, or 350 F). A factor is classified as random if the levels are selected at random from a population of levels (e.g. 254, 287, and 326 F). When there is only one factor, the classification does not have any effect on how the data are analyzed.

Fixed Effects Model y ij = μ + τ i + ε ij y ij : the ij th observation μ: grand mean, a parameter common to all treatments τ i : treatment effect, a parameter unique to the i th treatment, ε ij : random error

Least Squares Estimate The Sum of squares of error : a n L = (y ij μ τ i ) 2 i=1 j=1 Choose μ and τ that minimize L: L μ μ,τi = 0 L τ μ,τi = 0

Least Squares Estimate Adding a constraint: a τ i i=1 = 0 The least squares estimate for μ, τ i : μ = y.. τ i = y i. y.., i = 1, 2,, a

Cotton Percentage example -continued- Estimate of the overall mean: μ = y.. = 376 25 = 15.04

Cotton Percentage example -continued- Estimate of treatment effects: τ 1 = y 1. y.. = 9.80 15.04 = 5.24 τ 2 = y 2. y.. = 15.40 15.04 = 0.36 τ 3 = y 3. y.. = 17.60 15.04 = 2.56 τ 4 = y 4. y.. = 21.60 15.04 = 6.56 τ 5 = y 5. y.. = 10.80 15.04 = 4.24 *see slide 7 for the data

Confidence Interval (CI) 100(1 α) percent CI on the i th treatment mean: y i. ± tα 2,N a MS E n 100(1 α) percent CI on the difference of two treatments mean: y i. y j. ± tα 2,N a 2MS E n

Cotton Percentage example -continued- 95% CI on the mean of treatment 4: [21.60 ± 2.086 8.06 5 ] y 4. t 0.025,20 MS E n

Temperature Example Determine the effects of temperature on process yields Case I: Two levels of temperature setting Case II: Three levels of temperature setting

Temperature Vs Process yields Week # 1 Temperature Day 250 300 M 2.4 2.6 Tu 2.7 2.4 W 2.2 2.8 Th 2.5 2.5 F 2.0 2.2 Week #3 Week # 2 M 2.5 2.7 Tu 2.8 2.3 W 2.9 3.1 Th 2.4 2.9 F 2.1 2.2 Week # 4

Yield 2.0 2.2 2.4 2.6 2.8 3.0 3.2 Exploratory analysis: Time Sequence plot T1 T2 Any difference? T 2 =350 F T 1 =250 F TGIF? M Tu W Th F M Tu W Th F Time

Analysis t-test General form of t Statistics for one population θ θ t = s θ θ: the parameter to be estimated θ: the sample statistic that is the estimate of θ s θ : the estimator of the standard deviation of θ

Temperature Data I t-test Our interest : the difference μ 1 μ 2 Let θ = X 1 X 2 σ X1 X 2 = σ 1 2 n 1 + σ 2 2 n 2 Generally, σ 1, σ 2 unknown, use estimates: o Two methods of estimating σ X1 X 2, yielding exact and approximate t-test.

Exact t-test Exact t-test: t = θ = X 1 X 2 X 1 X 2 d s p 1/n 1 + 1/n 2 zero or a constant Estimate of σ x1 X 2 in the form of pooled variance where n 1 and n 2 are sample sizes, s p2 = (n 1 1)s 2 2 1 +(n 2 1)s 2, n 1 +n 2 2 s a2 = 1 n a 1 n a x ai x 2 i=1 a., a = 1, 2

Temperature Data I exact t-test For the temperature data: 2.45 2.57 0 T= =-0.893 0.3005 1/10+1/10 P(t 0.893)=0.191 p-value = 0.191, at 5% significance level, fail to reject H 0 Conclusion -- two scenarios: μ 1 μ 2 OR Highly variable data in each sample

Exact t-test assumptions Normality of population Independence of samples Independence of observations Small (n<30) and equal or similar sample sizes(n 1 n 2 ) Equal variance σ 12 = σ 22 = σ 2

Alternative approach: Confidence Interval for difference One sided test, such as μ 1 < μ 2 (X 1 X 2 ) ± t α,d.f. s x1 x 2 Two Sided test, such as μ 1 μ 2 (X 1 X 2 ) ± tα 2,d.f.s x 1 x 2 Bounds can be calculated and the decision can be made based on interval information

ANOVA for Temperature Data I Source of Variations d.f. SS MS F Temperature a-1= 1 0.072 0.072 0.797 SS treatment Within an-a= 18 1.626 0.090 SS within Total an-1= 19 1.698 Test Statistics Under H 0, F~ F(a-1,n-1), p=0.3838 a SS treatment = n y i. y 2.. i=1 a i=1 n j=1 a i=1 n j=1 SS within = (yij y i. ) 2 SS total = (yij y.. ) 2 a: levels of treatment (temperature), a=2 n: replication of each treatment, n=10 Numerator of s p in the exact t test! ANOVA uses SAME assumptions as t test!

Temperature Data II Variations: Week to Week? Day to Day? Temperature Day 250 300 350 M 2.4 2.6 3.2 Tu 2.7 2.4 3.0 W 2.2 2.8 3.1 Th 2.5 2.5 2.8 F 2.0 2.2 2.5 M 2.5 2.7 2.9 Tu 2.8 2.3 3.1 W 2.9 3.1 3.4 Th 2.4 2.9 3.2 F 2.1 2.2 2.6

Process Yield (Tons) 2.0 2.2 2.4 2.6 2.8 3.0 3.2 3.4 Exploratory Analysis: Temperature Data II- box plots H 0 : μ 1 = μ 2 = μ 3 H A : At least one of the mean is different. 250 300 350 Temperature (F) variability within each setting about the same

ANOVA for Temperature Data(3 levels) Source of Variations d.f. SS MS F Temperature 2 1.545 0.7725 8.91 Within 27 2.342 0.0867 Total 29 3.887 p-value=.001 Reject H 0 SS temp = 3 i=1 3 i=1 10 j=1 n 10 j=1 y ij 2 3 i=1 3 i=1 10 j=1 an 10 j=1 y ij SS total = y ij2 ( y ij ) 2 /an 2 SS within =SS total -SS temp

Typical Data for a Single Factor Experiments Treatment (level) Observations Totals Averages 1 y 11 y 12 Y 1n y 1. y 1. 2 y 21 y 22 y 2n y 2. y 2. a...... i y i1 y i2 y in y i........ a y a1 y a2 y an y a. y a. n y.. y..

Clinical Trial Example A clinical trial is designed to estimate the efficacy of an experimental drug (D) compared to placebo (P) in congestive heart failure (CHF) Efficacy measure: the rate of change per week in distance walked after being administered therapy (D or P) Baseline measure of left ventricular ejection fraction (LVEF) --- the lower the LVEF, the more serious CHF 2 investigators Design and Analysis Tandem

Rate of Change in Distance Walked per Week by Drug, Investigator, and Baseline LVEF Investigator LVEF 15 15<LVEF 25 25<LVEF 30 30<LVEF D P D P D P D P 1 0 69 842-451 319-178 165-141 56 1276 107-132 -59 8-342 -533 327-20 -131 1075 191 173-244 -255-109 1173-181 -107-5 -220 2-144 -248 228-84 168-286 -59-383 604-71 -657-316 885 155 291-631 1221 144-211 -90 745 30 0-397 497 75 37-123 540 10 168-96 -27

Analysis of CHF data Means, standard deviations and sample sizes: X D = 240.6, S D = 451.2, n D = 32, X P = 98.5, S P = 322.4, n p = 31, The two-sample t with d.f. = 61 is T = X D X P S D 2 /n D + S P 2 /n P = 3.43 corresponds to a p-value = 0.001 : reject H 0, drug is significantly more efficacious than placebo overall

ANOVA for CHF data Original data Rank transform Factor d.f. F P F P Drug 1 11.66 0.001 15.96 0.0002 Investigator 1 0.09 0.77 0.02 0.9 LVEF 1 1.48 0.23 1.18 0.28 Inv. * drug 1 0.21 0.65 0.06 0.81 LVEF*drug 1 1.76 0.19 2.63 0.11

Exploratory Data Analysis

Box plots by Investigator

Box plots Investigator effect pooled

Mean Distance by LVEF category

Mean Distance (D-P) by LVEF category