Assignment 9 Answer Keys

Similar documents
Answer Keys to Homework#10

Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks.

unadjusted model for baseline cholesterol 22:31 Monday, April 19,

23. Fractional factorials - introduction

SAS Procedures Inference about the Line ffl model statement in proc reg has many options ffl To construct confidence intervals use alpha=, clm, cli, c

20g g g Analyze the residuals from this experiment and comment on the model adequacy.

ST3232: Design and Analysis of Experiments

CSCI 688 Homework 6. Megan Rose Bryant Department of Mathematics William and Mary

19. Blocking & confounding

Lecture 10: 2 k Factorial Design Montgomery: Chapter 6

COMPREHENSIVE WRITTEN EXAMINATION, PAPER III FRIDAY AUGUST 26, 2005, 9:00 A.M. 1:00 P.M. STATISTICS 174 QUESTION

Unreplicated 2 k Factorial Designs

5.3 Three-Stage Nested Design Example

Chapter 8 (More on Assumptions for the Simple Linear Regression)

Residuals from regression on original data 1

Handout 1: Predicting GPA from SAT

Lecture 11: Blocking and Confounding in 2 k design

Statistics 512: Solution to Homework#11. Problems 1-3 refer to the soybean sausage dataset of Problem 20.8 (ch21pr08.dat).

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3

Module 6: Model Diagnostics

Lecture 12: 2 k Factorial Design Montgomery: Chapter 6

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS

Experimental design (KKR031, KBT120) Tuesday 11/ :30-13:30 V

R version ( ) Copyright (C) 2009 The R Foundation for Statistical Computing ISBN

Institutionen för matematik och matematisk statistik Umeå universitet November 7, Inlämningsuppgift 3. Mariam Shirdel

BE640 Intermediate Biostatistics 2. Regression and Correlation. Simple Linear Regression Software: SAS. Emergency Calls to the New York Auto Club

Chapter 11: Factorial Designs

Unit 6: Fractional Factorial Experiments at Three Levels

The 2 k Factorial Design. Dr. Mohammad Abuhaiba 1

Assignment 6 Answer Keys

Lecture notes on Regression & SAS example demonstration

T-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum

2.830 Homework #6. April 2, 2009

Fractional Factorial Designs

Analysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking

3.4. A computer ANOVA output is shown below. Fill in the blanks. You may give bounds on the P-value.

Experimental design (DOE) - Design

EE290H F05. Spanos. Lecture 5: Comparison of Treatments and ANOVA

Lecture 12: 2 k p Fractional Factorial Design

Stat 427/527: Advanced Data Analysis I

Single Factor Experiments

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes)

Chapter 6 The 2 k Factorial Design Solutions

Outline. Analysis of Variance. Acknowledgements. Comparison of 2 or more groups. Comparison of serveral groups

This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.

Reference: Chapter 6 of Montgomery(8e) Maghsoodloo

1) Answer the following questions as true (T) or false (F) by circling the appropriate letter.

Fractional Factorial Designs

Analysis of variance. April 16, Contents Comparison of several groups

Contents. TAMS38 - Lecture 8 2 k p fractional factorial design. Lecturer: Zhenxia Liu. Example 0 - continued 4. Example 0 - Glazing ceramic 3

Analysis of variance. April 16, 2009

Biological Applications of ANOVA - Examples and Readings

APPENDIX 1. Binodal Curve calculations

Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA

Chapter 6 The 2 k Factorial Design Solutions

Outline. Topic 20 - Diagnostics and Remedies. Residuals. Overview. Diagnostics Plots Residual checks Formal Tests. STAT Fall 2013

Outline. Analysis of Variance. Comparison of 2 or more groups. Acknowledgements. Comparison of serveral groups

Analysis of Variance

Comparison of a Population Means

CHAPTER 2 SIMPLE LINEAR REGRESSION

ESTIMATION METHODS FOR MISSING DATA IN UN-REPLICATED 2 FACTORIAL AND 2 FRACTIONAL FACTORIAL DESIGNS

STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007

2 k, 2 k r and 2 k-p Factorial Designs

EXST7015: Estimating tree weights from other morphometric variables Raw data print

11 Factors, ANOVA, and Regression: SAS versus Splus

Geometry Problem Solving Drill 08: Congruent Triangles

Data Set 8: Laysan Finch Beak Widths

Math Treibergs. Peanut Oil Data: 2 5 Factorial design with 1/2 Replication. Name: Example April 22, Data File Used in this Analysis:

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression

Analysis of Variance. Source DF Squares Square F Value Pr > F. Model <.0001 Error Corrected Total

STAT 350: Summer Semester Midterm 1: Solutions

Orthogonal contrasts for a 2x2 factorial design Example p130

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =

Odor attraction CRD Page 1

The One-Quarter Fraction

STA 303H1F: Two-way Analysis of Variance Practice Problems

Formal Statement of Simple Linear Regression Model

Reference: CHAPTER 7 of Montgomery(8e)

STAT451/551 Homework#11 Due: April 22, 2014

ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Chapter 13 Experiments with Random Factors Solutions

Advanced Digital Design with the Verilog HDL, Second Edition Michael D. Ciletti Prentice Hall, Pearson Education, 2011

Topic 2. Chapter 3: Diagnostics and Remedial Measures

General Linear Model (Chapter 4)

Statistics GIDP Ph.D. Qualifying Exam Methodology May 26 9:00am-1:00pm

The hypergeometric distribution - theoretical basic for the deviation between replicates in one germination test?

Forestry 430 Advanced Biometrics and FRST 533 Problems in Statistical Methods Course Materials 2010

Strategy of Experimentation II

USE OF COMPUTER EXPERIMENTS TO STUDY THE QUALITATIVE BEHAVIOR OF SOLUTIONS OF SECOND ORDER NEUTRAL DIFFERENTIAL EQUATIONS

CS 5014: Research Methods in Computer Science

Statistics GIDP Ph.D. Qualifying Exam Methodology May 26 9:00am-1:00pm

STAT 350. Assignment 4

Lecture 11: Simple Linear Regression

Chapter 30 Design and Analysis of

dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" = -/\<>*"; ODS LISTING;

Unit 9: Confounding and Fractional Factorial Designs

Transcription:

Assignment 9 Answer Keys Problem 1 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean 26.00 + 34.67 + 39.67 + + 49.33 + 42.33 + + 37.67 + + 54.67 + + + 42.33 Now factorial effects A and AB can be calculated by A = 1 [ ȳ( ) + ȳ(+ ) ȳ( + ) + ȳ(+ + ) ȳ( +) + ȳ(+ +) ȳ( + +) + ȳ(+ + +)] 4 = 1 ( 26.00 + 34.67 39.67 + 49.33 42.33 + 37.67 54.67 + 42.33) 4 = 0.33, AB = 1 [ȳ( ) ȳ(+ ) ȳ( + ) + ȳ(+ + ) + ȳ( +) ȳ(+ +) ȳ( + +) + ȳ(+ + +)] 4 = 1 (26.00 34.67 39.67 + 49.33 + 42.33 37.67 54.67 + 42.33) 4 = 1.67. Other factorial effects can be calculated in a similar way. All the effects are summarized in the following table. Effects A B AB C AC BC ABC Estimates 0.33 11.33 1.67 6.83 8.83 2.83 2.17 From the table, effects B,C and AC appear to be large (significant). (b) The ANOVA table from the SAS GLM procedure is shown below (model SS is replaced by the SS for individual effects), following which are the estimates of all the effects from this procedure. Source DF Squares Mean Square F Value Pr > F A 1 0.6666667 0.6666667 0.02 0.8837 B 1 770.6666667 770.6666667 25.55 0.0001 C 1 280.1666667 280.1666667 9.29 0.0077 AB 1 16.6666667 16.6666667 0.55 0.4681 AC 1 468.1666667 468.1666667 15.52 0.0012 BC 1 48.1666667 48.1666667 1.60 0.2245 ABC 1 28.1666667 28.1666667 0.93 0.3483 Error 16 482.666667 30.166667 Corrected Total 23 2095.333333

R-Square Coeff Var Root MSE y Mean 0.769647 13.45082 5.492419 40.83333 Standard Parameter Estimate Error t Value Pr > t A 0.3333333 2.24227067 0.15 0.8837 B 11.3333333 2.24227067 5.05 0.0001 AB -1.6666667 2.24227067-0.74 0.4681 C 6.8333333 2.24227067 3.05 0.0077 AC -8.8333333 2.24227067-3.94 0.0012 BC -2.8333333 2.24227067-1.26 0.2245 ABC -2.1666667 2.24227067-0.97 0.3483 Notice that the significant effects from the ANOVA table are B, C and AC, which are consistent with my conclusions in part (a). Also, the estimates are the same. (c) From (b), only effects B,C and AC are significant. So our model should include only these three effects plus the main effect A. If we introduce variables x 1, x 2 and x 3 as follows: x 1 = { 1, if A =, 1, if A = +. x 2 = { 1, if B =, 1, if B = +. x 3 = { 1, if C =, 1, if C = +. then the fitted regression model will have the following form with the coefficients equal to 1/2 of the corresponding effect estimates: y = ȳ + A 2 x 1 + B 2 x 2 + C 2 x 3 + AC 2 x 1x 3. Plugging in ȳ = 40.83 and the effect estimates from (a) gives the fitted regression model y = 40.83 + 0.17x 1 + 5.67x 2 + 3.42x 3 4.42x 1 x 3. (d) The diagnostic plots in Figure 1 are: normal probability Q-Q plot, plot of residuals versus factor A (cutting speed), plot of residuals versus factor B (tool geometry), plot of residuals versus factor C (cutting angle), and plot of residuals versus predicted values. Also, the results of formal normality tests are listed below. Tests for Normality Test --Statistic--- -----p Value------ Shapiro-Wilk W 0.953002 Pr < W 0.3143 Kolmogorov-Smirnov D 0.13291 Pr > D >0.1500 Cramer-von Mises W-Sq 0.059821 Pr > W-Sq >0.2500 Anderson-Darling A-Sq 0.401988 Pr > A-Sq >0.2500 The normal Q-Q plot and the normality tests shows that the normality assumption is valid. None of the four residual plots has shown unequal variances or potential outliers. (e) The main effect plots for B and C are in Figure 2. The interaction plot for A and C is in Figure 3. We would like to choose the factor levels such that the life of a machine tool is maximized. According to the above three plots, the optimal levels would be (A,B,C) = (,+,+).

Figure 1: Diagnostic Plots (f) In the regression model in part (c), if we set B to be at the low level (x 2 = 1) and the high level (x 2 = 1) respectively, we will end up with the following two models: y = 35.17 + 0.17x 1 + 3.42x 3 4.42x 1 x 3 when x 2 = 1, y = 46.5 + 0.17x 1 + 3.42x 3 4.42x 1 x 3 when x 2 = 1. The contour plots generated from these two models are shown in Figure 4. The trends in both plots suggest that to obtain the longest life for the machine tool, A (cutting speed) should be set at the low level and C (cutting angle) should be set at the high level. Also, the response values in the contour plot for high level of B are always larger than those in the contour plot for low level of B. So B (tool geometry) should be set at the high level. These results are consistent with what we found in part (e).

Figure 2: Main Effect Plots Figure 3: Interaction Plots (g) The standard error of the factorial effects is caculated by MSE 30.17 S.E. = n2 k 2 = = 2.24. 3 23 2 Problem 2 (a) The estimates of the factorial effects are summarized in the table below.

Figure 4: Contour Plots for Problem 1(f) Effect Estimate A 101.625 B 1.625 C 7.375 D 306.125 AB 7.875 AC 24.875 AD 153.625 BC 43.875 BD 0.625 CD 2.125 ABC 15.625 ABD 4.125 ACD 5.625 BCD 25.375 ABCD 40.125 And the normal QQ plot of these effect estimates are given in Figure 5, from which we can see that effects A, D and AD are significant. All the other effects approximately lie on a straight line and thus are insignificant. Figure 5: Normal Q-Q Plot for Effects (b) The ANOVA result for the model including only effects A,D and AD is shown below (model SS replaced by the effect SS).

Sum of Source DF Squares Mean Square F Value Pr > F A 1 41310.5625 41310.5625 23.77 0.0004 D 1 374850.0625 374850.0625 215.66 <.0001 AD 1 94402.5625 94402.5625 54.31 <.0001 Error 12 20857.7500 1738.1458 Corrected Total 15 531420.9375 R-Square Coeff Var Root MSE y Mean 0.960751 5.372129 41.69108 776.0625 All the three p-values are very small (< 0.0005), so these three effects are significant which confirms my findings in (a). (c) If we introduce variables x 1 and x 4 as follows: x 1 = { 1, if A =, 1, if A = +. x 4 = { 1, if D =, 1, if D = +. then the regression model relating the etch rate to the significant process variables A,D and AD is y = ȳ + A 2 x 1 + D 2 x 4 + AD 2 x 1x 4 = 776.0625 50.8125x 1 + 153.0625x 4 76.8125x 1 x 4, where the overall mean ȳ and the effects estimates are obtained from output in (b) and result in (a) respectively. (d) The diagnostic plots in Figure 6 are: normal probability Q-Q plot, plot of residuals versus factor A (anode-cathode gap), plot of residuals versus factor D (power applied to the cathode), and plot of residuals versus predicted values. Also, the results of formal normality tests are listed below. Tests for Normality Test --Statistic--- -----p Value------ Shapiro-Wilk W 0.969676 Pr < W 0.8333 Kolmogorov-Smirnov D 0.113048 Pr > D >0.1500 Cramer-von Mises W-Sq 0.045931 Pr > W-Sq >0.2500 Anderson-Darling A-Sq 0.268611 Pr > A-Sq >0.2500 The normal Q-Q plot and the normality tests shows that the normality assumption is valid. None of the four residual plots has shown unequal variances or potential outliers. Hence the model fits well. (e) Only effects A,D and AD are significant. So this 2 4 design can be projected to a 2 2 design, whose ANOVA table is already given in part (b). (f) The interacton plot for A and D is in Figure 7. The plot shows that A and D interact with each other: if D is set at low level, then the response mean increases as A changes from low level to high level; whereas, if D is at high level, then the response mean decreases as A changes from low level to high level. (g) The contour plot of the etch rate using the model in (c) is as shown in Figure 8.

Figure 6: Diagnostic Plots The trend in the contour plot suggests that to maximize the etch rate, we should set the levels as (A,D) = (,+). (h) According to the trend in the contour plot in part (g), the contour corresponding to etch rate = 800 should be between the contours for etch rate = 757.91 and etch rate = 826.88, and thus will approximately pass the point (1,1) in the plot. This suggests that the levels should be set as (A, D) = (+, +) to attain the target etch rate 800.

Figure 7: Interaction Plot for A and D Figure 8: Contour Plot