Institutionen för matematik och matematisk statistik Umeå universitet November 7, Inlämningsuppgift 3. Mariam Shirdel

Similar documents
a) Prepare a normal probability plot of the effects. Which effects seem active?

CSCI 688 Homework 6. Megan Rose Bryant Department of Mathematics William and Mary

STAT451/551 Homework#11 Due: April 22, 2014

Chapter 5 Introduction to Factorial Designs Solutions

APPENDIX 1. Binodal Curve calculations

3.4. A computer ANOVA output is shown below. Fill in the blanks. You may give bounds on the P-value.

Chapter 6 The 2 k Factorial Design Solutions

ST3232: Design and Analysis of Experiments

Chapter 30 Design and Analysis of

Contents. 2 2 factorial design 4

Contents. TAMS38 - Lecture 10 Response surface. Lecturer: Jolanta Pielaszkiewicz. Response surface 3. Response surface, cont. 4

20g g g Analyze the residuals from this experiment and comment on the model adequacy.

Assignment 9 Answer Keys

23. Fractional factorials - introduction

Contents. TAMS38 - Lecture 8 2 k p fractional factorial design. Lecturer: Zhenxia Liu. Example 0 - continued 4. Example 0 - Glazing ceramic 3

Answer Keys to Homework#10

Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks.

W&M CSCI 688: Design of Experiments Homework 2. Megan Rose Bryant

Confidence Interval for the mean response

Chapter 6 The 2 k Factorial Design Solutions

Experimental design (DOE) - Design

Strategy of Experimentation III

Histogram of Residuals. Residual Normal Probability Plot. Reg. Analysis Check Model Utility. (con t) Check Model Utility. Inference.

Simple Linear Regression. Steps for Regression. Example. Make a Scatter plot. Check Residual Plot (Residuals vs. X)

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

Multiple Regression Examples

The 2 k Factorial Design. Dr. Mohammad Abuhaiba 1

Orthogonal contrasts for a 2x2 factorial design Example p130

Design of Experiments SUTD - 21/4/2015 1

Steps for Regression. Simple Linear Regression. Data. Example. Residuals vs. X. Scatterplot. Make a Scatter plot Does it make sense to plot a line?

Six Sigma Black Belt Study Guides

USE OF COMPUTER EXPERIMENTS TO STUDY THE QUALITATIVE BEHAVIOR OF SOLUTIONS OF SECOND ORDER NEUTRAL DIFFERENTIAL EQUATIONS

SMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot.

Unreplicated 2 k Factorial Designs

Chapter 13 Experiments with Random Factors Solutions

SMAM 314 Exam 42 Name

Chapter 11: Factorial Designs

SMAM 314 Practice Final Examination Winter 2003

19. Blocking & confounding

School of Mathematical Sciences. Question 1. Best Subsets Regression

Model Building Chap 5 p251

The One-Quarter Fraction

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Design of Experiments SUTD 06/04/2016 1

SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3

Design and Analysis of

Confounding and fractional replication in 2 n factorial systems

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

Multiple Linear Regression

MATH602: APPLIED STATISTICS

2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23

1. Least squares with more than one predictor

Fractional Factorial Designs

Models with qualitative explanatory variables p216

Homework 04. , not a , not a 27 3 III III

CS 5014: Research Methods in Computer Science

TWO-LEVEL FACTORIAL EXPERIMENTS: REGULAR FRACTIONAL FACTORIALS

LEARNING WITH MINITAB Chapter 12 SESSION FIVE: DESIGNING AN EXPERIMENT

CS 5014: Research Methods in Computer Science. Experimental Design. Potential Pitfalls. One-Factor (Again) Clifford A. Shaffer.

Unit 6: Fractional Factorial Experiments at Three Levels

A Statistical Approach to the Study of Qualitative Behavior of Solutions of Second Order Neutral Differential Equations

EXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS

(1) The explanatory or predictor variables may be qualitative. (We ll focus on examples where this is the case.)

SMAM 319 Exam 1 Name. 1.Pick the best choice for the multiple choice questions below (10 points 2 each)

Analysis of Bivariate Data

Session 3 Fractional Factorial Designs 4

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4

MULTIPLE LINEAR REGRESSION IN MINITAB

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,

ANOVA: Analysis of Variation

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

Data Set 8: Laysan Finch Beak Widths

LECTURE 10: LINEAR MODEL SELECTION PT. 1. October 16, 2017 SDS 293: Machine Learning

28. SIMPLE LINEAR REGRESSION III

Chemometrics Unit 4 Response Surface Methodology

School of Mathematical Sciences. Question 1

Chapter 12: Multiple Regression

ESTIMATION METHODS FOR MISSING DATA IN UN-REPLICATED 2 FACTORIAL AND 2 FRACTIONAL FACTORIAL DESIGNS

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables

Statistics GIDP Ph.D. Qualifying Exam Methodology May 26 9:00am-1:00pm

Introduction to Regression

Design of Engineering Experiments Chapter 5 Introduction to Factorials

Statistics GIDP Ph.D. Qualifying Exam Methodology May 26 9:00am-1:00pm


Correlation & Simple Regression

[4+3+3] Q 1. (a) Describe the normal regression model through origin. Show that the least square estimator of the regression parameter is given by

Chapter 14 Multiple Regression Analysis

Reference: Chapter 6 of Montgomery(8e) Maghsoodloo

1 Introduction to One-way ANOVA

Lecture 10: 2 k Factorial Design Montgomery: Chapter 6

PRODUCT QUALITY IMPROVEMENT THROUGH RESPONSE SURFACE METHODOLOGY : A CASE STUDY

Examination paper for TMA4255 Applied statistics

CHAPTER 6 A STUDY ON DISC BRAKE SQUEAL USING DESIGN OF EXPERIMENTS

Design of Engineering Experiments Part 5 The 2 k Factorial Design

Unit 5: Fractional Factorial Experiments at Two Levels

Design and Analysis of Multi-Factored Experiments

Construction of Mixed-Level Orthogonal Arrays for Testing in Digital Marketing

Transcription:

Institutionen för matematik och matematisk statistik Umeå universitet November 7, 2011 Inlämningsuppgift 3 Mariam Shirdel (mash0007@student.umu.se) Kvalitetsteknik och försöksplanering, 7.5 hp

1 Uppgift 1: 7.8 An article in the IEEE Transactions on Semiconductor Manufacturing (Vol. 5, No. 3, 1992, pp. 214-222) describes an experiment to investigate the surface charge on a silicon wafer. The factors thought to influence induced surface charge are cleaning method (spin rinse dry, or SRD, and spin dry, or SD) and the position on the wafer where the charge was measured. The surface charge ( 10 11 q/cm 3 ) response data are shown. (a) Compute the estimates of the effects and their standard errors for this design. (b) Construct two-factor interaction plots and comment on the interaction of the factors. (c) Use the t ratio to determine the significance of each effect with α = 0.05. Comment on your findings. (d) Compute an approximate 95% CI for each effect. Compare your results with those in part (c) and comment. (e) Perform an analysis of variance of the appropriate regression model for this design. Include in your analysis hypothesis tests for each coefficient, as well as residual analysis. State your final conclusions about the adequacy of the model. Compare your results to part (c) and comment. 1.1 a Using Minitab one gets Estimated Effects and Coefficients for Surface charge Term Effect Coef SE Coef T P Constant -1.000 0.4462-2.24 0.055 Method -5.593-2.797 0.4462-6.27 0.000 Position -1.280-0.640 0.4462-1.43 0.189 Method*Position -1.220-0.610 0.4462-1.37 0.209 where effect is the estimated effects and SE Coef is the standard error. 1.2 b From Minitab one can get an interaction plot for method and position. 1

The interaction plot does not indicate a significant interaction between method and position. 1.3 c The t ratios and the corresponding P-value can be found in section 1.1 a at α = 0.05. The P-value for method is the only P-value < α, thus method is the only significant factor at α = 0.05. 1.4 d A 95% confidence interval is given by the effect estimate 2*(se(Effect)), where se(effect) = 2*(se(Coefficient)). The se(coefficient) can be found in 1.1 a, in column SE Coef. se(coefficient) is the same for all the terms, se(effect) = 2*(0.4462) = 0.8924. And 2*(se(Effect)) = 2*0.8924 = 1.7848. The approximate 95% confidence interval on the effect of method is -5.593 ± 1.7848 or (-7.38, - 3.81). The approximate 95% confidence interval on the effect of position is -1.28 ± 1.7848 or (-3.06, 0.505). The approximate 95% confidence interval on the effect of method*position is -1.22 ± 1.7848 or (-3.005, 0.565). 1.5 e The regression analysis by using Minitab becomes The regression equation is Surface charge = - 1.00-2.80 Method - 0.640 Position - 0.610 Method*Position Predictor Coef SE Coef T P Constant -1.0000 0.4462-2.24 0.055 Method -2.7967 0.4462-6.27 0.000 Position -0.6400 0.4462-1.43 0.189 Method*Position -0.6100 0.4462-1.37 0.209 2

If one looks at the regression analysis, the only significant factor appears to be method (P-value < α = 0.05). This result is equivalent to that obtained in part c. Thus, leaving only method, the regression analysis for the final model becomes The regression equation is Surface charge = - 1.00-2.80 Method Predictor Coef SE Coef T P Constant -1.0000 0.4873-2.05 0.067 Method -2.7967 0.4873-5.74 0.000 S = 1.68791 R-Sq = 76.7% R-Sq(adj) = 74.4% Analysis of Variance Source DF SS MS F P Regression 1 93.856 93.856 32.94 0.000 Residual Error 10 28.490 2.849 Total 11 122.347 Unusual Observations Surface Obs Method charge Fit SE Fit Residual St Resid 4 1.00-7.580-3.797 0.689-3.783-2.46R R denotes an observation with a large standardized residual. The analysis of variance indicates that the final regression model is adequate for the set of data. This is clear from the fact that the P-value 0.000. Though it appears that observation 4 has a large standardized residual. Looking at the normal probability plot of the residuals there appears to be deviation from normality. Looking closer at the residuals versus method one can see that the residuals for the high level of method are more spread out than for the low level. 3

4

2 Uppgift 2: 7.14 The data shown here represent a single replicate of a 2 5 design that is used in an experiment to study the compressive strength of concrete. The factors are mix (A), time (B), laboratory (C ), temperature (D), and drying time (E). (1) = 700 e = 800 a = 900 ae = 1200 b = 3400 be = 3500 ab = 5500 abe = 6200 c = 600 ce = 600 ac = 1000 ace = 1200 bc = 3000 bce = 3000 abc = 5300 abce = 5500 d = 1000 de = 1900 ad = 1100 ade = 1500 bd = 3000 bde = 4000 abd = 6100 abde = 6500 cd = 800 cde = 1500 acd = 1100 acde = 2000 bcd = 3300 bcde = 3400 abcd = 6000 abcde = 6800 (a) Estimate the factor effects. (b) Which effects appear important? Use a normal probability plot. (c) Determine an appropriate model and analyze the residuals from this experiment. Comment on the adequacy of the model. (d) If it is desirable to maximize the strength, in which direction would you adjust the process variables? 2.1 a By using Minitab we can estimate the factor effects, as seen below. Estimated Effects and Coefficients for Strength Term Effect Coef Constant 2887.50 A 1462.50 731.25 B 3537.50 1768.75 C -137.50-68.75 D 475.00 237.50 E 425.00 212.50 A*B 1200.00 600.00 A*C 125.00 62.50 A*D 62.50 31.25 A*E 62.50 31.25 B*C -100.00-50.00 B*D -12.50-6.25 B*E -12.50-6.25 C*D 112.50 56.25 5

C*E -62.50-31.25 D*E 225.00 112.50 A*B*C -62.50-31.25 A*B*D 200.00 100.00 A*B*E 50.00 25.00 A*C*D 75.00 37.50 A*C*E 100.00 50.00 A*D*E -87.50-43.75 B*C*D 100.00 50.00 B*C*E -75.00-37.50 B*D*E -62.50-31.25 C*D*E 37.50 18.75 A*B*C*D -12.50-6.25 A*B*C*E 12.50 6.25 A*B*D*E 0.00 0.00 A*C*D*E 150.00 75.00 B*C*D*E -25.00-12.50 A*B*C*D*E 62.50 31.25 2.2 b To find out which effects are important a normal plot of the effects is made by using Minitab. From this plot we can conclude that A, B, D, E, AB, DE and ABD appear to be significant. 2.3 c By using the result in b the regression analysis and final model becomes The regression equation is Strength = 2888 + 731 A + 1769 B + 237 D + 212 E + 600 AB + 113 DE + 100 ABD 6

Predictor Coef SE Coef T P Constant 2887.50 39.16 73.74 0.000 A 731.25 39.16 18.68 0.000 B 1768.75 39.16 45.17 0.000 D 237.50 39.16 6.07 0.000 E 212.50 39.16 5.43 0.000 AB 600.00 39.16 15.32 0.000 DE 112.50 39.16 2.87 0.008 ABD 100.00 39.16 2.55 0.017 S = 221.501 R-Sq = 99.1% R-Sq(adj) = 98.9% Analysis of Variance Source DF SS MS F P Regression 7 132717500 18959643 386.44 0.000 Residual Error 24 1177500 49063 Total 31 133895000 Source DF Seq SS A 1 17111250 B 1 100111250 D 1 1805000 E 1 1445000 AB 1 11520000 DE 1 405000 ABD 1 320000 Obs A Strength Fit SE Fit Residual St Resid 1-1.00 700.0 550.0 110.8 150.0 0.78 2 1.00 900.0 1012.5 110.8-112.5-0.59 3-1.00 3400.0 3087.5 110.8 312.5 1.63 4 1.00 5500.0 5550.0 110.8-50.0-0.26 5-1.00 600.0 550.0 110.8 50.0 0.26 6 1.00 1000.0 1012.5 110.8-12.5-0.07 7-1.00 3000.0 3087.5 110.8-87.5-0.46 8 1.00 5300.0 5550.0 110.8-250.0-1.30 9-1.00 1000.0 1000.0 110.8-0.0-0.00 10 1.00 1100.0 1062.5 110.8 37.5 0.20 11-1.00 3000.0 3137.5 110.8-137.5-0.72 12 1.00 6100.0 6000.0 110.8 100.0 0.52 13-1.00 800.0 1000.0 110.8-200.0-1.04 14 1.00 1100.0 1062.5 110.8 37.5 0.20 15-1.00 3300.0 3137.5 110.8 162.5 0.85 16 1.00 6000.0 6000.0 110.8-0.0-0.00 7

17-1.00 800.0 750.0 110.8 50.0 0.26 18 1.00 1200.0 1212.5 110.8-12.5-0.07 19-1.00 3500.0 3287.5 110.8 212.5 1.11 20 1.00 6200.0 5750.0 110.8 450.0 2.35R 21-1.00 600.0 750.0 110.8-150.0-0.78 22 1.00 1200.0 1212.5 110.8-12.5-0.07 23-1.00 3000.0 3287.5 110.8-287.5-1.50 24 1.00 5500.0 5750.0 110.8-250.0-1.30 25-1.00 1900.0 1650.0 110.8 250.0 1.30 26 1.00 1500.0 1712.5 110.8-212.5-1.11 27-1.00 4000.0 3787.5 110.8 212.5 1.11 28 1.00 6500.0 6650.0 110.8-150.0-0.78 29-1.00 1500.0 1650.0 110.8-150.0-0.78 30 1.00 2000.0 1712.5 110.8 287.5 1.50 31-1.00 3400.0 3787.5 110.8-387.5-2.02R 32 1.00 6800.0 6650.0 110.8 150.0 0.78 R denotes an observation with a large standardized residual. Based on the ANOVA, the P-value < α = 0.05 for the regression model, the model appears to be adequate. Looking at the residuals, there are two observation, 20 and 31, that have a residual which is larger than the rest. Though looking at the normal probability plot the residuals appear to be following a straight line. The residuals also look randomly scattered inn the versus fits plot. Looking at the residuals versus the different significant factors we see that they all look good. 8

9

10

11

2.4 d The regression equation is Strength = 2888 + 731 A + 1769 B + 237 D + 212 E + 600 AB + 113 DE + 100 ABD. To maximize strength, the variables A, B, D, and E should be increased. Variable C is not significant, thus any level of C would be acceptable. 12

3 Uppgift 3: H8.7 An article by J.J. Pignatiello, Jr. and J.S. Ramberg in the Journal of Quality Technology (Vol. 17, 1985, pp. 198-206) describes the use of a replicated fractional factorial to investigate the effect of five factors on the free height of leaf springs used in automotive application. The factors are A = furnace temperature, B = heating time, C = transfer time, D = hold down time, and E = quench oil temperature. The data are shown below: A B C D E Free height - - - - - 7.78 7.78 7.81 + - - + - 8.15 8.18 7.88 - + - + - 7.50 7.56 7.50 + + - - - 7.59 7.56 7.75 - - + + - 7.54 8.00 7.88 + - + - - 7.69 8.09 8.06 - + + - - 7.56 7.52 7.44 + + + + - 7.56 7.81 7.69 - - - - + 7.50 7.25 7.12 + - - + + 7.88 7.88 7.44 - + - + + 7.50 7.56 7.50 + + - - + 7.63 7.75 7.56 - - + + + 7.32 7.44 7.44 + - + - + 7.56 7.69 7.62 - + + - + 7.18 7.18 7.25 + + + + + 7.81 7.50 7.59 (a) Write out the alias stucture of this design. What is the resolution of this design? (b) Analyze the data. What factors influence the mean free height? (c) Calculate the range and standard deviation of the free height for each run. Is there any indication that any of these factors affect variability in the free height? (d) Analyze the residuals from this experiment, and comment on your findings. (e) Is this the best possible design for five factors in 16 runs? Specifically, can you find a fractional design for five factors in 16 runs with a higher resolution than this one? 3.1 a Looking at the table above and using Minitab, one can see that the identity operator, I = ABCD. Thus, the resolution is IV. The alias structure of this design is: 13

3.2 b Alias Structure I + A*B*C*D A + B*C*D B + A*C*D C + A*B*D D + A*B*C E + A*B*C*D*E A*B + C*D A*C + B*D A*D + B*C A*E + B*C*D*E B*E + A*C*D*E C*E + A*B*D*E D*E + A*B*C*E A*B*E + C*D*E A*C*E + B*D*E A*D*E + B*C*E Doing a design of experiment analysis in Minitab leads to Factorial Fit: Mean free height versus A; B; C; D; E Estimated Effects and Coefficients for Mean free height (coded units) Term Effect Coef Constant 7.6256 A 0.2421 0.1210 B -0.1638-0.0819 C -0.0496-0.0248 D 0.0912 0.0456 14

E -0.2387-0.1194 A*B -0.0296-0.0148 A*C 0.0013 0.0006 A*D -0.0229-0.0115 A*E 0.0637 0.0319 B*E 0.1529 0.0765 C*E -0.0329-0.0165 D*E 0.0396 0.0198 A*B*E 0.0021 0.0010 A*C*E 0.0196 0.0098 A*D*E -0.0596-0.0298 S = * PRESS = * Analysis of Variance for Mean free height (coded units) Source DF Seq SS Adj SS Adj MS F P Main Effects 5 0.612820 0.612820 0.122564 * * A 1 0.234417 0.234417 0.234417 * * B 1 0.107256 0.107256 0.107256 * * C 1 0.009834 0.009834 0.009834 * * D 1 0.033306 0.033306 0.033306 * * E 1 0.228006 0.228006 0.228006 * * 2-Way Interactions 7 0.125999 0.125999 0.018000 * * A*B 1 0.003501 0.003501 0.003501 * * A*C 1 0.000006 0.000006 0.000006 * * A*D 1 0.002101 0.002101 0.002101 * * A*E 1 0.016256 0.016256 0.016256 * * B*E 1 0.093534 0.093534 0.093534 * * C*E 1 0.004334 0.004334 0.004334 * * D*E 1 0.006267 0.006267 0.006267 * * 3-Way Interactions 3 0.015752 0.015752 0.005251 * * A*B*E 1 0.000017 0.000017 0.000017 * * A*C*E 1 0.001534 0.001534 0.001534 * * A*D*E 1 0.014201 0.014201 0.014201 * * Residual Error 0 * * * Total 15 0.754572 The normal plot of the effects indicates that the significant factors are A, B, E and the interaction B*E. To study this in more detail all the higher order interactions are excluded except for two-factor interactions. This yields Factorial Fit: Mean free height versus A; B; C; D; E Estimated Effects and Coefficients for Mean free height (coded units) Term Effect Coef SE Coef T P Constant 7.6256 0.01812 420.95 0.000 A 0.2421 0.1210 0.01812 6.68 0.007 15

B -0.1638-0.0819 0.01812-4.52 0.020 C -0.0496-0.0248 0.01812-1.37 0.265 D 0.0912 0.0456 0.01812 2.52 0.086 E -0.2387-0.1194 0.01812-6.59 0.007 A*B -0.0296-0.0148 0.01812-0.82 0.474 A*C 0.0013 0.0006 0.01812 0.03 0.975 A*D -0.0229-0.0115 0.01812-0.63 0.572 A*E 0.0637 0.0319 0.01812 1.76 0.177 B*E 0.1529 0.0765 0.01812 4.22 0.024 C*E -0.0329-0.0165 0.01812-0.91 0.431 D*E 0.0396 0.0198 0.01812 1.09 0.354 S = 0.0724617 PRESS = 4.99482 R-Sq = 97.91% R-Sq(pred) = 0.00% R-Sq(adj) = 89.56% Analysis of Variance for Mean free height (coded units) Source DF Seq SS Adj SS Adj MS F P Main Effects 5 0.612820 0.612820 0.122564 23.34 0.013 A 1 0.234417 0.234417 0.234417 44.65 0.007 B 1 0.107256 0.107256 0.107256 20.43 0.020 C 1 0.009834 0.009834 0.009834 1.87 0.265 D 1 0.033306 0.033306 0.033306 6.34 0.086 E 1 0.228006 0.228006 0.228006 43.42 0.007 2-Way Interactions 7 0.125999 0.125999 0.018000 3.43 0.170 A*B 1 0.003501 0.003501 0.003501 0.67 0.474 A*C 1 0.000006 0.000006 0.000006 0.00 0.975 A*D 1 0.002101 0.002101 0.002101 0.40 0.572 A*E 1 0.016256 0.016256 0.016256 3.10 0.177 B*E 1 0.093534 0.093534 0.093534 17.81 0.024 C*E 1 0.004334 0.004334 0.004334 0.83 0.431 D*E 1 0.006267 0.006267 0.006267 1.19 0.354 Residual Error 3 0.015752 0.015752 0.005251 Total 15 0.754572 Mean free Obs StdOrder height Fit SE Fit Residual St Resid 1 1 7.79000 7.77104 0.06532 0.01896 0.60 2 2 8.07000 8.02938 0.06532 0.04062 1.29 3 3 7.52000 7.55854 0.06532-0.03854-1.23 4 4 7.63333 7.65437 0.06532-0.02104-0.67 5 5 7.80667 7.82771 0.06532-0.02104-0.67 6 6 7.94667 7.98521 0.06532-0.03854-1.23 7 7 7.50667 7.46604 0.06532 0.04063 1.29 8 8 7.68667 7.66771 0.06532 0.01896 0.60 9 9 7.29000 7.30896 0.06532-0.01896-0.60 10 10 7.73333 7.77396 0.06532-0.04062-1.29 16

11 11 7.52000 7.48146 0.06532 0.03854 1.23 12 12 7.64667 7.62562 0.06532 0.02104 0.67 13 13 7.40000 7.37896 0.06532 0.02104 0.67 14 14 7.62333 7.58479 0.06532 0.03854 1.23 15 15 7.20333 7.24396 0.06532-0.04062-1.29 16 16 7.63333 7.65229 0.06532-0.01896-0.60 Looking at the P-values, where α = 0.05, for the T-test (or F-test, since the P-values conclude the same for both tests), one can see that the P-values for A, B, E and B*E are below α, thus they are indeed significant and they do influence the mean free height. The free height is thus free height = 7.6256 + 0.121*A - 0.082*B - 0.1194*E + 0.0765*B*E, which equals free height = 7.6256 + 0.121*furnace temperature - 0.082*heating time - 0.1194*quench oil time + 0.0765*heating time*quench oil time. The versus fits plot for the residuals looks good and the normal probability plot is alright. The value of the residuals are small and nothing is out of the ordinary. 3.3 c The range and standard deviation were calculated by the use of Minitab, the result is in the table below. 17

Range Standard deviation 0.03 0.017321 0.03 0.165227 0.06 0.034641 0.19 0.102144 0.46 0.238607 0.40 0.222785 0.12 0.061101 0.25 0.125033 0.38 0.193132 0.44 0.254034 0.06 0.034641 0.19 0.096090 0.12 0.069282 0.13 0.065064 0.07 0.040415 0.31 0.159478 Constructing a DOE for the range by using Minitab yields Term Effect Coef Constant 0.21937 A 0.11375 0.05688 B -0.12625-0.06312 C 0.02625 0.01313 D 0.06125 0.03062 E -0.01375-0.00687 A*B 0.04375 0.02188 A*C -0.03375-0.01688 A*D 0.03625 0.01812 A*E -0.00375-0.00188 B*E 0.01625 0.00812 18

C*E -0.13625-0.06812 D*E -0.02125-0.01063 A*B*E 0.03125 0.01562 A*C*E 0.04875 0.02437 A*D*E 0.13875 0.06937 S = * PRESS = * Analysis of Variance for Range (coded units) Source DF Seq SS Adj SS Adj MS F P Main Effects 5 0.134031 0.134031 0.0268063 * * A 1 0.051756 0.051756 0.0517563 * * B 1 0.063756 0.063756 0.0637562 * * C 1 0.002756 0.002756 0.0027563 * * D 1 0.015006 0.015006 0.0150062 * * E 1 0.000756 0.000756 0.0007562 * * 2-Way Interactions 7 0.094644 0.094644 0.0135205 * * A*B 1 0.007656 0.007656 0.0076563 * * A*C 1 0.004556 0.004556 0.0045563 * * A*D 1 0.005256 0.005256 0.0052562 * * A*E 1 0.000056 0.000056 0.0000563 * * B*E 1 0.001056 0.001056 0.0010562 * * C*E 1 0.074256 0.074256 0.0742562 * * D*E 1 0.001806 0.001806 0.0018063 * * 3-Way Interactions 3 0.090419 0.090419 0.0301396 * * A*B*E 1 0.003906 0.003906 0.0039062 * * A*C*E 1 0.009506 0.009506 0.0095062 * * A*D*E 1 0.077006 0.077006 0.0770062 * * Residual Error 0 * * * Total 15 0.319094 Looking at the normal plot of the effects it indicates that the interactions C*E and A*D*E are significant. Since A*D*E is aliased with B*C*E, B*C*E was included in the analysis, this leads to Factorial Fit: Range versus A; B; C; E Estimated Effects and Coefficients for Range (coded units) Term Effect Coef SE Coef T P Constant 0.21937 0.01948 11.26 0.000 A 0.11375 0.05688 0.01948 2.92 0.022 B -0.12625-0.06312 0.01948-3.24 0.014 C 0.02625 0.01313 0.01948 0.67 0.522 E -0.01375-0.00687 0.01948-0.35 0.735 B*C 0.03625 0.01812 0.01948 0.93 0.383 B*E 0.01625 0.00812 0.01948 0.42 0.689 C*E -0.13625-0.06812 0.01948-3.50 0.010 19

B*C*E 0.13875 0.06937 0.01948 3.56 0.009 S = 0.0779136 PRESS = 0.222008 R-Sq = 86.68% R-Sq(pred) = 30.43% R-Sq(adj) = 71.46% Analysis of Variance for Range (coded units) Source DF Seq SS Adj SS Adj MS F P Main Effects 4 0.119025 0.119025 0.0297563 4.90 0.033 A 1 0.051756 0.051756 0.0517563 8.53 0.022 B 1 0.063756 0.063756 0.0637562 10.50 0.014 C 1 0.002756 0.002756 0.0027563 0.45 0.522 E 1 0.000756 0.000756 0.0007562 0.12 0.735 2-Way Interactions 3 0.080569 0.080569 0.0268562 4.42 0.048 B*C 1 0.005256 0.005256 0.0052562 0.87 0.383 B*E 1 0.001056 0.001056 0.0010562 0.17 0.689 C*E 1 0.074256 0.074256 0.0742562 12.23 0.010 3-Way Interactions 1 0.077006 0.077006 0.0770062 12.69 0.009 B*C*E 1 0.077006 0.077006 0.0770062 12.69 0.009 Residual Error 7 0.042494 0.042494 0.0060705 Total 15 0.319094 Obs StdOrder Range Fit SE Fit Residual St Resid 1 1 0.030000 0.108125 0.058435-0.078125-1.52 2 2 0.300000 0.221875 0.058435 0.078125 1.52 3 3 0.060000 0.068125 0.058435-0.008125-0.16 4 4 0.190000 0.181875 0.058435 0.008125 0.16 5 5 0.460000 0.373125 0.058435 0.086875 1.69 6 6 0.400000 0.486875 0.058435-0.086875-1.69 7 7 0.120000 0.128125 0.058435-0.008125-0.16 8 8 0.250000 0.241875 0.058435 0.008125 0.16 9 9 0.380000 0.353125 0.058435 0.026875 0.52 10 10 0.440000 0.466875 0.058435-0.026875-0.52 11 11 0.060000 0.068125 0.058435-0.008125-0.16 12 12 0.190000 0.181875 0.058435 0.008125 0.16 13 13 0.120000 0.068125 0.058435 0.051875 1.01 14 14 0.130000 0.181875 0.058435-0.051875-1.01 15 15 0.070000 0.133125 0.058435-0.063125-1.22 16 16 0.310000 0.246875 0.058435 0.063125 1.22 20

Looking at the P-values in the ANOVA table for the main effects and interactions one can see that the two- and three-way interactions are significant, thus all of the main effects will be significant as well, even though the P- values for C and E are not in agreement. The range thus becomes range = 0.21937 + 0.05688*A - 0.06312*B + 0.01313*C - 0.00687*E + 0.01812*B*C + 0.00812*B*E - 0.06812*C*E + 0.06937*B*C*E, which equals range = 0.21937 + 0.05688*furnace temperature - 0.06312*heating time + 0.01313*transfer time - 0.00687*quench oil temperature + 0.01812*heating time*transfer time + 0.00812*heating time*quench oil temperature - 0.06812*transfer time*quench oil temperature + 0.06937*heating time*transfer time*quench oil temperature. The residuals in the table are small and both the normal probability plot and the versus fits for the residuals look good. Constructing a DOE for the standard deviation by using Minitab yields Factorial Fit: Standard deviation versus A; B; C; D; E 21

Estimated Effects and Coefficients for Standard deviation (coded units) Term Effect Coef Constant 0.11744 A 0.06259 0.03129 B -0.07149-0.03574 C 0.01057 0.00528 D 0.03536 0.01768 E -0.00684-0.00342 A*B 0.01540 0.00770 A*C -0.02185-0.01093 A*D 0.01906 0.00953 A*E -0.00329-0.00165 B*E 0.00877 0.00438 C*E -0.07148-0.03574 D*E -0.00468-0.00234 A*B*E 0.01556 0.00778 A*C*E 0.01997 0.00999 A*D*E 0.07643 0.03822 S = * PRESS = * Analysis of Variance for Standard deviation (coded units) Source DF Seq SS Adj SS Adj MS F P Main Effects 5 0.0417479 0.0417479 0.0083496 * * A 1 0.0156698 0.0156698 0.0156698 * * B 1 0.0204425 0.0204425 0.0204425 * * C 1 0.0004466 0.0004466 0.0004466 * * D 1 0.0050018 0.0050018 0.0050018 * * E 1 0.0001872 0.0001872 0.0001872 * * 2-Way Interactions 7 0.0251881 0.0251881 0.0035983 * * A*B 1 0.0009483 0.0009483 0.0009483 * * A*C 1 0.0019098 0.0019098 0.0019098 * * A*D 1 0.0014533 0.0014533 0.0014533 * * A*E 1 0.0000433 0.0000433 0.0000433 * * B*E 1 0.0003074 0.0003074 0.0003074 * * C*E 1 0.0204385 0.0204385 0.0204385 * * D*E 1 0.0000875 0.0000875 0.0000875 * * 3-Way Interactions 3 0.0259333 0.0259333 0.0086444 * * A*B*E 1 0.0009684 0.0009684 0.0009684 * * A*C*E 1 0.0015959 0.0015959 0.0015959 * * A*D*E 1 0.0233690 0.0233690 0.0233690 * * Residual Error 0 * * * Total 15 0.0928693 22

Standard SE St Obs StdOrder deviation Fit Fit Residual Resid 1 1 0.017321 0.017321 * 0.000000 * 2 2 0.165227 0.165227 * 0.000000 * 3 3 0.034641 0.034641 * -0.000000 * 4 4 0.102144 0.102144 * 0.000000 * 5 5 0.238607 0.238607 * -0.000000 * 6 6 0.222785 0.222785 * 0.000000 * 7 7 0.061101 0.061101 * -0.000000 * 8 8 0.125033 0.125033 * -0.000000 * 9 9 0.193132 0.193132 * 0.000000 * 10 10 0.254034 0.254034 * 0.000000 * 11 11 0.034641 0.034641 * 0.000000 * 12 12 0.096090 0.096090 * -0.000000 * 13 13 0.069282 0.069282 * 0.000000 * 14 14 0.065064 0.065064 * 0.000000 * 15 15 0.040415 0.040415 * -0.000000 * 16 16 0.159478 0.159478 * 0.000000 * Looking at the normal plot of the effects it indicates that the factors A, B and the interactions C*E and A*D*E are significant. Since A*D*E is aliased with B*C*E, B*C*E was included in the analysis, this leads to Factorial Fit: Standard deviation versus A; B; C; E Estimated Effects and Coefficients for Standard deviation (coded units) Term Effect Coef SE Coef T P Constant 0.11744 0.009708 12.10 0.000 A 0.06259 0.03129 0.009708 3.22 0.015 B -0.07149-0.03574 0.009708-3.68 0.008 C 0.01057 0.00528 0.009708 0.54 0.603 E -0.00684-0.00342 0.009708-0.35 0.735 B*C 0.01906 0.00953 0.009708 0.98 0.359 B*E 0.00877 0.00438 0.009708 0.45 0.665 C*E -0.07148-0.03574 0.009708-3.68 0.008 B*C*E 0.07643 0.03822 0.009708 3.94 0.006 S = 0.0388311 PRESS = 0.0551445 R-Sq = 88.63% R-Sq(pred) = 40.62% R-Sq(adj) = 75.65% Analysis of Variance for Standard deviation (coded units) Source DF Seq SS Adj SS Adj MS F P Main Effects 4 0.0367462 0.0367462 0.0091865 6.09 0.020 A 1 0.0156698 0.0156698 0.0156698 10.39 0.015 B 1 0.0204425 0.0204425 0.0204425 13.56 0.008 C 1 0.0004466 0.0004466 0.0004466 0.30 0.603 E 1 0.0001872 0.0001872 0.0001872 0.12 0.735 23

2-Way Interactions 3 0.0221992 0.0221992 0.0073997 4.91 0.038 B*C 1 0.0014533 0.0014533 0.0014533 0.96 0.359 B*E 1 0.0003074 0.0003074 0.0003074 0.20 0.665 C*E 1 0.0204385 0.0204385 0.0204385 13.55 0.008 3-Way Interactions 1 0.0233690 0.0233690 0.0233690 15.50 0.006 B*C*E 1 0.0233690 0.0233690 0.0233690 15.50 0.006 Residual Error 7 0.0105550 0.0105550 0.0015079 Total 15 0.0928693 Standard Obs StdOrder deviation Fit SE Fit Residual St Resid 1 1 0.017321 0.059979 0.029123-0.042658-1.66 2 2 0.165227 0.122569 0.029123 0.042658 1.66 3 3 0.034641 0.037098 0.029123-0.002457-0.10 4 4 0.102144 0.099687 0.029123 0.002457 0.10 5 5 0.238607 0.199401 0.029123 0.039206 1.53 6 6 0.222785 0.261991 0.029123-0.039206-1.53 7 7 0.061101 0.061772 0.029123-0.000671-0.03 8 8 0.125033 0.124362 0.029123 0.000671 0.03 9 9 0.193132 0.192288 0.029123 0.000844 0.03 10 10 0.254034 0.254878 0.029123-0.000844-0.03 11 11 0.034641 0.034071 0.029123 0.000570 0.02 12 12 0.096090 0.096660 0.029123-0.000570-0.02 13 13 0.069282 0.035878 0.029123 0.033404 1.30 14 14 0.065064 0.098468 0.029123-0.033404-1.30 15 15 0.040415 0.068652 0.029123-0.028237-1.10 16 16 0.159478 0.131241 0.029123 0.028237 1.10 Looking at the P-values in the ANOVA table for the main effects and interactions one can see that the two- and three-way interactions are significant, thus all of the main effects will be significant as well, even though the P- 24

values for C and E are not in agreement. The standard deviation thus becomes standard deviation = 0.11744 + 0.03129*A - 0.03574*B + 0.00528*C - 0.00342*E + 0.00953*B*C + 0.00438*B*E - 0.03574*C*E + 0.03822*B*C*E, which equals standard deviation = 0.11744 + 0.03129 *furnace temperature - 0.03574*heating time + 0.00528*transfer time - 0.00342*quench oil temperature + 0.00953*heating time*transfer time + 0.00438*heating time*quench oil temperature - 0.03574*transfer time*quench oil temperature + 0.03822*heating time*transfer time*quench oil temperature. The residuals in the table are small and the versus fits for the residuals look good. The normal probability plot does not appear to be satisfactory. 3.4 d Some of the residuals have already been discussed above, the rest, for the free height, follow here 25

26

27

The normal probability plot and the versus fits plot for the free height both look good. All the residuals in the plots for the different factors look good too. 3.5 e This is not the best possible design for five factors in 16 runs. It is possible to construct a resolution V design by setting the generator equal to the highest order interaction, ABCDE, thus the design would have a higher resolution. 28

4 Uppgift 4: 7.40 In their book, Empirical Model Building and Response Surfaces (Hoboken, NJ: John Wiley Sons, 1987), G. E. P. Box and N. R. Draper describe an experiment with three factors. The data shown in the following table are a variation of the original experiment on page 247 of their book. Suppose that these data were collected in a semiconductor manufacturing process. x 1 x 2 x 3 y 1 y 2-1 -1-1 24.00 12.49 0-1 -1 120.33 8.39 1-1 -1 213.67 42.83-1 0-1 86.00 3.46 0 0-1 136.63 80.41 1 0-1 340.67 16.17-1 1-1 112.33 27.57 0 1-1 256.33 4.62 1 1-1 271.67 23.63-1 -1 0 81.00 0.00 0-1 0 101.67 17.67 1-1 0 357.00 32.91-1 0 0 171.33 15.01 0 0 0 372.00 0.00 1 0 0 501.67 92.50-1 1 0 264.00 63.50 0 1 0 427.00 88.61 1 1 0 730.67 21.08-1 -1 1 220.67 133.82 0-1 1 239.67 23.46 1-1 1 422.00 18.52-1 0 1 199.00 29.44 0 0 1 485.33 44.67 1 0 1 673.67 158.21-1 1 1 176.67 55.51 0 1 1 501.00 138.94 1 1 1 1010.00 142.45 (a) The response y 1 is the average of three readings on resistivity for a single wafer. Fit a quadratic model to this response. (b) The response y 2 is the standard deviation of the three resistivity measurements. Fit a linear model to this response. (c) Where would you recommend we set x 1, x 2, and x 3 if the objective is to hold mean resistivity at 500 and minimize standard deviation? 4.1 a By using Minitab a quadratic model was fitted to y 1, as seen below. Estimated Regression Coefficients for y1 Term Coef SE Coef T P 29

Constant 327.62 38.76 8.453 0.000 x1 177.00 17.94 9.866 0.000 x2 109.43 17.94 6.099 0.000 x3 131.47 17.94 7.328 0.000 x1*x1 32.01 31.08 1.030 0.317 x2*x2-22.38 31.08-0.720 0.481 x3*x3-29.06 31.08-0.935 0.363 x1*x2 66.03 21.97 3.005 0.008 x1*x3 75.47 21.97 3.435 0.003 x2*x3 43.58 21.97 1.983 0.064 S = 76.1182 PRESS = 337786 R-Sq = 92.69% R-Sq(pred) = 74.92% R-Sq(adj) = 88.81% Analysis of Variance for y1 Source DF Seq SS Adj SS Adj MS F P Regression 9 1248237 1248237 138693 23.94 0.000 Linear 3 1090558 1090558 363519 62.74 0.000 x1 1 563929 563929 563929 97.33 0.000 x2 1 215531 215531 215531 37.20 0.000 x3 1 311097 311097 311097 53.69 0.000 Square 3 14219 14219 4740 0.82 0.502 x1*x1 1 6146 6146 6146 1.06 0.317 x2*x2 1 3006 3006 3006 0.52 0.481 x3*x3 1 5066 5066 5066 0.87 0.363 Interaction 3 143461 143461 47820 8.25 0.001 x1*x2 1 52317 52317 52317 9.03 0.008 x1*x3 1 68350 68350 68350 11.80 0.003 x2*x3 1 22794 22794 22794 3.93 0.064 Residual Error 17 98498 98498 5794 Total 26 1346735 Obs StdOrder y1 Fit SE Fit Residual St Resid 1 1 24.000 75.377 54.320-51.377-0.96 2 2 120.330 78.874 44.553 41.456 0.67 3 3 213.670 146.381 54.320 67.289 1.26 4 4 86.000 97.576 44.553-11.576-0.19 5 5 136.630 167.100 38.758-30.470-0.47 6 6 340.670 300.636 44.553 40.034 0.65 7 7 112.330 75.005 54.320 37.325 0.70 8 8 256.330 210.558 44.553 45.772 0.74 9 9 271.670 410.122 54.320-138.452-2.60 R 10 10 81.000 116.846 44.553-35.846-0.58 11 11 101.670 195.814 38.758-94.144-1.44 12 12 357.000 338.792 44.553 18.208 0.30 13 13 171.330 182.628 38.758-11.298-0.17 30

14 14 372.000 327.624 38.758 44.376 0.68 15 15 501.670 536.630 38.758-34.960-0.53 16 16 264.000 203.641 44.553 60.359 0.98 17 17 427.000 414.665 38.758 12.335 0.19 18 18 730.670 689.700 44.553 40.970 0.66 19 19 220.670 100.200 54.320 120.470 2.26 R 20 20 239.670 254.638 44.553-14.968-0.24 21 21 422.000 473.087 54.320-51.087-0.96 22 22 199.000 209.565 44.553-10.565-0.17 23 23 485.330 430.031 38.758 55.299 0.84 24 24 673.670 714.509 44.553-40.839-0.66 25 25 176.670 274.161 54.320-97.491-1.83 26 26 501.000 560.656 44.553-59.656-0.97 27 27 1010.000 911.162 54.320 98.838 1.85 R denotes an observation with a large standardized residual. Estimated Regression Coefficients for y1 using data in uncoded units Term Coef Constant 327.624 x1 177.001 x2 109.426 x3 131.466 x1*x1 32.0056 x2*x2-22.3844 x3*x3-29.0578 x1*x2 66.0283 x1*x3 75.4708 x2*x3 43.5833 In both the estimated regression and ANOVA we find that the P-value < α = 0.05 for the linear terms and interaction terms, they appear to be significant. The P-value < α = 0.05 for x 1, x 2, x 3, x 1 x 2 and x 1 x 3, thus they are significant for the model. The P-value for x 2 x 3 is 0.064, which is really close to α, but I m still putting it as insignificant. The quadratic model for y 1 is y 1 = 327.624+177.001 x 1 +109.426 x 2 +131.466 x 3 +66.0283 x 1 x 2 +75.4708 x 1 x 3. Looking at the normal probability plot and the versus fits plot of the residuals, they both look good, the residuals are all scattered in the versus fits plot and they re following a somewhat straight line in the normal proability plot. Observations 9 and 19 have larger residuals than the rest. 31

4.2 b By using Minitab a linear model was fitted to y 2, as seen below. Estimated Regression Coefficients for y2 Term Coef SE Coef T P Constant 48.00 7.808 6.147 0.000 x1 11.53 9.563 1.205 0.240 x2 15.32 9.563 1.602 0.123 x3 29.19 9.563 3.052 0.006 S = 40.5739 PRESS = 53048.3 R-Sq = 36.71% R-Sq(pred) = 11.32% R-Sq(adj) = 28.45% Analysis of Variance for y2 Source DF Seq SS Adj SS Adj MS F P Regression 3 21957.3 21957.3 7319.1 4.45 0.013 Linear 3 21957.3 21957.3 7319.1 4.45 0.013 x1 1 2392.0 2392.0 2392.0 1.45 0.240 x2 1 4226.5 4226.5 4226.5 2.57 0.123 x3 1 15338.8 15338.8 15338.8 9.32 0.006 Residual Error 23 37863.6 37863.6 1646.2 Total 26 59820.9 Obs StdOrder y2 Fit SE Fit Residual St Resid 1 1 12.490-8.048 18.312 20.538 0.57 2 2 8.390 3.480 15.617 4.910 0.13 3 3 42.830 15.008 18.312 27.822 0.77 32

4 4 3.460 7.276 15.617-3.816-0.10 5 5 80.410 18.804 12.346 61.606 1.59 6 6 16.170 30.331 15.617-14.161-0.38 7 7 27.570 22.599 18.312 4.971 0.14 8 8 4.620 34.127 15.617-29.507-0.79 9 9 23.630 45.655 18.312-22.025-0.61 10 10 0.000 21.144 15.617-21.144-0.56 11 11 17.670 32.672 12.346-15.002-0.39 12 12 32.910 44.200 15.617-11.290-0.30 13 13 15.010 36.467 12.346-21.457-0.56 14 14 0.000 47.995 7.808-47.995-1.21 15 15 92.500 59.523 12.346 32.977 0.85 16 16 63.500 51.791 15.617 11.709 0.31 17 17 88.610 63.319 12.346 25.291 0.65 18 18 21.080 74.846 15.617-53.766-1.44 19 19 133.820 50.336 18.312 83.484 2.31 R 20 20 23.460 61.864 15.617-38.404-1.03 21 21 18.520 73.391 18.312-54.871-1.52 22 22 29.440 65.659 15.617-36.219-0.97 23 23 44.670 77.187 12.346-32.517-0.84 24 24 158.210 88.715 15.617 69.495 1.86 25 25 55.510 80.982 18.312-25.472-0.70 26 26 138.940 92.510 15.617 46.430 1.24 27 27 142.450 104.038 18.312 38.412 1.06 R denotes an observation with a large standardized residual. Estimated Regression Coefficients for y2 using data in uncoded units Term Coef Constant 47.9952 x1 11.5278 x2 15.3233 x3 29.1917 In both the estimated regression and ANOVA we find that the P-value < α = 0.05 for the linear terms, they appear to be significant. The P-value < α = 0.05 for x 3, thus it is significant for the model. The linear model for y 2 is y 2 = 47.9952 + 29.1917 x 3. Looking at the versus fits plot of the residuals, it looka good, the residuals are all scattered in the versus fits plot. Observations 19 has a larger residual than the rest. The residuals in the normal probability plot seem to follow a slight curvature instead of a straight line. 33

4.3 c The formula for the standard deviation is y 2 = 47.9952 + 29.1917 x 3, where 1 < x 3 < 1. To minimize the standard deviation we pick x 3 = 1. A contour plot was made of y 2 vs x 2 and x 3. From this contour plot we see that y 2 has the smallest area to the bottom left. Since x 3 = 1 I choose x 2 = 0, so x 2 and x 3 are in the smallest area, with a minimized standard deviation. To attain the desired level of 500 for the mean, the quadratic equation found in part a will be solved with y = 500, x 2 = 0 and x 3 = 1 to find x 1. We have 500 = 327.624 + 177.001 x 1 131.466 75.4708 x 1 which leads to an x 1 3 in this particular situation (x 2 = 0, x 3 = 1, y = 500). 34

5 Uppgift 5: H12.1 An article in Industrial Quality Control (1956, pp. 5-8) describes an experiment to investigate the effect of glass type and phosphor type on the brightness of a television tube. The response measured is the current necessary (in microamps) to obtain a specified brightness level. The data are shown here. Analyze the data and draw conclusions. 5.1 Analysis Phosphor Type Glass Type 1 2 3 1 280 300 290 290 310 285 285 295 290 2 230 260 220 235 240 225 240 235 230 To check if the factors (glass type, phosphor type) influence the brightness level a Two-way ANOVA is made, where we look at the equality of row treatment effects (glass type) { H 0 : τ 1 = τ 2 = 0 H 1 : τ i 0 for at least one i and the equality of column treatment effects (phosphor type) { H 0 : β 1 = β 2 = β 3 = 0 H 1 : β i 0 for at least one i. We also want to know if the row and column treatments interact { H 0 : (τβ) ij = 0 H 1 : (τβ) ij 0 for at least one ij. The null hypothesis, H 0, is rejected if the P-value < α, here α = 0.05. Inserting the data in Minitab and doing a Two-way ANOVA yields: 35

Two-way ANOVA: Brightness level versus Glass Type; Phosphor Type Source DF SS MS F P Glass Type 1 14450.0 14450.0 273.79 0.000 Phosphor Type 2 933.3 466.7 8.84 0.004 Interaction 2 133.3 66.7 1.26 0.318 Error 12 633.3 52.8 Total 17 16150.0 S = 7.265 R-Sq = 96.08% R-Sq(adj) = 94.44% Individual 95% CIs For Mean Based on Glass Pooled StDev Type Mean -----+---------+---------+---------+---- 1 291.667 (--*-) 2 235.000 (--*-) -----+---------+---------+---------+---- 240 260 280 300 Individual 95% CIs For Mean Based on Phosphor Pooled StDev Type Mean -------+---------+---------+---------+-- 1 260.000 (-------*-------) 2 273.333 (-------*-------) 3 256.667 (-------*-------) -------+---------+---------+---------+-- 256.0 264.0 272.0 280.0 Phosphor Type Glass Type Brightness level RESI1 1 1 280-5.0000 1 1 290 5.0000 1 1 285 0.0000 1 2 230-5.0000 1 2 235 0.0000 1 2 240 5.0000 2 1 300-1.6667 2 1 310 8.3333 2 1 295-6.6667 2 2 260 15.0000 2 2 240-5.0000 2 2 235-10.0000 3 1 290 1.6667 3 1 285-3.3333 3 1 290 1.6667 3 2 220-5.0000 3 2 225 0.0000 3 2 230 5.0000 36

Residual Frequency Residual Percent Residual Residual Plots for Brightness level 99 Normal Probability Plot Versus Fits 90 10 50 0 10 1-10 0 Residual 10-10 220 240 260 Fitted Value 280 300 Histogram Versus Order 4 3 10 2 0 1 0-10 -5 0 5 Residual 10 15-10 2 4 6 8 10 12 Observation Order 14 16 18 Residuals Versus Glass Type (response is Brightness level) 15 10 5 0-5 -10 1.0 1.2 1.4 1.6 Glass Type 1.8 2.0 37

Residual Residuals Versus Phosphor Type (response is Brightness level) 15 10 5 0-5 -10 1.0 1.5 2.0 Phosphor Type 2.5 3.0 38

We find that the P-value for glass type is 0, which is less than α = 0.05, the null hypothesis can be rejected, thus glass type is significant and affects the brightness level. The P-value for phosphor type is 0.004, which is less than α = 0.05, the null hypothesis can be rejected, thus phosphor type is significant and affects the brightness level. The P-value for the interaction between the glass- and phosphor type is 0.318, which is bigger than α = 0.05, thus the null hypothesis cannot be rejected and the interaction is insignificant and there is no effect from the interaction on the brightness level. The normal probability plot for the brightness level looks alright, except for one of the residuals of phosphor- and glass type 2, where the residual has the value of 15, which can be seen in the column RESI1. It stands out a bit compared to the others, but most of the residuals follow a straight line, which indicates that the model seems to fit the data well. The coefficient of determination, R 2 = 0.9608, which is very high, meaning that the model is well fitted to the data material. The versus fits plot doesn t show any particular pattern for the residuals. There might be a slight tendency for the variance of the residuals to increase. Looking at the plots of the residuals versus glass- and phosphor type respectively, one can see that the two plots both have a slight inequality of variance. And the combination of glass type 2 and phosphor type 2 might have a larger variance than the other combinations. 39