10.0 REPLICATED FULL FACTORIAL DESIGN

Size: px
Start display at page:

Download "10.0 REPLICATED FULL FACTORIAL DESIGN"

Transcription

1 10.0 REPLICATED FULL FACTORIAL DESIGN (Updated Spring, 001) Pilot Plant Example ( 3 ), resp - Chemical Yield% Lo(-1) Hi(+1) Temperature 160 o 180 o C Concentration 10% 40% Catalyst A B Test# Temp Conc Catalyst Yield x 1 x x 3 y i1 y i y X Concentration 46, 44 79, ,58 69, 67 X 3 Catalyst 50, 54 81, 85-59, 61 74, X 1 Temperature Standard Order 1st variable column (-1,+1,-1...) Alternates every 0 value nd variable column (-1,-1,+,+,-,-,+,+) Alternates every 1 values 3rd variable column (-,-,-,-,+,+,+,+,-,-,-,-,+,+,+,+,...) Alternates every values 59

2 k th variable - alternates every k-1 values Underlying Model y= + b 0 + b 1 x 1 + b x + b 3 x 3 b 1 x 1 x + b 13 x 1 x 3 + b 3 x x 3 + b 13 x 1 x x 3 + ε Avg. of all responses at high level of x = = Avg. of all responses at low level for x = = E 1 = =3.0 Geometrical interpret E 1 = Number of + Signs = 8 E 1 = E 1 = = y s (We will work with these) Calculation Matrix Test I Y i

3 Avg = E 1 = E 1 = = = = E 13 = = Summary Avg = 64.5 E 1 =1.5 E 1 = 3.0 E 13 =10.0 E = -5.0 E 3 = 0.0 E 3 = 1.5 E 13 =0.5 Which of these are important? Which of the effects are distinguishable from the noise in the experimental environment? Lets say we run experiment numerous times. Sample effects, E s, are distributed normally. E = Linear Combination of y s We will look at three ways to figure out the important effects Replication of experiment Normal Prob. Plot Make an assumption about higher - order effects being negligible. Replication -> Important Effects Our experiment is replicated, calculate a sample variance for each test. 61

4 Test y i1 y i y S y Let s assume σ y is constant for all the tests. Calculate pooled sample variance estimate of σ y Σs P S P = = = 8 Pilot Plant Example: Repl. 3 Factorial Test s p x 1 x x 3 y i1 y i y

5 Calculated effects: E i is an estimate of Avg = 64.5 E 1 =1.5 E 1 = 3.0 E 13 =10.0 E = -5.0 E 3 = 0.0 E 3 = 1.5 E 13 =0.5 µ E S P = m i = 1 s i m where m= k, n is the number of trials for each test condition, N is total number of tests, and N=mn Under H o : ( µ E1 = µ E = = 0) The E s have arisen from a normal distribution centered at 0. We may, based on each calculated effect, develop a confidence interval for the true mean effect E i + t s eff -> How do we estimate σ eff First let s develop a confidence interval for the true average Avg in our case m n i = 1 j = 1 y ij = = 64.5 mn Var (Avg) = Var y + y + + y N = Var N [ y + y + + y] Nσ y = = > as expected N N σ y 63

6 8 = = = 0.5 N 16 s S p avg ---- s avg = ( 1 α )% Confidence interval for average (95% C.I.) is Avg ± t αs avg = 64.5 ± t 8, s avg = 64.5 ± (.306) ( 0.707) ν 1, --- Since 0 is not on C.I., reject H o that = 0 µ avg Confidence Intervals for Other Effects We calculate effects, generally speaking by: E = y (+) - y (-) Each based on N/ observations (8 for our example) Var[E] = Var [y (+)] + Var [y (-)] y+ y + + y Var[y (+)] = Var N 4 = Var [y+y+...+y] = N N ---σ y = ---σ N y N Var [E] = ---σ N y + ---σ N y For our example, 4 s eff = 4 σ eff = ---σ N y 4 ---s N P = = 16 64

7 Total number of trials = 16, s eff = Sometimes called the Std. error of an effect 100 ( 1 α) % confidence interval Avg ± t α ν 1 95% confidence interval is E i + (.306) (1.414) E i % confidence interval for each effect E 1 = * E 1 = E = * E 13 = * E 3 = E 3 = E 13 = , s avg does not lie on C.I. for E 1, E, & E 13. The calculated (sample) effects E 1, E, & E 13 have arisen from a normal distribution not centered at 0. Interpreting Significant Effects E 1 = 3.0 Collapse or average the design in X and X 3 directions 65

8 . y Average y for all (+) levels of X E 1 = X 1 E = Collapse or average the design in X 1 and X 3 directions y Average y for all (-) levels of X E = X E 13 = 10.0 Collapse or average the design in X direction. 66

9 X Since X 1 is involved in interaction, much care must be used in interpreting the main effect, E X 1 y +X X X y X 3 +X X Mathematical Model For 3 factorial design pilot plant example we tacitly assume that response can be characterized as: y = b 0 + b 1 x 1 + b x + b 3 x 3 + b 1 x 1 x + b 13 x 1 x 3 + b 3 x x 3 + b 13 x 1 x x 3 + ε We ran tests and fit this equation to the data: ŷ = bˆ 0 + bˆ 1x 1 + bˆ x + bˆ 3x 3 + bˆ 1 x 1 x + bˆ 13 x 1 x 3 + bˆ 3 x x 3 + bˆ 13 x 1 x x 3 67

10 We now know that only bˆ 0, bˆ 1, bˆ & bˆ 13 are important. The fitted model becomes: ŷ = bˆ 0 + bˆ 1x 1 + bˆ x + bˆ 13 x 1 x 3 ŷ = E 1 Avg E x E 13 x x1 x 3 ŷ = x x +5x 1 x 3 For each unique combination of x 1, x, & x 3 a predicted response may be calculated. ŷ = (-1) -.5 (-1) +5(-1) (-1) Test# x 1 x x 3 y ij y i ŷ i e ij = y ij - ŷ We must examine the model residuals to search for model inadequacies. Plots: e ij vs. run order e vs. y e ij vs. x 1, x,... (y ij - y i ) vs. y i Using the Model for Process Improvement ŷ = x x +5x 1 x 3 What values of x 1, x, & x 3 maximize predicted yield? How does the response behave as a function of x 1, x, & x 3? What is the shape of the response surface? 68

11 x 1 = 1 Temp -- ( Temp HI + Temp LO ) ( Temp HI Temp LO ) Temp MidTemp x 1 = = UnitChangeInTemp Temp When Temp = x 1 = = When x 1 = -0. Temp = > Temp = 170 +(-.) (10) = Only can use this transformation for quantitative variables. Recall that: Low (-1) High(+1) x 1 Temp x Conc x 3 Catalyst A B Based on other criteria, catalyst set to A, i.e., x 3 = -1 So, ŷ = x x +5x 1 (-1) or ŷ = x x The presence of interactions warp the plane. Remember, 3 ways to figure out the significant effects: 1. Use replication to get (estimate of ) (estimate of ) s p. Assume that higher order interactions are negligible (estimate of σ eff ) 3. Normal probability plot. σ y s eff s eff σ eff 69

12 Higher Order Interactions Negligible Study surface finish (Ra value) produced by a machining operation. Variable Low (-1) High(+1) 1-Speed (ft/min) Feed (in./rev) Depth of Cut (in.) Nose Radius (in.) 1/64 3/ SCEA 15 o 30 o Calculated Effects - Effect Estimates Avg = E 13 =.658 E 1 = E 14 = E = E 15 = E 3 = E 134 = E 4 = E 135 = E 5 = 0.04 E 145 = E 1 = E 34 = E 13 = E 35 = E 14 = E 45 = E 15 = -1.4 E 345 = E 3 = E 134 = E 4 = E 135 = E 5 = E 145 = E 34 = E 1345 = E 35 =.041 E 345 = -0.9 E 45 = E 1345 =

13 Test Table Test R a

14 Comments As we go from generally the proportion of sig. effects decreases Response surfaces generally fairly smooth - absence of higher order interactions Rarely find 3 - factor interactions significant, even rarer to find 4 - factor interactions significant even rarer to find 5 - factor interactions significant and so on. Let s assume that higher order interactions are negligible. Specifically, let s assume that 3 - factor and higher order interactions are negligible. The calculated effect estimates are then non-zero only because of experimental error. Normal Dist N (0, σ eff ) Calculated Effects Value (True effect value=0 for high order interactions) Under our assumption, all calculated 3-factor interactions come from a normal dist n with a standard deviation of centered at 0. Use the σ eff calculated effects to estimate. σ eff ( ) Number of 3rd and higher order E i s eff = All Higher Order Effects In above equation, the number of higher order effects equals the number of degrees of freedom. This is because that we have already assumed zero mean for the normal distribution. If we take 3rd and higher order interactions, (.658) + (.134 ) + + ( 4.1) = = s eff 7

15 s eff =.536 is an estimate of. σ eff Construct 100 ( 1 α) % Confidence Interval for each effect estimate E i ± t α ν 1, s eff --- where ν is the number of 3 - factor & higher order interactions used to calculate s eff. For our example, ν = 16, α = 0.05, The C.I. is E i ± (.1) (.536) = E i ± We can conclude that the average, E, E 4, & E 4 are significant. One final way to figure out which effects are significant is the normal probability plot. Normal Probability Plot Draw 100 values from a normal dist n, and plot the CDF for it. Draw another 100 values from an unknown dist n and plot the CDF for it too. Is the unknown dist n normal? Does the S shaped appearance in the data match the S shapes of the normal CDF? Let s change the vertical scale on the CDF plot so that the normal CDF becomes a straight line Linear Scale Scale streched out about Mean of the data x 73

16 Normal Not Normal If the sampled data from the unknown dist n appears to be well described by a line on such a plot, then the unknown dist n may be reasonably approximated as normal. How can we use a normal probability plot to help us decide which effects are important? Calculate the effects Rank the effects from smallest to largest (don t include the average) Calculate the cumulative prob. associated with each effect i -- P i = N Plot the points (E i, P i ) on Normal probability paper (Home-made or purchased). 74

14.0 RESPONSE SURFACE METHODOLOGY (RSM)

14.0 RESPONSE SURFACE METHODOLOGY (RSM) 4. RESPONSE SURFACE METHODOLOGY (RSM) (Updated Spring ) So far, we ve focused on experiments that: Identify a few important variables from a large set of candidate variables, i.e., a screening experiment.

More information

4. The 2 k Factorial Designs (Ch.6. Two-Level Factorial Designs)

4. The 2 k Factorial Designs (Ch.6. Two-Level Factorial Designs) 4. The 2 k Factorial Designs (Ch.6. Two-Level Factorial Designs) Hae-Jin Choi School of Mechanical Engineering, Chung-Ang University Introduction to 2 k Factorial Designs Special case of the general factorial

More information

4. Design of Experiments (DOE) (The 2 k Factorial Designs)

4. Design of Experiments (DOE) (The 2 k Factorial Designs) 4. Design of Experiments (DOE) (The 2 k Factorial Designs) Hae-Jin Choi School of Mechanical Engineering, Chung-Ang University 1 Example: Golfing How to improve my score in Golfing? Practice!!! Other than

More information

MEEM Design of Experiments Final Take-home Exam

MEEM Design of Experiments Final Take-home Exam MEEM 5990 - Design of Experiments Final Take-home Exam Assigned: April 19, 2005 Due: April 29, 2005 Problem #1 The results of a replicated 2 4 factorial experiment are shown in the table below: Test X

More information

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Correlation and the Analysis of Variance Approach to Simple Linear Regression Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation

More information

3.4. A computer ANOVA output is shown below. Fill in the blanks. You may give bounds on the P-value.

3.4. A computer ANOVA output is shown below. Fill in the blanks. You may give bounds on the P-value. 3.4. A computer ANOVA output is shown below. Fill in the blanks. You may give bounds on the P-value. One-way ANOVA Source DF SS MS F P Factor 3 36.15??? Error??? Total 19 196.04 Completed table is: One-way

More information

The 2 k Factorial Design. Dr. Mohammad Abuhaiba 1

The 2 k Factorial Design. Dr. Mohammad Abuhaiba 1 The 2 k Factorial Design Dr. Mohammad Abuhaiba 1 HoweWork Assignment Due Tuesday 1/6/2010 6.1, 6.2, 6.17, 6.18, 6.19 Dr. Mohammad Abuhaiba 2 Design of Engineering Experiments The 2 k Factorial Design Special

More information

2.830 Homework #6. April 2, 2009

2.830 Homework #6. April 2, 2009 2.830 Homework #6 Dayán Páez April 2, 2009 1 ANOVA The data for four different lithography processes, along with mean and standard deviations are shown in Table 1. Assume a null hypothesis of equality.

More information

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses. 1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately

More information

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments The hypothesis testing framework The two-sample t-test Checking assumptions, validity Comparing more that

More information

Permutation Tests. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods

Permutation Tests. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods Permutation Tests Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods The Two-Sample Problem We observe two independent random samples: F z = z 1, z 2,, z n independently of

More information

Chapter 5 Introduction to Factorial Designs Solutions

Chapter 5 Introduction to Factorial Designs Solutions Solutions from Montgomery, D. C. (1) Design and Analysis of Experiments, Wiley, NY Chapter 5 Introduction to Factorial Designs Solutions 5.1. The following output was obtained from a computer program that

More information

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008 MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1 Math 66/566 - Midterm Solutions NOTE: These solutions are for both the 66 and 566 exam. The problems are the same until questions and 5. 1. The moment generating function of a random variable X is M(t)

More information

Remedial Measures, Brown-Forsythe test, F test

Remedial Measures, Brown-Forsythe test, F test Remedial Measures, Brown-Forsythe test, F test Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 7, Slide 1 Remedial Measures How do we know that the regression function

More information

Factorial designs (Chapter 5 in the book)

Factorial designs (Chapter 5 in the book) Factorial designs (Chapter 5 in the book) Ex: We are interested in what affects ph in a liquide. ph is the response variable Choose the factors that affect amount of soda air flow... Choose the number

More information

Bias Variance Trade-off

Bias Variance Trade-off Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]

More information

Unreplicated 2 k Factorial Designs

Unreplicated 2 k Factorial Designs Unreplicated 2 k Factorial Designs These are 2 k factorial designs with one observation at each corner of the cube An unreplicated 2 k factorial design is also sometimes called a single replicate of the

More information

Inference in Regression Analysis

Inference in Regression Analysis Inference in Regression Analysis Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 4, Slide 1 Today: Normal Error Regression Model Y i = β 0 + β 1 X i + ǫ i Y i value

More information

5.1 Introduction. # of successes # of trials. 5.2 Part 1: Maximum Likelihood. MTH 452 Mathematical Statistics

5.1 Introduction. # of successes # of trials. 5.2 Part 1: Maximum Likelihood. MTH 452 Mathematical Statistics MTH 452 Mathematical Statistics Instructor: Orlando Merino University of Rhode Island Spring Semester, 2006 5.1 Introduction An Experiment: In 10 consecutive trips to the free throw line, a professional

More information

Experimental design (KKR031, KBT120) Tuesday 11/ :30-13:30 V

Experimental design (KKR031, KBT120) Tuesday 11/ :30-13:30 V Experimental design (KKR031, KBT120) Tuesday 11/1 2011-8:30-13:30 V Jan Rodmar will be available at ext 3024 and will visit the examination room ca 10:30. The examination results will be available for

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2 LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2 Data Analysis: The mean egg masses (g) of the two different types of eggs may be exactly the same, in which case you may be tempted to accept

More information

Response Surface Methodology

Response Surface Methodology Response Surface Methodology Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 27 1 Response Surface Methodology Interested in response y in relation to numeric factors x Relationship

More information

1 Statistical inference for a population mean

1 Statistical inference for a population mean 1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known

More information

STAT Final Practice Problems

STAT Final Practice Problems STAT 48 -- Final Practice Problems.Out of 5 women who had uterine cancer, 0 claimed to have used estrogens. Out of 30 women without uterine cancer 5 claimed to have used estrogens. Exposure Outcome (Cancer)

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Simple linear regression

Simple linear regression Simple linear regression Biometry 755 Spring 2008 Simple linear regression p. 1/40 Overview of regression analysis Evaluate relationship between one or more independent variables (X 1,...,X k ) and a single

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

Inference in Normal Regression Model. Dr. Frank Wood

Inference in Normal Regression Model. Dr. Frank Wood Inference in Normal Regression Model Dr. Frank Wood Remember We know that the point estimator of b 1 is b 1 = (Xi X )(Y i Ȳ ) (Xi X ) 2 Last class we derived the sampling distribution of b 1, it being

More information

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning SMA 6304 / MIT 2.853 / MIT 2.854 Manufacturing Systems Lecture 10: Data and Regression Analysis Lecturer: Prof. Duane S. Boning 1 Agenda 1. Comparison of Treatments (One Variable) Analysis of Variance

More information

Business Statistics. Lecture 5: Confidence Intervals

Business Statistics. Lecture 5: Confidence Intervals Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean

More information

20g g g Analyze the residuals from this experiment and comment on the model adequacy.

20g g g Analyze the residuals from this experiment and comment on the model adequacy. 3.4. A computer ANOVA output is shown below. Fill in the blanks. You may give bounds on the P-value. One-way ANOVA Source DF SS MS F P Factor 3 36.15??? Error??? Total 19 196.04 3.11. A pharmaceutical

More information

Statistics and Quantitative Analysis U4320. Segment 10 Prof. Sharyn O Halloran

Statistics and Quantitative Analysis U4320. Segment 10 Prof. Sharyn O Halloran Statistics and Quantitative Analysis U4320 Segment 10 Prof. Sharyn O Halloran Key Points 1. Review Univariate Regression Model 2. Introduce Multivariate Regression Model Assumptions Estimation Hypothesis

More information

Chapter 7: Statistical Inference (Two Samples)

Chapter 7: Statistical Inference (Two Samples) Chapter 7: Statistical Inference (Two Samples) Shiwen Shen University of South Carolina 2016 Fall Section 003 1 / 41 Motivation of Inference on Two Samples Until now we have been mainly interested in a

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Analysis of variance

Analysis of variance Analysis of variance 1 Method If the null hypothesis is true, then the populations are the same: they are normal, and they have the same mean and the same variance. We will estimate the numerical value

More information

2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23

2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23 2.4. ASSESSING THE MODEL 23 2.4.3 Estimatingσ 2 Note that the sums of squares are functions of the conditional random variables Y i = (Y X = x i ). Hence, the sums of squares are random variables as well.

More information

Statistical View of Least Squares

Statistical View of Least Squares Basic Ideas Some Examples Least Squares May 22, 2007 Basic Ideas Simple Linear Regression Basic Ideas Some Examples Least Squares Suppose we have two variables x and y Basic Ideas Simple Linear Regression

More information

Measuring the fit of the model - SSR

Measuring the fit of the model - SSR Measuring the fit of the model - SSR Once we ve determined our estimated regression line, we d like to know how well the model fits. How far/close are the observations to the fitted line? One way to do

More information

Multivariate Distributions

Multivariate Distributions IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing 1 in particular on multivariate

More information

Regression Models - Introduction

Regression Models - Introduction Regression Models - Introduction In regression models there are two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent

More information

Formal Statement of Simple Linear Regression Model

Formal Statement of Simple Linear Regression Model Formal Statement of Simple Linear Regression Model Y i = β 0 + β 1 X i + ɛ i Y i value of the response variable in the i th trial β 0 and β 1 are parameters X i is a known constant, the value of the predictor

More information

EE290H F05. Spanos. Lecture 5: Comparison of Treatments and ANOVA

EE290H F05. Spanos. Lecture 5: Comparison of Treatments and ANOVA 1 Design of Experiments in Semiconductor Manufacturing Comparison of Treatments which recipe works the best? Simple Factorial Experiments to explore impact of few variables Fractional Factorial Experiments

More information

Comparison of Two Population Means

Comparison of Two Population Means Comparison of Two Population Means Esra Akdeniz March 15, 2015 Independent versus Dependent (paired) Samples We have independent samples if we perform an experiment in two unrelated populations. We have

More information

Simple and Multiple Linear Regression

Simple and Multiple Linear Regression Sta. 113 Chapter 12 and 13 of Devore March 12, 2010 Table of contents 1 Simple Linear Regression 2 Model Simple Linear Regression A simple linear regression model is given by Y = β 0 + β 1 x + ɛ where

More information

CBA4 is live in practice mode this week exam mode from Saturday!

CBA4 is live in practice mode this week exam mode from Saturday! Announcements CBA4 is live in practice mode this week exam mode from Saturday! Material covered: Confidence intervals (both cases) 1 sample hypothesis tests (both cases) Hypothesis tests for 2 means as

More information

What is Experimental Design?

What is Experimental Design? One Factor ANOVA What is Experimental Design? A designed experiment is a test in which purposeful changes are made to the input variables (x) so that we may observe and identify the reasons for change

More information

MTH 452 Mathematical Statistics

MTH 452 Mathematical Statistics MTH 452 Mathematical Statistics Instructor: Orlando Merino University of Rhode Island Spring Semester, 2006 1 5.1 Introduction An Experiment: In 10 consecutive trips to the free throw line, a professional

More information

Univariate ARIMA Forecasts (Theory)

Univariate ARIMA Forecasts (Theory) Univariate ARIMA Forecasts (Theory) Al Nosedal University of Toronto April 10, 2016 Al Nosedal University of Toronto Univariate ARIMA Forecasts (Theory) April 10, 2016 1 / 23 All univariate forecasting

More information

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants. The idea of ANOVA Reminders: A factor is a variable that can take one of several levels used to differentiate one group from another. An experiment has a one-way, or completely randomized, design if several

More information

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs STATISTICS 4 Summary Notes. Geometric and Exponential Distributions GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs P(X = x) = ( p) x p x =,, 3,...

More information

UCLA STAT 233 Statistical Methods in Biomedical Imaging

UCLA STAT 233 Statistical Methods in Biomedical Imaging UCLA STAT 233 Statistical Methods in Biomedical Imaging Instructor: Ivo Dinov, Asst. Prof. In Statistics and Neurology University of California, Los Angeles, Spring 2004 http://www.stat.ucla.edu/~dinov/

More information

5. Blocking and Confounding

5. Blocking and Confounding 5. Blocking and Confounding Hae-Jin Choi School of Mechanical Engineering, Chung-Ang University 1 Why Blocking? Blocking is a technique for dealing with controllable nuisance variables Sometimes, it is

More information

Design of Experiments SUTD - 21/4/2015 1

Design of Experiments SUTD - 21/4/2015 1 Design of Experiments SUTD - 21/4/2015 1 Outline 1. Introduction 2. 2 k Factorial Design Exercise 3. Choice of Sample Size Exercise 4. 2 k p Fractional Factorial Design Exercise 5. Follow-up experimentation

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61

More information

15.063: Communicating with Data

15.063: Communicating with Data 15.063: Communicating with Data Summer 2003 Recitation 6 Linear Regression Today s Content Linear Regression Multiple Regression Some Problems 15.063 - Summer '03 2 Linear Regression Why? What is it? Pros?

More information

Regression Models - Introduction

Regression Models - Introduction Regression Models - Introduction In regression models, two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent variable,

More information

Tentamentsskrivning: Statistisk slutledning 1. Tentamentsskrivning i Statistisk slutledning MVE155/MSG200, 7.5 hp.

Tentamentsskrivning: Statistisk slutledning 1. Tentamentsskrivning i Statistisk slutledning MVE155/MSG200, 7.5 hp. Tentamentsskrivning: Statistisk slutledning 1 Tentamentsskrivning i Statistisk slutledning MVE155/MSG200, 7.5 hp. Tid: 14 mars 2017, kl 14.00-18.00 Examinator och jour: Serik Sagitov, tel. 031-772-5351,

More information

Introduction to hypothesis testing

Introduction to hypothesis testing Introduction to hypothesis testing Review: Logic of Hypothesis Tests Usually, we test (attempt to falsify) a null hypothesis (H 0 ): includes all possibilities except prediction in hypothesis (H A ) If

More information

Masters Comprehensive Examination Department of Statistics, University of Florida

Masters Comprehensive Examination Department of Statistics, University of Florida Masters Comprehensive Examination Department of Statistics, University of Florida May 6, 003, 8:00 am - :00 noon Instructions: You have four hours to answer questions in this examination You must show

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

Biostatistics Quantitative Data

Biostatistics Quantitative Data Biostatistics Quantitative Data Descriptive Statistics Statistical Models One-sample and Two-Sample Tests Introduction to SAS-ANALYST T- and Rank-Tests using ANALYST Thomas Scheike Quantitative Data This

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

Review. December 4 th, Review

Review. December 4 th, Review December 4 th, 2017 Att. Final exam: Course evaluation Friday, 12/14/2018, 10:30am 12:30pm Gore Hall 115 Overview Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 6: Statistics and Sampling Distributions Chapter

More information

Relating Graph to Matlab

Relating Graph to Matlab There are two related course documents on the web Probability and Statistics Review -should be read by people without statistics background and it is helpful as a review for those with prior statistics

More information

The Components of a Statistical Hypothesis Testing Problem

The Components of a Statistical Hypothesis Testing Problem Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one

More information

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous. COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida First Year Examination Department of Statistics, University of Florida August 20, 2009, 8:00 am - 2:00 noon Instructions:. You have four hours to answer questions in this examination. 2. You must show

More information

Chapter 6 The 2 k Factorial Design Solutions

Chapter 6 The 2 k Factorial Design Solutions Solutions from Montgomery, D. C. (004) Design and Analysis of Experiments, Wiley, NY Chapter 6 The k Factorial Design Solutions 6.. A router is used to cut locating notches on a printed circuit board.

More information

Density Temp vs Ratio. temp

Density Temp vs Ratio. temp Temp Ratio Density 0.00 0.02 0.04 0.06 0.08 0.10 0.12 Density 0.0 0.2 0.4 0.6 0.8 1.0 1. (a) 170 175 180 185 temp 1.0 1.5 2.0 2.5 3.0 ratio The histogram shows that the temperature measures have two peaks,

More information

HYPERGEOMETRIC and NEGATIVE HYPERGEOMETIC DISTRIBUTIONS

HYPERGEOMETRIC and NEGATIVE HYPERGEOMETIC DISTRIBUTIONS HYPERGEOMETRIC and NEGATIVE HYPERGEOMETIC DISTRIBUTIONS A The Hypergeometric Situation: Sampling without Replacement In the section on Bernoulli trials [top of page 3 of those notes], it was indicated

More information

Chapter 14. Linear least squares

Chapter 14. Linear least squares Serik Sagitov, Chalmers and GU, March 5, 2018 Chapter 14 Linear least squares 1 Simple linear regression model A linear model for the random response Y = Y (x) to an independent variable X = x For a given

More information

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions Chapter 9 Inferences from Two Samples 9. Inferences About Two Proportions 9.3 Inferences About Two s (Independent) 9.4 Inferences About Two s (Matched Pairs) 9.5 Comparing Variation in Two Samples Objective

More information

Chapter 4: Randomized Blocks and Latin Squares

Chapter 4: Randomized Blocks and Latin Squares Chapter 4: Randomized Blocks and Latin Squares 1 Design of Engineering Experiments The Blocking Principle Blocking and nuisance factors The randomized complete block design or the RCBD Extension of the

More information

OPTIMIZATION OF FIRST ORDER MODELS

OPTIMIZATION OF FIRST ORDER MODELS Chapter 2 OPTIMIZATION OF FIRST ORDER MODELS One should not multiply explanations and causes unless it is strictly necessary William of Bakersville in Umberto Eco s In the Name of the Rose 1 In Response

More information

AMS 7 Correlation and Regression Lecture 8

AMS 7 Correlation and Regression Lecture 8 AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation

More information

Statistical Methods in Natural Resources Management ESRM 304

Statistical Methods in Natural Resources Management ESRM 304 Statistical Methods in Natural Resources Management ESRM 304 Statistical Methods in Natural Resources Management I. Estimating a Population Mean II. Comparing two Population Means III. Reading Assignment

More information

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Chap The McGraw-Hill Companies, Inc. All rights reserved. 11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview

More information

B. Maddah INDE 504 Discrete-Event Simulation. Output Analysis (1)

B. Maddah INDE 504 Discrete-Event Simulation. Output Analysis (1) B. Maddah INDE 504 Discrete-Event Simulation Output Analysis (1) Introduction The basic, most serious disadvantage of simulation is that we don t get exact answers. Two different runs of the same model

More information

Regression Analysis. Simple Regression Multivariate Regression Stepwise Regression Replication and Prediction Error EE290H F05

Regression Analysis. Simple Regression Multivariate Regression Stepwise Regression Replication and Prediction Error EE290H F05 Regression Analysis Simple Regression Multivariate Regression Stepwise Regression Replication and Prediction Error 1 Regression Analysis In general, we "fit" a model by minimizing a metric that represents

More information

Objective Experiments Glossary of Statistical Terms

Objective Experiments Glossary of Statistical Terms Objective Experiments Glossary of Statistical Terms This glossary is intended to provide friendly definitions for terms used commonly in engineering and science. It is not intended to be absolutely precise.

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16 Stats Review Chapter 14 Revised 8/16 Note: This review is meant to highlight basic concepts from the course. It does not cover all concepts presented by your instructor. Refer back to your notes, unit

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

STA1000F Summary. Mitch Myburgh MYBMIT001 May 28, Work Unit 1: Introducing Probability

STA1000F Summary. Mitch Myburgh MYBMIT001 May 28, Work Unit 1: Introducing Probability STA1000F Summary Mitch Myburgh MYBMIT001 May 28, 2015 1 Module 1: Probability 1.1 Work Unit 1: Introducing Probability 1.1.1 Definitions 1. Random Experiment: A procedure whose outcome (result) in a particular

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution

More information

Simulation. Where real stuff starts

Simulation. Where real stuff starts 1 Simulation Where real stuff starts ToC 1. What is a simulation? 2. Accuracy of output 3. Random Number Generators 4. How to sample 5. Monte Carlo 6. Bootstrap 2 1. What is a simulation? 3 What is a simulation?

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 65 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Review In the previous lecture we considered the following tests: The independent

More information

Statistical Inference. Part IV. Statistical Inference

Statistical Inference. Part IV. Statistical Inference Part IV Statistical Inference As of Oct 5, 2017 Sampling Distributions of the OLS Estimator 1 Statistical Inference Sampling Distributions of the OLS Estimator Testing Against One-Sided Alternatives Two-Sided

More information

TMA4267 Linear Statistical Models V2017 (L19)

TMA4267 Linear Statistical Models V2017 (L19) TMA4267 Linear Statistical Models V2017 (L19) Part 4: Design of Experiments Blocking Fractional factorial designs Mette Langaas Department of Mathematical Sciences, NTNU To be lectured: March 28, 2017

More information

Design of Engineering Experiments Part 5 The 2 k Factorial Design

Design of Engineering Experiments Part 5 The 2 k Factorial Design Design of Engineering Experiments Part 5 The 2 k Factorial Design Text reference, Special case of the general factorial design; k factors, all at two levels The two levels are usually called low and high

More information

Ch. 7: Estimates and Sample Sizes

Ch. 7: Estimates and Sample Sizes Ch. 7: Estimates and Sample Sizes Section Title Notes Pages Introduction to the Chapter 2 2 Estimating p in the Binomial Distribution 2 5 3 Estimating a Population Mean: Sigma Known 6 9 4 Estimating a

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information