Vladimir Spokoiny Design of Experiments in Science and Industry

Size: px
Start display at page:

Download "Vladimir Spokoiny Design of Experiments in Science and Industry"

Transcription

1 W eierstraß-institut für Angew andte Analysis und Stochastik Vladimir Spokoiny Design of Experiments in Science and Industry Mohrenstr. 39, Berlin June, 23, 2009 DoE Vladimir Spokoiny 0-1

2 Outline 1 Experiments in Science and Industry 2 Simple comparative experiments, t-test: 3 Experiments with a single factor, ANOVA 4 Randomised blocks, Latin square, Graeco-Latin square: 5 Factorial and fractional factorial designs: DoE Vladimir Spokoiny 1-2

3 Experiments in Science and Industry The primary goal in scientific research is usually to show the statistical significance of an effect that a particular factor exerts on the dependent variable of interest. In industrial settings, the primary goal is usually to extract the maximum amount of unbiased information regarding the factors affecting a production process from as few (costly) observations as possible. DoE Vladimir Spokoiny 1-3

4 Computational Problems There are basically two general issues to which Experimental Design is addressed: 1 How to design an optimal experiment, and 2 How to analyze the results of an experiment. DoE Vladimir Spokoiny 1-4

5 Outline 1 Experiments in Science and Industry 2 Simple comparative experiments, t-test: 3 Experiments with a single factor, ANOVA 4 Randomised blocks, Latin square, Graeco-Latin square: 5 Factorial and fractional factorial designs: DoE Vladimir Spokoiny 2-5

6 Experimental data or Results Experimental data: Y 1,..., Y n Sample size n ; randomness and Experimental errors : the observations Y i are random (uncertain) and independent. Empirical mean: Y = (Y Y n )/n DoE Vladimir Spokoiny 2-6

7 Population and distribution Population Y : infinite or very large collection of data. Population mean IE(Y ) :) η = IE(Y ) = 1 N Y Population variance V (Y ) : σ 2 = IE(Y η) 2 = 1 N (Y η) 2. Population distribution: every Y i follows some reference distribution. Normal populations: every Y i η and variance σ 2. is normally distributed with mean Each variable σ 1 (Y i η) is standard normal. DoE Vladimir Spokoiny 2-7

8 Empirical (descriptive) measures of the Sample Sample Y 1,..., Y n. Sample mean: Y = (Y Y n )/n Sample variance (population mean known) ṡ 2 = 1 (Y i η) 2 n Sample variance (population mean unknown) s 2 = 1 n 1 i (Y i Y ) 2 i Student (t-) distribution: (Y η)/s follows the t-distribution with n 1 degrees of freedom. DoE Vladimir Spokoiny 2-8

9 Significance t-test Given two samples: n A observations with treatment A ; n B observations with treatment B. Sample means Y A, Y B ; population means η A, η B ; Sample variances: sa 2 = 1 (Y Y A ) 2 sb 2 n = 1 (Y Y B ) 2 A n B A B Pooled sample variance ( s 2 1 = (Y Y A ) 2 + n A + n B A B (Y Y B ) 2 ) DoE Vladimir Spokoiny 2-9

10 Significance t-test Hypothesis H 0 : η A = η B ; Alternative (two-sided) H 1 : η A η B ; Alternative (one-sided) H + 1 : η A > η B or η A < η B. Consider t 0 = (Y A Y B ) (η A η B ) s 1/n A + 1/n B If the individual errors are homogeneous between samples, then t 0 is t-distributed with n A + n B 2 degrees of freedom. Test statistic t = (Y A Y B ) s 1/n A + 1/n B Under the hypothesis, t = t 0. DoE Vladimir Spokoiny 2-10

11 Significance t-test. Cont Test level (size): given α > 0 IP(reject H 0 when correct) α. Quantiles of t 0 : given α < 1, define q α by IP(t 0 > q α ) = α. One-sided test: reject H 0 if t > q α. Two-sided test: reject H 0 if t > q α/2. Used that IP( t 0 > q α/2 ) = 2IP(t > q α/2 ) = α. DoE Vladimir Spokoiny 2-11

12 Outline 1 Experiments in Science and Industry 2 Simple comparative experiments, t-test: 3 Experiments with a single factor, ANOVA 4 Randomised blocks, Latin square, Graeco-Latin square: 5 Factorial and fractional factorial designs: DoE Vladimir Spokoiny 3-12

13 Comparing K treatment means Given K samples, K 2, each of size n k for k = 1,..., K. Sample means in k -treatment: Y k = 1 n k k -treatment sum of squares (SS): i Y ki S k = i (Y ki Y k ) 2 k -treatment sample variance s 2 k = S k/(n k 1) DoE Vladimir Spokoiny 3-13

14 Pooled measures Pooled sample size N = n n K. Pooled (grand) mean: Y = 1 Y ki N Within-treatment sum of squares: S R = S S K Between treatment sum of squares: S T = k k n k (Y k Y ) 2 Within- and between-treatment mean square: s 2 R = S R N k i s 2 T = S T K 1 DoE Vladimir Spokoiny 3-14

15 Analysis of Variance Table Overall variation S D = k (Y ki Y ) 2. i Then S D = k (Y ki Y k ) 2 + i k n k (Y k Y ) 2 = S T + S R. DoE Vladimir Spokoiny 3-15

16 Analysis of Variance Table source of sum of degree of mean variation squares freedom square average S A = NY 2 ν A = 1 s 2 A = S A/ν A between treatments S T = k n k(y k Y ) 2 ν T = K 1 s 2 T = S T /ν T within treatments total S R = k k i (Y ki Y ) 2 i (Y ki Y k ) 2 ν R = N K s 2 R = S R/ν R N DoE Vladimir Spokoiny 3-16

17 Treatment effect Basic model: Y ki = η k + ε ki, ε ki N(0, σ 2 ) i.i.d. where η k is the true treatment mean. Averaged treatment mean: η = 1 N nk η k. Treatment effect: τ k = η k η. Model: Y ki = η + τ k + ε ki Data analysis: Y ki = Y + (Y k Y ) + (Y ki Y k ) DoE Vladimir Spokoiny 3-17

18 Analysis of Variance Table. F-test No effect hypothesis: η 1 =... = η K η. Then all the values Y k and Y are of mean η. Under homogeneous normal errors, the ratio s 2 T /s2 R = ν 1 T ν 1 R k n k(y k Y ) 2 i (Y ki Y k ) 2 k has a fixed Fisher distribution F K 1,N K with (K 1, N K) degrees of freedom. If f α is the (1 α) -quantile of F K 1,N K Large values of S T /S R over f α indicate that the hypothesis of no effect is presumably wrong. DoE Vladimir Spokoiny 3-18

19 Analysis of Variance. Some issues Outlier check: ε ki > q α σ. Treatment dependent error variance: requires some correction of the test. Categorical data (binary, Poisson, multinomial, etc): apply Generalized Linear Models in place of linear: Y ki P η+τk where (P υ ) is a fixed exponential family (usually with canonical parameter). Interactions DoE Vladimir Spokoiny 3-19

20 Outline 1 Experiments in Science and Industry 2 Simple comparative experiments, t-test: 3 Experiments with a single factor, ANOVA 4 Randomised blocks, Latin square, Graeco-Latin square: 5 Factorial and fractional factorial designs: DoE Vladimir Spokoiny 4-20

21 Randomized blocks Y ki, observation with k th treatment applied to the i th block, k = 1,..., K and i = 1,..., n. Model: Y ki = η + β i + τ k + ε ki where η is general mean, β i, block effect, τ k, treatment effect. Data analysis: Y ki = Y + (Y i Y ) + (Y k Y ) + (Y ki Y i Y k + Y ) where Y is grand average, Y i, block average, Y k, treatment average, and Y ki Y i Y k + Y, residual. DoE Vladimir Spokoiny 4-21

22 Analysis of Variance Table source of variation sum of squares degree of freedom average (correction factor) S = nky 2 1 between blocks S B = K i (Y i Y ) 2 n 1 between treatments S T = n k (Y k Y ) 2 K 1 residuals S R = k i (Y ki Y k ) 2 (n 1)(K 1) total k i (Y ki Y i Y k + Y ) 2 DoE Vladimir Spokoiny 4-22 N = nk

23 Test of no effect The no effect hypothesis H 0 : η 1 =... = η K = η can be tested by the value of the ratio st 2 sr 2 = ν 1 T ν 1 R k n(y k Y ) 2 i (Y ki Y k ). 2 k Additivity Y ki = η + β i + τ k + ε ki implies that the block effect cancels, yielding the Fisher distribution F K 1,(n 1)(K 1). Gain of random block against completely random design: smaller variance of the test statistic. DoE Vladimir Spokoiny 4-23

24 Latin Square Design Latin square designs are used when the factors of interest have more than two levels there are no (or only negligible) interactions between factors. Feature: each row and each column receives each treatment exactly once. Example: Car Driver A B D C 2 D C A B 3 B D C A 4 C A B D DoE Vladimir Spokoiny 4-24

25 Latin Square Design: Additive model No interaction yields additive model Y ijk = η + β i + γ j + τ k + ε ijk where β i are row effects, γ j are column effects, τ k are treatment effects, i, j, k = 1,..., K. Requires only K 2 measurements (instead of K 3 ). Hypotheses tested: (i) all τ k are zeros (no treatment effect) (ii) all γ j are zero (no column effect) (iii) all β i are zero (no row effect) DoE Vladimir Spokoiny 4-25

26 Latin Square Design: Analysis of variance Data analysis: Y ijk = Ŷijk + ε ijk with Ŷ ijk = Y + (Y i Y ) + (Y j Y ) + (Y k Y ), where Y = K 2 Y ijk is Grand average, Y i is the row average, Y j is the column average, Y k is the treatment average. Sum of squares: S B = i (Y i Y ) 2, S C = j (Y j Y ) 2, S T = k (Y k Y ) 2, and S R = ijk (Y ijk Ŷijk) 2. DoE Vladimir Spokoiny 4-26

27 Latin Square Design: Analysis of variance Degrees of freedom: τ B = τ C = τ T = K 1, τ R = (K 1)(K 2) Mean squares: sb 2 = 1 (Y i Y ) 2, sc 2 τ = 1 (Y j Y ) 2, B τ C i st 2 = 1 (Y k Y ) 2, sr 2 τ = 1 (Y ijk T τ Ŷijk) 2 R k Ratio of mean squares as F-test: sb 2 /s2 R, row effect; sc 2 /s2 R, column effect ; st 2 /s2 R, treatment effect. DoE Vladimir Spokoiny 4-27 j ijk

28 Graeco-Latin Square Design Useful to eliminate more than two sources of variability. Allows to study of K treatments simultaneously with three different blocking variables. Constructed from two Latin squares by superimposing: A B C B C A C A B and α β χ χ α β β χ α results in Aα Bβ Cχ Bχ Cα Aβ Cβ Aχ Bα DoE Vladimir Spokoiny 4-28

29 Outline 1 Experiments in Science and Industry 2 Simple comparative experiments, t-test: 3 Experiments with a single factor, ANOVA 4 Randomised blocks, Latin square, Graeco-Latin square: 5 Factorial and fractional factorial designs: DoE Vladimir Spokoiny 5-29

30 Factorial design at two levels General factorial design: l k levels for the factor k, k = 1,..., K. Requires l 1... l K experimental runs. 2 K factorial design: K factor at two levels each. Requires 2 K runs. One can assume that each factor X k + and. takes the categorical values DoE Vladimir Spokoiny 5-30

31 Calculation of main effects main effect k = Y + k Y k Gain efficiency if variables act additively: 2 3 factorial is as precise as 24 one-fact.-at-a-time method. Permits to measure interaction effects. DoE Vladimir Spokoiny 5-31

32 Two-fact. interaction effects Consider any two factors, e.g. X 1, X 2. Compute average effects (AE) Define the interaction effect Alternatively Y 21 + = AE(X 2 X 1 = +), Y 21 = AE(X 2 X 1 = ). X 1 X 2 = 1 2[ Y 21 + Y 21 ]. X 1 X 2 = 2[ 1 ] Y 12 + Y 12 It suffices to check this formula for two factors only. DoE Vladimir Spokoiny 5-32

33 Three-fact. interaction effects Now consider any three factors, e.g. X 1, X 2, X 3. Define Y = 1 [ AE(X1 X 2 = +, X 3 = +) AE(X 1 X 2 =, X 3 = +) ] 2 Y 123 = 1 [ AE(X1 X 2 = +, X 3 = ) AE(X 1 X 2 =, X 3 = ) ] 2 and the interaction effect X 1 X 2 X 3 X 1 X 2 X 3 = 1 2( Y Y 123 ). DoE Vladimir Spokoiny 5-33

34 Signs for calculating effects from 2 3 mean DoE Vladimir Spokoiny 5-34

35 Fractional Factorial Design at Two Levels Problem: the number of runs of the full 2 K geometrically with K. factorial design grows Let K = 7. The number of effects: average main eff. 2-fact. 3-fact. 4-fact. 5-fact. 6-fact. 7-fact Redundancy: some of interactions are negligible. DoE Vladimir Spokoiny 5-35

Lecture 7: Latin Square and Related Design

Lecture 7: Latin Square and Related Design Lecture 7: Latin Square and Related Design Montgomery: Section 4.2-4.3 Page 1 Automobile Emission Experiment Four cars and four drivers are employed in a study for possible differences between four gasoline

More information

Chapter 4: Randomized Blocks and Latin Squares

Chapter 4: Randomized Blocks and Latin Squares Chapter 4: Randomized Blocks and Latin Squares 1 Design of Engineering Experiments The Blocking Principle Blocking and nuisance factors The randomized complete block design or the RCBD Extension of the

More information

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya DESAIN EKSPERIMEN BLOCKING FACTORS Semester Genap Jurusan Teknik Industri Universitas Brawijaya Outline The Randomized Complete Block Design The Latin Square Design The Graeco-Latin Square Design Balanced

More information

STAT22200 Spring 2014 Chapter 13B

STAT22200 Spring 2014 Chapter 13B STAT22200 Spring 2014 Chapter 13B Yibi Huang May 27, 2014 13.3.1 Crossover Designs 13.3.4 Replicated Latin Square Designs 13.4 Graeco-Latin Squares Chapter 13B - 1 13.3.1 Crossover Design (A Special Latin-Square

More information

Lecture 7: Latin Squares and Related Designs

Lecture 7: Latin Squares and Related Designs Lecture 7: Latin Squares and Related Designs Montgomery: Section 4.2 and 4.3 1 Lecture 7 Page 1 Automobile Emission Experiment Four cars and four drivers are employed in a study of four gasoline additives(a,b,c,

More information

The Random Effects Model Introduction

The Random Effects Model Introduction The Random Effects Model Introduction Sometimes, treatments included in experiment are randomly chosen from set of all possible treatments. Conclusions from such experiment can then be generalized to other

More information

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1) Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ

More information

Factorial designs. Experiments

Factorial designs. Experiments Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response

More information

Analysis of Variance

Analysis of Variance Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also

More information

Math 423/533: The Main Theoretical Topics

Math 423/533: The Main Theoretical Topics Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Hypothesis Testing One Sample Tests

Hypothesis Testing One Sample Tests STATISTICS Lecture no. 13 Department of Econometrics FEM UO Brno office 69a, tel. 973 442029 email:jiri.neubauer@unob.cz 12. 1. 2010 Tests on Mean of a Normal distribution Tests on Variance of a Normal

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Chapter 12. Analysis of variance

Chapter 12. Analysis of variance Serik Sagitov, Chalmers and GU, January 9, 016 Chapter 1. Analysis of variance Chapter 11: I = samples independent samples paired samples Chapter 1: I 3 samples of equal size J one-way layout two-way layout

More information

Stat 6640 Solution to Midterm #2

Stat 6640 Solution to Midterm #2 Stat 6640 Solution to Midterm #2 1. A study was conducted to examine how three statistical software packages used in a statistical course affect the statistical competence a student achieves. At the end

More information

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4 Contents Factorial design TAMS38 - Lecture 6 Factorial design, Latin Square Design Lecturer: Zhenxia Liu Department of Mathematics - Mathematical Statistics 28 November, 2017 Complete three factor design

More information

Paired comparisons. We assume that

Paired comparisons. We assume that To compare to methods, A and B, one can collect a sample of n pairs of observations. Pair i provides two measurements, Y Ai and Y Bi, one for each method: If we want to compare a reaction of patients to

More information

Chapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions

Chapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions Solutions from Montgomery, D. C. (008) Design and Analysis of Experiments, Wiley, NY Chapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions 4.. The ANOVA from a randomized complete block

More information

Assignment 6 Answer Keys

Assignment 6 Answer Keys ssignment 6 nswer Keys Problem 1 (a) The treatment sum of squares can be calculated by SS Treatment = b a ȳi 2 Nȳ 2 i=1 = 5 (5.40 2 + 5.80 2 + 10 2 + 9.80 2 ) 20 7.75 2 = 92.95 Then the F statistic for

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

EE290H F05. Spanos. Lecture 5: Comparison of Treatments and ANOVA

EE290H F05. Spanos. Lecture 5: Comparison of Treatments and ANOVA 1 Design of Experiments in Semiconductor Manufacturing Comparison of Treatments which recipe works the best? Simple Factorial Experiments to explore impact of few variables Fractional Factorial Experiments

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Chapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions

Chapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions Solutions from Montgomery, D. C. (008) Design and Analysis of Exeriments, Wiley, NY Chater 4 Randomized Blocks, Latin Squares, and Related Designs Solutions 4.. The ANOVA from a randomized comlete block

More information

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Chap The McGraw-Hill Companies, Inc. All rights reserved. 11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview

More information

Unit 12: Analysis of Single Factor Experiments

Unit 12: Analysis of Single Factor Experiments Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter

More information

Vladimir Spokoiny Foundations and Applications of Modern Nonparametric Statistics

Vladimir Spokoiny Foundations and Applications of Modern Nonparametric Statistics W eierstraß-institut für Angew andte Analysis und Stochastik Vladimir Spokoiny Foundations and Applications of Modern Nonparametric Statistics Mohrenstr. 39, 10117 Berlin spokoiny@wias-berlin.de www.wias-berlin.de/spokoiny

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

Lecture 6: Linear models and Gauss-Markov theorem

Lecture 6: Linear models and Gauss-Markov theorem Lecture 6: Linear models and Gauss-Markov theorem Linear model setting Results in simple linear regression can be extended to the following general linear model with independently observed response variables

More information

Topic 22 Analysis of Variance

Topic 22 Analysis of Variance Topic 22 Analysis of Variance Comparing Multiple Populations 1 / 14 Outline Overview One Way Analysis of Variance Sample Means Sums of Squares The F Statistic Confidence Intervals 2 / 14 Overview Two-sample

More information

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses. 1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately

More information

One-way ANOVA. Experimental Design. One-way ANOVA

One-way ANOVA. Experimental Design. One-way ANOVA Method to compare more than two samples simultaneously without inflating Type I Error rate (α) Simplicity Few assumptions Adequate for highly complex hypothesis testing 09/30/12 1 Outline of this class

More information

STATISTICS 141 Final Review

STATISTICS 141 Final Review STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /

More information

Lec 1: An Introduction to ANOVA

Lec 1: An Introduction to ANOVA Ying Li Stockholm University October 31, 2011 Three end-aisle displays Which is the best? Design of the Experiment Identify the stores of the similar size and type. The displays are randomly assigned to

More information

Ch 3: Multiple Linear Regression

Ch 3: Multiple Linear Regression Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery

More information

Incomplete Block Designs

Incomplete Block Designs Incomplete Block Designs Recall: in randomized complete block design, each of a treatments was used once within each of b blocks. In some situations, it will not be possible to use each of a treatments

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs) The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures

More information

Randomized Blocks, Latin Squares, and Related Designs. Dr. Mohammad Abuhaiba 1

Randomized Blocks, Latin Squares, and Related Designs. Dr. Mohammad Abuhaiba 1 Randomized Blocks, Latin Squares, and Related Designs Dr. Mohammad Abuhaiba 1 HomeWork Assignment Due Sunday 2/5/2010 Solve the following problems at the end of chapter 4: 4-1 4-7 4-12 4-14 4-16 Dr. Mohammad

More information

Lecture 20: Linear model, the LSE, and UMVUE

Lecture 20: Linear model, the LSE, and UMVUE Lecture 20: Linear model, the LSE, and UMVUE Linear Models One of the most useful statistical models is X i = β τ Z i + ε i, i = 1,...,n, where X i is the ith observation and is often called the ith response;

More information

Lec 5: Factorial Experiment

Lec 5: Factorial Experiment November 21, 2011 Example Study of the battery life vs the factors temperatures and types of material. A: Types of material, 3 levels. B: Temperatures, 3 levels. Example Study of the battery life vs the

More information

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010 1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

More information

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008 MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of Probability Sampling Procedures Collection of Data Measures

More information

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College 1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College An example ANOVA situation Example (Treating Blisters) Subjects: 25 patients with blisters Treatments: Treatment A, Treatment

More information

2.830 Homework #6. April 2, 2009

2.830 Homework #6. April 2, 2009 2.830 Homework #6 Dayán Páez April 2, 2009 1 ANOVA The data for four different lithography processes, along with mean and standard deviations are shown in Table 1. Assume a null hypothesis of equality.

More information

STAT Final Practice Problems

STAT Final Practice Problems STAT 48 -- Final Practice Problems.Out of 5 women who had uterine cancer, 0 claimed to have used estrogens. Out of 30 women without uterine cancer 5 claimed to have used estrogens. Exposure Outcome (Cancer)

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Math 36b May 7, 2009 Contents 2 ANOVA: Analysis of Variance 16 2.1 Basic ANOVA........................... 16 2.1.1 the model......................... 17 2.1.2 treatment sum of squares.................

More information

Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik

Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik Factorial Treatment Structure: Part I Lukas Meier, Seminar für Statistik Factorial Treatment Structure So far (in CRD), the treatments had no structure. So called factorial treatment structure exists if

More information

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement

More information

Comparing two independent samples

Comparing two independent samples In many applications it is necessary to compare two competing methods (for example, to compare treatment effects of a standard drug and an experimental drug). To compare two methods from statistical point

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations

More information

Sleep data, two drugs Ch13.xls

Sleep data, two drugs Ch13.xls Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch

More information

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is Stat 501 Solutions and Comments on Exam 1 Spring 005-4 0-4 1. (a) (5 points) Y ~ N, -1-4 34 (b) (5 points) X (X,X ) = (5,8) ~ N ( 11.5, 0.9375 ) 3 1 (c) (10 points, for each part) (i), (ii), and (v) are

More information

Cuckoo Birds. Analysis of Variance. Display of Cuckoo Bird Egg Lengths

Cuckoo Birds. Analysis of Variance. Display of Cuckoo Bird Egg Lengths Cuckoo Birds Analysis of Variance Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 29th November 2005 Cuckoo birds have a behavior in which they lay their

More information

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008 MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/term

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61

More information

Generalized Linear Models (GLZ)

Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the

More information

Analysis of Variance and Design of Experiments-I

Analysis of Variance and Design of Experiments-I Analysis of Variance and Design of Experiments-I MODULE VIII LECTURE - 35 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS MODEL Dr. Shalabh Department of Mathematics and Statistics Indian

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analysis of Variance and Design of Experiment-I MODULE IX LECTURE - 38 EXERCISES Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Example (Completely randomized

More information

STAT 115:Experimental Designs

STAT 115:Experimental Designs STAT 115:Experimental Designs Josefina V. Almeda 2013 Multisample inference: Analysis of Variance 1 Learning Objectives 1. Describe Analysis of Variance (ANOVA) 2. Explain the Rationale of ANOVA 3. Compare

More information

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t = 2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result

More information

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46 BIO5312 Biostatistics Lecture 10:Regression and Correlation Methods Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/1/2016 1/46 Outline In this lecture, we will discuss topics

More information

Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections ]

Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections ] Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections 9.10 9.15] 7.1. Introduction The Randomized Complete Block Design is commonly used to improve the ability of an experiment to detect

More information

Figure 9.1: A Latin square of order 4, used to construct four types of design

Figure 9.1: A Latin square of order 4, used to construct four types of design 152 Chapter 9 More about Latin Squares 9.1 Uses of Latin squares Let S be an n n Latin square. Figure 9.1 shows a possible square S when n = 4, using the symbols 1, 2, 3, 4 for the letters. Such a Latin

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 9 Jakub Mućk Econometrics of Panel Data Meeting # 9 1 / 22 Outline 1 Time series analysis Stationarity Unit Root Tests for Nonstationarity 2 Panel Unit Root

More information

14 Multiple Linear Regression

14 Multiple Linear Regression B.Sc./Cert./M.Sc. Qualif. - Statistics: Theory and Practice 14 Multiple Linear Regression 14.1 The multiple linear regression model In simple linear regression, the response variable y is expressed in

More information

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent: Activity #10: AxS ANOVA (Repeated subjects design) Resources: optimism.sav So far in MATH 300 and 301, we have studied the following hypothesis testing procedures: 1) Binomial test, sign-test, Fisher s

More information

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Application of Variance Homogeneity Tests Under Violation of Normality Assumption Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com

More information

PLSC PRACTICE TEST ONE

PLSC PRACTICE TEST ONE PLSC 724 - PRACTICE TEST ONE 1. Discuss briefly the relationship between the shape of the normal curve and the variance. 2. What is the relationship between a statistic and a parameter? 3. How is the α

More information

10. Issues on the determination of trial size

10. Issues on the determination of trial size 10. Issues on the determination of trial size 10.1. The general theory Review of hypothesis testing Null and alternative hypotheses. Simple and composite hypotheses. Test statistic and critical value.

More information

Inference in Regression Analysis

Inference in Regression Analysis Inference in Regression Analysis Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 4, Slide 1 Today: Normal Error Regression Model Y i = β 0 + β 1 X i + ǫ i Y i value

More information

11 Hypothesis Testing

11 Hypothesis Testing 28 11 Hypothesis Testing 111 Introduction Suppose we want to test the hypothesis: H : A q p β p 1 q 1 In terms of the rows of A this can be written as a 1 a q β, ie a i β for each row of A (here a i denotes

More information

Multivariate Regression

Multivariate Regression Multivariate Regression The so-called supervised learning problem is the following: we want to approximate the random variable Y with an appropriate function of the random variables X 1,..., X p with the

More information

Outline Topic 21 - Two Factor ANOVA

Outline Topic 21 - Two Factor ANOVA Outline Topic 21 - Two Factor ANOVA Data Model Parameter Estimates - Fall 2013 Equal Sample Size One replicate per cell Unequal Sample size Topic 21 2 Overview Now have two factors (A and B) Suppose each

More information

STAT 430 (Fall 2017): Tutorial 8

STAT 430 (Fall 2017): Tutorial 8 STAT 430 (Fall 2017): Tutorial 8 Balanced Incomplete Block Design Luyao Lin November 7th/9th, 2017 Department Statistics and Actuarial Science, Simon Fraser University Block Design Complete Random Complete

More information

Lecture 15 - ANOVA cont.

Lecture 15 - ANOVA cont. Lecture 15 - ANOVA cont. Statistics 102 Colin Rundel March 18, 2013 One-way ANOVA Example - Alfalfa Example - Alfalfa (11.6.1) Researchers were interested in the effect that acid has on the growth rate

More information

MAS353 Experimental Design

MAS353 Experimental Design MAS353 Experimental Design M. Farrow University of Newcastle upon Tyne Semester 2, 2005-6 1 Principles of Experimental Design 1.1 Why plan experiments? Planned experiments save time and money and help

More information

Formal Statement of Simple Linear Regression Model

Formal Statement of Simple Linear Regression Model Formal Statement of Simple Linear Regression Model Y i = β 0 + β 1 X i + ɛ i Y i value of the response variable in the i th trial β 0 and β 1 are parameters X i is a known constant, the value of the predictor

More information

These are multifactor experiments that have

These are multifactor experiments that have Design of Engineering Experiments Nested Designs Text reference, Chapter 14, Pg. 525 These are multifactor experiments that have some important industrial applications Nested and split-plot designs frequently

More information

Other hypotheses of interest (cont d)

Other hypotheses of interest (cont d) Other hypotheses of interest (cont d) In addition to the simple null hypothesis of no treatment effects, we might wish to test other hypothesis of the general form (examples follow): H 0 : C k g β g p

More information

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous. COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may

More information

Two or more categorical predictors. 2.1 Two fixed effects

Two or more categorical predictors. 2.1 Two fixed effects Two or more categorical predictors Here we extend the ANOVA methods to handle multiple categorical predictors. The statistician has to watch carefully to see whether the effects being considered are properly

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Example - Alfalfa (11.6.1) Lecture 14 - ANOVA cont. Alfalfa Hypotheses. Treatment Effect

Example - Alfalfa (11.6.1) Lecture 14 - ANOVA cont. Alfalfa Hypotheses. Treatment Effect (11.6.1) Lecture 14 - ANOVA cont. Sta102 / BME102 Colin Rundel March 19, 2014 Researchers were interested in the effect that acid has on the growth rate of alfalfa plants. They created three treatment

More information

Stat 427/527: Advanced Data Analysis I

Stat 427/527: Advanced Data Analysis I Stat 427/527: Advanced Data Analysis I Review of Chapters 1-4 Sep, 2017 1 / 18 Concepts you need to know/interpret Numerical summaries: measures of center (mean, median, mode) measures of spread (sample

More information

The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80

The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80 The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80 71. Decide in each case whether the hypothesis is simple

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

HANDBOOK OF APPLICABLE MATHEMATICS

HANDBOOK OF APPLICABLE MATHEMATICS HANDBOOK OF APPLICABLE MATHEMATICS Chief Editor: Walter Ledermann Volume VI: Statistics PART A Edited by Emlyn Lloyd University of Lancaster A Wiley-Interscience Publication JOHN WILEY & SONS Chichester

More information

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Statistics 203: Introduction to Regression and Analysis of Variance Course review Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Statistics for EES Factorial analysis of variance

Statistics for EES Factorial analysis of variance Statistics for EES Factorial analysis of variance Dirk Metzler June 12, 2015 Contents 1 ANOVA and F -Test 1 2 Pairwise comparisons and multiple testing 6 3 Non-parametric: The Kruskal-Wallis Test 9 1 ANOVA

More information

Power & Sample Size Calculation

Power & Sample Size Calculation Chapter 7 Power & Sample Size Calculation Yibi Huang Chapter 7 Section 10.3 Power & Sample Size Calculation for CRDs Power & Sample Size for Factorial Designs Chapter 7-1 Power & Sample Size Calculation

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.

More information

Multiple Predictor Variables: ANOVA

Multiple Predictor Variables: ANOVA Multiple Predictor Variables: ANOVA 1/32 Linear Models with Many Predictors Multiple regression has many predictors BUT - so did 1-way ANOVA if treatments had 2 levels What if there are multiple treatment

More information

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part : Sample Problems for the Elementary Section of Qualifying Exam in Probability and Statistics https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 2: Sample Problems for the Advanced Section

More information

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests

More information