INTRODUCTION TO DESIGN AND ANALYSIS OF EXPERIMENTS

Similar documents
Contents. Acknowledgments. xix

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Practical Statistics for the Analytical Scientist Table of Contents

Transition Passage to Descriptive Statistics 28

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

sphericity, 5-29, 5-32 residuals, 7-1 spread and level, 2-17 t test, 1-13 transformations, 2-15 violations, 1-19

Sleep data, two drugs Ch13.xls

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

Formula for the t-test

DESIGN AND ANALYSIS OF EXPERIMENTS Third Edition

CONTENTS PREFACE XI ABOUT THE AUTHOR XIII. CHAPTER 1 Research Strategies and the Control of Nuisance Variables 1

Sociology 6Z03 Review II

Mathematical Notation Math Introduction to Applied Statistics

Institute of Actuaries of India

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

EE290H F05. Spanos. Lecture 5: Comparison of Treatments and ANOVA

1 Introduction to One-way ANOVA

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R.

Inferences for Regression

Homework 2: Simple Linear Regression

INTRODUCTION TO ANALYSIS OF VARIANCE

Tables Table A Table B Table C Table D Table E 675

Experimental Design and Data Analysis for Biologists

Difference in two or more average scores in different groups

Unit 27 One-Way Analysis of Variance

Turning a research question into a statistical question.

STATISTICS; An Introductory Analysis. 2nd hidition TARO YAMANE NEW YORK UNIVERSITY A HARPER INTERNATIONAL EDITION

Review of Statistics 101

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

Analysis of Variance and Co-variance. By Manza Ramesh

Analysis of Variance

Statistical. Psychology

16.400/453J Human Factors Engineering. Design of Experiments II

BIOMETRICS INFORMATION

Lecture 18: Simple Linear Regression

THE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH. Robert R. SOKAL and F. James ROHLF. State University of New York at Stony Brook

STATISTICS 479 Exam II (100 points)

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

An Old Research Question

ANOVA: Analysis of Variation

Six Sigma Black Belt Study Guides

1 Introduction to Minitab

Data Set 8: Laysan Finch Beak Widths

Inference for the Regression Coefficient

Data Analysis as a Decision Making Process

Chapter 1 Statistical Inference

Chapter 3. Measuring data

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

A Second Course in Statistics: Regression Analysis

Ohio Department of Education Academic Content Standards Mathematics Detailed Checklist ~Grade 9~

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

Factorial designs. Experiments

Research Methodology Statistics Comprehensive Exam Study Guide

HANDBOOK OF APPLICABLE MATHEMATICS

Review of Multiple Regression

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

Willmar Public Schools Curriculum Map

Math 423/533: The Main Theoretical Topics

Sampling Distributions: Central Limit Theorem

Regression and Models with Multiple Factors. Ch. 17, 18

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

Algebra 1 Mathematics: to Hoover City Schools

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

Prentice Hall Stats: Modeling the World 2004 (Bock) Correlated to: National Advanced Placement (AP) Statistics Course Outline (Grades 9-12)

West Windsor-Plainsboro Regional School District Algebra Grade 8

CL Algebra I Alignment to OAS CARNEGIE LEARNING TEXTBOOK

Using SPSS for One Way Analysis of Variance

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

Formal Statement of Simple Linear Regression Model

Algebra I High School Math Solution Alignment to Oklahoma Academic Standards

Fairfield Public Schools

Dublin City Schools Mathematics Graded Course of Study Algebra I Philosophy

with the usual assumptions about the error term. The two values of X 1 X 2 0 1

Confidence Intervals, Testing and ANOVA Summary

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

DISCOVERING STATISTICS USING R

Hypothesis testing: Steps

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Exam details. Final Review Session. Things to Review

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

Correlation & Simple Regression

Chapter 11. Correlation and Regression

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Simple Linear Regression: One Quantitative IV

RCB - Example. STA305 week 10 1

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Revised: 2/19/09 Unit 1 Pre-Algebra Concepts and Operations Review

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Prentice Hall Mathematics, Geometry 2009 Correlated to: Connecticut Mathematics Curriculum Framework Companion, 2005 (Grades 9-12 Core and Extended)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Unbalanced Designs & Quasi F-Ratios

Transcription:

GEORGE W. COBB Mount Holyoke College INTRODUCTION TO DESIGN AND ANALYSIS OF EXPERIMENTS Springer

CONTENTS To the Instructor Sample Exam Questions To the Student Acknowledgments xv xxi xxvii xxix 1. INTRODUCTION TO EXPERIMENTAL DESIGN 1 1. The challenge of planning a good experiment 1 The content of an experiment: three decisions 2 Three sources of variability: one we want and two we don't 4 Three kinds of variability: one we want, one we can live with, and one that threatens disaster 6 Chance error and bias compared 10 Design in a nutshell: Isolate the effects of interest control what you can, and randomize the rest 13 2. Three basic principles and four experimental designs 13 Random assignment and the one-way randomized basic factorial (RBF[1]) design 14 Blocking, and the one-way complete block (CB[l]) design 16 Factorial crossing, and the two-way basic factorial (BF[2]) design 19 Kelly's experiment: a split plot / repeated measures design, or SP/RM 21 3. The factor structure of the four experimental designs 22 Factor structure of the one-way basic factorial (BF[1]) design 24 Factor structure of the one-way complete block (CBflJ) design 25 Factor structure of the two-way basic factorial (BF[2]) design 26 Factor structure of the split plot / repeated measures (SP/RM) design 27 Appendix: Other ways to guard against bias 30 2. INFORMAL ANALYSIS AND CHECKING ASSUMPTIONS 38 1. What analysis of variance does 39 2. The Six Fisher Assumptions 42 Two assumptions about unknown true values 42 Four assumptions about residual errors 43 3. Informal analysis, part 1: parallel dot graphs and choosing a scale 45 Averages and parallel dot graphs 46 Choosing a new scale 48

viii CONTENTS 4. Informal analysis, part 2: interaction graph for the log concentrations 52 Interaction graphs 53 Scatterplots 54 3. FORMAL ANOVA: DECOMPOSING THE DATA AND MEASURING VARIABILITY, TESTING HYPOTHESES AND ESTIMATING TRUE DIFFERENCES 61 1. Decomposing the data 62 2. Computing mean squares to measure average variability 64 SS: Sum of squares = measure of overall variability 66 df: degrees of freedom = units of information about residual error 68 MS: Mean square = average variability per unit of information 70 3. Standard deviation = root mean square for residuals 71 4- Formal hypothesis testing: are the effects detectable? 74 The analysis of variance table: summarizing the evidence 75 The logic of the F-test 77 5. Confidence intervals: the likely size of true differences 80 Intervals for the effect of day length on hearts and brains 80 Using the 95% distance to think about design issues 82 Appendix 1: The general decomposition rule 87 Appendix 2: Introduction to the logic of hypothesis tests and confidence intervals 92 4. DECISIONS ABOUT THE CONTENT OF AN EXPERIMENT 108 1. The response 110 Two restrictions 110 Reliability and validity: how good is the response? 114 2. Conditions 118 Isolate the effects of interest 118 Experiments versus observational studies 121 3. Material 125 Units 125 Material should be representative and uniform 126 Appendix: Supplementary Examples 135 5. RANDOMIZATION AND THE BASIC FACTORIAL DESIGN 148 1. The basic factorial design ("What you do") 149 Experimental version: the completely randomised 1 (CR) design. 150 Observational versions of the basic factorial design 152

CONTENTS ix 2. Informal analysis 155 Averages and outliers 155 Parallel dot graphs 157 3. Factor structure ("What you get") 164 4- Decomposition and analysis of variance for one-way BF designs 168 Decomposition of a balanced one-way design 170 Degrees of freedom for a balanced BF[l] design 173 Mean squares, the standard deviation, and the F-test 175 5. Using a computer [Optional] 181 6. Algebraic notation for factor structure [Optional] 185 Notation for the model 185 Notation for the estimates 187 Appendix: Supplementary Examples 196 6. INTERACTION AND THE PRINCIPLE OF FACTORIAL CROSSING 199 1. Factorial crossing and the two-way basic factorial design, or BF[2] 200 Factorial crossing 200 Two-way basic factorial designs: "what you do" 202 Factor structure of the BF[2] design: "what you get" 205 2. Interaction and the interaction graph 209 Interaction as a difference of differences 209 The interaction graph 211 3. Decomposition and ANOVA for the two-way design 216 Decomposition 217 Degrees of freedom 220 Mean squares, standard deviation, and F-tests 221 4. Using a computer [Optional] 226 5. Algebraic notation for the two-way BF design [Optional] 230 Appendix: Supplementary Examples 239 7. THE PRINCIPLE OF BLOCKING 243 1. Blocking and the complete block design (CB) 245 The randomized complete block experiment: what you do 245 Similar units 246 Ways to get blocks 247 Observational studies in complete blocks 248 Factor structure of the one-way CB design 249 2. Two nuisance factors: the Latin square design (LS) 251 Randomization in the LS plan 254

i CONTENTS Factor structure of the LS design 256 Variations on the simplest Latin square plan 256 3. The split plot/repeated measures design (SP/RM) 259 What you do 260 Variations on the basic SP/RM design 262 What you get 263 Factor structure of the basic SP/RM 263 Crossing versus nesting 264 4. Decomposition and analysis of variance 267 The complete block design 267 The Latin square design 271 The split plot/repeated measures design 274 Two kinds of units, two kinds of chance error 276 5. Scatterplots for data sets with blocks 282 Within-bloclcs scatterplots 282 Scatterplots of res. vs. fit 285 6. Using a computer. [Optional] 289 7. Algebraic notation for the CB, LS and SP/RM designs 293 Appendix: Supplementary Examples 303 8. WORKING WITH THE FOUR BASIC DESIGNS 310 1. Comparing and recognizing design structures 311 Preliminary steps: recognizing the units 311 A checklist for analyzing designs 314 2. Choosing a design structure: deciding about blocking 320 Thinking about factors 320 Deciding whether to use blocks 321 Choosing a design: Examples 323 3. Informal analysis: examples 329 4- Recognizing alternatives to ANOVA 339 Mistaking multiple measurements for levels of a factor 341 Mistaking categories of a nominal response for levels of a factor 344 Mistaking numbers on an interval/ratio scale for levels of a factor 347 Appendix: Supplementary Examples 357 9. EXTENDING THE BASIC DESIGNS BY FACTORIAL CROSSING 370 1. Extending the BF design: general principles 370 Question 1. Hotf do you choose the factors to cross? 371

CONTENTS xi Question 2. How do you carry out the experimental plan? 372 Question 3. How do you find the factor structure? 373 Question 4. How is the informal analysis different? 373 2. Three or more crossed factors of interest 375 3. Compound within-blocks factors 381 Two alternative models for residual error structure 384 4. Graphical methods for 3-factor interactions 392 5. Analysis of variance 397 F-ratios for compound within-blocks factors 397 Preliminary F-tests and pooling mean squares 400 Appendix: Supplementary Examples 406 1O. DECOMPOSING A DATA SET 414 1. The basic decomposition step and the BF[1] design 416 The basic decomposition step [SWEEP] 416 Complete decomposition of a balanced BF[l] data set 419 2. Decomposing data from balanced designs 424 Decomposition rule and examples 425 Counting degrees of freedom by subtraction 432 11. COMPARISONS, CONTRASTS, AND CONFIDENCE INTERVALS 436 1. Comparisons: confidence intervals and tests 437 Strategies for choosing comparisons 437 Interpreting the comparisons using standard errors 439 The true value estimated by a comparison 442 The standard error for a comparison 445 Confidence intervals and tests for comparisons 446 2. Adjustments for multiple comparisons 453 Arguments for and against adjusting 454 A strategy for adjusting (if you must) 455 3. Between-blocks factors and compound within-blocks factors 461 The SP/RMll] 461 Pooling Mean Squares 463 The CB12] design 464 The CB and SP/RM families: the "CWIC" rule for choosing MSs 466 4. Linear estimators and orthogonal contrasts [Optional] 470 Linear estimators 470 Orthogonal contrasts (comparisons) 473

xii CONTENTS 12. THE FISHER ASSUMPTIONS AND HOW TO CHECK THEM 482 1. Same SDs (s) 483 Checking the assumption 484 Finding a transformation 486 2. Independent chance errors (I) 497 Ad hoc scatterplots 498 Within-blocks scatterplots 500 The Huynh-Feldt condition for designs with blocks 504 3. The normality assumption (N) 507 Histograms for residuals 507 Normal plots for residuals 509 4- Effects are additive (A) and constant (C) 516 Plots for checking the assumptions 519 Transforming to malce the assumptions fit 521 5. Estimating replacement values for outliers 525 13. OTHER EXPERIMENTAL DESIGNS AND MODELS 535 1. New factor structures built by crossing and nesting 536 Crossing vs nesting: familiar examples, three pictures, four tests 536 Examples of designs with nested factors 541 Purely hierarchical designs 542 Finding the list of factors: a rule 546 Decomposing data sets with nested factors 549 2. New uses for old factor structures: fixed versus random effects 554 Fixed and random factors 555 Using expected mean squares to choose denominators for F-ratios 562 3. Models with mixed interaction effects 571 Preview: Restricted vs unrestricted models for mixed interactions 571 The unrestricted model for mixed interaction terms 574 The restricted model for mixed interaction terms 577 Generalized complete block designs 581 4. Expected mean squares and F-ratios 585 Finding EMSs and choosing denominators 586 Using Minitab 595 Pseudo F ratios: 596 The logic of the EMS rule, part I: Double decomposition diagrams 599 The logic of the EMS rule, part II 606 14. CONTINUOUS CARRIERS: A VISUAL APPROACH TO REGRESSION, CORRELATION AND ANALYSIS OF COVARIANCE 614 1. Regression 614 Introduction to line-fitting 615

CONTENTS xiii Fitting a regression line by least squares 621 Assumptions, tests, and confidence intervals 630 Transformations, extensions and limitations 634 2. Balloon summaries and correlation 652 The correlation coefficient 652 Balloon-based estimates for scatterplots 654 The correlation coefficient and the regression effect 656 Correlation and fraction of variability "explained" 661 Correlation equals, the standardized regression slope 664 3. Analysis of Covariance 673 Why and when to use ANCOVA 673 How to fit the ANCOVA model: computing rules 678 15. SAMPLING DISTRIBUTIONS AND THE ROLE OF THE ASSUMPTIONS 690 1. The logic of hypothesis testing 692 Structure of the argument 692 Sampling distribution of a test statistic 694 2. Ways to think about sampling distributions 702 Finding an equivalent single-draw box model 702 Representing distributions geometrically: dot graphs and histograms 705 Key features: EVs, SEs, and percentiles 709 3. Four fundamental families of distributions 720 The standard normal distribution: a useful fiction 721 The chi-square family: sums of squares for N(0,l) outcomes 732 The t family 733 The F-family 734 4. Sampling distributions for linear estimators 739 The meaning of 95% confidence, and its relation to the t family 739 The sampling distribution of (Est-True)/SE 743 5. Approximate sampling distributions for F-ratios 750 6. Why (and when) are the models reasonable? 754 Sampling from a population 754 Measurement error 759 Randomisation 760 Tables 767 Data Sources 774 Data Index 782 Subject Index 786