How to Plan Experiments

Similar documents
Announcements. CS 188: Artificial Intelligence Spring Classification. Today. Classification overview. Case-Based Reasoning

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals.

Artificial Neural Networks 2

CHAPTER 3: OPTIMIZATION

10-725/36-725: Convex Optimization Spring Lecture 21: April 6

Beyond Newton s method Thomas P. Minka

Performance Evaluation

Computational Optimization. Constrained Optimization Part 2

Lecture 3: Pattern Classification. Pattern classification

Machine Learning Basics III

Response Surface Methods

Disegno Sperimentale (DoE) come strumento per QbD

M15/5/MATME/SP1/ENG/TZ2/XX/M MARKSCHEME. May 2015 MATHEMATICS. Standard level. Paper pages

TABLE OF CONTENT. Chapter 4 Multiple Reaction Systems 61 Parallel Reactions 61 Quantitative Treatment of Product Distribution 63 Series Reactions 65

Operations Research Letters

University of Texas-Austin - Integration of Computing

Bridging the gap between GCSE and A level mathematics

Metric-based classifiers. Nuno Vasconcelos UCSD

Expression arrays, normalization, and error models

Session #11 Supplementary Session on Design of Experiments

ECE 307 Techniques for Engineering Decisions

What p values really mean (and why I should care) Francis C. Dane, PhD

Should the Residuals be Normal?

Unit 5 Solving Quadratic Equations

1 Kernel methods & optimization

Issues and Techniques in Pattern Classification

Directed Reading B. Section: Tools and Models in Science TOOLS IN SCIENCE MAKING MEASUREMENTS. is also know as the metric system.

Small Mixed-Level Screening Designs with Orthogonal Quadratic Effects

CH 59 SQUARE ROOTS. Every positive number has two square roots. Ch 59 Square Roots. Introduction

STARTING WITH CONFIDENCE

Unconstrained Multivariate Optimization

Jeff Howbert Introduction to Machine Learning Winter

Lecture 3: Latent Variables Models and Learning with the EM Algorithm. Sam Roweis. Tuesday July25, 2006 Machine Learning Summer School, Taiwan

Review of Optimization Basics

Condition Monitoring for Maintenance Support

2 Introduction to Response Surface Methodology

Graphing and Optimization

Line Integrals and Path Independence

Optimization Which point on the line y = 1 2x. is closest to the origin? MATH 1380 Lecture 18 1 of 15 Ronald Brent 2018 All rights reserved.

Objective Experiments Glossary of Statistical Terms

ChE 344 Winter 2013 Mid Term Exam I Tuesday, February 26, Closed Book, Web, and Notes. Honor Code

Pattern Recognition Prof. P. S. Sastry Department of Electronics and Communication Engineering Indian Institute of Science, Bangalore

Statistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing

Incorporating Reality Into Process Simulation. Nathan Massey Chemstations, Inc. January 10, 2002

Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Intro to Learning Theory Date: 12/8/16

Chapter 27 Summary Inferences for Regression

Edexcel AS and A Level Mathematics Year 1/AS - Pure Mathematics

2 Linear Classifiers and Perceptrons

18-660: Numerical Methods for Engineering Design and Optimization

Polynomial Functions of Higher Degree

Convex Optimization Overview (cnt d)

AN INTRODUCTION TO NEURAL NETWORKS. Scott Kuindersma November 12, 2009

A First Course on Kinetics and Reaction Engineering Unit 12. Performing Kinetics Experiments

Chemical Engineering: 4C3/6C3 Statistics for Engineering McMaster University: Final examination

Quadratic Equations Part I

Chapter 4. Regression Models. Learning Objectives

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

High-Contrast Algorithm Behavior Observation, Hypothesis, and Experimental Design

Table of Contents. Unit 3: Rational and Radical Relationships. Answer Key...AK-1. Introduction... v

CS 5014: Research Methods in Computer Science. Statistics: The Basic Idea. Statistics Questions (1) Statistics Questions (2) Clifford A.

Practical Statistics for the Analytical Scientist Table of Contents

Lecture 1: Period Three Implies Chaos

1 Robust optimization

6.2 Multiplying Polynomials

Unit 3 NOTES Honors Common Core Math 2 1. Day 1: Properties of Exponents

8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression)

explicit expression, recursive, composition of functions, arithmetic sequence, geometric sequence, domain, range

SIX SIGMA IMPROVE

1. What is the distance formula? Use it to find the distance between the points (5, 19) and ( 3, 7).

Engineering and. Tapio Salmi Abo Akademi Abo-Turku, Finland. Jyri-Pekka Mikkola. Umea University, Umea, Sweden. Johan Warna.

Test code: ME I/ME II, 2004 Syllabus for ME I. Matrix Algebra: Matrices and Vectors, Matrix Operations, Determinants,

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Integer Programming and Branch and Bound

Multiple Regression. Dan Frey Associate Professor of Mechanical Engineering and Engineering Systems

RESPONSE SURFACE MODELLING, RSM

From Last Meeting. Studied Fisher Linear Discrimination. - Mathematics. - Point Cloud view. - Likelihood view. - Toy examples

Conceptual Explanations: Simultaneous Equations Distance, rate, and time

A First Course on Kinetics and Reaction Engineering Unit 33. Axial Dispersion Model

3.3 Limits and Infinity

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Lecture (9) Reactor Sizing. Figure (1). Information needed to predict what a reactor can do.

Lecture - 24 Radial Basis Function Networks: Cover s Theorem

How to Characterize Solutions to Constrained Optimization Problems

James H. Steiger. Department of Psychology and Human Development Vanderbilt University. Introduction Factors Influencing Power

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 11

CSC 411: Lecture 03: Linear Classification

4.3 Rational Inequalities and Applications

ACCUPLACER MATH 0311 OR MATH 0120

Definition 8.1 Two inequalities are equivalent if they have the same solution set. Add or Subtract the same value on both sides of the inequality.

CHAPTER 6 A STUDY ON DISC BRAKE SQUEAL USING DESIGN OF EXPERIMENTS

Signal Detection Basics - CFAR

Intro to Nonlinear Optimization

Chapter 12 - Part I: Correlation Analysis

Machine Learning Lecture 2

15-388/688 - Practical Data Science: Nonlinear modeling, cross-validation, regularization, and evaluation

Mahopac Central School District Curriculum Introduction to Science 8

The Harvard Calculus Program in a Computer Classroom

BioMechanics and BioMaterials Lab (BME 541) Experiment #5 Mechanical Prosperities of Biomaterials Tensile Test

Optimization in Process Systems Engineering

Introduction. So, why did I even bother to write this?

Transcription:

How to Plan Eperiments Formulate a hypothesis Choose independent variable values Eperimental planning Scientific method Statistical design of eperiments Setting specifications

Formulating a Hypothesis Must be objectively testable Be careful when stating the hypothesis Are you stating the null hypothesis or the alternative hypothesis? Alternative hypothesis H A : testing whether A > B Null hypothesis H 0 : test if A==B Fine point, but can cause you to miss what s really going on due to your own preconceived notions of the system s behavior Eample: r A (T 1 )>r A (T 2 ) in a packed bed with porous catalytic particles. Better stated, We will eperimentally determine whether r A (T 1 ) is equal to r A (T 2 ) in a packed bed with porous catalytic particles when Q, P, C A,0, and d p are fied. Properly formulated hypothesis demonstrates an understanding of important variables Naturally suggests controls Indicates which factors/conditions/treatments will be tested Sometimes, the the prediction will be wrong Does not mean eperiment is a failure Leaps in learning/understanding often the result of an incorrect prediction

Radius of Curvature Formulating a Hypothesis Eample: The lens flattens due to stretching. Original shape h 0 F R 0 h R F 2a 0 2a 8-arm model Lens Diameter

Elastic Modulus (Pa). Formulating a Hypothesis Eample: Lens stiffness comes from its architecture. 2500 2000 E 1023 0.202 0.594 R 2 0.808 1.1 Data Model Fit 1500 1000 500 0 0 1 2 3 4 5 Distance from Center of Nucleus (mm)

Reaction Rate Independent Variable Value Selection Need to know something about your system Look for dynamic range Concentration Limited Intermediate Regime (Dynamic Range) Saturation Threshold Diffusion Limited Particle Size

Independent Variable Value Selection Linearly spaced points: i+1 = i +1 Log spaced points: i+1 =2 i

One Factor at a Time (OFAT) Vary one independent variable at a time until the value of the dependent variable is optimal across the range of independent variable tested AKA The Scientific Method Eample: Minimize the heat input Q to a distillation column while maintaining a known distillate composition y d Can easily find Q for a given C A,0 Will minimum Q be the same if C A,0 increases/decreases? OFAT gives us no insight into this problem: a change in C A,0 will result in a change in y d

Statistical Eperimental Design (DOE) Test all independent variables at carefully selected values Use math to tell you the relative importance of each factor and their interactions Eample: Minimize the heat input Q to a distillation column while maintaining a known distillate composition y d Determine range of feasible operating conditions (e.g. Q, C A,0 ) and input these values to the software Measure the response y d at each of these points Now can control the system such that y d is maintained despite changing C A,0 by systematically varying Q

Statistical Eperimental Design Scientific Method - OFAT OFAT: One Factor At a Time Manipulate one variable 1 while holding the other variables ( 2 n ) constant until the best response y( 1 ) is found, then move on to the net variable 2. Repeat for all n, then hope that the interactions aren t important. Time consuming and inefficient ( 1% inspiration, 99% perspiration Thomas Edison) Gives no insight into interactions Optimum response y( 1 ) will generally not hold for all values of 2 Better Method - DOE DOE: Design Of Eperiments Choose values of all variables, then compute the optimum response. Efficient Gives eplicit values for interactions Optimum response will be the true optimum response See http://www.statease.com/pubs/doeprimer.pdf.

One Factor at a Time (OFAT) 1. Vary 1 until optimal y( 1 ) is found. Hold 1 constant at this value. 2. Vary 2 until optimal y( 2 ) is found. Hold 1 and 2 constant at these values. 3. 4. Vary n until optimal y( k ) is found. Hold 1 k constant at these value. 5. Hope that y( 1, 2,, k ) is really the optimal Image from http://www.compassdude.com

Statistical Design of Eperiments (DOE) 1. Set limits on each 2. Choose a type of eperimental design 3. Use software to generate candidate points 4. Test each point in the candidate matri 5. Determine effects of each i and interactions i j 6. Determine the true optimal

Conveity and Search Compleity Previous eample was conve Any search would eventually find an optimum What about non-conve surfaces?

Types of DOE Designs Screening Eperiments Used to determine which independent variables significantly impact your response Gives linear estimate of factor effects Different designs for different resolutions Full factorial Fractional factorial Response Surfaces Used to determine mapping between independent and dependent variables Different designs for different systems D-optimal Mitures Bo-Behnken Central Composite

Screening Eperiments Full factorial design 2 k eperiments required Eposes all primary effects and first-order interactions (2FI) 1, 2, 3, 1 2, 1 3, 2 3, 1 2 3 In MATLAB, use ff2n(k) to get X Fractional factorial design Sacrifice resolution for epediency Possible to reduce number of required eperiments to only k+1 for main effects 1, 2,, k Main effects may be confounded with interactions

Confounding Factors Confounding Factors Potential effects which mask each other in DOE analysis in fractional factorial designs Blocking variables Variables which can t be controlled, but accounting for them may elucidate results

Statistical Eperiment Design Rapidly evaluate the dependence of the dependent variable on any number of independent variables and their interactions Solid mathematical foundation Provably minimal number of eperiments Available software tools do most of the work for you Factorial design Eamine k factors interactions 2 k eperiments Fractional factorial design Eamine k factors Some interactions will be confounded with primary factors <2 k eperiments (down to k+1 only looking at main factors) Useful for initial screening for important factors

How Linear DOE Works 1. Select independent variables 2. Select bounds of each independent variable 3. Select the level of interaction to eamine usually look at two factor interactions (2FI) 4. Determine the standard order eperimental plan 5. Randomize the order of eperimental runs The MR5 (minimum resolution 5 main and 2FI) layout for si factors in case study: + means the factor is set to the high bound value; - means it is set to the low bound value.

How Linear DOE Works 6. Perform the eperiments listed in the test matri 7. Apply ANOVA and linear regression model to determine effects and interactions 8. Use the trained regression model to make predictions about your system 9. Investigate nonlinear response to important factors and interactions using response surface methods (if desired/necessary) y( n c c c 1,..., ) 0 4 4 14 1 4 Linear model with all main and 2FI: y( 1,..., n) 6 k 3 c 2k 2 c k 0 6 l4 6 i1 c 3l c 3 i l i 6 6 j2 m5 c c 1 j 4m 1 4 j m c 56 5 6 Linear optimization methods used to find the optimal value(s) of y( 1,, n ) within the bounds of the independent variables.

How Response Surface Methods Work 1. Select important factors and interactions based on pilot study 2. Select bounds on these values 3. Input into software 4. Conduct eperiments 5. Input responses into software software will analyze and report most suitable model(s) 6. Select type of model (e.g. quadratic, cubic) 7. Predict conditions which yield the optimum response Figure 2. This is how OFAT sees the relationship between the response variable and factor A. Figure 3. This is how OFAT sees the relationship between the response variable and factor B. Figure 1. The real contour of the response to factors A and B: interactions are important. Figure 4. Response surface shows a pure interaction of two factors. Eample from http://www.chemicalprocessing.com/articles/2006/166.html

Setting Specifications General Approach Etract dependent variable specifications from stated project goals Render initial feasibility decision based on order of magnitude estimates Relate stated goals to independent (control) variables by identifying suitable models Identify important independent (control) variables based on suitable models Specify independent variable tolerances by back-calculating from dependent using models Identify suitable measurement and control equipment Eample: Chemical Reactor Capital budget ( $100,000) Product purity (3 ± 0.001 M) Production rate ( 100 M/hr) Product temperature (35 ± 0.1 C) Review literature Preliminary design calculations Your eperience and epertise Cost charts CSTR, PFR, laminar flow reactor? Rate equations 1. Reactor size 2. Feed concentration 3. Feed flowrate 4. Temperature 1. ±10 C, ±1 C, or ±0.1 C? 2. ±1M, ±0.1M, or ±0.01M? 3. ±1m 3 /s, ±0.1m 3 /s, or ±0.01m 3 /s? 4. ±1m 3, ±0.1m 3, or ±0.01m 3? 1. Thermostat/on-off switch +heater 2. Refractometer + injector 3. Flowmeter + control valves 4. Drilling/lathe/computer-guided laser Sanity check Can we really do this with proposed constraints? within proposed tolerances? without violating laws of thermodynamics?

Summary With enough time and money, anything is possible Never have enough time or money Don t let perfect be the enemy of good enough Learning to identify the important factors in an eperiment, process, or equipment design is an invaluable skill Learn to identify the limits and constraints of your system System could be a company, process, or component design Accurately assessing and specifying your project can save you and your company/lab time and money

Conclusions DOE maimizes information obtained per eperiment Save effort, time, and money Improve process control and stability More robust than the scientific method Eamples here are for systems with only 2-3 independent variables Benefits of DOE are even more obvious in higher dimensions Even in worst-case scenario, DOE is as good or better than OFAT