Session 8: Statistical Analysis of Measurements

Similar documents
Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Session 15: Measuring Substrate-Independent Young s Modulus of Thin Films

Lecture on Null Hypothesis Testing & Temporal Correlation

Slides for Data Mining by I. H. Witten and E. Frank

Statistical Foundations:

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

CBA4 is live in practice mode this week exam mode from Saturday!

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Session 11: Complex Modulus of Viscoelastic Polymers

Chapter 23. Inference About Means

Sampling Distributions: Central Limit Theorem

P-values and statistical tests 3. t-test

Single Sample Means. SOCY601 Alan Neustadtl

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

Hypothesis Testing. Mean (SDM)

You may not use your books/notes on this exam. You may use calculator.

Introduction to Business Statistics QM 220 Chapter 12

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

Exam 2 (KEY) July 20, 2009

(Re)introduction to statistics: dusting off the cobwebs

POLI 443 Applied Political Research

Hypothesis Testing in Action

Statistics Part IV Confidence Limits and Hypothesis Testing. Joe Nahas University of Notre Dame

Hypothesis Testing hypothesis testing approach formulation of the test statistic

z-test, t-test Kenneth A. Ribet Math 10A November 28, 2017

Advanced Experimental Design

Ch. 7. One sample hypothesis tests for µ and σ

HYPOTHESIS TESTING. Hypothesis Testing

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

Statistics for IT Managers

Hypotheses and Errors

Inference About Two Means: Independent Samples

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

Announcements. Unit 3: Foundations for inference Lecture 3: Decision errors, significance levels, sample size, and power.

Hypothesis Testing in Action: t-tests

Chapter 8: Confidence Intervals

Inferential Statistics

Lecture 30. DATA 8 Summer Regression Inference

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Application of Chi-Square and T-Test in Architectural Research Methods

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

One-Way Analysis of Variance (ANOVA) Paul K. Strode, Ph.D.

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)

The t-statistic. Student s t Test

Statistical Methods in Natural Resources Management ESRM 304

Hypothesis testing: Steps

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

T-TEST FOR HYPOTHESIS ABOUT

16.400/453J Human Factors Engineering. Design of Experiments II

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

79 Wyner Math Academy I Spring 2016

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

2008 Winton. Statistical Testing of RNGs

EXAM 3 Math 1342 Elementary Statistics 6-7

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

Testing a Claim about the Difference in 2 Population Means Independent Samples. (there is no difference in Population Means µ 1 µ 2 = 0) against

Factorial Analysis of Variance

Statistics Workshop Ramsey A. Foty, Ph.D. Department of Surgery UMDNJ-RWJMS

Confidence Intervals for Comparing Means

Unit 6 - Introduction to linear regression

Inferences for Correlation

COGS 14B: INTRODUCTION TO STATISTICAL ANALYSIS

Part III: Unstructured Data

February 5, 2018 START HERE. measurement scale. Means. number of means. independent measures t-test Ch dependent measures t-test Ch 16.

Statistics For Economics & Business

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

6. CONFIDENCE INTERVALS. Training is everything cauliflower is nothing but cabbage with a college education.

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Confidence Interval Estimation

AMS 315/576 Lecture Notes

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book.

Chapter 5: HYPOTHESIS TESTING

The Student s t Distribution

Hypothesis testing and the Gamma function

Hypothesis testing: Steps

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

χ 2 Test for Frequencies January 15, 2019 Contents

POLI 443 Applied Political Research

11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should

Psych 230. Psychological Measurement and Statistics

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

23. MORE HYPOTHESIS TESTING

T test for two Independent Samples. Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016

Ordinary Least Squares Regression Explained: Vartanian

Chapter 23: Inferences About Means

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Keysight Technologies Measuring Stress-Strain Curves for Shale Rock by Dynamic Instrumented Indentation. Application Note

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.

Inferences About Two Proportions

Statistics for EES 3. From standard error to t-test

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing

Transcription:

Session 8: Statistical Analysis of Measurements Jennifer Hay Factory Application Engineer Nano-Scale Sciences Division Agilent Technologies jenny.hay@agilent.com To view previous sessions: https://agilenteseminar.webex.com/agilenteseminar/onstage/g.php?p=117&t=m Page 1

Drawing conclusions We perform 10 indentations on each of two materials. Material A: H = 4.91±0.3 Material B: H = 5.17±0.1 Is this difference in hardness significant or not? Page

Designing experiments We have changed our film-deposition process and wish to know if that change has caused the hardness to change by more than %. Indentation tests on our films typically have a standard deviation of 5%. How many indentations must we do on each film (old and new) in order to determine whether the hardness of the new film differs by more than % from that of the old film? Page 3

History of the Student s t-test (From Wikipedia) The t-statistic was introduced in 1908 by William Sealy Gosset, a chemist working for the Guinness brewery in Dublin, Ireland ("Student" was his pen name) Gosset devised the t-test as a cheap way to monitor the quality of stout. He published the test in Biometrika in 1908, but was forced to use a pen name by his employer, who regarded the fact that they were using statistics as a trade secret. In fact, Gosset's identity was known to fellow statisticians. Page 4

Student-t test - Overview Student-t test is a statistical test used to determine, to a reasonable degree of confidence, whether two observation sets obtain from different populations. Null hypothesis: The two observations sets obtain from the same population. If the difference between the two averages is sufficiently large, relative to measurement scatter, we reject the null hypothesis and conclude that the two measurement sets obtain from different underlying populations. Exemplary conclusion: With 95% confidence, we conclude that material A is significantly harder than material B. Page 5

Assumptions Populations are normally distributed. (An observation set is a limited, random sampling of a population.) Observation sets are independent (not paired). (An example of paired sets would be test scores of a group of students before studying and test scores for that same group of students after studying.) Second population may have a mean that is either higher or lower than first population. (Two-tailed t-test, not onetailed t-test. One-tailed t-test is used when you start by assuming that the controlled variable MUST cause the second population to have a mean which is greater or less than the first; i.e. studying MUST cause higher grades.) Page 6

Assumptions Populations are normally distributed. (An observation set is a limited representative of population.) Observation sets are independent (not paired). (An example of paired sets would be test scores of a group of students before studying and test scores for that same group of students after studying.) Second population may have a mean that is either higher or lower than first population (Two-tailed t-test). Page 7

Step-by-step procedure for using Student s t-test to determine if a measured difference is signficant 1. Decide on a confidence level. Page 8

What confidence level? The student s t-test involves calculating the statistical probability (p) that the two measurement sets could have come from the same population. Observation set 1 Observation set p =7.6% Page 9

What confidence level? Observation set 1 Observation set p =7.6% Page 10

What confidence level? Observation set 1 Observation set p =7.6% Page 11

What confidence level? Observation set 1 Observation set p =7.6% Page 1

What confidence level? Observation set 1 Observation set p =7.6% The experimenter must define what threshold probability is to be accepted as significant (confidence level). This is a personal judgment in the light of tradition. Aside: The selection of confidence level is one example by which Michael Polanyi contradicts the notion that objectivity is synonymous with impersonal in science (Personal Knowledge). Page 13

Is this significant? Observation set 1 Observation set p =4.% In the biological/medical communities, the tradition is to consider any probability less than 5% as significant, so if the observations are made in those fields, then yes. But this tradition (of p < 0.05) derives largely from the relative difficulty of making experimental observations in biology. Page 14

Probability depends on number of observations Observation set 1 Observation set p =4.% Say (for a moment) that the two observation sets derive from two populations that really are different. Then the probability returned by the t-test will decrease as the number of observations increases. If observations are cheap (as they are with indentation), there is no reason to settle for p < 0.05 for significance. Page 15

Threshold on probability of common population Confidence level Probability threshold 5% 95% 1% 99% 0.1% 99.9% Confidence level Page 16

Threshold on probability of common population Confidence level Probability threshold 5% 95% 1% 99% 0.1% 99.9% Confidence level Page 17

Student s t-test, step-by-step 1. Decide on a confidence level (JLH recommends 99% for indentation testing, but 95% is also common).. Calculate average and standard deviation for each observation set (X 1, s 1 ; X, s ). 3. Calculate degrees of freedom, df, where N 1 and N are the number of observations in sets 1 and, respectively: df = N 1 +N - Page 18

Student s t-test, step-by-step 4. Look up Z crit in a t-distribution table by looking for the value of df in the first column and then finding the corresponding value under the column for a two-tailed comparison at 99% (or other desired) confidence. Page 19

t-distribution table (http://en.wikipedia.org/wiki/student's_t-distribution) Page 0

Student s t-test, step-by-step 4. Look up Z crit in a t-distribution table by looking for the value of df in the first column and then finding the corresponding value under the column for a two-tailed comparison at 99% (or other desired) confidence. Page 1

Student s t-test, step-by-step 4. Look up Z crit in a t-distribution table by looking for the value of df in the first column and then finding the corresponding value under the column for a two-tailed comparison at 99% (or other desired) confidence. 5. Calculate the value of the test statistic, Z, as: Z X 1 X s 1 s N1 N Page

Student s t-test, step-by-step 4. Look up Z crit in a t-distribution table by looking for the value of df in the first column and then finding the corresponding value under the column for a two-tailed comparison at 99% (or other desired) confidence. 5. Calculate the value of the test statistic, Z, as: Z X 1 X s 1 s N1 N 6. Compare Z and Z crit. If Z is greater, reject the null hypothesis and conclude that the observation sets are significantly different (drawn from different populations). Page 3

Example (H A = 4.91±0.3; H B = 5.17±0.1; N = 10) We perform 10 indentations on each of two materials. Material A: H = 4.91±0.3 Material B: H = 5.17±0.1 Is this difference in hardness significant or not? Page 4

Example (H A = 4.91±0.3; H B = 5.17±0.1; N = 10) 1. Decide on a confidence level: 99%.. Calculate average and standard deviation for each observation set (X 1 =4.91, s 1 =0.3; X =5.17, s =0.1). 3. Calculate degrees of freedom: df = 10+10- = 18. 4. Look up Z crit Page 5

t-distribution table (http://en.wikipedia.org/wiki/student's_t-distribution) Page 6

Example (H A = 4.91±0.3; H B = 5.17±0.1; N = 10) 1. Decide on a confidence level: 99%.. Calculate average and standard deviation for each observation set (X 1 =4.91, s 1 =0.3; X =5.17, s =0.1). 3. Calculate degrees of freedom: df = 10+10- = 18. 4. Look up Z crit :.878 (Z crit =.101 at 95% confidence). Page 7

Example (H A = 4.91±0.3; H B = 5.17±0.1; N = 10) 1. Decide on a confidence level: 99%.. Calculate average and standard deviation for each observation set (X 1 =4.91, s 1 =0.3; X =5.17, s =0.1). 3. Calculate degrees of freedom: df = 10+10- = 18. 4. Look up Z crit :.878 (Z crit =.101 at 95% confidence). 5. Calculate the value of the test statistic: Z Z 4.91 5.17 (0.3) 10 (0.1) 10.64 Page 8

Example (H A = 4.91±0.3; H B = 5.17±0.1; N = 10) 1. Decide on a confidence level: 99%.. Calculate average and standard deviation for each observation set (X 1 =4.91, s 1 =0.3; X =5.17, s =0.1). 3. Calculate degrees of freedom: df = 10+10- = 18. 4. Look up Z crit :.878 (Z crit =.101 at 95% confidence). 5. Calculate the value of the test statistic: Z =.64. 6. Compare Z and Z crit. Page 9

Example (H A = 4.91±0.3; H B = 5.17±0.1; N = 10) 1. Decide on a confidence level: 99%.. Calculate average and standard deviation for each observation set (X 1 =4.91, s 1 =0.3; X =5.17, s =0.1). 3. Calculate degrees of freedom: df = 10+10- = 18. 4. Look up Z crit :.878 (Z crit =.101 at 95% confidence). 5. Calculate the value of the test statistic: Z =.64. 6. Compare Z and Z crit. Conclusion: At the level of 99% confidence, the two observation sets are not sigificantly different. (But at 95% they are.) Page 30

What s N got to do with it? Z 4.91 5.17 (0.3) 10 (0.1) 10.64 Page 31

What s N got to do with it? Z 4.91 5.17 (0.3) 10 (0.1) 10.64 0 0 Page 3

What s N got to do with it? Z 4.91 5.17 (0.3) 10 (0.1) 10.64 3.37 0 0 Page 33

What s N got to do with it? Z 4.91 5.17 (0.3) 10 (0.1) 10.64 3.37 >.88 0 0 Whereas we could NOT conclude this difference to be significant at 99% confidence with only 10 indents, we CAN conclude it to be signficant at 99% confidence with 0 indents!! Page 34

A word of warning The student s t-test can discern significant difference between two observation sets. IT DOES NOT discern the cause of that difference for you. If the experiment is well designed, then the difference (if there is one) can be credited to the influence of a single independent parameter (e.g. tempering time). The well designed experiment minimizes the influence of all other physical variables (sample temperature, frame stiffness, indenter shape). Page 35

Using the student s t-test to estimate the number of observations needed to conclude significant difference at a particular confidence level Page 36

Solving Student s t-test for N Page 37 Z N N X X crit 1 1 1 s s Set N = N 1 = N Z N X X crit s s 1 1 Isolate variables which depend on N X X Z N crit 1 1 s s

Simplifications Page 38 X X Z N crit 1 1 s s F X X q F X q Z N crit 1 1 1 1 F F q Z N crit 1 1 Z crit =.58 for N = (99%); Z crit = 3.36 for N = 5 (99%)

N/Z crit as a function of N Calculate a column of N/Z crit and plot as a function of N. Page 39

N/Z crit as a function of N Page 40

Solving Student s t-test for N Page 41 F F q Z N crit 1 1 F F q b mn 1 1 b F F q m N 1 1 1

Functionality N 1 m q 1 F b 1F Page 4

Example We have changed our film-deposition process and wish to know if that change has caused the hardness to change by more than %. Indentation tests on our films typically have a standard deviation of 5%. How many indentations must we do on each film (old and new) in order to determine whether the hardness of the new film differs by more than % from that of the old film? N 1 q m 1 F b 1F 1 (0.05) 1 0.98 N 0. 3 N 79.1 80 0.151 10.98 Page 43

Example: Detect a % change with 5% std. dev. Page 44

Summary The Student s t-test provides a means for discerning whether the difference between two observation sets is significant. The experimentalist must select the threshold probability (of a common population) below which the null hypothesis will be rejected. This is done with consideration for tradition and the cost of individual observations. With some simplification, the Student s t-test can be used to predict the number of observations needed to discern a particular difference. Page 45

Session 9: Theory of indentation creep Wednesday, June 1, 013, 11:00 (New York) Abstract In addition to hardness and Young s modulus, instrumented indentation can be used to characterize creep in metals. This is because hardness is a manifestation of the yield stress of the metal. Under conditions of creep, the yield stress depends on temperature and strain rate. As a manifestation of yield stress, hardness is not a constant, but instead depends on temperature and strain rate just as yield stress does. By quantifying these dependencies, instrumented indentation can be used to determine the stress exponent and activation energy for creep. To register: https://agilenteseminar.webex.com/agilenteseminar/onstage/g.php?p=117&t=m Page 46

Suggested reading for next session Lucas, B. N., and W. C. Oliver. "Indentation Power-Law Creep of High-Purity Indium." Metallurgical and Materials Transactions A-Physical Metallurgy and Materials Science 30.3 (1999): 601-610. Agilent application note: In Situ Young s Modulus and Strain-Rate Sensitivity of Lead-Free SAC 105 Solder (http://cp.literature.agilent.com/litweb/pdf/5991-144en.pdf) Page 47

Upcoming workshops June : Theory & Practice of Instrumented Indentation Testing Course, Sunday, 1:00-5:00p.m., The Westin Lombard Yorktown Center, Lombard, Illinois. A short course offered in conjunction with the SEM 013 Annual Conference & Exposition on Experimental and Applied Mechanics (http://sem.org/conf-ac-top.asp) June 5-6: Nano Measure 013 A symposium for sensing and understanding nano-scale phenomena, University of Warsaw, Poland. Page 48

Page 49 Thank you!