PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University

Similar documents
Chapter 11: Simple Linear Regression and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics Chapter 4

Laboratory 3: Method of Least Squares

Laboratory 1c: Method of Least Squares

Comparison of Regression Lines

Economics 130. Lecture 4 Simple Linear Regression Continued

Chapter 13: Multiple Regression

Statistics for Economics & Business

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

U-Pb Geochronology Practical: Background

Measurement Uncertainties Reference

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

Statistics for Business and Economics

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Statistics MINITAB - Lab 2

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

/ n ) are compared. The logic is: if the two

Lecture 4 Hypothesis Testing

Some basic statistics and curve fitting techniques

Lecture 6: Introduction to Linear Regression

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

8/25/17. Data Modeling. Data Modeling. Data Modeling. Patrice Koehl Department of Biological Sciences National University of Singapore

Introduction to Regression

Negative Binomial Regression

Linear Regression Analysis: Terminology and Notation

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

Basic Business Statistics, 10/e

APPENDIX 2 FITTING A STRAIGHT LINE TO OBSERVATIONS

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

x = , so that calculated

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

Chapter 8 Indicator Variables

Linear Approximation with Regularization and Moving Least Squares

e i is a random error

Basically, if you have a dummy dependent variable you will be estimating a probability.

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

The Ordinary Least Squares (OLS) Estimator

Properties of Least Squares

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

NUMERICAL DIFFERENTIATION

Biostatistics 360 F&t Tests and Intervals in Regression 1

January Examinations 2015

STAT 3008 Applied Regression Analysis

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

x i1 =1 for all i (the constant ).

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

SIMPLE LINEAR REGRESSION

SPANC -- SPlitpole ANalysis Code User Manual

First Year Examination Department of Statistics, University of Florida

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

Composite Hypotheses testing

Cathy Walker March 5, 2010

Statistical analysis using matlab. HY 439 Presented by: George Fortetsanakis

18. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III

β0 + β1xi and want to estimate the unknown

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Topic- 11 The Analysis of Variance

Limited Dependent Variables

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Lecture 3 Stat102, Spring 2007

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

A REVIEW OF ERROR ANALYSIS

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Chapter 3 Describing Data Using Numerical Measures

Statistics II Final Exam 26/6/18

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

The Geometry of Logit and Probit

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

β0 + β1xi. You are interested in estimating the unknown parameters β

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

Chapter 14 Simple Linear Regression

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger

AS-Level Maths: Statistics 1 for Edexcel

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

Chapter 4: Regression With One Regressor

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

: 5: ) A

Basic Statistical Analysis and Yield Calculations

Learning Objectives for Chapter 11

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Lecture 6 More on Complete Randomized Block Design (RBD)

Diagnostics in Poisson Regression. Models - Residual Analysis

Chapter 15 Student Lecture Notes 15-1

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

Chapter 15 - Multiple Regression

Transcription:

PHYS 45 Sprng semester 7 Lecture : Dealng wth Expermental Uncertantes Ron Refenberger Brck anotechnology Center Purdue Unversty Lecture Introductory Comments Expermental errors (really expermental uncertantes) are common to all measurements They are never known exactly eed to know how to dentfy and deal wth them Sources of error: Random vs. Systematc Know how to calculate Mean and Standard devaton Understand error propagaton

Outlne Random vs. Systematc errors Mean and standard devaton Gaussan probablty dstrbuton functon (pdf) Standard error of the mean Combnng errors Regresson analyss Best ft to straght lne Plottng EXCEL Lnest uncertantes n fttng coeffcents Small sample problem Student t-factors and confdence levels The test 3 Random vs. Systematc Random Errors Affect each measurement dfferently Produce results dstrbuted around some mean value Can be reduced by takng multple measurements Random error and systematc errors combne by ether addng the squares of the separate errors and takng square root (uncorrelated errors) or by addng the errors (worst case error). EXAMPLE: Suppose a mass s measured to have a mass of 5 g usng a mass balance havng a calbraton accuracy of 5%. What s the total uncertanty n the measurement? Standard (uncorrelated) uncertanty Standard Fractonal Error Random Error Systematc Error Systematc Errors Affect all measurements n the same way Often dffcult to know Don t average out Maxmum uncertanty (worst case) ote: Unts are dfferent! Max. Fractonal Error Random Error Systematc Error.5.4.5.9 5 5 Standard Fractonal Error.4.64 Absolute Error.95g 4.5g Absolute Standard Error.645g 3.g Result 5 4.5 g Result 5 3 g 4 Standard s a word whch ndcates that a measurement represents a sample from a populaton

Gudelnes Beware of quotng too many sgnfcant fgures n your results do you mean.847 or.8? Errors can often be better estmated usng statstcs Errors are only approxmately known only quote or at most sgnfcant fgures. (.8.6) x 8 m/s vs. (.8.583) x 8 m/s Statstcal tests are ndcators. You must also be guded by ntuton, judgement, and common sense. 5 Calculatng the Mean and Standard Devaton If a quantty x s measured tmes, you can calculate the mean x and std. devaton s measurements x x s x x useful number to quantfy how data fluctuates about ts mean As The Gaussan probablty dstrbuton functon (pdf) <x>- <x>+ True Value Best Estmate Mean <x> x Varance s Standard devaton s Standard error of the mean m s/ <x> 68% of the dstrbuton between <x>- and <x>+ 95% of the dstrbuton between <x>-.96 and <x>+.96 ormal (Standard) dstrbuton calculator: http://onlnestatbook.com//calculators/normal_dst.html x 6 3

Why does random nose have a Gaussan shape? probablty of obtanng a specfc outcome when a quantty y o s measured n the presence of four ndependent and random sources of nose all wth the same sze. r Possble Devatons Total Devaton Measured Outcome Multplcty Probablty - - - - -4 y o -4 6.5% 3 - - - + - - + - - + - - + - - - - - + + - + - + - + + - + - - + + - + - + + - - - + + + + - + + + + - + + + + - - y o - 4 5.% y o 6 37.5% y o + 4 5.% 4 + + + + 4 y o + 4 6.5% Probablty 4 sources of error each contrbutng y.4.3... yo-4 y yo- y yo yo+ y yo+4 y Possble Values of Measurement 7 Combnng Errors When a result depends on more than one quantty, you must SYSTEMATICALLY combne errors Gven : x and y Quoted result :( x y) or ( x y) General Rule: IF z f x x y z x y z x y z z IF z x y or x y x z x y x y IF z xyor z z y z x y x y Quoted result :( xy) z or ( x ) y z df z x dx These formulas assume uncorrelated errors n x and y 8 4

Example Measurng the Focal Length of a Lens f o f focal length f, o o f f f o o - f o o - - o o o o - f o o o - -. o o o o o o o o o f o o o o Puttng n numbers:.6.4 cm o..3 cm f 7.9 f.6. f o 79.8 47.6. 4.4 f 7.9. cm..4.6.3.6. o o Largest contrbuton 9 To save tme, use onlne uncertanty calculator Choosng standard error, obtan f=7.9. cm same answer as on prevous slde http://denethor.wlu.ca/data/xc.shtml http://web.mst.edu/~gbert/java/uncertanty.html#operaton 5

Regresson Analyss noun: regress ˈrēˌɡres/ the acton of returnng to a former or less developed state The Regresson Effect Most data usually contans ) a predctable component ("sgnal") AD ) a statstcally ndependent unpredctable component ("nose"). The best we can hope for s to ft that part of the data due to the sgnal. Any reasonable ft should exhbt less varablty than the actual measurements. The fttng process therefore mples a regresson to the mean. Best Ft to a Straght Lne Specal Case of Regresson Analyss Collecton of data ponts {x, y } How to determne best values for a and b? Y The least-squares method mnmzes the sum of the squares of the devatons of the expermental data from the straght lne ft n an unbased fashon. One of the man lmtatons s an mplct assumpton that the errors n the ndependent varable (x) are neglgble.e. zero. X y ax b y y y R-squared s the percent of varance explaned by the model. R measures the percentage reducton n mean-squared-error that the regresson model acheves relatve to a nave model y=constant. 6

Best Ft to a Straght Lne Lnear Regresson y ax b To fnd the slope a that mnmzes,wetakethe dervatve wth respect to a and set t to zero. y ax bx y ax b a a yx x x a b DFaGb To fnd the ntercept b that mnmzes,wetake the dervatve wth respect to b and set t to zero. y ax b b b y ax b y x a b H GaJb yx x x y D F G H J 3 Results - Lnear Regresson Two equatons, two unknowns DG FH H J b and a b G FJ G G Uncertantes n a and b are gven by Goodness of Ft R F J FJ G FJ G b a y ax b y y 4 7

In Excel, the best ft to a straght lne s easy Example Speed of Lght Experment: Measure tme for lght pulse to go from pont A to pont B when separaton between A and B s known Default Plot Dstance Travelled 45 4 35 3 5 5 5 4 6 8 4 6 Tme delay 5 Better Roundtrp Dstance Travelled (m) 5 4 3 y =.9x R² =.998 4 8 6 Tme delay (nsec) 6 8

Best Tme delay (nsec) 6 8 4 y = 3.5897x R² =.9894 Speed of Lght 3 4 5 Roundtrp Dstance Travelled (m) Fgure : Tme delay vs. roundtrp dstance travelled for a lght pulse reflected from a mrror. The dashed lne s a best ft straght lne to the data wth an R value of.989. The nverse of the slope of the straght lne gves the speed of lght as.78 x 8 m/s. 7 Uncertantes n Ftted Coeffcents EXCEL Lnest Typcal Data round trp uncertanty dstance (m) (m) t (ns). In Excel, grab a x3 block of cells uncertanty (ns) 3.9. 3.6.5.7. 3.4 3.6 4.. 83. 5.3 9.6.4.4 6. 4..4 46.3 7.8 Steps to follow (on a PC). Ht the F key 3. Type =Lnest(range n y, range n x,,) 4. Ht the Ctrl Shft Enter keys = a a R Best Ft Straght Lne: Y=(3.79.8)X + (-5.884.64) Tme delay (nsec) 6 8 4-4 Data best ft b b stnd. error n y Speed of Lght 3 4 5 Roundtrp Dstance Travelled (m) 3.79 5.88.8 4.64.99 5.5 c = (.6. 3 ) x 8 m/s 8 9

The Small Sample Problem Statstcal methods assume a large number of observatons lead to relable estmates for the mean and standard devaton. Seldom the case. Whensmallernumberofobservatonsare made, less confdence n accuracy of calculated values. Can ths reduced level of confdence be quantfed? An addtonal statstc based on the degrees of freedom (dof), gves a multplcatve factor called the Student t-factor. W. S. Gosset developed ths approach n the early 9 s and publshed t under the pseudonym Student. Gosset s statstcs are partcularly applcable when dscussng the small sample problem. 9 The Student-t factors The dof s close to the number of data ponts. Varous constrants reduce the dof to an nteger value less than. If a problem has r constrants, then the dof=-r. As an example, from the prevous analyss we have 5 data ponts whch we used to determne the slope and standard devaton of 3.79.8. Snce we have calculated the mean () and standard devaton () from data, the dof=5-=3. At the 5% confdence level, the mean and standard devaton s 3.79 (.8x.765) =3.79.4 nsec/m At the 95% confdence level, the mean and standard devaton s 3.79 (.8x3.8) =3.79.57 nsec/m Table : t-factors for varous levels of confdence dof t(5%) t(9%) t(95%) t(99%). 6.3.7 63.7.86.9 4.3 9.9 3.765.35 3.8 5.84 4.74.3.78 4.6 5.77..57 4.3 6.78.94.45 3.7 7.7.89.36 3.5 8.76.86.3 3.36 9.73.83.6 3.5.7.8.3 3.7 5.69.75.3.95.687.7.9.85 3.683.7.4.75.674.64.96.58 When dof=, results are those from a Gaussan pdf

The test Suppose you measure the same quantty n a number of dfferent ways. Can the results be averaged to provde a better estmate for the quantty measured? Yes, f all the measurements are drawn from the same populaton. Test usng the statstc. Example: Measure the focal length of a convergng lens usng four dfferent technques Technque Measured focal length Geometry radus of curvature.9. cm Ray tracng.6. cm 3 Object at nfnty 9.3.3 cm 4 Bessel s conjugate focal ponts 9.9. cm 5 4 3 Focal Length Convergng Lens Radus of curvature Bessel s conjugate method Object at Ray tracng 8. 9.... 3. Measured focal length (cm) The test - graphcally y y n y n + Probablty y + + x n x x... x n

Step : Calculate weghted mean and weghted standard devaton 4 cm 4 w 4 9.99.73.85 cm w f 9.99.9 cm best answer?? w Step : Calculate and reduced from the data st x,, are results usng technque, etc. x x x Replace each... wth the weghted mean w x w fw f w f3 w f4 w 3 4...3..9 9.99.6 9.99 9.3 9.99 9.9 9.99 6.84 dof 43 6.84 reduced.8 dof 3 The f s the focal length determned by method 3 Step 3: Consult the probablty dstrbuton functon (pdf) f( ;dof) d( ) s the probablty that a partcular value of falls between and +d( ). + d( ) The probablty dstrbuton functon f( ;dof) for dof=,,3,4,5,.. as a functon of. The dstrbuton becomes more symmetrc as dof (=) ncreases. 4

reduced Step 4: Calculate, P f dof d P s the cumulatve probablty assocated wth a partcular value of reduced obtaned for the problem at hand. P (n percent) s the probablty that f another experment s conducted, the new data wll yeld a new value of reduced that falls between and the current reduced. SUMMARY: Re-measure the same x s (n ths case focal lengths) Calculate a new value of reduced from new measurements. P s the probablty that the new reduced wll on average be less than or equal to the orgnal reduced. You want P to be close to. For the measurements dscussed here.8 P f,3 d.48 Type: =CHISQ.DIST(.8,3,) nto an empty Excel cell and Excel should return the value.48. 5 Step 5: Interpret the result There s roughly a 48% chance that repeated measurements would gve values that would produce a value of.8 or less. Conversely, there s a 5% chance f the measurements are repeated, we mght get a value of >.8. We can nfer n an unbased and statstcal way that the result f 9.99.9 cm w w would only be obtaned agan, after performng a new set of measurements, wth a confdence level of about 5%. 6 3

Up ext Experment : Reflecton of Lght from Mrrors 7 4