Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Similar documents
Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

How to Describe Accuracy

Statistics: Error (Chpt. 5)

ANALYTICAL CHEMISTRY - CLUTCH 1E CH STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS

-However, this definition can be expanded to include: biology (biometrics), environmental science (environmetrics), economics (econometrics).

Analytical Chemistry. Course Philosophy

Chapter 3 Experimental Error

Statistical Analysis of Chemical Data Chapter 4

Lecture 3. - all digits that are certain plus one which contains some uncertainty are said to be significant figures

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

MECHANICAL ENGINEERING SYSTEMS LABORATORY

SPH3U1 Lesson 03 Introduction. 6.1 Expressing Error in Measurement

Practical Statistics for the Analytical Scientist Table of Contents

Statistical Analysis of Engineering Data The Bare Bones Edition. Precision, Bias, Accuracy, Measures of Precision, Propagation of Error

Introduction to Statistics and Error Analysis

We would like to describe this population. Central tendency (mean) Variability (standard deviation) ( X X ) 2 N

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

Proposed Procedures for Determining the Method Detection Limit and Minimum Level

Counting Statistics and Error Propagation!

Describing distributions with numbers

Application of mathematical, statistical, graphical or symbolic methods to maximize chemical information.

COUNTING ERRORS AND STATISTICS RCT STUDY GUIDE Identify the five general types of radiation measurement errors.

Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs

Describing distributions with numbers

E509A: Principle of Biostatistics. GY Zou

Wilcoxon Test and Calculating Sample Sizes

Uncertainty, Error, and Precision in Quantitative Measurements an Introduction 4.4 cm Experimental error

USE OF TOLERANCES IN THE DETERMINATION OF ACTIVE INGREDIENT CONTENT IN SPECIFICATIONS FOR PLANT PROTECTION PRODUCTS

Protocol for the design, conducts and interpretation of collaborative studies (Resolution Oeno 6/2000)

Unit 4. Statistics, Detection Limits and Uncertainty. Experts Teaching from Practical Experience

Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution.

2. A Basic Statistical Toolbox

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Inference for Distributions Inference for the Mean of a Population

Outline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews

2011 Pearson Education, Inc

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?

Minimum Power for PCL

EPAs New MDL Procedure What it Means, Why it Works, and How to Comply

Topic 2 Measurement and Calculations in Chemistry

Chem 321 Lecture 4 - Experimental Errors and Statistics 9/5/13

Class Objectives MISTAKES AND ERRORS MISTAKES AND ERRORS CE 211 SURVEYING ENGINEERING FALL 2011 CLASS 03: THEORY OF ERROS 8/26/2011

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:

Chapter 7 Comparison of two independent samples

Precision Correcting for Random Error

The SuperBall Lab. Objective. Instructions

03.1 Experimental Error

Sampling: A Brief Review. Workshop on Respondent-driven Sampling Analyst Software

STATISTICS OF OBSERVATIONS & SAMPLING THEORY. Parent Distributions

Advanced Experimental Design

ECON Introductory Econometrics. Lecture 2: Review of Statistics

Data Analysis for University Physics

Averaging, Errors and Uncertainty

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data

Validation Report. Calculator for Extrapolation of Net Weight in Conjunction with a Hypergeometric Sampling Plan

COPYRIGHTED MATERIAL INTRODUCTION CHAPTER 1

Stephen Scott.

Control Charts Confidence Interval Student s t test. Quality Control Charts. Confidence Interval. How do we verify accuracy?

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

AUTOMATED TEMPLATE MATCHING METHOD FOR NMIS AT THE Y-12 NATIONAL SECURITY COMPLEX

Appendix II Calculation of Uncertainties

Brian M. Tissue page 1 of 10

COPYRIGHTED MATERIAL INTRODUCTION CHAPTER 1

you-try-it-01.xlsx Step-by-Step Guide ver. 3/1/2010

Multiple Comparisons

Inferential Statistics

Relating Graph to Matlab

MEASUREMENT AND STATISTICAL TREATMENT OF EMPERICAL DATA

Scenario 5: Internet Usage Solution. θ j

Physics 509: Bootstrap and Robust Parameter Estimation

Analysis of 2x2 Cross-Over Designs using T-Tests

FUNDAMENTAL CONCEPTS IN MEASUREMENT & EXPERIMENTATION (continued) Measurement Errors and Uncertainty:

Harris: Quantitative Chemical Analysis, Eight Edition CHAPTER 03: EXPERIMENTAL ERROR

1] What is the 95% confidence interval for 6 measurements whose average is 4.22 and with a standard deviation of 0.07?

MEASUREMENT VARIATION

Understanding Inference: Confidence Intervals I. Questions about the Assignment. The Big Picture. Statistic vs. Parameter. Statistic vs.

4.1 Hypothesis Testing

Theory of Probability Sir Harold Jeffreys Table of Contents

Applications in Differentiation Page 3

Solutions to Final STAT 421, Fall 2008

Harris: Quantitative Chemical Analysis, Eight Edition CHAPTER 03: EXPERIMENTAL ERROR

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.

STUDYING RADIOCARBON PAPERS MAY SOLVE THE ENIGMA OF THE RADIOCARBON DATING OF THE SHROUD OF TURIN.

Math 120 Chapter 3 additional notes

Precision and Accuracy Assessing your Calorimeter

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics

English SI (mks) cgs. 1 psi = 1 lb f /in 2 Pascal (Pa) = 1 N/m 2 = 1 kg/(m s 2 ) lbm/ft 3 kg/m 3 g/cm 3 = mass/(length) 3

Error Analysis, Statistics and Graphing Workshop

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Precision and Accuracy Assessing your Calorimeter

Sampling Distributions

Chem 321 Lecture 5 - Experimental Errors and Statistics 9/10/13

The Union and Intersection for Different Configurations of Two Events Mutually Exclusive vs Independency of Events

Multicollinearity occurs when two or more predictors in the model are correlated and provide redundant information about the response.

Math 10 - Compilation of Sample Exam Questions + Answers

Philipp Koskarti

Stat 427/527: Advanced Data Analysis I

Transcription:

Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low (improper shielding and grounding of an instrument or error in the preparation of standards). 3. Random error - unpredictably high or low (pressure changes or temperature changes). Precision = ability to control random error. Accuracy = ability to control systematic error.

Statistics Experimental measurements always have some random error, so no conclusion can be drawn with complete certainty. Statistics give us a tool to accept conclusions that have a high probability of being correct. Deals with random error! There is always some uncertainty in a measurement. Challenge is to minimize this uncertaintity!!

Standard Deviation of the Mean Another very common way chemists represent error a measurement is to report a value called the standard deviation. The standard deviation of a small sampling is: s N i ( x N 1 x) s = standard deviation of the mean N = number of measurements or points x i = each individual measurement i 2 Trials Measurements 1 21.56 2 27.25 3 25.53 4 24.99 5 24.43 Mean 24.75 s 2.07 x = sample mean RSD = (s/mean) x 100 Coefficient of variance = (s) 2 24.75 ± 2.07

Standard Error of the Mean Another very common way to represent error is to report a value called the standard error. The standard error is related to standard deviation: S. E. s N s = standard deviation of the mean N = number of measurements or points 24.75 ± 0.93 Trials Measurements 1 21.56 2 27.25 3 25.53 4 24.99 5 24.43 Mean 24.75 s 2.07 S.E. 0.93

Confidence Limit Another common statistical tool for reporting the uncertainty (precision) of a measurement is the confidence limit (CL). C. L. t s N Table 22.1: Confidence Limit t-values as a function of (N-1) N-1 90% 95% 99% 99.5% 2 2.920 4.303 9.925 14.089 3 2.353 3.182 5.841 7.453 4 2.132 2.776 4.604 5.598 5 2.015 2.571 4.032 4.773 6 1.943 2.447 3.707 4.317 7 1.895 2.365 3.500 4.029 8 1.860 2.306 3.355 3.832 9 1.833 2.262 3.205 3.690 10 1.812 2.228 3.169 3.581 1.9. L. (2.776) 2.4 5 Trials Measurements 1 21.56 2 27.25 3 25.53 4 24.99 5 24.43 Mean 24.75 s 2.07 S.E. 0.93 C.L. 2.56 C 24.75 ± 2.56

Population versus Sample Size The term population is used when an infinite sampling occurred or all possible subjects were analyzed. Obviously, we cannot repeat a measurement an infinite number of times so quite often the idea of a population is theoretical. Sample size is selected to reflect population. μ 1σ 68.3% μ 2σ 95.5% μ 3σ 99.7% Median = middle number is a series of measurements Range = difference between the highest and lowest values

Q-Test Q-test is a statistical tool used to identify an outlier within a data set. Q exp x q = suspected outlier x n+1 = next nearest data point w = range (largest smallest data point in the set) Critical Rejection Values for Identifying an Outlier: Q-test Q crit N 90% CL 95% CL 99% CL 3 0.941 0.970 0.994 4 0.765 0.829 0.926 5 0.642 0.710 0.821 6 0.560 0.625 0.740 7 0.507 0.568 0.680 8 0.468 0.526 0.634 9 0.437 0.493 0.598 10 0.412 0.466 0.568 x q w x Cup 1 78 2 82 3 81 4 77 5 72 6 79 7 82 8 81 9 78 10 83 ppm Caffeine Mean 79.3 s 3.3 C.L. 95% 2.0

Calculation Q-test Example - Perform a Q-test on the data set from Table on previous page and determine if you can statistically designate data point #5 as an outlier within a 95% CL. If so, recalculate the mean, standard deviation and the 95% CL. Strategy Organize the data from highest to lowest data point and use Equation to calculate Q exp. Solution Ordering the data from Table 22.3 from highest to lowest results in Q exp 72 77 11 0.455 Using the Q crit table, we see that Q crit =0.466. Since Q exp < Q crit, you must keep the data point.

Grubbs Test The recommended way of identifying outliers is to use the Grubb s Test. A Grubb s test is similar to a Q-test however G exp is based upon the mean and standard deviation of the distribution instead of the next-nearest neighbor and range. Table: Critical Rejection Values for Identifying an Outlier: G-test G exp x q s x G crit N 90% C.L. 95% C.L. 99% C.L. 3 1.1.53 1.154 1.155 4 1.463 1.481 1.496 5 1.671 1.715 1.764 6 1.822 1.887 1.973 7 1.938 2.020 2.139 8 2.032 2.127 2.274 9 2.110 2.215 2.387 10 2.176 2.290 2.482 If G exp is greater than the critical G-value (G crit ) found in the Table then you are statistically justified in removing your suspected outlier.

How would you determine if the value is high or not? Is the value high or within the confidence interval of the normal counts? Mean = 5.1 x 10 6 counts s = 1.9 x 10 6 counts 5.1 ± 1.9 (x 10 6 counts) std. dev. 1.9 C. L. (2.776) 5 2.4 5.1 ± 2.4 (x 10 6 counts) 95% C.L. 5.6 (x 10 6 counts) is within the normal range at this confidence interval.

Student s t Good to report mean values at the 95% confidence interval Confidence interval From a limited number of measurements, it is possible to find population mean and standard deviation.

Student s t